BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 009954
(521 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|388520789|gb|AFK48456.1| unknown [Lotus japonicus]
Length = 527
Score = 630 bits (1625), Expect = e-178, Method: Compositional matrix adjust.
Identities = 328/501 (65%), Positives = 389/501 (77%), Gaps = 15/501 (2%)
Query: 25 RSRTGERG--RDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRH 82
R + GER DRHHRD+K GG + R D+ Y+R RD+DR H D +D ERRH++
Sbjct: 34 RKQDGERRDFHDRHHRDYKDGGFNGR-DRYNSYNRHRSRDYDR---HNDRVKDGERRHKY 89
Query: 83 RSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPE 142
+ S R R S+S S S S S+SKR SGFDMAPP+A +AV GQ G+
Sbjct: 90 EAHS---KRSRGESRSPSRSPSRSESKRVSGFDMAPPSADGT--SAVSGQTLGINHLNQG 144
Query: 143 MAQNMLPFGATQ--LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMT 200
AQN FG +Q +GA LM VQ MTQQATRHARRVY+GGLPPL NEQ+IATFFS VMT
Sbjct: 145 TAQNFSLFGISQPQIGALSLMQVQPMTQQATRHARRVYIGGLPPLTNEQSIATFFSHVMT 204
Query: 201 AIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYN 260
AIGGNSAG GD+VVNVYINHEKKFAF+EMRTVEEASNAM+LDGI+FEGV+VRVRRPTDYN
Sbjct: 205 AIGGNSAGAGDSVVNVYINHEKKFAFLEMRTVEEASNAMSLDGIVFEGVSVRVRRPTDYN 264
Query: 261 PTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGT 320
PTLAAALGP QPSP LNL+AVGL+ G IGG +G DR+FVGGLPYYF E QI+ELL++FG
Sbjct: 265 PTLAAALGPCQPSPYLNLSAVGLSGGTIGGTDGLDRIFVGGLPYYFAEEQIRELLQAFGP 324
Query: 321 LHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTE 380
L FDLV+D++TGNSKGYGFC+YQDPAVTD+ACA+LNGLK+GDKTLTVRRAT SG SKTE
Sbjct: 325 LRSFDLVRDKETGNSKGYGFCIYQDPAVTDMACASLNGLKVGDKTLTVRRATVSGHSKTE 384
Query: 381 QESILAQAQQHIAIQKMALQTSGMNTLGGGM--SLFGETLAKVLCLTEAITADALADDEE 438
QE I AQAQQ+I +QK+AL+ G+N G + E+ KVLCLTEAIT D L D+ E
Sbjct: 385 QEHIFAQAQQNITMQKVALEVVGLNIPGVERVPTTIDESATKVLCLTEAITTDELMDNGE 444
Query: 439 YEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRK 498
YEEI+EDMR+ECGK+GTL+NVVIPRP+ +G +TPG+GKVFLEY D AK+AL+GRK
Sbjct: 445 YEEIVEDMRDECGKFGTLMNVVIPRPNPSGEQTPGIGKVFLEYSDTAASFAAKSALNGRK 504
Query: 499 FGGNTVNAFYYPEDKYFNKDY 519
FGGN V A+YYPE+K+ N ++
Sbjct: 505 FGGNMVTAYYYPEEKFHNMEF 525
>gi|359497050|ref|XP_002267854.2| PREDICTED: splicing factor U2af large subunit B-like, partial
[Vitis vinifera]
Length = 410
Score = 612 bits (1579), Expect = e-172, Method: Compositional matrix adjust.
Identities = 308/411 (74%), Positives = 330/411 (80%), Gaps = 45/411 (10%)
Query: 112 SGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQAT 171
SGFDMAPP AA+LPGAAVPG+LPGVP VP M QNM PFGATQLGA PLMPVQ MTQQAT
Sbjct: 44 SGFDMAPPVAALLPGAAVPGELPGVPQMVPGMIQNMFPFGATQLGALPLMPVQAMTQQAT 103
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
RHARRVYVGGLPPLANEQ IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR+
Sbjct: 104 RHARRVYVGGLPPLANEQTIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRS 163
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
VEEASNAMALDGI+FEGV+VRVRRPTDYNP+LAAALGP QPSP+LNLAAVGL G IGGA
Sbjct: 164 VEEASNAMALDGIMFEGVSVRVRRPTDYNPSLAAALGPSQPSPHLNLAAVGLMPGVIGGA 223
Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
EGPDR+FVGGLPYYFTE QI+ELLESFG L GFDLVKDRDTGNSKGYGFCVYQDPAVTDI
Sbjct: 224 EGPDRIFVGGLPYYFTEEQIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 283
Query: 352 ACAALNGLKMGDKTLTVRRATA-SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
ACAALNGLKMGDKTLTVRRAT SGQ+K+EQ++ILAQAQQHIAIQK+ALQ G+N G
Sbjct: 284 ACAALNGLKMGDKTLTVRRATVGSGQAKSEQDNILAQAQQHIAIQKIALQAGGLNLPGA- 342
Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
G LV+VVIPRP NG
Sbjct: 343 -------------------------------------------GALVHVVIPRPSPNGDL 359
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
PGVGKVFLEY D G ++A+NALSGRKFGGN V+A YYPEDKY++ DY A
Sbjct: 360 IPGVGKVFLEYSDTAGSSSARNALSGRKFGGNVVSAVYYPEDKYYDGDYGA 410
>gi|156070760|gb|ABU45175.1| unknown [Solanum melongena]
Length = 553
Score = 588 bits (1515), Expect = e-165, Method: Compositional matrix adjust.
Identities = 302/458 (65%), Positives = 353/458 (77%), Gaps = 26/458 (5%)
Query: 67 DRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPG 126
DR R+ ++D+E RHRH+ S + R SRSPSKSKR SGFDMAPP +A+L G
Sbjct: 117 DRRRNNDKDREDRHRHKPSSRARSR----------SRSPSKSKRISGFDMAPPTSALLSG 166
Query: 127 AA-VPGQLPGVPS-AVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPP 184
A V GQ+PG+ + ++P M NM P A Q GA P+MPVQ MTQQATRHARRVYVGGLPP
Sbjct: 167 ATDVAGQVPGITNPSIPGMFSNMFPVAAGQFGALPVMPVQAMTQQATRHARRVYVGGLPP 226
Query: 185 LANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGI 244
ANEQ++ATFFS VM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR+VEEASNAMALDG+
Sbjct: 227 TANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAMALDGV 286
Query: 245 IFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPY 304
IFEG V+VRRP+DYNP+LAA LGP QPSPNLNLAAVGL G+ GG EGPDR+FVGGLPY
Sbjct: 287 IFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGLTPGSSGGLEGPDRIFVGGLPY 346
Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDK 364
YFTE+QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTDIACAALNG+KMGDK
Sbjct: 347 YFTESQIRELLESFGQLRGFDLVKDRETGNSKGYAFCVYQDVSVTDIACAALNGIKMGDK 406
Query: 365 TLTVRRATAS-GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLC 423
TLTVRRA Q EQES+L AQQ IA+Q+ LQ + T K+LC
Sbjct: 407 TLTVRRANQGITQPNPEQESVLLHAQQQIALQRFMLQPGALAT-------------KILC 453
Query: 424 LTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYD 483
LT+ ++ D L DD++Y++ILEDMR ECGK+G L+NVVIPRP+ NG TPG+GKVFLEY D
Sbjct: 454 LTQVVSVDELKDDDDYQDILEDMRIECGKFGALLNVVIPRPNPNGEPTPGLGKVFLEYAD 513
Query: 484 AVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
+ A+ L+GRKFGGN V A +YPE+K+ DY A
Sbjct: 514 VDSSSKARQGLNGRKFGGNQVIAVFYPENKFSEGDYEA 551
>gi|188998293|gb|ACD67872.1| U2 snRNP auxiliary factor large subunit [Solanum melongena]
Length = 554
Score = 588 bits (1515), Expect = e-165, Method: Compositional matrix adjust.
Identities = 302/458 (65%), Positives = 353/458 (77%), Gaps = 26/458 (5%)
Query: 67 DRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPG 126
DR R+ ++D+E RHRH+ S + R SRSPSKSKR SGFDMAPP +A+L G
Sbjct: 118 DRRRNNDKDREDRHRHKPSSRARSR----------SRSPSKSKRISGFDMAPPTSALLSG 167
Query: 127 AA-VPGQLPGVPS-AVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPP 184
A V GQ+PG+ + ++P M NM P A Q GA P+MPVQ MTQQATRHARRVYVGGLPP
Sbjct: 168 ATDVAGQVPGITNPSIPGMFSNMFPVAAGQFGALPVMPVQAMTQQATRHARRVYVGGLPP 227
Query: 185 LANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGI 244
ANEQ++ATFFS VM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR+VEEASNAMALDG+
Sbjct: 228 TANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAMALDGV 287
Query: 245 IFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPY 304
IFEG V+VRRP+DYNP+LAA LGP QPSPNLNLAAVGL G+ GG EGPDR+FVGGLPY
Sbjct: 288 IFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGLTPGSSGGLEGPDRIFVGGLPY 347
Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDK 364
YFTE+QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTDIACAALNG+KMGDK
Sbjct: 348 YFTESQIRELLESFGQLRGFDLVKDRETGNSKGYAFCVYQDVSVTDIACAALNGIKMGDK 407
Query: 365 TLTVRRATAS-GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLC 423
TLTVRRA Q EQES+L AQQ IA+Q+ LQ + T K+LC
Sbjct: 408 TLTVRRANQGITQPNPEQESVLLHAQQQIALQRFMLQPGALAT-------------KILC 454
Query: 424 LTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYD 483
LT+ ++ D L DD++Y++ILEDMR ECGK+G L+NVVIPRP+ NG TPG+GKVFLEY D
Sbjct: 455 LTQVVSVDELKDDDDYQDILEDMRIECGKFGALLNVVIPRPNPNGEPTPGLGKVFLEYAD 514
Query: 484 AVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
+ A+ L+GRKFGGN V A +YPE+K+ DY A
Sbjct: 515 VDSSSKARQGLNGRKFGGNQVIAVFYPENKFSEGDYEA 552
>gi|75338884|sp|Q9ZR40.1|U2A2B_NICPL RecName: Full=Splicing factor U2af large subunit B; AltName:
Full=NpU2AF65b; AltName: Full=U2 auxiliary factor 65 kDa
subunit B; AltName: Full=U2 small nuclear
ribonucleoprotein auxiliary factor large subunit B;
Short=U2 snRNP auxiliary factor large subunit B
gi|3850821|emb|CAA77135.1| U2 snRNP auxiliary factor, large subunit [Nicotiana
plumbaginifolia]
Length = 573
Score = 584 bits (1505), Expect = e-164, Method: Compositional matrix adjust.
Identities = 284/410 (69%), Positives = 328/410 (80%), Gaps = 15/410 (3%)
Query: 112 SGFDMAPPAAAMLPG-AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQA 170
SGFDMAPP +AMLPG A GQ+PG +P M NM P + Q GA P+MP+Q MTQQA
Sbjct: 175 SGFDMAPPTSAMLPGITAAAGQVPGTNPPIPGMFPNMFPLASGQFGALPVMPIQAMTQQA 234
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
TRHARRVYVGGLP ANEQ++ATFFS VM+AIGGN+AGPGDAVVNVYIN+EKKFAFVEMR
Sbjct: 235 TRHARRVYVGGLPAHANEQSVATFFSHVMSAIGGNTAGPGDAVVNVYINYEKKFAFVEMR 294
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+VEEASNAMALDGIIFEG +VRRP+DYNP+LAA LGP QP+PNLNLAAVGL+ G+ GG
Sbjct: 295 SVEEASNAMALDGIIFEGAPCKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGLSPGSAGG 354
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
EGPDR+FVGGLPYYFTE QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTD
Sbjct: 355 LEGPDRIFVGGLPYYFTEAQIRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDVSVTD 414
Query: 351 IACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
IACAALNG+KMGDKTLTVRRA + Q K EQES+L AQQ IA+Q++ LQ + + T
Sbjct: 415 IACAALNGIKMGDKTLTVRRANQGTTQPKPEQESVLLHAQQQIALQRLMLQPATLAT--- 471
Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
KVL LTE I+AD L DDE+Y++ILEDMR ECGK+G+LVNVVIPRP NG
Sbjct: 472 ----------KVLSLTEVISADELNDDEDYQDILEDMRTECGKFGSLVNVVIPRPSPNGE 521
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
TPGVGKVFLEY D + A+ +L+GRKFGGN V A +YPE+K++ DY
Sbjct: 522 PTPGVGKVFLEYADVDSSSKARQSLNGRKFGGNQVVAVFYPENKFYEGDY 571
>gi|75338883|sp|Q9ZR39.1|U2A2A_NICPL RecName: Full=Splicing factor U2af large subunit A; AltName:
Full=NpU2AF65a; AltName: Full=U2 auxiliary factor 65 kDa
subunit A; AltName: Full=U2 small nuclear
ribonucleoprotein auxiliary factor large subunit A;
Short=U2 snRNP auxiliary factor large subunit A
gi|3850823|emb|CAA77136.1| U2 snRNP auxiliary factor, large subunit [Nicotiana
plumbaginifolia]
Length = 555
Score = 584 bits (1505), Expect = e-164, Method: Compositional matrix adjust.
Identities = 284/412 (68%), Positives = 325/412 (78%), Gaps = 15/412 (3%)
Query: 112 SGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQA 170
SGFDMAPP A+LPGA GQ+PG A+P + NM P ++Q GA P+MPVQ MTQQA
Sbjct: 157 SGFDMAPPTTALLPGATDAAGQVPGTNPAIPGLFSNMFPLASSQFGALPMMPVQAMTQQA 216
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
TRHARRVYVGGLPP ANEQ++ATFFS VM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR
Sbjct: 217 TRHARRVYVGGLPPTANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKKFAFVEMR 276
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+VEEASNAMALDG+IFEG V+VRRP+DYNP+LAA LGP QPSPNLNLAAVG G+ GG
Sbjct: 277 SVEEASNAMALDGVIFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGSTPGSSGG 336
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
EGPDR+FVGGLPYYFTE+QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTD
Sbjct: 337 LEGPDRIFVGGLPYYFTESQIRELLESFGQLRGFDLVKDRETGNSKGYAFCVYQDVSVTD 396
Query: 351 IACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
IACAALNG+KMGDKTLTVRRA + Q EQES+L AQQ IA+Q+ LQ + T
Sbjct: 397 IACAALNGIKMGDKTLTVRRANQGTTQPNPEQESVLLHAQQQIALQRFMLQPGALAT--- 453
Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
KVLCLTE +T D L DD++Y++ILEDMR EC K+G LVNVVIPRP+ NG
Sbjct: 454 ----------KVLCLTEVVTVDELNDDDDYQDILEDMRTECEKFGALVNVVIPRPNPNGV 503
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
TPG+GKVFLEY D G + A+ L+GRKFGGN V A +YPE+K+ DY A
Sbjct: 504 PTPGLGKVFLEYADVDGSSKARQGLNGRKFGGNQVVAVFYPENKFSEGDYEA 555
>gi|156070776|gb|ABU45190.1| unknown [Capsicum frutescens]
Length = 551
Score = 583 bits (1503), Expect = e-164, Method: Compositional matrix adjust.
Identities = 282/412 (68%), Positives = 328/412 (79%), Gaps = 15/412 (3%)
Query: 112 SGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQA 170
SGFDMAPP +A+LPGA V GQ+PG ++P M NM P A Q GA P+MPVQ MTQQA
Sbjct: 153 SGFDMAPPTSALLPGATDVTGQVPGANPSIPGMFSNMFPLAAGQFGALPIMPVQAMTQQA 212
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
TRHARRVYVGGLPP ANEQ++ATFFS VM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR
Sbjct: 213 TRHARRVYVGGLPPTANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKKFAFVEMR 272
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+VEEASNAMALDG+IFEG V+VRRP+DYNP+LAA LGP QPSPNLNLAAVGL G+ GG
Sbjct: 273 SVEEASNAMALDGVIFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGLTPGSSGG 332
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
EGPDR+FVGGLPYYFTE+QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTD
Sbjct: 333 LEGPDRIFVGGLPYYFTESQIRELLESFGQLRGFDLVKDRETGNSKGYAFCVYQDVSVTD 392
Query: 351 IACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
IACAALNG+KMGDKTLTVRRA + Q EQES+L AQQ IA+Q+ LQ + T
Sbjct: 393 IACAALNGIKMGDKTLTVRRANQGTNQPNPEQESVLLHAQQQIALQRFMLQPGALAT--- 449
Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
KVLCLT+ ++ D L DD++Y++ILEDMR ECGK+G+L+NVVIPRP+ +G
Sbjct: 450 ----------KVLCLTQVVSVDELNDDDDYQDILEDMRVECGKFGSLLNVVIPRPNPSGE 499
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
TPG+GKVFLEY D + A+ L+GRKFGGN V A +YPE+K+ +Y A
Sbjct: 500 PTPGLGKVFLEYADVESSSRARQGLNGRKFGGNEVIAVFYPENKFSEGEYEA 551
>gi|156070797|gb|ABU45209.1| unknown [Solanum bulbocastanum]
Length = 558
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 282/411 (68%), Positives = 326/411 (79%), Gaps = 16/411 (3%)
Query: 112 SGFDMAPPAAAMLPGAA-VPGQLPGVPS-AVPEMAQNMLPFGATQLGAFPLMPVQVMTQQ 169
SGFDMAPP +A+L GA V GQ+PG + ++P M NM P A Q GA P+MPVQ MTQQ
Sbjct: 159 SGFDMAPPTSALLSGATDVAGQVPGTTNPSIPGMFSNMFPLAAGQFGALPIMPVQAMTQQ 218
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
ATRHARRVYVGGLPP ANEQ++ATFFS VM AIGGN+AGPGDAVVNVYINHEKKFAFVEM
Sbjct: 219 ATRHARRVYVGGLPPTANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKKFAFVEM 278
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
R+VEEASNAMALDG++FEG V+VRRP+DYNP+LAA LGP QPSPNLNLAAVGL G+ G
Sbjct: 279 RSVEEASNAMALDGVVFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGLTPGSSG 338
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
G EGPDR+FVGGLPYYFTE+QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VT
Sbjct: 339 GLEGPDRIFVGGLPYYFTESQIRELLESFGQLRGFDLVKDRETGNSKGYAFCVYQDVSVT 398
Query: 350 DIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
DIACAALNG+KMGDKTLTVRRA + Q EQES+L AQQ IA+Q+ LQ + T
Sbjct: 399 DIACAALNGIKMGDKTLTVRRANQGTTQPNPEQESVLLHAQQQIALQRFMLQPGALAT-- 456
Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
KVLCLTE ++ D L DD++Y++ILEDMR ECGK+G L+NVVIPRP+ NG
Sbjct: 457 -----------KVLCLTEVVSVDELKDDDDYQDILEDMRIECGKFGALLNVVIPRPNPNG 505
Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
TPG+GKVFLEY D + A+ L+GRKFGGN V A +YPE+K+ DY
Sbjct: 506 EPTPGLGKVFLEYADVDSSSKARQGLNGRKFGGNQVIAVFYPENKFSEGDY 556
>gi|297840477|ref|XP_002888120.1| hypothetical protein ARALYDRAFT_475241 [Arabidopsis lyrata subsp.
lyrata]
gi|297333961|gb|EFH64379.1| hypothetical protein ARALYDRAFT_475241 [Arabidopsis lyrata subsp.
lyrata]
Length = 589
Score = 578 bits (1489), Expect = e-162, Method: Compositional matrix adjust.
Identities = 284/411 (69%), Positives = 324/411 (78%), Gaps = 17/411 (4%)
Query: 113 GFDMAPPAAAMLPGAAVPGQLPGVPSA--VPEMAQNMLPF-GATQLGAFPLMPVQVMTQQ 169
GFDMAPP A GQ+P VP+ +P M NM P QLGA P++PVQ MTQQ
Sbjct: 190 GFDMAPPDMLAATAVAAAGQVPSVPTTATIPGMFPNMFPMVPGQQLGALPVLPVQAMTQQ 249
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
ATRHARRVYVGGLPP ANEQ++ATFFSQVM+AIGGN+AGPGDAVVNVYINHEKKFAFVEM
Sbjct: 250 ATRHARRVYVGGLPPTANEQSVATFFSQVMSAIGGNTAGPGDAVVNVYINHEKKFAFVEM 309
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
R+VEEASNAMALDGII EGV V+VRRPTDYNP+LAA LGP QP+PNLNLAAVGL+SG+ G
Sbjct: 310 RSVEEASNAMALDGIILEGVPVKVRRPTDYNPSLAATLGPSQPNPNLNLAAVGLSSGSTG 369
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
G EGPDR+FVGGLPYYFTE QI+ELLESFG L GF+LVKDR+TGNSKGY FCVYQDP+VT
Sbjct: 370 GLEGPDRIFVGGLPYYFTEVQIRELLESFGPLRGFNLVKDRETGNSKGYAFCVYQDPSVT 429
Query: 350 DIACAALNGLKMGDKTLTVRRATASG-QSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
DIACAALNG+KMGDKTLTVRRA Q K EQE +L AQQ IA+Q++ LQ G
Sbjct: 430 DIACAALNGIKMGDKTLTVRRAIQGVIQPKPEQEEVLLHAQQQIALQRLMLQPGG----- 484
Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
T K++CLT+ +TAD L DDEEY +I+EDMR+E GK+G LVNVVIPRP+ +
Sbjct: 485 --------TPTKIVCLTQVVTADDLRDDEEYADIMEDMRQEGGKFGNLVNVVIPRPNPDH 536
Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
TPGVGKVFLEY D G + A++ ++GRKFGGN V A YYPEDKY DY
Sbjct: 537 DPTPGVGKVFLEYADVDGSSKARSGMNGRKFGGNQVVAVYYPEDKYLQGDY 587
>gi|356509477|ref|XP_003523474.1| PREDICTED: splicing factor U2af large subunit B-like [Glycine max]
Length = 600
Score = 577 bits (1486), Expect = e-162, Method: Compositional matrix adjust.
Identities = 286/415 (68%), Positives = 324/415 (78%), Gaps = 20/415 (4%)
Query: 112 SGFDMAPPAAAMLPGA-AVPGQLPGVPSAVPEMAQNMLPFGATQL---GAFPLMPVQVMT 167
SGFDMAPPA+AML GA AV GQ+ G +P M NM P +Q+ A P+MPVQ MT
Sbjct: 197 SGFDMAPPASAMLTGASAVAGQITGANPTIPGMFPNMFPLATSQMQQFSALPVMPVQAMT 256
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
QQATRHARRVYVGGLPP ANEQ++ATFFSQVM IGGN+AGPGDAVVNVYINH+KKFAFV
Sbjct: 257 QQATRHARRVYVGGLPPTANEQSVATFFSQVMAKIGGNTAGPGDAVVNVYINHDKKFAFV 316
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
EMR+VEEASNAMALDGIIFEG V+VRRPTDYNP+LAA LGP QP+PNLNL AVGL G+
Sbjct: 317 EMRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAATLGPSQPNPNLNLGAVGLTPGS 376
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
GG +GPDRVFVGGLPYYFTETQI+ELLE+FG L GFDLVKDR+TGNSKGY FCVYQD A
Sbjct: 377 AGGLDGPDRVFVGGLPYYFTETQIRELLETFGPLRGFDLVKDRETGNSKGYAFCVYQDLA 436
Query: 348 VTDIACAALNGLKMGDKTLTVRRATASG---QSKTEQESILAQAQQHIAIQKMALQTSGM 404
VTDIACAALNG+KMGDKTLTVRRA Q K EQESIL AQQ IA+QK+ LQ + +
Sbjct: 437 VTDIACAALNGIKMGDKTLTVRRANQGANPQQPKPEQESILMHAQQQIALQKLMLQPALV 496
Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP 464
T KV+CLT A+++D L DDE+YEEIL+DMR+EC K+GTLVNVVIPRP
Sbjct: 497 AT-------------KVVCLTHAVSSDELKDDEDYEEILDDMRQECSKFGTLVNVVIPRP 543
Query: 465 DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+G GVGKVFLEY D G A+ L+GRKF GN V A +YPE+K+ DY
Sbjct: 544 PSDGEPAAGVGKVFLEYVDIDGATKARAGLNGRKFDGNQVVAVFYPENKFAQGDY 598
>gi|168028774|ref|XP_001766902.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681881|gb|EDQ68304.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 486
Score = 577 bits (1486), Expect = e-162, Method: Compositional matrix adjust.
Identities = 287/411 (69%), Positives = 323/411 (78%), Gaps = 21/411 (5%)
Query: 110 RRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ-LGAFPLMPVQVMTQ 168
+ SGFDMAPP A + V GQ+PG+P A+P + M PFG TQ G P MP Q MTQ
Sbjct: 96 KTSGFDMAPPGATV-----VAGQIPGMPPAMPGVFPAMFPFGGTQQFGGLPGMPAQAMTQ 150
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
QATRHARRVYVGGLPP+ANEQ IAT+FSQVM A+GGN+AGPGDAVVNVYIN EKKFAFVE
Sbjct: 151 QATRHARRVYVGGLPPMANEQTIATYFSQVMAAVGGNTAGPGDAVVNVYINQEKKFAFVE 210
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
MRTVEEASNAMALDGIIFEGV+VRVRRP+DYNP++AA LGP QPSP+LNLAAVGL GA
Sbjct: 211 MRTVEEASNAMALDGIIFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNLAAVGLTPGAA 270
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GGA+GPDR+FVGGLPYY TE QIKELLESFG L GFDLVKDRDTGNSKGYGFCVYQDP+V
Sbjct: 271 GGADGPDRIFVGGLPYYLTEVQIKELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPSV 330
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
DIACA LNG+KM DKTL VRRATASGQ K +Q ++LA AQQ IAIQK+ALQ
Sbjct: 331 VDIACATLNGMKMDDKTLNVRRATASGQPKPDQANVLAHAQQQIAIQKLALQA------- 383
Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
+T KV+ LTE +T + L DDEEY+EI+EDM ECGKYGTLVN VIPRP ++G
Sbjct: 384 -------KTPTKVVALTEVVTPNQLEDDEEYQEIMEDMGTECGKYGTLVNCVIPRP-RSG 435
Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
PG+GKVFLEY D G + AK +L GR+F N V A YYPEDK+ DY
Sbjct: 436 ENVPGLGKVFLEYSDIAGASKAKASLHGRRFDENLVVAVYYPEDKFAAGDY 486
>gi|30696485|ref|NP_176287.3| Splicing factor U2af large subunit B [Arabidopsis thaliana]
gi|209572798|sp|Q8L716.2|U2A2B_ARATH RecName: Full=Splicing factor U2af large subunit B; AltName:
Full=U2 auxiliary factor 65 kDa subunit B; AltName:
Full=U2 small nuclear ribonucleoprotein auxiliary factor
large subunit B; Short=U2 snRNP auxiliary factor large
subunit B
gi|332195625|gb|AEE33746.1| Splicing factor U2af large subunit B [Arabidopsis thaliana]
Length = 589
Score = 573 bits (1477), Expect = e-161, Method: Compositional matrix adjust.
Identities = 282/411 (68%), Positives = 322/411 (78%), Gaps = 17/411 (4%)
Query: 113 GFDMAPPAAAMLPGAAVPGQLPGVPSA--VPEMAQNMLPF-GATQLGAFPLMPVQVMTQQ 169
GFDMAPP A GQ+P VP+ +P M NM P QLGA P++PVQ MTQQ
Sbjct: 190 GFDMAPPDMLAATAVAAAGQVPSVPTTATIPGMFSNMFPMVPGQQLGALPVLPVQAMTQQ 249
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
ATRHARRVYVGGLPP ANEQ+++TFFSQVM+AIGGN+AGPGDAVVNVYINHEKKFAFVEM
Sbjct: 250 ATRHARRVYVGGLPPTANEQSVSTFFSQVMSAIGGNTAGPGDAVVNVYINHEKKFAFVEM 309
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
R+VEEASNAMALDGII EGV V+VRRPTDYNP+LAA LGP QP+PNLNL AVGL+SG+ G
Sbjct: 310 RSVEEASNAMALDGIILEGVPVKVRRPTDYNPSLAATLGPSQPNPNLNLGAVGLSSGSTG 369
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
G EGPDR+FVGGLPYYFTE QI+ELLESFG L GF+LVKDR+TGNSKGY FCVYQDP+VT
Sbjct: 370 GLEGPDRIFVGGLPYYFTEVQIRELLESFGPLRGFNLVKDRETGNSKGYAFCVYQDPSVT 429
Query: 350 DIACAALNGLKMGDKTLTVRRATASG-QSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
DIACAALNG+KMGDKTLTVRRA Q K EQE +L AQQ IA+Q++ Q G
Sbjct: 430 DIACAALNGIKMGDKTLTVRRAIQGAIQPKPEQEEVLLYAQQQIALQRLMFQPGG----- 484
Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
T K++CLT+ +TAD L DDEEY EI+EDMR+E GK+G LVNVVIPRP+ +
Sbjct: 485 --------TPTKIVCLTQVVTADDLRDDEEYAEIMEDMRQEGGKFGNLVNVVIPRPNPDH 536
Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
TPGVGKVFLEY D G + A++ ++GRKFGGN V A YYPEDKY DY
Sbjct: 537 DPTPGVGKVFLEYADVDGSSKARSGMNGRKFGGNQVVAVYYPEDKYAQGDY 587
>gi|357455533|ref|XP_003598047.1| Splicing factor U2af large subunit B [Medicago truncatula]
gi|355487095|gb|AES68298.1| Splicing factor U2af large subunit B [Medicago truncatula]
Length = 626
Score = 573 bits (1477), Expect = e-161, Method: Compositional matrix adjust.
Identities = 281/414 (67%), Positives = 322/414 (77%), Gaps = 19/414 (4%)
Query: 112 SGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL---GAFPLMPVQVMTQ 168
SGFDMAPP +A+L V GQ+ G A+P M NM P Q+ A P++PVQ MTQ
Sbjct: 224 SGFDMAPPTSAILGATGVAGQITGASPAIPGMFPNMFPLPTNQVQPFSALPVLPVQAMTQ 283
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
QATRHARRVYVGGL P ANEQ++ATFFSQVM IGGN+AGPGDAVVNVYINH+KKFAFVE
Sbjct: 284 QATRHARRVYVGGLSPTANEQSVATFFSQVMATIGGNTAGPGDAVVNVYINHDKKFAFVE 343
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
MR+VEEASNAMALDGIIFEG V+VRRPTDYNP+LAAALGP QP+PNLNL VGL+ G+
Sbjct: 344 MRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLGLVGLSPGSA 403
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GG +GPDR+FVGG+PYYFTETQI+ELLE+FG L GFDLVKDR+TGNSKGY FCVYQD AV
Sbjct: 404 GGLDGPDRIFVGGVPYYFTETQIRELLETFGPLRGFDLVKDRETGNSKGYAFCVYQDLAV 463
Query: 349 TDIACAALNGLKMGDKTLTVRRA---TASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
TDIACAALNG+KMGDKTLTVRRA T Q K EQESIL AQQ IA+QK+ LQ + +
Sbjct: 464 TDIACAALNGIKMGDKTLTVRRANQNTNPMQPKPEQESILMHAQQQIALQKLMLQPALVA 523
Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
T KVLCLT A++ D L DDE+YEEIL+DMR+EC K+G LVNVVIPRP
Sbjct: 524 T-------------KVLCLTHAVSPDELKDDEDYEEILDDMRQECSKFGNLVNVVIPRPR 570
Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+G PGVGKVFLEY D G A++ L+GRKFGGN V A +YPE+K+ DY
Sbjct: 571 PDGELCPGVGKVFLEYADVDGSTKARSGLNGRKFGGNQVIAVFYPENKFAQGDY 624
>gi|356517814|ref|XP_003527581.1| PREDICTED: splicing factor U2af large subunit A-like [Glycine max]
Length = 605
Score = 572 bits (1475), Expect = e-160, Method: Compositional matrix adjust.
Identities = 284/415 (68%), Positives = 323/415 (77%), Gaps = 20/415 (4%)
Query: 112 SGFDMAPPAAAMLPGA-AVPGQLPGVPSAVPEMAQNMLPFGATQL---GAFPLMPVQVMT 167
SGFDMAPPA+AML GA AV GQ+ G +P M NM P Q+ A P+MPVQ MT
Sbjct: 202 SGFDMAPPASAMLAGASAVAGQITGANPTIPGMFPNMFPLATNQMQQFSALPVMPVQAMT 261
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
QQATRHARRVYVGGLPP ANEQ++ATFFSQVM IGGN+AGPGDAVVNVYINH+KKFAFV
Sbjct: 262 QQATRHARRVYVGGLPPTANEQSVATFFSQVMAKIGGNTAGPGDAVVNVYINHDKKFAFV 321
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
EMR+VEEASNAMALDGIIFEG V+VRRPTDYNP+LAA LGP QP+PNLNL AVGL G+
Sbjct: 322 EMRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAATLGPSQPNPNLNLGAVGLTPGS 381
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
GG +GPDR+FVGGLPYYFTETQI+ELLE+FG L GFDLVKDR+TGNSKGY FCVYQD A
Sbjct: 382 AGGLDGPDRIFVGGLPYYFTETQIRELLETFGPLRGFDLVKDRETGNSKGYAFCVYQDLA 441
Query: 348 VTDIACAALNGLKMGDKTLTVRRATASG---QSKTEQESILAQAQQHIAIQKMALQTSGM 404
VTDIACAALNG+KMGDKTLTVRRA Q K EQESIL AQQ IA+QK+ LQ + +
Sbjct: 442 VTDIACAALNGIKMGDKTLTVRRANQGANPQQPKPEQESILMHAQQQIALQKLMLQPALV 501
Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP 464
T KV+CLT A+++D L DDE+Y+EIL+DMR+EC K+GTLVNVVIPRP
Sbjct: 502 AT-------------KVVCLTHAVSSDELKDDEDYDEILDDMRQECSKFGTLVNVVIPRP 548
Query: 465 DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+G GVGKVFLEY D G A+ L+GRKF GN V A +YPE+K+ DY
Sbjct: 549 PPDGEPAAGVGKVFLEYVDIDGATKARAGLNGRKFDGNQVVAVFYPENKFAQGDY 603
>gi|156070781|gb|ABU45194.1| unknown [Petunia integrifolia subsp. inflata]
Length = 557
Score = 572 bits (1475), Expect = e-160, Method: Compositional matrix adjust.
Identities = 279/412 (67%), Positives = 323/412 (78%), Gaps = 19/412 (4%)
Query: 112 SGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQA 170
SGFDMAPP +A+LPGA GQ+PG ++P M NM P + Q GA P+MP+Q MTQQA
Sbjct: 162 SGFDMAPPTSALLPGATDTAGQVPGASPSIPGMFSNMFPLTSGQFGALPVMPIQAMTQQA 221
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
TRHARRVYVGGLPP ANEQ++ATFFS VM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR
Sbjct: 222 TRHARRVYVGGLPPSANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKKFAFVEMR 281
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+VEEASNAMALDG+IFEG V+VRRP+DYNP+LAA LGP QPSPNLNLAAVGL G+ GG
Sbjct: 282 SVEEASNAMALDGVIFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGLTPGSSGG 341
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
EGPDR+FVGGLPYYFTE+QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTD
Sbjct: 342 LEGPDRIFVGGLPYYFTESQIRELLESFGQLRGFDLVKDRETGNSKGYAFCVYQDVSVTD 401
Query: 351 IACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
IACAALNG+KMGDKTLTVRRA + Q EQES+L AQQ IA+QK Q + T
Sbjct: 402 IACAALNGIKMGDKTLTVRRANQGTPQPNPEQESVLLHAQQQIALQKFMFQPGALAT--- 458
Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
KVLCLT+A++ D L DD++Y++ILEDMR ECGK+G L+NVVIPRP+ NG
Sbjct: 459 ----------KVLCLTQAVSVDELNDDDDYQDILEDMRTECGKFGALLNVVIPRPNPNGE 508
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
TPG+GK Y D G + A+ L+GRKFGGN V A +YPE+K+ DY A
Sbjct: 509 PTPGIGK----YADVDGSSKARQGLNGRKFGGNQVVAVFYPENKFSEGDYEA 556
>gi|22655131|gb|AAM98156.1| putative U2 snRNP auxiliary factor [Arabidopsis thaliana]
Length = 589
Score = 570 bits (1469), Expect = e-160, Method: Compositional matrix adjust.
Identities = 281/411 (68%), Positives = 321/411 (78%), Gaps = 17/411 (4%)
Query: 113 GFDMAPPAAAMLPGAAVPGQLPGVPSA--VPEMAQNMLPF-GATQLGAFPLMPVQVMTQQ 169
GFDMAPP A GQ+P VP+ +P M NM P QLGA P++PVQ MTQQ
Sbjct: 190 GFDMAPPDMLAATAVAAAGQVPSVPTTATIPGMFSNMFPMVPGQQLGALPVLPVQAMTQQ 249
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
ATRHA RVYVGGLPP ANEQ+++TFFSQVM+AIGGN+AGPGDAVVNVYINHEKKFAFVEM
Sbjct: 250 ATRHAPRVYVGGLPPTANEQSVSTFFSQVMSAIGGNTAGPGDAVVNVYINHEKKFAFVEM 309
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
R+VEEASNAMALDGII EGV V+VRRPTDYNP+LAA LGP QP+PNLNL AVGL+SG+ G
Sbjct: 310 RSVEEASNAMALDGIILEGVPVKVRRPTDYNPSLAATLGPSQPNPNLNLGAVGLSSGSTG 369
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
G EGPDR+FVGGLPYYFTE QI+ELLESFG L GF+LVKDR+TGNSKGY FCVYQDP+VT
Sbjct: 370 GLEGPDRIFVGGLPYYFTEVQIRELLESFGPLRGFNLVKDRETGNSKGYAFCVYQDPSVT 429
Query: 350 DIACAALNGLKMGDKTLTVRRATASG-QSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
DIACAALNG+KMGDKTLTVRRA Q K EQE +L AQQ IA+Q++ Q G
Sbjct: 430 DIACAALNGIKMGDKTLTVRRAIQGAIQPKPEQEEVLLYAQQQIALQRLMFQPGG----- 484
Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
T K++CLT+ +TAD L DDEEY EI+EDMR+E GK+G LVNVVIPRP+ +
Sbjct: 485 --------TPTKIVCLTQVVTADDLRDDEEYAEIMEDMRQEGGKFGNLVNVVIPRPNPDH 536
Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
TPGVGKVFLEY D G + A++ ++GRKFGGN V A YYPEDKY DY
Sbjct: 537 DPTPGVGKVFLEYADVDGSSKARSGMNGRKFGGNQVVAVYYPEDKYAQGDY 587
>gi|224134362|ref|XP_002327819.1| predicted protein [Populus trichocarpa]
gi|222836904|gb|EEE75297.1| predicted protein [Populus trichocarpa]
Length = 541
Score = 568 bits (1465), Expect = e-159, Method: Compositional matrix adjust.
Identities = 284/426 (66%), Positives = 327/426 (76%), Gaps = 23/426 (5%)
Query: 108 SKRRSGFDMAPPAAAMLPGAAVP------------GQLPGVPSAVPEMAQNMLPFGATQ- 154
SKR SGFDMAPP++A+LP AA GQ+ G +P M NM P G +Q
Sbjct: 123 SKRMSGFDMAPPSSAILPNAAAAAAAAAAASAAASGQIAGTTPPIPGMFPNMFPLGTSQQ 182
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
GA P+MPVQ MTQQATRHARRVYVGGLPP ANEQ++ATFFSQVM AIGGN+AGPGDAVV
Sbjct: 183 FGALPVMPVQAMTQQATRHARRVYVGGLPPTANEQSVATFFSQVMAAIGGNTAGPGDAVV 242
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
NVYINHEKKFAFVEMR+VEEASNAMALDGIIFEG V+VRRP+DYNP+LAA LGP QP+P
Sbjct: 243 NVYINHEKKFAFVEMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNP 302
Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
NLNL+AVGLA G+ GG EGPDR+FVGGLPYYFTE+QI+ELLESFG L GFDLVKDR+TGN
Sbjct: 303 NLNLSAVGLAPGSAGGLEGPDRIFVGGLPYYFTESQIRELLESFGPLRGFDLVKDRETGN 362
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIA 393
SKGY FCVYQD +VTDIACAALNG+KMGDKTLTVRRA + Q K EQE++L AQQ IA
Sbjct: 363 SKGYAFCVYQDLSVTDIACAALNGIKMGDKTLTVRRANQGTNQPKPEQENVLLHAQQQIA 422
Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
+Q++ LQ KV+CLT+ +T D L DD+EYE+ILEDMR E GK+
Sbjct: 423 LQRLMLQPQPQQQ---------PVPTKVVCLTQVVTGDELKDDDEYEDILEDMRTEAGKF 473
Query: 454 GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
G LVNVVIPRP NG PGVGKVFLEY D G + A+ ++GRKF GN V A +YPE+K
Sbjct: 474 GLLVNVVIPRPRPNGENAPGVGKVFLEYADTEGSSKARAGMNGRKFDGNQVVAVFYPENK 533
Query: 514 YFNKDY 519
+ +Y
Sbjct: 534 FSQGEY 539
>gi|12323333|gb|AAG51641.1|AC018908_7 putative U2 snRNP auxiliary factor; 19096-22891 [Arabidopsis
thaliana]
Length = 568
Score = 568 bits (1465), Expect = e-159, Method: Compositional matrix adjust.
Identities = 284/412 (68%), Positives = 324/412 (78%), Gaps = 18/412 (4%)
Query: 113 GFDMAPP-AAAMLPGAAVPGQLPGVPSA--VPEMAQNMLPF-GATQLGAFPLMPVQVMTQ 168
GFDMAPP A AA GQ+P VP+ +P M NM P QLGA P++PVQ MTQ
Sbjct: 168 GFDMAPPDMLAATAVAAAAGQVPSVPTTATIPGMFSNMFPMVPGQQLGALPVLPVQAMTQ 227
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
QATRHARRVYVGGLPP ANEQ+++TFFSQVM+AIGGN+AGPGDAVVNVYINHEKKFAFVE
Sbjct: 228 QATRHARRVYVGGLPPTANEQSVSTFFSQVMSAIGGNTAGPGDAVVNVYINHEKKFAFVE 287
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
MR+VEEASNAMALDGII EGV V+VRRPTDYNP+LAA LGP QP+PNLNL AVGL+SG+
Sbjct: 288 MRSVEEASNAMALDGIILEGVPVKVRRPTDYNPSLAATLGPSQPNPNLNLGAVGLSSGST 347
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GG EGPDR+FVGGLPYYFTE QI+ELLESFG L GF+LVKDR+TGNSKGY FCVYQDP+V
Sbjct: 348 GGLEGPDRIFVGGLPYYFTEVQIRELLESFGPLRGFNLVKDRETGNSKGYAFCVYQDPSV 407
Query: 349 TDIACAALNGLKMGDKTLTVRRATASG-QSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
TDIACAALNG+KMGDKTLTVRRA Q K EQE +L AQQ IA+Q++ Q G
Sbjct: 408 TDIACAALNGIKMGDKTLTVRRAIQGAIQPKPEQEEVLLYAQQQIALQRLMFQPGG---- 463
Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
T K++CLT+ +TAD L DDEEY EI+EDMR+E GK+G LVNVVIPRP+ +
Sbjct: 464 ---------TPTKIVCLTQVVTADDLRDDEEYAEIMEDMRQEGGKFGNLVNVVIPRPNPD 514
Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
TPGVGKVFLEY D G + A++ ++GRKFGGN V A YYPEDKY DY
Sbjct: 515 HDPTPGVGKVFLEYADVDGSSKARSGMNGRKFGGNQVVAVYYPEDKYAQGDY 566
>gi|357455535|ref|XP_003598048.1| Splicing factor U2af large subunit B [Medicago truncatula]
gi|355487096|gb|AES68299.1| Splicing factor U2af large subunit B [Medicago truncatula]
Length = 629
Score = 568 bits (1463), Expect = e-159, Method: Compositional matrix adjust.
Identities = 281/417 (67%), Positives = 322/417 (77%), Gaps = 22/417 (5%)
Query: 112 SGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL---GAFPLMPVQVMTQ 168
SGFDMAPP +A+L V GQ+ G A+P M NM P Q+ A P++PVQ MTQ
Sbjct: 224 SGFDMAPPTSAILGATGVAGQITGASPAIPGMFPNMFPLPTNQVQPFSALPVLPVQAMTQ 283
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
QATRHARRVYVGGL P ANEQ++ATFFSQVM IGGN+AGPGDAVVNVYINH+KKFAFVE
Sbjct: 284 QATRHARRVYVGGLSPTANEQSVATFFSQVMATIGGNTAGPGDAVVNVYINHDKKFAFVE 343
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
MR+VEEASNAMALDGIIFEG V+VRRPTDYNP+LAAALGP QP+PNLNL VGL+ G+
Sbjct: 344 MRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLGLVGLSPGSA 403
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GG +GPDR+FVGG+PYYFTETQI+ELLE+FG L GFDLVKDR+TGNSKGY FCVYQD AV
Sbjct: 404 GGLDGPDRIFVGGVPYYFTETQIRELLETFGPLRGFDLVKDRETGNSKGYAFCVYQDLAV 463
Query: 349 TDIACAALNGLKMGDKTLTVRRA---TASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
TDIACAALNG+KMGDKTLTVRRA T Q K EQESIL AQQ IA+QK+ LQ + +
Sbjct: 464 TDIACAALNGIKMGDKTLTVRRANQNTNPMQPKPEQESILMHAQQQIALQKLMLQPALVA 523
Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG---TLVNVVIP 462
T KVLCLT A++ D L DDE+YEEIL+DMR+EC K+G LVNVVIP
Sbjct: 524 T-------------KVLCLTHAVSPDELKDDEDYEEILDDMRQECSKFGNICNLVNVVIP 570
Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
RP +G PGVGKVFLEY D G A++ L+GRKFGGN V A +YPE+K+ DY
Sbjct: 571 RPRPDGELCPGVGKVFLEYADVDGSTKARSGLNGRKFGGNQVIAVFYPENKFAQGDY 627
>gi|357455537|ref|XP_003598049.1| Splicing factor U2af large subunit B [Medicago truncatula]
gi|355487097|gb|AES68300.1| Splicing factor U2af large subunit B [Medicago truncatula]
Length = 627
Score = 567 bits (1462), Expect = e-159, Method: Compositional matrix adjust.
Identities = 281/415 (67%), Positives = 321/415 (77%), Gaps = 20/415 (4%)
Query: 112 SGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ-LGAFPLMPVQVMTQQA 170
SGFDMAPP +A+L V GQ+ G A+P M NM P Q A P++PVQ MTQQA
Sbjct: 224 SGFDMAPPTSAILGATGVAGQITGASPAIPGMFPNMFPLPTNQPFSALPVLPVQAMTQQA 283
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
TRHARRVYVGGL P ANEQ++ATFFSQVM IGGN+AGPGDAVVNVYINH+KKFAFVEMR
Sbjct: 284 TRHARRVYVGGLSPTANEQSVATFFSQVMATIGGNTAGPGDAVVNVYINHDKKFAFVEMR 343
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+VEEASNAMALDGIIFEG V+VRRPTDYNP+LAAALGP QP+PNLNL VGL+ G+ GG
Sbjct: 344 SVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLGLVGLSPGSAGG 403
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
+GPDR+FVGG+PYYFTETQI+ELLE+FG L GFDLVKDR+TGNSKGY FCVYQD AVTD
Sbjct: 404 LDGPDRIFVGGVPYYFTETQIRELLETFGPLRGFDLVKDRETGNSKGYAFCVYQDLAVTD 463
Query: 351 IACAALNGLKMGDKTLTVRRA---TASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
IACAALNG+KMGDKTLTVRRA T Q K EQESIL AQQ IA+QK+ LQ + + T
Sbjct: 464 IACAALNGIKMGDKTLTVRRANQNTNPMQPKPEQESILMHAQQQIALQKLMLQPALVAT- 522
Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG---TLVNVVIPRP 464
KVLCLT A++ D L DDE+YEEIL+DMR+EC K+G LVNVVIPRP
Sbjct: 523 ------------KVLCLTHAVSPDELKDDEDYEEILDDMRQECSKFGNICNLVNVVIPRP 570
Query: 465 DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+G PGVGKVFLEY D G A++ L+GRKFGGN V A +YPE+K+ DY
Sbjct: 571 RPDGELCPGVGKVFLEYADVDGSTKARSGLNGRKFGGNQVIAVFYPENKFAQGDY 625
>gi|449496757|ref|XP_004160219.1| PREDICTED: splicing factor U2af large subunit B-like [Cucumis
sativus]
Length = 587
Score = 566 bits (1458), Expect = e-158, Method: Compositional matrix adjust.
Identities = 282/412 (68%), Positives = 321/412 (77%), Gaps = 17/412 (4%)
Query: 113 GFDMAPPAAAMLPGA-AVPGQLPGVPSAVPEMAQNMLPFGATQ-LGAFPLMPVQVMTQQA 170
GFDMAPP A+L GA A GQ+PG A+P M M P Q GA P+MPVQ MTQQA
Sbjct: 190 GFDMAPPTTAILSGATAAAGQIPGTTPAIPGMFPTMFPLATGQPFGALPVMPVQAMTQQA 249
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
TRHARRVYVGGLPP ANEQ++ATFFSQVM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR
Sbjct: 250 TRHARRVYVGGLPPTANEQSVATFFSQVMAAIGGNTAGPGDAVVNVYINHEKKFAFVEMR 309
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+VEEASNAMALDGIIFEG V+VRRP+DYNP+LAA LGP QP+PNLNLAAVGL G+ GG
Sbjct: 310 SVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGLTPGSAGG 369
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTD
Sbjct: 370 LEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLSVTD 429
Query: 351 IACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
IACAALNG+KMGDKTLTVRRA + Q K EQES+L AQQ IA+QK+ LQ ++T
Sbjct: 430 IACAALNGIKMGDKTLTVRRANQGANQPKPEQESVLLHAQQQIALQKLMLQPGAVST--- 486
Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
KVLCLT+ +T + L +DE+YE+I+EDMR E GK+GTLVNVVIPRP N
Sbjct: 487 ----------KVLCLTQVVTPEELINDEDYEDIMEDMRGEGGKFGTLVNVVIPRPRPNEA 536
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
PGVGKVFLEY D A+ L+GRKFGGN V A +YPE+K+ +Y A
Sbjct: 537 -APGVGKVFLEYADIDSATKARAGLNGRKFGGNQVMAVFYPENKFAQGEYDA 587
>gi|449441167|ref|XP_004138355.1| PREDICTED: splicing factor U2af large subunit B-like [Cucumis
sativus]
Length = 587
Score = 566 bits (1458), Expect = e-158, Method: Compositional matrix adjust.
Identities = 282/412 (68%), Positives = 321/412 (77%), Gaps = 17/412 (4%)
Query: 113 GFDMAPPAAAMLPGA-AVPGQLPGVPSAVPEMAQNMLPFGATQ-LGAFPLMPVQVMTQQA 170
GFDMAPP A+L GA A GQ+PG A+P M M P Q GA P+MPVQ MTQQA
Sbjct: 190 GFDMAPPTTAILSGATAAAGQIPGTTPAIPGMFPTMFPLATGQPFGALPVMPVQAMTQQA 249
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
TRHARRVYVGGLPP ANEQ++ATFFSQVM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR
Sbjct: 250 TRHARRVYVGGLPPTANEQSVATFFSQVMAAIGGNTAGPGDAVVNVYINHEKKFAFVEMR 309
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+VEEASNAMALDGIIFEG V+VRRP+DYNP+LAA LGP QP+PNLNLAAVGL G+ GG
Sbjct: 310 SVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGLTPGSAGG 369
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTD
Sbjct: 370 LEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLSVTD 429
Query: 351 IACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
IACAALNG+KMGDKTLTVRRA + Q K EQES+L AQQ IA+QK+ LQ ++T
Sbjct: 430 IACAALNGIKMGDKTLTVRRANQGANQPKPEQESVLLHAQQQIALQKLMLQPGAVST--- 486
Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
KVLCLT+ +T + L +DE+YE+I+EDMR E GK+GTLVNVVIPRP N
Sbjct: 487 ----------KVLCLTQVVTPEELINDEDYEDIMEDMRGEGGKFGTLVNVVIPRPRPNEA 536
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
PGVGKVFLEY D A+ L+GRKFGGN V A +YPE+K+ +Y A
Sbjct: 537 -APGVGKVFLEYADIDSATKARAGLNGRKFGGNQVMAVFYPENKFAQGEYDA 587
>gi|297798226|ref|XP_002866997.1| hypothetical protein ARALYDRAFT_490965 [Arabidopsis lyrata subsp.
lyrata]
gi|297312833|gb|EFH43256.1| hypothetical protein ARALYDRAFT_490965 [Arabidopsis lyrata subsp.
lyrata]
Length = 567
Score = 565 bits (1455), Expect = e-158, Method: Compositional matrix adjust.
Identities = 312/539 (57%), Positives = 358/539 (66%), Gaps = 65/539 (12%)
Query: 35 RHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKER---RHRHRSRSHSSDR 91
R H S DR R+K DRE + R R RD + KER + R R R H S R
Sbjct: 42 RDHERETSRSKDREREKGRDRDRERDSEVSRRSRDRDGEKGKERSREKDRDRERHHRSSR 101
Query: 92 FRNRSKSLSPSRS-------------------------------------------PSKS 108
R+ S+ S R PSKS
Sbjct: 102 HRDHSRDRSERRERGGRDDDDYRRSRDRDHDRRRDDRGGRRIRRSRSRSKDRSERSPSKS 161
Query: 109 -KRRSGFDMAPP-AAAMLPGAAVPGQLPGVPSAVP--EMAQNMLPFGATQ-LGAFPLMPV 163
KR SGFDMAPP +A + GAAV GQ+P P +P M NM P Q G +MP+
Sbjct: 162 NKRVSGFDMAPPASAMLAAGAAVTGQVPPAPPTLPGAGMFPNMFPLPTGQSFGGLSMMPI 221
Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
Q MTQQATRHARRVYVGGL P ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYINHEKK
Sbjct: 222 QAMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINHEKK 281
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
FAFVEMR+VEEASNAM+LDGIIFEG V+VRRP+DYNP+LAA LGP QPSP+LNLAAVGL
Sbjct: 282 FAFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLNLAAVGL 341
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
GA GG EGPDR+FVGGLPYYFTE+Q++ELLESFG L GFDLVKDR+TGNSKGY FCVY
Sbjct: 342 TPGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGALKGFDLVKDRETGNSKGYAFCVY 401
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTS 402
QD +VTDIACAALNG+KMGDKTLTVRRA + Q K EQES+L AQQ IA Q++ LQ
Sbjct: 402 QDLSVTDIACAALNGIKMGDKTLTVRRANQGTMQPKPEQESVLLHAQQQIAFQRIMLQPG 461
Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
M T V+CLT+ +T D L DDEEYE+I+EDMR+E GK+G L NVVIP
Sbjct: 462 VMATT-------------VVCLTQVVTEDELRDDEEYEDIMEDMRQEGGKFGALTNVVIP 508
Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
RP NG PG+GKVFL+Y D G A+ ++GRKFGGN V A YYPEDK+ DY A
Sbjct: 509 RPSPNGEPVPGLGKVFLKYADTDGSTRARTGMNGRKFGGNEVVAVYYPEDKFEQGDYGA 567
>gi|168021052|ref|XP_001763056.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685868|gb|EDQ72261.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 503
Score = 564 bits (1453), Expect = e-158, Method: Compositional matrix adjust.
Identities = 282/414 (68%), Positives = 330/414 (79%), Gaps = 14/414 (3%)
Query: 109 KRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ-LGAFPLMPVQVMT 167
+++SGFDMAPP AA++ GAA+ GQ+PG+ +P + M PFG TQ G P MP Q MT
Sbjct: 99 RKQSGFDMAPPGAAVVSGAALAGQIPGIAQPMPGVYPGMFPFGGTQQFGGIPGMPAQAMT 158
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
QQATRHARRVYVGGLPPLANEQ IAT+FSQVM A+GGN+AGPGDAVVNVYIN EKKFAFV
Sbjct: 159 QQATRHARRVYVGGLPPLANEQTIATYFSQVMAAVGGNTAGPGDAVVNVYINQEKKFAFV 218
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG- 286
EMRTVEEASNAMALDGI+FEGV+VRVRRP+DYNP++AA LGP QPSP+LNLAAVGL G
Sbjct: 219 EMRTVEEASNAMALDGIMFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNLAAVGLTPGN 278
Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
A GGA+GPDR+FVGGLPYY TE QIKELLESFG L GFDLVKDRDTGNSKGYGFCVYQDP
Sbjct: 279 AAGGADGPDRIFVGGLPYYLTEIQIKELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDP 338
Query: 347 AVTDIACAALNGLKMGDKTLTVRRATAS-GQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
+V DIACA LNG+KM DKTL VRRATA + K +Q ++LA AQQ IAIQ L S M+
Sbjct: 339 SVVDIACATLNGMKMDDKTLNVRRATARLARPKPDQANVLAHAQQQIAIQ--VLVYSWMS 396
Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
+ ET V+ LT+ ++ D L DDEEY++ILEDM+EECGKYG LV +VIPRP
Sbjct: 397 PV--------ETPTNVVALTQVVSPDELKDDEEYQDILEDMKEECGKYGNLVKLVIPRP- 447
Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
++G + PGVGKVF+EY D G A AK +L GR+FGG++V A YYP +K+ +DY
Sbjct: 448 RDGEDVPGVGKVFVEYSDTAGAAKAKASLHGRRFGGHSVVAVYYPAEKFSIEDY 501
>gi|168026451|ref|XP_001765745.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682922|gb|EDQ69336.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 491
Score = 562 bits (1448), Expect = e-157, Method: Compositional matrix adjust.
Identities = 289/408 (70%), Positives = 331/408 (81%), Gaps = 15/408 (3%)
Query: 113 GFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ-LGAFPLMPVQVMTQQAT 171
GFDMAPP AA++ GAAVPGQLPG+ +P + M PFG TQ G P MP Q MTQQAT
Sbjct: 96 GFDMAPPGAAVIAGAAVPGQLPGMAQPMPGVFPGMFPFGGTQQFGGIPGMPAQAMTQQAT 155
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
RHARRVYVGGLPP+ANEQ IAT+FSQVM A+GGN+AGPGDAVVNVYIN EKKFAFVEMRT
Sbjct: 156 RHARRVYVGGLPPMANEQTIATYFSQVMAAVGGNTAGPGDAVVNVYINQEKKFAFVEMRT 215
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
VEEASNAMALDGI+FEGV+VRVRRP+DYNP++AA LGP QPSP+LNLAAVGL GA GGA
Sbjct: 216 VEEASNAMALDGIMFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNLAAVGLTPGAAGGA 275
Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
+GPDR+FVGGLPYY TE QIKELLESFG L GFDLVKDRDTGNSKGYGFCVYQDPAVTD+
Sbjct: 276 DGPDRIFVGGLPYYLTEIQIKELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAVTDV 335
Query: 352 ACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGM 411
A AALNGLKMGDKTL+VRRA+ASGQ K +Q ++LA AQQ IAIQ M+ L
Sbjct: 336 AIAALNGLKMGDKTLSVRRASASGQPKPDQANVLAHAQQQIAIQVFW-----MSPL---- 386
Query: 412 SLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGET 471
ET KV+ LT+ ++ D L DDEEY++ILEDM+EECGKYG L+ VVIPRP ++G +
Sbjct: 387 ----ETSTKVVALTQVVSPDELKDDEEYQDILEDMKEECGKYGNLLRVVIPRP-RDGEDV 441
Query: 472 PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
PGVGKVF+EY D G A AK +L GR+FGG++V A YYPE+K+ DY
Sbjct: 442 PGVGKVFVEYSDTAGAAKAKASLHGRRFGGHSVVAVYYPEEKFAAGDY 489
>gi|147840634|emb|CAN68321.1| hypothetical protein VITISV_032193 [Vitis vinifera]
Length = 565
Score = 561 bits (1445), Expect = e-157, Method: Compositional matrix adjust.
Identities = 283/413 (68%), Positives = 323/413 (78%), Gaps = 15/413 (3%)
Query: 109 KRRSGFDMAPPAAAMLPGAAVP-GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMT 167
KR SGFDMAPPA+AML GAA GQ+PG +P M NM P + Q GA P+MPVQ MT
Sbjct: 164 KRVSGFDMAPPASAMLAGAAAAAGQIPGTTPTIPGMFPNMFPLASGQFGALPVMPVQAMT 223
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
QQATRHARRVYVGGL P ANEQ++ATFFSQVM+AIGGN+AGPGDAVVNVYINHEKKFAFV
Sbjct: 224 QQATRHARRVYVGGLSPTANEQSVATFFSQVMSAIGGNTAGPGDAVVNVYINHEKKFAFV 283
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
EMR+VEEASNAMALDGIIFEG V+VRRP+DYNP+LAA LGP QP+PNLNLAAVGL G+
Sbjct: 284 EMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGLTPGS 343
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
GG EGPDR+FVGGLPYYFTE QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +
Sbjct: 344 AGGLEGPDRIFVGGLPYYFTEAQIRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLS 403
Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
VTDIACAALNG+KMGDKTLTVRRA + Q K EQE++L AQQ IA+Q++ Q + T
Sbjct: 404 VTDIACAALNGIKMGDKTLTVRRANQGASQPKPEQENVLLHAQQQIALQRLMFQPGALAT 463
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
KV+CLT+ + AD L DDE YE+I+EDMR E GK+G LVNV IPRP
Sbjct: 464 -------------KVVCLTQVVNADELQDDEAYEDIVEDMRIEGGKFGNLVNVAIPRPKP 510
Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
NG TPG+GKVFLEY D G A+ L+GRKF GN V A +YPE+K+ +Y
Sbjct: 511 NGEPTPGLGKVFLEYADIDGAXKARTGLNGRKFDGNQVVAVFYPENKFSQGEY 563
>gi|338762830|gb|AEI98617.1| hypothetical protein 111O18.4 [Coffea canephora]
Length = 570
Score = 560 bits (1444), Expect = e-157, Method: Compositional matrix adjust.
Identities = 287/443 (64%), Positives = 327/443 (73%), Gaps = 21/443 (4%)
Query: 78 RRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVP 137
+RHR RSR ++ KR SGFDMAPP ++ GA Q+ G
Sbjct: 146 QRHRSRSREGRAEHRSRSRSRSR------SKKRISGFDMAPPTNPLMTGATSLPQVTGAA 199
Query: 138 SAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQ 197
AVP + NM QLGA P+MPVQ MTQQATRHARRVYVGGLPP ANEQ++ATFFS
Sbjct: 200 PAVPGVFPNMFSLPTGQLGALPVMPVQAMTQQATRHARRVYVGGLPPTANEQSVATFFSH 259
Query: 198 VMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPT 257
VM+AIGGN+AGPGDAVVNVYINHEKKFAFVEMR+VEEASNAMALDGIIFEG V+VRRP+
Sbjct: 260 VMSAIGGNTAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAMALDGIIFEGAPVKVRRPS 319
Query: 258 DYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLES 317
DYNP+LAA LGP QP+PNLNLAAVGL G+ GG EGPDR+FVGGLPYYFTE QI+ELLES
Sbjct: 320 DYNPSLAATLGPSQPNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEGQIRELLES 379
Query: 318 FGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQ 376
FG L GFDLVKDR+TGNSKGY FCVYQD +VTDIACAALNG+KMGDKTLTVRRA Q
Sbjct: 380 FGPLRGFDLVKDRETGNSKGYAFCVYQDLSVTDIACAALNGIKMGDKTLTVRRANQGVTQ 439
Query: 377 SKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADD 436
K EQES+L AQQ IA+QK+ LQ + T KVLCLT+ ++AD L DD
Sbjct: 440 PKPEQESVLLHAQQQIALQKLMLQPGTLAT-------------KVLCLTQVVSADELRDD 486
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
E+Y +ILEDMR ECGK+ TLVN+VIPRP G TPGVGKVFLEY D A+ L G
Sbjct: 487 EDYADILEDMRLECGKF-TLVNLVIPRPSPTGDPTPGVGKVFLEYADVESANKARQGLHG 545
Query: 497 RKFGGNTVNAFYYPEDKYFNKDY 519
R+FGGN V A +YPE+++ DY
Sbjct: 546 RRFGGNQVVAVFYPENRFSQGDY 568
>gi|359476715|ref|XP_002271463.2| PREDICTED: splicing factor U2af large subunit B-like [Vitis
vinifera]
Length = 568
Score = 559 bits (1441), Expect = e-156, Method: Compositional matrix adjust.
Identities = 283/416 (68%), Positives = 323/416 (77%), Gaps = 18/416 (4%)
Query: 109 KRRSGFDMAPPAAAMLPGAA----VPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQ 164
KR SGFDMAPPA+AML GAA GQ+PG +P M NM P + Q GA P+MPVQ
Sbjct: 164 KRVSGFDMAPPASAMLAGAAAAADFTGQIPGTTPTIPGMFPNMFPLASGQFGALPVMPVQ 223
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
MTQQATRHARRVYVGGL P ANEQ++ATFFSQVM+AIGGN+AGPGDAVVNVYINHEKKF
Sbjct: 224 AMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMSAIGGNTAGPGDAVVNVYINHEKKF 283
Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
AFVEMR+VEEASNAMALDGIIFEG V+VRRP+DYNP+LAA LGP QP+PNLNLAAVGL
Sbjct: 284 AFVEMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGLT 343
Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
G+ GG EGPDR+FVGGLPYYFTE QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQ
Sbjct: 344 PGSAGGLEGPDRIFVGGLPYYFTEAQIRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQ 403
Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
D +VTDIACAALNG+KMGDKTLTVRRA + Q K EQE++L AQQ IA+Q++ Q
Sbjct: 404 DLSVTDIACAALNGIKMGDKTLTVRRANQGASQPKPEQENVLLHAQQQIALQRLMFQPGA 463
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
+ T KV+CLT+ + AD L DDE YE+I+EDMR E GK+G LVNV IPR
Sbjct: 464 LAT-------------KVVCLTQVVNADELQDDEAYEDIVEDMRIEGGKFGNLVNVAIPR 510
Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
P NG TPG+GKVFLEY D G A+ L+GRKF GN V A +YPE+K+ +Y
Sbjct: 511 PKPNGEPTPGLGKVFLEYADIDGATKARTGLNGRKFDGNQVVAVFYPENKFSQGEY 566
>gi|302816055|ref|XP_002989707.1| hypothetical protein SELMODRAFT_160385 [Selaginella moellendorffii]
gi|300142484|gb|EFJ09184.1| hypothetical protein SELMODRAFT_160385 [Selaginella moellendorffii]
Length = 421
Score = 556 bits (1434), Expect = e-156, Method: Compositional matrix adjust.
Identities = 291/418 (69%), Positives = 333/418 (79%), Gaps = 16/418 (3%)
Query: 116 MAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ-----LGAFPLMPVQVMTQQA 170
MAPP AA++ G PGQLPG+ VP + +M PF TQ P MP Q MTQQA
Sbjct: 1 MAPPGAAVVTGT-TPGQLPGITQPVPGVF-SMFPFAGTQARLLFFAGLPTMPAQAMTQQA 58
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
TRHARRVYVGGLPPLANEQ IATFFSQVM+AIGGN+AGPGDAVVNVYIN EKKFAFVEMR
Sbjct: 59 TRHARRVYVGGLPPLANEQTIATFFSQVMSAIGGNTAGPGDAVVNVYINQEKKFAFVEMR 118
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
TVEEASNAMALDGIIFEGV+VRVRRP+DYNP++AA LGP QPSP+LNLAAVGL GA GG
Sbjct: 119 TVEEASNAMALDGIIFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNLAAVGLTPGAAGG 178
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
A+GPDR+FVGGLPYY TE QIKELLESFG L GFDLVKDRDTGNSKGYGFCVYQDPAVTD
Sbjct: 179 ADGPDRIFVGGLPYYLTEGQIKELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAVTD 238
Query: 351 IACAALNGLKMGDKTLTVRRATA---SGQSKTEQESILAQAQQHIAIQKMALQTSG---- 403
+ACAALNGLKMGDKTLTVRRATA SGQ K +Q ++LAQAQQ IA+QK+ALQ +
Sbjct: 239 VACAALNGLKMGDKTLTVRRATASVHSGQPKPDQANVLAQAQQQIALQKLALQGAPYYNM 298
Query: 404 -MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
M + GM++ ET KV+CL + ++ D L +D+EYEEILEDMREECGKYG++ +V+P
Sbjct: 299 MMPGVDNGMTM-PETPTKVVCLKQVVSPDELKEDDEYEEILEDMREECGKYGSVATLVLP 357
Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
RP +G E GVGKVF+EY AKN+L+GRKFGGN V A Y+PEDK+ +Y+
Sbjct: 358 RPKSDGEEVAGVGKVFVEYATIEEAIKAKNSLNGRKFGGNIVAAVYFPEDKFLQGEYN 415
>gi|168056046|ref|XP_001780033.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668531|gb|EDQ55136.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 536
Score = 552 bits (1423), Expect = e-154, Method: Compositional matrix adjust.
Identities = 277/416 (66%), Positives = 319/416 (76%), Gaps = 18/416 (4%)
Query: 110 RRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQ 169
+ SGFDMAPP ++PGAAVPGQ+ G+P +P + +M PFG Q G P MP Q MTQQ
Sbjct: 131 KTSGFDMAPPGGTIVPGAAVPGQISGMPPQMPGVFPSMFPFGGAQFGGLPGMPAQAMTQQ 190
Query: 170 -ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
ATRHARRVYVGGLPP+ANEQ IAT+FSQVM A+GGN+AGPGDAVVNVYIN EKKFAFVE
Sbjct: 191 QATRHARRVYVGGLPPMANEQTIATYFSQVMAAVGGNTAGPGDAVVNVYINQEKKFAFVE 250
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
MRTVEEASNAM+LDGIIFEGV+VRVRRP+DYNP++AA LGP QPSP+LNLAAVGL GA
Sbjct: 251 MRTVEEASNAMSLDGIIFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNLAAVGLTPGAA 310
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GGA+GPDR+FVGGLPYY TE QIKELLESFG L GFDLVKDRDTGNSKGYGFCVYQDP+V
Sbjct: 311 GGADGPDRIFVGGLPYYLTEPQIKELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPSV 370
Query: 349 -TDIACAALNGLKMGDKTLTVRRATAS---GQSKTEQESILAQAQQHIAIQKMALQTSGM 404
TD+A AALNGLKMGDKTL+VRRA+A GQ K +Q ++L AQQ IA+Q
Sbjct: 371 TTDVAIAALNGLKMGDKTLSVRRASARYGIGQPKPDQANVLIHAQQQIALQVTLKMLLHR 430
Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP 464
T + + + +T + L DDEEY+EILEDMR ECGKYG L+NVVIPRP
Sbjct: 431 KTFTAAWTFYAQV----------VTPNQLEDDEEYQEILEDMRMECGKYGNLLNVVIPRP 480
Query: 465 DQNGGET-PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
GET PG+GKVFL+Y D G AK +L GR+F N V A +YPEDK+ KD+
Sbjct: 481 --RAGETVPGLGKVFLDYSDTTGATKAKTSLHGRRFDENLVVAVFYPEDKFAAKDF 534
>gi|334187224|ref|NP_001190937.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
gi|332661290|gb|AEE86690.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
Length = 551
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 302/523 (57%), Positives = 349/523 (66%), Gaps = 49/523 (9%)
Query: 35 RHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKERRHR----HRSRSHSSD 90
R H S DR R+K DRE + R R RD + KER HR R H S
Sbjct: 42 RDHERETSRSKDREREKGRDKDRERDSEVSRRSRDRDGEKSKERSRDKDRDHRERHHRSS 101
Query: 91 RFRNRSKSLSPSRSPSK---------------------------SKRRSGFDMAPPAAAM 123
R R+ S+ R +R SGFDMAPPA+AM
Sbjct: 102 RHRDHSRERGERRERGGGRRSRRSRSRSKDRSERRTRSRSPSKSKQRVSGFDMAPPASAM 161
Query: 124 LPGAA-VPGQLPGVPSAVPE--MAQNMLPFGATQ-LGAFPLMPVQVMTQQATRHARRVYV 179
L A V GQ+P P +P M NM P Q G +MP+Q MTQQATRHARRVYV
Sbjct: 162 LAAGAAVTGQVPPAPPTLPGAGMFPNMFPLPTGQSFGGLSMMPIQAMTQQATRHARRVYV 221
Query: 180 GGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAM 239
GGL P ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYINHEKKFAFVEMR+VEEASNAM
Sbjct: 222 GGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAM 281
Query: 240 ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFV 299
+LDGIIFEG V+VRRP+DYNP+LAA LGP QPSP+LNLAAVGL GA GG EGPDR+FV
Sbjct: 282 SLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLNLAAVGLTPGASGGLEGPDRIFV 341
Query: 300 GGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGL 359
GGLPYYFTE+Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTDIACAALNG+
Sbjct: 342 GGLPYYFTESQVRELLESFGGLKGFDLVKDRETGNSKGYAFCVYQDLSVTDIACAALNGI 401
Query: 360 KMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
KMGDKTLTVRRA + K EQE++L AQQ IA Q++ LQ + T
Sbjct: 402 KMGDKTLTVRRANQGTMLQKPEQENVLLHAQQQIAFQRVMLQPGAVATT----------- 450
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
V+CLT+ +T D L DDEEY +I+EDMR+E GK+G L NVVIPRP NG G+GKVF
Sbjct: 451 --VVCLTQVVTEDELRDDEEYGDIMEDMRQEGGKFGALTNVVIPRPSPNGEPVAGLGKVF 508
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
L+Y D G A+ ++GRKFGGN V A YYPEDK+ DY A
Sbjct: 509 LKYADTDGSTRARFGMNGRKFGGNEVVAVYYPEDKFEQGDYGA 551
>gi|302820212|ref|XP_002991774.1| hypothetical protein SELMODRAFT_161898 [Selaginella moellendorffii]
gi|300140455|gb|EFJ07178.1| hypothetical protein SELMODRAFT_161898 [Selaginella moellendorffii]
Length = 420
Score = 550 bits (1417), Expect = e-154, Method: Compositional matrix adjust.
Identities = 287/416 (68%), Positives = 325/416 (78%), Gaps = 13/416 (3%)
Query: 116 MAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ-----LGAFPLMPVQVMTQQA 170
MAPP AA++ G PGQLPG+ VP + +M PF TQ P MP Q MTQQA
Sbjct: 1 MAPPGAAVVTGT-TPGQLPGITQPVPGVF-SMFPFAGTQASLLFFAGLPTMPAQAMTQQA 58
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
TRHARRVYVGGLPPLANEQ IATFFSQVM+AIGGN+AGPGDAVVNVYIN EKKFAFVEMR
Sbjct: 59 TRHARRVYVGGLPPLANEQTIATFFSQVMSAIGGNTAGPGDAVVNVYINQEKKFAFVEMR 118
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
TVEEASNAMALDGIIFEGV+VRVRRP+DYNP++AA LGP QPSP+LNLAAVGL GA GG
Sbjct: 119 TVEEASNAMALDGIIFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNLAAVGLTPGAAGG 178
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
A+GPDR+FVGGLPYY TE QIKELLESFG L GFDLVKDRDTGNSKGYGFCVYQDPAVTD
Sbjct: 179 ADGPDRIFVGGLPYYLTEGQIKELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAVTD 238
Query: 351 IACAALNGLKMGDKTLTVRRATA---SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
+ACAALNGLKMGDKTLTVRRATA SGQ K +Q ++LAQAQQ IA+Q N +
Sbjct: 239 VACAALNGLKMGDKTLTVRRATASVHSGQPKPDQANVLAQAQQQIALQLALQGAPYYNMM 298
Query: 408 GGGMS---LFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP 464
G+ ET KV+CL + ++ D L +D+EYEEILEDMREECGKYG++ +V+PRP
Sbjct: 299 MPGVDNGMTMPETPTKVVCLKQVVSPDELKEDDEYEEILEDMREECGKYGSVATLVLPRP 358
Query: 465 DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
NG E GVGKVF+EY AKN+L+GRKFGGN V A Y+PEDK+ +Y+
Sbjct: 359 KSNGEEVAGVGKVFVEYATIEEAIKAKNSLNGRKFGGNIVAAVYFPEDKFLQGEYN 414
>gi|297735185|emb|CBI17547.3| unnamed protein product [Vitis vinifera]
Length = 393
Score = 547 bits (1409), Expect = e-153, Method: Compositional matrix adjust.
Identities = 266/390 (68%), Positives = 305/390 (78%), Gaps = 14/390 (3%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
GQ+PG +P M NM P + Q GA P+MPVQ MTQQATRHARRVYVGGL P ANEQ+
Sbjct: 15 GQIPGTTPTIPGMFPNMFPLASGQFGALPVMPVQAMTQQATRHARRVYVGGLSPTANEQS 74
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ATFFSQVM+AIGGN+AGPGDAVVNVYINHEKKFAFVEMR+VEEASNAMALDGIIFEG
Sbjct: 75 VATFFSQVMSAIGGNTAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAMALDGIIFEGAP 134
Query: 251 VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQ 310
V+VRRP+DYNP+LAA LGP QP+PNLNLAAVGL G+ GG EGPDR+FVGGLPYYFTE Q
Sbjct: 135 VKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQ 194
Query: 311 IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
I+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTDIACAALNG+KMGDKTLTVRR
Sbjct: 195 IRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLSVTDIACAALNGIKMGDKTLTVRR 254
Query: 371 AT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAIT 429
A + Q K EQE++L AQQ IA+Q++ Q + T KV+CLT+ +
Sbjct: 255 ANQGASQPKPEQENVLLHAQQQIALQRLMFQPGALAT-------------KVVCLTQVVN 301
Query: 430 ADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT 489
AD L DDE YE+I+EDMR E GK+G LVNV IPRP NG TPG+GKVFLEY D G
Sbjct: 302 ADELQDDEAYEDIVEDMRIEGGKFGNLVNVAIPRPKPNGEPTPGLGKVFLEYADIDGATK 361
Query: 490 AKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
A+ L+GRKF GN V A +YPE+K+ +Y
Sbjct: 362 ARTGLNGRKFDGNQVVAVFYPENKFSQGEY 391
>gi|122245120|sp|Q2QKB4.1|U2A2B_WHEAT RecName: Full=Splicing factor U2af large subunit B; AltName:
Full=U2 auxiliary factor 65 kDa subunit B; AltName:
Full=U2 small nuclear ribonucleoprotein auxiliary factor
large subunit B; Short=U2 snRNP auxiliary factor large
subunit B
gi|68036764|gb|AAY84880.1| U2AF large subunit [Triticum aestivum]
Length = 543
Score = 543 bits (1400), Expect = e-152, Method: Compositional matrix adjust.
Identities = 283/424 (66%), Positives = 325/424 (76%), Gaps = 20/424 (4%)
Query: 102 SRSPSKSKRRSGFDMAPPAAAMLP---GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAF 158
SRS SKSKR SGFD+ P A ++LP P QLPG S++P M NMLPF Q+
Sbjct: 134 SRSHSKSKRVSGFDLGPTAQSVLPQFPTIPTPSQLPG--SSIPGMFPNMLPFADGQINPL 191
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
+ P Q MTQQATRHARRVYVGGLPP ANEQ++A +F+QVM AIGGN+AGPGDAV+NVYI
Sbjct: 192 VMQP-QAMTQQATRHARRVYVGGLPPSANEQSVAIYFNQVMAAIGGNTAGPGDAVLNVYI 250
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
NH+KKFAFVEMR+VEEASNAMALDGI+FEG V+VRRPTDYNP+LAAALGP QPS NLNL
Sbjct: 251 NHDKKFAFVEMRSVEEASNAMALDGILFEGAPVKVRRPTDYNPSLAAALGPSQPSSNLNL 310
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
AAVGL G+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY
Sbjct: 311 AAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGY 370
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKM 397
FCVYQD VTDIACAALNG+KMGDKTLTVRRA S Q + EQE+IL QAQQ + +QK+
Sbjct: 371 AFCVYQDLNVTDIACAALNGIKMGDKTLTVRRANQGSAQPRPEQENILLQAQQQVQLQKL 430
Query: 398 ALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
Q + T KV+CLT+ +TAD L DDEEYE+I+EDMR E GKYG LV
Sbjct: 431 VYQVGALPT-------------KVVCLTQVVTADELKDDEEYEDIMEDMRLEAGKYGNLV 477
Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
VVIPRP +G GVGKVFLEY D G AK A+ GRKFGGN V A +YPE+K+ ++
Sbjct: 478 KVVIPRPHPSGEPVSGVGKVFLEYADVDGSTKAKTAMHGRKFGGNPVVAVFYPENKFADE 537
Query: 518 DYSA 521
DY A
Sbjct: 538 DYDA 541
>gi|224094725|ref|XP_002310209.1| predicted protein [Populus trichocarpa]
gi|222853112|gb|EEE90659.1| predicted protein [Populus trichocarpa]
Length = 394
Score = 543 bits (1399), Expect = e-152, Method: Compositional matrix adjust.
Identities = 265/387 (68%), Positives = 306/387 (79%), Gaps = 15/387 (3%)
Query: 135 GVPSAVPEMAQNMLPFGA-TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIAT 193
G +P M NM P G Q GA P+MPVQ MTQQATRHARRVYVGGLPP+ANEQ++AT
Sbjct: 19 GTTPPIPGMFPNMFPLGTGQQFGALPVMPVQAMTQQATRHARRVYVGGLPPIANEQSVAT 78
Query: 194 FFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRV 253
FFSQVM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR+VEEASNAMALDGIIFEG V+V
Sbjct: 79 FFSQVMAAIGGNTAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAMALDGIIFEGAPVKV 138
Query: 254 RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
RRP+DYNP+LAA LGP QP+PNLNLAAVGL G+ GG EGPDR+FVGGLPYYFTE QI+E
Sbjct: 139 RRPSDYNPSLAATLGPSQPNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQIRE 198
Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT- 372
LLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTDIACAALNG+KMGDKTLTVRRA
Sbjct: 199 LLESFGALRGFDLVKDRETGNSKGYAFCVYQDLSVTDIACAALNGIKMGDKTLTVRRANQ 258
Query: 373 ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADA 432
+ Q K EQE++L AQQ IA+Q++ LQ + T KV+CLT+ +T D
Sbjct: 259 GTNQPKPEQENVLLHAQQQIALQRLMLQPPPVVT-------------KVVCLTQVVTVDE 305
Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
L DD+EYE+ILED+R E GK+G LVNVVIPRP +G PGVGKVFLEY D G + A+
Sbjct: 306 LKDDDEYEDILEDIRMEAGKFGQLVNVVIPRPRPDGENAPGVGKVFLEYADTEGSSKARA 365
Query: 493 ALSGRKFGGNTVNAFYYPEDKYFNKDY 519
++GRKFGGN V A ++PE+K+ +Y
Sbjct: 366 GMNGRKFGGNHVVAVFFPENKFSQGEY 392
>gi|357156009|ref|XP_003577312.1| PREDICTED: splicing factor U2af large subunit B-like [Brachypodium
distachyon]
Length = 576
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 273/413 (66%), Positives = 311/413 (75%), Gaps = 18/413 (4%)
Query: 113 GFDMAPPAAAML--PGAAVPGQLPGVPSAVPEMAQNMLPFGA-TQLGAFPLMPVQVMTQQ 169
GFD P L PGA PGQLP V +P M NM F A TQ + P Q MTQQ
Sbjct: 178 GFDQGPSQGVPLVTPGA-TPGQLPAVAPLIPGMLPNMFNFTAPTQFNPLAMQP-QAMTQQ 235
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
ATRHARRVYVGGLPP ANEQ +A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAFVEM
Sbjct: 236 ATRHARRVYVGGLPPTANEQTVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVEM 295
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
R+VEEASNAMALDGI+FEG V+VRRPTDYNP+LAAALGP QP+PNLNL AVGL G+ G
Sbjct: 296 RSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLGAVGLTPGSAG 355
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
G EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD VT
Sbjct: 356 GLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNVT 415
Query: 350 DIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
DIACAALNG+KMGDKTLTVRRA + Q + EQE+IL QA Q + +Q++ LQ +G
Sbjct: 416 DIACAALNGIKMGDKTLTVRRANQGASQPRPEQETILMQAHQQVQMQRLVLQ------VG 469
Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
G + KV+CLT+ ++AD L DDEEYE+ILEDMREE KYG LV VIPRPD +G
Sbjct: 470 GALP------TKVVCLTQVVSADELRDDEEYEDILEDMREEGRKYGNLVKAVIPRPDPSG 523
Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
PGVGKVFLEY D G AK + GRKFGGN V A +YPE+K+ DY A
Sbjct: 524 APVPGVGKVFLEYLDVDGSTKAKTGMHGRKFGGNQVVAVFYPENKFAEGDYDA 576
>gi|15234495|ref|NP_195387.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
gi|75318082|sp|O23212.2|U2A2A_ARATH RecName: Full=Splicing factor U2af large subunit A; AltName:
Full=U2 auxiliary factor 65 kDa subunit A; AltName:
Full=U2 small nuclear ribonucleoprotein auxiliary factor
large subunit A; Short=U2 snRNP auxiliary factor large
subunit A
gi|18087531|gb|AAL58899.1|AF462805_1 At4g35590/C7A10_670 [Arabidopsis thaliana]
gi|4006898|emb|CAB16828.1| splicing factor-like protein [Arabidopsis thaliana]
gi|7270617|emb|CAB80335.1| splicing factor-like protein [Arabidopsis thaliana]
gi|23506119|gb|AAN28919.1| At4g35590/C7A10_670 [Arabidopsis thaliana]
gi|24030414|gb|AAN41365.1| putative splicing factor [Arabidopsis thaliana]
gi|332661287|gb|AEE86687.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
Length = 573
Score = 541 bits (1394), Expect = e-151, Method: Compositional matrix adjust.
Identities = 276/418 (66%), Positives = 318/418 (76%), Gaps = 18/418 (4%)
Query: 109 KRRSGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPE--MAQNMLPFGATQ-LGAFPLMPVQ 164
+R SGFDMAPPA+AML A V GQ+P P +P M NM P Q G +MP+Q
Sbjct: 169 QRVSGFDMAPPASAMLAAGAAVTGQVPPAPPTLPGAGMFPNMFPLPTGQSFGGLSMMPIQ 228
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
MTQQATRHARRVYVGGL P ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYINHEKKF
Sbjct: 229 AMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINHEKKF 288
Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
AFVEMR+VEEASNAM+LDGIIFEG V+VRRP+DYNP+LAA LGP QPSP+LNLAAVGL
Sbjct: 289 AFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLNLAAVGLT 348
Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
GA GG EGPDR+FVGGLPYYFTE+Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQ
Sbjct: 349 PGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGGLKGFDLVKDRETGNSKGYAFCVYQ 408
Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
D +VTDIACAALNG+KMGDKTLTVRRA + K EQE++L AQQ IA Q++ LQ
Sbjct: 409 DLSVTDIACAALNGIKMGDKTLTVRRANQGTMLQKPEQENVLLHAQQQIAFQRVMLQPGA 468
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
+ T V+CLT+ +T D L DDEEY +I+EDMR+E GK+G L NVVIPR
Sbjct: 469 VATT-------------VVCLTQVVTEDELRDDEEYGDIMEDMRQEGGKFGALTNVVIPR 515
Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
P NG G+GKVFL+Y D G A+ ++GRKFGGN V A YYPEDK+ DY A
Sbjct: 516 PSPNGEPVAGLGKVFLKYADTDGSTRARFGMNGRKFGGNEVVAVYYPEDKFEQGDYGA 573
>gi|156070782|gb|ABU45195.1| unknown [Petunia integrifolia subsp. inflata]
Length = 506
Score = 536 bits (1381), Expect = e-149, Method: Compositional matrix adjust.
Identities = 283/487 (58%), Positives = 339/487 (69%), Gaps = 60/487 (12%)
Query: 68 RHRDYNRDKERRHRHRSRSH-SSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPG 126
R R+ ++D E HRHR S +DR S+SPSKS+R SGFDMAPP +A+LPG
Sbjct: 46 RRRENDKDIEDPHRHRPGSRGKTDR----------SQSPSKSRRISGFDMAPPTSALLPG 95
Query: 127 AA-VPGQLPGVPSAVPEMAQNMLPF-----------------------------GATQLG 156
A GQ+PG ++P M NM P G Q G
Sbjct: 96 ATDAAGQVPGTNPSIPGMFSNMFPLASDQVLPQIPSYYTSNGLLIFSFLIHLVCGFFQCG 155
Query: 157 AFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
FP+MP+Q MTQQATRHARRVYVGGLP ANEQ++ATFFS VM AIGGN+AGPGDAV++V
Sbjct: 156 PFPVMPIQEMTQQATRHARRVYVGGLPSSANEQSVATFFSHVMYAIGGNTAGPGDAVIDV 215
Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL 276
YINHEKKFAFVEMR+VEEASNAMALDG+IFEG VRVRRP+DYN +LAA LGP QPSPNL
Sbjct: 216 YINHEKKFAFVEMRSVEEASNAMALDGVIFEGEPVRVRRPSDYNASLAATLGPSQPSPNL 275
Query: 277 NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
NLAAVGL G+ GG EGPD +F+GGLP YFTE QI+ELLESFG L GF+LVKDR++GNSK
Sbjct: 276 NLAAVGLTPGSSGGLEGPDCIFIGGLPDYFTEAQIRELLESFGPLRGFNLVKDRESGNSK 335
Query: 337 GYGFCVYQDPAVTDIACAALNGLK-MGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAI 394
G+ F VYQD +VT+IAC ALNG+K M DKTL VRRA + Q EQES+L Q I++
Sbjct: 336 GHAFFVYQDVSVTEIACGALNGIKIMHDKTLIVRRANQGTQQLNPEQESVL----QQISL 391
Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
Q++ L + T KVLCLTEA+ D L DD++Y++ILEDMR ECGK+G
Sbjct: 392 QRLMLLPGALAT-------------KVLCLTEAVRLDELNDDDDYQDILEDMRTECGKFG 438
Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
L+NV+IPRP+ NG TPGVGKVFLEY D + A+ L+GRKFGGN V A +YPE+K+
Sbjct: 439 ALLNVIIPRPNPNGEPTPGVGKVFLEYADVDSSSKAQQGLNGRKFGGNQVIAVFYPENKF 498
Query: 515 FNKDYSA 521
+Y A
Sbjct: 499 SEGNYEA 505
>gi|122232770|sp|Q2QZL4.2|U2A2B_ORYSJ RecName: Full=Splicing factor U2af large subunit B; AltName:
Full=U2 auxiliary factor 65 kDa subunit B; AltName:
Full=U2 small nuclear ribonucleoprotein auxiliary factor
large subunit B; Short=U2 snRNP auxiliary factor large
subunit B
gi|108864649|gb|ABA95281.2| transposon protein, putative, CACTA, En/Spm sub-class, expressed
[Oryza sativa Japonica Group]
gi|222616418|gb|EEE52550.1| hypothetical protein OsJ_34796 [Oryza sativa Japonica Group]
Length = 548
Score = 535 bits (1379), Expect = e-149, Method: Compositional matrix adjust.
Identities = 276/414 (66%), Positives = 318/414 (76%), Gaps = 20/414 (4%)
Query: 112 SGFDMAPPAAAMLP---GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ 168
SGFDMAPPA A++P P Q PG +A+P M NMLP G Q + P Q MTQ
Sbjct: 151 SGFDMAPPAQAVVPQFPAIPTPSQFPG--TAIPGMFPNMLPMGVGQFNPLVIQP-QAMTQ 207
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
QATRHARRVYVGGLPP ANEQ++A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAFVE
Sbjct: 208 QATRHARRVYVGGLPPTANEQSVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVE 267
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
MR+VEEASNAMALDGI+FEG V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL G+
Sbjct: 268 MRSVEEASNAMALDGILFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTPGSA 327
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD V
Sbjct: 328 GGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNV 387
Query: 349 TDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
TDIACAALNG+KMGDKTLTVRRA + Q + EQESIL QAQQ + +QK+ Q + T
Sbjct: 388 TDIACAALNGIKMGDKTLTVRRANQGAAQPRPEQESILLQAQQQVQLQKLVYQVGALPT- 446
Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
KV+CLT+ ++AD L DDEEYE+I+EDMR E GKYG L+ VVIPRPD +
Sbjct: 447 ------------KVVCLTQVVSADELKDDEEYEDIMEDMRLEAGKYGNLIKVVIPRPDPS 494
Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
G GVGKVFLEY D G AK A+ GRKFGGN V A +YPE+K+ + +Y A
Sbjct: 495 GLPVAGVGKVFLEYADVDGATKAKTAMHGRKFGGNPVVAVFYPENKFSSAEYDA 548
>gi|218191627|gb|EEC74054.1| hypothetical protein OsI_09051 [Oryza sativa Indica Group]
Length = 548
Score = 535 bits (1379), Expect = e-149, Method: Compositional matrix adjust.
Identities = 276/414 (66%), Positives = 318/414 (76%), Gaps = 20/414 (4%)
Query: 112 SGFDMAPPAAAMLP---GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ 168
SGFDMAPPA A++P P Q PG +A+P M NMLP G Q + P Q MTQ
Sbjct: 151 SGFDMAPPAQAVVPQFPAIPTPSQFPG--TAIPGMFPNMLPMGVGQFNPLVIQP-QAMTQ 207
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
QATRHARRVYVGGLPP ANEQ++A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAFVE
Sbjct: 208 QATRHARRVYVGGLPPTANEQSVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVE 267
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
MR+VEEASNAMALDGI+FEG V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL G+
Sbjct: 268 MRSVEEASNAMALDGILFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTPGSA 327
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD V
Sbjct: 328 GGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNV 387
Query: 349 TDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
TDIACAALNG+KMGDKTLTVRRA + Q + EQESIL QAQQ + +QK+ Q + T
Sbjct: 388 TDIACAALNGIKMGDKTLTVRRANQGAAQPRPEQESILLQAQQQVQLQKLVYQVGALPT- 446
Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
KV+CLT+ ++AD L DDEEYE+I+EDMR E GKYG L+ VVIPRPD +
Sbjct: 447 ------------KVVCLTQVVSADELKDDEEYEDIMEDMRLESGKYGNLIKVVIPRPDPS 494
Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
G GVGKVFLEY D G AK A+ GRKFGGN V A +YPE+K+ + +Y A
Sbjct: 495 GLPVAGVGKVFLEYADVDGATKAKTAMHGRKFGGNPVVAVFYPENKFASAEYDA 548
>gi|357470349|ref|XP_003605459.1| Splicing factor U2af large subunit B [Medicago truncatula]
gi|355506514|gb|AES87656.1| Splicing factor U2af large subunit B [Medicago truncatula]
Length = 634
Score = 533 bits (1374), Expect = e-149, Method: Compositional matrix adjust.
Identities = 263/400 (65%), Positives = 305/400 (76%), Gaps = 28/400 (7%)
Query: 124 LPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL---GAFPLMPVQVMTQQATRHARRVYVG 180
+P A+ G LP NM P GA Q+ A P+MP+Q MTQQATRHARRVYVG
Sbjct: 257 VPNPAISGVLP-----------NMFPMGANQMPQFSALPMMPIQAMTQQATRHARRVYVG 305
Query: 181 GLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMA 240
GLPP ANEQ++A FFSQVM IGGN+AGPGDAVVNVYINH+KKFAFVEMR+VEEASNAMA
Sbjct: 306 GLPPTANEQSVAIFFSQVMANIGGNTAGPGDAVVNVYINHDKKFAFVEMRSVEEASNAMA 365
Query: 241 LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVG 300
LDGIIFEG V+VRRPTDYNP+LAA LGP QP+PNLNL AVGL G+ GG EGPDR+FVG
Sbjct: 366 LDGIIFEGAPVKVRRPTDYNPSLAATLGPSQPNPNLNLGAVGLTPGSAGGLEGPDRIFVG 425
Query: 301 GLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLK 360
GLPYYFTETQI+ELLE+FG L GFDLVKDR+TGNSKGY FCVY D AVTDIACAALNG+K
Sbjct: 426 GLPYYFTETQIRELLETFGPLRGFDLVKDRETGNSKGYAFCVYADLAVTDIACAALNGIK 485
Query: 361 MGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA 419
MGDKTLTVRRA + Q K EQESIL AQQ IA+QK+ Q + + T
Sbjct: 486 MGDKTLTVRRANQGTTQPKPEQESILMHAQQQIALQKLIFQPALVAT------------- 532
Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFL 479
KV+CLT A+ + L +DE++EEI++DMR+EC K+G+LVNVVIPRP +G + GVGKVFL
Sbjct: 533 KVVCLTNAVAPEELKEDEDFEEIIDDMRQECSKFGSLVNVVIPRPQPDGDLSGGVGKVFL 592
Query: 480 EYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
EY D G A+ L+GRKFGGN V A +Y E+K+ DY
Sbjct: 593 EYVDIEGATKARTGLNGRKFGGNEVIAVFYSENKFAQGDY 632
>gi|357438349|ref|XP_003589450.1| Splicing factor U2af large subunit B [Medicago truncatula]
gi|355478498|gb|AES59701.1| Splicing factor U2af large subunit B [Medicago truncatula]
Length = 611
Score = 532 bits (1371), Expect = e-148, Method: Compositional matrix adjust.
Identities = 267/414 (64%), Positives = 307/414 (74%), Gaps = 34/414 (8%)
Query: 112 SGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL---GAFPLMPVQVMTQ 168
SGFDMAPP +A+L V GQ+ G A+P M NM P Q+ A P++PVQ MTQ
Sbjct: 224 SGFDMAPPTSAILGATGVAGQITGASPAIPGMFPNMFPLPTNQVQPFSALPVLPVQAMTQ 283
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
QATRHARRVYVGGL P ANEQ++ATFFSQVM IGGN+AGPGDAV
Sbjct: 284 QATRHARRVYVGGLSPTANEQSVATFFSQVMATIGGNTAGPGDAV--------------- 328
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
MR+VEEASNAMALDGIIFEG V+VRRPTDYNP+LAAALGP QP+PNLNL VGL+ G+
Sbjct: 329 MRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLGLVGLSPGSA 388
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GG +GPDR+FVGG+PYYFTETQI+ELLE+FG L GFDLVKDR+TGNSKGY FCVYQD AV
Sbjct: 389 GGLDGPDRIFVGGVPYYFTETQIRELLETFGPLRGFDLVKDRETGNSKGYAFCVYQDLAV 448
Query: 349 TDIACAALNGLKMGDKTLTVRRA---TASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
TDIACAALNG+KMGDKTLTVRRA T Q K EQESIL AQQ IA+QK+ LQ + +
Sbjct: 449 TDIACAALNGIKMGDKTLTVRRANQNTNPMQPKPEQESILMHAQQQIALQKLMLQPALVA 508
Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
T KVLCLT A++ D L DDE+YEEIL+DMR+EC K+G LVNVVIPRP
Sbjct: 509 T-------------KVLCLTHAVSPDELKDDEDYEEILDDMRQECSKFGNLVNVVIPRPR 555
Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+G PGVGKVFLEY D G A++ L+GRKFGGN V A +YPE+K+ DY
Sbjct: 556 PDGELCPGVGKVFLEYADVDGSTKARSGLNGRKFGGNQVIAVFYPENKFAQGDY 609
>gi|357446501|ref|XP_003593528.1| Splicing factor U2af large subunit B [Medicago truncatula]
gi|355482576|gb|AES63779.1| Splicing factor U2af large subunit B [Medicago truncatula]
Length = 593
Score = 531 bits (1369), Expect = e-148, Method: Compositional matrix adjust.
Identities = 296/542 (54%), Positives = 358/542 (66%), Gaps = 79/542 (14%)
Query: 49 RDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKS 108
R K Y+R+ RD+DR H DY+RD++ R+R+ + S S R S+
Sbjct: 58 RGKYDSYNRQRGRDYDR---HNDYDRDRDTRNRYGAHSKRSRRESRSRSRSRSPSQ-SEG 113
Query: 109 KRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ 168
KR SGFDMAPPA + P V GQ+PG+ + QN P+G +Q+GA LM VQ MTQ
Sbjct: 114 KRTSGFDMAPPATGVTP--TVSGQMPGIAHMIQGATQNFSPYGISQIGALSLMQVQPMTQ 171
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
QATRHARRVYVGGLPP ANEQ+IA+FFSQVM AIGGNSAG GD+VVNVYINHEKKFAFVE
Sbjct: 172 QATRHARRVYVGGLPPFANEQSIASFFSQVMIAIGGNSAGSGDSVVNVYINHEKKFAFVE 231
Query: 229 MRTVEEASNAMALDGIIFEGVAV---------RV-------RRPTD-------------- 258
MRTVEEASNAMALDGI+FEG+ V R+ RRP D
Sbjct: 232 MRTVEEASNAMALDGIVFEGIGVAPIVKMVENRLRWFGHVERRPIDSVARRVDQMEDSQM 291
Query: 259 ------------------YNPTL-------AAALGPGQPSP-NLNLAAV--------GLA 284
Y+ TL A+ +P+ N +LAAV L
Sbjct: 292 DKTIRKDLEINKLDRNMVYDRTLWRNLIHVGVAVRVRRPTDYNPSLAAVLGPCQPSANLN 351
Query: 285 SGAIGGAEGP-------DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
A+G + G DR+FVGGLPYYFTE Q++ELL++FG L FD+V+D++TGNSKG
Sbjct: 352 LSAVGLSAGTIGGAEGLDRIFVGGLPYYFTEVQMRELLQAFGPLRSFDIVRDKETGNSKG 411
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
YGFC+YQDPAVTDIACAALNGLKMGDKTLTVRRAT S SK E+++I A+AQQHIA+QK+
Sbjct: 412 YGFCIYQDPAVTDIACAALNGLKMGDKTLTVRRATVSAHSKPEEDNIFARAQQHIAMQKI 471
Query: 398 ALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
AL+ G+N G+ E+ KVLCLTEA+T + L D+ EYEEILEDMR+EC K+GTLV
Sbjct: 472 ALEVVGLNI--PGVPTNDESPTKVLCLTEAVTTEQLTDNGEYEEILEDMRDECRKFGTLV 529
Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
NVVIPRP+ NG + G+GKVFLEY D C AKNAL+GRKFGG+ V AFYYPE+KY +
Sbjct: 530 NVVIPRPNPNGELSTGIGKVFLEYSDCTACLAAKNALNGRKFGGSIVTAFYYPEEKYHSM 589
Query: 518 DY 519
DY
Sbjct: 590 DY 591
>gi|168030966|ref|XP_001767993.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680835|gb|EDQ67268.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 499
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 263/414 (63%), Positives = 312/414 (75%), Gaps = 31/414 (7%)
Query: 112 SGFDMAPPAAAMLPGAAVPGQLPGV--PSAVPEMAQNMLPF-GATQLGAFPLMPVQVMTQ 168
SGFDMAPP +LP +A+ GQ+ G+ PS + PF G TQ+G FPL +
Sbjct: 109 SGFDMAPPGVTVLPASALSGQIAGMGFPS--------IFPFAGGTQVGPFPLH-FHAIGL 159
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
TRHARRVYVGGLPP+ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYIN EK+FAFVE
Sbjct: 160 SFTRHARRVYVGGLPPMANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINQEKRFAFVE 219
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
MRTVEEASNAMALDGI++EGV+VRVRRP+DYNP++AA LGP QPS +LNL AVGL GA+
Sbjct: 220 MRTVEEASNAMALDGIVYEGVSVRVRRPSDYNPSMAATLGPSQPSSHLNLTAVGLTPGAL 279
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GGA+GPDR+FVGGLPYY +E QI +LL SFG L FDLVKDRDTGNSKGYGFCVYQDP+V
Sbjct: 280 GGADGPDRIFVGGLPYYLSEEQIMDLLSSFGHLRAFDLVKDRDTGNSKGYGFCVYQDPSV 339
Query: 349 TDIACAALNGLKMGDKTLTVRRATAS---GQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
DIACAALNGLKMGD+TLTVRRA+A GQ K +Q +I+ QAQQ IA+Q A
Sbjct: 340 MDIACAALNGLKMGDRTLTVRRASARLRFGQPKPDQSNIIVQAQQQIALQVAA------- 392
Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
ET KV+CL++ ++ L DD E++EI+EDM+EECGKYG+L+NVVIPRP
Sbjct: 393 ---------PETATKVICLSQVVSIVDLKDDVEFDEIVEDMKEECGKYGSLLNVVIPRPS 443
Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+ + PG+G VF+EY D G A AK AL RKFGG V A YY EDK+ N DY
Sbjct: 444 YDEEDVPGIGMVFVEYSDLEGAAKAKQALHNRKFGGKLVIASYYSEDKFLNGDY 497
>gi|242069419|ref|XP_002449986.1| hypothetical protein SORBIDRAFT_05g026740 [Sorghum bicolor]
gi|241935829|gb|EES08974.1| hypothetical protein SORBIDRAFT_05g026740 [Sorghum bicolor]
Length = 545
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 278/414 (67%), Positives = 315/414 (76%), Gaps = 23/414 (5%)
Query: 113 GFDMAPPAA--AMLPGAAVPGQLPGVPSAVPE---MAQNMLPFG-ATQLGAFPLMPVQVM 166
GFD PP A + P P QLPG +++P M NMLPFG A Q + P Q M
Sbjct: 146 GFDAPPPQAMGSPFPVIPTPSQLPG--TSLPNIGGMFPNMLPFGVAGQFNPLVIQP-QAM 202
Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
TQQATRHARRVYVGGLPP ANEQ +A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAF
Sbjct: 203 TQQATRHARRVYVGGLPPSANEQTVAVYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAF 262
Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
VEMR+VEEASNAMALDGI+FEG V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL +G
Sbjct: 263 VEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTAG 322
Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD
Sbjct: 323 STGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDL 382
Query: 347 AVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
VTDIACAALNG+KMGDKTLTVRRA + Q + EQESIL QAQQ + +QK+ Q +
Sbjct: 383 TVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQLQKLVYQVGALP 442
Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
T KV+CLT+ +TAD L DDEEYE+I+EDMR E GKYGTLV V+IPRPD
Sbjct: 443 T-------------KVVCLTQVVTADELKDDEEYEDIMEDMRLEAGKYGTLVKVIIPRPD 489
Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+G GVGKVFLEY D G A AK AL GRKFGGN V A Y EDK+ N +Y
Sbjct: 490 PSGQPVAGVGKVFLEYADIDGAAKAKTALHGRKFGGNPVVAVCYAEDKFANGEY 543
>gi|223950169|gb|ACN29168.1| unknown [Zea mays]
gi|413920349|gb|AFW60281.1| splicing factor U2AF subunit [Zea mays]
Length = 594
Score = 521 bits (1343), Expect = e-145, Method: Compositional matrix adjust.
Identities = 270/413 (65%), Positives = 312/413 (75%), Gaps = 18/413 (4%)
Query: 112 SGFDMAPP--AAAMLPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
SGFD APP A ++ A+PGQLPG+ + +P + N+ A Q + P Q MT
Sbjct: 194 SGFDQAPPQHALPIVAAGAIPGQLPGITAPIPGVGVLPNLYNLAAGQFNPLVIQP-QAMT 252
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
QQATRHARRVYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV+NVYINH+KKFAFV
Sbjct: 253 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 312
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
EMR+VEEASNAMALDGI+FEG V+VRRPTDYNP+LAAALGP QP+PNLNLAAVGL G+
Sbjct: 313 EMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGS 372
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD
Sbjct: 373 AGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLN 432
Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
VTDIACAALNG+KMGDKTLTVRRA + Q + EQESIL QAQQ + +QK+ Q
Sbjct: 433 VTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQMQKLVYQ------ 486
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
+GG + KV+CLT+ +TAD L DDEEY +I+EDMREE KYG LV VVIPRPD
Sbjct: 487 VGGALP------TKVVCLTQVVTADELRDDEEYNDIVEDMREEGRKYGNLVKVVIPRPDP 540
Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+ GVGKVFLEY D G AK + GRKFGGN V A +YPEDK+ + Y
Sbjct: 541 SDAPVAGVGKVFLEYADVEGSTKAKTGMHGRKFGGNQVVAVFYPEDKFAAEQY 593
>gi|212723502|ref|NP_001131562.1| uncharacterized protein LOC100192903 [Zea mays]
gi|194691860|gb|ACF80014.1| unknown [Zea mays]
gi|195646366|gb|ACG42651.1| splicing factor U2AF 65 kDa subunit [Zea mays]
gi|413920213|gb|AFW60145.1| Splicing factor U2AF subunit [Zea mays]
Length = 539
Score = 520 bits (1338), Expect = e-145, Method: Compositional matrix adjust.
Identities = 278/414 (67%), Positives = 314/414 (75%), Gaps = 23/414 (5%)
Query: 113 GFDMAPPAA--AMLPGAAVPGQLPGVPSAVPE---MAQNMLPFG-ATQLGAFPLMPVQVM 166
GFD PP A + P P QLPG S++P M NMLPFG A Q + P Q M
Sbjct: 140 GFDAPPPQAMGSTFPVIPTPSQLPG--SSLPNIGGMFPNMLPFGVAGQFNPLVIQP-QAM 196
Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
TQQATRHARRVYVGGLPP ANEQ +A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAF
Sbjct: 197 TQQATRHARRVYVGGLPPSANEQTVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAF 256
Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
VEMR+VEEASNAMALDGI+FEG V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL +G
Sbjct: 257 VEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTAG 316
Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD
Sbjct: 317 SNGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDL 376
Query: 347 AVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
VTDIACAALNG+KMGDKTLTVRRA + Q + EQESIL QAQQ + +QK+ Q +
Sbjct: 377 TVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQLQKLVYQVGALP 436
Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
T KV+CLT+ +TAD L DDEEYE+I+EDMR E GKYG LV V+IPRPD
Sbjct: 437 T-------------KVVCLTQVVTADELKDDEEYEDIMEDMRLEAGKYGNLVKVIIPRPD 483
Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+G GVGKVFLEY D G A AK AL GRKFGGN V A Y EDK+ N +Y
Sbjct: 484 PSGQPVVGVGKVFLEYADIDGAAKAKTALHGRKFGGNPVVAVCYAEDKFANGEY 537
>gi|226497766|ref|NP_001152419.1| LOC100286059 [Zea mays]
gi|195656099|gb|ACG47517.1| splicing factor U2AF 65 kDa subunit [Zea mays]
Length = 596
Score = 519 bits (1336), Expect = e-144, Method: Compositional matrix adjust.
Identities = 269/413 (65%), Positives = 311/413 (75%), Gaps = 18/413 (4%)
Query: 112 SGFDMAPP--AAAMLPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
SGFD APP A ++ A+PGQLPG+ + +P + N+ A Q + P Q MT
Sbjct: 196 SGFDQAPPQHALPIVAAGAIPGQLPGITAPIPGVGVLPNLYNLAAGQFNPLVIQP-QAMT 254
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
QQATRHARRVYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV+NVYINH+KKFAFV
Sbjct: 255 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 314
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
EMR+VEEASNAMALDGI+FEG V+VRRPTDYNP+LAAALGP QP+PNLNLAAVGL G+
Sbjct: 315 EMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGS 374
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD
Sbjct: 375 AGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLN 434
Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
VTDIACAALNG+KMGDKTLTV RA + Q + EQESIL QAQQ + +QK+ Q
Sbjct: 435 VTDIACAALNGIKMGDKTLTVSRANQGASQPRPEQESILLQAQQQVQMQKLVYQ------ 488
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
+GG + KV+CLT+ +TAD L DDEEY +I+EDMREE KYG LV VVIPRPD
Sbjct: 489 VGGALP------TKVVCLTQVVTADELRDDEEYNDIVEDMREEGRKYGNLVKVVIPRPDP 542
Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+ GVGKVFLEY D G AK + GRKFGGN V A +YPEDK+ + Y
Sbjct: 543 SDAPVAGVGKVFLEYADVEGSTKAKTGMHGRKFGGNQVVAVFYPEDKFAAEQY 595
>gi|115486631|ref|NP_001068459.1| Os11g0682300 [Oryza sativa Japonica Group]
gi|113645681|dbj|BAF28822.1| Os11g0682300, partial [Oryza sativa Japonica Group]
Length = 378
Score = 518 bits (1333), Expect = e-144, Method: Compositional matrix adjust.
Identities = 265/394 (67%), Positives = 306/394 (77%), Gaps = 17/394 (4%)
Query: 129 VPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANE 188
+ GQ PG +A+P M NMLP G Q + P Q MTQQATRHARRVYVGGLPP ANE
Sbjct: 1 IAGQFPG--TAIPGMFPNMLPMGVGQFNPLVIQP-QAMTQQATRHARRVYVGGLPPTANE 57
Query: 189 QAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEG 248
Q++A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAMALDGI+FEG
Sbjct: 58 QSVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMALDGILFEG 117
Query: 249 VAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTE 308
V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL G+ GG EGPDR+FVGGLPYYFTE
Sbjct: 118 APVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTE 177
Query: 309 TQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV 368
Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD VTDIACAALNG+KMGDKTLTV
Sbjct: 178 AQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNVTDIACAALNGIKMGDKTLTV 237
Query: 369 RRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEA 427
RRA + Q + EQESIL QAQQ + +QK+ Q + T KV+CLT+
Sbjct: 238 RRANQGAAQPRPEQESILLQAQQQVQLQKLVYQVGALPT-------------KVVCLTQV 284
Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
++AD L DDEEYE+I+EDMR E GKYG L+ VVIPRPD +G GVGKVFLEY D G
Sbjct: 285 VSADELKDDEEYEDIMEDMRLEAGKYGNLIKVVIPRPDPSGLPVAGVGKVFLEYADVDGA 344
Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
AK A+ GRKFGGN V A +YPE+K+ + +Y A
Sbjct: 345 TKAKTAMHGRKFGGNPVVAVFYPENKFSSAEYDA 378
>gi|414591747|tpg|DAA42318.1| TPA: hypothetical protein ZEAMMB73_924732 [Zea mays]
Length = 590
Score = 511 bits (1316), Expect = e-142, Method: Compositional matrix adjust.
Identities = 264/413 (63%), Positives = 307/413 (74%), Gaps = 18/413 (4%)
Query: 112 SGFDMAPPAAAM--LPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
SGFD AP A+ + +PGQLPGV + +P + N+ A Q + P Q MT
Sbjct: 190 SGFDQAPTQQALPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQFNPLAIQP-QAMT 248
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
QQATRHARRVYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV+NVYINH+KKFAFV
Sbjct: 249 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 308
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
EMR+VEEASNAM LDGI+FEG V++RRPTDYNP+LAAALGP QP+PNLNL+AVGL G+
Sbjct: 309 EMRSVEEASNAMVLDGIMFEGAPVKIRRPTDYNPSLAAALGPSQPNPNLNLSAVGLTPGS 368
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
GG EGPDR+FVGGL YYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD
Sbjct: 369 AGGLEGPDRIFVGGLQYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLN 428
Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
VTDIACAALNG+KMGDKTLTVRRA + Q + EQESIL QAQQ + +QK Q
Sbjct: 429 VTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQMQKFVYQ------ 482
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
+GG + KV+CLT+ +T D L DDEEY++I+EDMREE KYG LV V IPRPD
Sbjct: 483 VGGALP------TKVVCLTQVVTEDELRDDEEYDDIVEDMREEGHKYGNLVKVAIPRPDP 536
Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+G GVGKVFLEY D G AK + GRKFGGN V A +YPEDK+ + Y
Sbjct: 537 SGAPVAGVGKVFLEYADVEGSTKAKTGMHGRKFGGNQVVAVFYPEDKFAAEQY 589
>gi|193848546|gb|ACF22733.1| U2AF large subunit [Brachypodium distachyon]
Length = 569
Score = 510 bits (1314), Expect = e-142, Method: Compositional matrix adjust.
Identities = 265/397 (66%), Positives = 307/397 (77%), Gaps = 22/397 (5%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQ-----ATRHARRVYVGGLPPL 185
GQLPG S++P M NMLPF Q + P Q MTQQ ATRHARRVYVGGLPP
Sbjct: 189 GQLPG--SSIPGMFPNMLPFAVGQFNPLVMQP-QAMTQQHIFPQATRHARRVYVGGLPPT 245
Query: 186 ANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGII 245
ANEQ++A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAMALDGI+
Sbjct: 246 ANEQSVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMALDGIL 305
Query: 246 FEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYY 305
FEG V+VRRPTDYNP+LA+ALGP QPS NLNLAAVGL G+ GG EGPDR+FVGGLPYY
Sbjct: 306 FEGAPVKVRRPTDYNPSLASALGPSQPSSNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYY 365
Query: 306 FTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKT 365
FTE Q++ELLESFG+L GFDLVKDR+TGNSKGY FCVYQD VTDIACAALNG+KMGDKT
Sbjct: 366 FTEAQVRELLESFGSLRGFDLVKDRETGNSKGYAFCVYQDLNVTDIACAALNGIKMGDKT 425
Query: 366 LTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCL 424
LTVRRA S Q + EQE+IL QAQQ + +QK+ Q + T KV+CL
Sbjct: 426 LTVRRANQGSAQPRPEQENILLQAQQQVQLQKLVYQVGALPT-------------KVICL 472
Query: 425 TEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDA 484
T+ +TAD L DDEEYE+I+EDMR E GKYGTLV VVIPRP +G GVGKVFLEY D
Sbjct: 473 TQVVTADELKDDEEYEDIMEDMRLEAGKYGTLVKVVIPRPHPSGEPVAGVGKVFLEYADV 532
Query: 485 VGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
G AK A+ GRKFGGN V A +YPE+K+ ++++ A
Sbjct: 533 DGSTKAKTAMHGRKFGGNPVVAVFYPENKFSDEEFDA 569
>gi|108864648|gb|ABG22574.1| transposon protein, putative, CACTA, En/Spm sub-class, expressed
[Oryza sativa Japonica Group]
Length = 366
Score = 508 bits (1307), Expect = e-141, Method: Compositional matrix adjust.
Identities = 259/380 (68%), Positives = 297/380 (78%), Gaps = 15/380 (3%)
Query: 143 MAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAI 202
M NMLP G Q + P Q MTQQATRHARRVYVGGLPP ANEQ++A +F+QVM AI
Sbjct: 1 MFPNMLPMGVGQFNPLVIQP-QAMTQQATRHARRVYVGGLPPTANEQSVAIYFNQVMAAI 59
Query: 203 GGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPT 262
GGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAMALDGI+FEG V+VRRPTDYNP+
Sbjct: 60 GGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMALDGILFEGAPVKVRRPTDYNPS 119
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
LAAALGP QPSPNLNLAAVGL G+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L
Sbjct: 120 LAAALGPSQPSPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLR 179
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQ 381
GFDLVKDR+TGNSKGY FCVYQD VTDIACAALNG+KMGDKTLTVRRA + Q + EQ
Sbjct: 180 GFDLVKDRETGNSKGYAFCVYQDLNVTDIACAALNGIKMGDKTLTVRRANQGAAQPRPEQ 239
Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEE 441
ESIL QAQQ + +QK+ Q + T KV+CLT+ ++AD L DDEEYE+
Sbjct: 240 ESILLQAQQQVQLQKLVYQVGALPT-------------KVVCLTQVVSADELKDDEEYED 286
Query: 442 ILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
I+EDMR E GKYG L+ VVIPRPD +G GVGKVFLEY D G AK A+ GRKFGG
Sbjct: 287 IMEDMRLEAGKYGNLIKVVIPRPDPSGLPVAGVGKVFLEYADVDGATKAKTAMHGRKFGG 346
Query: 502 NTVNAFYYPEDKYFNKDYSA 521
N V A +YPE+K+ + +Y A
Sbjct: 347 NPVVAVFYPENKFSSAEYDA 366
>gi|9858779|gb|AAG01126.1|AF273333_11 BAC19.11 [Solanum lycopersicum]
Length = 532
Score = 506 bits (1304), Expect = e-141, Method: Compositional matrix adjust.
Identities = 260/415 (62%), Positives = 300/415 (72%), Gaps = 46/415 (11%)
Query: 112 SGFDMAPPAAAMLPGAA-VPGQLPGVPS-AVPEMAQNMLPFGATQLGAFPLMPVQVMTQQ 169
SGFDMAPP +A+L GA V GQ+PG + ++P M NM P A Q
Sbjct: 159 SGFDMAPPTSALLSGATDVAGQVPGTTNPSIPGMFSNMFPLAAGQ--------------- 203
Query: 170 ATRHARRVYVGGLPPLANEQAIATF--FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
ATRHARRVYVGGLPP ANEQ + FS GDAVVNVYINHEKKFAFV
Sbjct: 204 ATRHARRVYVGGLPPTANEQVLKILLKFS-------------GDAVVNVYINHEKKFAFV 250
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
EMR+VEEASNAMALDG+IFEG V+VRRP+DYNP+LAA LGP QPSPNLNLAAVGL G+
Sbjct: 251 EMRSVEEASNAMALDGVIFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGLTPGS 310
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
GG EGPDR+FVGGLPYYFTE+QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +
Sbjct: 311 SGGLEGPDRIFVGGLPYYFTESQIRELLESFGQLRGFDLVKDRETGNSKGYAFCVYQDVS 370
Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
VTDIACAALNG+KMGDKTLTVRRA + Q EQES+L AQQ IA+Q+ LQ + T
Sbjct: 371 VTDIACAALNGIKMGDKTLTVRRANQGTTQPNPEQESVLLHAQQQIALQRFMLQPGALAT 430
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
KVLCLTE ++ D L DD++Y++ILEDMR ECGK+G L+NVVIPRP+
Sbjct: 431 -------------KVLCLTEVVSVDELKDDDDYQDILEDMRIECGKFGALLNVVIPRPNP 477
Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
NG TPG+GKVFLEY D + A+ L+GRKFGGN V A +YPE+K+ DY A
Sbjct: 478 NGEPTPGLGKVFLEYADVDSSSKARQGLNGRKFGGNQVIAVFYPENKFSEGDYEA 532
>gi|413920211|gb|AFW60143.1| hypothetical protein ZEAMMB73_955987 [Zea mays]
Length = 367
Score = 506 bits (1302), Expect = e-140, Method: Compositional matrix adjust.
Identities = 264/379 (69%), Positives = 297/379 (78%), Gaps = 16/379 (4%)
Query: 143 MAQNMLPFG-ATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTA 201
M NMLPFG A Q + P Q MTQQATRHARRVYVGGLPP ANEQ +A +F+QVM A
Sbjct: 1 MFPNMLPFGVAGQFNPLVIQP-QAMTQQATRHARRVYVGGLPPSANEQTVAIYFNQVMAA 59
Query: 202 IGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNP 261
IGGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAMALDGI+FEG V+VRRPTDYNP
Sbjct: 60 IGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNP 119
Query: 262 TLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTL 321
+LAAALGP QPSPNLNLAAVGL +G+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L
Sbjct: 120 SLAAALGPSQPSPNLNLAAVGLTAGSNGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPL 179
Query: 322 HGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTE 380
GFDLVKDR+TGNSKGY FCVYQD VTDIACAALNG+KMGDKTLTVRRA + Q + E
Sbjct: 180 RGFDLVKDRETGNSKGYAFCVYQDLTVTDIACAALNGIKMGDKTLTVRRANQGASQPRPE 239
Query: 381 QESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE 440
QESIL QAQQ + +QK+ Q + T KV+CLT+ +TAD L DDEEYE
Sbjct: 240 QESILLQAQQQVQLQKLVYQVGALPT-------------KVVCLTQVVTADELKDDEEYE 286
Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
+I+EDMR E GKYG LV V+IPRPD +G GVGKVFLEY D G A AK AL GRKFG
Sbjct: 287 DIMEDMRLEAGKYGNLVKVIIPRPDPSGQPVVGVGKVFLEYADIDGAAKAKTALHGRKFG 346
Query: 501 GNTVNAFYYPEDKYFNKDY 519
GN V A Y EDK+ N +Y
Sbjct: 347 GNPVVAVCYAEDKFANGEY 365
>gi|226532558|ref|NP_001140768.1| uncharacterized protein LOC100272843 [Zea mays]
gi|194701008|gb|ACF84588.1| unknown [Zea mays]
gi|414591744|tpg|DAA42315.1| TPA: hypothetical protein ZEAMMB73_924732 [Zea mays]
Length = 583
Score = 505 bits (1300), Expect = e-140, Method: Compositional matrix adjust.
Identities = 263/413 (63%), Positives = 305/413 (73%), Gaps = 25/413 (6%)
Query: 112 SGFDMAPPAAAM--LPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
SGFD AP A+ + +PGQLPGV + +P + N+ A Q Q MT
Sbjct: 190 SGFDQAPTQQALPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQ--------PQAMT 241
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
QQATRHARRVYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV+NVYINH+KKFAFV
Sbjct: 242 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 301
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
EMR+VEEASNAM LDGI+FEG V++RRPTDYNP+LAAALGP QP+PNLNL+AVGL G+
Sbjct: 302 EMRSVEEASNAMVLDGIMFEGAPVKIRRPTDYNPSLAAALGPSQPNPNLNLSAVGLTPGS 361
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
GG EGPDR+FVGGL YYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD
Sbjct: 362 AGGLEGPDRIFVGGLQYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLN 421
Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
VTDIACAALNG+KMGDKTLTVRRA + Q + EQESIL QAQQ + +QK Q
Sbjct: 422 VTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQMQKFVYQ------ 475
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
+GG + KV+CLT+ +T D L DDEEY++I+EDMREE KYG LV V IPRPD
Sbjct: 476 VGGALP------TKVVCLTQVVTEDELRDDEEYDDIVEDMREEGHKYGNLVKVAIPRPDP 529
Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+G GVGKVFLEY D G AK + GRKFGGN V A +YPEDK+ + Y
Sbjct: 530 SGAPVAGVGKVFLEYADVEGSTKAKTGMHGRKFGGNQVVAVFYPEDKFAAEQY 582
>gi|218186084|gb|EEC68511.1| hypothetical protein OsI_36782 [Oryza sativa Indica Group]
Length = 574
Score = 504 bits (1297), Expect = e-140, Method: Compositional matrix adjust.
Identities = 249/389 (64%), Positives = 288/389 (74%), Gaps = 13/389 (3%)
Query: 132 QLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAI 191
Q+P V A+ M NM T + P Q MTQQATRHARRVYVGGLPP ANE +
Sbjct: 196 QVPVVAPAISGMLPNMFNLTQTPFTPLVIQP-QAMTQQATRHARRVYVGGLPPTANEHTV 254
Query: 192 ATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAV 251
A +F+QVM A+GGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAMALDGI+FEG V
Sbjct: 255 AVYFNQVMAAVGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMALDGIMFEGAPV 314
Query: 252 RVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQI 311
+VRRPTDYNP+LAAALGP QP+PNLNLAAVGL G+ GG EGPDR+FVGGLPYYFTE Q+
Sbjct: 315 KVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQV 374
Query: 312 KELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD VTDIACAALNG+KMGDKTLTVRRA
Sbjct: 375 RELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNVTDIACAALNGIKMGDKTLTVRRA 434
Query: 372 T-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
+ Q + EQES+L QQ +QK+ Q G G KV+CLT+ I+
Sbjct: 435 NQGASQPRPEQESMLLHVQQQAQMQKLMFQVGG-----------GALPTKVVCLTQVISP 483
Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATA 490
D L DDEEYE+I++DMREE +YG LV VVIPRPD +G GVG+VFLE+ D A
Sbjct: 484 DELRDDEEYEDIVQDMREEGCRYGNLVKVVIPRPDPSGAPVAGVGRVFLEFADIESSTKA 543
Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
KN + GRKF N V A +YPEDK+ Y
Sbjct: 544 KNGMHGRKFANNQVVAVFYPEDKFAEGQY 572
>gi|115486373|ref|NP_001068330.1| Os11g0636900 [Oryza sativa Japonica Group]
gi|122248736|sp|Q2R0Q1.2|U2A2A_ORYSJ RecName: Full=Splicing factor U2af large subunit A; AltName:
Full=U2 auxiliary factor 65 kDa subunit A; AltName:
Full=U2 small nuclear ribonucleoprotein auxiliary factor
large subunit A; Short=U2 snRNP auxiliary factor large
subunit A
gi|108864607|gb|ABA94914.2| U2 snRNP auxilliary factor, large subunit, splicing factor family
protein, expressed [Oryza sativa Japonica Group]
gi|113645552|dbj|BAF28693.1| Os11g0636900 [Oryza sativa Japonica Group]
gi|222616290|gb|EEE52422.1| hypothetical protein OsJ_34542 [Oryza sativa Japonica Group]
Length = 574
Score = 503 bits (1296), Expect = e-140, Method: Compositional matrix adjust.
Identities = 248/389 (63%), Positives = 288/389 (74%), Gaps = 13/389 (3%)
Query: 132 QLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAI 191
Q+P V A+ M NM T + P Q MTQQATRHARRVYVGGLPP ANE +
Sbjct: 196 QVPVVAPAISGMLPNMFNLTQTPFTPLVIQP-QAMTQQATRHARRVYVGGLPPTANEHTV 254
Query: 192 ATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAV 251
A +F+QVM A+GGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAMALDGI+FEG V
Sbjct: 255 AVYFNQVMAAVGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMALDGIMFEGAPV 314
Query: 252 RVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQI 311
+VRRPTDYNP+LAAALGP QP+PNLNLAAVGL G+ GG EGPDR+FVGGLPYYFTE Q+
Sbjct: 315 KVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQV 374
Query: 312 KELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD VTDIACAALNG+KMGDKTLTVRRA
Sbjct: 375 RELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNVTDIACAALNGIKMGDKTLTVRRA 434
Query: 372 T-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
+ Q + EQES+L QQ +QK+ Q G G KV+CLT+ ++
Sbjct: 435 NQGASQPRPEQESMLLHVQQQAQMQKLMFQVGG-----------GALPTKVVCLTQVVSP 483
Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATA 490
D L DDEEYE+I++DMREE +YG LV VVIPRPD +G GVG+VFLE+ D A
Sbjct: 484 DELRDDEEYEDIVQDMREEGCRYGNLVKVVIPRPDPSGAPVAGVGRVFLEFADVESSTKA 543
Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
KN + GRKF N V A +YPEDK+ Y
Sbjct: 544 KNGMHGRKFANNQVVAVFYPEDKFAEGQY 572
>gi|296088196|emb|CBI35712.3| unnamed protein product [Vitis vinifera]
Length = 412
Score = 500 bits (1287), Expect = e-139, Method: Compositional matrix adjust.
Identities = 248/301 (82%), Positives = 263/301 (87%), Gaps = 4/301 (1%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
G+LPGVP VP M QNM PFGATQLGA PLMPVQ MTQQATRHARRVYVGGLPPLANEQ
Sbjct: 105 GELPGVPQMVPGMIQNMFPFGATQLGALPLMPVQAMTQQATRHARRVYVGGLPPLANEQT 164
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR+VEEASNAMALDGI+FE
Sbjct: 165 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAMALDGIMFEACL 224
Query: 251 VRV-RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTET 309
+ PTDYNP+LAAALGP QPSP+LNLAAVGL G IGGAEGPDR+FVGGLPYYFTE
Sbjct: 225 TLIFSLPTDYNPSLAAALGPSQPSPHLNLAAVGLMPGVIGGAEGPDRIFVGGLPYYFTEE 284
Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVR 369
QI+ELLESFG L GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVR
Sbjct: 285 QIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVR 344
Query: 370 RATA-SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSL--FGETLAKVLCLTE 426
RAT SGQ+K+EQ++ILAQAQQHIAIQK+ALQ G+N G GM+ ET KVLCLTE
Sbjct: 345 RATVGSGQAKSEQDNILAQAQQHIAIQKIALQAGGLNLPGAGMAFTAIAETPTKVLCLTE 404
Query: 427 A 427
Sbjct: 405 V 405
>gi|413920210|gb|AFW60142.1| hypothetical protein ZEAMMB73_955987 [Zea mays]
Length = 364
Score = 495 bits (1275), Expect = e-137, Method: Compositional matrix adjust.
Identities = 254/357 (71%), Positives = 286/357 (80%), Gaps = 14/357 (3%)
Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
Q MTQQATRHARRVYVGGLPP ANEQ +A +F+QVM AIGGN+AGPGDAV+NVYINH+KK
Sbjct: 19 QAMTQQATRHARRVYVGGLPPSANEQTVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKK 78
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
FAFVEMR+VEEASNAMALDGI+FEG V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL
Sbjct: 79 FAFVEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGL 138
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
+G+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVY
Sbjct: 139 TAGSNGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVY 198
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTS 402
QD VTDIACAALNG+KMGDKTLTVRRA + Q + EQESIL QAQQ + +QK+ Q
Sbjct: 199 QDLTVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQLQKLVYQVG 258
Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
+ T KV+CLT+ +TAD L DDEEYE+I+EDMR E GKYG LV V+IP
Sbjct: 259 ALPT-------------KVVCLTQVVTADELKDDEEYEDIMEDMRLEAGKYGNLVKVIIP 305
Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
RPD +G GVGKVFLEY D G A AK AL GRKFGGN V A Y EDK+ N +Y
Sbjct: 306 RPDPSGQPVVGVGKVFLEYADIDGAAKAKTALHGRKFGGNPVVAVCYAEDKFANGEY 362
>gi|357160098|ref|XP_003578657.1| PREDICTED: splicing factor U2af large subunit B-like [Brachypodium
distachyon]
Length = 534
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 269/480 (56%), Positives = 327/480 (68%), Gaps = 49/480 (10%)
Query: 69 HRDYNRDKERRHR--------------------HRSRSHSSDRFRNRSKS-------LSP 101
H D NRD++R H+ RSR+H S+R R R + S
Sbjct: 73 HGDRNRDRDRHHQEHRERSERREHRGRSDDHDYRRSRNHESER-RERDRDGHRRQRSRSR 131
Query: 102 SRSPSKSKRRSGFDMAPPAAAMLPGAAV-PGQLPGVPSAVPEMAQNMLPFGATQLGAFPL 160
SRS ++SKR SGFD P + +V PG LP VP+A+P M NM + P
Sbjct: 132 SRSRAQSKRVSGFDQGPSQTISIAAPSVTPGLLPAVPAAIPAMLPNMF---NIPIAGQP- 187
Query: 161 MPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH 220
Q MTQQATRHARRVYVGGLPP ANEQ +A +F+ VM AIGGN+AG GDAVVNVYINH
Sbjct: 188 ---QAMTQQATRHARRVYVGGLPPSANEQTVAIYFNHVMAAIGGNAAGLGDAVVNVYINH 244
Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
+KKFAFVEMR+VEEASNAMALDGI+FEG V+VRRPTDYNP+ AA LGP QP+PNLNLAA
Sbjct: 245 DKKFAFVEMRSVEEASNAMALDGILFEGAPVKVRRPTDYNPSQAAVLGPSQPNPNLNLAA 304
Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
VGL G+ GG EGPDR+FVGGLPYYFTE Q++ELLE+FG L GFD+VKDR+TGNSKGY F
Sbjct: 305 VGLTPGSAGGLEGPDRIFVGGLPYYFTEAQVQELLETFGPLRGFDIVKDRETGNSKGYAF 364
Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMAL 399
CVYQD AVTDIACAALNG+++GD+TLTVRRA + + + E E+IL QAQ ++K+
Sbjct: 365 CVYQDLAVTDIACAALNGIQLGDRTLTVRRANQGAAEPRPEHENILLQAQHQAQMKKLVY 424
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
+ +GG + KV+CLT+ ++ D L +DEEY++ILEDM E KYG LV
Sbjct: 425 E------VGGAIP------TKVVCLTQVVSEDDLRNDEEYKDILEDMTFEGRKYGNLVQA 472
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
VIPRP NG GVGKVFLEY D G AK + GR+F G V+A +YPE K+ + +Y
Sbjct: 473 VIPRPHPNGVPVAGVGKVFLEYADVDGSTNAKAGMHGRRFDGKVVDAVFYPEKKFADGEY 532
>gi|413920214|gb|AFW60146.1| hypothetical protein ZEAMMB73_955987 [Zea mays]
Length = 536
Score = 493 bits (1268), Expect = e-136, Method: Compositional matrix adjust.
Identities = 270/417 (64%), Positives = 307/417 (73%), Gaps = 32/417 (7%)
Query: 113 GFDMAPPAA--AMLPGAAVPGQLPGVPSAVPE---MAQNMLPFGATQLGAFPLMPVQVMT 167
GFD PP A + P P QLPG S++P M NMLPFG G F P+ +
Sbjct: 140 GFDAPPPQAMGSTFPVIPTPSQLPG--SSLPNIGGMFPNMLPFGVA--GQF--NPLVIQP 193
Query: 168 QQAT----RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
Q T RHARRVYVGGLPP ANEQ +A +F+QVM AIGGN+AGPGDAV+NVYINH+KK
Sbjct: 194 QAMTQQATRHARRVYVGGLPPSANEQTVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKK 253
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
FAFVEMR+VEEASNAMALDGI+FEG V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL
Sbjct: 254 FAFVEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGL 313
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
+G+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVY
Sbjct: 314 TAGSNGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVY 373
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTS 402
QD VTDIACAALNG+KMGDKTLTVRRA + Q + EQESIL QAQQ + +QK+ Q
Sbjct: 374 QDLTVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQLQKLVYQVG 433
Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
+ T KV+CLT+ +TAD L DDEEYE+I+EDMR E G LV V+IP
Sbjct: 434 ALPT-------------KVVCLTQVVTADELKDDEEYEDIMEDMRLEAGN---LVKVIIP 477
Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
RPD +G GVGKVFLEY D G A AK AL GRKFGGN V A Y EDK+ N +Y
Sbjct: 478 RPDPSGQPVVGVGKVFLEYADIDGAAKAKTALHGRKFGGNPVVAVCYAEDKFANGEY 534
>gi|224077136|ref|XP_002305148.1| predicted protein [Populus trichocarpa]
gi|222848112|gb|EEE85659.1| predicted protein [Populus trichocarpa]
Length = 296
Score = 487 bits (1253), Expect = e-135, Method: Compositional matrix adjust.
Identities = 242/296 (81%), Positives = 256/296 (86%), Gaps = 3/296 (1%)
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNP+LAA LGP QPSP LNLAAVGL G I
Sbjct: 1 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPSLAATLGPSQPSPLLNLAAVGLVPGTI 60
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GAEGPDRVFVGGLPYYFTE QI+ELLESFG L GFDLVKDRDTGNSKGYGFCVYQDPAV
Sbjct: 61 SGAEGPDRVFVGGLPYYFTEIQIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAV 120
Query: 349 TDIACAALNGLKMGDKTLTVRRATAS-GQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
TDIACAALNGLKMGDKTLTVRRAT S GQSK+EQE+ILAQAQQHIAIQKMALQ MN
Sbjct: 121 TDIACAALNGLKMGDKTLTVRRATESGGQSKSEQENILAQAQQHIAIQKMALQAGVMNLP 180
Query: 408 GGGMSLF--GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
G G+ L T +KVLCLTEAIT + LADDEEYEEILEDMREEC K+GTL+NVVIPRP
Sbjct: 181 GVGIPLAESAYTPSKVLCLTEAITMEVLADDEEYEEILEDMREECCKFGTLINVVIPRPS 240
Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
QN + PG GKVFLEY D + CA A+NAL+GRKFGGNTVNAFYYPE+KY N DY A
Sbjct: 241 QNEEKMPGAGKVFLEYSDTISCANARNALNGRKFGGNTVNAFYYPEEKYSNGDYGA 296
>gi|255568277|ref|XP_002525113.1| splicing factor u2af large subunit, putative [Ricinus communis]
gi|223535572|gb|EEF37240.1| splicing factor u2af large subunit, putative [Ricinus communis]
Length = 549
Score = 487 bits (1253), Expect = e-135, Method: Compositional matrix adjust.
Identities = 243/347 (70%), Positives = 273/347 (78%), Gaps = 21/347 (6%)
Query: 113 GFDMAPPAAAMLPGAAVP----GQLPGVPSAVPEMAQNMLPFGA-TQLGAFPLMPVQVMT 167
GFDMAPP +AML GAA GQ+PG A+P M NM P G Q G P+MPVQ MT
Sbjct: 189 GFDMAPPPSAMLTGAAAVAAAAGQIPGTAPAIPGMFPNMFPLGTGQQFGTLPVMPVQAMT 248
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
QQATRHARRVYVGGLPP ANEQ++ATFFS VM AIGGN+AGPGDAVVNVYINHEKKFAFV
Sbjct: 249 QQATRHARRVYVGGLPPTANEQSVATFFSHVMAAIGGNTAGPGDAVVNVYINHEKKFAFV 308
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
EMR+VEEASNAMALDGIIFEG V+VRRP+DYNP+LAA LGP QP+PNLNL AVGL G+
Sbjct: 309 EMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNPNLNLGAVGLTPGS 368
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
GG EGPDR+FVGGLPYYFTE QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +
Sbjct: 369 AGGLEGPDRIFVGGLPYYFTEAQIRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLS 428
Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
VTDIACAALNG+KMGDKTLTVRRA + Q K EQE++L AQQ IA+Q++ LQ
Sbjct: 429 VTDIACAALNGIKMGDKTLTVRRANQGANQPKPEQETVLLHAQQQIALQRLMLQP----- 483
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
KV+CLT+ +TAD L DD+EYE+ILEDMR E GK+
Sbjct: 484 ----------VPTKVVCLTQVVTADELKDDDEYEDILEDMRTEGGKF 520
>gi|413920212|gb|AFW60144.1| hypothetical protein ZEAMMB73_955987 [Zea mays]
Length = 502
Score = 479 bits (1232), Expect = e-132, Method: Compositional matrix adjust.
Identities = 257/379 (67%), Positives = 291/379 (76%), Gaps = 24/379 (6%)
Query: 113 GFDMAPPAA--AMLPGAAVPGQLPGVPSAVPE---MAQNMLPFG-ATQLGAFPLMPVQVM 166
GFD PP A + P P QLPG S++P M NMLPFG A Q + P Q M
Sbjct: 140 GFDAPPPQAMGSTFPVIPTPSQLPG--SSLPNIGGMFPNMLPFGVAGQFNPLVIQP-QAM 196
Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
TQQATRHARRVYVGGLPP ANEQ +A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAF
Sbjct: 197 TQQATRHARRVYVGGLPPSANEQTVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAF 256
Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
VEMR+VEEASNAMALDGI+FEG V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL +G
Sbjct: 257 VEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTAG 316
Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD
Sbjct: 317 SNGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDL 376
Query: 347 AVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
VTDIACAALNG+KMGDKTLTVRRA + Q + EQESIL QAQQ + +QK+ Q +
Sbjct: 377 TVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQLQKLVYQVGALP 436
Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
T KV+CLT+ +TAD L DDEEYE+I+EDMR E GKYG LV V+IPRPD
Sbjct: 437 T-------------KVVCLTQVVTADELKDDEEYEDIMEDMRLEAGKYGNLVKVIIPRPD 483
Query: 466 QNGGETPGVGKVFLE-YYD 483
+G GVGKV LE YYD
Sbjct: 484 PSGQPVVGVGKVSLELYYD 502
>gi|224125466|ref|XP_002329812.1| predicted protein [Populus trichocarpa]
gi|222870874|gb|EEF08005.1| predicted protein [Populus trichocarpa]
Length = 321
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 235/294 (79%), Positives = 249/294 (84%), Gaps = 3/294 (1%)
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
MRTVEEASNAM LDGIIFEGVAVRVRRPTDYNP+LAA LGP QPSP LNLAAVGL G I
Sbjct: 1 MRTVEEASNAMTLDGIIFEGVAVRVRRPTDYNPSLAATLGPSQPSPLLNLAAVGLVPGTI 60
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GAEGPDRVFVGGLPYYFTETQI+ELLESFG L GFDLVKDRDTGNSKGYGFCVYQDPAV
Sbjct: 61 SGAEGPDRVFVGGLPYYFTETQIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAV 120
Query: 349 TDIACAALNGLKMGDKTLTVRRATAS-GQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
TDIACAALNGLKMGDKTLTVRR T S GQS++EQE+ILAQAQQHIAIQKMALQ MN
Sbjct: 121 TDIACAALNGLKMGDKTLTVRRGTESGGQSRSEQENILAQAQQHIAIQKMALQAGVMNLP 180
Query: 408 GGGMSLF--GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
G G+ L + +KVLCLTEAI + LADDEEYEEILEDMREEC K+GTL+NVVIPRP
Sbjct: 181 GVGIPLAESSHSPSKVLCLTEAIAMEVLADDEEYEEILEDMREECCKFGTLINVVIPRPS 240
Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
Q + G GKVFLEY D CA A+NAL+GRKFGGNTVNA YYPEDKY N DY
Sbjct: 241 QTEEQISGAGKVFLEYSDTSSCANARNALNGRKFGGNTVNASYYPEDKYHNGDY 294
>gi|302813365|ref|XP_002988368.1| hypothetical protein SELMODRAFT_235524 [Selaginella moellendorffii]
gi|300143770|gb|EFJ10458.1| hypothetical protein SELMODRAFT_235524 [Selaginella moellendorffii]
Length = 353
Score = 470 bits (1209), Expect = e-130, Method: Compositional matrix adjust.
Identities = 237/366 (64%), Positives = 279/366 (76%), Gaps = 17/366 (4%)
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
L AF MP Q MTQQATRHAR+VYVGGLP L NEQ IATFF+QVM +GGN+AGPGD VV
Sbjct: 4 LAAFA-MPPQTMTQQATRHARQVYVGGLPGLVNEQTIATFFNQVMVNVGGNTAGPGDVVV 62
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
NVYIN EKKFAFVEMRTVEEASNAMALDGI F+GV+VRVRRP+DYNP++AA LGP QPSP
Sbjct: 63 NVYINQEKKFAFVEMRTVEEASNAMALDGISFQGVSVRVRRPSDYNPSVAANLGPSQPSP 122
Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
+LNLAAVGL GA GG +GPDR+FVGGLPYY TE QI+ELLESFG L GFDLVKDR++GN
Sbjct: 123 SLNLAAVGLTPGAGGGVDGPDRIFVGGLPYYLTEPQIRELLESFGPLRGFDLVKDRESGN 182
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
SKGYGFCVYQDP VTD+ACAALNGLKMGD+TLTVRRATA+GQ Q HI
Sbjct: 183 SKGYGFCVYQDPNVTDVACAALNGLKMGDRTLTVRRATANGQQA-------GQDHAHILS 235
Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
+L +G+ E +VLCL EA+ L +DE+++EILEDMR+ECGK+G
Sbjct: 236 LAKSLTMNGV--------FPDEGATRVLCLKEAVLEAELIEDEQFDEILEDMRDECGKFG 287
Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
T++++VIPRP Q E GVGKVF+ + D A+ +L+GRKF G V A YYPE+++
Sbjct: 288 TVLHLVIPRPSQ-AAEVDGVGKVFVHFEDTGAATRARISLNGRKFDGRAVVATYYPEEQF 346
Query: 515 FNKDYS 520
D+S
Sbjct: 347 MVGDFS 352
>gi|302795921|ref|XP_002979723.1| hypothetical protein SELMODRAFT_13030 [Selaginella moellendorffii]
gi|300152483|gb|EFJ19125.1| hypothetical protein SELMODRAFT_13030 [Selaginella moellendorffii]
Length = 360
Score = 469 bits (1206), Expect = e-129, Method: Compositional matrix adjust.
Identities = 237/366 (64%), Positives = 279/366 (76%), Gaps = 17/366 (4%)
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
L AF MP Q MTQQATRHAR+VYVGGLP L NEQ IATFF+QVM +GGN+AGPGD VV
Sbjct: 11 LAAFA-MPPQTMTQQATRHARQVYVGGLPGLVNEQTIATFFNQVMVNVGGNTAGPGDVVV 69
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
NVYIN EKKFAFVEMRTVEEASNAMALDGI F+GV+VRVRRP+DYNP++AA LGP QPSP
Sbjct: 70 NVYINQEKKFAFVEMRTVEEASNAMALDGISFQGVSVRVRRPSDYNPSVAANLGPSQPSP 129
Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
+LNLAAVGL GA GG +GPDR+FVGGLPYY TE QI+ELLESFG L GFDLVKDR++GN
Sbjct: 130 SLNLAAVGLTPGAGGGVDGPDRIFVGGLPYYLTEPQIRELLESFGPLRGFDLVKDRESGN 189
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
SKGYGFCVYQDP VTD+ACAALNGLKMGD+TLTVRRATA+GQ Q HI
Sbjct: 190 SKGYGFCVYQDPNVTDVACAALNGLKMGDRTLTVRRATANGQQA-------GQDHAHILS 242
Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
+L +G+ E +VLCL EA+ L +DE+++EILEDMR+ECGK+G
Sbjct: 243 LAKSLTMNGV--------FPDEGATRVLCLKEAVLEAELIEDEQFDEILEDMRDECGKFG 294
Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
T++++VIPRP Q E GVGKVF+ + D A+ +L+GRKF G V A YYPE+++
Sbjct: 295 TVLHLVIPRPSQ-AAEVDGVGKVFVHFEDTGAATRARISLNGRKFDGRAVVATYYPEEQF 353
Query: 515 FNKDYS 520
D+S
Sbjct: 354 MVGDFS 359
>gi|30690730|ref|NP_849509.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
gi|19310597|gb|AAL85029.1| putative splicing factor [Arabidopsis thaliana]
gi|332661289|gb|AEE86689.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
Length = 542
Score = 466 bits (1200), Expect = e-128, Method: Compositional matrix adjust.
Identities = 238/351 (67%), Positives = 274/351 (78%), Gaps = 18/351 (5%)
Query: 109 KRRSGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPE--MAQNMLPFGATQ-LGAFPLMPVQ 164
+R SGFDMAPPA+AML A V GQ+P P +P M NM P Q G +MP+Q
Sbjct: 169 QRVSGFDMAPPASAMLAAGAAVTGQVPPAPPTLPGAGMFPNMFPLPTGQSFGGLSMMPIQ 228
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
MTQQATRHARRVYVGGL P ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYINHEKKF
Sbjct: 229 AMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINHEKKF 288
Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
AFVEMR+VEEASNAM+LDGIIFEG V+VRRP+DYNP+LAA LGP QPSP+LNLAAVGL
Sbjct: 289 AFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLNLAAVGLT 348
Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
GA GG EGPDR+FVGGLPYYFTE+Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQ
Sbjct: 349 PGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGGLKGFDLVKDRETGNSKGYAFCVYQ 408
Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
D +VTDIACAALNG+KMGDKTLTVRRA + K EQE++L AQQ IA Q++ LQ
Sbjct: 409 DLSVTDIACAALNGIKMGDKTLTVRRANQGTMLQKPEQENVLLHAQQQIAFQRVMLQPGA 468
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
+ T V+CLT+ +T D L DDEEY +I+EDMR+E GK+G
Sbjct: 469 VAT-------------TVVCLTQVVTEDELRDDEEYGDIMEDMRQEGGKFG 506
>gi|42573197|ref|NP_974695.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
gi|332661288|gb|AEE86688.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
Length = 565
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 237/350 (67%), Positives = 273/350 (78%), Gaps = 18/350 (5%)
Query: 109 KRRSGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPE--MAQNMLPFGATQ-LGAFPLMPVQ 164
+R SGFDMAPPA+AML A V GQ+P P +P M NM P Q G +MP+Q
Sbjct: 169 QRVSGFDMAPPASAMLAAGAAVTGQVPPAPPTLPGAGMFPNMFPLPTGQSFGGLSMMPIQ 228
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
MTQQATRHARRVYVGGL P ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYINHEKKF
Sbjct: 229 AMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINHEKKF 288
Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
AFVEMR+VEEASNAM+LDGIIFEG V+VRRP+DYNP+LAA LGP QPSP+LNLAAVGL
Sbjct: 289 AFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLNLAAVGLT 348
Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
GA GG EGPDR+FVGGLPYYFTE+Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQ
Sbjct: 349 PGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGGLKGFDLVKDRETGNSKGYAFCVYQ 408
Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
D +VTDIACAALNG+KMGDKTLTVRRA + K EQE++L AQQ IA Q++ LQ
Sbjct: 409 DLSVTDIACAALNGIKMGDKTLTVRRANQGTMLQKPEQENVLLHAQQQIAFQRVMLQPGA 468
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
+ T V+CLT+ +T D L DDEEY +I+EDMR+E GK+
Sbjct: 469 VAT-------------TVVCLTQVVTEDELRDDEEYGDIMEDMRQEGGKF 505
>gi|222423510|dbj|BAH19725.1| AT4G36690 [Arabidopsis thaliana]
Length = 565
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 236/350 (67%), Positives = 273/350 (78%), Gaps = 18/350 (5%)
Query: 109 KRRSGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPE--MAQNMLPFGATQ-LGAFPLMPVQ 164
+R SGFDMAPPA+AML A V GQ+P P +P M NM P Q G +MP+Q
Sbjct: 169 QRVSGFDMAPPASAMLAAGAAVTGQVPPAPPTLPGAGMFPNMFPLPTGQSFGGLSMMPIQ 228
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
MTQQATRHARRVYVGGL P ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYINHEKKF
Sbjct: 229 AMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINHEKKF 288
Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
AFVEMR+VEEASNAM+LDGIIFEG V+VRRP+DYNP+LAA LGP QPSP+LNLAAVGL
Sbjct: 289 AFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLNLAAVGLT 348
Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
GA GG EGPDR+FVGGLPYYFTE+Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQ
Sbjct: 349 PGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGGLKGFDLVKDRETGNSKGYAFCVYQ 408
Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
D +VTDIACAALNG+KMGDKTLTVRRA + K EQE++L AQQ IA Q++ LQ
Sbjct: 409 DLSVTDIACAALNGIKMGDKTLTVRRANQGTMLQKPEQENVLLHAQQQIAFQRVMLQPGA 468
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
+ T V+CLT+ +T D L DDEEY +I+EDMR+E G++
Sbjct: 469 VAT-------------TVVCLTQVVTEDELRDDEEYGDIMEDMRQEGGRF 505
>gi|29367529|gb|AAO72620.1| putative U2 snRNP auxiliary factor [Oryza sativa Japonica Group]
Length = 331
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 216/316 (68%), Positives = 250/316 (79%), Gaps = 12/316 (3%)
Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
Q MTQQATRHARRVYVGGLPP ANE +A +F+QVM A+GGN+AGPGDAV+NVYINH+KK
Sbjct: 22 QAMTQQATRHARRVYVGGLPPTANEHTVAVYFNQVMAAVGGNTAGPGDAVLNVYINHDKK 81
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
FAFVEMR+VEEASNAMALDGI+FEG V+VRRPTDYNP+LAAALGP QP+PNLNLAAVGL
Sbjct: 82 FAFVEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLAAVGL 141
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
G+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVY
Sbjct: 142 TPGSAGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVY 201
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTS 402
QD VTDIACAALNG+KMGDKTLTVRRA + Q + EQES+L QQ +QK+ Q
Sbjct: 202 QDLNVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESMLLHVQQQAQMQKLMFQVG 261
Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
G G KV+CLT+ ++ D L DDEEYE+I++DMREE +YG LV V+ P
Sbjct: 262 G-----------GALPTKVVCLTQVVSPDELRDDEEYEDIVQDMREEGCRYGNLVKVLNP 310
Query: 463 RPDQNGGETPGVGKVF 478
RPD +G G G+ F
Sbjct: 311 RPDPSGAPVAGFGRCF 326
>gi|413920348|gb|AFW60280.1| hypothetical protein ZEAMMB73_339264 [Zea mays]
Length = 549
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 235/361 (65%), Positives = 273/361 (75%), Gaps = 21/361 (5%)
Query: 112 SGFDMAPP--AAAMLPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
SGFD APP A ++ A+PGQLPG+ + +P + N+ A Q + P Q MT
Sbjct: 194 SGFDQAPPQHALPIVAAGAIPGQLPGITAPIPGVGVLPNLYNLAAGQFNPLVIQP-QAMT 252
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
QQATRHARRVYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV+NVYINH+KKFAFV
Sbjct: 253 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 312
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
EMR+VEEASNAMALDGI+FEG V+VRRPTDYNP+LAAALGP QP+PNLNLAAVGL G+
Sbjct: 313 EMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGS 372
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD
Sbjct: 373 AGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLN 432
Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
VTDIACAALNG+KMGDKTLTVRRA + Q + EQESIL QAQQ + +QK+ Q
Sbjct: 433 VTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQMQKLVYQ------ 486
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY---GTLVNVVIPR 463
+GG + KV+CLT+ +TAD L DDEEY +I+EDMREE KY + I R
Sbjct: 487 VGGALP------TKVVCLTQVVTADELRDDEEYNDIVEDMREEGRKYVPHNAIAECFIVR 540
Query: 464 P 464
P
Sbjct: 541 P 541
>gi|122245119|sp|Q2QKB3.1|U2A2A_WHEAT RecName: Full=Splicing factor U2af large subunit A; AltName:
Full=U2 auxiliary factor 65 kDa subunit A; AltName:
Full=U2 small nuclear ribonucleoprotein auxiliary factor
large subunit A; Short=U2 snRNP auxiliary factor large
subunit A
gi|68036924|gb|AAY84881.1| U2AF large subunit [Triticum aestivum]
Length = 591
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 249/443 (56%), Positives = 296/443 (66%), Gaps = 28/443 (6%)
Query: 29 GERGRDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRHRSRSHS 88
G R R+RHHRD + G DR R + D + D + R R
Sbjct: 131 GSRDRERHHRDHREGSRDRER---HHRDHRERSERREHRDRSDDRDYRRSCDRDAERRDR 187
Query: 89 SDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAA-AMLPGAAVPGQLPGVPSAVPEMAQNM 147
R +S SP RS S+SKR SGFD P A +L A P QLP +P+A P M NM
Sbjct: 188 DRDGHRRHRSRSPLRSESQSKRMSGFDQRPSEAIPILAPDATPSQLPELPAANPGMFPNM 247
Query: 148 LP--FGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGN 205
LP LG PL MTQQATRHARRVYVGGLPP+ANEQ +A FF+QVM AIGGN
Sbjct: 248 LPNLVNVPALGQ-PL----AMTQQATRHARRVYVGGLPPIANEQTVAVFFNQVMAAIGGN 302
Query: 206 SAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAA 265
+ G AVVNVYINH+KKFAFVEMR+VEEASNAMALDGI+FEG V+VRRPTDYNP+ AA
Sbjct: 303 TFALGHAVVNVYINHDKKFAFVEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSQAA 362
Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFD 325
ALGP QP+PNLNLAAVGL GA GG EGPDR+FVGGLPYYFTE Q++ELLE+FG L GFD
Sbjct: 363 ALGPSQPNPNLNLAAVGLTPGAGGGLEGPDRIFVGGLPYYFTEAQVRELLETFGPLRGFD 422
Query: 326 LVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESIL 385
+VKD++TGNSKGY FC+Y+D VTDIACAALNG+++GD+TLTVRRA + + EQE+IL
Sbjct: 423 IVKDKETGNSKGYAFCLYKDGTVTDIACAALNGIQLGDRTLTVRRANQGAEPRPEQENIL 482
Query: 386 AQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL-AKVLCLTEAITADALADDEEYEEILE 444
QAQQ ++++ + G TL KV+CLT+ ++AD L DDEEY +ILE
Sbjct: 483 LQAQQEAQMKRLVYEV-------------GRTLTTKVVCLTQVVSADDLRDDEEYNDILE 529
Query: 445 DMREECGKY---GTLVNVVIPRP 464
DM E KY T+ I RP
Sbjct: 530 DMTLEGHKYVPHSTIAESFIIRP 552
>gi|414591746|tpg|DAA42317.1| TPA: hypothetical protein ZEAMMB73_924732 [Zea mays]
Length = 538
Score = 437 bits (1124), Expect = e-120, Method: Compositional matrix adjust.
Identities = 227/348 (65%), Positives = 265/348 (76%), Gaps = 18/348 (5%)
Query: 112 SGFDMAPPAAAM--LPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
SGFD AP A+ + +PGQLPGV + +P + N+ A Q + P Q MT
Sbjct: 190 SGFDQAPTQQALPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQFNPLAIQP-QAMT 248
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
QQATRHARRVYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV+NVYINH+KKFAFV
Sbjct: 249 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 308
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
EMR+VEEASNAM LDGI+FEG V++RRPTDYNP+LAAALGP QP+PNLNL+AVGL G+
Sbjct: 309 EMRSVEEASNAMVLDGIMFEGAPVKIRRPTDYNPSLAAALGPSQPNPNLNLSAVGLTPGS 368
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
GG EGPDR+FVGGL YYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD
Sbjct: 369 AGGLEGPDRIFVGGLQYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLN 428
Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
VTDIACAALNG+KMGDKTLTVRRA + Q + EQESIL QAQQ + +QK Q
Sbjct: 429 VTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQMQKFVYQ------ 482
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
+GG + KV+CLT+ +T D L DDEEY++I+EDMREE KYG
Sbjct: 483 VGGALP------TKVVCLTQVVTEDELRDDEEYDDIVEDMREEGHKYG 524
>gi|108864608|gb|ABG22562.1| U2 snRNP auxilliary factor, large subunit, splicing factor family
protein, expressed [Oryza sativa Japonica Group]
Length = 550
Score = 433 bits (1113), Expect = e-118, Method: Compositional matrix adjust.
Identities = 213/323 (65%), Positives = 247/323 (76%), Gaps = 13/323 (4%)
Query: 132 QLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAI 191
Q+P V A+ M NM T + P Q MTQQATRHARRVYVGGLPP ANE +
Sbjct: 196 QVPVVAPAISGMLPNMFNLTQTPFTPLVIQP-QAMTQQATRHARRVYVGGLPPTANEHTV 254
Query: 192 ATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAV 251
A +F+QVM A+GGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAMALDGI+FEG V
Sbjct: 255 AVYFNQVMAAVGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMALDGIMFEGAPV 314
Query: 252 RVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQI 311
+VRRPTDYNP+LAAALGP QP+PNLNLAAVGL G+ GG EGPDR+FVGGLPYYFTE Q+
Sbjct: 315 KVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQV 374
Query: 312 KELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD VTDIACAALNG+KMGDKTLTVRRA
Sbjct: 375 RELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNVTDIACAALNGIKMGDKTLTVRRA 434
Query: 372 T-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
+ Q + EQES+L QQ +QK+ Q G G KV+CLT+ ++
Sbjct: 435 NQGASQPRPEQESMLLHVQQQAQMQKLMFQVGG-----------GALPTKVVCLTQVVSP 483
Query: 431 DALADDEEYEEILEDMREECGKY 453
D L DDEEYE+I++DMREE +Y
Sbjct: 484 DELRDDEEYEDIVQDMREEGCRY 506
>gi|414591745|tpg|DAA42316.1| TPA: hypothetical protein ZEAMMB73_924732 [Zea mays]
Length = 538
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 227/361 (62%), Positives = 267/361 (73%), Gaps = 28/361 (7%)
Query: 112 SGFDMAPPAAAM--LPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
SGFD AP A+ + +PGQLPGV + +P + N+ A Q Q MT
Sbjct: 190 SGFDQAPTQQALPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQ--------PQAMT 241
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
QQATRHARRVYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV+NVYINH+KKFAFV
Sbjct: 242 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 301
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
EMR+VEEASNAM LDGI+FEG V++RRPTDYNP+LAAALGP QP+PNLNL+AVGL G+
Sbjct: 302 EMRSVEEASNAMVLDGIMFEGAPVKIRRPTDYNPSLAAALGPSQPNPNLNLSAVGLTPGS 361
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
GG EGPDR+FVGGL YYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD
Sbjct: 362 AGGLEGPDRIFVGGLQYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLN 421
Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
VTDIACAALNG+KMGDKTLTVRRA + Q + EQESIL QAQQ + +QK Q
Sbjct: 422 VTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQMQKFVYQ------ 475
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY---GTLVNVVIPR 463
+GG + KV+CLT+ +T D L DDEEY++I+EDMREE KY + + + R
Sbjct: 476 VGGALP------TKVVCLTQVVTEDELRDDEEYDDIVEDMREEGHKYVPHNAIADCFVVR 529
Query: 464 P 464
P
Sbjct: 530 P 530
>gi|224030681|gb|ACN34416.1| unknown [Zea mays]
Length = 425
Score = 417 bits (1072), Expect = e-114, Method: Compositional matrix adjust.
Identities = 242/379 (63%), Positives = 279/379 (73%), Gaps = 28/379 (7%)
Query: 80 HRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAM--LPGAAVPGQLPGVP 137
HR RS S S DR R SRS SKSKR SGFD AP A+ + +PGQLPGV
Sbjct: 48 HRSRSPSMSRDRDRRSRSR---SRSRSKSKRVSGFDQAPTQQALPIVAAGVIPGQLPGVT 104
Query: 138 SAVPEMA--QNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFF 195
+ +P + N+ A Q Q MTQQATRHARRVYVGGLPP ANEQ +A FF
Sbjct: 105 APIPGVGVLPNLYNLAAGQ--------PQAMTQQATRHARRVYVGGLPPTANEQTVAIFF 156
Query: 196 SQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRR 255
+ VM AIGGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAM LDGI+FEG V++RR
Sbjct: 157 NGVMAAIGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMVLDGIMFEGAPVKIRR 216
Query: 256 PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELL 315
PTDYNP+LAAALGP QP+PNLNL+AVGL G+ GG EGPDR+FVGGL YYFTE Q++ELL
Sbjct: 217 PTDYNPSLAAALGPSQPNPNLNLSAVGLTPGSAGGLEGPDRIFVGGLQYYFTEAQVRELL 276
Query: 316 ESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-AS 374
ESFG L GFDLVKDR+TGNSKGY FCVYQD VTDIACAALNG+KMGDKTLTVRRA +
Sbjct: 277 ESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNVTDIACAALNGIKMGDKTLTVRRANQGA 336
Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA 434
Q + EQESIL QAQQ + +QK Q +GG + KV+CLT+ +T D L
Sbjct: 337 SQPRPEQESILLQAQQQVQMQKFVYQ------VGGALP------TKVVCLTQVVTEDELR 384
Query: 435 DDEEYEEILEDMREECGKY 453
DDEEY++I+EDMREE KY
Sbjct: 385 DDEEYDDIVEDMREEGHKY 403
>gi|242069431|ref|XP_002449992.1| hypothetical protein SORBIDRAFT_05g026786 [Sorghum bicolor]
gi|241935835|gb|EES08980.1| hypothetical protein SORBIDRAFT_05g026786 [Sorghum bicolor]
Length = 296
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 202/292 (69%), Positives = 233/292 (79%), Gaps = 16/292 (5%)
Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
Q MTQ ATRHARRVYVGGLPP ANEQ +A +F+Q+M AIGGN+AGPGDAV+NVYINH+KK
Sbjct: 9 QAMTQHATRHARRVYVGGLPPDANEQTVAVYFNQIMAAIGGNTAGPGDAVLNVYINHDKK 68
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
FA VEMR+VEEASNAMALDGI+FEGV V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL
Sbjct: 69 FASVEMRSVEEASNAMALDGIMFEGVPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGL 128
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
+G+ GG E PDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKD++TGNSKGY FC Y
Sbjct: 129 TAGS-GGLEDPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDKETGNSKGYAFCDY 187
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTS 402
QD VTDIACAALNG+KMGDK LTVRRA + Q EQESIL QAQQ + +QK+A
Sbjct: 188 QDLTVTDIACAALNGIKMGDKILTVRRANQGASQPTPEQESILLQAQQQVQMQKLAHPVG 247
Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
T KV+CL ++AD L +DE YE+I++DMREE +YG
Sbjct: 248 AAPT-------------KVVCLVHVVSADEL-EDEVYEDIMDDMREEARRYG 285
>gi|384249807|gb|EIE23288.1| hypothetical protein COCSUDRAFT_23864, partial [Coccomyxa
subellipsoidea C-169]
Length = 464
Score = 389 bits (1000), Expect = e-105, Method: Compositional matrix adjust.
Identities = 224/429 (52%), Positives = 274/429 (63%), Gaps = 51/429 (11%)
Query: 111 RSGFDMAPPAA-----AMLPGAAVPGQLPGV----------PSAVPEMAQN-----MLPF 150
+SGFD PP LP PG +PGV +A P N P
Sbjct: 58 KSGFDQPPPGGIPPVFGGLPAGLPPG-MPGVEAIAAVAAPLAAAAPTGFSNGGFSGAPPM 116
Query: 151 GATQLGAFPLMP-VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
TQ+G +MP VQ +QQATRHARRVYVGGLPP NEQ IATFFS + AIGG +AGP
Sbjct: 117 IGTQMGG--MMPGVQPPSQQATRHARRVYVGGLPPTGNEQNIATFFSNALAAIGGTTAGP 174
Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
G +VVNVYIN+EKKFAFVE RTVEE SNAMALDGI+FEGV+VRVRRP DYNP A+ALGP
Sbjct: 175 GASVVNVYINYEKKFAFVEFRTVEETSNAMALDGIMFEGVSVRVRRPNDYNPAAASALGP 234
Query: 270 GQPSPNLNLAAVGLASG---AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
P+PNLNLAA+GL +G A+ + +RVFVGGLPYY E Q +ELL SFG + FDL
Sbjct: 235 SVPNPNLNLAAIGLQAGGMNAVAMIDAAERVFVGGLPYYLNEEQCRELLGSFGGIKSFDL 294
Query: 327 VKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILA 386
VKDR+TGNSKGYGF VY DP VTDIACA LNG++MG++TLTVRRA TE + A
Sbjct: 295 VKDRETGNSKGYGFVVYTDPNVTDIACAGLNGMRMGERTLTVRRA-------TENQGGAA 347
Query: 387 QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
A A+ T+ ++L L EA++ D LA+DEEY +I++DM
Sbjct: 348 GAATAAALSADPFPTA----------------TRILALQEAVSLDELANDEEYVDIVQDM 391
Query: 447 REECGKYGTLVNVVIPRPDQNGGETP-GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
R+E K+GT+++V+IPRP G P G+GKVF+ + + G + + GR+FGG TV
Sbjct: 392 RDEASKFGTVIDVLIPRPAPEGQPPPSGLGKVFINFAEKEGAVNSFRVMHGRRFGGRTVV 451
Query: 506 AFYYPEDKY 514
A Y E Y
Sbjct: 452 ASYVQEADY 460
>gi|359497129|ref|XP_003635431.1| PREDICTED: splicing factor U2af large subunit B-like, partial
[Vitis vinifera]
Length = 238
Score = 367 bits (942), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 181/238 (76%), Positives = 199/238 (83%), Gaps = 3/238 (1%)
Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
IGGAEGPDR+FVGGLPYYFTE QI+ELLESFG L GFDLVKDRDTGNSKGYGFCVYQDP
Sbjct: 1 VIGGAEGPDRIFVGGLPYYFTEEQIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDP 60
Query: 347 AVTDIACAALNGLKMGDKTLTVRRATA-SGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
AVTDIACAALNGLKMGDKTLTVRRAT SGQ+K+EQ++ILAQAQQHIAIQK+ALQ G+N
Sbjct: 61 AVTDIACAALNGLKMGDKTLTVRRATVGSGQAKSEQDNILAQAQQHIAIQKIALQAGGLN 120
Query: 406 TLGGGMSL--FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
G GM+ ET KVLCLTE I D L DDE YEEILEDMR+E GK+G LV+VVIPR
Sbjct: 121 LPGAGMAFTAIAETPTKVLCLTEVINIDELRDDEAYEEILEDMRDEGGKFGALVHVVIPR 180
Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
P NG PGVGKVFLEY D G ++A+NALSGRKFGGN V+A YYPEDKY++ DY A
Sbjct: 181 PSPNGDLIPGVGKVFLEYSDTAGSSSARNALSGRKFGGNVVSAVYYPEDKYYDGDYGA 238
>gi|297736736|emb|CBI25913.3| unnamed protein product [Vitis vinifera]
Length = 6467
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 181/238 (76%), Positives = 199/238 (83%), Gaps = 3/238 (1%)
Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
IGGAEGPDR+FVGGLPYYFTE QI+ELLESFG L GFDLVKDRDTGNSKGYGFCVYQDP
Sbjct: 6230 VIGGAEGPDRIFVGGLPYYFTEEQIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDP 6289
Query: 347 AVTDIACAALNGLKMGDKTLTVRRATA-SGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
AVTDIACAALNGLKMGDKTLTVRRAT SGQ+K+EQ++ILAQAQQHIAIQK+ALQ G+N
Sbjct: 6290 AVTDIACAALNGLKMGDKTLTVRRATVGSGQAKSEQDNILAQAQQHIAIQKIALQAGGLN 6349
Query: 406 TLGGGMSL--FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
G GM+ ET KVLCLTE I D L DDE YEEILEDMR+E GK+G LV+VVIPR
Sbjct: 6350 LPGAGMAFTAIAETPTKVLCLTEVINIDELRDDEAYEEILEDMRDEGGKFGALVHVVIPR 6409
Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
P NG PGVGKVFLEY D G ++A+NALSGRKFGGN V+A YYPEDKY++ DY A
Sbjct: 6410 PSPNGDLIPGVGKVFLEYSDTAGSSSARNALSGRKFGGNVVSAVYYPEDKYYDGDYGA 6467
>gi|159473054|ref|XP_001694654.1| U2 snRNP auxiliary factor, large subunit [Chlamydomonas
reinhardtii]
gi|158276466|gb|EDP02238.1| U2 snRNP auxiliary factor, large subunit [Chlamydomonas
reinhardtii]
Length = 306
Score = 358 bits (919), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 178/302 (58%), Positives = 222/302 (73%), Gaps = 12/302 (3%)
Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFA 225
++QQATRHARR+YVGGLPP A EQ+I++FFS + AIGGN+AGPG+AVVNVYIN EK FA
Sbjct: 4 VSQQATRHARRIYVGGLPPTATEQSISSFFSHALAAIGGNTAGPGNAVVNVYINREKNFA 63
Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLAS 285
FVE+RTVEE SN+MALDGI+FEGV+VRVRRP DYNP A +LGP P+P LNLAA+GL
Sbjct: 64 FVELRTVEETSNSMALDGIMFEGVSVRVRRPNDYNPAAAVSLGPSTPNPALNLAAIGLNP 123
Query: 286 GAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQD 345
+ PDR+FVGGLPYY TE Q +ELL SFG + FDLVKDRDTGNSKGYGF VYQD
Sbjct: 124 N-----DNPDRIFVGGLPYYLTEDQCRELLGSFGAIKSFDLVKDRDTGNSKGYGFVVYQD 178
Query: 346 PAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
+VTDIACA LNGLKMGD+TLTVRRAT ++ +A +G+
Sbjct: 179 TSVTDIACAGLNGLKMGDRTLTVRRATEGAPGGGAAPAMGPAGLGGLAGLGGLNPLAGV- 237
Query: 406 TLGGGM---SLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
GG+ L T +++ LT+A++A+ + DD+EY++ILEDM++E ++G NV+IP
Sbjct: 238 ---GGVVVNPLGLATATRIVVLTDAVSAEEIIDDQEYQDILEDMKDEASRHGLCNNVLIP 294
Query: 463 RP 464
RP
Sbjct: 295 RP 296
>gi|302846543|ref|XP_002954808.1| splicing factor U2AF, large subunit [Volvox carteri f. nagariensis]
gi|300259991|gb|EFJ44214.1| splicing factor U2AF, large subunit [Volvox carteri f. nagariensis]
Length = 532
Score = 349 bits (895), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 184/356 (51%), Positives = 235/356 (66%), Gaps = 14/356 (3%)
Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGD-AVVNVYINHEKKF 224
++QQATRHARR+YVGGLPP A EQ+I++FFS + AIGGN+AGPG + I +
Sbjct: 187 VSQQATRHARRIYVGGLPPTATEQSISSFFSHALAAIGGNTAGPGGFPFHSTSITSPQSI 246
Query: 225 AFVEMRT-VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
+R +EE SNAMALDGI+FEGV+VRVRRP DYNP AA+LGP P+PNLNLAA+GL
Sbjct: 247 RSSILREFIEETSNAMALDGIMFEGVSVRVRRPNDYNPAAAASLGPSTPNPNLNLAAIGL 306
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
++ A GGA+ DR+FVGGLPYY TE Q +ELL SFG + FDLVKDR+TGNSKGYGF VY
Sbjct: 307 SNAAGGGADQADRIFVGGLPYYLTEEQCRELLGSFGPIKSFDLVKDRETGNSKGYGFVVY 366
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
QD AVTDIACA LNGL+MGD+TLTVRRAT S + + + A + AL
Sbjct: 367 QDSAVTDIACAGLNGLRMGDRTLTVRRATEGAPSASGAGAAASTALGPPGLVPAALAN-- 424
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
G + + L ++A+ + D+ EYE+IL DM++E ++G NV+IPR
Sbjct: 425 ----------LGVGVGVGVGLNPLVSAEEIVDNTEYEDILADMKDEASRHGLCNNVLIPR 474
Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
P PG+ KV +E+ D A+NA+ GR+F G VNA Y E+ YF+ Y
Sbjct: 475 PTAENPNPPGMCKVIMEFNDVNSAVKARNAMHGRRFAGRVVNATYLTEEAYFSGRY 530
>gi|308801273|ref|XP_003077950.1| U2 snRNP auxiliary factor, large subunit (ISS) [Ostreococcus tauri]
gi|116056401|emb|CAL52690.1| U2 snRNP auxiliary factor, large subunit (ISS) [Ostreococcus tauri]
Length = 388
Score = 345 bits (885), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 175/362 (48%), Positives = 239/362 (66%), Gaps = 30/362 (8%)
Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFA 225
+T Q TRHARR+Y+GG P +ANEQ +++FF+ + A+GG ++ VVNVYIN EKKFA
Sbjct: 49 ITAQTTRHARRIYLGGCPTMANEQELSSFFNDALVAVGGTTSEEA-PVVNVYINLEKKFA 107
Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLAS 285
FVE R+VEE SNA+ALDG++ +G VR+RRP DYNP +A LGP P+P LNL A+GL
Sbjct: 108 FVEFRSVEECSNALALDGVMIQGEPVRIRRPNDYNPQIAQGLGPSTPNPKLNLQAIGLDP 167
Query: 286 GAIGGA-------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
A+ + E P+R+F+GGLPYY E Q++ELLE+FG + FDLV+D++ GNSKGY
Sbjct: 168 SALARSATTNILQEDPNRIFIGGLPYYLEEPQVRELLEAFGPIARFDLVRDKENGNSKGY 227
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
GF VYQD AVTDIAC LNG++MG+KTLTVRRA Q +T+ L Q +
Sbjct: 228 GFVVYQDAAVTDIACQGLNGMQMGEKTLTVRRAE---QGRTD----LIGGQVSVPPPPAI 280
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTE-AITADALADDEEYEEILEDMREECGKYGTLV 457
+ ++V+ T IT + LADDEE+E I+EDM EECGKYG ++
Sbjct: 281 APAN--------------PPSEVVSFTNMGITEEELADDEEFENIMEDMNEECGKYGKII 326
Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
+VVIPRP ++G G+GKVF+ Y A++AL+GR+FGGN+V A + + + ++
Sbjct: 327 SVVIPRPSKSGESVTGIGKVFVRYESVEDATKARDALNGRRFGGNSVVADFIDIESFASQ 386
Query: 518 DY 519
+
Sbjct: 387 TF 388
>gi|302813497|ref|XP_002988434.1| hypothetical protein SELMODRAFT_24180 [Selaginella moellendorffii]
gi|300143836|gb|EFJ10524.1| hypothetical protein SELMODRAFT_24180 [Selaginella moellendorffii]
Length = 339
Score = 337 bits (864), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 189/374 (50%), Positives = 234/374 (62%), Gaps = 38/374 (10%)
Query: 147 MLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS 206
+LP G QL MP Q T+ ARRVYVGGLP + +E IATFF+ M I GN+
Sbjct: 3 VLPAGIAQLPVVLRMP---QMPQITKPARRVYVGGLPAVVDEARIATFFNHAMAVIEGNT 59
Query: 207 AGPG-DAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAA 265
G G DAVV+V+I+H K +AFVEMR+VEEASNAMALDGIIFEG VR+RRP++YNP A
Sbjct: 60 YGQGGDAVVSVFIDHAKNYAFVEMRSVEEASNAMALDGIIFEGSQVRIRRPSNYNPEHAM 119
Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFD 325
G QPSP+L L VGL A A+GPDR+F+GGLPY + + ++++LLE FG L D
Sbjct: 120 LFGSSQPSPSLRLDKVGLVYRA--HADGPDRIFIGGLPYEWGDAEVRQLLEPFGALRALD 177
Query: 326 LVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESIL 385
+VKD T SKGYGF VY++PA TD ACAALN + K L V RAT S S +L
Sbjct: 178 IVKDSYTRKSKGYGFAVYENPASTDAACAALNQKPLEGKILRVHRATNS--SGNPALVLL 235
Query: 386 AQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILED 445
Q+ + LG +V+CL A++ + L D++EY EI+ED
Sbjct: 236 PQSSE----------------LG----------TRVVCLCNAVSEEMLRDEKEYAEIIED 269
Query: 446 MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
M+EECGKYG LV+V IPR G PG+GKVFLEY D V A++ L GR F TV
Sbjct: 270 MKEECGKYGPLVSVEIPR----GDGAPGLGKVFLEYKDLVSALKARHGLQGRSFDKRTVQ 325
Query: 506 AFYYPEDKYFNKDY 519
A YYPEDK+ KDY
Sbjct: 326 ATYYPEDKFSAKDY 339
>gi|255073589|ref|XP_002500469.1| RNA binding protein [Micromonas sp. RCC299]
gi|226515732|gb|ACO61727.1| RNA binding protein [Micromonas sp. RCC299]
Length = 489
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 188/376 (50%), Positives = 246/376 (65%), Gaps = 22/376 (5%)
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAG-PGDAVVNVYINHEKK 223
V QQATRHARRVYVG LP E +A FF+ M AIGG A PGD V+NVYIN+EKK
Sbjct: 115 VPNQQATRHARRVYVGNLPGTVTEPKVAAFFNNAMHAIGGTVAALPGDPVLNVYINYEKK 174
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
FAFVE RTVEE SN MALDG + EG+A+RVRRP DYN A++LGP QP LNL A+GL
Sbjct: 175 FAFVEFRTVEETSNCMALDGAVLEGIAMRVRRPNDYNVMAASSLGPSQPKDGLNLEAIGL 234
Query: 284 ------------ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRD 331
A+ ++ + R+F+GGLPY+ TET +KEL+E+FG F LV DR+
Sbjct: 235 NPAAAGGGGAGAANASLTEEDLQHRLFIGGLPYFLTETMVKELVEAFGPTKQFQLVVDRE 294
Query: 332 TGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQH 391
TGNSKGYGF VYQD +VTD+AC L+G+KMG+K+LTV+RA G + + + + H
Sbjct: 295 TGNSKGYGFFVYQDHSVTDVACQGLHGMKMGEKSLTVQRAMQGG-AGAPKPTAASVGPGH 353
Query: 392 IAI---QKMALQTSGMNTLGGGMSL----FGETLAKVLCLTEAITADALADDEEYEEILE 444
A+ ++A +G + G+S+ ++V+ LTE + + L DD EY EI+E
Sbjct: 354 TALPGADEVAAHLAGASGAPAGLSVPPPPSEHPASRVVSLTEMLDVEELRDDVEYGEIME 413
Query: 445 DMREECGKYGTLVNVVIPRP-DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNT 503
DMREECGK+G + ++VIPRP D +G PG+GKVF+ Y D G A A+NAL GRKFGGN
Sbjct: 414 DMREECGKFGRIESIVIPRPGDADGAAVPGLGKVFVRYEDDAGAAAARNALHGRKFGGNV 473
Query: 504 VNAFYYPEDKYFNKDY 519
V A + E + ++ +
Sbjct: 474 VKADFIDETVFASRAF 489
>gi|302796203|ref|XP_002979864.1| hypothetical protein SELMODRAFT_153568 [Selaginella moellendorffii]
gi|300152624|gb|EFJ19266.1| hypothetical protein SELMODRAFT_153568 [Selaginella moellendorffii]
Length = 325
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 185/363 (50%), Positives = 229/363 (63%), Gaps = 41/363 (11%)
Query: 158 FPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPG-DAVVNV 216
P MP Q T+ ARRVYVGGLP + +E IATFF+ M I GN+ G G DAVV+V
Sbjct: 1 MPQMP------QITKPARRVYVGGLPAVVDEARIATFFNHAMAVIEGNTYGQGGDAVVSV 54
Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL 276
+I+H K +AFVEMR+VEEASNAMALDGIIFEG VR+RRP++YNP A G QPSP+L
Sbjct: 55 FIDHAKNYAFVEMRSVEEASNAMALDGIIFEGSQVRIRRPSNYNPEHAMLFGSSQPSPSL 114
Query: 277 NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
L VGL A A+GPDR+F+GGLPY + + ++++LLE FG L D+VKD T SK
Sbjct: 115 RLDKVGLVYRA--HADGPDRIFIGGLPYEWGDAEVRQLLEPFGALRALDIVKDSYTRKSK 172
Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQK 396
GYGF VY++PA TD ACAALN + K L V RAT S S +L Q+ +
Sbjct: 173 GYGFAVYENPASTDAACAALNQKPLEGKILRVHRATNS--SGNPALVLLPQSSE------ 224
Query: 397 MALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
LG +V+CL A++ + L D++EY EI+EDM+EECGKYG L
Sbjct: 225 ----------LG----------TRVVCLCNAVSEEMLRDEKEYAEIIEDMKEECGKYGPL 264
Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
V+V IPR G PG+GKVFLEY D V A++ L GR F TV A YYPEDK+
Sbjct: 265 VSVEIPR----GDGAPGLGKVFLEYKDLVSALKARHGLQGRSFDKRTVQATYYPEDKFSA 320
Query: 517 KDY 519
KDY
Sbjct: 321 KDY 323
>gi|303273844|ref|XP_003056274.1| RNA binding protein [Micromonas pusilla CCMP1545]
gi|226462358|gb|EEH59650.1| RNA binding protein [Micromonas pusilla CCMP1545]
Length = 564
Score = 325 bits (832), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 192/409 (46%), Positives = 245/409 (59%), Gaps = 55/409 (13%)
Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGG---NSAGPG-DAVVNVYINHEK 222
+ QATRHARR+YVGGLP ANE + ATFFS + AIGG +A G + V+NVY+NHEK
Sbjct: 156 STQATRHARRIYVGGLPATANEASTATFFSNALAAIGGVVQTAAAAGVEPVLNVYMNHEK 215
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
KFAFVE RTVEE SNA+ALDG++F+GV++RVRRP DYN +AA LGP PS +L+LAA+G
Sbjct: 216 KFAFVEFRTVEETSNAIALDGVVFDGVSLRVRRPNDYNAAIAATLGPSTPSTDLDLAAIG 275
Query: 283 LASGA------------------IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
L GA + + +R+FVGGLPY+ TE +KEL+E+FG F
Sbjct: 276 LVPGAGGAAGGAGAGGAAGGQNNLSPEDTANRLFVGGLPYFLTEPMVKELVEAFGPTKHF 335
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
LV DR+TGNSKGYGF VYQD AVTD+AC L+G+KMG+KTLTVRRAT +G +
Sbjct: 336 MLVMDRETGNSKGYGFFVYQDHAVTDVACQGLHGMKMGEKTLTVRRATGAGGGAAGATNA 395
Query: 385 --------------LAQAQQHIA-IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEA-- 427
+A+A+ ++ A G G SL L+ CLT+
Sbjct: 396 SGLAGGAGGAPSTRVAEAEDATTGGERAARAPRGAMPRGAXASLTARFLSCRCCLTDPPP 455
Query: 428 --------------ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE--T 471
+ + L DD EY EI EDMREECGK+G ++ V IPRP GG+
Sbjct: 456 PSNPPSRVLSLNDMLDVEDLRDDVEYGEITEDMREECGKHGVVLEVRIPRPAAAGGDEIV 515
Query: 472 PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
PG+GKVF++Y + G A+ AL GRKFGG V A + E + YS
Sbjct: 516 PGLGKVFVQYEEVAGAEAARKALHGRKFGGQIVVADFVDEAAFAEGRYS 564
>gi|145344032|ref|XP_001416543.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576769|gb|ABO94836.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 291
Score = 315 bits (806), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 165/306 (53%), Positives = 201/306 (65%), Gaps = 23/306 (7%)
Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
T QATRHARR+YVGG+P NE + FF+ + A+GG + G VVNVYIN EKKFAF
Sbjct: 1 TAQATRHARRIYVGGIPLTTNEADVNAFFNNALLAVGGTNGAEGQPVVNVYINVEKKFAF 60
Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
VE R+VEEASNA+ALDGI+ +GV VR+RRP DYNP+LA LGP P+P LNLAA+GL
Sbjct: 61 VEFRSVEEASNALALDGIVLDGVPVRIRRPNDYNPSLAHDLGPSMPNPALNLAAIGLDPS 120
Query: 287 A-----IGGA---EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
A +GG E DR+F+GGLPY+ E QI+ELLE+FG + FDLV+D++TGNSKGY
Sbjct: 121 ALQRAGVGGNLLHEHEDRIFIGGLPYFLDEAQIRELLEAFGPIRQFDLVRDKETGNSKGY 180
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
GF VY+D +VTDIAC LNG+ MGDKTLTVRRA S Q ++ AI
Sbjct: 181 GFVVYEDVSVTDIACQGLNGMTMGDKTLTVRRAEQSNAPGGVQPGMMNVPPPPPAIAAPP 240
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
M L T + LADDEEYE I+EDM+EECGK+G +V+
Sbjct: 241 TNPPSTVVSFDNMGL---------------TEEELADDEEYENIMEDMQEECGKHGEIVS 285
Query: 459 VVIPRP 464
VVIPRP
Sbjct: 286 VVIPRP 291
>gi|242069429|ref|XP_002449991.1| hypothetical protein SORBIDRAFT_05g026783 [Sorghum bicolor]
gi|241935834|gb|EES08979.1| hypothetical protein SORBIDRAFT_05g026783 [Sorghum bicolor]
Length = 249
Score = 311 bits (796), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 153/228 (67%), Positives = 178/228 (78%), Gaps = 2/228 (0%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P Q+MT++AT RRVYVG LPP ANEQ I FF+QVM IGGN+AGPGDAV + +
Sbjct: 11 PHTTTQLMTREATLFTRRVYVGDLPPSANEQTIGVFFNQVMAVIGGNTAGPGDAVCGICM 70
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
NHE++FA VE R EEASNAMALDGI+FEGV V+VRRP DYN + AAA+GP QPS LNL
Sbjct: 71 NHEQRFALVEFRMAEEASNAMALDGILFEGVPVKVRRPADYNLSQAAAMGPTQPSRKLNL 130
Query: 279 AAVGLASG-AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
AAVGL +G A GG+E PDR+FVGGLPYY++E Q+++LLE G L GF+LVKDR+TGNSKG
Sbjct: 131 AAVGLTAGSAGGGSEDPDRIFVGGLPYYYSEAQVRDLLECIGPLRGFELVKDRETGNSKG 190
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASG-QSKTEQESI 384
Y FCVY D TDIACA LNG+KMGDK LTVRRA S Q + EQESI
Sbjct: 191 YAFCVYMDTTATDIACADLNGIKMGDKILTVRRANQSASQPRPEQESI 238
>gi|348510223|ref|XP_003442645.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 3
[Oreochromis niloticus]
Length = 467
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 183/484 (37%), Positives = 260/484 (53%), Gaps = 44/484 (9%)
Query: 46 DRRRDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSP 105
DR R+++ K R G R D+HR +++D+ R R + + R+R S +
Sbjct: 18 DRERERHKKRSRSG--SPGRGDKHRSWSKDRGSRSREKRSRSRDRKSRDRRSSSRDHKKH 75
Query: 106 SKSKRRSG-------FDMAPPAAAMLPGAAVPGQLPGVPSA--VPEMAQNMLPFGATQLG 156
S S RR+ +D+ PP + P Q + +A +P +A +L T
Sbjct: 76 SHSPRRTRKKRTCKYWDVPPPGFEHI----TPMQYKAMQAAGQIPTIA--LLATSTTTGV 129
Query: 157 AFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
A V ++ Q TR ARR+YVG +P E+++A FF+ M + G S P + V+ V
Sbjct: 130 AAAPTQVPIVGSQMTRQARRLYVGNIPFGVTEESMAEFFNAQMR-LAGLSQAPSNPVLAV 188
Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL 276
IN +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P + P P
Sbjct: 189 QINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPAFHVP-- 246
Query: 277 NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
G+ S + + P ++F+GGLP Y + Q+KELL SFG L F+LVKD T SK
Sbjct: 247 -----GVVSTVV--PDSPHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSK 299
Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI-AIQ 395
GY FC Y D + TD A A LNG+++GDK L V+RA+ ++ I + +Q
Sbjct: 300 GYAFCEYVDISATDQAVAGLNGMQLGDKKLIVQRASVGAKNANPTSIIETPVTLQVPGLQ 359
Query: 396 KMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGT 455
+ LQ SGM T +VLCL + + L DDE+YEEILED+REEC KYGT
Sbjct: 360 R--LQNSGMPT-------------EVLCLLNMVMPEELVDDEDYEEILEDIREECCKYGT 404
Query: 456 LVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
+ ++ IPRP +G E PG GK+F+EY A C A AL+GRKF V YY D Y
Sbjct: 405 VRSIEIPRP-VDGVEVPGCGKIFVEYVSAADCQKAMQALTGRKFANRVVVTKYYDPDMYH 463
Query: 516 NKDY 519
++
Sbjct: 464 RHEF 467
>gi|412990165|emb|CCO19483.1| predicted protein [Bathycoccus prasinos]
Length = 495
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 171/388 (44%), Positives = 221/388 (56%), Gaps = 52/388 (13%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGG----NSAGPGDAVVNVYINHEKKF 224
QATRHARRVYVGG PP +E +A FF+ + A+GG + G + VVNVY+NHEK F
Sbjct: 121 QATRHARRVYVGGFPPNVSEVRVADFFNNALMAVGGIAETQTEGNANPVVNVYMNHEKHF 180
Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
AFVE R EE SN MALD I F+ +RVRRP DYN A LGP P+ +NL A+GL+
Sbjct: 181 AFVEFRNAEETSNCMALDSISFDSSQLRVRRPNDYNQPAAMKLGPIVPNIKMNLEAIGLS 240
Query: 285 SGAI-------------GGAEGP-------DRVFVGGLPYYFTETQIKELLESFGTLHGF 324
+ + G A G DRVFVGGLPY+ TE QI+ELLE+FG + F
Sbjct: 241 NEVLQRMQSGVASGQNNGNANGSNVADPNEDRVFVGGLPYFLTEAQIRELLEAFGPITRF 300
Query: 325 DLVKDRDTGNSKGYGFCVYQD-PAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQES 383
DLV+DRDTG SKGYGF VY+D PA+TDIA L+G++MGDK LTVRRA A+ E
Sbjct: 301 DLVRDRDTGGSKGYGFVVYRDGPAITDIAIQGLHGMQMGDKQLTVRRANAT------LER 354
Query: 384 ILAQAQQHIAIQKMALQTSGMNTLGGGMSLF------------GETLAKVLCLTE-AITA 430
+ + + + Q+ Q G GG + F E + L L I
Sbjct: 355 MQQEQRAALQQQQQQHQLLGN---GGAAAQFLPQSAAPAAPEVDENATECLVLKNMGIKD 411
Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATA 490
+ L D EEYE I+ED +EEC K+G ++ + IP+P + G VF+ + A A
Sbjct: 412 EELNDPEEYEIIVEDTQEECEKFGKVLGMKIPKP-----PSKSAGVVFVRFETAESARKA 466
Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFNKD 518
+ +L+GRKF GN V+A Y + Y D
Sbjct: 467 RKSLNGRKFAGNIVSAQYDSIETYEKHD 494
>gi|452823555|gb|EME30564.1| U2 snRNP auxiliary factor large subunit, putative isoform 1
[Galdieria sulphuraria]
Length = 522
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 156/371 (42%), Positives = 220/371 (59%), Gaps = 34/371 (9%)
Query: 148 LPFGATQLGAFPLMPV-QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS 206
L F A P+ P Q TQQAT+HARR+YVG LP E +A FF+ + G
Sbjct: 181 LDFSALSQYMIPVAPTTQPNTQQATKHARRLYVGNLPSDVTESEVADFFNSALYLAKGVD 240
Query: 207 AGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAA 266
PGD V +VY+N +K+FAF+E+ + EA+ A+ +DG++F G+++R+RRP DYNP + A
Sbjct: 241 V-PGDPVQSVYLNLDKRFAFIELNSAAEAAAAIQMDGVLFRGMSLRMRRPNDYNPNIHA- 298
Query: 267 LGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
P P + +A+G+ S + +GPD+VF+GGLPY+ TE QIKE+L S+G L+ F+L
Sbjct: 299 --PVYPPIGFDPSALGVVSTQV--PDGPDKVFIGGLPYHLTEDQIKEILSSYGPLNAFNL 354
Query: 327 VKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILA 386
VKD +TG SKGY F Y+DP++ + A LNG+ MGDKTLTVRRA+
Sbjct: 355 VKDPNTGLSKGYAFFQYKDPSIVEAAIKGLNGMTMGDKTLTVRRASQV------------ 402
Query: 387 QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
+SG LG S ++L L + + L DDEEYE+I+ED+
Sbjct: 403 --------------SSGSVELGQSFSPTVRYPTRILELRNMVEPEELVDDEEYEDIIEDV 448
Query: 447 REECGKYGTLVNVVIPRPDQ-NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
REE KYG + V IPRP + + PG+GKVF+ + A AL+GR+FGG +V
Sbjct: 449 REESSKYGEVTEVKIPRPSKTDEANPPGLGKVFVSFKTVSDAEKAFAALTGRRFGGKSVI 508
Query: 506 AFYYPEDKYFN 516
A YY E++Y++
Sbjct: 509 ANYYDEERYYS 519
>gi|348510219|ref|XP_003442643.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
[Oreochromis niloticus]
Length = 466
Score = 281 bits (718), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 177/465 (38%), Positives = 251/465 (53%), Gaps = 42/465 (9%)
Query: 65 RTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSG-------FDMA 117
R D+HR +++D+ R R + + R+R S + S S RR+ +D+
Sbjct: 34 RGDKHRSWSKDRGSRSREKRSRSRDRKSRDRRSSSRDHKKHSHSPRRTRKKRTCKYWDVP 93
Query: 118 PPAAAMLPGAAVPGQLPGVPSA--VPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHAR 175
PP + P Q + +A +P +A +L T A V ++ Q TR AR
Sbjct: 94 PPGFEHI----TPMQYKAMQAAGQIPTIA--LLATSTTTGVAAAPTQVPIVGSQMTRQAR 147
Query: 176 RVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEA 235
R+YVG +P E+++A FF+ M + G S P + V+ V IN +K FAF+E R+V+E
Sbjct: 148 RLYVGNIPFGVTEESMAEFFNAQMR-LAGLSQAPSNPVLAVQINQDKNFAFLEFRSVDET 206
Query: 236 SNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
+ AMA DGIIF+G ++++RRP DY P + P P G+ S + + P
Sbjct: 207 TQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPAFHVP-------GVVSTVV--PDSPH 257
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
++F+GGLP Y + Q+KELL SFG L F+LVKD T SKGY FC Y D + TD A A
Sbjct: 258 KLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCEYVDISATDQAVAG 317
Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI-AIQKMALQTSGMNTLGGGMSLF 414
LNG+++GDK L V+RA+ ++ I + +Q+ LQ SGM T
Sbjct: 318 LNGMQLGDKKLIVQRASVGAKNANPTSIIETPVTLQVPGLQR--LQNSGMPT-------- 367
Query: 415 GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
+VLCL + + L DDE+YEEILED+REEC KYGT+ ++ IPRP +G E PG
Sbjct: 368 -----EVLCLLNMVMPEELVDDEDYEEILEDIREECCKYGTVRSIEIPRP-VDGVEVPGC 421
Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
GK+F+EY A C A AL+GRKF V YY D Y ++
Sbjct: 422 GKIFVEYVSAADCQKAMQALTGRKFANRVVVTKYYDPDMYHRHEF 466
>gi|147902896|ref|NP_001080595.1| U2 small nuclear RNA auxiliary factor 2 [Xenopus laevis]
gi|111185517|gb|AAH44032.2| U2af2 protein [Xenopus laevis]
Length = 456
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 165/418 (39%), Positives = 234/418 (55%), Gaps = 37/418 (8%)
Query: 107 KSKRRSGFDMAPPAAAML-----PGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLM 161
K K R +D+ PP + GQ+P + +P M + L T
Sbjct: 70 KKKIRKYWDIPPPGFEHITPLQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT-------- 120
Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +
Sbjct: 121 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQD 179
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 180 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVP 232
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY FC
Sbjct: 233 GVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFC 290
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q L +
Sbjct: 291 EYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGLMS 346
Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
S + +GG + +VLCL + + L DD+EYEEI+ED+R+ECGKYG + ++ I
Sbjct: 347 SQVQ-MGGHPT-------EVLCLMNMVVPEELIDDDEYEEIVEDVRDECGKYGAVKSIEI 398
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
PRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 399 PRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDGYHRRDF 455
>gi|348510221|ref|XP_003442644.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
[Oreochromis niloticus]
Length = 467
Score = 278 bits (712), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 176/467 (37%), Positives = 250/467 (53%), Gaps = 45/467 (9%)
Query: 65 RTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSG-------FDMA 117
R D+HR +++D+ R R + + R+R S + S S RR+ +D+
Sbjct: 34 RGDKHRSWSKDRGSRSREKRSRSRDRKSRDRRSSSRDHKKHSHSPRRTRKKRTCKYWDVP 93
Query: 118 PPAAAMLPGAAVPGQLPGVPSA--VPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHAR 175
PP + P Q + +A +P +A +L T A V ++ Q TR AR
Sbjct: 94 PPGFEHI----TPMQYKAMQAAGQIPTIA--LLATSTTTGVAAAPTQVPIVGSQMTRQAR 147
Query: 176 RVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEA 235
R+YVG +P E+++A FF+ M + G S P + V+ V IN +K FAF+E R+V+E
Sbjct: 148 RLYVGNIPFGVTEESMAEFFNAQMR-LAGLSQAPSNPVLAVQINQDKNFAFLEFRSVDET 206
Query: 236 SNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
+ AMA DGIIF+G ++++RRP DY P + P P G+ S + + P
Sbjct: 207 TQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPAFHVP-------GVVSTVV--PDSPH 257
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
++F+GGLP Y + Q+KELL SFG L F+LVKD T SKGY FC Y D + TD A A
Sbjct: 258 KLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCEYVDISATDQAVAG 317
Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQK---MALQTSGMNTLGGGMS 412
LNG+++GDK L V+RA+ ++ + + +Q LQ SGM T
Sbjct: 318 LNGMQLGDKKLIVQRASVGAKNAN---PVSTSGNTPVTLQVPGLQRLQNSGMPT------ 368
Query: 413 LFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
+VLCL + + L DDE+YEEILED+REEC KYGT+ ++ IPRP +G E P
Sbjct: 369 -------EVLCLLNMVMPEELVDDEDYEEILEDIREECCKYGTVRSIEIPRP-VDGVEVP 420
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
G GK+F+EY A C A AL+GRKF V YY D Y ++
Sbjct: 421 GCGKIFVEYVSAADCQKAMQALTGRKFANRVVVTKYYDPDMYHRHEF 467
>gi|47575746|ref|NP_001001217.1| U2 small nuclear RNA auxiliary factor 2 isoform 2 [Xenopus
(Silurana) tropicalis]
gi|45709722|gb|AAH67966.1| U2 small nuclear ribonucleoprotein auxiliary factor (U2AF) 2
[Xenopus (Silurana) tropicalis]
Length = 456
Score = 278 bits (710), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 165/413 (39%), Positives = 233/413 (56%), Gaps = 27/413 (6%)
Query: 107 KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVM 166
K K R +D+ PP + P Q + +A A +LP A PV V+
Sbjct: 70 KKKIRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTPVPVV 125
Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +K FAF
Sbjct: 126 GSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAF 184
Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++ G+ S
Sbjct: 185 LEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVST 237
Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
+ + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY FC Y D
Sbjct: 238 VV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDI 295
Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q L +S +
Sbjct: 296 NVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGLMSSQVQ- 350
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
+GG + +VLCL + + L DD+EYEEI+ED+R+ECGKYG + ++ IPRP
Sbjct: 351 MGGHPT-------EVLCLMNMVLPEELLDDDEYEEIVEDVRDECGKYGAVKSIEIPRP-V 402
Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 403 DGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDGYHRRDF 455
>gi|159487587|ref|XP_001701804.1| U2 snRNP auxiliary factor, large subunit [Chlamydomonas
reinhardtii]
gi|158281023|gb|EDP06779.1| U2 snRNP auxiliary factor, large subunit [Chlamydomonas
reinhardtii]
Length = 309
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 145/328 (44%), Positives = 208/328 (63%), Gaps = 20/328 (6%)
Query: 194 FFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRV 253
FF+Q+M A G + PG V++ ++N++K+FAF+EMR VEE SNAMA DGI +G ++V
Sbjct: 1 FFNQIMMASGATTQ-PGPPVMSCFMNNDKRFAFLEMRCVEETSNAMAFDGIQCQGEVLKV 59
Query: 254 RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
RRP DYNP A LGP PSP +NLA +G+ + + +GP++V++GGLP +E Q+++
Sbjct: 60 RRPHDYNPAAAKLLGPTDPSPKVNLALLGVINTLV--EDGPNKVYIGGLPACLSEEQVRQ 117
Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
+L++FGTL F+LV DR+TGNSKGYGFC Y DP+VTD A L+ L + K LT RRA
Sbjct: 118 ILQAFGTLKAFNLVLDRETGNSKGYGFCEYADPSVTDSAIQGLSALIIQGKPLTARRANT 177
Query: 374 SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADAL 433
S ++ ++++ Q Q + S + GGG + V+ L++ ++ D L
Sbjct: 178 SAETSLTLQTLIEQQQAAL--------VSTTSPAGGGHT--------VVRLSKMVSRDDL 221
Query: 434 ADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG-GETPGVGKVFLEYYDAVGCATAKN 492
DD EY ++L+D+ EE GKYG LV V IPRP G + PGVG VFL Y D VG A+
Sbjct: 222 LDDGEYADLLDDITEEVGKYGKLVGVEIPRPGAAGAADPPGVGLVFLCYEDTVGAKRAQV 281
Query: 493 ALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
AL GR+FG N A +Y ++ +D++
Sbjct: 282 ALKGRQFGANVAEATFYDRARFDARDFA 309
>gi|452823554|gb|EME30563.1| U2 snRNP auxiliary factor large subunit, putative isoform 2
[Galdieria sulphuraria]
Length = 538
Score = 275 bits (702), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 157/386 (40%), Positives = 219/386 (56%), Gaps = 48/386 (12%)
Query: 148 LPFGATQLGAFPLMPV-QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS 206
L F A P+ P Q TQQAT+HARR+YVG LP E +A FF+ + G
Sbjct: 181 LDFSALSQYMIPVAPTTQPNTQQATKHARRLYVGNLPSDVTESEVADFFNSALYLAKGVD 240
Query: 207 AGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAA 266
PGD V +VY+N +K+FAF+E+ + EA+ A+ +DG++F G+++R+RRP DYNP + A
Sbjct: 241 V-PGDPVQSVYLNLDKRFAFIELNSAAEAAAAIQMDGVLFRGMSLRMRRPNDYNPNIHAP 299
Query: 267 LGPGQPSPNLNLAA----------VGLASGAIGGA-----EGPDRVFVGGLPYYFTETQI 311
+ P P L +G A+G +GPD+VF+GGLPY+ TE QI
Sbjct: 300 VYP----PVCQLLTCFLGYIEKFQIGFDPSALGVVSTQVPDGPDKVFIGGLPYHLTEDQI 355
Query: 312 KELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
KE+L S+G L+ F+LVKD +TG SKGY F Y+DP++ + A LNG+ MGDKTLTVRRA
Sbjct: 356 KEILSSYGPLNAFNLVKDPNTGLSKGYAFFQYKDPSIVEAAIKGLNGMTMGDKTLTVRRA 415
Query: 372 TASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITAD 431
+ +SG LG S ++L L + +
Sbjct: 416 SQV--------------------------SSGSVELGQSFSPTVRYPTRILELRNMVEPE 449
Query: 432 ALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ-NGGETPGVGKVFLEYYDAVGCATA 490
L DDEEYE+I+ED+REE KYG + V IPRP + + PG+GKVF+ + A
Sbjct: 450 ELVDDEEYEDIIEDVREESSKYGEVTEVKIPRPSKTDEANPPGLGKVFVSFKTVSDAEKA 509
Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFN 516
AL+GR+FGG +V A YY E++Y++
Sbjct: 510 FAALTGRRFGGKSVIANYYDEERYYS 535
>gi|45387787|ref|NP_991252.1| U2 small nuclear RNA auxiliary factor 2b [Danio rerio]
gi|41389016|gb|AAH65869.1| U2 small nuclear RNA auxiliary factor 2b [Danio rerio]
Length = 475
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 171/447 (38%), Positives = 243/447 (54%), Gaps = 36/447 (8%)
Query: 76 KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPG 135
KERRHR RS + + S SP R K K + +D+ PP + P Q
Sbjct: 61 KERRHR---RSDHTQNHPQENVSRSPHRE-KKKKIKKYWDVPPPGFEHI----TPMQYKA 112
Query: 136 VPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFF 195
+ +A A +LP A PV V+ Q TR ARR+YVG +P E+++ FF
Sbjct: 113 MQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEESMMDFF 172
Query: 196 SQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRR 255
+ M +GG + PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+ ++++RR
Sbjct: 173 NAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQAQSLKIRR 231
Query: 256 PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD---RVFVGGLPYYFTETQIK 312
P DY P PG S N ++ G+ S + PD ++F+GGLP Y + Q+K
Sbjct: 232 PHDYQPL------PGM-SENPSVYVPGVVSTVV-----PDSIHKLFIGGLPNYLNDDQVK 279
Query: 313 ELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
ELL SFG L F+LVKD TG SKGY FC Y D V D A A LNG+++ DK L V+RA+
Sbjct: 280 ELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNVNDQAIAGLNGMQLADKKLLVQRAS 339
Query: 373 ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADA 432
++ T + + + + +Q L ++ M +GG +VLCL + +
Sbjct: 340 VGAKNAT----MTSINETPVTLQVPGLTSNPMIQMGG-------IPTEVLCLMNMVAPEE 388
Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
L DDEEYEEI+ED++EEC KYG + ++ IPRP +G + PG GK+F+E+ A
Sbjct: 389 LIDDEEYEEIVEDVKEECSKYGQVKSIEIPRP-VDGLDIPGTGKIFVEFTSVYDSQKAMQ 447
Query: 493 ALSGRKFGGNTVNAFYYPEDKYFNKDY 519
L+GRKF V Y D Y +D+
Sbjct: 448 GLTGRKFANRVVVTKYCDPDAYHRRDF 474
>gi|410903107|ref|XP_003965035.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 3
[Takifugu rubripes]
Length = 461
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 151/358 (42%), Positives = 209/358 (58%), Gaps = 27/358 (7%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V+ Q TR ARR+YVG +P E+++A FF+ M + G S P + V+ V IN +K
Sbjct: 130 VPVVGSQMTRQARRLYVGNIPFGVTEESMAEFFNAQMR-LAGLSQAPSNPVLAVQINQDK 188
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P + P P G
Sbjct: 189 NFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPVFHVP-------G 241
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+ S + + P ++F+GGLP Y + Q+KELL SFG L F+LVKD T SKGY FC
Sbjct: 242 VVSTVV--PDSPHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCE 299
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI-AIQKMALQT 401
Y D + TD A A LNG+++GDK L V+RA+ ++ I A + +Q+ LQ
Sbjct: 300 YVDVSATDQAVAGLNGMQLGDKKLIVQRASVGAKNANPSAIIEAPVTLQVPGLQR--LQN 357
Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
SGM T +VLCL + + L DD++YEEILED+REEC KYG++ ++ I
Sbjct: 358 SGMPT-------------EVLCLLNMVMPEELVDDDDYEEILEDVREECCKYGSVRSIEI 404
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
PRP +G + PG GK+F+EY A C A AL+GRKF V YY D Y ++
Sbjct: 405 PRP-VDGVDVPGCGKIFVEYVSASDCQKAMQALTGRKFANRVVVTKYYDLDLYHRHEF 461
>gi|410903105|ref|XP_003965034.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
[Takifugu rubripes]
Length = 454
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 151/358 (42%), Positives = 209/358 (58%), Gaps = 27/358 (7%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V+ Q TR ARR+YVG +P E+++A FF+ M + G S P + V+ V IN +K
Sbjct: 123 VPVVGSQMTRQARRLYVGNIPFGVTEESMAEFFNAQMR-LAGLSQAPSNPVLAVQINQDK 181
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P + P P G
Sbjct: 182 NFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPVFHVP-------G 234
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+ S + + P ++F+GGLP Y + Q+KELL SFG L F+LVKD T SKGY FC
Sbjct: 235 VVSTVV--PDSPHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCE 292
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI-AIQKMALQT 401
Y D + TD A A LNG+++GDK L V+RA+ ++ I A + +Q+ LQ
Sbjct: 293 YVDVSATDQAVAGLNGMQLGDKKLIVQRASVGAKNANPSAIIEAPVTLQVPGLQR--LQN 350
Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
SGM T +VLCL + + L DD++YEEILED+REEC KYG++ ++ I
Sbjct: 351 SGMPT-------------EVLCLLNMVMPEELVDDDDYEEILEDVREECCKYGSVRSIEI 397
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
PRP +G + PG GK+F+EY A C A AL+GRKF V YY D Y ++
Sbjct: 398 PRP-VDGVDVPGCGKIFVEYVSASDCQKAMQALTGRKFANRVVVTKYYDLDLYHRHEF 454
>gi|410903103|ref|XP_003965033.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
[Takifugu rubripes]
Length = 446
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 151/358 (42%), Positives = 209/358 (58%), Gaps = 27/358 (7%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V+ Q TR ARR+YVG +P E+++A FF+ M + G S P + V+ V IN +K
Sbjct: 115 VPVVGSQMTRQARRLYVGNIPFGVTEESMAEFFNAQMR-LAGLSQAPSNPVLAVQINQDK 173
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P + P P G
Sbjct: 174 NFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPVFHVP-------G 226
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+ S + + P ++F+GGLP Y + Q+KELL SFG L F+LVKD T SKGY FC
Sbjct: 227 VVSTVV--PDSPHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCE 284
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI-AIQKMALQT 401
Y D + TD A A LNG+++GDK L V+RA+ ++ I A + +Q+ LQ
Sbjct: 285 YVDVSATDQAVAGLNGMQLGDKKLIVQRASVGAKNANPSAIIEAPVTLQVPGLQR--LQN 342
Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
SGM T +VLCL + + L DD++YEEILED+REEC KYG++ ++ I
Sbjct: 343 SGMPT-------------EVLCLLNMVMPEELVDDDDYEEILEDVREECCKYGSVRSIEI 389
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
PRP +G + PG GK+F+EY A C A AL+GRKF V YY D Y ++
Sbjct: 390 PRP-VDGVDVPGCGKIFVEYVSASDCQKAMQALTGRKFANRVVVTKYYDLDLYHRHEF 446
>gi|389610875|dbj|BAM19048.1| U2 small nuclear riboprotein auxiliary factor 50 [Papilio polytes]
Length = 422
Score = 272 bits (695), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 150/361 (41%), Positives = 212/361 (58%), Gaps = 21/361 (5%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ FF+Q M + G + G+ V+ I
Sbjct: 83 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEETMEFFNQQM-HLSGLAQAAGNPVLACQI 141
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG +P +N+
Sbjct: 142 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGTENPAINV 195
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
A G+ S + + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 196 PA-GVISTVV--PDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSSTGLSKGY 252
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
F Y D ++TD A A LNG+++GDK L V+RA+ ++ T I Q +
Sbjct: 253 AFAEYVDISMTDQAIAGLNGMQLGDKKLIVQRASIGAKNSTLGVYI----QSMTGAAPVT 308
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
LQ +G+ G G + +VLCL +T D L D+EEYE+ILED++EEC KYG + +
Sbjct: 309 LQVAGLTLAGAGPA------TEVLCLLNMVTPDELRDEEEYEDILEDIKEECNKYGCVRS 362
Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
+ IPRP + G + PG GKVF+E+ C A+ L+GRKF V YY DKY ++
Sbjct: 363 IEIPRPLE-GVDVPGCGKVFVEFNSIADCQKAQQTLTGRKFSNRVVVTSYYDPDKYHRRE 421
Query: 519 Y 519
+
Sbjct: 422 F 422
>gi|325191172|emb|CCA25959.1| splicing factor U2af large subunit putative [Albugo laibachii Nc14]
Length = 553
Score = 272 bits (695), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 153/359 (42%), Positives = 218/359 (60%), Gaps = 16/359 (4%)
Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFA 225
M QQ TRHARR+YVGG+ + NE I+ FF+ V+ G G AVV+VYIN E+ FA
Sbjct: 207 MAQQ-TRHARRIYVGGIGEV-NETEISAFFNDVIDRALGERQ-EGGAVVSVYINRERHFA 263
Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAA-LGPGQPSPNLNLAAVGLA 284
FVE++++E + M LDGI F G ++VRRP DYNP L LGP P LNLAA+G+
Sbjct: 264 FVELKSIELTTACMNLDGIAFRGQPLKVRRPNDYNPGLVPKDLGP---IPALNLAALGIV 320
Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
S + +GP +VF+GG+PY+ TE QIKELL++FG L F LVKD T SKGY FC Y
Sbjct: 321 STTV--QDGPGKVFIGGIPYHLTEEQIKELLQAFGPLKSFHLVKDLTTNLSKGYAFCEYM 378
Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE---SILAQAQQHIAIQKMALQT 401
D VTD AC LN +K+GD+TLTVRRA + +K ++ A + + + A+QT
Sbjct: 379 DSGVTDAACIGLNDMKLGDRTLTVRRALSQESAKVIANAAGTVNAGVEMGLDPSRAAMQT 438
Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
M G + G + ++VL L +T + L D++EY +I++D+R EC +YG + +++
Sbjct: 439 ISM--AGVHLGPIG-SPSRVLVLRNMVTPEELEDEDEYRDIMDDIRSECERYGRVTTIIL 495
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
PR + G+ +GKV++E+ D A N L GR F V+A Y E ++ ++ +
Sbjct: 496 PRAKEGYGDE-ALGKVYIEFGDISTSQAAANELHGRGFANRVVSAQYMEEAQFERRELT 553
>gi|114052735|ref|NP_001040494.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 [Bombyx mori]
gi|95103122|gb|ABF51502.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 isoform 1
[Bombyx mori]
Length = 417
Score = 271 bits (692), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 149/361 (41%), Positives = 211/361 (58%), Gaps = 26/361 (7%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ FF+Q M + G + G+ V+ I
Sbjct: 83 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEETMEFFNQQM-HLSGLAQAAGNPVLACQI 141
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG +P +N+
Sbjct: 142 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGTENPAINV 195
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
A G+ S + + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 196 PA-GVISTVV--PDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSSTGLSKGY 252
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
F Y D ++TD A A LNG+++GDK L V+RA+ ++ T + A Q
Sbjct: 253 AFAEYVDISMTDQAIAGLNGMQLGDKKLIVQRASIGAKNSTLALTGAAPVQ--------- 303
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
+Q +G+ G G +VLCL +T D L D+EEYE+ILED++EEC KYG + +
Sbjct: 304 IQVAGLTLAGAGPP------TEVLCLLNMVTPDELRDEEEYEDILEDIKEECNKYGVVRS 357
Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
+ IPRP + G E PG GKVF+E+ C A+ L+GRKF V Y+ DKY ++
Sbjct: 358 IEIPRPIE-GVEVPGCGKVFVEFNSIADCQKAQQTLTGRKFSNRVVVTSYFDPDKYHRRE 416
Query: 519 Y 519
+
Sbjct: 417 F 417
>gi|357623461|gb|EHJ74600.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 [Danaus
plexippus]
Length = 350
Score = 270 bits (691), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 150/361 (41%), Positives = 212/361 (58%), Gaps = 26/361 (7%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ FF+Q M + G + G+ V+ I
Sbjct: 16 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEETMEFFNQQM-HLSGLAQAAGNPVLACQI 74
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG +P +N+
Sbjct: 75 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGTENPAINV 128
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
A G+ S + + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 129 PA-GVISTVV--PDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSSTGLSKGY 185
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
F Y D ++TD A A LNG+++GDK L V+RA+ ++ T LA +
Sbjct: 186 AFAEYVDISMTDQAIAGLNGMQLGDKKLIVQRASIGAKNST-----LAMT----GAAPVT 236
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
LQ +G+ G G + +VLCL +T D L D+EEYE+ILED++EEC KYG + +
Sbjct: 237 LQVAGLTLAGAGPA------TEVLCLLNMVTPDELRDEEEYEDILEDIKEECNKYGCVRS 290
Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
+ IPRP + G E PG GKVF+E+ C A+ L+GRKF V Y+ DKY ++
Sbjct: 291 IEIPRPIE-GVEVPGCGKVFVEFNSIADCQKAQQTLTGRKFSNRVVVTSYFDPDKYHRRE 349
Query: 519 Y 519
+
Sbjct: 350 F 350
>gi|62859443|ref|NP_001016998.1| U2 small nuclear RNA auxiliary factor 2 isoform 1 [Xenopus
(Silurana) tropicalis]
gi|89269799|emb|CAJ83531.1| U2 (RNU2) small nuclear RNA auxiliary factor 2 [Xenopus (Silurana)
tropicalis]
gi|115292148|gb|AAI22001.1| U2 small nuclear RNA auxiliary factor 2 [Xenopus (Silurana)
tropicalis]
Length = 465
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 165/422 (39%), Positives = 233/422 (55%), Gaps = 36/422 (8%)
Query: 107 KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVM 166
K K R +D+ PP + P Q + +A A +LP A PV V+
Sbjct: 70 KKKIRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTPVPVV 125
Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +K FAF
Sbjct: 126 GSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAF 184
Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++ G+ S
Sbjct: 185 LEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVST 237
Query: 287 AIGGAEGPDRVFVGGLPYYFTETQI---------KELLESFGTLHGFDLVKDRDTGNSKG 337
+ + ++F+GGLP Y + Q+ KELL SFG L F+LVKD TG SKG
Sbjct: 238 VV--PDSAHKLFIGGLPNYLNDDQVTMESLSLWVKELLTSFGPLKAFNLVKDSATGLSKG 295
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q
Sbjct: 296 YAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVP 351
Query: 398 ALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
L +S + +GG + +VLCL + + L DD+EYEEI+ED+R+ECGKYG +
Sbjct: 352 GLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDDEYEEIVEDVRDECGKYGAVK 403
Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
++ IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +
Sbjct: 404 SIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDGYHRR 462
Query: 518 DY 519
D+
Sbjct: 463 DF 464
>gi|289741197|gb|ADD19346.1| splicing factor U2AF large subunit [Glossina morsitans morsitans]
Length = 423
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 155/391 (39%), Positives = 217/391 (55%), Gaps = 29/391 (7%)
Query: 134 PGVPSAVPEMAQNMLPFGATQLGAFPLMP---VQVMTQQATRHARRVYVGGLPPLANEQA 190
PG P + M G A P +P V V+ TR ARR+YVG +P E
Sbjct: 57 PGFEHITPLQYKAMQAAGQIPANALPEIPQAAVPVVGSTITRQARRLYVGNIPFGVTEDE 116
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+Q M G A G+ V+ IN +K FAF+E R+ +E + AMA DGI F+G +
Sbjct: 117 MMEFFNQQMHLTGLAQAA-GNPVLACQINLDKNFAFLEFRSTDETTQAMAFDGISFKGQS 175
Query: 251 VRVRRPTDYNPTLAAALGPG--QPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTE 308
+++RRP DY P PG + +P A G+ S + + P ++F+GGLP Y E
Sbjct: 176 LKIRRPHDYQPM------PGVVESTPVAQPVANGVISAVV--PDSPHKIFIGGLPNYLNE 227
Query: 309 TQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV 368
Q+KELL SFG L F+LVKD TG SKGY FC Y D ++TD A A LNG+++GDK L V
Sbjct: 228 DQVKELLLSFGQLRAFNLVKDAATGLSKGYAFCEYIDHSITDQAIAGLNGMQLGDKKLIV 287
Query: 369 RRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAI 428
+RA+ ++ H + +Q G++ +G +VLCL +
Sbjct: 288 QRASVGAKNAQ---------NNHTTAAPVMIQVPGLSMVG-----ISGPPTEVLCLLNMV 333
Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCA 488
T D L D+EEYE+ILED++EEC KYG + +V IPRP + G + PG GKVF+E+ + C
Sbjct: 334 TPDELRDEEEYEDILEDIKEECNKYGVVRSVEIPRPIE-GVDVPGCGKVFVEFNSVMDCQ 392
Query: 489 TAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
A+ AL+GRKF V Y+ DKY +++
Sbjct: 393 KAQQALTGRKFSDRVVVTSYFDPDKYHRREF 423
>gi|195167317|ref|XP_002024480.1| GL15893 [Drosophila persimilis]
gi|198469588|ref|XP_001355063.2| GA22177 [Drosophila pseudoobscura pseudoobscura]
gi|194107878|gb|EDW29921.1| GL15893 [Drosophila persimilis]
gi|198146942|gb|EAL32119.2| GA22177 [Drosophila pseudoobscura pseudoobscura]
Length = 418
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 155/392 (39%), Positives = 222/392 (56%), Gaps = 44/392 (11%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
GQ+P S VP+ Q +P V+ TR ARR+YVG +P E+
Sbjct: 68 GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 110
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+Q M +G A G V+ IN +K FAF+E R+++E + AMA DGI +G +
Sbjct: 111 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQS 169
Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
+++RRP DY P PG +P + A V +SG I + P ++F+GGLP Y
Sbjct: 170 LKIRRPHDYQPM------PGITDTPAIKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 221
Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
+ Q+KELL SFG L F+LVKD TG SKGY FC Y D ++TD + A LNG+++GDK L
Sbjct: 222 DDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 281
Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEA 427
V+RA+ ++ + + Q + LQ G++T+ + +VLCL
Sbjct: 282 VQRASVGAKNAQNASN---------SSQSVMLQVPGLSTV-----VSSGPPTEVLCLLNM 327
Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
+T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+ + C
Sbjct: 328 VTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLDC 386
Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
A+ AL+GRKF V Y+ DKY +++
Sbjct: 387 QKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 418
>gi|194770152|ref|XP_001967161.1| GF19596 [Drosophila ananassae]
gi|190619281|gb|EDV34805.1| GF19596 [Drosophila ananassae]
Length = 416
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 155/392 (39%), Positives = 221/392 (56%), Gaps = 44/392 (11%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
GQ+P S VP+ Q +P V+ TR ARR+YVG +P E+
Sbjct: 66 GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 108
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+Q M +G A G V+ IN +K FAF+E R+++E + AMA DGI +G +
Sbjct: 109 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQS 167
Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
+++RRP DY P PG +P + A V +SG I + P ++F+GGLP Y
Sbjct: 168 LKIRRPHDYQPM------PGITDTPAIKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 219
Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
+ Q+KELL SFG L F+LVKD TG SKGY FC Y D ++TD + A LNG+++GDK L
Sbjct: 220 DDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 279
Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEA 427
V+RA+ ++ + Q + LQ G++T+ + +VLCL
Sbjct: 280 VQRASVGAKNAQNASN---------TTQSVMLQVPGLSTV-----VTSGPPTEVLCLLNM 325
Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
+T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+ + C
Sbjct: 326 VTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLDC 384
Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
A+ AL+GRKF V Y+ DKY +++
Sbjct: 385 QKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 416
>gi|195448282|ref|XP_002071589.1| GK10063 [Drosophila willistoni]
gi|194167674|gb|EDW82575.1| GK10063 [Drosophila willistoni]
Length = 416
Score = 268 bits (685), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 155/392 (39%), Positives = 221/392 (56%), Gaps = 44/392 (11%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
GQ+P S VP+ Q +P V+ TR ARR+YVG +P E+
Sbjct: 66 GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 108
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+Q M +G A G V+ IN +K FAF+E R+++E + AMA DGI +G +
Sbjct: 109 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQS 167
Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
+++RRP DY P PG +P + A V +SG I + P ++F+GGLP Y
Sbjct: 168 LKIRRPHDYQPM------PGITDTPAIKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 219
Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
+ Q+KELL SFG L F+LVKD TG SKGY FC Y D ++TD + A LNG+++GDK L
Sbjct: 220 DDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 279
Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEA 427
V+RA+ ++ + Q + LQ G++T+ + +VLCL
Sbjct: 280 VQRASVGAKNAQNAAN---------TTQSVMLQVPGLSTV-----VTSGPPTEVLCLLNM 325
Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
+T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+ + C
Sbjct: 326 VTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLDC 384
Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
A+ AL+GRKF V Y+ DKY +++
Sbjct: 385 QKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 416
>gi|291241059|ref|XP_002740425.1| PREDICTED: U2 (RNU2) small nuclear RNA auxiliary factor 2-like
[Saccoglossus kowalevskii]
Length = 466
Score = 268 bits (684), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 168/421 (39%), Positives = 240/421 (57%), Gaps = 27/421 (6%)
Query: 101 PSRS--PSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAF 158
PSRS P + K +D+ PP + P Q + A ++ Q L Q A
Sbjct: 71 PSRSARPKRKKPFMYWDIPPPGFEHI----APLQYKAMQGA-GQIPQTALENQMAQAAAN 125
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
MP+ + Q TR ARR+YVG +P E+A+ FF++ M + A G+ V+ V I
Sbjct: 126 SNMPI--VGSQMTRQARRLYVGNIPFGVTEEAMMDFFNRQMKSFRITQAQ-GNPVLAVQI 182
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
+ K FAF+E R+V+E + AMA DGI+F+G ++++RRP DY P A P P+
Sbjct: 183 DLNKNFAFLEFRSVDETTQAMAFDGILFQGQSLKIRRPKDYQPVPGMAEMPSVHVPDYLF 242
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
+ G+ S + + P ++F+GGLP Y E Q+KELL SFG L F+LVKD T SKGY
Sbjct: 243 SPTGVVSTVV--PDSPHKIFIGGLPNYLNEDQVKELLTSFGELKAFNLVKDSATSLSKGY 300
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
FC Y D +TD A A LNG+++GDK L V+RA+ ++ AQ Q IA ++
Sbjct: 301 AFCEYIDEKITDQAIAGLNGMQLGDKKLIVQRASVGAKN--------AQTAQMIA--QLN 350
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
+Q G+N G L G T +VLCL +T D L D+EEYEEIL+D+R+ECGKYG + +
Sbjct: 351 IQVPGVNI---GQGLVGPT-TEVLCLMNMVTPDELQDEEEYEEILDDVRQECGKYGQVRS 406
Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
+ IPRP + G E PG GK+F+E+ + + A+ AL+GRKF V + DKY ++
Sbjct: 407 LEIPRPIE-GVEVPGCGKIFVEFTNVLESQKAQTALAGRKFNNRIVVTSFCDPDKYHRRE 465
Query: 519 Y 519
+
Sbjct: 466 F 466
>gi|255089803|ref|XP_002506823.1| predicted protein [Micromonas sp. RCC299]
gi|226522096|gb|ACO68081.1| predicted protein [Micromonas sp. RCC299]
Length = 554
Score = 268 bits (684), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 163/358 (45%), Positives = 209/358 (58%), Gaps = 23/358 (6%)
Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAG--PGD---AVVNVYINH 220
+ QATRHARRVYVGG P NE +A+F + + AIGG S P + V++VYIN
Sbjct: 209 INVQATRHARRVYVGGFPDNTNEPELASFIANALVAIGGASGAYDPDNGMTCVLSVYINR 268
Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
+K FAFVE RTVEEASNAMALDG++ G +RVRRP DY P AA +GP P+ +LNLAA
Sbjct: 269 DKLFAFVEFRTVEEASNAMALDGVVMAGSQLRVRRPNDYQPQQAALIGPTTPADSLNLAA 328
Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
VGL G G + G +++VG LP Y TE Q+ ELL+SFG + F+LV D+DTG KGYGF
Sbjct: 329 VGLIPGVNGQSSG-RKLYVGNLPPYLTELQVLELLQSFGAVQAFNLVVDKDTGTLKGYGF 387
Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ 400
Y D A + A L G+++GDK L V+RA G A Q
Sbjct: 388 FEYADAAADEAAMEGLTGMRLGDKVLNVKRAAYDGGVGQGVGQASGSA-----------Q 436
Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
G G + GE+ ++ + LT +T + L D E EILED +EEC +G L VV
Sbjct: 437 APGFAP--GSLPANGESASECVRLTNMVTREELTDPTEAREILEDTQEECAGFGELTRVV 494
Query: 461 IPRPDQNGGETP-GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE---DKY 514
+P P + E P GVG+VFL + DA G A A +L+GRKF V+A + DKY
Sbjct: 495 MPLPRRTRLEDPAGVGEVFLLFADAEGAARAVRSLNGRKFADRVVSAGFITRAEFDKY 552
>gi|386764548|ref|NP_001245708.1| U2 small nuclear riboprotein auxiliary factor 50, isoform B
[Drosophila melanogaster]
gi|383293438|gb|AFH07421.1| U2 small nuclear riboprotein auxiliary factor 50, isoform B
[Drosophila melanogaster]
Length = 427
Score = 268 bits (684), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 156/393 (39%), Positives = 220/393 (55%), Gaps = 46/393 (11%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
GQ+P S VP+ Q +P V+ TR ARR+YVG +P E+
Sbjct: 77 GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 119
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+Q M +G A G V+ IN +K FAF+E R+++E + AMA DGI +G +
Sbjct: 120 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQS 178
Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
+++RRP DY P PG +P + A V +SG I + P ++F+GGLP Y
Sbjct: 179 LKIRRPHDYQPM------PGITDTPAIKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 230
Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
+ Q+KELL SFG L F+LVKD TG SKGY FC Y D ++TD + A LNG+++GDK L
Sbjct: 231 DDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 290
Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM-NTLGGGMSLFGETLAKVLCLTE 426
V+RA+ ++ + Q + LQ G+ N + G +VLCL
Sbjct: 291 VQRASVGAKNAQNAAN---------TTQSVMLQVPGLSNVVTSGPP------TEVLCLLN 335
Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
+T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+ +
Sbjct: 336 MVTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLD 394
Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
C A+ AL+GRKF V Y+ DKY +++
Sbjct: 395 CQKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 427
>gi|17136764|ref|NP_476891.1| U2 small nuclear riboprotein auxiliary factor 50, isoform A
[Drosophila melanogaster]
gi|386764552|ref|NP_001245710.1| U2 small nuclear riboprotein auxiliary factor 50, isoform D
[Drosophila melanogaster]
gi|195351420|ref|XP_002042232.1| GM13406 [Drosophila sechellia]
gi|195479195|ref|XP_002100800.1| GE15975 [Drosophila yakuba]
gi|195555160|ref|XP_002077042.1| GD24494 [Drosophila simulans]
gi|4033485|sp|Q24562.1|U2AF2_DROME RecName: Full=Splicing factor U2AF 50 kDa subunit; AltName: Full=U2
auxiliary factor 50 kDa subunit; AltName: Full=U2 snRNP
auxiliary factor large subunit
gi|349761|gb|AAA03548.1| RNA binding protein [Drosophila melanogaster]
gi|7293214|gb|AAF48596.1| U2 small nuclear riboprotein auxiliary factor 50, isoform A
[Drosophila melanogaster]
gi|17861976|gb|AAL39465.1| LD03714p [Drosophila melanogaster]
gi|194124075|gb|EDW46118.1| GM13406 [Drosophila sechellia]
gi|194188324|gb|EDX01908.1| GE15975 [Drosophila yakuba]
gi|194203060|gb|EDX16636.1| GD24494 [Drosophila simulans]
gi|220943258|gb|ACL84172.1| U2af50-PA [synthetic construct]
gi|220953438|gb|ACL89262.1| U2af50-PA [synthetic construct]
gi|383293440|gb|AFH07423.1| U2 small nuclear riboprotein auxiliary factor 50, isoform D
[Drosophila melanogaster]
Length = 416
Score = 267 bits (683), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 156/393 (39%), Positives = 220/393 (55%), Gaps = 46/393 (11%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
GQ+P S VP+ Q +P V+ TR ARR+YVG +P E+
Sbjct: 66 GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 108
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+Q M +G A G V+ IN +K FAF+E R+++E + AMA DGI +G +
Sbjct: 109 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQS 167
Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
+++RRP DY P PG +P + A V +SG I + P ++F+GGLP Y
Sbjct: 168 LKIRRPHDYQPM------PGITDTPAIKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 219
Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
+ Q+KELL SFG L F+LVKD TG SKGY FC Y D ++TD + A LNG+++GDK L
Sbjct: 220 DDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 279
Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM-NTLGGGMSLFGETLAKVLCLTE 426
V+RA+ ++ + Q + LQ G+ N + G +VLCL
Sbjct: 280 VQRASVGAKNAQNAAN---------TTQSVMLQVPGLSNVVTSGPP------TEVLCLLN 324
Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
+T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+ +
Sbjct: 325 MVTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLD 383
Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
C A+ AL+GRKF V Y+ DKY +++
Sbjct: 384 CQKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 416
>gi|194893848|ref|XP_001977952.1| GG19328 [Drosophila erecta]
gi|190649601|gb|EDV46879.1| GG19328 [Drosophila erecta]
Length = 416
Score = 267 bits (683), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 156/393 (39%), Positives = 220/393 (55%), Gaps = 46/393 (11%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
GQ+P S VP+ Q +P V+ TR ARR+YVG +P E+
Sbjct: 66 GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 108
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+Q M +G A G V+ IN +K FAF+E R+++E + AMA DGI +G +
Sbjct: 109 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQS 167
Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
+++RRP DY P PG +P + A V +SG I + P ++F+GGLP Y
Sbjct: 168 LKIRRPHDYQPM------PGITDTPAIKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 219
Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
+ Q+KELL SFG L F+LVKD TG SKGY FC Y D ++TD + A LNG+++GDK L
Sbjct: 220 DDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 279
Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM-NTLGGGMSLFGETLAKVLCLTE 426
V+RA+ ++ + Q + LQ G+ N + G +VLCL
Sbjct: 280 VQRASVGAKNAQNAAN---------TTQSVMLQVPGLSNVVTSGPP------TEVLCLLN 324
Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
+T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+ +
Sbjct: 325 MVTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLD 383
Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
C A+ AL+GRKF V Y+ DKY +++
Sbjct: 384 CQKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 416
>gi|386764550|ref|NP_001245709.1| U2 small nuclear riboprotein auxiliary factor 50, isoform C
[Drosophila melanogaster]
gi|383293439|gb|AFH07422.1| U2 small nuclear riboprotein auxiliary factor 50, isoform C
[Drosophila melanogaster]
Length = 360
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 156/393 (39%), Positives = 220/393 (55%), Gaps = 46/393 (11%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
GQ+P S VP+ Q +P V+ TR ARR+YVG +P E+
Sbjct: 10 GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 52
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+Q M +G A G V+ IN +K FAF+E R+++E + AMA DGI +G +
Sbjct: 53 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQS 111
Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
+++RRP DY P PG +P + A V +SG I + P ++F+GGLP Y
Sbjct: 112 LKIRRPHDYQPM------PGITDTPAIKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 163
Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
+ Q+KELL SFG L F+LVKD TG SKGY FC Y D ++TD + A LNG+++GDK L
Sbjct: 164 DDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 223
Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM-NTLGGGMSLFGETLAKVLCLTE 426
V+RA+ ++ + Q + LQ G+ N + G +VLCL
Sbjct: 224 VQRASVGAKNAQNAAN---------TTQSVMLQVPGLSNVVTSGPP------TEVLCLLN 268
Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
+T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+ +
Sbjct: 269 MVTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLD 327
Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
C A+ AL+GRKF V Y+ DKY +++
Sbjct: 328 CQKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 360
>gi|328721670|ref|XP_001951521.2| PREDICTED: splicing factor U2AF 50 kDa subunit-like isoform 1
[Acyrthosiphon pisum]
Length = 416
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 160/430 (37%), Positives = 227/430 (52%), Gaps = 50/430 (11%)
Query: 95 RSKSLSPSRSPSKSKRRSGFDMAPP-----AAAMLPGAAVPGQLPGVPSAVPEMAQNMLP 149
RS+S SP + K +D+ PP A GQ+P + +P+ Q +P
Sbjct: 32 RSRSKSPKNKSRRRKPSLYWDVPPPGFEHIAPLQYKAMQAAGQIPA--NTMPDTPQTAVP 89
Query: 150 FGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
V+ TR ARR+YVG +P E + FF+Q M + G +
Sbjct: 90 ---------------VVGSTITRQARRLYVGNIPFGVTEDEMMEFFNQQM-HLSGLAQAA 133
Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
G+ V+ IN +K FAF+E R+++E + AMA DGI F+G ++++RRP DY PT +
Sbjct: 134 GNPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPT--PGMTE 191
Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD 329
P N N SG P ++F+GGLP Y + Q+KELL SFG L F+LVKD
Sbjct: 192 SNPVTNYN-------SGMTLDMNSPHKIFIGGLPAYLNDEQVKELLTSFGQLKAFNLVKD 244
Query: 330 RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
TG SKGY FC Y D +TD A A LNG+++G+K L V+RA+ ++ L Q
Sbjct: 245 AATGLSKGYAFCEYADVVMTDQAIAGLNGMQLGEKKLIVQRASIGAKNPG-----LGQV- 298
Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
+ +Q G+ +G +VLCL +T D L D+EEYE+ILED+REE
Sbjct: 299 ------PVTIQVPGLTVVGT-----AGPPTEVLCLLNMVTPDELKDEEEYEDILEDIREE 347
Query: 450 CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
C KYG + ++ IPRP + G + PG GKVF+E+ V C A+ AL+GRKF V +
Sbjct: 348 CNKYGVVRSLEIPRPIE-GIDVPGCGKVFIEFNAIVDCQKAQQALAGRKFNNRVVVTSFM 406
Query: 510 PEDKYFNKDY 519
DKY +++
Sbjct: 407 EPDKYHRREF 416
>gi|195042782|ref|XP_001991497.1| GH12033 [Drosophila grimshawi]
gi|195134983|ref|XP_002011915.1| GI14308 [Drosophila mojavensis]
gi|193901255|gb|EDW00122.1| GH12033 [Drosophila grimshawi]
gi|193909169|gb|EDW08036.1| GI14308 [Drosophila mojavensis]
Length = 416
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 157/392 (40%), Positives = 223/392 (56%), Gaps = 44/392 (11%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
GQ+P S VP+ Q +P V+ TR ARR+YVG +P E+
Sbjct: 66 GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 108
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+Q M +G A G V+ IN +K FAF+E R+++E + AMA DGI +G
Sbjct: 109 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQD 167
Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
+++RRP DY P PG +P + A V +SG I + P ++F+GGLP Y
Sbjct: 168 LKIRRPHDYQPM------PGITDTPAVKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 219
Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
+ Q+KELL SFG L F+LVKD TG SKGY FC Y D ++TD + A LNG+++GDK L
Sbjct: 220 DEQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 279
Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEA 427
V+RA+ ++ AQ + + Q + LQ G++T+ + +VLCL
Sbjct: 280 VQRASVGAKN--------AQNAANTS-QSVMLQVPGLSTV-----VTSGPPTEVLCLLNM 325
Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
+T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+ + C
Sbjct: 326 VTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLDC 384
Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
A+ AL+GRKF V Y+ DKY +++
Sbjct: 385 QKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 416
>gi|156404394|ref|XP_001640392.1| predicted protein [Nematostella vectensis]
gi|156227526|gb|EDO48329.1| predicted protein [Nematostella vectensis]
Length = 332
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 149/350 (42%), Positives = 204/350 (58%), Gaps = 21/350 (6%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
TR ARR+YVG +P E + FF+ M N+A PG+ V+ IN E+ FAF+E+R
Sbjct: 2 TRQARRLYVGNIPFGVTENLMIEFFNAKMKEAKLNTA-PGNPVIAAQINTEQNFAFIELR 60
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+VEE + AMA DGII +G A+++RRP DY P PG S N ++ G+ S +
Sbjct: 61 SVEETTQAMAFDGIILQGQALKIRRPKDYQPI------PGM-SENASVHVPGVVSTVV-- 111
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
+ P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKGY FC Y D +TD
Sbjct: 112 PDSPHKIFIGGLPNYLNEDQVKELLSSFGELRAFNLVKDSATGLSKGYAFCEYVDLGITD 171
Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
+A LNG+++GDK L V+RA+ + QA + Q LQ G++
Sbjct: 172 VAIQGLNGMQLGDKKLIVQRASVGAKQNLNN----PQAMNMVPAQ---LQIPGLDI---S 221
Query: 411 MSLFGETLA-KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
M++ G A +VL L +T D L DDEE+EEI +D+REEC KYG + ++ IPRP +
Sbjct: 222 MAVPGAVAATEVLALMNMVTPDELGDDEEFEEIYDDVREECSKYGRVKSMEIPRPMEGLM 281
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
E PGVGK+F+E+ A AL GRKF V YYP ++Y + +
Sbjct: 282 EPPGVGKIFVEFSSIDDAKKAAAALGGRKFANRVVVTSYYPPEEYHRRIF 331
>gi|328721668|ref|XP_003247369.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like isoform 2
[Acyrthosiphon pisum]
Length = 451
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 162/437 (37%), Positives = 231/437 (52%), Gaps = 50/437 (11%)
Query: 95 RSKSLSPSRSPSKSKRRSGFDMAPP-----AAAMLPGAAVPGQLPGVPSAVPEMAQNMLP 149
RS+S SP + K +D+ PP A GQ+P + +P+ Q +P
Sbjct: 53 RSRSKSPKNKSRRRKPSLYWDVPPPGFEHIAPLQYKAMQAAGQIPA--NTMPDTPQTAVP 110
Query: 150 FGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
V+ TR ARR+YVG +P E + FF+Q M + G +
Sbjct: 111 ---------------VVGSTITRQARRLYVGNIPFGVTEDEMMEFFNQQM-HLSGLAQAA 154
Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
G+ V+ IN +K FAF+E R+++E + AMA DGI F+G ++++RRP DY PT +
Sbjct: 155 GNPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPT--PGMTE 212
Query: 270 GQPSPNLN----LAAVGLASGAIGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLH 322
P N N L + S + G PD ++F+GGLP Y + Q+KELL SFG L
Sbjct: 213 SNPVTNYNSGMTLDMMKYDSSSFGLGTVPDSPHKIFIGGLPAYLNDEQVKELLTSFGQLK 272
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
F+LVKD TG SKGY FC Y D +TD A A LNG+++G+K L V+RA+ ++
Sbjct: 273 AFNLVKDAATGLSKGYAFCEYADVVMTDQAIAGLNGMQLGEKKLIVQRASIGAKNPG--- 329
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEI 442
L Q + +Q G+ +G +VLCL +T D L D+EEYE+I
Sbjct: 330 --LGQV-------PVTIQVPGLTVVGT-----AGPPTEVLCLLNMVTPDELKDEEEYEDI 375
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
LED+REEC KYG + ++ IPRP + G + PG GKVF+E+ V C A+ AL+GRKF
Sbjct: 376 LEDIREECNKYGVVRSLEIPRPIE-GIDVPGCGKVFIEFNAIVDCQKAQQALAGRKFNNR 434
Query: 503 TVNAFYYPEDKYFNKDY 519
V + DKY +++
Sbjct: 435 VVVTSFMEPDKYHRREF 451
>gi|270011684|gb|EFA08132.1| hypothetical protein TcasGA2_TC005736 [Tribolium castaneum]
Length = 432
Score = 265 bits (678), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 143/363 (39%), Positives = 208/363 (57%), Gaps = 33/363 (9%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 101 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 159
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY QP P +
Sbjct: 160 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDY-----------QPMPGMAE 208
Query: 279 AAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
+++ + +G I + P ++F+GGLP Y E Q+KELL SFG L F+LVKD G SK
Sbjct: 209 SSISVPAGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDTAFGLSK 268
Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQK 396
GY F Y D +TD A A LNG+++GDK L V+RA+ ++ T I
Sbjct: 269 GYAFAEYIDITMTDQAIAGLNGMQLGDKRLIVQRASVGAKNAT-------------VIPA 315
Query: 397 MALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
+ +Q G++ +G +VLCL +T D L D+EEYE+ILED++EEC KYG +
Sbjct: 316 VQIQVPGLSLVGA-----SGPPTEVLCLLNMVTPDELKDEEEYEDILEDIKEECNKYGVV 370
Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ IPRP +G E PG GKVF+E+ + C A+ L+GRKF V Y+ DKY
Sbjct: 371 RSIEIPRP-IDGVEVPGCGKVFVEFNSVLDCQKAQQTLTGRKFSNRVVVTSYFDPDKYHR 429
Query: 517 KDY 519
+++
Sbjct: 430 REF 432
>gi|410903109|ref|XP_003965036.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 4
[Takifugu rubripes]
Length = 455
Score = 265 bits (677), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 151/367 (41%), Positives = 209/367 (56%), Gaps = 36/367 (9%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V+ Q TR ARR+YVG +P E+++A FF+ M + G S P + V+ V IN +K
Sbjct: 115 VPVVGSQMTRQARRLYVGNIPFGVTEESMAEFFNAQMR-LAGLSQAPSNPVLAVQINQDK 173
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P + P P G
Sbjct: 174 NFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPVFHVP-------G 226
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQI---------KELLESFGTLHGFDLVKDRDTG 333
+ S + + P ++F+GGLP Y + Q+ KELL SFG L F+LVKD T
Sbjct: 227 VVSTVV--PDSPHKLFIGGLPNYLNDDQVLIRRLGWRVKELLTSFGPLKAFNLVKDSATS 284
Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI- 392
SKGY FC Y D + TD A A LNG+++GDK L V+RA+ ++ I A +
Sbjct: 285 LSKGYAFCEYVDVSATDQAVAGLNGMQLGDKKLIVQRASVGAKNANPSAIIEAPVTLQVP 344
Query: 393 AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGK 452
+Q+ LQ SGM T +VLCL + + L DD++YEEILED+REEC K
Sbjct: 345 GLQR--LQNSGMPT-------------EVLCLLNMVMPEELVDDDDYEEILEDVREECCK 389
Query: 453 YGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
YG++ ++ IPRP +G + PG GK+F+EY A C A AL+GRKF V YY D
Sbjct: 390 YGSVRSIEIPRP-VDGVDVPGCGKIFVEYVSASDCQKAMQALTGRKFANRVVVTKYYDLD 448
Query: 513 KYFNKDY 519
Y ++
Sbjct: 449 LYHRHEF 455
>gi|91088649|ref|XP_974465.1| PREDICTED: similar to U2 small nuclear ribonucleoprotein auxiliary
factor 2 [Tribolium castaneum]
Length = 450
Score = 265 bits (676), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 143/363 (39%), Positives = 208/363 (57%), Gaps = 33/363 (9%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 119 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 177
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY QP P +
Sbjct: 178 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDY-----------QPMPGMAE 226
Query: 279 AAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
+++ + +G I + P ++F+GGLP Y E Q+KELL SFG L F+LVKD G SK
Sbjct: 227 SSISVPAGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDTAFGLSK 286
Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQK 396
GY F Y D +TD A A LNG+++GDK L V+RA+ ++ T I
Sbjct: 287 GYAFAEYIDITMTDQAIAGLNGMQLGDKRLIVQRASVGAKNAT-------------VIPA 333
Query: 397 MALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
+ +Q G++ +G +VLCL +T D L D+EEYE+ILED++EEC KYG +
Sbjct: 334 VQIQVPGLSLVGA-----SGPPTEVLCLLNMVTPDELKDEEEYEDILEDIKEECNKYGVV 388
Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ IPRP +G E PG GKVF+E+ + C A+ L+GRKF V Y+ DKY
Sbjct: 389 RSIEIPRP-IDGVEVPGCGKVFVEFNSVLDCQKAQQTLTGRKFSNRVVVTSYFDPDKYHR 447
Query: 517 KDY 519
+++
Sbjct: 448 REF 450
>gi|115496604|ref|NP_001068804.1| splicing factor U2AF 65 kDa subunit [Bos taurus]
gi|89994093|gb|AAI14161.1| U2 small nuclear RNA auxiliary factor 2 [Bos taurus]
gi|296477253|tpg|DAA19368.1| TPA: U2 (RNU2) small nuclear RNA auxiliary factor 2 [Bos taurus]
Length = 475
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 169/420 (40%), Positives = 236/420 (56%), Gaps = 26/420 (6%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPLSTINQTPVTLQVPGL 363
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
+S + +GG + +VLCL + + L DDEEYEEI+ED+R+ECGKYG + ++
Sbjct: 364 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECGKYGLVKSI 415
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 416 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 474
>gi|193629757|ref|XP_001950852.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Acyrthosiphon
pisum]
Length = 446
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 162/437 (37%), Positives = 231/437 (52%), Gaps = 50/437 (11%)
Query: 95 RSKSLSPSRSPSKSKRRSGFDMAPP-----AAAMLPGAAVPGQLPGVPSAVPEMAQNMLP 149
RS+S SP + K +D+ PP A GQ+P + +P+ Q +P
Sbjct: 48 RSRSKSPKNKSRRRKPSLYWDVPPPGFEHIAPLQYKAMQAAGQIPA--NTMPDTPQTAVP 105
Query: 150 FGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
V+ TR ARR+YVG +P E + FF+Q M + G +
Sbjct: 106 ---------------VVGSTITRQARRLYVGNIPFGVTEDEMMEFFNQQM-HLSGLAQAA 149
Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
G+ V+ IN +K FAF+E R+++E + AMA DGI F+G ++++RRP DY PT +
Sbjct: 150 GNPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPT--PGMTE 207
Query: 270 GQPSPNLN----LAAVGLASGAIGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLH 322
P N N L + S + G PD ++F+GGLP Y + Q+KELL SFG L
Sbjct: 208 SNPVTNYNSGMTLDMMKYDSSSFGLGTVPDSPHKIFIGGLPAYLNDEQVKELLTSFGQLK 267
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
F+LVKD TG SKGY FC Y D +TD A A LNG+++G+K L V+RA+ ++
Sbjct: 268 AFNLVKDAATGLSKGYAFCEYADVVMTDQAIAGLNGMQLGEKKLIVQRASIGAKNPG--- 324
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEI 442
L QA + +Q G+ +G +VLCL +T D L D+EEYE+I
Sbjct: 325 --LGQA-------PVTIQVPGLTVVGT-----AGPPTEVLCLLNMVTPDELKDEEEYEDI 370
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
LED+REEC KYG + ++ IPRP + G + PG GKVF+E+ C A+ AL+GRKF
Sbjct: 371 LEDIREECNKYGVVRSLEIPRPIE-GIDVPGCGKVFIEFNAIPDCQKAQQALAGRKFNNR 429
Query: 503 TVNAFYYPEDKYFNKDY 519
V + DKY +++
Sbjct: 430 VVVTSFMEPDKYHRREF 446
>gi|432908697|ref|XP_004077989.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
[Oryzias latipes]
Length = 474
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 175/445 (39%), Positives = 244/445 (54%), Gaps = 29/445 (6%)
Query: 76 KERRHRHRS-RSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLP 134
KERRHR +S H S + SP R K K + +D+ PP + P Q
Sbjct: 57 KERRHRRKSVHLHQSSCLKTSCYVRSPHRE-KKKKIKKYWDVPPPGFEHI----TPMQYK 111
Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
+ +A A +LP A PV V+ Q TR ARR+YVG +P E+++ F
Sbjct: 112 AMQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEESMMDF 171
Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
F+ M +GG + PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G ++++R
Sbjct: 172 FNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIR 230
Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
RP DY P PG S N ++ G+ S + + ++F+GGLP Y + Q+KEL
Sbjct: 231 RPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKEL 281
Query: 315 LESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATAS 374
L SFG L F+LVKD TG SKGY FC Y D + D A A LNG+++GDK L V+RA+
Sbjct: 282 LTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQRASVG 341
Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA 434
++ T L Q + LQ G+N+ ++ G +VLCL + + L
Sbjct: 342 SKNAT-----LTSINQ----TPVTLQVPGLNS---SVTQMGGLPTEVLCLMNMVAPEELL 389
Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNAL 494
DDEEYEEI+ED+REECGKYG + ++ IPRP +G E PG GK+F+E+ A L
Sbjct: 390 DDEEYEEIVEDVREECGKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFTSVFDAQKAMQGL 448
Query: 495 SGRKFGGNTVNAFYYPEDKYFNKDY 519
+GRKF V Y D Y +D+
Sbjct: 449 TGRKFANRVVVTKYCDPDAYHRRDF 473
>gi|291190480|ref|NP_001167275.1| Splicing factor U2AF 65 kDa subunit [Salmo salar]
gi|223648990|gb|ACN11253.1| Splicing factor U2AF 65 kDa subunit [Salmo salar]
Length = 474
Score = 261 bits (668), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 173/465 (37%), Positives = 243/465 (52%), Gaps = 67/465 (14%)
Query: 76 KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKR-RSGFDM-AP--------------- 118
KERR RHR + + + S SP+ K K+ R +D+ AP
Sbjct: 55 KERR-RHRRSIQTQKQSQETVVSRSPALHREKKKKVRKYWDVPAPGFEHITPLQYKAMQA 113
Query: 119 ----PAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHA 174
PA A+LP P LP P++VP V+ Q TR A
Sbjct: 114 AGQIPATALLPTMITPEGLPPAPTSVP-----------------------VVGSQMTRQA 150
Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEE 234
RR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+E
Sbjct: 151 RRLYVGNIPFGITEEAMMDFFNAQM-CLGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDE 209
Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
+ AMA DGIIF+G ++++RRP DY P + P P G+ S + +
Sbjct: 210 TTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSESPSVYVP-------GVVSTVV--PDSA 260
Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
++F+GGLP Y + Q+KELL SFG L F+LVKD T SKGY FC Y D + D A A
Sbjct: 261 HKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATALSKGYAFCEYVDVNLNDQAIA 320
Query: 355 ALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLF 414
LNG+++GDK L V+RA+ ++ ++ Q + +Q L + M +LGG
Sbjct: 321 GLNGMQLGDKKLLVQRASVGAKNA----ALTGMNQTPVTLQVPGLMPTSMASLGG----- 371
Query: 415 GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
+VLCL + + L DDEEYEEI+ED+R+ECGKYG + ++ IPRP +G E PG
Sbjct: 372 --LPTEVLCLMNMVAVEELLDDEEYEEIVEDVRDECGKYGQVKSIEIPRP-VDGLEVPGT 428
Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
GK+F+E+ A AL+GRKF V Y D Y +D+
Sbjct: 429 GKIFVEFMTLFDSQKAMQALTGRKFANRVVVTKYCDPDAYHRRDF 473
>gi|197692223|dbj|BAG70075.1| U2 small nuclear RNA auxiliary factor 2 isoform b [Homo sapiens]
gi|197692475|dbj|BAG70201.1| U2 small nuclear RNA auxiliary factor 2 isoform b [Homo sapiens]
Length = 471
Score = 261 bits (666), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 169/420 (40%), Positives = 236/420 (56%), Gaps = 30/420 (7%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S NL++
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENLSVY 245
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGL 359
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
+S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 360 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 411
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 412 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 470
>gi|195999450|ref|XP_002109593.1| hypothetical protein TRIADDRAFT_21652 [Trichoplax adhaerens]
gi|190587717|gb|EDV27759.1| hypothetical protein TRIADDRAFT_21652 [Trichoplax adhaerens]
Length = 476
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 141/351 (40%), Positives = 207/351 (58%), Gaps = 22/351 (6%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
Q TR +RR+YVG +P EQA+ FF++ M G A GD V+ V IN +K FAF+E
Sbjct: 148 QITRQSRRLYVGNIPFGITEQAMMDFFNEKMVTTGLTQAN-GDPVLAVQINFDKNFAFLE 206
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
R++EE +NAMA DGI+F+ A+++RRP DY P G P+ + N+ G+ S +
Sbjct: 207 FRSIEETTNAMAFDGIMFQNQALKIRRPKDYQPPT------GDPNSSANIHVPGVISTVV 260
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
+ P ++F+GGLP Y TE Q+KELL+SFG L F+LVKD TG SKGY FC Y V
Sbjct: 261 --PDTPHKLFIGGLPNYLTEDQVKELLQSFGELKAFNLVKDSATGLSKGYAFCEYVVVEV 318
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
TD A A LN +++G+K L V+RA+ + S+ + + +Q LQ S N LG
Sbjct: 319 TDQAIAGLNNMQLGEKKLVVQRASVGAKHNY---SVRCLSGIPVTVQVPGLQISN-NALG 374
Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
+ ++L L +T + L DDEEYE+I+ED+R E K + ++ IPRP + G
Sbjct: 375 --------EVTEILQLMNMVTEEELVDDEEYEDIIEDVRAEVSKIAPVKSLEIPRPIE-G 425
Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+ GVGK+++E+++ C A+ +L+GRKF V +YP + Y + +
Sbjct: 426 VDVAGVGKIYIEFHNLDDCLKAQQSLAGRKFANRVVMTSFYPPESYHMRQF 476
>gi|242019185|ref|XP_002430045.1| Splicing factor U2AF 50 kDa subunit, putative [Pediculus humanus
corporis]
gi|212515110|gb|EEB17307.1| Splicing factor U2AF 50 kDa subunit, putative [Pediculus humanus
corporis]
Length = 445
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 148/362 (40%), Positives = 214/362 (59%), Gaps = 28/362 (7%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 111 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 169
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG +P++N
Sbjct: 170 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMSENPSVN 223
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
+ A G+ S + + P ++F+GGLP Y E Q+KELL SFG L F+LV D TG SKG
Sbjct: 224 VPA-GVISTVV--PDSPHKIFIGGLPNYLNEDQLKELLMSFGQLRAFNLVMDSTTGLSKG 280
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y FC++ D VTD A A LNG+++GDK L V+RA+ ++ + L Q Q + IQ
Sbjct: 281 YAFCLFVDINVTDQAIAGLNGMQLGDKKLIVQRASVGAKN-----TALGQ-QAPVQIQVP 334
Query: 398 ALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
L + GM+ +VLCL +T L D+EEYE+ILED++EEC K+G +
Sbjct: 335 GLTSVGMSG----------PPTEVLCLLNMVTPSELNDEEEYEDILEDIKEECNKHGVVK 384
Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
++ IPRP G + PG GKVF+E+ + C A+ AL+GRKF V Y+ DKY +
Sbjct: 385 SLEIPRPIL-GVDVPGCGKVFVEFNSVLDCQKAQQALTGRKFNHRVVVTSYFDPDKYHRR 443
Query: 518 DY 519
++
Sbjct: 444 EF 445
>gi|410351437|gb|JAA42322.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
Length = 475
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 168/420 (40%), Positives = 234/420 (55%), Gaps = 26/420 (6%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T Q + +Q L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPPSTINQTPVTLQVPGL 363
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
+S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 364 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 415
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 416 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 474
>gi|347968827|ref|XP_311994.4| AGAP002908-PA [Anopheles gambiae str. PEST]
gi|333467820|gb|EAA08228.4| AGAP002908-PA [Anopheles gambiae str. PEST]
Length = 446
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 151/368 (41%), Positives = 210/368 (57%), Gaps = 37/368 (10%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 109 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQMH-LSGLAQAAGNPVLACQI 167
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
N +K FAF+E R+++E + AMA D I F+G ++++RRP DY P PG +
Sbjct: 168 NLDKNFAFLEFRSIDETTQAMAFDSINFKGQSLKIRRPHDYQPM------PGM----TDS 217
Query: 279 AAVGLA---SGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTG 333
AAV + SG I + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG
Sbjct: 218 AAVNVPEKFSGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLLSFGQLKAFNLVKDAATG 277
Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
KGY F Y + VTD A A LNG+++GDK L V+RA+ +K +++A Q +
Sbjct: 278 LGKGYAFAEYVEYTVTDQAIAGLNGMQLGDKKLIVQRASVG--AKNSNAAVVAPVQIQVP 335
Query: 394 IQKMALQTSGMNTLGGGMSLFGET--LAKVLCLTEAITADALADDEEYEEILEDMREECG 451
G+SL G + +VLCL +T D L D+EEYE+ILED+REEC
Sbjct: 336 ----------------GLSLVGSSGPPTEVLCLLNMVTPDELKDEEEYEDILEDIREECN 379
Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
KYG + +V IPRP + G + PG GKVF+E+ V C A+ AL+GRKF V Y+
Sbjct: 380 KYGVVRSVEIPRPIE-GVDVPGCGKVFVEFNSIVDCQKAQQALTGRKFSDRVVVTSYFDP 438
Query: 512 DKYFNKDY 519
DKY +++
Sbjct: 439 DKYHRREF 446
>gi|1334149|emb|CAA45875.1| unnamed protein product [Mus musculus]
Length = 492
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 169/420 (40%), Positives = 232/420 (55%), Gaps = 26/420 (6%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 95 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 150
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 151 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 209
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 210 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 262
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 263 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 320
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T L I + L
Sbjct: 321 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT-----LVSLPSTINQTPVTL 375
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
Q G+ + M G +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 376 QVPGLMSSQVQM---GGHPTEVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 432
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 433 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 491
>gi|417410850|gb|JAA51891.1| Putative splicing factor u2af large subunit rrm superfamily,
partial [Desmodus rotundus]
Length = 455
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 167/421 (39%), Positives = 235/421 (55%), Gaps = 27/421 (6%)
Query: 99 LSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAF 158
+ P R K K R +D+ PP + P Q + +A A +LP A
Sbjct: 61 IRPPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAV 116
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V I
Sbjct: 117 TPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQI 175
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
N +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 176 NQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSV 228
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 229 YVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGY 286
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q
Sbjct: 287 AFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPG 342
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
L +S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + +
Sbjct: 343 LMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKS 394
Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
+ IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D
Sbjct: 395 IEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRD 453
Query: 519 Y 519
+
Sbjct: 454 F 454
>gi|6005926|ref|NP_009210.1| splicing factor U2AF 65 kDa subunit isoform a [Homo sapiens]
gi|267188|sp|P26368.4|U2AF2_HUMAN RecName: Full=Splicing factor U2AF 65 kDa subunit; AltName: Full=U2
auxiliary factor 65 kDa subunit; Short=hU2AF(65);
Short=hU2AF65; AltName: Full=U2 snRNP auxiliary factor
large subunit
gi|37545|emb|CAA45409.1| splicing factor U2AF [Homo sapiens]
gi|380783065|gb|AFE63408.1| splicing factor U2AF 65 kDa subunit isoform a [Macaca mulatta]
gi|410212804|gb|JAA03621.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
gi|410260574|gb|JAA18253.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
gi|410291504|gb|JAA24352.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
Length = 475
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 168/420 (40%), Positives = 234/420 (55%), Gaps = 26/420 (6%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T Q + +Q L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPPSTINQTPVTLQVPGL 363
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
+S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 364 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 415
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 416 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 474
>gi|228543|prf||1805352A splicing factor U2AF:SUBUNIT=large
Length = 475
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 168/420 (40%), Positives = 234/420 (55%), Gaps = 26/420 (6%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPLHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T Q + +Q L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPPSTINQTPVTLQVPGL 363
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
+S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 364 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 415
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 416 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 474
>gi|297277970|ref|XP_001091568.2| PREDICTED: splicing factor U2AF 65 kDa subunit [Macaca mulatta]
Length = 471
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 168/425 (39%), Positives = 235/425 (55%), Gaps = 36/425 (8%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
RSP K K R +D+ PP + GQ+P + +P M + L T
Sbjct: 74 RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT- 131
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+
Sbjct: 132 -------PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVL 183
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S
Sbjct: 184 AVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SE 236
Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
N ++ G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG
Sbjct: 237 NPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGL 294
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
SKGY FC Y D VTD A A LNG+++GDK L V+RA+ ++ T Q + +
Sbjct: 295 SKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPPSTINQTPVTL 354
Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
Q L +S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG
Sbjct: 355 QVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYG 406
Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ ++ IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y
Sbjct: 407 LVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSY 465
Query: 515 FNKDY 519
+D+
Sbjct: 466 HRRDF 470
>gi|348526424|ref|XP_003450719.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
[Oreochromis niloticus]
Length = 475
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 172/447 (38%), Positives = 247/447 (55%), Gaps = 32/447 (7%)
Query: 76 KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSG---FDMAPPAAAMLPGAAVPGQ 132
KERRHR RS + + ++ L RSP + K++ +D+ PP + P Q
Sbjct: 57 KERRHRRRSVPVCNYIWASKQSKLL--RSPHREKKKKVKKYWDVPPPGFEHI----TPMQ 110
Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIA 192
+ +A A +LP A PV V+ Q TR ARR+YVG +P E+++
Sbjct: 111 YKAMQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEESMM 170
Query: 193 TFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVR 252
FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G +++
Sbjct: 171 DFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLK 229
Query: 253 VRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIK 312
+RRP DY P PG S N ++ G+ S + + ++F+GGLP Y + Q+K
Sbjct: 230 IRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVK 280
Query: 313 ELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
ELL SFG L F+LVKD TG SKGY FC Y D + D A A LNG+++GDK L V+RA+
Sbjct: 281 ELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQRAS 340
Query: 373 ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADA 432
++ T L+ Q + LQ G+N+ ++ G +VLCL + +
Sbjct: 341 VGSKNAT-----LSSINQ----TPVTLQVPGLNS---SVTQMGGLPTEVLCLMNMVAPEE 388
Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
L DDEEYEEI+ED+R+EC KYG + ++ IPRP +G E PG GK+F+E+ A
Sbjct: 389 LLDDEEYEEIVEDVRDECSKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFTSVFDSQKAMQ 447
Query: 493 ALSGRKFGGNTVNAFYYPEDKYFNKDY 519
L+GRKF V Y D Y +D+
Sbjct: 448 GLTGRKFANRVVVTKYCDPDAYHRRDF 474
>gi|327365322|ref|NP_001192160.1| splicing factor U2AF 65 kDa subunit isoform 1 [Mus musculus]
gi|348551789|ref|XP_003461711.1| PREDICTED: splicing factor U2AF 65 kDa subunit isoform 2 [Cavia
porcellus]
gi|392343893|ref|XP_003748811.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Rattus norvegicus]
gi|136628|sp|P26369.3|U2AF2_MOUSE RecName: Full=Splicing factor U2AF 65 kDa subunit; AltName: Full=U2
auxiliary factor 65 kDa subunit; AltName: Full=U2 snRNP
auxiliary factor large subunit
gi|55101|emb|CAA45874.1| splicing factor U2AF [Mus musculus]
gi|26347321|dbj|BAC37309.1| unnamed protein product [Mus musculus]
Length = 475
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 169/420 (40%), Positives = 232/420 (55%), Gaps = 26/420 (6%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T L I + L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT-----LVSLPSTINQTPVTL 358
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
Q G+ + M G +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 359 QVPGLMSSQVQM---GGHPTEVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 415
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 416 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 474
>gi|410351435|gb|JAA42321.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
Length = 471
Score = 258 bits (660), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 168/420 (40%), Positives = 235/420 (55%), Gaps = 30/420 (7%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGL 359
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
+S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 360 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 411
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 412 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 470
>gi|410982179|ref|XP_003997437.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Felis catus]
Length = 471
Score = 258 bits (659), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 167/421 (39%), Positives = 235/421 (55%), Gaps = 27/421 (6%)
Query: 99 LSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAF 158
+S R K K R +D+ PP + P Q + +A A +LP A
Sbjct: 77 ISLPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAV 132
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V I
Sbjct: 133 TPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQI 191
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
N +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 192 NQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSV 244
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 245 YVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGY 302
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q
Sbjct: 303 AFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPG 358
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
L +S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + +
Sbjct: 359 LMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKS 410
Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
+ IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D
Sbjct: 411 IEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRD 469
Query: 519 Y 519
+
Sbjct: 470 F 470
>gi|432908699|ref|XP_004077990.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 3
[Oryzias latipes]
Length = 479
Score = 258 bits (659), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 174/444 (39%), Positives = 243/444 (54%), Gaps = 31/444 (6%)
Query: 76 KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPG 135
KERRHR S + + S SP R K K + +D+ PP + P Q
Sbjct: 66 KERRHRRNSPPAYP---QENTASRSPHRE-KKKKIKKYWDVPPPGFEHI----TPMQYKA 117
Query: 136 VPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFF 195
+ +A A +LP A PV V+ Q TR ARR+YVG +P E+++ FF
Sbjct: 118 MQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEESMMDFF 177
Query: 196 SQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRR 255
+ M +GG + PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RR
Sbjct: 178 NAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRR 236
Query: 256 PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELL 315
P DY P PG S N ++ G+ S + + ++F+GGLP Y + Q+KELL
Sbjct: 237 PHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELL 287
Query: 316 ESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASG 375
SFG L F+LVKD TG SKGY FC Y D + D A A LNG+++GDK L V+RA+
Sbjct: 288 TSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQRASVGS 347
Query: 376 QSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALAD 435
++ T L Q + LQ G+N+ ++ G +VLCL + + L D
Sbjct: 348 KNAT-----LTSINQ----TPVTLQVPGLNS---SVTQMGGLPTEVLCLMNMVAPEELLD 395
Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
DEEYEEI+ED+REECGKYG + ++ IPRP +G E PG GK+F+E+ A L+
Sbjct: 396 DEEYEEIVEDVREECGKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFTSVFDAQKAMQGLT 454
Query: 496 GRKFGGNTVNAFYYPEDKYFNKDY 519
GRKF V Y D Y +D+
Sbjct: 455 GRKFANRVVVTKYCDPDAYHRRDF 478
>gi|60279268|ref|NP_001012496.1| splicing factor U2AF 65 kDa subunit isoform b [Homo sapiens]
gi|164565377|ref|NP_598432.2| splicing factor U2AF 65 kDa subunit isoform 2 [Mus musculus]
gi|109461136|ref|XP_001060115.1| PREDICTED: splicing factor U2AF 65 kDa subunit isoform 6 [Rattus
norvegicus]
gi|338709958|ref|XP_001496159.3| PREDICTED: splicing factor U2AF 65 kDa subunit [Equus caballus]
gi|348551787|ref|XP_003461710.1| PREDICTED: splicing factor U2AF 65 kDa subunit isoform 1 [Cavia
porcellus]
gi|359318549|ref|XP_003638845.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Canis lupus
familiaris]
gi|395861318|ref|XP_003802936.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Otolemur garnettii]
gi|397471087|ref|XP_003807136.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Pan paniscus]
gi|403308602|ref|XP_003944746.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Saimiri boliviensis
boliviensis]
gi|14250571|gb|AAH08740.1| U2 small nuclear RNA auxiliary factor 2 [Homo sapiens]
gi|27695339|gb|AAH43071.1| U2af2 protein [Mus musculus]
gi|39644972|gb|AAH30574.1| U2 small nuclear RNA auxiliary factor 2 [Homo sapiens]
gi|119592810|gb|EAW72404.1| U2 (RNU2) small nuclear RNA auxiliary factor 2, isoform CRA_c [Homo
sapiens]
gi|148699339|gb|EDL31286.1| U2 small nuclear ribonucleoprotein auxiliary factor (U2AF) 2,
isoform CRA_a [Mus musculus]
gi|149016700|gb|EDL75886.1| similar to U2 (RNU2) small nuclear RNA auxiliary factor 2 isoform b
[Rattus norvegicus]
gi|261858294|dbj|BAI45669.1| U2 small nuclear RNA auxiliary factor 2 [synthetic construct]
gi|325463253|gb|ADZ15397.1| U2 small nuclear RNA auxiliary factor 2 [synthetic construct]
gi|380783067|gb|AFE63409.1| splicing factor U2AF 65 kDa subunit isoform b [Macaca mulatta]
gi|389618965|gb|AFK92990.1| U2 small nuclear RNA auxiliary factor 2 [Sus scrofa]
gi|410212802|gb|JAA03620.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
gi|410260572|gb|JAA18252.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
gi|410291502|gb|JAA24351.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
gi|431902970|gb|ELK09152.1| Splicing factor U2AF 65 kDa subunit [Pteropus alecto]
Length = 471
Score = 258 bits (658), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 168/420 (40%), Positives = 235/420 (55%), Gaps = 30/420 (7%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGL 359
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
+S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 360 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 411
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 412 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 470
>gi|344270173|ref|XP_003406920.1| PREDICTED: LOW QUALITY PROTEIN: splicing factor U2AF 65 kDa
subunit-like [Loxodonta africana]
Length = 471
Score = 258 bits (658), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 168/420 (40%), Positives = 235/420 (55%), Gaps = 30/420 (7%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGL 359
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
+S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 360 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 411
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 412 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 470
>gi|157132061|ref|XP_001662443.1| splicing factor u2af large subunit [Aedes aegypti]
gi|108881728|gb|EAT45953.1| AAEL002818-PA [Aedes aegypti]
Length = 418
Score = 257 bits (657), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 150/368 (40%), Positives = 208/368 (56%), Gaps = 37/368 (10%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 81 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 139
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG +
Sbjct: 140 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGM----TDS 189
Query: 279 AAVGLA---SGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTG 333
AAV + SG I + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG
Sbjct: 190 AAVSVPEKFSGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLLSFGQLKAFNLVKDAATG 249
Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
SKGY F Y + +TD A A LNG+++GDK L V+RA+ ++ Q Q
Sbjct: 250 LSKGYAFAEYVEYTITDQAIAGLNGMQLGDKKLIVQRASVGAKNANVAAVAPVQIQVP-- 307
Query: 394 IQKMALQTSGMNTLGGGMSLFGET--LAKVLCLTEAITADALADDEEYEEILEDMREECG 451
G+SL G + +VLCL +T D L D+EEYE+ILED++EEC
Sbjct: 308 ----------------GLSLVGSSGPPTEVLCLLNMVTPDELKDEEEYEDILEDIKEECN 351
Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
KYG + +V IPRP + G + PG GKVF+E+ V C A+ AL+GRKF V Y+
Sbjct: 352 KYGVVRSVEIPRPIE-GVDVPGCGKVFVEFNSIVDCQKAQQALTGRKFSDRVVVTSYFDP 410
Query: 512 DKYFNKDY 519
DKY +++
Sbjct: 411 DKYHRREF 418
>gi|170054347|ref|XP_001863087.1| splicing factor u2af large subunit [Culex quinquefasciatus]
gi|167874693|gb|EDS38076.1| splicing factor u2af large subunit [Culex quinquefasciatus]
Length = 438
Score = 257 bits (657), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 146/369 (39%), Positives = 205/369 (55%), Gaps = 39/369 (10%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 101 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQMH-LSGLAQAAGNPVLACQI 159
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY QP P +
Sbjct: 160 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDY-----------QPMPGMTD 208
Query: 279 AAVGLASGAIGGA------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
+AV G + P ++F+GGLP Y E Q+KELL SFG L F+LVKD T
Sbjct: 209 SAVAPVQEKFSGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLLSFGQLKAFNLVKDAAT 268
Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI 392
G SKGY F Y + ++TD A A LNG+++GDK L V+RA+ ++ Q Q
Sbjct: 269 GLSKGYAFAEYVEYSITDQAIAGLNGMQLGDKKLIVQRASVGAKNANVAAVAPVQIQVP- 327
Query: 393 AIQKMALQTSGMNTLGGGMSLFGET--LAKVLCLTEAITADALADDEEYEEILEDMREEC 450
G+SL G + +VLCL +T D L D+EEYE+ILED++EEC
Sbjct: 328 -----------------GLSLVGSSGPPTEVLCLLNMVTPDELKDEEEYEDILEDIKEEC 370
Query: 451 GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYP 510
KYG + + IPRP + G + PG GKVF+E+ V C A+ AL+GRKF V Y+
Sbjct: 371 NKYGVVRSAEIPRPIE-GVDVPGCGKVFVEFNSIVDCQKAQQALTGRKFSDRVVVTSYFD 429
Query: 511 EDKYFNKDY 519
DKY +++
Sbjct: 430 PDKYHRREF 438
>gi|444724150|gb|ELW64768.1| Splicing factor U2AF 65 kDa subunit [Tupaia chinensis]
Length = 447
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 166/417 (39%), Positives = 233/417 (55%), Gaps = 27/417 (6%)
Query: 103 RSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
R K K R +D+ PP + P Q + +A A +LP A P
Sbjct: 57 RHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTP 112
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +K
Sbjct: 113 VPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDK 171
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++ G
Sbjct: 172 NFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPG 224
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY FC
Sbjct: 225 VVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCE 282
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q L +S
Sbjct: 283 YVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGLMSS 338
Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
+ +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++ IP
Sbjct: 339 QVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIP 390
Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
RP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 391 RP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 446
>gi|301782083|ref|XP_002926459.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Ailuropoda
melanoleuca]
Length = 496
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 168/425 (39%), Positives = 236/425 (55%), Gaps = 40/425 (9%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
RSP K K R +D+ PP + GQ+P + +P M + L T
Sbjct: 103 RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT- 160
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+
Sbjct: 161 -------PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVL 212
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S
Sbjct: 213 AVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SE 265
Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
N ++ G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG
Sbjct: 266 NPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGL 323
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
SKGY FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +
Sbjct: 324 SKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTL 379
Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
Q L +S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG
Sbjct: 380 QVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYG 431
Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ ++ IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y
Sbjct: 432 LVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSY 490
Query: 515 FNKDY 519
+D+
Sbjct: 491 HRRDF 495
>gi|327280715|ref|XP_003225097.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
[Anolis carolinensis]
Length = 456
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 165/418 (39%), Positives = 233/418 (55%), Gaps = 37/418 (8%)
Query: 107 KSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLM 161
K K R +D+ PP + GQ+P + +P M + L T
Sbjct: 70 KKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT-------- 120
Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +
Sbjct: 121 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQD 179
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 180 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVP 232
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY FC
Sbjct: 233 GVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFC 290
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q L +
Sbjct: 291 EYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGLMS 346
Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++ I
Sbjct: 347 SQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGVVKSIEI 398
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
PRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 399 PRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 455
>gi|344277364|ref|XP_003410472.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Loxodonta
africana]
Length = 471
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 177/495 (35%), Positives = 257/495 (51%), Gaps = 37/495 (7%)
Query: 25 RSRTGERGRDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRHRS 84
+S+ G +RH + +S R RD+ + R+HD+ R+ R R RS
Sbjct: 13 KSKHGREEENRHRK--RSHSHSRNRDRKRRSQSRDRRNHDQ--------RNDPRDQRRRS 62
Query: 85 RSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMA 144
+ S D R + +R K K +D+ PP + P Q + +A A
Sbjct: 63 KPWSRDAEEERGGLIPSARHDRKRKVHKYWDVPPPGFEHI----TPMQYKAMQAAGQIPA 118
Query: 145 QNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGG 204
LP A PV + Q TR ARR+YVG +P E+A+ FF+ + +G
Sbjct: 119 TAFLPTMTPDGLAMIPTPVPMGGSQMTRKARRLYVGNIPFGITEEAMMDFFN-IQMRLGV 177
Query: 205 NSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA 264
+ PG+ ++ V IN +K FAF+E R+V+E + A ALDGIIF+G ++++RRP DY P +
Sbjct: 178 LTQAPGNPILAVQINQDKNFAFLEFRSVDETTQATALDGIIFQGQSLKIRRPHDYQPLPS 237
Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
S NL+ G+AS + ++ ++F+ GLP Y + Q+KELL SFG L F
Sbjct: 238 M-------SENLSAYMAGVASTVVPDSD--HKLFIEGLPTYLNDDQVKELLTSFGPLKAF 288
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
LVKD TG SKGY C Y D TD A A LNG+++GDK L V R + ++ T +
Sbjct: 289 SLVKDSATGLSKGYAVCEYVDINDTDQATAGLNGMQLGDKKLLVLRGSVGAKNGT----L 344
Query: 385 LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
Q + Q L++S + +GG + +VLCL + + L DDEEYEEI+E
Sbjct: 345 STINQVPVTPQVPGLRSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIME 396
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
D+REEC KYG + ++ IPRP +G E PG GK+F+E+ C A L+GRKF V
Sbjct: 397 DVREECSKYGLVKSMEIPRP-VDGVEVPGCGKIFVEFTTVFDCQKAMQGLTGRKFANRVV 455
Query: 505 NAFYYPEDKYFNKDY 519
Y D Y ++D+
Sbjct: 456 VTKYCDLDSYHHRDF 470
>gi|63101571|gb|AAH94451.1| U2af2 protein, partial [Mus musculus]
Length = 403
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 168/425 (39%), Positives = 236/425 (55%), Gaps = 40/425 (9%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
RSP K K R +D+ PP + GQ+P + +P M + L T
Sbjct: 10 RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT- 67
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+
Sbjct: 68 -------PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVL 119
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S
Sbjct: 120 AVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SE 172
Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
N ++ G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG
Sbjct: 173 NPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGL 230
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
SKGY FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +
Sbjct: 231 SKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTL 286
Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
Q L +S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG
Sbjct: 287 QVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYG 338
Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ ++ IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y
Sbjct: 339 LVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSY 397
Query: 515 FNKDY 519
+D+
Sbjct: 398 HRRDF 402
>gi|348526426|ref|XP_003450720.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
[Oreochromis niloticus]
Length = 487
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 174/457 (38%), Positives = 249/457 (54%), Gaps = 40/457 (8%)
Query: 76 KERRHRHRSRSHSSDRFRNRSKSL-----SPS-----RSPSKSKRRSG---FDMAPPAAA 122
KERRHR RS + + ++ L SP RSP + K++ +D+ PP
Sbjct: 57 KERRHRRRSVPVCNYIWASKQSKLLLRQESPHYTGMYRSPHREKKKKVKKYWDVPPPGFE 116
Query: 123 MLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGL 182
+ P Q + +A A +LP A PV V+ Q TR ARR+YVG +
Sbjct: 117 HI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNI 172
Query: 183 PPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALD 242
P E+++ FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+E + AMA D
Sbjct: 173 PFGITEESMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFD 231
Query: 243 GIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGL 302
GIIF+G ++++RRP DY P PG S N ++ G+ S + + ++F+GGL
Sbjct: 232 GIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGL 282
Query: 303 PYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMG 362
P Y + Q+KELL SFG L F+LVKD TG SKGY FC Y D + D A A LNG+++G
Sbjct: 283 PNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLG 342
Query: 363 DKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVL 422
DK L V+RA+ ++ T L+ Q + LQ G+N+ ++ G +VL
Sbjct: 343 DKKLLVQRASVGSKNAT-----LSSINQ----TPVTLQVPGLNS---SVTQMGGLPTEVL 390
Query: 423 CLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYY 482
CL + + L DDEEYEEI+ED+R+EC KYG + ++ IPRP +G E PG GK+F+E+
Sbjct: 391 CLMNMVAPEELLDDEEYEEIVEDVRDECSKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFT 449
Query: 483 DAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
A L+GRKF V Y D Y +D+
Sbjct: 450 SVFDSQKAMQGLTGRKFANRVVVTKYCDPDAYHRRDF 486
>gi|432908695|ref|XP_004077988.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
[Oryzias latipes]
Length = 458
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 175/444 (39%), Positives = 242/444 (54%), Gaps = 43/444 (9%)
Query: 76 KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPG 135
KERRHR RNRS P R K K + +D+ PP + P Q
Sbjct: 57 KERRHR-----------RNRS----PHRE-KKKKIKKYWDVPPPGFEHI----TPMQYKA 96
Query: 136 VPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFF 195
+ +A A +LP A PV V+ Q TR ARR+YVG +P E+++ FF
Sbjct: 97 MQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEESMMDFF 156
Query: 196 SQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRR 255
+ M +GG + PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RR
Sbjct: 157 NAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRR 215
Query: 256 PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELL 315
P DY P PG S N ++ G+ S + + ++F+GGLP Y + Q+KELL
Sbjct: 216 PHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELL 266
Query: 316 ESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASG 375
SFG L F+LVKD TG SKGY FC Y D + D A A LNG+++GDK L V+RA+
Sbjct: 267 TSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQRASVGS 326
Query: 376 QSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALAD 435
++ T L Q + LQ G+N+ ++ G +VLCL + + L D
Sbjct: 327 KNAT-----LTSINQ----TPVTLQVPGLNS---SVTQMGGLPTEVLCLMNMVAPEELLD 374
Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
DEEYEEI+ED+REECGKYG + ++ IPRP +G E PG GK+F+E+ A L+
Sbjct: 375 DEEYEEIVEDVREECGKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFTSVFDAQKAMQGLT 433
Query: 496 GRKFGGNTVNAFYYPEDKYFNKDY 519
GRKF V Y D Y +D+
Sbjct: 434 GRKFANRVVVTKYCDPDAYHRRDF 457
>gi|260800970|ref|XP_002595369.1| hypothetical protein BRAFLDRAFT_113856 [Branchiostoma floridae]
gi|229280615|gb|EEN51381.1| hypothetical protein BRAFLDRAFT_113856 [Branchiostoma floridae]
Length = 524
Score = 254 bits (650), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 151/372 (40%), Positives = 210/372 (56%), Gaps = 26/372 (6%)
Query: 148 LPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
+P A A MP+ Q TR ARR+YVG +P E+A+ FF+ M +
Sbjct: 178 IPSAALLANAGTAMPI---GSQMTRQARRLYVGNIPFGVTEEAMIDFFNTQMHR-ASLAQ 233
Query: 208 GPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAAL 267
PG+ V+ +N +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P A
Sbjct: 234 APGNPVLACQVNLDKNFAFLEFRSVDETTLAMAFDGIIFQGQSLKLRRPHDYQPVPGMAE 293
Query: 268 GPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLV 327
P P G+ S + + P ++F+GGLP Y + Q+KELL SFG L F+LV
Sbjct: 294 NPDIHVPGGFPVIPGVVSTVV--QDSPHKIFIGGLPNYLNDDQVKELLLSFGQLKAFNLV 351
Query: 328 KDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
KD T SKGY FC Y DP VTD A A LNG+++GDK L V+RA+ ++
Sbjct: 352 KDSSTALSKGYAFCEYVDPNVTDQAIAGLNGMQLGDKKLIVQRASVGAKNAQN------- 404
Query: 388 AQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMR 447
Q + LQ G+ G +VLCL + + L D+EEYE+ILED+R
Sbjct: 405 -------QPVQLQIPGLTLTGN-----AGPPTEVLCLMNMVMPEELMDEEEYEDILEDVR 452
Query: 448 EECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAF 507
EECGKYG +++V IPRP + G + PG GK+++E+ + C A+ AL+GRKF V
Sbjct: 453 EECGKYGAVLSVEIPRPIE-GVDVPGCGKIYVEFRSIMDCQKAQQALTGRKFAQRIVVTS 511
Query: 508 YYPEDKYFNKDY 519
YY DKY +D+
Sbjct: 512 YYDPDKYHRRDF 523
>gi|226478958|emb|CAX72974.1| Splicing factor U2AF 65 kDa subunit [Schistosoma japonicum]
Length = 520
Score = 254 bits (650), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 176/458 (38%), Positives = 247/458 (53%), Gaps = 30/458 (6%)
Query: 74 RDKERRHRHRSRSHSSDRFRNRS------KSLSPSRSPSKS-KRRSGFDMAPPA-AAMLP 125
R+++R H H R HS R R+ S + S RSPS S +D+ PP + P
Sbjct: 81 RERKRSHSHGHRRHSKSRHRDYSGGHKSRRHQSHHRSPSNSVSAHKYWDVPPPGFEHVTP 140
Query: 126 GAAVPGQLPG-VPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPP 184
Q G VP V Q +P A V R ARR+YVG +P
Sbjct: 141 AQYKALQTSGQVPVNVYAAGQVPMPVHAPNAPLTLTTNVPFAGSAVCRQARRLYVGNIPF 200
Query: 185 LANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGI 244
A E+ + FF++ M A G A G+ ++ V IN EK FAF+E R+V+E + +ALDG+
Sbjct: 201 TATEENMMEFFNKQMRAQGLIQA-EGNPIIAVQINMEKNFAFLEFRSVDETTQGLALDGV 259
Query: 245 IFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPY 304
+F+ A+++RRP DY P + P P G+ S + + P ++FVGGLP
Sbjct: 260 LFQNQALKLRRPRDYAPLPGVSEQPSVIVP-------GVVSTVV--QDSPHKIFVGGLPN 310
Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDK 364
Y E Q+KELL SFG L GF+LVKD TG SKGY FC Y D VTD ACA LNG+++GDK
Sbjct: 311 YLNEDQVKELLLSFGPLKGFNLVKDGSTGLSKGYAFCEYVDSNVTDHACAGLNGMQLGDK 370
Query: 365 TLTVRRATASGQSKTEQESILAQAQQHIA-IQKMALQTSGMNTLGGGMSLF--GETLAKV 421
L V+RA+ + T +L Q ++ +++ A+Q NT G G G +V
Sbjct: 371 KLIVQRASVGAKHTT---GVLPQCLLQMSGLEEGAVQ----NTTGSGNLTVRSGGPPTEV 423
Query: 422 LCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEY 481
LCL I L DDEEYE+I+ED+R EC KYG + ++ IPRP + G + PGVGK+++E+
Sbjct: 424 LCLMNMIETSELEDDEEYEDIVEDVRAECSKYGVVRSLEIPRPIR-GIDVPGVGKIYVEF 482
Query: 482 YDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+ C A AL+GRKF V ++ + Y +++
Sbjct: 483 ASLIDCQKAATALTGRKFNQRLVVTSFFSPNSYHRREF 520
>gi|355703931|gb|EHH30422.1| hypothetical protein EGK_11092, partial [Macaca mulatta]
Length = 453
Score = 254 bits (648), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 175/466 (37%), Positives = 247/466 (53%), Gaps = 45/466 (9%)
Query: 63 HDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPS-----------------RSP 105
H R++ R +E R+ + S FR+ +S+ P RSP
Sbjct: 10 HAHILHMRNFWRSRESRYGVFETNLKS--FRSAPRSVFPKQYKPLTRGAKEEHGGLIRSP 67
Query: 106 ---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
K K R +D+ PP + P Q + +A A +LP A P
Sbjct: 68 RHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTP 123
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +K
Sbjct: 124 VPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDK 182
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++ G
Sbjct: 183 NFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPG 235
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY FC
Sbjct: 236 VVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCE 293
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
Y D VTD A A LNG+++GDK L V+RA+ ++ T Q + +Q L +S
Sbjct: 294 YVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPPSTINQTPVTLQVPGLMSS 353
Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
+ +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++ IP
Sbjct: 354 QVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIP 405
Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
RP +G E PG GK+F+E+ C A L+GRKF V Y
Sbjct: 406 RP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKY 450
>gi|390341852|ref|XP_792919.3| PREDICTED: splicing factor U2AF 50 kDa subunit-like
[Strongylocentrotus purpuratus]
Length = 386
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 142/359 (39%), Positives = 207/359 (57%), Gaps = 37/359 (10%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
Q TR ARR+YVG +P E A+ FF+ M +G A PG V+ V +NH+K FAF+E
Sbjct: 57 QMTRQARRLYVGNIPFGVTEDAMVEFFNGKMHNVGLAQA-PGPPVLAVQVNHDKNFAFLE 115
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
R+VEE + AMA DGI+F+ A+++RRP DY + P P G+ S +
Sbjct: 116 FRSVEETTQAMAFDGILFQNQALKIRRPKDYQAIPGMSATPTVHVP-------GVVSTVV 168
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
+ P+++F+GGLP Y + Q+KELL SFG L F+LVKD T SKGY FC Y + +
Sbjct: 169 --QDSPNKIFIGGLPNYLNDDQVKELLSSFGPLKAFNLVKDSATSLSKGYAFCEYVETNL 226
Query: 349 TDI-------ACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ- 400
TD+ A A LNG+++G+K L V+RA+ ++ Q Q I I ++L
Sbjct: 227 TDLGWETTDKAIAGLNGMQLGEKKLIVQRASVGAKNAMNQGQ-----QVQINIPGLSLPG 281
Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
T+G NT ++LCL +T + L DDEEY++I+ED++EEC KYG + ++
Sbjct: 282 TTGPNT-------------EILCLMNMVTPEELKDDEEYDDIVEDVKEECQKYGQVRSLE 328
Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP G + PG GK+++E+ + A+ AL+GRKF TV +Y DKY +++
Sbjct: 329 IPRPIP-GLDVPGCGKIYVEFMTVMDAQAAQRALAGRKFANRTVVTSFYDVDKYHRREF 386
>gi|351710523|gb|EHB13442.1| Splicing factor U2AF 65 kDa subunit [Heterocephalus glaber]
Length = 904
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 164/412 (39%), Positives = 231/412 (56%), Gaps = 27/412 (6%)
Query: 93 RNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGA 152
R ++S R K K R +D+ PP + P Q + +A A +LP
Sbjct: 144 RTAAQSCRSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMT 199
Query: 153 TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDA 212
A PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+
Sbjct: 200 PDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNP 258
Query: 213 VVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP 272
V+ V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG
Sbjct: 259 VLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM- 311
Query: 273 SPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
S N ++ G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD T
Sbjct: 312 SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSAT 369
Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI 392
G SKGY FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q +
Sbjct: 370 GLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPV 425
Query: 393 AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGK 452
+Q L +S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC K
Sbjct: 426 TLQVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSK 477
Query: 453 YGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
YG + ++ IPRP +G E PG GK+F+E+ C A L+GRKF V
Sbjct: 478 YGLVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVV 528
>gi|303283510|ref|XP_003061046.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457397|gb|EEH54696.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 378
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 150/368 (40%), Positives = 214/368 (58%), Gaps = 30/368 (8%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
QATRHARRVYVG L ++ + FF ++M A G G VV+ YIN EK FAF+E
Sbjct: 19 QATRHARRVYVGALTADVDDAHLTQFFEEIMLATGATKRVDGGCVVSTYINREKLFAFIE 78
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL----- 283
+TVEEASNA+ DG+++ G +R+RRP DYN A+ LGP QP+PNLN +A+G+
Sbjct: 79 FQTVEEASNALGFDGVVYGGQQLRLRRPNDYNIAQASLLGPQQPNPNLNYSAIGINHTPT 138
Query: 284 -ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+ + P ++FVGGLP Y TE Q+KEL+ SFG + F+LV D+DTG SKGY F
Sbjct: 139 PMVASTENSTSPYKLFVGGLPNYITENQVKELVCSFGEIKAFNLVFDKDTGLSKGYAFWE 198
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRA----------TASGQSKTEQESILAQAQQHI 392
+ DP+V++ A L+G+++G+K + V+ A A+G+ T S +A QQ
Sbjct: 199 FLDPSVSEAAIKGLDGMRLGEKLINVKFANGNPPPIGGYNAAGEDGT---STVAAQQQLG 255
Query: 393 AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGK 452
+ + L T+ L GG+ + L ++ + LAD E EILED EEC
Sbjct: 256 YVANVPLATA--TALTGGVE------TTCVRLKGMVSREELADPTEAAEILEDTEEECKG 307
Query: 453 YGTLVNVVIPRPDQNGGETP-GVGKVFLEYYDAVGCA-TAKNALSGRKFGGNTVNAFYYP 510
+G+LV V++PRP + P GVG+V L++ V CA A+ +L+GRKF V A +
Sbjct: 308 FGSLVKVLMPRPGPHPDLDPVGVGEVMLKFA-TVECARRAQRSLNGRKFADRLVGAVFVK 366
Query: 511 EDKYFNKD 518
E + +D
Sbjct: 367 ESVFDERD 374
>gi|388856534|emb|CCF49840.1| related to pre-mRNA splicing factor U2AF large chain [Ustilago
hordei]
Length = 718
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 145/377 (38%), Positives = 214/377 (56%), Gaps = 39/377 (10%)
Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
P +V Q R ARR+YVG + ANE + FF++ M + + PG+ V+ +N +
Sbjct: 330 PEEVAQQNNNRQARRLYVGNITHSANEPNMVAFFNEQMLKLKLGTE-PGEPAVSAQVNVD 388
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K +AFVE R EEA+NAM+ DGI+F+G ++++RRP DY GP P N+
Sbjct: 389 KGYAFVEFRHPEEATNAMSFDGIVFQGQSLKIRRPKDYT-------GPDV-RPASNIHVP 440
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
G+ S + + P ++FVGGLP Y T+ Q+ ELL++FG L F+LVKD G SKG+ FC
Sbjct: 441 GVISTNV--PDSPHKIFVGGLPTYLTDDQVIELLQAFGELRAFNLVKDTANGASKGFAFC 498
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
Y D A+TD+AC LNG+++GD+ L V+RA+ + K ++I A A+ + +
Sbjct: 499 EYVDTALTDLACQGLNGMELGDRNLVVQRASVGSEKKA--QAIAATGANAGALGDAGMPS 556
Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
S G G GE + ++ L +T + L DDEEY +I+ED+REEC KYGT+ +V +
Sbjct: 557 SVQQFAGEGGDA-GEPRSCMVMLN-MVTPEELQDDEEYADIVEDIREECTKYGTVTDVRV 614
Query: 462 PRPDQ-------------------NGGETP-----GVGKVFLEYYDAVGCATAKNALSGR 497
PRP + +GGE P GVG+V++ Y + CA A A++GR
Sbjct: 615 PRPAKESKGAAAHQWKRTQDESAASGGEKPATEREGVGRVYVRYAETGDCAQALRAIAGR 674
Query: 498 KFGGNTVNAFYYPEDKY 514
+FGG TV + ED +
Sbjct: 675 QFGGRTVICAFLKEDNW 691
>gi|71022561|ref|XP_761510.1| hypothetical protein UM05363.1 [Ustilago maydis 521]
gi|46101379|gb|EAK86612.1| hypothetical protein UM05363.1 [Ustilago maydis 521]
Length = 727
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 144/379 (37%), Positives = 211/379 (55%), Gaps = 41/379 (10%)
Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
P ++ Q A R ARR+YVG + ANEQ I FF++ M + + PG+ V+ +N +
Sbjct: 340 PAELAAQNANRQARRLYVGNITHQANEQNIVAFFNEQMLKLKLGTE-PGEPAVSAQVNVD 398
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K +AFVE R EEA+NAM+ DGI+F+ ++++RRP DY GP P+ N+
Sbjct: 399 KGYAFVEFRHPEEATNAMSFDGIVFQAQSLKIRRPKDYT-------GPDIRPPS-NIHVP 450
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
G+ S + + P ++FVGGLP Y + Q+ ELL++FG L F+LVKD TG SKG+ FC
Sbjct: 451 GVISTNV--PDSPHKIFVGGLPTYLNDDQVIELLQAFGELRAFNLVKDTGTGASKGFAFC 508
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
Y DPA+TD+AC LNG+++GD+ L V+RA+ + K + LA ++ A
Sbjct: 509 EYVDPALTDLACQGLNGMELGDRNLVVQRASVGSEKKAQ---ALAATGANMGALGGAAVP 565
Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
S + G GE ++ L +T + L DDEEY +I+ED+R+EC KYG + +V I
Sbjct: 566 SSVQKFAGDGGDAGEPTTCMVMLN-MVTPEELQDDEEYADIVEDIRDECNKYGAVSDVRI 624
Query: 462 PRP--------------DQNGG------------ETPGVGKVFLEYYDAVGCATAKNALS 495
PRP Q+ G E GVG+V++ Y + CA A A++
Sbjct: 625 PRPAKESKGAAAHQWKRSQDEGATTVDGEKATSAEREGVGRVYVRYGETEHCAQALRAIA 684
Query: 496 GRKFGGNTVNAFYYPEDKY 514
GR+FGG TV + ED +
Sbjct: 685 GRQFGGRTVICAFLREDDW 703
>gi|308477324|ref|XP_003100876.1| CRE-UAF-1 protein [Caenorhabditis remanei]
gi|308264450|gb|EFP08403.1| CRE-UAF-1 protein [Caenorhabditis remanei]
Length = 496
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 155/385 (40%), Positives = 219/385 (56%), Gaps = 33/385 (8%)
Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
G + P +NM GA G+ V V+ T +RR+YVG +P NE+A+ F
Sbjct: 145 GFENITPMEYKNMQASGAVPRGSVQ-SAVPVVGPSVTCQSRRLYVGNIPFGCNEEAMLDF 203
Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
F+Q M + G + PG+ ++ IN +K FAF+E R+++E + MA DGI F G ++VR
Sbjct: 204 FNQQM-HLCGLAQAPGNPILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVR 262
Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
RP DY P+ Q + ++N A + ++S + + P+++F+GGLP Y TE Q+KEL
Sbjct: 263 RPRDYQPS--------QNTFDMN-ARMPVSSIVV---DSPNKIFIGGLPNYLTEDQVKEL 310
Query: 315 LESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATAS 374
L SFG L F L D GNSKGY F Y DP +TD A A LNG+++GDK L V+ A A+
Sbjct: 311 LCSFGPLKAFSLNMD-SQGNSKGYAFAEYLDPTLTDQAIAGLNGMQLGDKQLVVQLACAN 369
Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA 434
+T + L + IA ++ Q +G T ++LCL +T D L
Sbjct: 370 ---QTRHNTHLPNSASAIAGIDLS-QGAGRAT-------------EILCLMNMVTEDELR 412
Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNAL 494
DE+YEEILED+REEC KYG + ++ IPRP + PGVGKVF+E+ C A+ AL
Sbjct: 413 SDEDYEEILEDVREECSKYGIVRSLEIPRP-YDEQPVPGVGKVFVEFATTSDCQRAQAAL 471
Query: 495 SGRKFGGNTVNAFYYPEDKYFNKDY 519
+GRKF TV YY DKY N+ +
Sbjct: 472 TGRKFANRTVVTSYYDVDKYHNRQF 496
>gi|339243511|ref|XP_003377681.1| splicing factor U2AFsubunit [Trichinella spiralis]
gi|316973494|gb|EFV57074.1| splicing factor U2AFsubunit [Trichinella spiralis]
Length = 402
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 140/357 (39%), Positives = 206/357 (57%), Gaps = 30/357 (8%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V+ T +RR+YVG +P E+A+ FF+Q M + G + G+ ++ IN +K
Sbjct: 76 VPVVGPSVTCQSRRLYVGNIPFGCTEEAMMDFFNQQMH-LCGLAQALGNPILACQINLDK 134
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R++ E + AMA DGI ++G ++++RRP DY QP P N G
Sbjct: 135 NFAFIEFRSIAETTAAMAFDGINYQGQSLKIRRPRDY-----------QPLPGQNDTLAG 183
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
L S + A+ P ++F+GGLP Y +E Q+KELL SFG L F+L+KD T SKGY F
Sbjct: 184 LVSSVV--ADSPYKLFIGGLPNYLSEEQVKELLISFGQLKAFNLIKDPATQISKGYAFAE 241
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
Y D +TD A A LNG+++GDK L V+ A S+ A+ A +A+Q
Sbjct: 242 YSDSTLTDQAIAGLNGMQLGDKKLVVQLA-----------SVGAKNNMFSAAAPVAIQVP 290
Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
GMN + + ++LCL + A+ L D+EEY++I+ED++EEC KYG++ +V IP
Sbjct: 291 GMNVVNPAAT----PATEILCLMNMVVAEELVDNEEYDDIVEDIKEECCKYGSVKSVEIP 346
Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
RP + G + PGVGKVF+E+ + C A+ AL+GRKF V Y+ D Y + +
Sbjct: 347 RPIE-GLDVPGVGKVFVEFGTVMECQKAQQALTGRKFANRVVVTSYFDPDLYHRRQF 402
>gi|355756173|gb|EHH59920.1| hypothetical protein EGM_10153, partial [Macaca fascicularis]
Length = 442
Score = 252 bits (643), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 173/458 (37%), Positives = 245/458 (53%), Gaps = 45/458 (9%)
Query: 63 HDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPS-----------------RSP 105
H R++ R +E R+ + S FR+ +S+ P RSP
Sbjct: 10 HAHILHMRNFWRSRESRYGVFETNLKS--FRSAPRSVFPKQYKPLTRGAKEEHGGLIRSP 67
Query: 106 ---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
K K R +D+ PP + P Q + +A A +LP A P
Sbjct: 68 RHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTP 123
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +K
Sbjct: 124 VPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDK 182
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++ G
Sbjct: 183 NFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPG 235
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY FC
Sbjct: 236 VVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCE 293
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
Y D VTD A A LNG+++GDK L V+RA+ ++ T Q + +Q L +S
Sbjct: 294 YVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPPSTINQTPVTLQVPGLMSS 353
Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
+ +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++ IP
Sbjct: 354 QVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIP 405
Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
RP +G E PG GK+F+E+ C A L+GRKF
Sbjct: 406 RP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFA 442
>gi|354486866|ref|XP_003505598.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Cricetulus
griseus]
gi|344242983|gb|EGV99086.1| Splicing factor U2AF 65 kDa subunit [Cricetulus griseus]
Length = 469
Score = 251 bits (641), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 163/414 (39%), Positives = 234/414 (56%), Gaps = 29/414 (7%)
Query: 107 KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ-LGAFPLMPVQV 165
K K R +D+ PP + P Q + +A A +LP + L A P MPV V
Sbjct: 83 KKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTVTSDGLVASP-MPVPV 137
Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFA 225
+ Q TR ARR+YVG +P E+A+ FF+ M +G + PG+ V+ V IN EK FA
Sbjct: 138 VGSQMTRQARRLYVGNIPFGITEEAMKDFFNAQM-QLGVLTQVPGNPVLAVQINQEKNFA 196
Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLAS 285
F+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++ G+ S
Sbjct: 197 FLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVS 249
Query: 286 GAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQD 345
+ + ++F+GG+P Y + ++KELL SFGTL F+LVKD TG SKGY FC Y D
Sbjct: 250 TVV--PDSAHKLFIGGMPSYLNDDKVKELLTSFGTLKAFNLVKDSATGLSKGYAFCEYLD 307
Query: 346 PAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
TD A A LNG+++GDK L V+RA+ ++ T + Q + +Q L +S +
Sbjct: 308 INATDQAIAGLNGMQLGDKKLIVQRASVGSKNAT----LSTINQTPVTVQVPGLMSSQVQ 363
Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
+GG + +VLCL + L DDEEYEEI+ED+R+EC KYG + ++ IPRP
Sbjct: 364 -MGGHPT-------EVLCLMNMVLPKELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRPV 415
Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+ G E PG G +F+E+ C A L+GR++ V Y D Y ++D+
Sbjct: 416 E-GVEVPGCGNIFVEFTSVFDCQKAMQGLTGRRYANKVVVTKYCDPDSYHSRDF 468
>gi|449669310|ref|XP_004206989.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Hydra
magnipapillata]
Length = 480
Score = 251 bits (641), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 148/367 (40%), Positives = 201/367 (54%), Gaps = 35/367 (9%)
Query: 160 LMPVQVMTQ--QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
++P + Q Q T ARR+Y G LP E + FF+ M + PG+ V+
Sbjct: 143 VIPATALPQGAQMTMQARRLYCGNLPFGITEDLMVDFFNAKMRE-SDMARQPGNPVLACQ 201
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLN 277
IN EK FAF+E R+VEE + AMA DGII +G A+++RRP DY P + G P+
Sbjct: 202 INLEKNFAFLEFRSVEETTLAMAFDGIILQGQALKIRRPKDYQP-IPGINGMAYPTLFAE 260
Query: 278 LAAV---GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
A G+ S + ++ ++VFVGGLP Y E Q+KELL +FG L F+LVKD TG
Sbjct: 261 SQATHIPGVVSTVV--SDTINKVFVGGLPNYLNEDQVKELLSTFGELRAFNLVKDSATGL 318
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
SKGY FC Y D +TD+A A +NG+++GDK L V+RA+ ++ T Q +I
Sbjct: 319 SKGYAFCEYVDIGITDVAIAGMNGMQLGDKKLIVQRASVGSKTMTAQLNI---------- 368
Query: 395 QKMALQTSGMNTLGGGMSLFGE-TLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
G L E T +LCL + A+ L DDE+Y+EI ED+REEC KY
Sbjct: 369 --------------PGFDLNKEITATNILCLMNMVVAEELMDDEDYDEIFEDIREECSKY 414
Query: 454 GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
G + ++ IPRP+ N G+GKVF+EY + A AL+GRKF V YY D
Sbjct: 415 GRIRSMQIPRPN-NEFLVSGIGKVFIEYATSGESKVASEALAGRKFANRVVVTAYYDPDS 473
Query: 514 YFNKDYS 520
Y D+S
Sbjct: 474 YHRHDFS 480
>gi|348688506|gb|EGZ28320.1| hypothetical protein PHYSODRAFT_309221 [Phytophthora sojae]
Length = 694
Score = 251 bits (640), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 148/357 (41%), Positives = 218/357 (61%), Gaps = 18/357 (5%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMT-AIGGNSAGPGDAVVNVYINHEKKFAFV 227
Q TRHARR+YVGG+ ++ E I FF+ V+ A+G G +VV+VYIN E+ FAFV
Sbjct: 351 QQTRHARRLYVGGIGEIS-EPEITAFFNDVIDRALGEKQEG--GSVVSVYINRERHFAFV 407
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNP-TLAAALGPGQPSPNLNLAAVGLASG 286
E+RT+E + M LDG+ + G +++RRP DYNP T+ LGP P LNLAA+G+ S
Sbjct: 408 ELRTIELTTACMNLDGVSYNGQPLKIRRPNDYNPATVPKDLGP---IPQLNLAALGIVST 464
Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
+ ++GP ++F+GGLPY+ E Q+KELL++FG L F LVK+ + SKGYGFC Y D
Sbjct: 465 TV--SDGPGKIFIGGLPYHLNEEQVKELLQAFGPLRSFHLVKELSSNLSKGYGFCEYMDI 522
Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQSK---TEQESILAQAQQHIAIQKMALQTSG 403
VTD AC LN +++GDKTLTVRRA + +K + ++ + + + A+Q
Sbjct: 523 NVTDAACLGLNDMRLGDKTLTVRRAMSQENAKAVASAAGTVNTGLEMGLDPSRAAMQAM- 581
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
+L G SL T ++V+ L +T + L D+EEY +IL+D++ EC ++G + ++++PR
Sbjct: 582 --SLAGIPSLPLGTPSRVIVLLNMVTPEELEDEEEYADILDDIKGECERFGAVPSLLLPR 639
Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
P G VGKVF+E+ D +A L GR F TV + E KY ++ +
Sbjct: 640 P--RDGIPSAVGKVFVEFGDVQSAQSAATELHGRGFSNRTVAVEFMDEGKYARRELA 694
>gi|1710361|gb|AAB38280.1| splicing factor U2AF65 [Caenorhabditis briggsae]
Length = 488
Score = 251 bits (640), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 152/387 (39%), Positives = 218/387 (56%), Gaps = 35/387 (9%)
Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
G + P +NM GA G+ V V+ T +RR+YVG +P NE+A+ F
Sbjct: 136 GFENITPMEYKNMQASGAVPRGSVQ-SAVPVVGPSVTCQSRRLYVGNIPFGCNEEAMLDF 194
Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
F+Q M + + PG+ ++ IN +K FAF+E R+++E + MA DGI F G ++VR
Sbjct: 195 FNQQM-HLCNLAQAPGNPILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVR 253
Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
RP DY P+ Q + ++N A + ++S + A +++F+GGLP Y TE Q+KEL
Sbjct: 254 RPRDYQPS--------QNTFDMN-ARMPVSSIVVDSA---NKIFIGGLPNYLTEDQVKEL 301
Query: 315 LESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATAS 374
L SFG L F L D GNSKGY F Y DP +TD A A LNG+++GDK L V+ A A+
Sbjct: 302 LCSFGPLKAFSLNVD-SQGNSKGYAFAEYLDPTLTDQAIAGLNGMQLGDKQLVVQLACAN 360
Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMN-TLGGGMSLFGETLAKVLCLTEAITADAL 433
Q + + + A +G++ + G G + ++LCL +T D L
Sbjct: 361 ------------QTRHNTHLPNSASAIAGIDLSQGAGRA------TEILCLMNMVTEDEL 402
Query: 434 ADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNA 493
DE+YEEILED+REEC KYG + ++ IPRP + PGVGKVF+E+ C A+ A
Sbjct: 403 RSDEDYEEILEDVREECSKYGIVRSLEIPRP-YDDHPVPGVGKVFVEFATTSDCQRAQAA 461
Query: 494 LSGRKFGGNTVNAFYYPEDKYFNKDYS 520
L+GRKF TV YY DKY N+ ++
Sbjct: 462 LTGRKFANRTVVTSYYDVDKYHNRQFN 488
>gi|384939340|gb|AFI33275.1| splicing factor U2AF 65 kDa subunit isoform a [Macaca mulatta]
Length = 475
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 167/420 (39%), Positives = 234/420 (55%), Gaps = 26/420 (6%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T Q + +Q L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPPSTINQTPVTLQVPGL 363
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
+S + +GG + +VLCL + + L DDEEYEEI+E++R+EC KYG + ++
Sbjct: 364 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEEVRDECSKYGLVKSI 415
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 416 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 474
>gi|268575804|ref|XP_002642882.1| C. briggsae CBR-UAF-1 protein [Caenorhabditis briggsae]
gi|60415989|sp|P90727.2|U2AF2_CAEBR RecName: Full=Splicing factor U2AF 65 kDa subunit; AltName: Full=U2
auxiliary factor 65 kDa subunit; Short=U2AF65; AltName:
Full=U2 snRNP auxiliary factor large subunit
Length = 488
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 152/387 (39%), Positives = 218/387 (56%), Gaps = 35/387 (9%)
Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
G + P +NM GA G+ V V+ T +RR+YVG +P NE+A+ F
Sbjct: 136 GFENITPMEYKNMQASGAVPRGSVQ-SAVPVVGPSVTCQSRRLYVGNIPFGCNEEAMLDF 194
Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
F+Q M + + PG+ ++ IN +K FAF+E R+++E + MA DGI F G ++VR
Sbjct: 195 FNQQM-HLCNLAQAPGNPILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVR 253
Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
RP DY P+ Q + ++N A + ++S + A +++F+GGLP Y TE Q+KEL
Sbjct: 254 RPRDYQPS--------QNTFDMN-ARMPVSSIVVDSA---NKIFIGGLPNYLTEDQVKEL 301
Query: 315 LESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATAS 374
L SFG L F L D GNSKGY F Y DP +TD A A LNG+++GDK L V+ A A+
Sbjct: 302 LCSFGPLKAFSLNVD-SQGNSKGYAFAEYLDPTLTDQAIAGLNGMQLGDKQLVVQLACAN 360
Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMN-TLGGGMSLFGETLAKVLCLTEAITADAL 433
Q + + + A +G++ + G G + ++LCL +T D L
Sbjct: 361 ------------QTRHNTHLPNSASAIAGIDLSQGAGRA------TEILCLMNMVTEDEL 402
Query: 434 ADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNA 493
DE+YEEILED+REEC KYG + ++ IPRP + PGVGKVF+E+ C A+ A
Sbjct: 403 RSDEDYEEILEDVREECSKYGIVRSLEIPRP-YDDHPVPGVGKVFVEFATTSDCQRAQAA 461
Query: 494 LSGRKFGGNTVNAFYYPEDKYFNKDYS 520
L+GRKF TV YY DKY N+ ++
Sbjct: 462 LTGRKFANRTVVTSYYDVDKYHNRQFN 488
>gi|410905623|ref|XP_003966291.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Takifugu
rubripes]
Length = 458
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 156/389 (40%), Positives = 219/389 (56%), Gaps = 32/389 (8%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
GQ+P + +P M + L T PV V+ Q TR ARR+YVG +P E++
Sbjct: 101 GQIPAT-ALLPTMTPDGLAVTPT--------PVPVVGSQMTRQARRLYVGNIPFGITEES 151
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G +
Sbjct: 152 MMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQS 210
Query: 251 VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQ 310
+++RRP DY P PG S N ++ G+ S + + ++F+GGLP Y + Q
Sbjct: 211 LKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQ 261
Query: 311 IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
+KELL SFG L F+LVKD TG SKGY FC Y D + D A A LNG+++GDK L V+R
Sbjct: 262 VKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQR 321
Query: 371 ATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
A+ Q I + LQ G+N+ ++ G +VLCL +
Sbjct: 322 ASXXXXXS---------FQTSINQTPVTLQVPGLNS---SVTQMGGVPTEVLCLMNMVAP 369
Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATA 490
+ L DDEEYEEI+ED+R+ECGKYG + ++ IPRP +G E PG GK+F+E+ A
Sbjct: 370 EELLDDEEYEEIVEDVRDECGKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFMSVFDSQKA 428
Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
L+GRKF V Y D Y +D+
Sbjct: 429 MQGLTGRKFANRVVVTKYCDPDAYHRRDF 457
>gi|343426615|emb|CBQ70144.1| related to pre-mRNA splicing factor U2AF large chain [Sporisorium
reilianum SRZ2]
Length = 710
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 151/413 (36%), Positives = 225/413 (54%), Gaps = 45/413 (10%)
Query: 132 QLPGVPSAVPEMAQNMLP---FGATQLGAFPLM----PVQVMTQQATRHARRVYVGGLPP 184
Q P S P Q P FG G++P P ++ Q A R ARR+YVG +
Sbjct: 289 QHPYSRSQQPPHGQPYSPQAGFGGADNGSYPSSQGPSPAELAAQNANRQARRLYVGNITH 348
Query: 185 LANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGI 244
NE I FF++ M + + PG+ V+ +N +K +AFVE R +EA+NAM+ DGI
Sbjct: 349 QTNEHNIVAFFNEQMLKLKLGTE-PGEPAVSAQVNVDKGYAFVEFRHPDEATNAMSFDGI 407
Query: 245 IFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPY 304
+F+ ++++RRP DY GP P+ N+ G+ S + + P ++FVGGLP
Sbjct: 408 VFQAQSLKIRRPKDYT-------GPDVRPPS-NIHVPGVISTNV--PDSPFKIFVGGLPT 457
Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDK 364
Y T+ Q+ ELL++FG L F+LVKD TG SKG+ FC Y D A+TD+AC LNG+++GD+
Sbjct: 458 YLTDDQVIELLQAFGELRAFNLVKDTGTGASKGFAFCEYVDTALTDLACQGLNGMELGDR 517
Query: 365 TLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCL 424
L V+RA+ + K + LA + +A S + G GE + ++ L
Sbjct: 518 NLVVQRASVGSEKKAQ---ALAATGANSGALGIAAVPSSVQQSAGEDGDAGEPTSCMVML 574
Query: 425 TEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP----------------DQNG 468
+T + L DDEEY +I+ED+R+EC KYG + +V +PRP D++G
Sbjct: 575 N-MVTPEELQDDEEYADIVEDIRDECTKYGAVTDVRVPRPAKESKGAAAHQWKRSQDESG 633
Query: 469 --GETP-----GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
G+ P GVG+V++ Y + CA A A++GR+FGG TV + ED +
Sbjct: 634 AEGDKPDAEREGVGRVYVRYGETEHCAQALRAIAGRQFGGRTVICAFLKEDDW 686
>gi|327280717|ref|XP_003225098.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
[Anolis carolinensis]
Length = 467
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 166/429 (38%), Positives = 233/429 (54%), Gaps = 48/429 (11%)
Query: 107 KSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLM 161
K K R +D+ PP + GQ+P + +P M + L T
Sbjct: 70 KKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT-------- 120
Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +
Sbjct: 121 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQD 179
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 180 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVP 232
Query: 282 GLASGAIGGAEGPDRVFVGGLPYY-----------FTETQIKELLESFGTLHGFDLVKDR 330
G+ S + + ++F+GGLP Y F Q+KELL SFG L F+LVKD
Sbjct: 233 GVVSTVV--PDSAHKLFIGGLPNYLNDDQVMFLPPFLSCQVKELLTSFGPLKAFNLVKDS 290
Query: 331 DTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQ 390
TG SKGY FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q
Sbjct: 291 ATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQT 346
Query: 391 HIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREEC 450
+ +Q L +S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC
Sbjct: 347 PVTLQVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDEC 398
Query: 451 GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYP 510
KYG + ++ IPRP +G E PG GK+F+E+ C A L+GRKF V Y
Sbjct: 399 SKYGVVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCD 457
Query: 511 EDKYFNKDY 519
D Y +D+
Sbjct: 458 PDSYHRRDF 466
>gi|341891946|gb|EGT47881.1| hypothetical protein CAEBREN_25972 [Caenorhabditis brenneri]
Length = 491
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 146/358 (40%), Positives = 208/358 (58%), Gaps = 34/358 (9%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V+ T +RR+YVG +P NE+A+ FF+Q M + G + PG+ ++ IN +K
Sbjct: 167 VPVVGPSVTCQSRRLYVGNIPFGCNEEAMLDFFNQQM-HLCGLAQAPGNPILLCQINLDK 225
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+++E + MA DGI F G ++VRRP DY P+ Q + ++N + +
Sbjct: 226 NFAFIEFRSIDETTAGMAFDGINFMGQQLKVRRPRDYQPS--------QNTFDMN-SRMP 276
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
++S + A +++F+GGLP Y TE Q+KELL SFG L F L D GNSKGY F
Sbjct: 277 VSSIVVDSA---NKIFIGGLPNYLTEDQVKELLCSFGPLKAFSLNVD-SQGNSKGYAFAE 332
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
Y DP +TD A A LNG+++GDK L V+ A A+ Q + + + A +
Sbjct: 333 YLDPTLTDQAIAGLNGMQLGDKQLVVQLACAN------------QTRHNTHLPNSASAIA 380
Query: 403 GMN-TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
G++ + G G + +VLCL +T D L DE+YEEILED+REEC KYG + ++ I
Sbjct: 381 GIDLSQGAGRA------TEVLCLMNMVTEDELKSDEDYEEILEDVREECSKYGIVRSLEI 434
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
PRP + PGVGKVF+E+ C A+ AL+GRKF TV YY DKY N+ +
Sbjct: 435 PRP-YDEHPVPGVGKVFVEFASTSDCQRAQAALTGRKFANRTVVTSYYDVDKYHNRQF 491
>gi|47221657|emb|CAF97922.1| unnamed protein product [Tetraodon nigroviridis]
Length = 458
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 156/389 (40%), Positives = 221/389 (56%), Gaps = 32/389 (8%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
GQ+P + +P M + L T PV V+ Q TR ARR+YVG +P E++
Sbjct: 101 GQIPAT-ALLPTMTPDGLAVTPT--------PVPVVGSQMTRQARRLYVGNIPFGITEES 151
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G +
Sbjct: 152 MMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQS 210
Query: 251 VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQ 310
+++RRP DY P PG S N ++ G+ S + + ++F+GGLP Y + Q
Sbjct: 211 LKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQ 261
Query: 311 IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
+KELL SFG L F+LVKD TG SKGY FC Y D + D A A LNG+++GDK L V+R
Sbjct: 262 VKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQR 321
Query: 371 ATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
A+ ++ T L Q + LQ G+N+ ++ G +VLCL +
Sbjct: 322 ASVGSKNAT-----LTSINQ----TPVTLQVPGLNS---SVTQMGGVPTEVLCLMNMVAP 369
Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATA 490
+ L DDEEYEEI+ED+R+EC KYG + ++ IPRP +G E PG GK+F+E+ A
Sbjct: 370 EELLDDEEYEEIVEDVRDECSKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFMSVFDSQKA 428
Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
L+GRKF V Y D Y +D+
Sbjct: 429 MQGLTGRKFANRVVVTKYCDPDAYHRRDF 457
>gi|348526428|ref|XP_003450721.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 3
[Oreochromis niloticus]
Length = 458
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 156/389 (40%), Positives = 222/389 (57%), Gaps = 32/389 (8%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
GQ+P + +P M + L T PV V+ Q TR ARR+YVG +P E++
Sbjct: 101 GQIPAT-ALLPTMTPDGLAVTPT--------PVPVVGSQMTRQARRLYVGNIPFGITEES 151
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G +
Sbjct: 152 MMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQS 210
Query: 251 VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQ 310
+++RRP DY P PG S N ++ G+ S + + ++F+GGLP Y + Q
Sbjct: 211 LKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQ 261
Query: 311 IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
+KELL SFG L F+LVKD TG SKGY FC Y D + D A A LNG+++GDK L V+R
Sbjct: 262 VKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQR 321
Query: 371 ATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
A+ ++ T L+ Q + LQ G+N+ ++ G +VLCL +
Sbjct: 322 ASVGSKNAT-----LSSINQ----TPVTLQVPGLNS---SVTQMGGLPTEVLCLMNMVAP 369
Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATA 490
+ L DDEEYEEI+ED+R+EC KYG + ++ IPRP +G E PG GK+F+E+ A
Sbjct: 370 EELLDDEEYEEIVEDVRDECSKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFTSVFDSQKA 428
Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
L+GRKF V Y D Y +D+
Sbjct: 429 MQGLTGRKFANRVVVTKYCDPDAYHRRDF 457
>gi|71480064|ref|NP_001025127.1| U2 small nuclear RNA auxiliary factor 2a [Danio rerio]
gi|68533572|gb|AAH98548.1| U2 small nuclear RNA auxiliary factor 2a [Danio rerio]
Length = 465
Score = 248 bits (634), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 157/406 (38%), Positives = 227/406 (55%), Gaps = 26/406 (6%)
Query: 114 FDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRH 173
+D+ PP + P Q + +A A +LP + A PV V+ Q TR
Sbjct: 85 WDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPEGLAVTPTPVPVVGSQMTRQ 140
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
ARR+YVG +P E+++ FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+
Sbjct: 141 ARRLYVGNIPFGITEESMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVD 199
Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
E + AMA DGIIF+G ++++RRP DY P PG S N ++ G+ S + +
Sbjct: 200 ETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDS 250
Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY FC Y D ++D A
Sbjct: 251 AHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNISDQAI 310
Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSL 413
A LNG+++GDK L V+RA+ ++ T + Q + +Q L S +N +GG
Sbjct: 311 AGLNGMQLGDKKLLVQRASVGSKNTT----LTGINQTPVTLQVPGLMNSSVNQMGG---- 362
Query: 414 FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPG 473
+VLCL + + L DDEEYEEI+ED+R+EC KYG + ++ IPRP +G + PG
Sbjct: 363 ---IPTEVLCLMNMVAPEELLDDEEYEEIVEDVRDECSKYGQVKSIEIPRP-VDGLDIPG 418
Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
GK+F+E+ A L+GRK V Y D Y +D+
Sbjct: 419 TGKIFVEFMSVFDSQKAMQGLTGRKSANRVVVTKYCDPDAYHRRDF 464
>gi|384939342|gb|AFI33276.1| splicing factor U2AF 65 kDa subunit isoform b [Macaca mulatta]
Length = 471
Score = 248 bits (634), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 167/420 (39%), Positives = 235/420 (55%), Gaps = 30/420 (7%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGL 359
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
+S + +GG + +VLCL + + L DDEEYEEI+E++R+EC KYG + ++
Sbjct: 360 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEEVRDECSKYGLVKSI 411
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D+
Sbjct: 412 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 470
>gi|301117646|ref|XP_002906551.1| splicing factor U2af large subunit, putative [Phytophthora
infestans T30-4]
gi|262107900|gb|EEY65952.1| splicing factor U2af large subunit, putative [Phytophthora
infestans T30-4]
Length = 569
Score = 248 bits (633), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 147/357 (41%), Positives = 215/357 (60%), Gaps = 18/357 (5%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMT-AIGGNSAGPGDAVVNVYINHEKKFAFV 227
Q TRHARR+YVGG+ ++ E I FF+ V+ A+G G +VV+VYIN E+ FAFV
Sbjct: 226 QQTRHARRLYVGGIGEIS-EPEITAFFNDVIDRALGEKQEG--GSVVSVYINRERHFAFV 282
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNP-TLAAALGPGQPSPNLNLAAVGLASG 286
E+R++E + M LDG+ + G +++RRP DYNP T+ LGP P LNLAA+G+ S
Sbjct: 283 ELRSIELTTACMNLDGVSYNGQPLKIRRPNDYNPATVPKDLGP---IPQLNLAALGIVST 339
Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
+ ++GP ++F+GGLPY+ E Q+KELL++FG L F LVK+ + SKGYGFC Y D
Sbjct: 340 TV--SDGPGKIFIGGLPYHLNEEQVKELLQAFGPLRSFHLVKELSSNLSKGYGFCEYMDI 397
Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQSK---TEQESILAQAQQHIAIQKMALQTSG 403
VTD AC LN +++GDKTLTVRRA + +K + ++ + + A+Q
Sbjct: 398 NVTDAACIGLNDMQLGDKTLTVRRAMSQENAKAVASAAGTVNTGLEMGLDPSLAAMQAMS 457
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
M G S+ T ++V+ L +T + L D+EEY +IL+D++ EC ++G + ++++PR
Sbjct: 458 M---AGIPSVPLGTPSRVIVLLNMVTPEELEDEEEYADILDDIKGECERFGAVPSLLLPR 514
Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
P G VGKVF+E+ D A L GR F TV + E KY ++ S
Sbjct: 515 PRD--GVLSAVGKVFVEFADVQSAQAAATELHGRGFSNRTVAVEFMDEGKYARREIS 569
>gi|345480698|ref|XP_001604333.2| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Nasonia
vitripennis]
Length = 455
Score = 248 bits (633), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 148/363 (40%), Positives = 212/363 (58%), Gaps = 37/363 (10%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 128 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 186
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG +P++N
Sbjct: 187 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 240
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
+ + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKG
Sbjct: 241 VP------------DSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 288
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y FC Y D ++TD A A LNG+++GDK L V+RA+ +K I AQA I + +
Sbjct: 289 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVG--AKNPMPMIGAQAPVQIQVPGL 346
Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
++ TSG T +VLCL +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 347 SMVGTSGPPT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 393
Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+V IPRP + G + PG GKVF+E+ + C A+ L+GRKF V Y+ DKY
Sbjct: 394 RSVEIPRPIE-GVDVPGCGKVFVEFNSVLDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 452
Query: 517 KDY 519
+++
Sbjct: 453 REF 455
>gi|198432988|ref|XP_002130386.1| PREDICTED: similar to U2 small nuclear ribonucleoprotein auxiliary
factor (U2AF) 2 isoform 1 [Ciona intestinalis]
Length = 482
Score = 248 bits (633), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 146/360 (40%), Positives = 211/360 (58%), Gaps = 29/360 (8%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V Q TR ARR+YVG +P E A+ FF+ M I G + PG ++ V IN +K
Sbjct: 150 VPVAGSQMTRQARRLYVGNIPFGVTEDAMMDFFNNQMQ-IAGLAQAPGQPILAVQINLDK 208
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+V+E + A+A DGI F ++++RRP+DY P L +L QP+ +L G
Sbjct: 209 NFAFLEFRSVDETTQALAFDGINFMNQSLKIRRPSDYKP-LPGSLE--QPAIHLP----G 261
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+ S + ++ ++F+GGLP Y + Q+KELL SFG L F+LVKD T SKGY F
Sbjct: 262 VISTVVQDSQ--HKMFIGGLPNYLNDDQVKELLTSFGPLRAFNLVKDSATALSKGYAFAE 319
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQK-MALQT 401
+ D ++TD A A LNG+++GDK L V+RA SI A+ H AI + LQ
Sbjct: 320 FADYSLTDQAIAGLNGMQLGDKKLIVQRA-----------SIGAKNNPHGAIMAPVTLQI 368
Query: 402 SGM-NTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
GM + G G + VLCL + + L DDEEYEEI+ED+++ECGK G++V++
Sbjct: 369 PGMAHATGAGPA------TTVLCLMNMVLPEELTDDEEYEEIMEDVKDECGKLGSVVSLE 422
Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
IPRP E GVGK+++E+ + + A ALSGRKF V ++ ++Y ++ +S
Sbjct: 423 IPRPGPGLTEADGVGKIYVEFANHLDTQKAAQALSGRKFSNRVVVTSFFDPERYHHRQFS 482
>gi|50882018|gb|AAT85577.1| U2 small nuclear ribonucleoprotein auxiliary factor large subunit
[Alvinella pompejana]
Length = 479
Score = 248 bits (632), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 159/416 (38%), Positives = 234/416 (56%), Gaps = 40/416 (9%)
Query: 113 GFD-MAPPAAAMLPGAAVPGQL--PGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMT-- 167
GF+ M P L G PG L PG+P P +A N + A P P+ + T
Sbjct: 95 GFEHMTPMQYKALHGLGPPGGLAAPGMPVVAPVIAAN-------NVVASPTAPMALNTTI 147
Query: 168 ----QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
+R ARR+YVG +P E+ + +F+ M + G + G+ V+ ++N +K
Sbjct: 148 PFAGSAISRQARRLYVGNIPFGVTEEMMMDYFNTQMK-MAGLAQAEGNPVIACHVNLDKN 206
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
FAF+E R+V+E + AMA DGI F+G ++++RRP DY P A P ++A G+
Sbjct: 207 FAFLEFRSVDETTQAMAFDGINFQGQSLKIRRPKDYQPLPGMAEVP-------SVAVPGV 259
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
S + + ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKGY FC Y
Sbjct: 260 VSTVV--QDSAHKIFIGGLPNYLNEDQVKELLTSFGLLKAFNLVKDSATGLSKGYAFCEY 317
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
DP++TD ACA LNG+++GD+ L V+RA+ ++ ++L Q + +Q G
Sbjct: 318 LDPSITDQACAGLNGMQLGDEKLIVQRASVGAKNAQGGPNVLPVQLQIPGLNMAQVQGPG 377
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
T +VLCL +T + L D+EEYEEILED++EEC KYG + ++ IPR
Sbjct: 378 PTT-------------EVLCLMNMVTPEDLEDEEEYEEILEDVKEECSKYGYVKSIEIPR 424
Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
P + G E PGVGK+F+E+ + C A+ AL+GRKF V + Y+ D+Y +++
Sbjct: 425 PIK-GVEVPGVGKIFVEFNSVIDCQKAQQALTGRKFSNRVVVSSYFDPDRYHRREF 479
>gi|328766440|gb|EGF76494.1| hypothetical protein BATDEDRAFT_21058 [Batrachochytrium
dendrobatidis JAM81]
Length = 551
Score = 247 bits (631), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 136/352 (38%), Positives = 205/352 (58%), Gaps = 27/352 (7%)
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
Q R +R+Y G +P E+ I +F S +G PG+A VN YIN E+ +AFV
Sbjct: 227 QPIARQFKRLYFGNIPVDCIEERILSFASSSYEKLG-LPKDPGNAAVNAYINRERNYAFV 285
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
E R+ EEA+ AMALDG +F+G ++VRRP DYNP AA G QPS + A
Sbjct: 286 EFRSPEEATRAMALDGSLFDGNILKVRRPKDYNPE-AAPDGATQPS----------IAPA 334
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
E D++FVG +P Y ++ Q++ELL++FG L F L++D TG SKG+ FC Y D
Sbjct: 335 TSAQESLDKIFVGAIPTYLSDDQVQELLKTFGELKTFSLIRDSATGLSKGFAFCEYVDGQ 394
Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
+TD AC LNG+++G+K L V+RA+ T I A Q + ++ L T +
Sbjct: 395 ITDAACQGLNGMELGEKKLIVQRASVGSNKNT----ISAVGQSQLLPMEI-LATIAKDPC 449
Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
+ +VL L + ++ L DE+Y++IL D++EEC K+GT++++ IPRP +
Sbjct: 450 ---------KVTRVLLLLNMVVSEDLVSDEDYQDILLDIQEECEKFGTILDIAIPRP-VS 499
Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
G GVGK+F+++ + A+A++AL+GRKF V A ++ EDK+ +D+
Sbjct: 500 GQSNAGVGKIFVKFDNVKQSASAQHALAGRKFADRVVIASFFDEDKFDQQDF 551
>gi|309271453|ref|XP_003085312.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Mus musculus]
Length = 730
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 161/419 (38%), Positives = 227/419 (54%), Gaps = 29/419 (6%)
Query: 103 RSPS--KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPL 160
RSP K K R +D+ PP + P Q + +A +A +LP A
Sbjct: 338 RSPCHEKKKVRKYWDVPPPGFEHV----TPMQYKAMQAAGQILATALLPTMTPGGLAVTP 393
Query: 161 MPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH 220
MPV V+ Q TR ARR+YVG +P E+++ F + M + G + PG+ ++ V IN
Sbjct: 394 MPVPVVGSQMTRQARRLYVGTIPFGITEESMLDFLNTQM-HLRGLTQAPGNPILAVQINL 452
Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
+K FAF+E R+V+E + A+A DGIIF+G ++++RRP DY P + P P
Sbjct: 453 DKNFAFLEFRSVDETTQALAFDGIIFQGQSLKIRRPHDYQPLPGMSGSPSVYVP------ 506
Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
G+ S + + ++F+GGLP Y + Q+KELL S G L F+LVKD TG SKGY F
Sbjct: 507 -GVVSTIV--PDSAHKLFIGGLPNYLNDDQVKELLTSVGILRAFNLVKDSITGLSKGYAF 563
Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ 400
C Y D VTD A A LNG+ +GDK L V+RA+ +K SI+ Q + LQ
Sbjct: 564 CEYMDINVTDQAIAWLNGMHLGDKKLLVQRASVG--AKNVALSIINQT-------PVTLQ 614
Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
G+ + M G VLCL + L DDEEYEEI++D+R+EC KYG + ++
Sbjct: 615 VPGLTSSQVQM---GGHPTTVLCLMNMVLPKELLDDEEYEEIVDDVRDECNKYGLVKSIE 671
Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ + C A L+GRKF V Y D Y +++
Sbjct: 672 IPRP-MDGVEVPGCGKIFVEFTSVIDCQKAMQGLTGRKFANRVVVTKYCDPDSYHGRNF 729
>gi|71996475|ref|NP_001022967.1| Protein UAF-1, isoform a [Caenorhabditis elegans]
gi|6226906|sp|P90978.2|U2AF2_CAEEL RecName: Full=Splicing factor U2AF 65 kDa subunit; AltName: Full=U2
auxiliary factor 65 kDa subunit; Short=U2AF65; AltName:
Full=U2 snRNP auxiliary factor large subunit
gi|3334906|gb|AAC26982.1| splicing factor U2AF65 [Caenorhabditis elegans]
gi|351018334|emb|CCD62278.1| Protein UAF-1, isoform a [Caenorhabditis elegans]
Length = 496
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 149/385 (38%), Positives = 211/385 (54%), Gaps = 33/385 (8%)
Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
G + P +NM G G+ V V+ T +RR+YVG +P NE+A+ F
Sbjct: 145 GFETTTPMEYKNMQAAGQVPRGSVQ-SAVPVVGPSVTCQSRRLYVGNIPFGCNEEAMLDF 203
Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
F+Q M + G + PG+ ++ IN +K FAF+E R+++E + MA DGI F G ++VR
Sbjct: 204 FNQQM-HLCGLAQAPGNPILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVR 262
Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
RP DY P+ Q + ++N + + +++ + A +++F+GGLP Y TE Q+KEL
Sbjct: 263 RPRDYQPS--------QNTFDMN-SRMPVSTIVVDSA---NKIFIGGLPNYLTEDQVKEL 310
Query: 315 LESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATAS 374
L SFG L F L D GNSKGY F Y DP +TD A A LNG+++GDK L V+ A A+
Sbjct: 311 LCSFGPLKAFSLNVD-SQGNSKGYAFAEYLDPTLTDQAIAGLNGMQLGDKQLVVQLACAN 369
Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA 434
Q L S G +S ++LCL +T D L
Sbjct: 370 QQR-----------------HNTNLPNSASAIAGIDLSQGAGRATEILCLMNMVTEDELK 412
Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNAL 494
D+EYEEILED+R+EC KYG + ++ IPRP ++ PGVGKVF+E+ C A+ AL
Sbjct: 413 ADDEYEEILEDVRDECSKYGIVRSLEIPRPYED-HPVPGVGKVFVEFASTSDCQRAQAAL 471
Query: 495 SGRKFGGNTVNAFYYPEDKYFNKDY 519
+GRKF TV YY DKY N+ +
Sbjct: 472 TGRKFANRTVVTSYYDVDKYHNRQF 496
>gi|71996485|ref|NP_001022969.1| Protein UAF-1, isoform c [Caenorhabditis elegans]
gi|351018336|emb|CCD62280.1| Protein UAF-1, isoform c [Caenorhabditis elegans]
Length = 474
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 149/385 (38%), Positives = 211/385 (54%), Gaps = 33/385 (8%)
Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
G + P +NM G G+ V V+ T +RR+YVG +P NE+A+ F
Sbjct: 123 GFETTTPMEYKNMQAAGQVPRGSVQ-SAVPVVGPSVTCQSRRLYVGNIPFGCNEEAMLDF 181
Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
F+Q M + G + PG+ ++ IN +K FAF+E R+++E + MA DGI F G ++VR
Sbjct: 182 FNQQM-HLCGLAQAPGNPILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVR 240
Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
RP DY P+ Q + ++N + + +++ + A +++F+GGLP Y TE Q+KEL
Sbjct: 241 RPRDYQPS--------QNTFDMN-SRMPVSTIVVDSA---NKIFIGGLPNYLTEDQVKEL 288
Query: 315 LESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATAS 374
L SFG L F L D GNSKGY F Y DP +TD A A LNG+++GDK L V+ A A+
Sbjct: 289 LCSFGPLKAFSLNVD-SQGNSKGYAFAEYLDPTLTDQAIAGLNGMQLGDKQLVVQLACAN 347
Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA 434
Q L S G +S ++LCL +T D L
Sbjct: 348 QQR-----------------HNTNLPNSASAIAGIDLSQGAGRATEILCLMNMVTEDELK 390
Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNAL 494
D+EYEEILED+R+EC KYG + ++ IPRP ++ PGVGKVF+E+ C A+ AL
Sbjct: 391 ADDEYEEILEDVRDECSKYGIVRSLEIPRPYED-HPVPGVGKVFVEFASTSDCQRAQAAL 449
Query: 495 SGRKFGGNTVNAFYYPEDKYFNKDY 519
+GRKF TV YY DKY N+ +
Sbjct: 450 TGRKFANRTVVTSYYDVDKYHNRQF 474
>gi|71996490|ref|NP_001022970.1| Protein UAF-1, isoform d [Caenorhabditis elegans]
gi|351018337|emb|CCD62281.1| Protein UAF-1, isoform d [Caenorhabditis elegans]
Length = 471
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 149/385 (38%), Positives = 211/385 (54%), Gaps = 33/385 (8%)
Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
G + P +NM G G+ V V+ T +RR+YVG +P NE+A+ F
Sbjct: 120 GFETTTPMEYKNMQAAGQVPRGSVQ-SAVPVVGPSVTCQSRRLYVGNIPFGCNEEAMLDF 178
Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
F+Q M + G + PG+ ++ IN +K FAF+E R+++E + MA DGI F G ++VR
Sbjct: 179 FNQQM-HLCGLAQAPGNPILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVR 237
Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
RP DY P+ Q + ++N + + +++ + A +++F+GGLP Y TE Q+KEL
Sbjct: 238 RPRDYQPS--------QNTFDMN-SRMPVSTIVVDSA---NKIFIGGLPNYLTEDQVKEL 285
Query: 315 LESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATAS 374
L SFG L F L D GNSKGY F Y DP +TD A A LNG+++GDK L V+ A A+
Sbjct: 286 LCSFGPLKAFSLNVD-SQGNSKGYAFAEYLDPTLTDQAIAGLNGMQLGDKQLVVQLACAN 344
Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA 434
Q L S G +S ++LCL +T D L
Sbjct: 345 QQR-----------------HNTNLPNSASAIAGIDLSQGAGRATEILCLMNMVTEDELK 387
Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNAL 494
D+EYEEILED+R+EC KYG + ++ IPRP ++ PGVGKVF+E+ C A+ AL
Sbjct: 388 ADDEYEEILEDVRDECSKYGIVRSLEIPRPYED-HPVPGVGKVFVEFASTSDCQRAQAAL 446
Query: 495 SGRKFGGNTVNAFYYPEDKYFNKDY 519
+GRKF TV YY DKY N+ +
Sbjct: 447 TGRKFANRTVVTSYYDVDKYHNRQF 471
>gi|309266895|ref|XP_003086891.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like, partial [Mus
musculus]
Length = 493
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 162/419 (38%), Positives = 229/419 (54%), Gaps = 29/419 (6%)
Query: 103 RSPS--KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPL 160
RSP K K R +D+ PP + P Q + +A +A +LP A
Sbjct: 101 RSPCHEKKKVRKYWDVPPPGFEHV----TPMQYKAMQAAGQILATALLPTMTPGGLAVTP 156
Query: 161 MPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH 220
MPV V+ Q TR ARR+YVG +P E+++ F + M + G + PG+ ++ V IN
Sbjct: 157 MPVPVVGSQMTRQARRLYVGTIPFGITEESMLDFLNTQM-HLRGLTQAPGNPILAVQINL 215
Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
+K FAF+E R+V+E + A+A DGIIF+G ++++RRP DY P + G PS +
Sbjct: 216 DKNFAFLEFRSVDETTQALAFDGIIFQGQSLKIRRPHDYQPLPGMS---GSPS----VYV 268
Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
G+ S + + ++F+GGLP Y + Q+KELL S G L F+LVKD TG SKGY F
Sbjct: 269 PGVVSTIV--PDSAHKLFIGGLPNYLNDDQVKELLTSVGILRAFNLVKDSITGLSKGYAF 326
Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ 400
C Y D VTD A A LNG+ +GDK L V+RA+ +K SI+ Q + LQ
Sbjct: 327 CEYMDINVTDQAIAWLNGMHLGDKKLLVQRASVG--AKNVALSIINQT-------PVTLQ 377
Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
G+ + M G VLCL + L DDEEYEEI++D+R+EC KYG + ++
Sbjct: 378 VPGLTSSQVQM---GGHPTTVLCLMNMVLPKELLDDEEYEEIVDDVRDECNKYGLVKSIE 434
Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ + C A L+GRKF V Y D Y +++
Sbjct: 435 IPRP-MDGVEVPGCGKIFVEFTSVIDCQKAMQGLTGRKFANRVVVTKYCDPDSYHGRNF 492
>gi|322792032|gb|EFZ16137.1| hypothetical protein SINV_12499 [Solenopsis invicta]
Length = 344
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 146/364 (40%), Positives = 211/364 (57%), Gaps = 41/364 (11%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 19 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 77
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG +P++N
Sbjct: 78 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 131
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
+ + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKG
Sbjct: 132 VP------------DSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 179
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y FC Y D ++TD A A LNG+++GDK L V+RA+ ++ I AQA I +
Sbjct: 180 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVP-- 233
Query: 398 ALQTSGMNTLGGGMSLFGET--LAKVLCLTEAITADALADDEEYEEILEDMREECGKYGT 455
G+S+ G + +VLCL +T + L ++EEYE+ILED++EEC KYG
Sbjct: 234 ------------GLSMVGTSGPATEVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGV 281
Query: 456 LVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
+ +V IPRP + G + PG GKVF+E+ + C A+ L+GRKF V Y+ DKY
Sbjct: 282 VRSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYH 340
Query: 516 NKDY 519
+++
Sbjct: 341 RREF 344
>gi|148697816|gb|EDL29763.1| mCG68163 [Mus musculus]
Length = 472
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 162/419 (38%), Positives = 229/419 (54%), Gaps = 29/419 (6%)
Query: 103 RSPS--KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPL 160
RSP K K R +D+ PP + P Q + +A +A +LP A
Sbjct: 80 RSPCHEKKKVRKYWDVPPPGFEHV----TPMQYKAMQAAGQILATALLPTMTPGGLAVTP 135
Query: 161 MPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH 220
MPV V+ Q TR ARR+YVG +P E+++ F + M + G + PG+ ++ V IN
Sbjct: 136 MPVPVVGSQMTRQARRLYVGTIPFGITEESMLDFLNTQM-HLRGLTQAPGNPILAVQINL 194
Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
+K FAF+E R+V+E + A+A DGIIF+G ++++RRP DY P + G PS +
Sbjct: 195 DKNFAFLEFRSVDETTQALAFDGIIFQGQSLKIRRPHDYQPLPGMS---GSPS----VYV 247
Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
G+ S + + ++F+GGLP Y + Q+KELL S G L F+LVKD TG SKGY F
Sbjct: 248 PGVVSTIV--PDSAHKLFIGGLPNYLNDDQVKELLTSVGILRAFNLVKDSITGLSKGYAF 305
Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ 400
C Y D VTD A A LNG+ +GDK L V+RA+ +K SI+ Q + LQ
Sbjct: 306 CEYMDINVTDQAIAWLNGMHLGDKKLLVQRASVG--AKNVALSIINQT-------PVTLQ 356
Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
G+ + M G VLCL + L DDEEYEEI++D+R+EC KYG + ++
Sbjct: 357 VPGLTSSQVQM---GGHPTTVLCLMNMVLPKELLDDEEYEEIVDDVRDECNKYGLVKSIE 413
Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
IPRP +G E PG GK+F+E+ + C A L+GRKF V Y D Y +++
Sbjct: 414 IPRP-MDGVEVPGCGKIFVEFTSVIDCQKAMQGLTGRKFANRVVVTKYCDPDSYHGRNF 471
>gi|350417886|ref|XP_003491628.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like isoform 2
[Bombus impatiens]
Length = 428
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 147/363 (40%), Positives = 212/363 (58%), Gaps = 39/363 (10%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 103 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 161
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG +P++N
Sbjct: 162 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 215
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
+ + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKG
Sbjct: 216 VP------------DSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 263
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y FC Y D ++TD A A LNG+++GDK L V+RA+ ++ I AQA I + +
Sbjct: 264 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVPGL 319
Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
++ TSG T +VLCL +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 320 SMVGTSGPAT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 366
Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+V IPRP + G + PG GKVF+E+ + C A+ L+GRKF V Y+ DKY
Sbjct: 367 RSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 425
Query: 517 KDY 519
+++
Sbjct: 426 REF 428
>gi|383854116|ref|XP_003702568.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Megachile
rotundata]
Length = 432
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 148/363 (40%), Positives = 214/363 (58%), Gaps = 35/363 (9%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 103 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 161
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG +P++N
Sbjct: 162 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 215
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
+ G + + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKG
Sbjct: 216 V------PGTV--PDSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 267
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y FC Y D ++TD A A LNG+++GDK L V+RA+ ++ I AQA I + +
Sbjct: 268 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVPGL 323
Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
++ TSG T +VLCL +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 324 SMVGTSGPAT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 370
Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+V IPRP + G + PG GKVF+E+ + C A+ L+GRKF V Y+ DKY
Sbjct: 371 RSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 429
Query: 517 KDY 519
+++
Sbjct: 430 REF 432
>gi|340715832|ref|XP_003396412.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Bombus
terrestris]
gi|350417884|ref|XP_003491627.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like isoform 1
[Bombus impatiens]
Length = 432
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 148/363 (40%), Positives = 214/363 (58%), Gaps = 35/363 (9%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 103 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 161
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG +P++N
Sbjct: 162 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 215
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
+ G + + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKG
Sbjct: 216 V------PGTV--PDSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 267
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y FC Y D ++TD A A LNG+++GDK L V+RA+ ++ I AQA I + +
Sbjct: 268 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVPGL 323
Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
++ TSG T +VLCL +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 324 SMVGTSGPAT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 370
Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+V IPRP + G + PG GKVF+E+ + C A+ L+GRKF V Y+ DKY
Sbjct: 371 RSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 429
Query: 517 KDY 519
+++
Sbjct: 430 REF 432
>gi|307176032|gb|EFN65791.1| Splicing factor U2AF 50 kDa subunit [Camponotus floridanus]
Length = 432
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 148/363 (40%), Positives = 214/363 (58%), Gaps = 35/363 (9%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 103 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 161
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG +P++N
Sbjct: 162 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 215
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
+ G + + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKG
Sbjct: 216 V------PGTV--PDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSATGLSKG 267
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y FC Y D ++TD A A LNG+++GDK L V+RA+ ++ I AQA I + +
Sbjct: 268 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVPGL 323
Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
++ TSG T +VLCL +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 324 SMVGTSGPAT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 370
Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+V IPRP + G + PG GKVF+E+ + C A+ L+GRKF V Y+ DKY
Sbjct: 371 RSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 429
Query: 517 KDY 519
+++
Sbjct: 430 REF 432
>gi|332026432|gb|EGI66560.1| Splicing factor U2AF 50 kDa subunit [Acromyrmex echinatior]
Length = 435
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 147/363 (40%), Positives = 212/363 (58%), Gaps = 39/363 (10%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 110 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 168
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG +P++N
Sbjct: 169 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 222
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
+ + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKG
Sbjct: 223 VP------------DSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 270
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y FC Y D ++TD A A LNG+++GDK L V+RA+ ++ I AQA I + +
Sbjct: 271 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVPGL 326
Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
++ TSG T +VLCL +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 327 SMVGTSGPAT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 373
Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+V IPRP + G + PG GKVF+E+ + C A+ L+GRKF V Y+ DKY
Sbjct: 374 RSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 432
Query: 517 KDY 519
+++
Sbjct: 433 REF 435
>gi|402590758|gb|EJW84688.1| hypothetical protein WUBG_04401 [Wuchereria bancrofti]
Length = 477
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 139/358 (38%), Positives = 203/358 (56%), Gaps = 34/358 (9%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V+ T +RR+YVG +P +E A+ FF+Q M + G + PG+ V+ +N +K
Sbjct: 153 VPVVGPSVTCQSRRLYVGNIPFGCSEDAMLDFFNQQM-HLCGLAQAPGNPVLACQMNLDK 211
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+++E + MA DGI F G +++RRP DY P S + +L +
Sbjct: 212 NFAFIEFRSIDETTAGMAFDGINFMGQQLKIRRPRDYQPM----------STSYDLGNM- 260
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+ S + + P ++F+GGLP Y Q+KELL SFG L F+LV ++ TG SKGY F
Sbjct: 261 MVSNIV--PDSPHKIFIGGLPSYLNAEQVKELLSSFGQLKAFNLVTEQSTGVSKGYAFAE 318
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
Y DP++TD A A LNG+++GDK L V+ + A+ ++ Q + +Q +
Sbjct: 319 YLDPSLTDQAIAGLNGMQLGDKNLVVQLSCANARNNVAQNTF------------PQIQVA 366
Query: 403 GMN-TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
G++ + G G +VLCL +T D L DDEEYE+ILED+REEC KYG + ++ I
Sbjct: 367 GIDLSHGAGPP------TEVLCLMNMVTEDELKDDEEYEDILEDIREECAKYGIVKSLEI 420
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
PR G + GVGKVF+E+ C A+ AL+GRKF TV YY D Y + +
Sbjct: 421 PR-SVPGVDVTGVGKVFVEFNSKQECQKAQAALTGRKFANRTVVTSYYDPDMYHRRQF 477
>gi|307195151|gb|EFN77144.1| Splicing factor U2AF 50 kDa subunit [Harpegnathos saltator]
Length = 402
Score = 245 bits (625), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 145/364 (39%), Positives = 210/364 (57%), Gaps = 41/364 (11%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 77 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 135
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG +P++N
Sbjct: 136 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 189
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
+ + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKG
Sbjct: 190 VP------------DSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 237
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y FC Y D ++TD A A LNG+++GDK L V+RA+ ++ I QA I +
Sbjct: 238 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGTQAPVQIQVP-- 291
Query: 398 ALQTSGMNTLGGGMSLFGET--LAKVLCLTEAITADALADDEEYEEILEDMREECGKYGT 455
G+S+ G + +VLCL +T + L ++EEYE+ILED++EEC KYG
Sbjct: 292 ------------GLSMVGSSGPATEVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGV 339
Query: 456 LVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
+ +V IPRP + G + PG GKVF+E+ + C A+ L+GRKF V Y+ DKY
Sbjct: 340 VRSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYH 398
Query: 516 NKDY 519
+++
Sbjct: 399 RREF 402
>gi|170575889|ref|XP_001893425.1| U2 auxiliary factor 65 kDa subunit [Brugia malayi]
gi|158600599|gb|EDP37742.1| U2 auxiliary factor 65 kDa subunit, putative [Brugia malayi]
Length = 502
Score = 244 bits (624), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 138/358 (38%), Positives = 203/358 (56%), Gaps = 34/358 (9%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V+ T +RR+YVG +P +E A+ FF+Q M + G + PG+ V+ +N +K
Sbjct: 178 VPVVGPSVTCQSRRLYVGNIPFGCSEDAMLDFFNQQM-HLCGLAQAPGNPVLACQMNLDK 236
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+++E + MA DGI F G +++RRP DY P S + +L +
Sbjct: 237 NFAFIEFRSIDETTAGMAFDGINFMGQQLKIRRPRDYQPM----------STSYDLGNMM 286
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+++ + P ++F+GGLP Y Q+KELL SFG L F+LV ++ TG SKGY F
Sbjct: 287 VSNIV---PDSPHKIFIGGLPSYLNAEQVKELLSSFGQLKAFNLVTEQSTGVSKGYAFAE 343
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
Y DP++TD A A LNG+++GDK L V+ + A+ ++ Q + +Q +
Sbjct: 344 YLDPSLTDQAIAGLNGMQLGDKNLVVQLSCANARNNVAQNTF------------PQIQVA 391
Query: 403 GMN-TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
G++ + G G +VLCL +T D L DDEEYE+ILED+REEC KYG + ++ I
Sbjct: 392 GIDLSHGAGPP------TEVLCLMNMVTEDELKDDEEYEDILEDIREECAKYGIVKSLEI 445
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
PR G + GVGKVF+E+ C A+ AL+GRKF TV YY D Y + +
Sbjct: 446 PR-SVPGVDVTGVGKVFVEFNSKQECQKAQAALTGRKFANRTVVTSYYDPDMYHRRQF 502
>gi|393909510|gb|EJD75480.1| hypothetical protein LOAG_17389 [Loa loa]
Length = 502
Score = 244 bits (623), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 138/358 (38%), Positives = 203/358 (56%), Gaps = 34/358 (9%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V+ T +RR+YVG +P +E A+ FF+Q M + G + PG+ V+ +N +K
Sbjct: 178 VPVVGPSVTCQSRRLYVGNIPFGCSEDAMLDFFNQQM-HLCGLAQAPGNPVLACQMNLDK 236
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+++E + MA DGI F G +++RRP DY P S + +L +
Sbjct: 237 NFAFIEFRSIDETTAGMAFDGINFMGQQLKIRRPRDYQPM----------STSYDLGNMM 286
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+++ + P ++F+GGLP Y Q+KELL SFG L F+LV ++ TG SKGY F
Sbjct: 287 VSNIV---PDSPHKIFIGGLPSYLNAEQVKELLSSFGQLKAFNLVTEQSTGVSKGYAFAE 343
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
Y DP++TD A A LNG+++GDK L V+ + A+ ++ Q + +Q +
Sbjct: 344 YLDPSLTDQAIAGLNGMQLGDKNLVVQLSCANARNNVAQNTF------------PQIQVA 391
Query: 403 GMN-TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
G++ + G G +VLCL +T D L DDEEYE+ILED+REEC KYG + ++ I
Sbjct: 392 GIDLSHGAGPP------TEVLCLMNMVTEDELKDDEEYEDILEDIREECAKYGIVKSLEI 445
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
PR G + GVGKVF+E+ C A+ AL+GRKF TV YY D Y + +
Sbjct: 446 PR-SVPGVDVTGVGKVFVEFNSKQECQKAQAALTGRKFANRTVVTSYYDPDMYHRRQF 502
>gi|380016747|ref|XP_003692335.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Apis florea]
Length = 428
Score = 244 bits (623), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 146/363 (40%), Positives = 211/363 (58%), Gaps = 39/363 (10%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 103 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 161
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
N +K FAF+E R+++E + AMA D I F+G ++++RRP DY P PG +P++N
Sbjct: 162 NLDKNFAFLEFRSIDETTQAMAFDSINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 215
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
+ + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKG
Sbjct: 216 VP------------DSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 263
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y FC Y D ++TD A A LNG+++GDK L V+RA+ ++ I AQA I + +
Sbjct: 264 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVPGL 319
Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
++ TSG T +VLCL +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 320 SMVGTSGPAT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 366
Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+V IPRP + G + PG GKVF+E+ + C A+ L+GRKF V Y+ DKY
Sbjct: 367 RSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 425
Query: 517 KDY 519
+++
Sbjct: 426 REF 428
>gi|66520699|ref|XP_623055.1| PREDICTED: splicing factor U2AF 50 kDa subunit [Apis mellifera]
Length = 432
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 147/363 (40%), Positives = 213/363 (58%), Gaps = 35/363 (9%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 103 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 161
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
N +K FAF+E R+++E + AMA D I F+G ++++RRP DY P PG +P++N
Sbjct: 162 NLDKNFAFLEFRSIDETTQAMAFDSINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 215
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
+ G + + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKG
Sbjct: 216 V------PGTV--PDSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 267
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y FC Y D ++TD A A LNG+++GDK L V+RA+ ++ I AQA I + +
Sbjct: 268 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVPGL 323
Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
++ TSG T +VLCL +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 324 SMVGTSGPAT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 370
Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+V IPRP + G + PG GKVF+E+ + C A+ L+GRKF V Y+ DKY
Sbjct: 371 RSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 429
Query: 517 KDY 519
+++
Sbjct: 430 REF 432
>gi|405976087|gb|EKC40607.1| Splicing factor U2AF 50 kDa subunit [Crassostrea gigas]
Length = 428
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 148/349 (42%), Positives = 205/349 (58%), Gaps = 31/349 (8%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
+R ARR+YVG +P E+A+ FF+ M G A G V+ V IN +K FAF+E R
Sbjct: 111 SRQARRLYVGNIPFGVTEEAMMDFFNHQMKMTGLAQA-EGSPVIAVQINLDKNFAFLEFR 169
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+V+E + AMA DGI F+G ++++RRP DY P A +P++N+ G+ S +
Sbjct: 170 SVDETTQAMAFDGINFQGQSLKIRRPRDYQPLPGMA-----ETPSVNVP--GVVSTVV-- 220
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
+ P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKGY FC Y DP VTD
Sbjct: 221 QDSPHKIFIGGLPNYLNEDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDPNVTD 280
Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
CA LNG+++GDK L V+RA+ L + +Q LQ G+N G
Sbjct: 281 QGCAGLNGMQLGDKKLIVQRAS------------LGAKNSQVPVQ---LQIPGLNLNQG- 324
Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
+VLCL I + L D+EEYE+ILED++EEC KYG + ++ IPRP + G +
Sbjct: 325 ----AGPPTEVLCLMNMIVPEELEDEEEYEDILEDVKEECSKYGVVRSIEIPRPIK-GVD 379
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
PG GK+F+E+ + C A+ AL+GRKF V YY DKY +++
Sbjct: 380 VPGCGKIFVEFNSIIDCQKAQQALTGRKFSNRVVVTSYYDPDKYHRREF 428
>gi|198432986|ref|XP_002130494.1| PREDICTED: similar to U2 small nuclear ribonucleoprotein auxiliary
factor (U2AF) 2 isoform 2 [Ciona intestinalis]
Length = 472
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 145/360 (40%), Positives = 208/360 (57%), Gaps = 35/360 (9%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V Q TR ARR+YVG +P E A+ FF+ M I G + PG ++ V IN +K
Sbjct: 146 VPVAGSQMTRQARRLYVGNIPFGVTEDAMMDFFNNQMQ-IAGLAQAPGQPILAVQINLDK 204
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+V+E + A+A DGI F ++++RRP+DY P L +L QP+ +L G
Sbjct: 205 NFAFLEFRSVDETTQALAFDGINFMNQSLKIRRPSDYKP-LPGSLE--QPAIHLP----G 257
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+ S + ++ ++F+GGLP Y + Q+KELL SFG L F+LVKD T SKGY F
Sbjct: 258 VISTVVQDSQ--HKMFIGGLPNYLNDDQVKELLTSFGPLRAFNLVKDSATALSKGYAFAE 315
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQH--IAIQKMALQ 400
+ D ++TD A A LNG+++GDK L V+RA SI A+ H I I MA
Sbjct: 316 FADYSLTDQAIAGLNGMQLGDKKLIVQRA-----------SIGAKNNPHGAIMIPGMAHA 364
Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
T G G + VLCL + + L DDEEYEEI+ED+++ECGK G++V++
Sbjct: 365 T------GAGPA------TTVLCLMNMVLPEELTDDEEYEEIMEDVKDECGKLGSVVSLE 412
Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
IPRP E GVGK+++E+ + + A ALSGRKF V ++ ++Y ++ +S
Sbjct: 413 IPRPGPGLTEADGVGKIYVEFANHLDTQKAAQALSGRKFSNRVVVTSFFDPERYHHRQFS 472
>gi|403175591|ref|XP_003888994.1| hypothetical protein PGTG_22302 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171670|gb|EHS64431.1| hypothetical protein PGTG_22302 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 713
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 145/369 (39%), Positives = 201/369 (54%), Gaps = 35/369 (9%)
Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIG----GNSAGPG-----DAVVNVY 217
TQ R ARR+YVG + ANE +A FF+ M +G N G + VV V
Sbjct: 329 TQSFARQARRLYVGNILHTANEMNVAEFFNAKMKELGLLARNNEDGMAISISENPVVAVQ 388
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLN 277
+NHEK +AFVE R EEA++ M+ DGIIF+ A+++RRP DY GP P
Sbjct: 389 VNHEKNYAFVEFRNAEEATHGMSFDGIIFQNQALKIRRPKDYT-------GPDHAGPT-- 439
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN--S 335
G+ S + + P+++F+GGLP Y T+ Q+ ELL+SFG L F+LVKD +G S
Sbjct: 440 -HIPGVVSTNV--PDSPNKIFIGGLPSYLTDDQVMELLKSFGELKSFNLVKDTSSGGHVS 496
Query: 336 KGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESIL-AQAQQHIAI 394
KG+ FC Y DP +TDIAC LNG+++GD+ L V+RA +K E+E+ Q +
Sbjct: 497 KGFAFCEYVDPDLTDIACQGLNGMELGDRYLVVQRAQIGQNAKKEKENNPDGQRNNYNQF 556
Query: 395 QKMA-LQTSGMNTLGGGMSLFGE-TLAKVLCLTEAITADALADDEEYEEILEDMREECGK 452
A Q + + GE +VL + + + L DD+EY EILED+R+ECGK
Sbjct: 557 NNFAGGQATAAASSVLAAVKSGEGEKTRVLQMLNMVNQEELVDDQEYGEILEDIRDECGK 616
Query: 453 YGTLVNVVIPRPDQN---------GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNT 503
YG + V IPRP +N G+GKVF+ + CA A A++GR+F G
Sbjct: 617 YGKIEGVRIPRPIKNEKGRIDLKASESVDGLGKVFVMFEKVDECAAALLAIAGRQFAGRV 676
Query: 504 VNAFYYPED 512
+ Y PED
Sbjct: 677 IICAYAPED 685
>gi|427789501|gb|JAA60202.1| Putative splicing factor u2af large subunit rrm superfamily
[Rhipicephalus pulchellus]
Length = 462
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 146/361 (40%), Positives = 210/361 (58%), Gaps = 28/361 (7%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V ++ TR ARR+YVG +P +E+ + +F+ M A G + A PG+ V+ I
Sbjct: 130 PPAAVPIVGGTITRQARRLYVGNIPFGCSEEEMMDYFNAQMHACGFSQA-PGNPVLACQI 188
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG S ++
Sbjct: 189 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGM-SETPSV 241
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
A G+ S + + P ++F+GGLP Y E Q++ELL SFG L F+LVKD TG SKGY
Sbjct: 242 AVPGVISTVV--QDSPHKIFIGGLPNYLNEDQVRELLMSFGQLRAFNLVKDSATGLSKGY 299
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
FC Y + A TD A LNG+++GDK L V+RA+ +K Q + Q + IQ
Sbjct: 300 AFCEYVEVATTDQAIMGLNGMQLGDKKLIVQRASVG--AKNSQMN-----QAPVQIQVPG 352
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
LQ G G G +VLCL + + L D+EEYE+ILED+ EEC KYG + +
Sbjct: 353 LQLQG----GAGPP------TEVLCLMNLVCPEELKDEEEYEDILEDIHEECNKYGVVKS 402
Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
+ IPRP +G E PG GK ++E+ + C A+ +L+GRKF V Y+ DKY ++
Sbjct: 403 IEIPRPI-DGVEVPGCGKAYVEFNSVIDCQKAQQSLTGRKFSNRVVVTSYFDPDKYHRRE 461
Query: 519 Y 519
+
Sbjct: 462 F 462
>gi|226483519|emb|CAX74060.1| Splicing factor U2AF 65 kDa subunit [Schistosoma japonicum]
gi|226483521|emb|CAX74061.1| Splicing factor U2AF 65 kDa subunit [Schistosoma japonicum]
Length = 518
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 170/447 (38%), Positives = 234/447 (52%), Gaps = 36/447 (8%)
Query: 76 KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPG 135
K +RH R RS SS + + P GF+ PA A+ GQ
Sbjct: 105 KSKRHHSRHRSASSPTLVYKYWDVPPP----------GFEHVTPAQYKALQAS--GQ--- 149
Query: 136 VPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFF 195
+P V Q +P A V R ARR+YVG +P A E+ + FF
Sbjct: 150 IPVNVYAAGQVPMPVHAPNAPLTLTTNVPFAGSAVCRQARRLYVGNIPFTATEENMMEFF 209
Query: 196 SQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRR 255
++ M A G A G+ ++ V IN EK FAF+E R+V+E + +ALDG++F+ A+++RR
Sbjct: 210 NKQMRAQGLIQA-EGNPIIAVQINMEKNFAFLEFRSVDETTQGLALDGVLFQNQALKLRR 268
Query: 256 PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELL 315
P DY P + P P G+ S + + P ++FVGGLP Y E Q+KELL
Sbjct: 269 PRDYAPLPGVSEQPSVIVP-------GVVSTVV--QDSPHKIFVGGLPNYLNEDQVKELL 319
Query: 316 ESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASG 375
SFG L GF+LVKD TG SKGY FC Y D VTD ACA LNG+++GDK L V+RA+
Sbjct: 320 LSFGPLKGFNLVKDGSTGLSKGYAFCEYVDANVTDHACAGLNGMQLGDKKLIVQRASVGA 379
Query: 376 QSKTEQESILAQAQQHI-AIQKMALQTSGMNTLGGGMSLF--GETLAKVLCLTEAITADA 432
+ T +L Q + ++ +Q NT G G G +VLCL I
Sbjct: 380 KHTT---GVLPQTLLSLPGLEDGTVQ----NTTGSGNITIRSGGPPTEVLCLMNMIETSE 432
Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
L DDEEYE+I+ED+R EC KYG + ++ IPRP G E PGVGK+++E+ + C A
Sbjct: 433 LEDDEEYEDIVEDVRAECSKYGVVRSLEIPRPIP-GVEVPGVGKIYVEFASLIDCQKAAT 491
Query: 493 ALSGRKFGGNTVNAFYYPEDKYFNKDY 519
AL+GRKF V ++ D Y +++
Sbjct: 492 ALTGRKFNQRLVVTSFFSPDNYHRREF 518
>gi|328862941|gb|EGG12041.1| hypothetical protein MELLADRAFT_41759 [Melampsora larici-populina
98AG31]
Length = 397
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 141/368 (38%), Positives = 198/368 (53%), Gaps = 46/368 (12%)
Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDA---------VVNVY 217
TQ R ARR+YVG + ANE +A FF+ M +G D VV V
Sbjct: 26 TQSFARQARRLYVGNILHTANEMNVAEFFNAKMKELGLLVRNGEDGSMISISENPVVAVQ 85
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALG--PGQPSPN 275
+NHEK +AFVE R EEA++ M+ DGIIF+ A+++RRP DY T + PG S N
Sbjct: 86 VNHEKNYAFVEFRNAEEATHGMSFDGIIFQNQALKIRRPKDYTGTEHTSTNHIPGVVSTN 145
Query: 276 LNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN- 334
+ + P+++F+GGLP Y T+ Q+ ELL+SFG L F+LVKD +G
Sbjct: 146 V--------------PDSPNKIFIGGLPSYLTDDQVMELLKSFGELKSFNLVKDTSSGGQ 191
Query: 335 -SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
SKG+ FC Y D +TDIAC LNG+++GD+ L V+RA +K E+E+ +
Sbjct: 192 VSKGFAFCEYVDSDLTDIACQGLNGMELGDRYLVVQRAQIGQNAKKEKENAEGAGGAGMG 251
Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
+ +G N G KVL + + + L DDEEY+EILED++EEC KY
Sbjct: 252 VGGF----NGTNRASEG------ERTKVLQMLNMVNPEELVDDEEYKEILEDIKEECSKY 301
Query: 454 GTLVNVVIPRPDQN---------GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
G + +V IPRP +N G+GKVF+++ CA A +A++GR+F G +
Sbjct: 302 GQIEDVKIPRPAKNEKGRMDLKSSESIEGLGKVFIKFERIEDCAQALSAIAGRQFAGRVI 361
Query: 505 NAFYYPED 512
Y ED
Sbjct: 362 ICAYASED 369
>gi|440900150|gb|ELR51345.1| Splicing factor U2AF 65 kDa subunit, partial [Bos grunniens mutus]
Length = 411
Score = 241 bits (615), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 162/425 (38%), Positives = 230/425 (54%), Gaps = 39/425 (9%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
RSP K K R +D+ PP + GQ+P + +P M + L T
Sbjct: 17 RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT- 74
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+
Sbjct: 75 -------PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVL 126
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S
Sbjct: 127 AVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SE 179
Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
N ++ G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG
Sbjct: 180 NPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGL 237
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
SKGY FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +
Sbjct: 238 SKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTL 293
Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
Q L +S + +GG + +VLCL + + L DDEEYEEI+ED+R+ECGKYG
Sbjct: 294 QVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECGKYG 345
Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ ++ IPR ++F+E+ C A L+GRKF V Y D Y
Sbjct: 346 LVKSIEIPRQAWVFTSLILFLQIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSY 405
Query: 515 FNKDY 519
+D+
Sbjct: 406 HRRDF 410
>gi|321479007|gb|EFX89963.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 isoform 1
[Daphnia pulex]
Length = 487
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 154/422 (36%), Positives = 224/422 (53%), Gaps = 56/422 (13%)
Query: 134 PGVPSAVPEMA-QNMLPFGATQLGAFPLMPVQVMTQQA------------TRHARRVYVG 180
P + +P + +++ P + A +P +MT+ A TR ARR+YVG
Sbjct: 86 PSIYWDIPPIGFEHITPMQYKAMQAAGQIPANIMTEVAPQVQVPVVGNTITRQARRLYVG 145
Query: 181 GLP-----PLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEA 235
+P A E+ + FF+Q M + G S PG ++ IN +K FAF+E R+++E
Sbjct: 146 NIPFGVSDVRAAEEEMMDFFNQQM-HLSGLSQAPGHPILACQINLDKNFAFLEFRSIDET 204
Query: 236 SNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA---- 291
S AMA DGI F+G +++RRP DY+PT G G G++SG I
Sbjct: 205 SQAMAFDGINFKGQTLKIRRPHDYHPTPGGGGGGGGGG-GGPETTPGMSSGGITEKRAGG 263
Query: 292 --------------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
+ P ++F+GGLP Y E Q ELL SFG L F+LVKD TG SKG
Sbjct: 264 GGGGGGGTMSTIVPDTPHKLFIGGLPNYLNEEQ--ELLMSFGQLRAFNLVKDTATGLSKG 321
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y FC Y D VTD A + LNG+++GDK + ++RA+ ++ A A + +Q
Sbjct: 322 YAFCEYADVTVTDQAISGLNGMQLGDKKIIIQRASVGAKN--------AAAYAQMPVQ-- 371
Query: 398 ALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
+Q G N G G+ +VLCL +T + L DDEEYEEI++D+REEC ++G +
Sbjct: 372 -IQVPGFNLAAGP----GQP-TEVLCLLNMVTPEELRDDEEYEEIVDDIREECNRHGAVR 425
Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
+V IPRP + + PGVGKVF+E+ C A+ AL+GRKF V +Y ++Y +
Sbjct: 426 SVEIPRPLEGVDDVPGVGKVFVEFISVSDCVKAQQALTGRKFANRIVVTSFYEPERYHRR 485
Query: 518 DY 519
D+
Sbjct: 486 DF 487
>gi|256074204|ref|XP_002573416.1| splicing factor u2af large subunit [Schistosoma mansoni]
gi|238658595|emb|CAZ29648.1| splicing factor u2af large subunit, putative [Schistosoma mansoni]
Length = 521
Score = 241 bits (614), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 170/447 (38%), Positives = 234/447 (52%), Gaps = 36/447 (8%)
Query: 76 KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPG 135
K +RH R RS SS + + P GF+ PA A+ GQ
Sbjct: 108 KSKRHHSRHRSASSPTLVYKYWDVPPP----------GFEHVTPAQYKALQAS--GQ--- 152
Query: 136 VPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFF 195
+P V Q +P A V R ARR+YVG +P A E+ + FF
Sbjct: 153 IPVNVYAAGQVPMPVHAPNAPLTLTTNVPFAGSAVCRQARRLYVGNIPFTATEENMMEFF 212
Query: 196 SQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRR 255
++ M A G A G+ ++ V IN EK FAF+E R+V+E + +ALDG++F+ A+++RR
Sbjct: 213 NKQMRAQGLIQA-EGNPIIAVQINMEKNFAFLEFRSVDETTQGLALDGVLFQNQALKLRR 271
Query: 256 PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELL 315
P DY P + P P G+ S + + P ++FVGGLP Y E Q+KELL
Sbjct: 272 PRDYAPLPGVSEQPSVIVP-------GVVSTVV--QDSPHKIFVGGLPNYLNEDQVKELL 322
Query: 316 ESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASG 375
SFG L GF+LVKD TG SKGY FC Y D VTD ACA LNG+++GDK L V+RA+
Sbjct: 323 LSFGPLKGFNLVKDGSTGLSKGYAFCEYVDANVTDHACAGLNGMQLGDKKLIVQRASVGA 382
Query: 376 QSKTEQESILAQAQQHI-AIQKMALQTSGMNTLGGGMSLF--GETLAKVLCLTEAITADA 432
+ T +L Q + ++ +Q NT G G G +VLCL I
Sbjct: 383 KHTT---GVLPQTLLSLPGLEDGTVQ----NTTGSGNITIRSGGPPTEVLCLMNMIETSE 435
Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
L DDEEYE+I+ED+R EC KYG + ++ IPRP G E PGVGK+++E+ + C A
Sbjct: 436 LEDDEEYEDIVEDVRAECSKYGVVRSLEIPRPIP-GVEVPGVGKIYVEFASLIDCQKAAT 494
Query: 493 ALSGRKFGGNTVNAFYYPEDKYFNKDY 519
AL+GRKF V ++ D Y +++
Sbjct: 495 ALTGRKFNQRLVVTSFFSPDNYHRREF 521
>gi|299470773|emb|CBN79819.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 636
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 143/404 (35%), Positives = 222/404 (54%), Gaps = 58/404 (14%)
Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
+ Q TRHARR+Y+GG P E+ +++FF++V+ G AV +VY++ EK FAF
Sbjct: 239 SNQQTRHARRLYIGGCPK-TTEEEMSSFFNEVINR-ALEYPIDGGAVASVYVSQEKAFAF 296
Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
+E++T+E A++ + LDGI+++ +++RRP+DYNP L A P P LNL+ +G+ S
Sbjct: 297 LELKTMELATSVLELDGIVYKETQLKMRRPSDYNPQLVPAAS--GPIPKLNLSVLGIISS 354
Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
+ ++ ++VF+GGLPY TE Q++ELL +FG L F LV+D + SKGYGFC Y +
Sbjct: 355 TVPDSD--NKVFIGGLPYNLTEDQVRELLSAFGPLKSFHLVRDPGSPTSKGYGFCEYLNA 412
Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQS----------------------------- 377
VT IAC L+G+ +GDKTLTVR AT G S
Sbjct: 413 GVTAIACEGLHGMTLGDKTLTVRPATDRGSSGGQQQQQQQQQQQQHYQQQAMPGMQMGGG 472
Query: 378 --------KTEQESI-LAQAQQHI-------AIQKMALQTSGMNTLGGGMSLFGETLAK- 420
T+QE + LAQA + A+ +A +G S L +
Sbjct: 473 QQMGLTAPPTQQEQMQLAQAMAALDPNASQAAMASLAQGGAGAVPSAAAESAVAANLIRA 532
Query: 421 -----VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
VL + +T L D +EYE+I++D+++EC +GT++++++PRP + + VG
Sbjct: 533 IPPTRVLVMLNMVTEAELRDPQEYEDIVDDIQQECSSHGTVLSIIVPRPGE-ADASRAVG 591
Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
KVF+EY A +L+GR+FG N V Y E+K+ +++
Sbjct: 592 KVFVEYDTKDSAQKAALSLAGRQFGANIVKVEYLNEEKFARREF 635
>gi|443731660|gb|ELU16702.1| hypothetical protein CAPTEDRAFT_155651 [Capitella teleta]
Length = 480
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 145/349 (41%), Positives = 209/349 (59%), Gaps = 25/349 (7%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
+R ARR+YVG +P E+ + FF+ M + S G+ V+ +N +K FAF+E R
Sbjct: 157 SRQARRLYVGNIPFGVTEEMMMDFFNTQMK-MAALSQAEGNPVIACQVNLDKNFAFLEYR 215
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+V+E ++AMALDGI F+G ++++RRP DY P A P N+A G+ S +
Sbjct: 216 SVDETTHAMALDGINFQGQSLKIRRPKDYQPLPGIAETP-------NVAVPGVVSTVV-- 266
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
+ ++F+GGLP Y E Q+KELL SFG L F LVKD TG SKGY FC Y D ++TD
Sbjct: 267 QDSAHKIFIGGLPNYLNEDQVKELLTSFGPLKAFSLVKDSATGLSKGYAFCEYLDISITD 326
Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
ACA LNG+++GDK L V+RA+ ++ +I++ A + I +A+
Sbjct: 327 QACAGLNGMQLGDKKLIVQRASVGAKNA----AIISSAPMQMQIPGLAVNP--------- 373
Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
M+ G +VLCL + + L D+EEYE+ILED++EEC KYG + ++ IPRP + G E
Sbjct: 374 MAAAGPA-TEVLCLMNMVLPEELEDEEEYEDILEDVKEECSKYGFVKSLEIPRPIK-GVE 431
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
PGVGK+F+E++ A+NALSGRKF TV Y DKY +++
Sbjct: 432 VPGVGKIFVEFHSITDSQKAQNALSGRKFANRTVITSYCDPDKYHRREF 480
>gi|340373805|ref|XP_003385430.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Amphimedon
queenslandica]
Length = 529
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 141/370 (38%), Positives = 195/370 (52%), Gaps = 56/370 (15%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
Q TR ARR+Y+G +P E+ + FF++ M SA PG V+ V IN +K FAF+E
Sbjct: 198 QLTRQARRLYIGNIPFGIAEEVMVNFFNEKMLEAKLCSA-PGIPVLAVQINMDKNFAFIE 256
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
R+VEE +NAMA DGI+ +G ++++RRP DY P + P G+ S +
Sbjct: 257 FRSVEETTNAMAFDGIVLQGQSLKIRRPKDYAPIPGVDIMPKH--------VPGVISTVV 308
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
+GP +VF GGLP Y ++ Q+KELL SFG L F+LVKD T SKGY F Y D V
Sbjct: 309 --PDGPHKVFCGGLPTYLSDDQVKELLSSFGDLKAFNLVKDSGTSFSKGYCFFEYLDTDV 366
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQE-----------------SILAQAQQH 391
TD A LNG+ +GDK L V+RA+ + E + + + AQ
Sbjct: 367 TDGAIQGLNGMALGDKKLVVQRASVGAKVMEEYDISTDITSMAMPISIPGLQMPSTAQ-- 424
Query: 392 IAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECG 451
T VLCL T + L DD+EYE ILED+REEC
Sbjct: 425 -------------------------TATTVLCLMNMTTEEELRDDDEYEGILEDVREECS 459
Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
YG +++V PRP + G PG+GK+F+E+ C A+ AL+GRKF V ++
Sbjct: 460 NYGQVLSVAAPRPVE-GTLVPGLGKIFVEFAATDQCQRAQTALAGRKFANRVVVTSFFDL 518
Query: 512 DKYFNKDYSA 521
+KY K+++A
Sbjct: 519 EKYRQKNFAA 528
>gi|331243454|ref|XP_003334370.1| splicing factor U2AF subunit [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 600
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 147/379 (38%), Positives = 203/379 (53%), Gaps = 55/379 (14%)
Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIG----GNSAGPG-----DAVVNVY 217
TQ R ARR+YVG + ANE +A FF+ M +G N G + VV V
Sbjct: 216 TQSFARQARRLYVGNILHTANEMNVAEFFNAKMKELGLLARNNEDGMAISISENPVVAVQ 275
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLN 277
+NHEK +AFVE R EEA++ M+ DGIIF+ A+++RRP DY GP P
Sbjct: 276 VNHEKNYAFVEFRNAEEATHGMSFDGIIFQNQALKIRRPKDYT-------GPDHAGPT-- 326
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN--S 335
G+ S + + P+++F+GGLP Y T+ Q+ ELL+SFG L F+LVKD +G S
Sbjct: 327 -HIPGVVSTNV--PDSPNKIFIGGLPSYLTDDQVMELLKSFGELKSFNLVKDTSSGGHVS 383
Query: 336 KGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQ 395
KG+ FC Y DP +TDIAC LNG+++GD+ L V+RA +K E+E+ Q++ Q
Sbjct: 384 KGFAFCEYVDPDLTDIACQGLNGMELGDRYLVVQRAQIGQNAKKEKEN-NPDGQRNNYNQ 442
Query: 396 KMALQTSGMNTLGGGMSLF-------------GETLAKVLCLTEAITADALADDEEYEEI 442
N GG + GE +VL + + + L DD+EY EI
Sbjct: 443 --------FNNFAGGQATAAASSVLAAVKSGEGEK-TRVLQMLNMVNQEELVDDQEYGEI 493
Query: 443 LEDMREECGKYGTLVNVVIPRPDQN---------GGETPGVGKVFLEYYDAVGCATAKNA 493
LED+R+ECGKYG + V IPRP +N G+GKVF+ + CA A A
Sbjct: 494 LEDIRDECGKYGKIEGVRIPRPIKNEKGRIDLKASESVDGLGKVFVMFEKVDECAAALLA 553
Query: 494 LSGRKFGGNTVNAFYYPED 512
++GR+F G + Y PED
Sbjct: 554 IAGRQFAGRVIICAYAPED 572
>gi|195380577|ref|XP_002049047.1| GJ20975 [Drosophila virilis]
gi|194143844|gb|EDW60240.1| GJ20975 [Drosophila virilis]
Length = 428
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 128/354 (36%), Positives = 200/354 (56%), Gaps = 26/354 (7%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
TR ARR+YVG +P ++ + FF++ + + G G+AV+ N +K FAF+E
Sbjct: 96 VTRQARRLYVGNIPFSTTDEDMMAFFNEQIHRLNGTQGHDGNAVLTCQTNLDKNFAFLEF 155
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNP----TLAAALGPGQPSPNLNLAAVGLAS 285
R+++EA+ A+ DGI + G +++RRP DY+P T A + + + ++ + + ++
Sbjct: 156 RSMDEATQAITFDGISYRGQTLKIRRPHDYHPVASITTAEIVEIAKGATQIHASNLPISP 215
Query: 286 GAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQD 345
+ P ++++GGLP E Q+KELL +FG L GF+LVKD TG SKG+ FC Y D
Sbjct: 216 VV---PDSPHKIYIGGLPTCLNEEQVKELLVTFGQLRGFNLVKDTITGQSKGFAFCEYLD 272
Query: 346 PAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
P++T+ A A LNG+++GD+ L V+R+ +A + +A Q LQ G
Sbjct: 273 PSITEQAIAGLNGMQLGDRKLVVQRS-------------IAGVRNMVASQLPVLQVPGF- 318
Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
+ + +VLCL + L D++EYE+I D+++EC KYG + ++ IPRP
Sbjct: 319 ----PIDVSTCKATEVLCLLNMVLPSELLDNDEYEDIRTDIKQECAKYGKVKSLKIPRPI 374
Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+ +T G GKVF+ + C A NALSGRKF G V +Y DKY KD+
Sbjct: 375 GDPPQT-GCGKVFVRFESIEDCKKALNALSGRKFSGRIVMTSFYDLDKYKRKDF 427
>gi|358338608|dbj|GAA57078.1| splicing factor U2AF large subunit [Clonorchis sinensis]
Length = 518
Score = 237 bits (605), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 166/422 (39%), Positives = 228/422 (54%), Gaps = 50/422 (11%)
Query: 113 GFDMAPPA--AAMLPGAAVP------GQLPGVPSAVPEMAQNM---LPFGATQLGAFPLM 161
GF+ PA AM +P GQ+P +P+ P + +PF + +
Sbjct: 132 GFEHVTPAQYKAMQASGQIPVNVYAAGQIP-MPAHAPNAPLTLTTNIPFAGSAV------ 184
Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
R ARR+YVG +P A E+ + FF++ M A G A G ++ V IN E
Sbjct: 185 ---------CRQARRLYVGNIPFTATEENMMEFFNKQMRAQGLVQAE-GSPIIAVQINME 234
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K FAF+E R+V+E + +ALDGI+F A+++RRP DY P + P P
Sbjct: 235 KNFAFLEFRSVDETTQGLALDGILFHNQALKLRRPRDYAPLPGVSETPSVIVP------- 287
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
G+ S + + P +VFVGGLP Y E Q+KELL SFG L GF+LVKD TG SKGY FC
Sbjct: 288 GVVSTVV--QDSPHKVFVGGLPNYLNEDQVKELLLSFGPLKGFNLVKDGSTGLSKGYAFC 345
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT--ASGQSKTEQESILAQAQQHIAIQKMAL 399
Y DP VTD ACA LNG+++GDK L V+RA+ A + T ++L Q + +
Sbjct: 346 EYVDPNVTDHACAGLNGMQLGDKKLIVQRASVGAKHNATTAAPALL----QLPGLTDTLV 401
Query: 400 QTSGMNTLGGGMSLF--GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
Q NT G G G +VLCL I L DDEEYE+I+ED+R EC KYG +
Sbjct: 402 Q----NTTGTGNITIRSGGPPTEVLCLMNMIDPAELEDDEEYEDIVEDVRAECSKYGVVR 457
Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
++ IPRP + G E PGVGK+++E+ + C A AL+GRKF V ++ D Y +
Sbjct: 458 SLEIPRPIR-GVEVPGVGKIYVEFASLIDCQKAATALTGRKFNQRLVVTSFFNPDNYHRR 516
Query: 518 DY 519
++
Sbjct: 517 EF 518
>gi|195489053|ref|XP_002092574.1| GE11595 [Drosophila yakuba]
gi|194178675|gb|EDW92286.1| GE11595 [Drosophila yakuba]
Length = 437
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 133/362 (36%), Positives = 205/362 (56%), Gaps = 38/362 (10%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIG--GNSAGPGDAVVNVYINHEKKFAFVE 228
TR ARR+YVG +P E+ + FF+Q + A+G G G AV+ N EK FAF+E
Sbjct: 98 TRQARRLYVGNIPFGVTEEEMMEFFNQQLMALGLEGAQYLDGKAVLTCQTNLEKNFAFLE 157
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALG----------PGQPSPNLNL 278
R+++EA+ A+ DGIIF G +++RRP DY P + + P + N +
Sbjct: 158 FRSMDEATQALNFDGIIFRGQILKIRRPHDYQPVPSIRVSNMESYRSFRLPATTTTNPPI 217
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
+ + ++S + P++++VGGLP + QIK+LL+SFG L G +LVKD +T SKG+
Sbjct: 218 STIAVSSIV---PDSPNKIYVGGLPTCLDQDQIKDLLQSFGELKGLNLVKDINTSLSKGF 274
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
F Y DP+VTD A A L+G+++GD+ L V+R+ G++ +++Q+
Sbjct: 275 AFFEYIDPSVTDHAIAGLHGMQLGDRRLVVQRSIPGGKN-------------GLSVQQPI 321
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
+Q G++TL L + ++LCL + + L D+EE+E+I D+++EC KYG + +
Sbjct: 322 VQVPGISTL-----LDPGSPTEILCLLNMVLPEELLDNEEFEDIRSDIKQECAKYGDVRS 376
Query: 459 VVIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+ IPRP G+ P G GKVF+++ A ALSGRKF G V +Y DKY
Sbjct: 377 IKIPRP---VGQFPKRGCGKVFVQFESVEDSQKALKALSGRKFSGRIVMTSFYDPDKYLQ 433
Query: 517 KD 518
D
Sbjct: 434 DD 435
>gi|297837403|ref|XP_002886583.1| hypothetical protein ARALYDRAFT_338287 [Arabidopsis lyrata subsp.
lyrata]
gi|297332424|gb|EFH62842.1| hypothetical protein ARALYDRAFT_338287 [Arabidopsis lyrata subsp.
lyrata]
Length = 233
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 132/246 (53%), Positives = 169/246 (68%), Gaps = 20/246 (8%)
Query: 276 LNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNS 335
+ L+ G +G + EGPDR+FVGGLP+YFT+ QI+E+LE G L GF+L+KDR TG+S
Sbjct: 4 VELSTTGSTTGDL--VEGPDRIFVGGLPHYFTDAQIREILECLGPLRGFNLLKDRQTGDS 61
Query: 336 KGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESI-LAQAQQHIA 393
KGY FCVYQDP+VTDIACAALNG+K+GDKTL VRRA + Q K EQE + AQQ IA
Sbjct: 62 KGYAFCVYQDPSVTDIACAALNGIKIGDKTLAVRRAMQGTIQPKPEQEEVLQQIAQQQIA 121
Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
+Q++ L+ G+ T K++CL++ +T D L + EEY +I MR+E GK+
Sbjct: 122 LQRLMLEPGGIPT-------------KIVCLSQLVTIDNLRNYEEYADI---MRQEGGKF 165
Query: 454 GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
G LVNVVIPRP+ + G TPGVG VFLEY D G + A+ ++GR GG V A YYPEDK
Sbjct: 166 GNLVNVVIPRPNPDHGPTPGVGNVFLEYADVDGSSKARLEMNGRIVGGYQVVAVYYPEDK 225
Query: 514 YFNKDY 519
Y DY
Sbjct: 226 YAQGDY 231
>gi|443898020|dbj|GAC75358.1| splicing factor U2AF, large subunit [Pseudozyma antarctica T-34]
Length = 699
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 145/407 (35%), Positives = 222/407 (54%), Gaps = 42/407 (10%)
Query: 132 QLPGV-PSAVPEMAQNMLPFGATQLGAFPLMPVQVM-TQQATRHARRVYVGGLPPLANEQ 189
Q PG+ P + Q PF P Q + T + R ARR+YVG + ANE
Sbjct: 284 QHPGMYPQDGQQYTQGQPPFAGQYQQGHPASHNQALATADSGRQARRLYVGNITHQANEP 343
Query: 190 AIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV 249
++ FF++ M + + PG+ V+ +N +K +AFVE R +EA+NAM+ DGI+F+G
Sbjct: 344 SMVAFFNEQMLKLKLGTE-PGEPAVSAQVNVDKGYAFVEFRHPDEATNAMSFDGIVFQGQ 402
Query: 250 AVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTET 309
++++RRP DY GP P+ ++ G+ S + + P ++FVGGLP Y T+
Sbjct: 403 SLKIRRPKDYT-------GPDVRPPS-SIHVPGVISTNV--PDSPFKIFVGGLPTYLTDD 452
Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVR 369
Q+ ELL++FG L F+LVKD T SKG+ FC Y D A+TD+AC LNG+++GD+ L V+
Sbjct: 453 QVIELLQAFGELRSFNLVKDPATNASKGFAFCEYVDTALTDLACQGLNGMELGDRNLVVQ 512
Query: 370 RATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAIT 429
RA+ + K + A A + + + +S G G GE + ++ L +T
Sbjct: 513 RASVGSEKKAQ-----AIAAYGANVGALGVPSSVQQFAGAGGDA-GEPTSCMVMLN-MVT 565
Query: 430 ADALADDEEYEEILEDMREECGKYGTLVNVVIPRP--------------DQNG------- 468
+ L DDEEY +I+ED+R+EC K+GT+ +V +PRP QN
Sbjct: 566 PEELQDDEEYADIVEDIRDECTKFGTVNDVRVPRPAKESKGAAAHQWKRSQNDEAADAGK 625
Query: 469 -GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
E GVG+V++ Y + CA A +++GR+FGG TV + ED +
Sbjct: 626 PSEREGVGRVYVRYAETDQCAQALKSIAGRQFGGRTVICAFLKEDDW 672
>gi|195124159|ref|XP_002006561.1| GI21125 [Drosophila mojavensis]
gi|193911629|gb|EDW10496.1| GI21125 [Drosophila mojavensis]
Length = 427
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 128/352 (36%), Positives = 199/352 (56%), Gaps = 23/352 (6%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
TR ARR+YVG +P ++ + FF++ + + G + G+AV+ N +K FAF+E
Sbjct: 96 VTRQARRLYVGNIPFSTTDEDMMAFFNEQINRLNGTNGVDGNAVLTCQTNLDKNFAFLEF 155
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
R+++EA+ A+ DGI++ G +++RRP DY+P A++ + + +A + S I
Sbjct: 156 RSMDEATQAINFDGILYRGQTLKIRRPHDYHPM--ASVSSSEAADAAKGSATHVNSVPIS 213
Query: 290 GA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
+ P +++VGGLP E Q+KELL +FG L GF+LVK+ TG SKG+ FC Y DP
Sbjct: 214 PMVPDSPHKIYVGGLPTCLNEEQVKELLVTFGKLRGFNLVKEAVTGQSKGFAFCEYVDPC 273
Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
+T+ A A LNG+++GD+ L V+R+ +A + +A Q LQ G
Sbjct: 274 ITEQAIAGLNGMQLGDRKLIVQRS-------------IAGVRNLVANQLPVLQVPGF--- 317
Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
+ + +VLCL + L DD+EY++I D+++EC KYG + ++ IPRP +
Sbjct: 318 --PVDVSTGKATEVLCLLNMVLPSELTDDDEYDDIRTDIKQECAKYGKVKSLKIPRPGDD 375
Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+ G GKVF+ + C A NALSGRKF G V +Y +KY KD+
Sbjct: 376 SIQG-GCGKVFVRFESIDDCKKALNALSGRKFSGRIVMTSFYDLEKYKRKDF 426
>gi|392572624|gb|EIW65769.1| hypothetical protein TREMEDRAFT_41238 [Tremella mesenterica DSM
1558]
Length = 596
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 147/415 (35%), Positives = 210/415 (50%), Gaps = 74/415 (17%)
Query: 125 PGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPP 184
PG P G+PSA+ +A + P GA L R A+R+YVGG+
Sbjct: 201 PGRVPPPPELGIPSAL--IAGSFPPPGANGL----------------RQAKRIYVGGITE 242
Query: 185 LANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGI 244
+ ++ FF+ M+ G PGD + V +NHEK FAF+E R+ EEAS+A+ LD +
Sbjct: 243 SMTDASLLEFFNTTMSERGFTLEIPGDPIGAVQVNHEKAFAFLEFRSAEEASSALKLDNV 302
Query: 245 IFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPY 304
+FE V +RV+RP DY L P Q + GA ++ P+++F+GGLP
Sbjct: 303 MFEDVPLRVKRPKDYT-----GLDPLQHT----------MGGAQAMSDSPNKLFIGGLPT 347
Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDK 364
Y E Q+ ELL+SFG L F+LVKD D+ +KG+ F Y DP+ TD+A + LN +GD+
Sbjct: 348 YLDEAQVMELLKSFGELRSFNLVKDPDSSENKGFAFAEYTDPSNTDMAISGLNNFSLGDR 407
Query: 365 TLTVRRATASGQSKTE-----QESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA 419
L V+RA S T ES LA++ AI Q SG +
Sbjct: 408 ILVVQRAAVGRASGTTDAIPGSESFLAKS----AIFAQENQQSG-------------PTS 450
Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE--------- 470
+V+ L +TAD L DD+EY+EILED+ EC ++G + V +PRP +
Sbjct: 451 RVMLLLNMVTADELYDDQEYQEILEDITSECSRFGEIEGVRVPRPVPKSKKWEPSESAVV 510
Query: 471 ----------TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
GVG+VF+ Y D A N+L GR+FGG T+ PE+++
Sbjct: 511 TQERARRADLAAGVGRVFVMYKDLASTEKAMNSLGGRQFGGRTIVVANVPEEEFL 565
>gi|430813569|emb|CCJ29085.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 545
Score = 235 bits (600), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 137/397 (34%), Positives = 209/397 (52%), Gaps = 48/397 (12%)
Query: 129 VPG--QLPGVPSAVPEMAQNMLPFGATQL-----GAFPLMPVQVMTQQATRHARRVYVGG 181
VPG LPG P Q+M+ GA + Q + +R +RR++VG
Sbjct: 182 VPGLFPLPGAPR------QSMMDLSKLSTVHKGPGAMNIPNPQALQPLQSRQSRRIHVGN 235
Query: 182 LPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMAL 241
+P +E+ + FF+ M+ + ++G + V++ +NHEK +AF+E R E+A+ A+
Sbjct: 236 IPQPIDEEHLVNFFNDTMSCLNVTTSG-DNPVISAQVNHEKGYAFLEFRQPEDATVAIGF 294
Query: 242 DGIIFEGVAVRVRRPTDY----NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRV 297
DGI + ++++RRP DY PT + PG S N + P+++
Sbjct: 295 DGISYMNNSLKIRRPMDYIVPQMPTDDGSYVPGVISTNF--------------TDTPNKI 340
Query: 298 FVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALN 357
+GGLP Y + Q+ ELL+SFG L F+L+KD T SKG+ FC Y DP VTDIAC LN
Sbjct: 341 HIGGLPTYLDDEQVIELLKSFGELKAFNLIKDAATNESKGFAFCEYVDPDVTDIACEGLN 400
Query: 358 GLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGET 417
G+++GDK L V+RA+ T+Q+ I +I +A + +
Sbjct: 401 GMELGDKILVVKRASIG----TKQKPISTSGGGIASITMLAEEEGQLRP----------- 445
Query: 418 LAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKV 477
+VL + +T + L DD+EYEEI ED+R+EC KYG ++++ IPR GVGKV
Sbjct: 446 -TRVLQMFNMVTPEELQDDDEYEEISEDIRDECSKYGKVLDLKIPRGIGGSRSNFGVGKV 504
Query: 478 FLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++ + + C A L+GRKF TV +YPE+ Y
Sbjct: 505 YVRFETEMSCLKAMKDLAGRKFSDRTVLTSFYPEENY 541
>gi|389745686|gb|EIM86867.1| hypothetical protein STEHIDRAFT_57258 [Stereum hirsutum FP-91666
SS1]
Length = 417
Score = 235 bits (599), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 137/364 (37%), Positives = 199/364 (54%), Gaps = 27/364 (7%)
Query: 156 GAFPLMPVQV------MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
G P MPVQ + +R +RR+Y+G + P EQ + FF+ M + + P
Sbjct: 39 GLPPPMPVQTFGMGMGVNPNLSRQSRRLYIGSITPDITEQNLTDFFNSKMIEMNIGTGAP 98
Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
G V+ V N+EK +AFVE R+ E+A+ AMA DGIIF +++RRP DY G
Sbjct: 99 GPPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFVNGPLKIRRPKDYG-------GM 151
Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD 329
P+P L++ G+ S + + P ++FVGGLP Y E Q+ ELL+SFG L F+LV++
Sbjct: 152 EMPAPPLHVP--GVVSTNV--PDSPHKIFVGGLPSYLNEEQVMELLKSFGDLKAFNLVRE 207
Query: 330 RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
G SKG+ F Y DP VTD+A +L+G+++GD+ L V+RA+ ++ L
Sbjct: 208 NGNGPSKGFAFFEYVDPEVTDVAIQSLSGMELGDRYLVVQRASVGAKAGQPGMPNLPY-D 266
Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
Q I + L G S +++L + +T + L DD EY ++LED+REE
Sbjct: 267 QFPEIPR--------PILPAGASDLSSANSRILLMLNMVTPEDLIDDSEYADLLEDIREE 318
Query: 450 CGKYGTLVNVVIPRPD-QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
YG + +V IPRPD Q E GVG+V++ Y DA G A AL+GR F G ++ A
Sbjct: 319 VANYGDVDDVRIPRPDAQRADEAAGVGRVYVRYKDAEGAAKGMQALAGRSFAGRSIIATV 378
Query: 509 YPED 512
ED
Sbjct: 379 LSED 382
>gi|384496094|gb|EIE86585.1| hypothetical protein RO3G_11296 [Rhizopus delemar RA 99-880]
Length = 502
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 132/358 (36%), Positives = 192/358 (53%), Gaps = 37/358 (10%)
Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
P Q M + ++++YVG +P +E + FF+ + + VV V INHE
Sbjct: 182 PPQRMEDATPKQSKKLYVGQIPSTTDEVTLCDFFNATIR----HELQDKTPVVGVQINHE 237
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K FAF+E T ++A+ M LDGI F+G +++RRP Y P P + P
Sbjct: 238 KNFAFIEFHTSQQATACMVLDGISFQGNTLKIRRPNHYQP-------PEEQVP------- 283
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
GL++ + P++VFVGGLP Y T+ Q+ ELL SFG L F+LVKD TG +KG+ FC
Sbjct: 284 GLSTNV---PDTPNKVFVGGLPVYLTDNQVMELLTSFGELRAFNLVKDTATGANKGFAFC 340
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
Y DP+VTD+AC LNG+++GDK L V+RA+ + HI M+
Sbjct: 341 EYADPSVTDLACQGLNGMELGDKKLIVQRASVGAK--------------HIPPDYMSGPI 386
Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
N + S E +VL L ++ + L DDEEY++I ED+ EEC K+G +V++ I
Sbjct: 387 LPANYVPV-TSAKEEDATRVLQLMNMVSPEELEDDEEYQDIWEDIAEECAKFGNIVDMKI 445
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
P+P Q + PG G +F+ Y A AL+GRKF TV A + E Y + ++
Sbjct: 446 PKP-QKDQQVPGCGLIFVRYETTDETLAALRALAGRKFADRTVVATFVDEQNYLSDNF 502
>gi|395331854|gb|EJF64234.1| hypothetical protein DICSQDRAFT_144911 [Dichomitus squalens
LYAD-421 SS1]
Length = 587
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 154/468 (32%), Positives = 239/468 (51%), Gaps = 56/468 (11%)
Query: 71 DYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR--SGFDMAPPAAAMLPGAA 128
D D+ R +HR + +R RS + P +P ++R SG+D+ P A
Sbjct: 126 DERGDRRGRGKHREGLGTPER---RSPT-PPDAAPLSQRKRKASGWDVHAPGYEQY--TA 179
Query: 129 VPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQV------MTQQATRHARRVYVGGL 182
+ + G+ +P + +P G P MPV + +R +RR+Y+G +
Sbjct: 180 MQAKQTGL-FNLPGANRTQIPPILAIPGLPPPMPVSTFGMGTGVNPNLSRQSRRLYIGSI 238
Query: 183 PPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALD 242
P NEQ + FF+ M + + PG+ V+ V N+EK +AFVE R+ E+A+ AMA D
Sbjct: 239 TPDINEQNLTDFFNSKMKEMNLGTGAPGNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFD 298
Query: 243 GIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGL 302
GIIF +++RRP DY GP +PN+++ G+ S + + +++FVGGL
Sbjct: 299 GIIFLNGPLKIRRPKDYG-------GPDVIAPNMHVP--GVVSTNV--PDSANKIFVGGL 347
Query: 303 PYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMG 362
P Y E Q+ ELL SFG L F+LV++ G SKG+ F Y DP+VTD+A +L+G+++G
Sbjct: 348 PTYLNEEQVMELLSSFGELKAFNLVRENGNGPSKGFAFFEYVDPSVTDVAIQSLSGMELG 407
Query: 363 DKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVL 422
DK L V+RA+ +K Q I + I K L ++ E+ ++L
Sbjct: 408 DKYLVVQRASVG--AKPGQSPIPGMYDLNPEIPKPILPVGDLS----------ESQDRIL 455
Query: 423 CLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP------------------ 464
+ + + L DD+EY +ILED++EECGKYG + ++ IPRP
Sbjct: 456 LMLNMVVPEELQDDQEYADILEDVKEECGKYGEVEDLRIPRPVKKDKAKWGEGGRDSALA 515
Query: 465 DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
Q E GVG+V+++Y A A AL+GR F G ++ A +D
Sbjct: 516 QQRADEAAGVGRVYVKYASPRSAANALKALAGRSFAGRSIIATLLSDD 563
>gi|391337926|ref|XP_003743315.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Metaseiulus
occidentalis]
Length = 430
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 138/371 (37%), Positives = 194/371 (52%), Gaps = 59/371 (15%)
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
V+ TR ARR+YVG +P EQ + +F+ M A A G+ V+ IN +K F
Sbjct: 103 VIGSTITRQARRLYVGNIPFGCTEQEMIDYFNVQMHACAFAQAQ-GNPVLACQINMDKNF 161
Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
AF+E R+++E S AM+ DGI F+G ++++RRP DY P + G P G+
Sbjct: 162 AFLEFRSIDETSAAMSFDGINFKGQSLKIRRPHDYQPMPGMSESQGSVIP-------GVV 214
Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
S + + P +VF+GGLP Y E Q++ELL SFG L F+LVKD TG SKGY FC Y
Sbjct: 215 STVV--QDSPHKVFIGGLPNYLNEDQVRELLMSFGQLKAFNLVKDTATGLSKGYAFCEYA 272
Query: 345 DPAVTDIACAALNGLKMGD----------------KTLTVRRATASGQSKTEQESILAQA 388
+ +TD A A LNG+++GD + V+
Sbjct: 273 EVTITDDAIAGLNGMQLGDKKLIVQRASVGAKNSNMAVPVQ------------------- 313
Query: 389 QQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMRE 448
+Q GM + G S +VLCL +T D L D+EEY++ILED+++
Sbjct: 314 ----------IQVPGMPNVPIGSS---GPATEVLCLMNLVTPDELRDEEEYDDILEDIQD 360
Query: 449 ECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
EC KYG + ++ IPRP Q G + PGVGKVF+E+ C A+ AL+GRKF V Y
Sbjct: 361 ECNKYGHVKSIEIPRPIQ-GVDVPGVGKVFVEFNSVADCQKAQQALTGRKFSNRVVVTSY 419
Query: 509 YPEDKYFNKDY 519
+ DKY + +
Sbjct: 420 FEPDKYHRRQF 430
>gi|302854386|ref|XP_002958701.1| hypothetical protein VOLCADRAFT_121741 [Volvox carteri f.
nagariensis]
gi|300255941|gb|EFJ40221.1| hypothetical protein VOLCADRAFT_121741 [Volvox carteri f.
nagariensis]
Length = 294
Score = 233 bits (593), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 138/335 (41%), Positives = 195/335 (58%), Gaps = 43/335 (12%)
Query: 187 NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF 246
+E ++ F+ VM A G + PG V++ Y+N+EK+FAF+E R+VEE SNAMA DG+
Sbjct: 2 SEVSLTQLFNNVMMAAGATTQ-PGGPVISCYMNNEKRFAFLEFRSVEETSNAMAFDGLQC 60
Query: 247 EGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYF 306
+G ++VRRP DYNP A LGP +PS +NLA +G+ + + +GP++VFVGGLP Y
Sbjct: 61 QGETLKVRRPHDYNPAAAKLLGPTEPSAKINLALLGVVNTLV--EDGPNKVFVGGLPGYL 118
Query: 307 TETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTL 366
+E Q++++L++FG L F+LV DRDTG SKGYGFC Y DP +TD+A L+ L +G K L
Sbjct: 119 SEEQVRQILQAFGPLRAFNLVTDRDTGASKGYGFCEYADPNITDVAIQGLSALIVGGKPL 178
Query: 367 TVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTE 426
TVRRA +G++ ++++ Q Q +
Sbjct: 179 TVRRANTAGEASATLQTLIQQQQAAL---------------------------------- 204
Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVI-PRPDQNGGETPGVGKVFLEYYDAV 485
L DD EY +++ED+ +E GKYG LV V I P G + PGVG V+L Y D
Sbjct: 205 -----DLVDDGEYMDLMEDVTQEVGKYGKLVGVEIPRPPPDGGPDPPGVGFVYLCYEDPR 259
Query: 486 GCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
G A+ AL+GRKFG N A +Y ++ KD S
Sbjct: 260 GAERAQVALNGRKFGDNLAEATFYDRSRFDAKDLS 294
>gi|426244214|ref|XP_004015921.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Ovis aries]
Length = 471
Score = 233 bits (593), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 162/421 (38%), Positives = 227/421 (53%), Gaps = 33/421 (7%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 79 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 134
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 135 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 193
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 194 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 246
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + G+ ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 247 VPGVVSTVVPGSA--HKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 304
Query: 340 FCVYQDPAVTD-IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
FC Y D VTD ++ A + +G R A +S + Q + +Q
Sbjct: 305 FCEYVDINVTDQVSPAPAHPALLGSPLRAGRGACSSSPFASTIN------QTPVTLQVPG 358
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
L +S + +GG + +VLCL + + L DDEEYEEI+ED+R+ECGKYG + +
Sbjct: 359 LMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECGKYGLVKS 410
Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
+ IPRP +G E PG GK+F+E+ C A L+GRKF V Y D Y +D
Sbjct: 411 IEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRD 469
Query: 519 Y 519
+
Sbjct: 470 F 470
>gi|302685922|ref|XP_003032641.1| hypothetical protein SCHCODRAFT_75908 [Schizophyllum commune H4-8]
gi|300106335|gb|EFI97738.1| hypothetical protein SCHCODRAFT_75908 [Schizophyllum commune H4-8]
Length = 556
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 131/356 (36%), Positives = 203/356 (57%), Gaps = 43/356 (12%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
+R +RR+Y+G + P NE +A FF+ MT + + GPG+ V+ V N+EK +AFVE R
Sbjct: 190 SRQSRRLYIGSITPEINEHNLAEFFNSKMTEMNIGTGGPGNPVLAVQCNYEKNYAFVEFR 249
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+ ++A+ AMA DGIIF +++RRP DY+ ++A+A P +++ G+ S +
Sbjct: 250 SADDATAAMAFDGIIFLNGPLKIRRPKDYDISVASA-------PMIHVP--GIISTNV-- 298
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK-GYGFCVYQDPAVT 349
+ +++FVGGLP Y E Q++ELL SFG L F+LV++ TG SK GY F Y DP VT
Sbjct: 299 PDSANKIFVGGLPAYLNEEQVQELLTSFGELKAFNLVRETGTGASKQGYAFFEYVDPNVT 358
Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
D+A +LNG+++GD+ L V+RA+ + T A+ AI K + +T G
Sbjct: 359 DVAIQSLNGMELGDRFLVVQRASVGAKDGTIPN---LPAELMPAIPKPIMPAGQTDTSGD 415
Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP----- 464
A+VL + +T D L DD+EY ++LED++EEC K+G + ++ +PRP
Sbjct: 416 ---------ARVLLMLNMVTPDDLVDDDEYGDLLEDIKEECSKFGPVEDLRVPRPVKKEK 466
Query: 465 --------------DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
Q E GVG+V++++ DA A A +L+GR F G ++ A
Sbjct: 467 KWAPGEGGREAAVEAQRADEAAGVGRVYVKFVDAKDAAVALKSLAGRSFAGRSIIA 522
>gi|390479436|ref|XP_002762565.2| PREDICTED: splicing factor U2AF 65 kDa subunit [Callithrix jacchus]
Length = 453
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 151/377 (40%), Positives = 214/377 (56%), Gaps = 37/377 (9%)
Query: 107 KSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLM 161
K K R +D+ PP + GQ+P + +P M + L T
Sbjct: 70 KKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT-------- 120
Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +
Sbjct: 121 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQD 179
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 180 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVP 232
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY FC
Sbjct: 233 GVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFC 290
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q L +
Sbjct: 291 EYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGLMS 346
Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++ I
Sbjct: 347 SQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEI 398
Query: 462 PRPDQNGGETPGVGKVF 478
PRP +G E PG GK +
Sbjct: 399 PRP-VDGVEVPGCGKAW 414
>gi|170088030|ref|XP_001875238.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164650438|gb|EDR14679.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 370
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/367 (35%), Positives = 198/367 (53%), Gaps = 34/367 (9%)
Query: 156 GAFPLMPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
G P MPVQ +R +RR+Y+G + P NEQ +A FF+ M + + GP
Sbjct: 4 GLPPPMPVQSFGMGIGGNPNLSRQSRRLYIGSITPEVNEQNLADFFNSKMIEMSIGTGGP 63
Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
G+ V+ V N+EK +AFVE R+ E+A+ AMA DGIIF +++RRP DY
Sbjct: 64 GNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDYG--------- 114
Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
+ +A+ G+ + PD ++FVGGLP Y E Q+ ELL+SFG L F+L
Sbjct: 115 -----GMEIASPGVHVPGVVSTNVPDSINKIFVGGLPTYLNEEQVMELLKSFGDLKAFNL 169
Query: 327 VKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILA 386
V++ G SKG+ F Y D VTD+A +LNG+++GD+ L V+RA+ + T
Sbjct: 170 VRENGNGPSKGFAFFEYVDVGVTDVAIQSLNGMELGDRYLVVQRASVGAKPGTPGMIPNL 229
Query: 387 QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
Q I + + +G + T +++L + +T D L DD+EY ++ ED+
Sbjct: 230 PYDQFPEIPR-PIMPAGKDP---------ATDSRILLMLNMVTPDDLTDDQEYGDLYEDV 279
Query: 447 REECGKYGTLVNVVIPRPDQ-NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
+EEC YG + ++ IPRPD E GVG+V+++Y D+ A N L+GR F G ++
Sbjct: 280 KEECSNYGAVEDLRIPRPDAVRLDEASGVGRVYVKYKDSESATAALNNLAGRSFAGRSII 339
Query: 506 AFYYPED 512
A ED
Sbjct: 340 ATLLSED 346
>gi|194756144|ref|XP_001960339.1| GF13310 [Drosophila ananassae]
gi|190621637|gb|EDV37161.1| GF13310 [Drosophila ananassae]
Length = 434
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 140/362 (38%), Positives = 205/362 (56%), Gaps = 33/362 (9%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP-GDAVVNVYINHEKKFAFVE 228
TR ARR+YVG +P E+ + FF+Q + A+G +S G AV+ N EK FAF+E
Sbjct: 93 VTRQARRLYVGNIPFGVTEEEMMGFFNQQLIALGSSSLKTDGKAVLTCQTNLEKNFAFLE 152
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALG-PG-----QPSPNLNLAAVG 282
R+++EA+ A+ DGI+F G +++RRP DY+P + + PG SP + ++ G
Sbjct: 153 FRSMDEATQAINFDGIVFRGQTLKIRRPHDYHPVASISCSEPGFATTTMTSPQIVVSTTG 212
Query: 283 ---LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
+ S + + P ++++GGLP ETQIKELL SFG L GF+LVKD +T SKG+
Sbjct: 213 PNHVISTLV--PDSPQKIYIGGLPTCLNETQIKELLLSFGQLKGFNLVKDANTSLSKGFA 270
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
F Y DP VT+ A A LNG+++GD+ L V+R+ A G++ + L
Sbjct: 271 FFEYVDPLVTEQAIAGLNGMQLGDRKLVVQRSIAGGRNSG-------------GVPATVL 317
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
Q G+ + E+ +VLCL + + L DDEEYE+I D+++EC KYG + ++
Sbjct: 318 QVPGLTAIPN-----TESPTEVLCLLNMVLPEELLDDEEYEDIRTDIQQECAKYGDVRSL 372
Query: 460 VIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
IPRP G++P G GKVF+++ C A ALSGRKF G V + DKY
Sbjct: 373 KIPRPIPK-GDSPKRGCGKVFVQFESVDDCQKAMRALSGRKFSGRIVMTSFSDPDKYLAD 431
Query: 518 DY 519
D+
Sbjct: 432 DF 433
>gi|124360614|gb|ABN08613.1| RNA-binding region RNP-1 (RNA recognition motif) [Medicago
truncatula]
Length = 257
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 128/199 (64%), Positives = 151/199 (75%), Gaps = 6/199 (3%)
Query: 49 RDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKS 108
R K Y+R+ RD+DR H DY+RD++ R+R+ + S S R +RS+S S S S S+
Sbjct: 58 RGKYDSYNRQRGRDYDR---HNDYDRDRDTRNRYGAHSKRSRR-ESRSRSRSRSPSQSEG 113
Query: 109 KRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ 168
KR SGFDMAPPA + P V GQ+PG+ + QN P+G +Q+GA LM VQ MTQ
Sbjct: 114 KRTSGFDMAPPATGVTP--TVSGQMPGIAHMIQGATQNFSPYGISQIGALSLMQVQPMTQ 171
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
QATRHARRVYVGGLPP ANEQ+IA+FFSQVM AIGGNSAG GD+VVNVYINHEKKFAFVE
Sbjct: 172 QATRHARRVYVGGLPPFANEQSIASFFSQVMIAIGGNSAGSGDSVVNVYINHEKKFAFVE 231
Query: 229 MRTVEEASNAMALDGIIFE 247
MRTVEEASNAMALDGI+FE
Sbjct: 232 MRTVEEASNAMALDGIVFE 250
>gi|410054709|ref|XP_003954504.1| PREDICTED: LOW QUALITY PROTEIN: splicing factor U2AF 65 kDa subunit
[Pan troglodytes]
Length = 394
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 138/331 (41%), Positives = 196/331 (59%), Gaps = 23/331 (6%)
Query: 189 QAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEG 248
+A+ FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G
Sbjct: 86 EAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQG 144
Query: 249 VAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTE 308
++++RRP DY P PG S N ++ G+ S + + ++F+GGLP Y +
Sbjct: 145 QSLKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLND 195
Query: 309 TQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV 368
Q+KELL SFG L F+LVKD TG SKGY FC Y D VTD A A LNG+++GDK L V
Sbjct: 196 DQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLV 255
Query: 369 RRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAI 428
+RA+ ++ T + Q + +Q L +S + +GG + +VLCL +
Sbjct: 256 QRASVGAKNAT----LSTINQTPVTLQVPGLMSSQVQ-MGGHPT-------EVLCLMNMV 303
Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCA 488
+ L DDEEYEEI+ED+R+EC KYG + ++ IPRP +G E PG GK+F+E+ C
Sbjct: 304 LPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQ 362
Query: 489 TAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
A L+GRKF V Y D Y +D+
Sbjct: 363 KAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 393
>gi|195429288|ref|XP_002062695.1| GK19586 [Drosophila willistoni]
gi|194158780|gb|EDW73681.1| GK19586 [Drosophila willistoni]
Length = 466
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 136/368 (36%), Positives = 195/368 (52%), Gaps = 48/368 (13%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
TR ARR+YVG +P ++ + FF+ + ++G G V+ N EK FAF+E
Sbjct: 129 VTRQARRLYVGNIPFGVTDKEMMNFFNVQLQSLGLKQFHDGTPVLTCQTNLEKNFAFLEF 188
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
R++ E + A+A DG+ F G +++RRP DY+P + + +L VGL+ +
Sbjct: 189 RSMGETTQAIAFDGVNFRGQTLKIRRPHDYHPVTSLS----------SLETVGLSDTIVT 238
Query: 290 GA---------------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
A + P ++++G LP E QIKELL SFG L GF+LVKD +TG
Sbjct: 239 SAHTPVPMKDLVSTLVPDSPQKIYIGSLPPCLDEAQIKELLLSFGRLRGFNLVKDANTGM 298
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
SKGY F Y D AVT+ A A LNG+ +GD+ L V+R+ A G++ A H
Sbjct: 299 SKGYAFFEYVDSAVTEQAIAGLNGMLLGDRRLVVQRSIAGGRN----------ASNHSPA 348
Query: 395 QKMALQTSGMNTLGGGMSLFGETLA-KVLCLTEAITADALADDEEYEEILEDMREECGKY 453
LQ G S+F A ++LCL + + L DDEEYE+I D+++EC K+
Sbjct: 349 S--VLQVPGFP------SVFSTGAATEILCLLNMVQPEDLLDDEEYEDICVDIKQECDKH 400
Query: 454 GTLVNVVIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
G + + IPRP G+TP G GKVF+ + C A NALSGRKF G V ++
Sbjct: 401 GKVKGLKIPRPLV--GKTPRAGCGKVFVRFESMEDCQKALNALSGRKFNGRIVVTSFFNL 458
Query: 512 DKYFNKDY 519
DKY ++
Sbjct: 459 DKYEKNEF 466
>gi|297302956|ref|XP_001119590.2| PREDICTED: splicing factor U2AF 65 kDa subunit-like, partial
[Macaca mulatta]
Length = 432
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 154/378 (40%), Positives = 215/378 (56%), Gaps = 30/378 (7%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGL 359
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
+S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 360 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 411
Query: 460 VIPRPDQNGGETPGVGKV 477
IPR +G E PG GKV
Sbjct: 412 EIPR-LVDGVEVPGCGKV 428
>gi|387193280|gb|AFJ68695.1| splicing factor U2AF 65 kDa subunit [Nannochloropsis gaditana
CCMP526]
Length = 424
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 145/363 (39%), Positives = 204/363 (56%), Gaps = 41/363 (11%)
Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
TQQ+TRHARRVYVGG A++ + FF+Q++ P VV + +N +K FAF
Sbjct: 82 TQQSTRHARRVYVGGNFGDASDFEVLAFFNQIINE-SLERPSPAGPVVAIQVNRQKHFAF 140
Query: 227 VEMRTVEEASNA-MALDGIIFEGVAVRVRRPTDYNPTL------------AAALGPGQPS 273
+E+ +V ++ M LDG+ F G ++V+RPTDY+P L A Q S
Sbjct: 141 LELNSVPLTTSVIMQLDGVPFRGNPLKVKRPTDYHPELLPLDTPPPPTLKVANFRALQAS 200
Query: 274 PNLNLAAVGL-ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
L +A+ GL A GA + P ++FVGGLPY+ T+ Q++ELL +FG L GFDL KD T
Sbjct: 201 GALPMASTGLTAPGANSVPDSPYKIFVGGLPYHVTDDQVRELLSAFGPLRGFDLKKDPAT 260
Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI 392
G SKGYGFC Y D AV D+A L+G+ +G KTLTV+ A AS Q + +Q
Sbjct: 261 GMSKGYGFCEYIDHAVGDVAIQGLHGMDLGGKTLTVKYALASQQLQQQQSMQQMLLSTTP 320
Query: 393 AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGK 452
A KVL L +T D L DD+EY+EI+ED+REE K
Sbjct: 321 A-------------------------TKVLVLANMVTPDELKDDQEYQEIVEDVREEVAK 355
Query: 453 YGTLVNVVIPR-PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
+G ++++VIPR + + +P VGK+F+EY ++ A +L GR+F G V A +Y E
Sbjct: 356 FGEVLSLVIPRPEEPSAPPSPAVGKIFVEYAESSQTKAAAQSLQGRRFAGRIVQASFYDE 415
Query: 512 DKY 514
+K+
Sbjct: 416 EKF 418
>gi|198456623|ref|XP_001360392.2| GA16338 [Drosophila pseudoobscura pseudoobscura]
gi|198135682|gb|EAL24967.2| GA16338 [Drosophila pseudoobscura pseudoobscura]
Length = 491
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 135/384 (35%), Positives = 196/384 (51%), Gaps = 47/384 (12%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP----GDAVVNVYINHEKKFA 225
TR ARR+YVG +P E I FF+Q +G N G G AV++ N +K FA
Sbjct: 121 VTRQARRLYVGNIPFGVTEDDIMAFFNQQFLLLGDNCGGQLCLDGKAVLSCQANLDKNFA 180
Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPT-----------LAAALGPGQPSP 274
F+E R+++EA+ A DGI F G +++RRP DY+P + A+G S
Sbjct: 181 FIEFRSMQEATQATTFDGISFRGQVLKIRRPHDYHPVGSVGAAAGAGSIPDAVGGCASSA 240
Query: 275 NLNLAAVGLASGAIGGA-------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLV 327
+ +G++G + P ++++GGLP ETQIKELL SFG L GF+LV
Sbjct: 241 AAKSRSSSAETGSLGSQAISNLVPDSPHKIYIGGLPTCLNETQIKELLLSFGQLRGFNLV 300
Query: 328 KDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
KD T SKGY F Y DP +T+ A LNG+++GD+ L V+R+ SG+
Sbjct: 301 KDPSTTLSKGYAFFEYVDPLLTEQVIANLNGMQLGDRRLIVQRSIPSGRYA--------- 351
Query: 388 AQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMR 447
Q I IQ L + + G++ +VLCL + + L D+EEYE+I D+
Sbjct: 352 GNQQIPIQVPGLVATSLTGSTAGLN----NATQVLCLLNMVLPEELLDNEEYEDIRADIE 407
Query: 448 EECGKYGTLVNVVIPRPD------------QNGGETPGVGKVFLEYYDAVGCATAKNALS 495
+EC KYG ++++ IPRP + G GKV++ + A ALS
Sbjct: 408 QECSKYGEVLSLKIPRPQVSGGEGEGEGGGDSATRPKGCGKVYVHFGSIEDSEKALGALS 467
Query: 496 GRKFGGNTVNAFYYPEDKYFNKDY 519
GRKF G V ++ DKY ++D+
Sbjct: 468 GRKFSGRIVIGSFFDRDKYLSEDF 491
>gi|449547880|gb|EMD38847.1| hypothetical protein CERSUDRAFT_81656 [Ceriporiopsis subvermispora
B]
Length = 476
Score = 224 bits (571), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 134/381 (35%), Positives = 206/381 (54%), Gaps = 47/381 (12%)
Query: 156 GAFPLMPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
G P MPVQ +R +RR+Y+G + P NEQ +A FF+ M + + P
Sbjct: 95 GLPPPMPVQSFGMGIGGNPNLSRQSRRLYIGSITPDINEQNLAEFFNGKMKEMDIGTGAP 154
Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
G+ V+ V N+EK +AFVE R+ E+A+ AMA DGIIF +++RRP DY GP
Sbjct: 155 GNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFLNGPLKIRRPKDYG-------GP 207
Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD 329
+P +++ G+ S + + ++VFVGGLP Y E Q+ ELL+SFG L F+LV++
Sbjct: 208 DVLAPMMHVP--GVVSTNV--PDSANKVFVGGLPMYLNEEQVMELLKSFGELKAFNLVRE 263
Query: 330 RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
G SKG+ F Y DP+VTD+A +L+G+++GDK L V+RA+ +K Q I
Sbjct: 264 NGNGPSKGFAFFEYVDPSVTDVAIQSLSGMELGDKYLVVQRASVG--AKPGQSPIDEMYG 321
Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
I + + + +++ T +++L + + + L DD+EY +I ED+ EE
Sbjct: 322 SPAPIPRPIMPATDIDS----------TQSRILLMLNMVVPEELQDDQEYADIYEDITEE 371
Query: 450 CGKYGTLVNVVIPRP--------DQNG----------GETPGVGKVFLEYYDAVGCATAK 491
CG+YG + ++ IPRP +NG E GVG+V+++Y A A
Sbjct: 372 CGRYGAVEDLRIPRPVKRDKAKWGENGMDSARAAQLADEAAGVGRVYVKYAQPNSAANAL 431
Query: 492 NALSGRKFGGNTVNAFYYPED 512
AL+GR F G ++ A +D
Sbjct: 432 KALAGRSFAGRSIIATLLSDD 452
>gi|336368252|gb|EGN96595.1| hypothetical protein SERLA73DRAFT_93106 [Serpula lacrymans var.
lacrymans S7.3]
Length = 396
Score = 224 bits (571), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 133/367 (36%), Positives = 201/367 (54%), Gaps = 31/367 (8%)
Query: 156 GAFPLMPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
G P MPVQ +R +RR+Y+G + P NEQ +A FF+ M + + P
Sbjct: 27 GLPPPMPVQTFGMGIGSNPNLSRQSRRLYIGSITPDVNEQNLADFFNSKMIEMSIGTGAP 86
Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
G+ V+ V N+EK +AFVE R+ E+A+ AMA DGIIF +++RRP DY G
Sbjct: 87 GNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDYG-------GV 139
Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD 329
+P++++ G+ S + + ++VFVGGLP Y E Q+ ELL+SFG L F+LV++
Sbjct: 140 DMSAPSVHVP--GVVSTNV--PDSINKVFVGGLPTYLNEEQVMELLKSFGELKAFNLVRE 195
Query: 330 RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
G SKG+ F Y D +VTD+A +LNG+++GD+ L V+RA+ + T
Sbjct: 196 NGNGPSKGFAFFEYVDISVTDVAIQSLNGMELGDRYLVVQRASVGAKPGTPGMIPNLPYD 255
Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
Q I + + +G N+ A++L + +T D L DD+EY ++ ED++EE
Sbjct: 256 QFPEIPR-PIMPAGENSSAD---------ARILLMLNMVTPDDLTDDQEYGDLYEDVKEE 305
Query: 450 CGKYGTLVNVVIPRPDQNGG----ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
C YG + ++ IPRP E GVG+V+++Y DA A AL+GR F G ++
Sbjct: 306 CSVYGAVEDLRIPRPSAMDAIRQDEAAGVGRVYVKYIDADSANNALKALAGRSFAGRSII 365
Query: 506 AFYYPED 512
A ED
Sbjct: 366 ATLLSED 372
>gi|76779874|gb|AAI06135.1| U2af2 protein [Mus musculus]
Length = 307
Score = 224 bits (571), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 137/326 (42%), Positives = 193/326 (59%), Gaps = 23/326 (7%)
Query: 194 FFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRV 253
FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G ++++
Sbjct: 4 FFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKI 62
Query: 254 RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
RRP DY P PG S N ++ G+ S + + ++F+GGLP Y + Q+KE
Sbjct: 63 RRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKE 113
Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
LL SFG L F+LVKD TG SKGY FC Y D VTD A A LNG+++GDK L V+RA+
Sbjct: 114 LLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASV 173
Query: 374 SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADAL 433
++ T + Q + +Q L +S + +GG + +VLCL + + L
Sbjct: 174 GAKNAT----LSTINQTPVTLQVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEEL 221
Query: 434 ADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNA 493
DDEEYEEI+ED+R+EC KYG + ++ IPRP +G E PG GK+F+E+ C A
Sbjct: 222 LDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQG 280
Query: 494 LSGRKFGGNTVNAFYYPEDKYFNKDY 519
L+GRKF V Y D Y +D+
Sbjct: 281 LTGRKFANRVVVTKYCDPDSYHRRDF 306
>gi|13938661|gb|AAH07487.1| U2 small nuclear ribonucleoprotein auxiliary factor (U2AF) 2 [Mus
musculus]
Length = 306
Score = 224 bits (571), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 137/326 (42%), Positives = 193/326 (59%), Gaps = 23/326 (7%)
Query: 194 FFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRV 253
FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G ++++
Sbjct: 3 FFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKI 61
Query: 254 RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
RRP DY P PG S N ++ G+ S + + ++F+GGLP Y + Q+KE
Sbjct: 62 RRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKE 112
Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
LL SFG L F+LVKD TG SKGY FC Y D VTD A A LNG+++GDK L V+RA+
Sbjct: 113 LLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASV 172
Query: 374 SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADAL 433
++ T + Q + +Q L +S + +GG + +VLCL + + L
Sbjct: 173 GAKNAT----LSTINQTPVTLQVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEEL 220
Query: 434 ADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNA 493
DDEEYEEI+ED+R+EC KYG + ++ IPRP +G E PG GK+F+E+ C A
Sbjct: 221 LDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQG 279
Query: 494 LSGRKFGGNTVNAFYYPEDKYFNKDY 519
L+GRKF V Y D Y +D+
Sbjct: 280 LTGRKFANRVVVTKYCDPDSYHRRDF 305
>gi|403413555|emb|CCM00255.1| predicted protein [Fibroporia radiculosa]
Length = 582
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 135/381 (35%), Positives = 204/381 (53%), Gaps = 48/381 (12%)
Query: 156 GAFPLMPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
G P MPVQ +R +RR+Y+G + P NEQ +A FF+ M + + GP
Sbjct: 202 GLPPPMPVQSFGMGIGGNPNLSRQSRRLYIGSITPDINEQNLADFFNSKMKEMSIGTGGP 261
Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
G+ V+ V N+EK +AFVE R+ E+A+ AMA DGIIF +++RRP DY
Sbjct: 262 GNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDYG--------- 312
Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD 329
G S ++ G+ S + + +++FVGGLP Y E Q+ ELL+SFG L F+LV++
Sbjct: 313 GMDSIAPSMHVPGVVSTNV--PDSINKIFVGGLPTYLNEEQVMELLKSFGDLKAFNLVRE 370
Query: 330 RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
G SKG+ F Y DP VTD+A +L+G+++GDK L V+RA+ +K Q I
Sbjct: 371 NGNGPSKGFAFFEYVDPGVTDVAIQSLSGMELGDKFLVVQRASVG--AKPGQPPIPGLYD 428
Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
Q + I + L + T A++L + + + L DD+EY ++ ED++EE
Sbjct: 429 Q-VEIPRPILPAGDVEG----------TDARILLMLNMVVPEDLTDDQEYADVYEDVKEE 477
Query: 450 CGKYGTLVNVVIPRPDQ-------NGG-----------ETPGVGKVFLEYYDAVGCATAK 491
C KYG + ++ IPRP + GG E GVG+V++++ ++ A A
Sbjct: 478 CSKYGLVEDLRIPRPVKRDKAKWGEGGHESAITAQRIDEAAGVGRVYVKFTESYSAAQAL 537
Query: 492 NALSGRKFGGNTVNAFYYPED 512
AL+GR F G ++ A ED
Sbjct: 538 KALAGRSFAGRSIIATLLSED 558
>gi|195149862|ref|XP_002015874.1| GL11290 [Drosophila persimilis]
gi|194109721|gb|EDW31764.1| GL11290 [Drosophila persimilis]
Length = 487
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 136/389 (34%), Positives = 199/389 (51%), Gaps = 57/389 (14%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP----GDAVVNVYINHEKKFA 225
TR ARR+YVG +P E I FF+Q +G + G G AV++ N +K FA
Sbjct: 117 VTRQARRLYVGNIPFGVTEDDIMAFFNQQFLLLGDDCGGQLCLDGKAVLSCQANLDKNFA 176
Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPT-----------LAAALGPGQPSP 274
F+E R+++EA+ A DGI F G +++RRP DY+P + A+G S
Sbjct: 177 FIEFRSMQEATQATTFDGISFRGQVLKIRRPHDYHPVGSVGAAAGAGSIPDAVGGCASSA 236
Query: 275 NLNLAAVGLASGAIGGA-------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLV 327
+ +G++G + P ++++GGLP ETQIKELL SFG L GF+LV
Sbjct: 237 AAKSRSSSADTGSLGSQAISNLVPDSPHKIYIGGLPTCLNETQIKELLLSFGQLRGFNLV 296
Query: 328 KDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
KD T SKGY F Y DP +T+ A LNG+++GD+ L V+R+ SG
Sbjct: 297 KDPSTTLSKGYAFFEYVDPLLTEQVIANLNGMQLGDRRLIVQRSIPSG------------ 344
Query: 388 AQQHIAIQKMALQTSGMNTLGGGMSLFGET-----LAKVLCLTEAITADALADDEEYEEI 442
++ IQ++ +Q G+ SL G T +VLCL + + L D+EEYE+I
Sbjct: 345 --RYAGIQQIPIQVPGLV----ATSLTGSTAGLNNATQVLCLLNMVLPEELLDNEEYEDI 398
Query: 443 LEDMREECGKYGTLVNVVIPRPD------------QNGGETPGVGKVFLEYYDAVGCATA 490
D+ +EC KYG ++++ IPRP + G GKV++ + A
Sbjct: 399 RADIEQECSKYGEVLSLKIPRPQASGGEGEGGGGGDSATRPKGCGKVYVHFGTIEDSEKA 458
Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
ALSGRKF G V ++ DKY ++D+
Sbjct: 459 LGALSGRKFSGRIVIGSFFDRDKYLSEDF 487
>gi|409040470|gb|EKM49957.1| hypothetical protein PHACADRAFT_264412 [Phanerochaete carnosa
HHB-10118-sp]
Length = 575
Score = 222 bits (566), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 150/450 (33%), Positives = 233/450 (51%), Gaps = 55/450 (12%)
Query: 81 RHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPAAAMLPGAAVPGQLPGVPSA 139
+HR + +R RS + S + S S+ KR+ SG+D+ P A+ + G+
Sbjct: 127 KHREGLGTPER---RSPTPSDAVSLSQRKRKASGWDIHAPGYEQY--TAMQAKQTGL-FN 180
Query: 140 VPEMAQNMLPFGATQLGAFPLMPVQV------MTQQATRHARRVYVGGLPPLANEQAIAT 193
+P + +P G P MPVQ + +R +RR+Y+G + P NEQ +A
Sbjct: 181 LPGANRTQIPPILAVPGLPPPMPVQSFGMGMGVNPNLSRQSRRLYIGSITPEINEQNLAD 240
Query: 194 FFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRV 253
FF++ M + + PG+ V+ V N+EK +AFVE R+ E+A+ AMA DGIIF +++
Sbjct: 241 FFNEKMKEMSIGTGAPGNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFLSGPLKI 300
Query: 254 RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
RRP DY G +P++++ G+ S + + ++VFVGGLP Y E Q+ E
Sbjct: 301 RRPKDYG-------GSENLAPSMHVP--GVVSTNV--PDSINKVFVGGLPPYLNEEQVME 349
Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
LL SFG L F+LV++ G SKG+ F Y DPAVTD+A +LN +++GDK L V+RA+
Sbjct: 350 LLTSFGDLKAFNLVRENGNGPSKGFAFFEYVDPAVTDVAIQSLNEMELGDKYLVVQRASV 409
Query: 374 SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADAL 433
++ I QA I K L + + A++L + + + L
Sbjct: 410 GAKNG----QIPPQALYPTEIPKPILPAGDLEGVE----------ARILLMLNMVVPEDL 455
Query: 434 ADDEEYEEILEDMREECGKYGTLVNVVIPRP-----------------DQNGGETPGVGK 476
DD+EY +I ED+++EC K+G + ++ IPRP Q E GVG+
Sbjct: 456 NDDQEYADIYEDVKDECEKHGPIEDLRIPRPVKKDKAKWGESGLDPLSAQRVDEAAGVGR 515
Query: 477 VFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
V++ + A G A AL+GR F G ++ A
Sbjct: 516 VYVRFVGADGAKRALKALAGRSFAGRSIIA 545
>gi|194884971|ref|XP_001976363.1| GG20057 [Drosophila erecta]
gi|190659550|gb|EDV56763.1| GG20057 [Drosophila erecta]
Length = 440
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 130/362 (35%), Positives = 199/362 (54%), Gaps = 38/362 (10%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIG--GNSAGPGDAVVNVYINHEKKFAFVE 228
TR ARR+YVG +P E+ + FF+Q + A+G G G AV+ N EK FAF+E
Sbjct: 101 TRQARRLYVGNIPFGVTEEEMMKFFNQQLLALGLAGAQYMDGKAVLTCQTNLEKNFAFLE 160
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALG----------PGQPSPNLNL 278
R+++EA+ A+ DGI+F G +++RRP DY P + + P + L
Sbjct: 161 FRSMDEATQALNFDGILFRGQVLKIRRPHDYQPVPSIRVSAMESYRSFRLPDNTVTHPPL 220
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
A + L+S + P+++FVGGLP + QI++LL+SFG L +LVKD +T SKG+
Sbjct: 221 ATIPLSSIV---PDSPNKIFVGGLPTCLGQDQIRDLLQSFGELKRLNLVKDTNTCLSKGF 277
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
F Y DP VTD A A L+G+++G++ L V+R+ G + ++ Q+
Sbjct: 278 AFFEYFDPTVTDHAIAGLHGMQLGNRRLVVQRSIPGG-------------KHAVSGQQPL 324
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
+Q G++TL L + +++CL + + L D+EE+E+I D+ +EC KYG + +
Sbjct: 325 VQVPGISTL-----LDPGSPTEIICLLNMVLPEELLDNEEFEDIRTDIEQECAKYGEVRS 379
Query: 459 VVIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+ IPRP G+ P G GKVF+++ A ALSGRKF G V +Y +KY
Sbjct: 380 IKIPRPI---GQAPKRGCGKVFVQFESVEDSQRALKALSGRKFSGRIVMTSFYDPEKYLL 436
Query: 517 KD 518
D
Sbjct: 437 DD 438
>gi|324503285|gb|ADY41429.1| Splicing factor U2AF 65 kDa subunit [Ascaris suum]
Length = 522
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 138/357 (38%), Positives = 201/357 (56%), Gaps = 32/357 (8%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V+ T +RR+YVG +P +E A+ FF+Q M + G + PG+ V+ +N +K
Sbjct: 198 VPVVGPSVTCQSRRLYVGNIPFGCSEDAMLDFFNQQMH-LCGLAQAPGNPVLACQMNLDK 256
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+++E + MA DGI F G +++RRP DY P ++ + G N+
Sbjct: 257 NFAFIEFRSIDETTAGMAFDGINFMGQQLKIRRPRDYQP-MSTSYDMG------NMMVSN 309
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+ A+ P ++F+GGLP Y Q+KELL SFG L F+LV D TG SKGY F
Sbjct: 310 IV------ADSPYKIFIGGLPSYLNAEQVKELLSSFGQLKAFNLVTDVSTGVSKGYAFAE 363
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
Y DP++TD A A LNG+++GDK L V+ + A+ ++ S A Q +Q +
Sbjct: 364 YLDPSLTDQAIAGLNGMQLGDKNLVVQLSCANARAAM---STTAFPQ---------IQVA 411
Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
G++ G +VLCL +T + L +DEEYE+ILED+REEC KYG + ++ +P
Sbjct: 412 GIDLSHG-----AGPPTEVLCLMNMVTEEELKEDEEYEDILEDIREECAKYGFVKSIEVP 466
Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
R G + GVGKVF+E+ C A+ AL+GRKF TV YY D Y + +
Sbjct: 467 R-SIPGVDVTGVGKVFVEFNSKQECQKAQAALTGRKFANRTVVTSYYDPDLYHRRQF 522
>gi|392589921|gb|EIW79251.1| hypothetical protein CONPUDRAFT_83522 [Coniophora puteana
RWD-64-598 SS2]
Length = 411
Score = 218 bits (554), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 133/384 (34%), Positives = 201/384 (52%), Gaps = 51/384 (13%)
Query: 156 GAFPLMPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
G P +PVQ +R +RR+Y+G + P NEQ +A FF+ M + + GP
Sbjct: 27 GLPPPIPVQTFGMGIGSNPNLSRQSRRLYIGSITPDVNEQNLADFFNGKMIEMSIGTGGP 86
Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
G+ V+ V N+EK +AFVE R+ E+A+ AMA DGIIF +++RRP DY G
Sbjct: 87 GNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDYG-------GD 139
Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
+PN ++ G+ S + PD ++FVGGLP Y E Q+ ELL+SFG L F+L
Sbjct: 140 AIMAPNFHVP--GVVSTNV-----PDSIHKIFVGGLPPYLNEEQVMELLKSFGELKAFNL 192
Query: 327 VKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILA 386
V++ G SKG+ F Y D +VTD+A +LNG+++GD+ L V+RA+ + T
Sbjct: 193 VRENGNGPSKGFAFFEYVDSSVTDVAIQSLNGMELGDRYLVVQRASVGAKPGTPGMIPNL 252
Query: 387 QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
Q I + + +T A++L + +T D L DD+EY +I ED+
Sbjct: 253 PYDQFPEIPRPIMPAGDGST----------EDARILLMLNMVTVDDLQDDDEYGDIYEDV 302
Query: 447 REECGKYGTLVNVVIPRPDQN------------------GGETPGVGKVFLEYYDAVGCA 488
+EEC K+G + ++ IPRP + E GVG+V++++ D G
Sbjct: 303 KEECSKHGAVEDLRIPRPIKKDKSKWGDAGQQSQTDAARADEAAGVGRVYVKFVDGDGAQ 362
Query: 489 TAKNALSGRKFGGNTVNAFYYPED 512
A +L+GR F G ++ A ED
Sbjct: 363 RAMKSLAGRSFAGRSIIATVLSED 386
>gi|336381013|gb|EGO22165.1| hypothetical protein SERLADRAFT_371639 [Serpula lacrymans var.
lacrymans S7.9]
Length = 381
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 131/378 (34%), Positives = 200/378 (52%), Gaps = 47/378 (12%)
Query: 161 MPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
MPVQ +R +RR+Y+G + P NEQ +A FF+ M + + PG+ V+
Sbjct: 1 MPVQTFGMGIGSNPNLSRQSRRLYIGSITPDVNEQNLADFFNSKMIEMSIGTGAPGNPVL 60
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
V N+EK +AFVE R+ E+A+ AMA DGIIF +++RRP DY G +P
Sbjct: 61 AVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDYG-------GVDMSAP 113
Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
++++ G+ S + + ++VFVGGLP Y E Q+ ELL+SFG L F+LV++ G
Sbjct: 114 SVHVP--GVVSTNV--PDSINKVFVGGLPTYLNEEQVMELLKSFGELKAFNLVRENGNGP 169
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
SKG+ F Y D +VTD+A +LNG+++GD+ L V+RA+ + T Q I
Sbjct: 170 SKGFAFFEYVDISVTDVAIQSLNGMELGDRYLVVQRASVGAKPGTPGMIPNLPYDQFPEI 229
Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
+ + +G N+ A++L + +T D L DD+EY ++ ED++EEC YG
Sbjct: 230 PR-PIMPAGENSSAD---------ARILLMLNMVTPDDLTDDQEYGDLYEDVKEECSVYG 279
Query: 455 TLVNVVIPRPDQNG--------------------GETPGVGKVFLEYYDAVGCATAKNAL 494
+ ++ IPRP + E GVG+V+++Y DA A AL
Sbjct: 280 AVEDLRIPRPVKKDKSKWAPGEVGHQSAMDAIRQDEAAGVGRVYVKYIDADSANNALKAL 339
Query: 495 SGRKFGGNTVNAFYYPED 512
+GR F G ++ A ED
Sbjct: 340 AGRSFAGRSIIATLLSED 357
>gi|195393580|ref|XP_002055432.1| GJ19364 [Drosophila virilis]
gi|194149942|gb|EDW65633.1| GJ19364 [Drosophila virilis]
Length = 476
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 150/444 (33%), Positives = 217/444 (48%), Gaps = 88/444 (19%)
Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
GQ+P S VP+ Q +P V+ TR ARR+YVG +P E+
Sbjct: 66 GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 108
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+Q M +G A G V+ IN +K FAF+E R+++E + AMA DGI +G
Sbjct: 109 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQD 167
Query: 251 VRVR-----------------RPTDYNPTLAAALGPGQPS-------PN----------- 275
+++R +P + + + + P P PN
Sbjct: 168 LKIRRPHDYQPMPGITDTPAVKPAVVSSGVISTVVPDSPHKIFIGGLPNYLNDEQKEFTL 227
Query: 276 ---LNLAAVGLAS-----GAIGGAEGPDRVF------------VGGLPYYFTETQIKELL 315
L++ A + GAI E R+ L T +KELL
Sbjct: 228 NAFLDIGACKKVTPHTNTGAIASLEVDPRIVNLIDELLIRRTVKASLGSSATSRFVKELL 287
Query: 316 ESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASG 375
SFG L F+LVKD TG SKGY FC Y D ++TD + A LNG+++GDK L V+RA+
Sbjct: 288 LSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLIVQRASVGA 347
Query: 376 QSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALAD 435
++ AQ + + Q + LQ G++T+ + +VLCL +T D L D
Sbjct: 348 KN--------AQNAANTS-QSVMLQVPGLSTV-----VTSGPPTEVLCLLNMVTPDELRD 393
Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+ + C A+ AL+
Sbjct: 394 EEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLDCQKAQQALT 452
Query: 496 GRKFGGNTVNAFYYPEDKYFNKDY 519
GRKF V Y+ DKY +++
Sbjct: 453 GRKFSDRVVVTSYFDPDKYHRREF 476
>gi|195057468|ref|XP_001995263.1| GH23055 [Drosophila grimshawi]
gi|193899469|gb|EDV98335.1| GH23055 [Drosophila grimshawi]
Length = 453
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/363 (34%), Positives = 198/363 (54%), Gaps = 46/363 (12%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
TR ARR+YVG +P + + FF+ + + G G AV+ N EK FAF+E+
Sbjct: 123 VTRQARRLYVGNIPFNTTDDEMRAFFNVQIQRMCGALENDGKAVLTCQTNLEKNFAFLEL 182
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNP--TLAAALGPGQPSPNLNLAAVGLASGA 287
R+++E + A++ DGI + G ++++RRP DY+ T + +G A G SGA
Sbjct: 183 RSMDETTLAISFDGINYRGQSLKIRRPHDYHAGGTTGSFVG-----------ATGYVSGA 231
Query: 288 IGGA---------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
+ + + P ++++GGLP + Q+KELL +FG L GF++VKD + G+ KGY
Sbjct: 232 VVQSNAAIATVVPDTPHKIYIGGLPTCLNDDQVKELLMTFGHLRGFNMVKD-ELGHGKGY 290
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
FC Y D ++T+ A A LNG+++G++ L V+R+ A ++ + + Q A K+
Sbjct: 291 AFCEYMDASITEQAIAGLNGMQLGERKLIVQRSLAGVRNLVTHQLPVLQVPGFPADVKVG 350
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
T +VLCL + D L DD EYE+I +D++EEC K+G +++
Sbjct: 351 KAT------------------EVLCLLNMVMPDELLDDAEYEDIRKDIKEECAKFGKVIS 392
Query: 459 VVIPRPDQNGGET--PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+ IPRP GE+ PG GKV++ + C A ALSGR+F G V +Y +K+
Sbjct: 393 IKIPRP---FGESPQPGCGKVYVRFETTDVCKKALKALSGRRFSGRIVMTSFYDPNKFKR 449
Query: 517 KDY 519
KD+
Sbjct: 450 KDF 452
>gi|358059688|dbj|GAA94557.1| hypothetical protein E5Q_01209 [Mixia osmundae IAM 14324]
Length = 564
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 155/458 (33%), Positives = 228/458 (49%), Gaps = 61/458 (13%)
Query: 93 RNRSKSLSPSRSPSKSKRRS----------GFDMAPPAAAMLPGA-AVPGQLPGVPSAVP 141
R + +P + SKRR GF+ A + G +PGQ GV P
Sbjct: 133 REEVREKTPENTIPISKRRRAQTAWDVRPIGFETVSAETARMSGHFLLPGQ-NGVVRFPP 191
Query: 142 EMAQNMLPFGATQL-----GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFS 196
+ FG + A P+ VQ + A R RR+YVG + P A+EQ + FF+
Sbjct: 192 GFHEGRGAFGGLNMSGAGSAAAPMGGVQPIISFA-RQQRRLYVGNIMPTADEQNVTEFFN 250
Query: 197 QVMTAIGGN------SAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
M G + D VV+V +NHEK +AFVE R+ EEAS+AM+ DGI+F+
Sbjct: 251 AKMRENGLSLDDKKVDVQTADPVVSVQVNHEKSYAFVEFRSPEEASSAMSFDGIVFQDQQ 310
Query: 251 VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQ 310
+++RRP DY G S +L V ++S + P++VFVGGLP Y + Q
Sbjct: 311 LKIRRPKDYT---------GDESGGTHLPGV-ISSNV---PDTPNKVFVGGLPSYLDDEQ 357
Query: 311 IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
+ ELL SFG L F+LVK+ SKG+ FC Y DP VTD ACA LNG+++GD+ L V+R
Sbjct: 358 VLELLSSFGELRSFNLVKEGPQNASKGFAFCEYADPNVTDAACAGLNGMEIGDRYLVVQR 417
Query: 371 ATASGQSKTEQESILAQAQQHI--AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLT--E 426
A G + + + + A+ ++A G + ET CL
Sbjct: 418 AQV-GANVYKHPGGYGGSNPALPPALARVAPTIFGQD----------ETAPATRCLQMLN 466
Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP--DQNG-------GETPGVGKV 477
+T + L DD++Y +I ED+++EC KYG +++V IPRP +NG +GK+
Sbjct: 467 MVTPEELVDDQDYADINEDIKDECSKYGEVIDVKIPRPIKTENGRMDVKASESVEHLGKI 526
Query: 478 FLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
F+ + A +A++GR+FGG V Y E+ +
Sbjct: 527 FVMFDSTESSKKAIDAIAGRQFGGRLVICAYEKEETFL 564
>gi|219116422|ref|XP_002179006.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409773|gb|EEC49704.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 325
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 123/348 (35%), Positives = 186/348 (53%), Gaps = 32/348 (9%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
Q TRHARR+YVG LPP E AI F + + I + D V++ YINHE++F FVE
Sbjct: 2 QQTRHARRLYVGNLPPHITEDAIHVEFRRAI-EIASPTPLSEDPVLSTYINHERRFCFVE 60
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
+TVE A+ M LDG+ +GV V+V+RP DYN +A + P P L+++ +G+ SG +
Sbjct: 61 FKTVEMATACMNLDGLHVQGVPVKVKRPNDYNANMAPKIHPSA-LPPLDVSKLGIVSGTV 119
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLV-KDRDTGNSKGYGFCVYQDPA 347
+GP+++F+GGL Y+ ++Q+ ELL++FG + F LV D ++ SKGY F Y DP
Sbjct: 120 --EDGPNKIFIGGLHYHLQDSQVMELLQAFGKIKAFHLVSNDPESNMSKGYCFVEYADPN 177
Query: 348 VTDIACAALNGLKMGD-KTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
+T IA LNG+ +G+ K LT R A + + +
Sbjct: 178 ITPIAVQGLNGMDIGNGKALTARLAGDRTGGAGGAAFLAHAMDPQNGVPNIP-------- 229
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
+VL L +T + LA D EY+ + +++++EC K+G L + IPR
Sbjct: 230 ------------TRVLVLHNMVTDEDLATDTEYQGLFDEVKDECAKFGRLERLEIPR--- 274
Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ P KVFL Y A++ L GR+FG N V ++PE ++
Sbjct: 275 ---QGPAARKVFLGYVTVAEAMQAQHELQGRQFGPNVVQTTFFPESEF 319
>gi|346978171|gb|EGY21623.1| splicing factor U2AF 50 kDa subunit [Verticillium dahliae VdLs.17]
Length = 589
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 139/425 (32%), Positives = 212/425 (49%), Gaps = 61/425 (14%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P P T+L AF P
Sbjct: 173 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPTKLQAFMNQP 224
Query: 163 VQV----MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
V + +R A+R+ V LP A E ++A+FF+ + G N D V+ +
Sbjct: 225 GTVNSASLKPSNSRQAKRLLVSKLPSSATEDSVASFFNLQLN--GLNVIESTDPCVSCQL 282
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFE---------GVAVRVRRPTDYNPTLAAALGP 269
+++K F VE R EA+ A+ALDGI E G + +RRP DY P
Sbjct: 283 SNDKSFCVVEFRNASEATVALALDGISMEADSATDGAAGRGLEIRRPKDYIVPAVTEELP 342
Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD 329
+P G+ S + + P+++ + G P Y TE Q+ ELL SFG L F LV+D
Sbjct: 343 YEP---------GVVSSNV--VDTPNKLSITGFPPYLTEEQVTELLTSFGELKAFVLVRD 391
Query: 330 RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
R T S+G+ FC Y D A D+A L+G+ +G+ L +++A+
Sbjct: 392 RHTDESRGFVFCEYVDSAANDVAIQGLSGMDLGNSKLKIQKAS----------------- 434
Query: 390 QHIAIQKMA---LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
I + ++A + + M+ L G + E ++VL L +TAD L D+E+YEEI+ED+
Sbjct: 435 --IGVTQVAGVEMGVAAMSMLAGTTATDSEE-SRVLQLLNMVTADELMDNEDYEEIVEDV 491
Query: 447 REECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
+EEC KYGT++ V +PRP ++ GVGK+F++Y A +L+GRKF TV
Sbjct: 492 QEECAKYGTVLEVKVPRPVGGSRQSAGVGKIFVKYETKEATKKALQSLAGRKFADRTVVT 551
Query: 507 FYYPE 511
Y+PE
Sbjct: 552 TYFPE 556
>gi|85111663|ref|XP_964044.1| hypothetical protein NCU03039 [Neurospora crassa OR74A]
gi|28925805|gb|EAA34808.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 584
Score = 214 bits (546), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 141/428 (32%), Positives = 223/428 (52%), Gaps = 59/428 (13%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P Q M P T+L AF P
Sbjct: 191 RKRRMTQWDIKPPGYGNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMTQP 242
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+ A +R ++R+ V +PP A ++++ FF+ + G N D V
Sbjct: 243 GGAVNSAALKPTNSRQSKRLIVSNIPPSATDESLLGFFNLQLN--GLNVIDSADPCVQCQ 300
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------VAVRVRRPTDYNPTLAAALG 268
I+ + FA +E R +A+ A+ALDGI E +++RRP DY + A+
Sbjct: 301 ISPDHSFAMLEFRNSPDATVALALDGITMEAEDANGAAGAGGLKIRRPKDY---IVPAI- 356
Query: 269 PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
PN + + +S I + P+++ V +P Y +E QI ELL +FG L F LVK
Sbjct: 357 --VEDPNYDPDSEVPSSIVI---DSPNKISVTNIPAYLSEEQIMELLVAFGKLKSFVLVK 411
Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
D+ T S+G FC Y D +VT +A LN + +GD+ L V++A+
Sbjct: 412 DKHTEESRGIAFCEYHDSSVTSVAIDGLNNMMLGDRALKVQKAS---------------- 455
Query: 389 QQHIAIQKMA--LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
IQ++A L + M+ L G SL G+ +++V+ L +TAD L D+++YEEI +D+
Sbjct: 456 ---YGIQQVAGELSVNAMSMLAGTTSLDGD-VSRVVQLLNMVTADELMDNDDYEEIRDDV 511
Query: 447 REECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
+EEC K+GT+V++ IPRP ++ GVGK+F++Y ++ A AL+GRKF TV A
Sbjct: 512 QEECEKFGTIVSLKIPRPTGGSRQSAGVGKIFIKYENSDQATKALKALAGRKFADRTVVA 571
Query: 507 FYYPEDKY 514
Y+PE+ +
Sbjct: 572 TYFPEENF 579
>gi|392565476|gb|EIW58653.1| hypothetical protein TRAVEDRAFT_37512 [Trametes versicolor
FP-101664 SS1]
Length = 548
Score = 214 bits (545), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 151/461 (32%), Positives = 229/461 (49%), Gaps = 75/461 (16%)
Query: 91 RFRNRSKSLSPSRSPSKSKRR---SGFDMAPPA----AAM---------LPGA---AVPG 131
R +S +PS + S+R+ SG+D+ P AM LPGA +P
Sbjct: 100 RVMTTRRSPTPSDATPLSQRKRKASGWDVHAPGYEQYTAMQAKQTGLFNLPGANRTQIPP 159
Query: 132 QLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAI 191
L +P P M N G G P +R +RR+Y+G + P NEQ +
Sbjct: 160 IL-AIPGLPPPMPVNTFGMGT---GVNP---------NLSRQSRRLYIGSITPDINEQNL 206
Query: 192 ATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAV 251
FF+ M + + PG+ V+ V N+EK +AFVE R+ E+A+ AMA DGIIF +
Sbjct: 207 TDFFNSKMKEMNLGTGAPGNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFLNGPL 266
Query: 252 RVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQI 311
++RRP DY G P N+ G+ S + + +++FVGGLP Y E Q+
Sbjct: 267 KIRRPKDY----------GGPDMLANMHVPGVVSTNV--PDSANKIFVGGLPTYLNEEQV 314
Query: 312 KELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
ELL SFG L F+LV++ G SKG+ F Y DP+VTD+A +L+G+++GDK L V+RA
Sbjct: 315 MELLSSFGELKAFNLVRENGNGPSKGFAFFEYVDPSVTDVAIPSLSGMELGDKYLVVQRA 374
Query: 372 TA---SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAI 428
+ GQS + A + I M + + + ++L + +
Sbjct: 375 SVGAKPGQSPIPGMGMFDMAPE-IPKPIMPVGERDLEAMQD----------RILLMLNMV 423
Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIPRP-----------------DQNGGET 471
+ L+DD+EY ++ ED++EEC KYGT+ ++ IPRP Q E
Sbjct: 424 VPEELSDDQEYGDLYEDVKEECEKYGTVEDLRIPRPVKKDKAKWGEGRESAIAAQRADEA 483
Query: 472 PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
GVG+V+++++D A A AL+GR F G ++ A +D
Sbjct: 484 AGVGRVYVKFHDPRAAANALKALAGRSFAGRSIIATLLTDD 524
>gi|336465212|gb|EGO53452.1| hypothetical protein NEUTE1DRAFT_92746 [Neurospora tetrasperma FGSC
2508]
Length = 584
Score = 214 bits (545), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 141/428 (32%), Positives = 223/428 (52%), Gaps = 59/428 (13%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P Q M P T+L AF P
Sbjct: 191 RKRRMTQWDIKPPGYGNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMTQP 242
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+ A +R ++R+ V +PP A ++++ FF+ + G N D V
Sbjct: 243 GGAVNSTALKPTNSRQSKRLIVSNIPPSATDESLLGFFNLQLN--GLNVIDSADPCVQCQ 300
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------VAVRVRRPTDYNPTLAAALG 268
I+ + FA +E R +A+ A+ALDGI E +++RRP DY + A+
Sbjct: 301 ISPDHSFAMLEFRNSPDATVALALDGITMEAEDANGAAGAGGLKIRRPKDY---IVPAI- 356
Query: 269 PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
PN + + +S I + P+++ V +P Y +E QI ELL +FG L F LVK
Sbjct: 357 --VEDPNYDPDSEVPSSIVI---DSPNKISVTNIPAYLSEEQIMELLVAFGKLKSFVLVK 411
Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
D+ T S+G FC Y D +VT +A LN + +GD+ L V++A+
Sbjct: 412 DKHTEESRGIAFCEYHDSSVTSVAIDGLNNMMLGDRALKVQKAS---------------- 455
Query: 389 QQHIAIQKMA--LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
IQ++A L + M+ L G SL G+ +++V+ L +TAD L D+++YEEI +D+
Sbjct: 456 ---YGIQQVAGELSVNAMSMLAGTTSLDGD-VSRVVQLLNMVTADELMDNDDYEEIRDDV 511
Query: 447 REECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
+EEC K+GT+V++ IPRP ++ GVGK+F++Y ++ A AL+GRKF TV A
Sbjct: 512 QEECEKFGTIVSLKIPRPTGGSRQSAGVGKIFIKYENSDQATKALKALAGRKFADRTVVA 571
Query: 507 FYYPEDKY 514
Y+PE+ +
Sbjct: 572 TYFPEENF 579
>gi|449680331|ref|XP_002158219.2| PREDICTED: splicing factor U2AF 50 kDa subunit-like, partial [Hydra
magnipapillata]
Length = 259
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 120/293 (40%), Positives = 161/293 (54%), Gaps = 35/293 (11%)
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
R+VEE + AMA DGI+ +G A+++RRP DY P + P G+ S +
Sbjct: 1 FRSVEETTLAMAFDGIMLQGQALKIRRPKDYQPIPGISEMQATHIP-------GVVSTVV 53
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
++ ++VFVGGLP Y E Q+KELL +FG L F+LVKD TG SKGY FC Y D +
Sbjct: 54 --SDTINKVFVGGLPNYLNEDQVKELLSTFGDLRSFNLVKDSATGLSKGYAFCEYVDIGI 111
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
TD+A A +NG+++GDK L V+RA+ ++ T Q +I
Sbjct: 112 TDVAIAGMNGMQLGDKKLVVQRASVGSKTMTAQLNI------------------------ 147
Query: 409 GGMSLFGE-TLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
G L E T +LCL + AD L DDE+Y+EI ED+REEC KYG + ++ IPRP+
Sbjct: 148 PGFDLSKEITATNILCLMNMVVADELIDDEDYDEIFEDIREECSKYGRIRSMQIPRPNHE 207
Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
GVGKVF+EY + A AL+GRKF V YY D Y D+S
Sbjct: 208 -FLVSGVGKVFIEYATSEESKIASEALAGRKFANRVVVTAYYDPDAYHQHDFS 259
>gi|390596686|gb|EIN06087.1| hypothetical protein PUNSTDRAFT_145447 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 602
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 125/361 (34%), Positives = 190/361 (52%), Gaps = 53/361 (14%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
+R +RR+Y+G + P EQ +A FF+ M + + GPG+ V+ V N+EK +AFVE R
Sbjct: 240 SRQSRRLYIGSITPEITEQNLADFFNSKMIEMSIGTGGPGNPVLAVQCNYEKNYAFVEFR 299
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+ E+A+ AMA DGIIF +++RRP DY G P P GL +
Sbjct: 300 SAEDATAAMAFDGIIFLNGPLKIRRPKDY----------GGPDP----MGAGLHVPGVVS 345
Query: 291 AEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
PD +VFVGGLP Y E Q+ ELL SFG L F+LV++ G SKG+ F Y D +
Sbjct: 346 TNVPDSINKVFVGGLPAYLNEEQVMELLTSFGELKAFNLVRENGNGPSKGFAFFEYVDES 405
Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
VTD+A ALNG+++GD+ L V+RA+ + +Q + + + ++
Sbjct: 406 VTDVAIQALNGMELGDRYLVVQRASVGAKPGMPN----LPYEQFPELPRPIMPAGDVSNR 461
Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
A++L + + + L DD+EY +I ED++EEC K+G + ++ IPRP +
Sbjct: 462 D----------ARILLMLSMVVPEDLVDDQEYADICEDVKEECEKFGAVEDLRIPRPAKR 511
Query: 468 ---------GGETP-------------GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
GG+ P GVG+V++++ +A A AL+GR FGG ++
Sbjct: 512 DRTKWGEGAGGQAPMSAQDYQRMDEAAGVGRVYVKFREARAAGDALKALAGRSFGGRSII 571
Query: 506 A 506
A
Sbjct: 572 A 572
>gi|332375140|gb|AEE62711.1| unknown [Dendroctonus ponderosae]
Length = 374
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 119/305 (39%), Positives = 174/305 (57%), Gaps = 36/305 (11%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E + +F+Q M + G + G+ V+ I
Sbjct: 102 PQAAVPVVGSTITRQARRLYVGNIPFGVTEDEMMEYFNQQM-HLSGLAQAAGNPVLACQI 160
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY QP P ++
Sbjct: 161 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDY-----------QPMPGMSE 209
Query: 279 AAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
++ + +G I + P ++F+GGLP Y E Q+KELL SFG L F+LVKD G SK
Sbjct: 210 NSISVPAGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDTAFGLSK 269
Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQK 396
GY F Y D ++TD A A LNG+++GDK L V+RA+ ++ T +L Q +
Sbjct: 270 GYAFAEYIDISMTDQAIAGLNGMQLGDKRLIVQRASVGAKNAT----VLPAVQIQVP--- 322
Query: 397 MALQTSGMNTLGGGMSLFGET--LAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
G+SL G + +VLCL +T D L D+EEYE+ILED++EEC KYG
Sbjct: 323 -------------GLSLVGASGPPTEVLCLLNMVTPDELKDEEEYEDILEDIKEECNKYG 369
Query: 455 TLVNV 459
+ ++
Sbjct: 370 VVRSI 374
>gi|350295506|gb|EGZ76483.1| hypothetical protein NEUTE2DRAFT_76972 [Neurospora tetrasperma FGSC
2509]
Length = 592
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 140/428 (32%), Positives = 223/428 (52%), Gaps = 59/428 (13%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P Q M P T+L AF P
Sbjct: 199 RKRRMTQWDIKPPGYGNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMTQP 250
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+ A +R ++R+ V +PP A ++++ FF+ + G N D V
Sbjct: 251 GGAVNSAALKPTNSRQSKRLIVSNIPPSATDESLLGFFNLQLN--GLNVIDSADPCVQCQ 308
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------VAVRVRRPTDYNPTLAAALG 268
I+ + FA +E + +A+ A+ALDGI E +++RRP DY + A+
Sbjct: 309 ISPDHSFAMLEFKNSPDATVALALDGITMEAEDANGAAGAGGLKIRRPKDY---IVPAI- 364
Query: 269 PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
PN + + +S I + P+++ V +P Y +E QI ELL +FG L F LVK
Sbjct: 365 --VEDPNYDPDSEVPSSIVI---DSPNKISVTNIPAYLSEEQIMELLVAFGKLKSFVLVK 419
Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
D+ T S+G FC Y D +VT +A LN + +GD+ L V++A+
Sbjct: 420 DKHTEESRGIAFCEYHDSSVTSVAIDGLNNMMLGDRALKVQKAS---------------- 463
Query: 389 QQHIAIQKMA--LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
IQ++A L + M+ L G SL G+ +++V+ L +TAD L D+++YEEI +D+
Sbjct: 464 ---YGIQQVAGELSVNAMSMLAGTTSLDGD-VSRVVQLLNMVTADELMDNDDYEEIRDDV 519
Query: 447 REECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
+EEC K+GT+V++ IPRP ++ GVGK+F++Y ++ A AL+GRKF TV A
Sbjct: 520 QEECEKFGTIVSLKIPRPTGGSRQSAGVGKIFIKYENSDQATKALKALAGRKFADRTVVA 579
Query: 507 FYYPEDKY 514
Y+PE+ +
Sbjct: 580 TYFPEENF 587
>gi|380481793|emb|CCF41637.1| U2 snRNP auxilliary factor [Colletotrichum higginsianum]
Length = 550
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 145/459 (31%), Positives = 225/459 (49%), Gaps = 63/459 (13%)
Query: 78 RRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPA-----------AAMLP 125
RR R RS + + R + L+ S + KRR + +D+ PP + M P
Sbjct: 128 RRERQRSATPPPKK-REPTPDLTDITSVLERKRRLTQWDIKPPGYENVTAEQAKLSGMFP 186
Query: 126 GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPL 185
P Q P PS + Q ++ Q+ + L P +R A+R+ + LPP
Sbjct: 187 LPGAPRQQPIDPSKL----QAIMNQPGGQVNSAALKPSN------SRQAKRLLINNLPPS 236
Query: 186 ANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGII 245
A E++I +FF+ + G N D + ++ + FA VE R EA+ A+ALDGI
Sbjct: 237 ATEESIQSFFNLQLN--GLNIIESADPCTSCQVSKDNSFAVVEFRNASEATIALALDGIS 294
Query: 246 FEG----------VAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
E + + RP DY P +P G+ S + + P
Sbjct: 295 MEADDATNGEAANQGLSIHRPKDYIVPAVVDDVPYEP---------GVVSNVV--IDTPS 343
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
++ + LP Y ++ Q+ ELL SFG L F LV+DR T S+G FC Y DPA TD+A
Sbjct: 344 KLSIANLPTYLSDEQVSELLVSFGELKAFVLVRDRSTEESRGIAFCEYVDPAATDVAIQG 403
Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFG 415
LNG+ +GDK L V++A+ G ++ +A + +A M T+ ++ G
Sbjct: 404 LNGMDLGDKKLRVQKASV-GVTQ------VAGVEMGVAAMSMLAGTTSTDS--------G 448
Query: 416 ETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
ET +VL L +T + L D+++YEEI ED++EEC K+G ++++ IPRP ++ GVG
Sbjct: 449 ET--RVLQLLNMVTPEELMDNDDYEEIKEDVQEECSKFGNVLDIKIPRPVGGSRQSAGVG 506
Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
K+F+ + + A AL+GRKF TV Y+PE+ +
Sbjct: 507 KIFVRFENTESAKKALQALAGRKFADRTVVTTYFPEENF 545
>gi|393236224|gb|EJD43774.1| hypothetical protein AURDEDRAFT_137718 [Auricularia delicata
TFB-10046 SS5]
Length = 389
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 122/353 (34%), Positives = 193/353 (54%), Gaps = 41/353 (11%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIG-GNSAGPGDAVVNVYINHEKKFAFVEMR 230
R +RR+Y+G + P E+ + FF+Q M + G GD V+ V +N+EK +AFVE R
Sbjct: 26 RQSRRLYIGSITPEITEENLTKFFNQKMREMNLGQQNASGDPVLAVQVNYEKNYAFVEFR 85
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+ ++A+ AMA DGIIF+ +R+RRP DY +G +P+ + G+ S +
Sbjct: 86 SADDATAAMAFDGIIFQSGPLRIRRPKDY-------MGNEYSAPSA-MHVPGVVSTNV-- 135
Query: 291 AEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
PD ++FVGGLP Y E Q+ ELL+SFG L F+LV++ + G SKGY F Y D
Sbjct: 136 ---PDSLHKIFVGGLPTYLNEEQVMELLKSFGELKAFNLVRENNNGPSKGYAFFEYVDEE 192
Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
VT++A LNG+++GD+ L V+RA+ ++ + I +M + L
Sbjct: 193 VTEVAIQGLNGMELGDRVLAVQRASVGSKNG------MVVPNPDIPYDQMPEVPRPIMPL 246
Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
+ A++L + + + L DDEE+ E+ ED++EEC K+G + ++ IPRP +
Sbjct: 247 NEAPT----QDARILLMLNMVVPEDLVDDEEFAELYEDVKEECAKFGAVEDLRIPRPAKR 302
Query: 468 GG--------------ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
G E GVG+V+++YY + +TA +L+GR F G ++ A
Sbjct: 303 AGPKYGPAAVEAQRVDEAAGVGRVYVKYYKSSDASTALRSLAGRSFAGRSIIA 355
>gi|299745153|ref|XP_001831503.2| rRNA primary transcript binding protein [Coprinopsis cinerea
okayama7#130]
gi|298406457|gb|EAU90350.2| rRNA primary transcript binding protein [Coprinopsis cinerea
okayama7#130]
Length = 550
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 126/359 (35%), Positives = 194/359 (54%), Gaps = 42/359 (11%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
+R +RR+Y+G + P NEQ +A FF++ M + + G+ V+ V N+EK +AFVE R
Sbjct: 192 SRQSRRLYIGSITPDVNEQNLAEFFNKKMAEMNIGTGSTGNPVLAVQCNYEKNYAFVEFR 251
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+ ++A+ AMA DGIIF +++RRP DY + SP +++ G S +
Sbjct: 252 SADDATAAMAFDGIIFINGPLKIRRPKDYGGEVVTG------SPGIHVP--GAVSTNV-- 301
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
+ ++VFVGGLP Y E Q+ ELL+SFG L F+LV++ G SKG+ F Y D +VTD
Sbjct: 302 PDSINKVFVGGLPTYLNEEQVMELLKSFGELKAFNLVRENGNGPSKGFAFFEYVDSSVTD 361
Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
+A +LNG+++GD+ L V+RA+ + Q I + + +G
Sbjct: 362 VAIQSLNGMELGDRYLVVQRASVGAKPGAPG----LPYDQFPDIPR-PIMPAGAEV---- 412
Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP------ 464
T A++L + +T D L DDEEY ++ ED++EEC KYG + ++ IPRP
Sbjct: 413 ------TDARILLMLNMVTPDDLIDDEEYGDLYEDVKEECSKYGEVEDLRIPRPVKKDKA 466
Query: 465 -----------DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
Q E GVG+V+++Y D G A N+L+GR F G ++ A E+
Sbjct: 467 KWGEGQISAQDAQRIDEAAGVGRVYVKYADTEGANKALNSLAGRSFAGRSIIATLLSEE 525
>gi|302411252|ref|XP_003003459.1| splicing factor U2AF 65 kDa subunit [Verticillium albo-atrum
VaMs.102]
gi|261357364|gb|EEY19792.1| splicing factor U2AF 65 kDa subunit [Verticillium albo-atrum
VaMs.102]
Length = 568
Score = 212 bits (539), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 162/543 (29%), Positives = 252/543 (46%), Gaps = 110/543 (20%)
Query: 40 FKSGGDDRRRDKNYKYDREGIRDHDRTDRH--------RDYNRDK--------------- 76
++ G + D +Y R R+ +R DR+ RD++RD+
Sbjct: 63 YERGSRRQENDDSYAASRSH-REREREDRYSGRDRRAERDWDRDRGSSRRDARRDDDDRG 121
Query: 77 -----------ERRHRHRSRSHSSDRFRNRSKSLSP---SRSPS-----------KSKRR 111
+RR R R+ R R +S SP R P+ + +R
Sbjct: 122 ARRGGDRDQFDDRRRGGRDRNEEFARREQRPRSASPPPKKREPTPDLTDIVSVLDRKRRL 181
Query: 112 SGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQV-- 165
+ +D+ PP + A + G LPG P P T+L AF P V
Sbjct: 182 TQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPTKLQAFMNQPGTVNS 233
Query: 166 --MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
+ +R A+R+ V LP A E+++A+FF+ + G N D V+ ++++K
Sbjct: 234 ASLKPSNSRQAKRLLVSKLPSSATEESVASFFNLQLN--GLNVIESTDPCVSCQLSNDKS 291
Query: 224 FAFVEMRTVEEASNAMALDGIIFE---------GVAVRVRRPTDYNPTLAAALGPGQPSP 274
F VE R EA+ A+ALDGI E G + +RRP DY P +P
Sbjct: 292 FCVVEFRNASEATVALALDGISMEADSGTDGAAGRGMEIRRPKDYIVPAVTEELPYEP-- 349
Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
G+ S + + P+++ + G P Y TE Q+ ELL SFG L F LV+DR T
Sbjct: 350 -------GVVSSNV--VDTPNKLSITGFPPYLTEEQVTELLTSFGELKAFVLVRDRHTDE 400
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
S+G+ FC Y D A D+A L+G+ +G+ L +++A+ I +
Sbjct: 401 SRGFVFCEYVDSAANDVAIQGLSGMDLGNSKLKIQKAS-------------------IGV 441
Query: 395 QKMA---LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECG 451
++A + + M+ L G + E ++VL L +TAD L D+E+YEEI+ED++EEC
Sbjct: 442 TQVAGVEMGVAAMSMLAGTTATDSEE-SRVLQLLNMVTADELMDNEDYEEIVEDVQEECA 500
Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
KYGT++ V +PRP ++ GVGK+F++Y A +L+GRKF TV Y+PE
Sbjct: 501 KYGTVLEVKVPRPVGGSRQSAGVGKIFVKYETKEATKKALQSLAGRKFADRTVVTTYFPE 560
Query: 512 DKY 514
+ +
Sbjct: 561 ENF 563
>gi|336274240|ref|XP_003351874.1| hypothetical protein SMAC_00421 [Sordaria macrospora k-hell]
gi|380096157|emb|CCC06204.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 594
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 140/428 (32%), Positives = 223/428 (52%), Gaps = 59/428 (13%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P Q M P T+L AF P
Sbjct: 201 RKRRMTQWDIKPPGYGNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMTQP 252
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+ A +R ++R+ V +PP A ++++ FF+ + G N D V
Sbjct: 253 GGTVNSTALKPTNSRQSKRLIVSNIPPSATDESLLGFFNLQLN--GLNVIDSVDPCVQCQ 310
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGVA---------VRVRRPTDYNPTLAAALG 268
I+ + FA VE R +A+ A+ALDGI E +++RRP DY + A+
Sbjct: 311 ISPDHSFAMVEFRNSPDATVALALDGITMEADDANDAASAGGLKIRRPKDY---IVPAI- 366
Query: 269 PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
PN + + ++ I + P+++ V +P Y TE QI ELL +FG L F LVK
Sbjct: 367 --VEDPNYDPDSEVPSNIVI---DSPNKISVTNIPAYLTEEQIMELLVAFGKLKSFVLVK 421
Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
D+ T S+G FC Y D +VT +A LN + +GD+ L V++A+
Sbjct: 422 DKHTEESRGIAFCEYHDSSVTSVAIDGLNNMMLGDRALKVQKAS---------------- 465
Query: 389 QQHIAIQKMA--LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
IQ++A L + M+ L G SL G+ +++V+ L +TAD L D+++YEEI +D+
Sbjct: 466 ---YGIQQVAGELSVNAMSMLAGTTSLDGD-VSRVVQLLNMVTADELMDNDDYEEIRDDV 521
Query: 447 REECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
+EEC K+GT+V++ IPRP ++ GVGK++++Y ++ A +L+GRKF TV A
Sbjct: 522 QEECEKFGTIVSLKIPRPTGGSRQSAGVGKIYIKYENSDQATKALKSLAGRKFADRTVVA 581
Query: 507 FYYPEDKY 514
Y+PE+ +
Sbjct: 582 TYFPEENF 589
>gi|330803435|ref|XP_003289712.1| hypothetical protein DICPUDRAFT_56283 [Dictyostelium purpureum]
gi|325080222|gb|EGC33787.1| hypothetical protein DICPUDRAFT_56283 [Dictyostelium purpureum]
Length = 501
Score = 211 bits (538), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 122/348 (35%), Positives = 180/348 (51%), Gaps = 37/348 (10%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
+ +RR+YVG +PP + + FF+ + A N PG VV IN K FAF+E RT
Sbjct: 158 KQSRRIYVGNIPPGITDSELIEFFNAAVLAANLN-VKPGPPVVFCQINAPKCFAFIEFRT 216
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDY----NPTLAAALGPGQPSPNLNLAAVGLASGA 287
EEA+NAM DGI + +++RRP DY +PT +AL P+
Sbjct: 217 PEEATNAMRFDGITLKNYTLKIRRPKDYQQSNDPTNTSALPTIVPT-------------- 262
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
+ ++FVGGLP E Q+K LL ++G L F+LVKD +TG SKGY FC Y DP
Sbjct: 263 -NVPDSEHKIFVGGLPSNLNEEQVKTLLSAYGKLKAFNLVKDTNTGISKGYAFCEYLDPD 321
Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
VTD ACA+LNG+ + DK L V+RA+ Q+ S + + NT
Sbjct: 322 VTDQACASLNGISLADKNLIVQRASIVAQTL----STIRSTVPSSPTTSTTQTSIDNNT- 376
Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
++V+ L + + L DD+EY+ IL D++EEC +G ++ +P P +N
Sbjct: 377 ---------KPSRVIQLLNLVDKEDLYDDKEYDNILIDVKEECENFGPTQSLWLPMPSKN 427
Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
+ V ++++E+ A L G+K+ G T+ + YYPED Y+
Sbjct: 428 PLD---VTRIYIEFQQLESSQKACLGLGGKKYNGRTIFSAYYPEDLYY 472
>gi|296420976|ref|XP_002840043.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295636253|emb|CAZ84234.1| unnamed protein product [Tuber melanosporum]
Length = 540
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 124/389 (31%), Positives = 197/389 (50%), Gaps = 45/389 (11%)
Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQA-----TRHARRVYVGGLPPLAN 187
LPG P P T+L AF P V T +R A+R+ + +PP +
Sbjct: 185 LPGAPRQAP--------MDPTRLHAFMTQPSNVATASTLKPTNSRQAKRLLMSNIPPGTD 236
Query: 188 EQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE 247
E+++ FF+Q ++++ + GP D + +V ++ K +E + +A+ +AL GI F
Sbjct: 237 EESLLQFFNQTLSSLNVTTGGP-DPITSVQLSGSKILGLLEFKNTNDATVCLALSGIEFN 295
Query: 248 GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFT 307
G + ++RP DY + P P + + G+ S + + P+++ + +P Y
Sbjct: 296 GGNIEIKRPRDY-------IVPIVPEDHRHQEP-GVISSDV--PDTPNKILISEIPEYLQ 345
Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
+ Q+ ELL+SFG L F LVKD SKG FC Y DP T+IA LN +++ D TL
Sbjct: 346 DEQVIELLKSFGDLKAFVLVKDVTDETSKGIAFCEYLDPGTTEIAVEGLNAMEIADNTLR 405
Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG--GGMSLFGETLAKVLCLT 425
VRRA+ I +++ A G+N + G + ++VL L
Sbjct: 406 VRRAS-------------------IGMKQAAGVEMGVNAMAMMAGTTSVDLEASRVLQLL 446
Query: 426 EAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAV 485
+TAD L D EEY+EI ED+R+EC K+G+LV++ +PRP + GVGK+F+ +
Sbjct: 447 NMVTADELLDPEEYDEICEDIRDECQKFGSLVDMKVPRPSGGSRQAAGVGKIFVRFETQE 506
Query: 486 GCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ A +L+GRKF TV Y+ E+ Y
Sbjct: 507 SASNALRSLAGRKFADRTVVCTYFSEENY 535
>gi|224003073|ref|XP_002291208.1| U2 snRNP auxillary splicing factor, U2AF subunit [Thalassiosira
pseudonana CCMP1335]
gi|220972984|gb|EED91315.1| U2 snRNP auxillary splicing factor, U2AF subunit, partial
[Thalassiosira pseudonana CCMP1335]
Length = 352
Score = 208 bits (530), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 129/361 (35%), Positives = 195/361 (54%), Gaps = 32/361 (8%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAI--GGNSAG---------PGDAVVNVYIN 219
TRHARR+YVG +P +++EQ F + ++I NS D +++VYIN
Sbjct: 1 TRHARRLYVGNIPDVSDEQLHHFFRDAIRSSIILDNNSEAHSSHKHQYVDNDPIISVYIN 60
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVA-VRVRRPTDYNPTLAAALGPGQPSPNLNL 278
E++FAF+E +T+E + MALDG+ G V+++RP DYNP +A L P L+
Sbjct: 61 RERRFAFLEFKTMEITTACMALDGLDVMGRGKVKIKRPNDYNPAVAPMLN-ASTMPVLDT 119
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK-DRDTGNSKG 337
+G+ S + +GP+++FVGGLPY+ ++Q+ ELL +FG + F+LVK D + SKG
Sbjct: 120 GKLGIISMTV--HDGPNKIFVGGLPYHLVDSQVLELLSAFGAVKAFNLVKNDPMSDTSKG 177
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y F Y DP VT IA LNG+ MG G+ +Q ++
Sbjct: 178 YCFVEYCDPNVTQIAAMGLNGMDMG-----------GGKQPYQQPIVVKDPMAVANAAAS 226
Query: 398 ALQ----TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
AL + G+ L ++ G T+ ++L L + + LA E+ + + E++REE GKY
Sbjct: 227 ALDQAFGSGGVPPLAPPTAMPGSTVTRILVLLNMVMDEDLATAEDRKFLEEEVREEVGKY 286
Query: 454 GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
GTL+++ IP P + G V K+FLEY A+ L GR FG N V+A Y+ E+
Sbjct: 287 GTLLSMKIPMPHE-GCAPSAVKKIFLEYATPAEAMYAEKELKGRAFGPNVVDASYFSEED 345
Query: 514 Y 514
Y
Sbjct: 346 Y 346
>gi|440633242|gb|ELR03161.1| hypothetical protein GMDG_05987 [Geomyces destructans 20631-21]
Length = 559
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 136/421 (32%), Positives = 211/421 (50%), Gaps = 52/421 (12%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P P ++L AF P
Sbjct: 173 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMSQP 224
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+T A +R A+R+ V LP +E+ I FF+ + G N D +
Sbjct: 225 SGSITNAALKPSNSRQAKRLLVHNLPKTLSEEGIVEFFNLQLN--GLNVVEGSDPCLTAQ 282
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFE----GVAVRVRRPTDYNPTLAAALGPGQPS 273
++ +K FA VE +T +A+ A+A+DGI E A+ +RRP DY + A+
Sbjct: 283 VSKDKSFALVEFKTTSDATVALAMDGIGIEENGGSRALSIRRPKDY---IVPAVDEAMHE 339
Query: 274 PNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTG 333
P G+ + + + P+++ + +P Y T+ Q+ ELL SFG L F L KD T
Sbjct: 340 P-------GVVTNVV--PDTPNKISISNVPPYLTDEQVTELLVSFGELKAFVLAKDSTTE 390
Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
S+G FC Y D A TDIA LNG+++GDK L V+RA+ G ++T A +
Sbjct: 391 ESRGIAFCEYVDAAATDIAVEGLNGMELGDKHLKVQRASI-GTTQT--------AGLEMG 441
Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
+ M++ G + G +VL L +TA+ L D+E+YEEILED++EEC KY
Sbjct: 442 VNAMSML--------AGTTTDGLDEGRVLQLLNMVTAEELIDNEDYEEILEDVKEECEKY 493
Query: 454 GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
G ++++ IPRP ++ GVGK+F+++ A L+GRKF TV Y+ E+
Sbjct: 494 GKVLDIKIPRPSGGSRQSAGVGKIFVKFDTPASAGKALRTLAGRKFADRTVVTTYFSEEN 553
Query: 514 Y 514
+
Sbjct: 554 F 554
>gi|426196755|gb|EKV46683.1| hypothetical protein AGABI2DRAFT_206147 [Agaricus bisporus var.
bisporus H97]
Length = 558
Score = 207 bits (528), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 143/441 (32%), Positives = 225/441 (51%), Gaps = 51/441 (11%)
Query: 97 KSLSPSRSPSKSKRR---SGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGAT 153
+S +P PS S+R+ SG+D+ P +A+ + G+ +P + +P +
Sbjct: 120 RSPTPENCPSLSQRKRKASGWDVHAPGYEQY--SAMQAKQTGL-FNLPGANRTQIPPILS 176
Query: 154 QLGAFPLMPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
G P MPVQ +R +RR+Y+G + NEQ +A FF+ M + +
Sbjct: 177 IPGLPPPMPVQSFGMGIGGNPNLSRQSRRLYIGSITQEVNEQNLADFFNAKMAEMNIGTG 236
Query: 208 GPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAAL 267
G+ V+ V N+EK +AFVE R+ E+A+ AMA DGIIF +++RRP DY +
Sbjct: 237 ITGNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDYGGDTIVSP 296
Query: 268 G---PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
G PG S N+ + ++VFVGGLP Y E Q+ ELL+SFG L F
Sbjct: 297 GVHVPGVVSTNV--------------PDSINKVFVGGLPTYLNEEQVMELLKSFGELKAF 342
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
+LV++ TG SKG+ F Y D AVTD+A +LNG+++GD+ L V+RA+ + T
Sbjct: 343 NLVRENGTGTSKGFAFFEYVDQAVTDVAIQSLNGMELGDRYLVVQRASVGAKPGTPGMIP 402
Query: 385 LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
Q I + + T + +++L + +T + L +D+EY ++ +
Sbjct: 403 NLPYDQFPEIPRPIMPAGKDQT---------SSESRILLMLNMVTPEDLHEDDEYGDLYD 453
Query: 445 DMREECGKYGTLVNVVIPRP-------------DQNGGETPGVGKVFLEYYDAVGCATAK 491
D++ EC KYG L ++ IPRP Q E GVG+V+++Y ++ + A
Sbjct: 454 DVKAECSKYGELEDLRIPRPVKKDKTSSMSAQDAQRIDEAAGVGRVYVKYTNSKSASAAL 513
Query: 492 NALSGRKFGGNTVNAFYYPED 512
NAL+GR F G ++ A +D
Sbjct: 514 NALAGRSFAGRSIIATLLSDD 534
>gi|154319442|ref|XP_001559038.1| hypothetical protein BC1G_02202 [Botryotinia fuckeliana B05.10]
Length = 596
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 133/430 (30%), Positives = 211/430 (49%), Gaps = 60/430 (13%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P P ++L AF P
Sbjct: 200 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMAQP 251
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+T A +R ++R+ V +P E+ I +FF+ + G N D +++
Sbjct: 252 SGQVTNAALKPSNSRQSKRLLVHNIPADTKEETIVSFFNLQLN--GLNVIEGSDPLISAQ 309
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGV-------------AVRVRRPTDYNPTLA 264
++ + FA +E +T +A+ A+ALDGI +G + +RRP DY
Sbjct: 310 VSKDGSFALLEFKTQSDATVALALDGITMDGNDHMETENGSADTRGLSIRRPKDYIVPAV 369
Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
P +P G+ S + A+ +++ V +P+Y + Q+ ELL SFG L F
Sbjct: 370 TDETPFEP---------GVISNVV--ADTQNKISVTNIPHYLNDEQVTELLVSFGELKAF 418
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
LVKD +T S+G FC Y DPA TDIA LNG+++GDK L V+RA+
Sbjct: 419 VLVKDSNTDESRGIAFCEYVDPAATDIAVEGLNGMELGDKHLKVQRASIG---------- 468
Query: 385 LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
+ + + + + M+ L G S E +VL L +T + L D+E+YEEI E
Sbjct: 469 ------NTQVSGLEMSVNAMSMLAGTTSQDLEN-GRVLQLLNMVTPEELIDNEDYEEICE 521
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
D++EEC KYG ++ + +PRP ++ GVGK+F+++ A AL+GRKF TV
Sbjct: 522 DVKEECEKYGKVLEMKVPRPTGGSRQSTGVGKIFVKFDTPDSAGKALKALAGRKFADRTV 581
Query: 505 NAFYYPEDKY 514
Y+PE+ +
Sbjct: 582 VTTYFPEENF 591
>gi|347842431|emb|CCD57003.1| hypothetical protein [Botryotinia fuckeliana]
Length = 606
Score = 207 bits (526), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 133/430 (30%), Positives = 211/430 (49%), Gaps = 60/430 (13%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P P ++L AF P
Sbjct: 210 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMAQP 261
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+T A +R ++R+ V +P E+ I +FF+ + G N D +++
Sbjct: 262 SGQVTNAALKPSNSRQSKRLLVHNIPADTKEETIVSFFNLQLN--GLNVIEGSDPLISAQ 319
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGV-------------AVRVRRPTDYNPTLA 264
++ + FA +E +T +A+ A+ALDGI +G + +RRP DY
Sbjct: 320 VSKDGSFALLEFKTQSDATVALALDGITMDGNDHMETGNGSADTRGLSIRRPKDYIVPAV 379
Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
P +P G+ S + A+ +++ V +P+Y + Q+ ELL SFG L F
Sbjct: 380 TDETPFEP---------GVISNVV--ADTQNKISVTNIPHYLNDEQVTELLVSFGELKAF 428
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
LVKD +T S+G FC Y DPA TDIA LNG+++GDK L V+RA+
Sbjct: 429 VLVKDSNTDESRGIAFCEYVDPAATDIAVEGLNGMELGDKHLKVQRASIG---------- 478
Query: 385 LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
+ + + + + M+ L G S E +VL L +T + L D+E+YEEI E
Sbjct: 479 ------NTQVSGLEMSVNAMSMLAGTTSQDLEN-GRVLQLLNMVTPEELIDNEDYEEICE 531
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
D++EEC KYG ++ + +PRP ++ GVGK+F+++ A AL+GRKF TV
Sbjct: 532 DVKEECEKYGKVLEMKVPRPTGGSRQSTGVGKIFVKFDTPDSAGKALKALAGRKFADRTV 591
Query: 505 NAFYYPEDKY 514
Y+PE+ +
Sbjct: 592 VTTYFPEENF 601
>gi|18446992|gb|AAL68087.1| AT16577p [Drosophila melanogaster]
Length = 449
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 132/362 (36%), Positives = 200/362 (55%), Gaps = 35/362 (9%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP----GDAVVNVYINHEKKFAF 226
TR ARR+YVG +P ++ + FF+ + A+G + G+AV+ N EK FAF
Sbjct: 107 TRQARRLYVGNIPFGVTDEEMMQFFNHQIMALGFEAKSSHYMDGNAVLTCQTNLEKNFAF 166
Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-----SPNLNLA-- 279
+E R+++EAS A+ DG++F G +++RRP DY P + ++ + P +N+A
Sbjct: 167 LEFRSIDEASQALNFDGMVFRGQTLKIRRPHDYQPVPSISVSAMESYRSFRVPAINVAQQ 226
Query: 280 -AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
AV L I + P++++VGGLP + Q+KELL+SFG L G +LV D +T +KG+
Sbjct: 227 PAVTLPVTTIV-PDSPNKIYVGGLPTCLNQDQVKELLQSFGELKGLNLVMDTNTNLNKGF 285
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
F Y DP+VTD A A L+G+ +GD+ L V+R+ G++ H+
Sbjct: 286 AFFEYCDPSVTDHAIAGLHGMLLGDRRLVVQRSIPGGKNAFP---------GHLPT---V 333
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
LQ G++TL S + LCL + + L DDEE+E+I D+++EC K+G + +
Sbjct: 334 LQVPGISTLQDPGS-----PTETLCLLNMVRPEELLDDEEFEDIRTDIKQECAKFGEVRS 388
Query: 459 VVIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+ IPRP G+ P G GKVF+++ A ALSGRKF G V YY +KY
Sbjct: 389 IKIPRPI---GQFPKRGCGKVFVQFESVEDSQKALKALSGRKFSGRIVMTSYYDPEKYLA 445
Query: 517 KD 518
D
Sbjct: 446 DD 447
>gi|310793965|gb|EFQ29426.1| U2 snRNP auxilliary factor [Glomerella graminicola M1.001]
Length = 549
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 146/459 (31%), Positives = 221/459 (48%), Gaps = 63/459 (13%)
Query: 78 RRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPA-----------AAMLP 125
RR R RS + + R + L+ S + KRR + +D+ PP + M P
Sbjct: 127 RRERQRSATPPPKK-REPTPDLTDVTSVLERKRRLTQWDIKPPGYENVTAEQAKLSGMFP 185
Query: 126 GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPL 185
P Q P PS + Q ++ Q+ + L P +R A+R+ + LPP
Sbjct: 186 LPGAPRQQPMDPSKL----QAIMNQPGGQVNSAALKPSN------SRQAKRLLINNLPPS 235
Query: 186 ANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGII 245
A E+ I FF+ + G N D + ++ + FA VE R EA+ A+ALDGI
Sbjct: 236 ATEEGIQNFFNLQLN--GLNIIESTDPCTSCQVSKDHSFAVVEFRNASEATVALALDGIS 293
Query: 246 FE------GVA----VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
E G A + +RRP DY P +P G+ S + + +
Sbjct: 294 MEAEDATNGAAADQGLVIRRPKDYIVPAVVDDVPYEP---------GVVSNVV--VDTHN 342
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
++ V +P Y +E Q+ ELL SFG L F LV+D+ T S+G FC Y DPA TD+A
Sbjct: 343 KISVANMPVYLSEEQVSELLVSFGELKAFVLVRDKSTEESRGIAFCEYVDPAATDVAIQG 402
Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFG 415
LNG+ +GDK L V++A+ + + + + M+ L G S
Sbjct: 403 LNGMDLGDKRLKVQKASVG----------------VTQVAGVEMGVAAMSMLAGTTSTDS 446
Query: 416 ETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
E +VL L +T + L D+++YEEI ED++EEC K+GT+++V IPRP ++ GVG
Sbjct: 447 EE-TRVLQLLNMVTPEELMDNDDYEEIKEDVQEECAKFGTVLDVKIPRPVGGSRQSAGVG 505
Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
K+F+++ A AL+GRKF TV Y+PE+ +
Sbjct: 506 KIFVKFETKEAAKKALQALAGRKFADRTVVTTYFPEENF 544
>gi|24659166|ref|NP_611769.2| large subunit 2 [Drosophila melanogaster]
gi|7291545|gb|AAF46969.1| large subunit 2 [Drosophila melanogaster]
gi|201066057|gb|ACH92438.1| FI08027p [Drosophila melanogaster]
Length = 449
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 132/362 (36%), Positives = 200/362 (55%), Gaps = 35/362 (9%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP----GDAVVNVYINHEKKFAF 226
TR ARR+YVG +P ++ + FF+ + A+G + G+AV+ N EK FAF
Sbjct: 107 TRQARRLYVGNIPFGVTDEEMMQFFNHQIMALGFEAKSSHYMDGNAVLTCQTNLEKNFAF 166
Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-----SPNLNLA-- 279
+E R+++EAS A+ DG++F G +++RRP DY P + ++ + P +N+A
Sbjct: 167 LEFRSIDEASQALNFDGMVFRGQTLKIRRPHDYQPVPSISVSAMESYRSFRVPAINVAQQ 226
Query: 280 -AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
AV L I + P++++VGGLP + Q+KELL+SFG L G +LV D +T +KG+
Sbjct: 227 PAVTLPVTTIV-PDSPNKIYVGGLPTCLNQDQVKELLQSFGELKGLNLVMDTNTNLNKGF 285
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
F Y DP+VTD A A L+G+ +GD+ L V+R+ G++ H+
Sbjct: 286 AFFEYCDPSVTDHAIAGLHGMLLGDRRLVVQRSIPGGKNAFP---------GHLPT---V 333
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
LQ G++TL S + LCL + + L DDEE+E+I D+++EC K+G + +
Sbjct: 334 LQVPGISTLQDPGS-----PTETLCLLNMVRPEELLDDEEFEDIRTDIKQECAKFGEVRS 388
Query: 459 VVIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+ IPRP G+ P G GKVF+++ A ALSGRKF G V YY +KY
Sbjct: 389 IKIPRPI---GQFPKRGCGKVFVQFESVEDSQKALKALSGRKFSGRIVMTSYYDPEKYLA 445
Query: 517 KD 518
D
Sbjct: 446 DD 447
>gi|402223467|gb|EJU03531.1| hypothetical protein DACRYDRAFT_77158 [Dacryopinax sp. DJM-731 SS1]
Length = 392
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 124/363 (34%), Positives = 196/363 (53%), Gaps = 46/363 (12%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
+ ARR+Y+G + P E+ + F + + +G G V ++HEK +A++E
Sbjct: 54 KQARRLYIGDITPDTTEENLTAFLKKTLPELGIKVEGEDVGFEEVRVSHEKNYAYIEFSN 113
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV--GLASGAIG 289
++A+ M LDG +F G +++RRP DY L+A +LA V G+ G +
Sbjct: 114 PDDATKTMELDGTVFLGQPLKIRRPHDY---LSAT----------DLAVVFGGIVPGVVS 160
Query: 290 GAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
PD ++FVGGLP Y E Q+ ELL++FG L F+LVKD TG SKG+ F Y DP
Sbjct: 161 -TNVPDSINKIFVGGLPTYLNEAQVMELLQTFGELRAFNLVKDGSTGVSKGFAFFEYMDP 219
Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
VTD+AC LNG+++GD+ L V+RA+ G + T+ + + + A+ NT
Sbjct: 220 GVTDVACQGLNGMELGDRYLVVQRASI-GANPTKPN--MPNMPGTLPPPRPAILPVD-NT 275
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
+ +L L +TA+ L D++YE+ILED+REE G++G ++++ IPRP +
Sbjct: 276 ---------NPPSPILLLLNMVTAEELLQDQDYEDILEDVREEMGRFGPVIDIKIPRPQR 326
Query: 467 N-----GGETP---------GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
G T GVG+V++++ D + A NA++GR+F G ++ A Y ED
Sbjct: 327 RENRWIPGSTSEPVKTDIELGVGRVYVKFADTDSASQALNAVAGRQFSGRSIIATYLQED 386
Query: 513 KYF 515
+
Sbjct: 387 PFL 389
>gi|326428095|gb|EGD73665.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 isoform 2
[Salpingoeca sp. ATCC 50818]
Length = 415
Score = 204 bits (519), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 128/395 (32%), Positives = 193/395 (48%), Gaps = 77/395 (19%)
Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEE 234
R++ V LP + + +F + + A PG V+ ++ + K A +E R+V+E
Sbjct: 36 RQLVVENLPEGTRDHDLMSFMNDCI-ASNKLITQPGQPVIKCTLSEDGKSAVLEFRSVDE 94
Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA--- 291
A+N + D F+G +RVRRP +Y +P ++ + + SGA A
Sbjct: 95 ATNGLVFDRERFKGAQLRVRRPDNYE------------APKGHITRIPMQSGANVSAVVQ 142
Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
+ P ++++GG+P Y +E Q+KELLE FG L F LV D TG SKG+ FC Y DPA+TD
Sbjct: 143 DSPYKIYIGGIPPYLSEEQVKELLEPFGQLKSFHLVMDTSTGQSKGFAFCEYMDPAITDT 202
Query: 352 ACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM-ALQTSGMN----- 405
ALN L++G+K L V+RA+ + AQQ+ + + A +++GM
Sbjct: 203 MIGALNDLRIGEKRLLVQRASIGARGG---------AQQNNPMHSIPAFKSAGMQGPGGP 253
Query: 406 -------------------------------------TLGGGMSLFGETL-------AKV 421
G G+ L T+ ++
Sbjct: 254 MMPLPPMSAPLKPAAPGAPITPGPISQPPPMSLPLPPPSGAGLPLPAPTMPTQQLVATRI 313
Query: 422 LCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE--TPGVGKVFL 479
L L +T D L DDEEYE+I D+REEC K+G ++++ IPRP + E PGVGK++L
Sbjct: 314 LILMNMVTVDELRDDEEYEDICADIREECEKFGEILDMKIPRPSKENPEEQVPGVGKIYL 373
Query: 480 EYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+Y A A ALSGR F TV ++PEDK+
Sbjct: 374 KYASADEARIAARALSGRSFAERTVVTSFWPEDKF 408
>gi|134113282|ref|XP_774666.1| hypothetical protein CNBF3460 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257310|gb|EAL20019.1| hypothetical protein CNBF3460 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 652
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 131/406 (32%), Positives = 198/406 (48%), Gaps = 70/406 (17%)
Query: 137 PSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFS 196
P VP A +P GAFP R R+Y+GG+ EQ I FF+
Sbjct: 246 PGRVPPPAHLGIP-ATFVAGAFP-------PSNPVRQNNRLYIGGIKEDMQEQQIQDFFN 297
Query: 197 QVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRP 256
+M G + G D V IN+++ FAF+E+ T E+A+ A+ LDG++ +G ++RVRRP
Sbjct: 298 NLMKE-KGMADGKEDPVKQCQINNDRNFAFIELHTPEQATAALELDGVVLDGASLRVRRP 356
Query: 257 TDY---NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
DY +P L G PS A+ P+++F+GG+P Y + Q+ E
Sbjct: 357 KDYAGIDPLLQTFNGVVAPS----------------VADSPNKLFIGGIPTYLNDEQVME 400
Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
LL+SFG L F+LVK+ G SKG+ F Y DP VTD+A L+ +GD+ L V+RA
Sbjct: 401 LLKSFGELKSFNLVKE-SAGVSKGFAFAEYLDPEVTDMAIQGLHNFSLGDRNLVVQRAAV 459
Query: 374 SGQSKTE-----QESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAI 428
+ + L+QA H+ +Q A S ++V+ L +
Sbjct: 460 GRNTGVNAPIPGSAAYLSQAIPHL-MQNNADAPS----------------SRVMLLLNMV 502
Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIPRP-------------------DQNGG 469
T + L +D++Y +I+ED+ EEC KYG + V IPRP ++
Sbjct: 503 TPEELYNDDDYNDIIEDINEECSKYGEIEGVRIPRPVPKSKKWESTEAAAATAERNKRTD 562
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
+ GVG+V++ Y D A NA+ GR+F G T+ PE+++
Sbjct: 563 DEAGVGRVYVMYKDVESTKKAMNAIGGRQFAGRTILVANVPEEEFL 608
>gi|58268792|ref|XP_571552.1| rRNA primary transcript binding protein [Cryptococcus neoformans
var. neoformans JEC21]
gi|57227787|gb|AAW44245.1| rRNA primary transcript binding protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 651
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 131/406 (32%), Positives = 198/406 (48%), Gaps = 70/406 (17%)
Query: 137 PSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFS 196
P VP A +P GAFP R R+Y+GG+ EQ I FF+
Sbjct: 245 PGRVPPPAHLGIP-ATFVAGAFP-------PSNPVRQNNRLYIGGIKEDMQEQQIQDFFN 296
Query: 197 QVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRP 256
+M G + G D V IN+++ FAF+E+ T E+A+ A+ LDG++ +G ++RVRRP
Sbjct: 297 NLMKE-KGMADGKEDPVKQCQINNDRNFAFIELHTPEQATAALELDGVVLDGASLRVRRP 355
Query: 257 TDY---NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
DY +P L G PS A+ P+++F+GG+P Y + Q+ E
Sbjct: 356 KDYAGIDPLLQTFNGVVAPS----------------VADSPNKLFIGGIPTYLNDEQVME 399
Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
LL+SFG L F+LVK+ G SKG+ F Y DP VTD+A L+ +GD+ L V+RA
Sbjct: 400 LLKSFGELKSFNLVKE-SAGVSKGFAFAEYLDPEVTDMAIQGLHNFSLGDRNLVVQRAAV 458
Query: 374 SGQSKTE-----QESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAI 428
+ + L+QA H+ +Q A S ++V+ L +
Sbjct: 459 GRNTGVNAPIPGSAAYLSQAIPHL-MQNNADAPS----------------SRVMLLLNMV 501
Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIPRP-------------------DQNGG 469
T + L +D++Y +I+ED+ EEC KYG + V IPRP ++
Sbjct: 502 TPEELYNDDDYNDIIEDINEECSKYGEIEGVRIPRPVPKSKKWESTEAAAATAERNKRTD 561
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
+ GVG+V++ Y D A NA+ GR+F G T+ PE+++
Sbjct: 562 DEAGVGRVYVMYKDVESTKKAMNAIGGRQFAGRTILVANVPEEEFL 607
>gi|402906865|ref|XP_003916203.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Papio anubis]
Length = 446
Score = 202 bits (514), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 149/426 (34%), Positives = 209/426 (49%), Gaps = 67/426 (15%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
RSP K K R +D+ PP + GQ+P + +P M + L T
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT- 135
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+
Sbjct: 136 -------PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVL 187
Query: 215 NVYINHEKKFAFVEMRTVEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPS 273
V IN +K FAF+E+ + + AL + F V P
Sbjct: 188 AVQINQDKNFAFLEVSWGNRSGPVLCALAMLTFPEWVVSTVVP----------------- 230
Query: 274 PNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTG 333
+ ++F+GGLP Y + Q+KELL SFG L F+LVKD TG
Sbjct: 231 ------------------DSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATG 272
Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
SKGY FC Y D VTD A A LNG+++GDK L V+RA+ ++ T + Q +
Sbjct: 273 LSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVT 328
Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
+Q L +S + +GG + +VLCL + + L DDEEYEEI+ED+R+EC KY
Sbjct: 329 LQVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKY 380
Query: 454 GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
G + ++ IPRP +G E PG GK+F+E+ C A L+GRKF V Y D
Sbjct: 381 GLVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDS 439
Query: 514 YFNKDY 519
Y +D+
Sbjct: 440 YHRRDF 445
>gi|405121398|gb|AFR96167.1| rRNA primary transcript binding protein [Cryptococcus neoformans
var. grubii H99]
Length = 655
Score = 202 bits (513), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 130/406 (32%), Positives = 198/406 (48%), Gaps = 70/406 (17%)
Query: 137 PSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFS 196
P VP A +P GAFP R R+Y+GG+ EQ I FF+
Sbjct: 249 PGRVPPPAHLGIP-ATFVAGAFP-------PSNPVRQNNRLYIGGIKEDMQEQQIQDFFN 300
Query: 197 QVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRP 256
+M G + G D V IN+++ FAF+E+ T E+A+ A+ LDG++ +G ++RVRRP
Sbjct: 301 NLMKE-KGMADGKEDPVKQCQINNDRNFAFIELHTPEQATAALELDGVVLDGASLRVRRP 359
Query: 257 TDY---NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
DY +P L G PS A+ P+++F+GG+P Y + Q+ E
Sbjct: 360 KDYAGIDPLLQTFNGVVAPS----------------VADSPNKLFIGGIPTYLNDEQVME 403
Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
LL+SFG L F+LVK+ G SKG+ F Y DP VTD+A L+ +GD+ L V+RA
Sbjct: 404 LLKSFGELKSFNLVKE-SAGVSKGFAFAEYLDPEVTDMAIQGLHNFALGDRNLVVQRAAV 462
Query: 374 SGQSKTE-----QESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAI 428
+ + L+QA H+ +Q A S ++V+ L +
Sbjct: 463 GRNTGVNAPIPGSAAYLSQAIPHL-MQNNADAPS----------------SRVMLLLNMV 505
Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIPRP-------------------DQNGG 469
T + L +D++Y +I+ED+ +EC KYG + V IPRP ++
Sbjct: 506 TPEELYNDDDYNDIIEDINDECSKYGEIEGVRIPRPVPKSKKWESTEAAAATAERNKRTD 565
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
+ GVG+V++ Y D A NA+ GR+F G T+ PE+++
Sbjct: 566 DEAGVGRVYVMYKDVESTKKAMNAIGGRQFAGRTILVANVPEEEFL 611
>gi|156061663|ref|XP_001596754.1| hypothetical protein SS1G_02977 [Sclerotinia sclerotiorum 1980]
gi|154700378|gb|EDO00117.1| hypothetical protein SS1G_02977 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 518
Score = 201 bits (511), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 132/428 (30%), Positives = 206/428 (48%), Gaps = 60/428 (14%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P P ++L AF P
Sbjct: 125 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMAQP 176
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+T A +R ++R+ V +P E+ + FF+ + G N D ++
Sbjct: 177 SGSVTNAALKPSNSRQSKRLLVHNIPADTKEETLVGFFNLQLN--GLNVIEGSDPCISAQ 234
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG-------------VAVRVRRPTDYNPTLA 264
++ + FA +E +T +A+ A+ALDGI E + +RRP DY
Sbjct: 235 VSKDGSFALLEFKTQSDATVALALDGITMENNDHMVTGNGSADTQGLSIRRPKDYIVPAV 294
Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
P +P G+ S + + +++ V +P+Y + Q+ ELL SFG L F
Sbjct: 295 TDETPFEP---------GVVSNIVPDTQ--NKISVANIPHYLNDEQVTELLVSFGELKAF 343
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
LVKD +T S+G FC Y DPA TDIA LNG+++GDK L V+RA+
Sbjct: 344 VLVKDSNTDESRGIAFCEYVDPAATDIAVEGLNGMELGDKHLKVQRASIG---------- 393
Query: 385 LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
+ + + + + M+ L G S E +VL L +T + L D+E+YEEI E
Sbjct: 394 ------NTQVSGLEMSVNAMSMLAGTTSQDLEN-GRVLQLLNMVTPEELIDNEDYEEICE 446
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
D++EEC KYG ++ + +PRP ++ GVGK+F+++ A AL+GRKF TV
Sbjct: 447 DVKEECEKYGKVLEMKVPRPTGGSRQSTGVGKIFVKFDTPDSAGKALKALAGRKFADRTV 506
Query: 505 NAFYYPED 512
Y+PED
Sbjct: 507 VTTYFPED 514
>gi|321260434|ref|XP_003194937.1| splicing factor (U2 snRNP auxiliary factor large subunit)
[Cryptococcus gattii WM276]
gi|317461409|gb|ADV23150.1| Splicing factor (U2 snRNP auxiliary factor large subunit), putative
[Cryptococcus gattii WM276]
Length = 654
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 126/403 (31%), Positives = 198/403 (49%), Gaps = 64/403 (15%)
Query: 137 PSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFS 196
P VP A +P GAFP R R+Y+GG+ E I FF+
Sbjct: 248 PGRVPPPAHLGIP-ATFVAGAFP-------PSNPVRQNNRLYIGGIKEDMQENQIQDFFN 299
Query: 197 QVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRP 256
+M G + G D V IN+++ FAF+E+ T E+A+ A+ LDG++ +G ++RVRRP
Sbjct: 300 NLMKE-KGMADGKEDPVKQCQINNDRNFAFIELHTPEQATAALELDGVVLDGASLRVRRP 358
Query: 257 TDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLE 316
DY A + P + N G+ + ++ A+ P+++F+GG+P Y + Q+ ELL+
Sbjct: 359 KDY-----AGIDPLLQTFN------GIVAPSV--ADSPNKLFIGGIPTYLNDEQVMELLK 405
Query: 317 SFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQ 376
SFG L F+LVK+ G SKG+ F Y DP VTD+A L+ +GD+ L V+RA
Sbjct: 406 SFGELKSFNLVKE-SAGVSKGFAFAEYLDPEVTDMAIQGLHNFALGDRNLVVQRAAVGRN 464
Query: 377 SKTE-----QESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITAD 431
+ + L+QA H+ TS +V+ L +T +
Sbjct: 465 TGVNAPIPGSAAYLSQAIPHLMQNNADAPTS-----------------RVMLLLNMVTPE 507
Query: 432 ALADDEEYEEILEDMREECGKYGTLVNVVIPRP-------------------DQNGGETP 472
L +D++Y +I+ED+ +EC KYG + V IPRP ++ +
Sbjct: 508 ELYNDDDYNDIIEDINDECSKYGEIEGVRIPRPVPKSKKWESTEAAAATAERNKRTDDEA 567
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
GVG+V++ Y D A +A+ GR+F G T+ PE+++
Sbjct: 568 GVGRVYVMYKDVESTKKAMDAIGGRQFAGRTILVANVPEEEFL 610
>gi|353240191|emb|CCA72072.1| related to pre-mRNA splicing factor U2AF large chain
[Piriformospora indica DSM 11827]
Length = 403
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 140/396 (35%), Positives = 198/396 (50%), Gaps = 76/396 (19%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMT--AIGGNSAGPGDAVVNV 216
P+ V R +RR+Y+G + PLA+E++IA FF+ M + +SA P V+ V
Sbjct: 31 PVATGMVSNSNLARQSRRLYLGSITPLADEESIALFFNSQMRERKLTTSSAPP---VLAV 87
Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL 276
+N EK +AFVE R+ E+A+ MALDG +F +RVRRP DY GP
Sbjct: 88 QVNREKNYAFVEFRSAEDATAGMALDGTVFLDGPLRVRRPHDYA-------GPE------ 134
Query: 277 NLAAVGLASGA-IGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
A G+A A + A PD ++F+G LP + TE QI ELL+SFG L F+LV++ T
Sbjct: 135 --AMTGVAGFATLLPATMPDSVNKIFIGNLPTHLTEDQIVELLKSFGELKAFNLVREHGT 192
Query: 333 GNSK----------------GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQ 376
SK G+ F Y DPAVTDIA +LNG+ +GDK L V+RA
Sbjct: 193 NVSKVFTVRITLSMNLTGSQGFAFVEYADPAVTDIATESLNGMDLGDKKLVVQRA----- 247
Query: 377 SKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADD 436
S+ A+ I + M + + + L E +V+ + + A+ L DD
Sbjct: 248 ------SVGAKGGVPIPPEAMDIPAPIV-----AVDLNKEANGRVVLMLNMVVAEDLMDD 296
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRP-----------DQN---------GGETPGVGK 476
EYEEIL+D+R EC +G ++ V +PRP D N E GVG+
Sbjct: 297 VEYEEILDDIRSECSGFGQVLGVYVPRPLKKDKSRWDTMDPNTMSPAEIAKADEIAGVGR 356
Query: 477 VFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
V++++ D G TA L+GR F G T+ ED
Sbjct: 357 VYVKFSDTEGATTAVRQLAGRSFAGRTIITTLMRED 392
>gi|110741990|dbj|BAE98934.1| splicing factor like protein [Arabidopsis thaliana]
Length = 341
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 105/149 (70%), Positives = 118/149 (79%), Gaps = 4/149 (2%)
Query: 109 KRRSGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPE--MAQNMLPFGATQ-LGAFPLMPVQ 164
+R SGFDMAPPA+AML A V GQ+P P +P M NM P Q G +MP+Q
Sbjct: 169 QRVSGFDMAPPASAMLAAGAAVTGQVPPAPPTLPGAGMFPNMFPLPTGQSFGGLSMMPIQ 228
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
MTQQATRHARRVYVGGL P ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYINHEKKF
Sbjct: 229 AMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINHEKKF 288
Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRV 253
AFVEMR+VEEASNAM+LDGIIFEG V+V
Sbjct: 289 AFVEMRSVEEASNAMSLDGIIFEGAPVKV 317
>gi|440792998|gb|ELR14199.1| U2 snRNP auxilliary factor, large subunit, splicing factor
subfamily protein [Acanthamoeba castellanii str. Neff]
Length = 462
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 126/350 (36%), Positives = 180/350 (51%), Gaps = 62/350 (17%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
R ARR+YVG +PP G S VV+ +N +K F+F+E T
Sbjct: 172 RQARRLYVGSIPP-------------------GVSDPDHRPVVSSQLNPDKSFSFIEFST 212
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
++EA+ MALDGI G+ ++VRRP DY +P A A G + G+ S +
Sbjct: 213 IDEATAGMALDGITMNGMTLKVRRPKDYVSPPTAQAPASG------GIHIPGIVSTNV-- 264
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
+ P+++F+GGLP Y E Q+KELL +FG L F+LVKD TGNSKGY F Y D +VTD
Sbjct: 265 PDSPNKIFIGGLPSYLNEAQVKELLTAFGPLKAFNLVKDTATGNSKGYAFFEYLDASVTD 324
Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
AC LNG+K+GDKTL V+RA + I+ M + SGM L
Sbjct: 325 RACQGLNGMKLGDKTLLVQRANIGAKQDGTGGLIM-----------MPMDPSGM--LNAS 371
Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
S ++ + + L DE++ +I+ED+R+EC KYG +++
Sbjct: 372 PSAASLLNLQL---LNLVRPEELVSDEDHADIVEDVRQECEKYGNVMS------------ 416
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
VF+E+ D A A++ L+GRKF G+ V Y+ ED Y +D+S
Sbjct: 417 ------VFVEFADREDAARAQSNLAGRKFNGHVVLTSYFDEDDYEQRDFS 460
>gi|406867695|gb|EKD20733.1| U2 snRNP auxilliary factor [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 613
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 137/443 (30%), Positives = 209/443 (47%), Gaps = 76/443 (17%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P P ++L AF P
Sbjct: 201 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMSQP 252
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+T A +R A+R+ V LPP NE+AI FF+ + G N D V+
Sbjct: 253 SGSVTNAALKPSNSRQAKRLLVSNLPPTVNEEAIVNFFNLQLN--GLNVIEGSDPCVSAQ 310
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFE-----GVA-------VRVRRPTDYNPTLAA 265
I+ +K FA +E + +A+ A+ALDGI E G A + +RRP DY +
Sbjct: 311 ISKDKTFALLEFKQTSDATIALALDGITMEDEHMNGSATNGDTQGLSIRRPKDY---IVP 367
Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFD 325
+ P G+ S + + +++ + +P Y T+ Q+ ELL SFG L F
Sbjct: 368 TITDETP------FEAGVISNVV--VDTQNKISISNIPLYLTDEQVTELLVSFGELKAFV 419
Query: 326 LVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESIL 385
LVKD T S+G FC Y DP TDIA LNG+++GDK L V+RA+
Sbjct: 420 LVKDNGTDESRGIAFCEYVDPVATDIAVEGLNGMELGDKHLKVQRASIG----------- 468
Query: 386 AQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE----- 440
H + + + + M+ L G S G +VL L +T + L D+E+YE
Sbjct: 469 -----HTQVSGLEMSVNAMSMLAGTTSQ-GLEEGRVLQLLNMVTPEELIDNEDYEGTILL 522
Query: 441 ------------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCA 488
EI ED++EEC KYG ++++ +PRP ++ GVGK+++++ ++
Sbjct: 523 TTLIQSLLTMYPEICEDVKEECEKYGKVLDMKVPRPTGGSRQSNGVGKIYVKFDNSDSAG 582
Query: 489 TAKNALSGRKFGGNTVNAFYYPE 511
A L+GRKF TV Y+ E
Sbjct: 583 KALRTLAGRKFADRTVVVTYFSE 605
>gi|443924699|gb|ELU43686.1| rRNA primary transcript binding protein [Rhizoctonia solani AG-1
IA]
Length = 678
Score = 197 bits (502), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 134/382 (35%), Positives = 198/382 (51%), Gaps = 70/382 (18%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
R +RR+YVG + ANE + FF++ M + + G GD V+ V INHEK +AFVE R+
Sbjct: 233 RQSRRLYVGNITYEANENNLQEFFNRKMVEMKIGTGGGGDPVIGVQINHEKSYAFVEFRS 292
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPT-LAAALG---PGQPSPNLNLAAVGLASGA 287
E+A+ AMA DGI+F+ +++RRP DY + L+A +G PG S N+
Sbjct: 293 AEDATAAMAFDGIMFQSGPLKIRRPKDYTGSDLSAPMGVHVPGVVSTNV----------- 341
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC-VYQDP 346
+ +++FVGGLP Y E Q+ ELL+SFG L F+LV R+ GN C VY DP
Sbjct: 342 ---PDSINKIFVGGLPTYLDENQVMELLKSFGELKAFNLV--RENGNGAFRRDCQVYVDP 396
Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQSKTE--QESILAQAQQHIAIQKMAL-QTSG 403
+VTDIA LNG+++GD+ L V+RA+ +S + A A I M + +T+
Sbjct: 397 SVTDIAIQGLNGMELGDRFLVVQRASVGAKSGIPGVPPELFAPA---IPRPIMPITETAD 453
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
N + + +L L + + L DD EY EI+ED+REEC K+G + ++ IPR
Sbjct: 454 PN----------PSDSTILLLLNMVAPEDLTDDGEYTEIVEDVREECSKFGPVRSLAIPR 503
Query: 464 P--------DQNGG-------------------------ETPGVGKVFLEYYDAVGCATA 490
P D N G E GVG+V++++ G ++A
Sbjct: 504 PAKKEKSKWDANAGALVTAGGAVLAPGAVGATAGDGKTDEQRGVGRVYIKFETPEGASSA 563
Query: 491 KNALSGRKFGGNTVNAFYYPED 512
L+GR F G ++ A +D
Sbjct: 564 LRGLAGRAFAGRSIIATILADD 585
>gi|297706019|ref|XP_002829852.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Pongo abelii]
Length = 352
Score = 197 bits (502), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 124/289 (42%), Positives = 177/289 (61%), Gaps = 23/289 (7%)
Query: 188 EQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE 247
++A+ FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+
Sbjct: 74 KEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQ 132
Query: 248 GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFT 307
G ++++RRP DY P PG S N ++ G+ S + + ++F+GGLP Y
Sbjct: 133 GQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLN 183
Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
+ Q+KELL SFG L F+LVKD TG SKGY FC Y D VTD A A LNG+++GDK L
Sbjct: 184 DDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLL 243
Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEA 427
V+RA+ ++ T + Q + +Q L +S + +GG + +VLCL
Sbjct: 244 VQRASVGAKNAT----LSTINQTPVTLQVPGLMSSQVQ-MGGHPT-------EVLCLMNM 291
Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGK 476
+ + L DDEEYEEI+ED+R+EC KYG + ++ IPRP +G E PG GK
Sbjct: 292 VLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGK 339
>gi|295663747|ref|XP_002792426.1| splicing factor U2AF 50 kDa subunit [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226279096|gb|EEH34662.1| splicing factor U2AF 50 kDa subunit [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 567
Score = 197 bits (502), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 136/465 (29%), Positives = 227/465 (48%), Gaps = 85/465 (18%)
Query: 93 RNRSKSLSP---SRSPS-----------KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGV 136
R+R +S SP R P+ + +R + +D+ PP + A + G P +
Sbjct: 140 RDRKRSASPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYEHVTAEQAKLSGMFP-L 198
Query: 137 PSAVPEMAQNMLPFGATQLGAFPLMPV------QVMTQQATRHARRVYVGGLPPLANEQA 190
P A + A + ++L AF PV V+ +R A+R++V +PP A E++
Sbjct: 199 PGAPRQQAVD-----PSRLQAFMNQPVPGISTNTVLRPSNSRQAKRLFVHNIPPSATEES 253
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+ + G N D V+ ++ ++ F +E ++ +A+ A+A DGI E
Sbjct: 254 LVQFFNLQLN--GLNVIKGVDPCVSAQLSKDRSFGLLEFKSSADATVALAFDGITMEDTG 311
Query: 251 ---------------VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
+ +RRP DY +G G+ P+ G+ S + + P+
Sbjct: 312 DMDTSNGEANGSNQGLSLRRPKDY----ILPVG-GEEEPHRE----GVVSNVV--PDTPN 360
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
++ V +P + E + LL SFG L F LVKD +TG S+G FC Y DP TDIA
Sbjct: 361 KICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSETGESRGIAFCEYLDPMSTDIAVEN 420
Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFG 415
LNG+++G+K L V RA+ +Q +G++ MS++
Sbjct: 421 LNGMELGNKRLKVVRASIG-----------------------TIQATGLDMGINAMSMYA 457
Query: 416 ETLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
+T + +VL L +T D L D+++YEEI +D+R+EC KYG +V + +PRP N
Sbjct: 458 KTTSQDIEAGRVLQLLNMVTTDELLDNDDYEEICDDVRDECSKYGQVVELKVPRPTGNNK 517
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++ GVGK+++++ ++ + A AL+GRKF TV Y+ E+ +
Sbjct: 518 QSAGVGKIYVKFDNSESASKALKALAGRKFQDRTVVTTYFSEENF 562
>gi|409081516|gb|EKM81875.1| hypothetical protein AGABI1DRAFT_70425 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 375
Score = 197 bits (502), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 127/371 (34%), Positives = 196/371 (52%), Gaps = 39/371 (10%)
Query: 161 MPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
MPVQ +R +RR+Y+G + NEQ +A FF+ M + + G+ V+
Sbjct: 1 MPVQSFGMGIGGNPNLSRQSRRLYIGSITQEVNEQNLADFFNAKMAEMNIGTGITGNPVL 60
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
V N+EK +AFVE R+ E+A+ AMA DGIIF +++RRP DY + PG P
Sbjct: 61 AVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDYGGD--TIVSPGVHVP 118
Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
G+ S + + ++VFVGGLP Y E Q+ ELL+SFG L F+LV++ TG
Sbjct: 119 -------GVVSTNV--PDSINKVFVGGLPTYLNEEQVMELLKSFGELKAFNLVRENGTGT 169
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
SKG+ F Y D AVTD+A +LNG+++GD+ L V+RA+ + T Q I
Sbjct: 170 SKGFAFFEYVDQAVTDVAIQSLNGMELGDRYLVVQRASVGAKPGTPGMIPNLPYDQFPEI 229
Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
+ + T + +++L + +T + L +D+EY ++ +D++ EC KYG
Sbjct: 230 PRPIMPAGKDQT---------SSESRILLMLNMVTPEDLHEDDEYGDLYDDVKAECSKYG 280
Query: 455 TLVNVVIPRP-------------DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
L ++ IPRP Q E GVG+V+++Y ++ + A NAL+GR F G
Sbjct: 281 ELEDLRIPRPVKKDKTSSMSAQDAQRIDEAAGVGRVYVKYTNSKSASAALNALAGRSFAG 340
Query: 502 NTVNAFYYPED 512
++ A +D
Sbjct: 341 RSIIATLLSDD 351
>gi|225677913|gb|EEH16197.1| splicing factor U2AF 65 kDa subunit ) [Paracoccidioides
brasiliensis Pb03]
gi|226287345|gb|EEH42858.1| splicing factor U2AF 50 kDa subunit [Paracoccidioides brasiliensis
Pb18]
Length = 567
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 136/465 (29%), Positives = 227/465 (48%), Gaps = 85/465 (18%)
Query: 93 RNRSKSLSP---SRSPS-----------KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGV 136
R+R +S SP R P+ + +R + +D+ PP + A + G P +
Sbjct: 140 RDRKRSASPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYEHVTAEQAKLSGMFP-L 198
Query: 137 PSAVPEMAQNMLPFGATQLGAFPLMPV------QVMTQQATRHARRVYVGGLPPLANEQA 190
P A + A + ++L AF PV V+ +R A+R++V +PP A E++
Sbjct: 199 PGAPRQQAVD-----PSRLQAFMNQPVPGTSINTVLRPSNSRQAKRLFVHNIPPSATEES 253
Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
+ FF+ + G N D V+ ++ ++ F +E ++ +A+ A+A DGI E
Sbjct: 254 LVQFFNLQLN--GLNVIKGVDPCVSAQLSKDRSFGLLEFKSSADATVALAFDGITMEDTG 311
Query: 251 ---------------VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
+ +RRP DY +G G+ P+ G+ S + + P+
Sbjct: 312 DMDTSNGEANGSNQGLSLRRPKDY----ILPVG-GEEEPHRE----GVVSNVV--PDTPN 360
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
++ V +P + E + LL SFG L F LVKD +TG S+G FC Y DP TDIA
Sbjct: 361 KICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSETGESRGIAFCEYLDPMSTDIAVEN 420
Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFG 415
LNG+++G+K L V RA+ +Q +G++ MS++
Sbjct: 421 LNGMELGNKRLRVVRASIG-----------------------TIQATGLDMGINAMSMYA 457
Query: 416 ETLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
+T + +VL L +T D L D+++YEEI +D+R+EC KYG +V + +PRP N
Sbjct: 458 KTTSQDIEAGRVLQLLNMVTTDELLDNDDYEEICDDVRDECSKYGQVVELKVPRPTGNNK 517
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++ GVGK+++++ ++ + A AL+GRKF TV Y+ E+ +
Sbjct: 518 QSAGVGKIYVKFDNSESASKALKALAGRKFQDRTVVTTYFSEENF 562
>gi|240279650|gb|EER43155.1| splicing factor u2af large subunit [Ajellomyces capsulatus H143]
gi|325092783|gb|EGC46093.1| splicing factor u2af large subunit [Ajellomyces capsulatus H88]
Length = 572
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 137/464 (29%), Positives = 221/464 (47%), Gaps = 78/464 (16%)
Query: 93 RNRSKSLSP---SRSPS-----------KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LP 134
R R +S SP R P+ + +R + +D+ PP + A + G LP
Sbjct: 140 RERKRSASPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYEHVTAEQAKLSGMFPLP 199
Query: 135 GVP---SAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAI 191
G P + P Q + T A V+ +R A+R++V LP A E+++
Sbjct: 200 GAPRQQAVDPSRLQAFIHPPTTTSTAPGTSTNTVLKPSNSRQAKRLFVHNLPSSATEESL 259
Query: 192 ATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE---- 247
FF+ + G N D V ++++K FA +E R + + A+A DGI E
Sbjct: 260 VQFFNLQLN--GLNVIKGVDPCVTAQLSNDKTFALLEFRNAADTTVALAFDGITMEDNDE 317
Query: 248 -----------GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDR 296
+ +RRP DY L +A+ G+P+ G+ S + + P++
Sbjct: 318 MDTTNGDSNGSNQGLSIRRPKDY--ILPSAVE-GEPNQE------GVVSNVV--PDSPNK 366
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
+ V +P + E + LL SFG L F LVKD +TG S+G FC Y+DP TDIA L
Sbjct: 367 ICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSETGESRGIAFCEYRDPMSTDIAVENL 426
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGE 416
NG+++G+K L V RA+ Q +G++ MS++ +
Sbjct: 427 NGMELGNKKLKVVRASIG-----------------------TTQAAGLDMGVNAMSMYAK 463
Query: 417 TLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
T + +VL L +T + L D+++YEEI +D+R+EC KYG +V + +PRP N +
Sbjct: 464 TTSQDIEASRVLQLLNMVTTEELIDNDDYEEICDDVRDECSKYGEVVELKVPRPTGNNKQ 523
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ GVGK+++++ ++ A AL+GRKF TV Y+ E+ +
Sbjct: 524 SAGVGKIYVKFDNSESATKALRALAGRKFQDRTVVTTYFSEENF 567
>gi|388580158|gb|EIM20475.1| hypothetical protein WALSEDRAFT_55089 [Wallemia sebi CBS 633.66]
Length = 391
Score = 197 bits (500), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 125/382 (32%), Positives = 193/382 (50%), Gaps = 61/382 (15%)
Query: 157 AFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
+F +P + R +RRV++G L P NE+ + F+ M+ I + PG+ VVNV
Sbjct: 15 SFSNLPPTLEDFNGARASRRVFIGDLKPNHNEENLTKLFNDKMSTIDQVAKIPGEPVVNV 74
Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL 276
+ H++ +A++E R +EA+ A+ DG IF+G ++++RP + L G
Sbjct: 75 TVKHDRGYAYIEFRNTDEAAYALQFDGTIFQGEGIQIKRPQEVLDELQRKQG-------- 126
Query: 277 NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
SG + ++ ++FVG LP + + Q+ ELL SFG L F+LVK+ + SK
Sbjct: 127 -----HTVSGTVPDSD--QKIFVGSLPTFLNDEQVMELLGSFGELRSFNLVKEGTSDVSK 179
Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQK 396
G+ FC Y DPA+TDIA LNG+++GD+ L V+R++ K
Sbjct: 180 GFAFCEYMDPALTDIAIQGLNGMEVGDRKLVVQRSSTGPMGK------------------ 221
Query: 397 MALQTSGMNTLGGGMSLFGETLA---KVLCLTEAITADALADDEEYEEILEDMREECGKY 453
+ G +++ + L ET A VL L +TA+ L DD +Y+EI ED++EEC +Y
Sbjct: 222 --IGVGGTSSIAQILPLASETQAYRTNVLLLLNMVTAEELKDDLDYQEICEDIQEECSQY 279
Query: 454 GTLVNVVIPRPD----------------------QNGGETPGVGKVFLEYYDAVGCATAK 491
G ++ + IPRP + E GVGK+F+ Y + A
Sbjct: 280 GEIIKIKIPRPPRPDDPVFSTPGVTLSSGEDLRFEAASEELGVGKIFILYKTEEQASKAL 339
Query: 492 NALSGRKFGGNTV-NAFYYPED 512
AL+GR FGG TV A+ PED
Sbjct: 340 KALAGRVFGGRTVIGAYGKPED 361
>gi|154280004|ref|XP_001540815.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150412758|gb|EDN08145.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 571
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 139/466 (29%), Positives = 222/466 (47%), Gaps = 83/466 (17%)
Query: 93 RNRSKSLSP---SRSPS-----------KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LP 134
R R +S SP R P+ + +R + +D+ PP + A + G LP
Sbjct: 140 RERKRSASPPPKKREPTPDLTDVVPVLERKRRLTQWDIKPPGYEHVTAEQAKLSGMFPLP 199
Query: 135 GVP---SAVPEMAQNML--PFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQ 189
G P + P Q + P +T G V+ +R A+R++V LP A E+
Sbjct: 200 GAPRQQAVDPSRLQAFIHPPTTSTAPGT---STNTVLKPSNSRQAKRLFVHNLPSSATEE 256
Query: 190 AIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE-- 247
++ FF+ + G N D V ++++K FA VE R + + A+A DGI E
Sbjct: 257 SLVQFFNLQLN--GLNVIKGVDPCVTAQLSNDKTFALVEFRNAADTTVALAFDGITMEDN 314
Query: 248 -------------GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
+ +RRP DY L +A+ G+P G+ S + + P
Sbjct: 315 DEMDTTNGNSNGSNQGLSIRRPKDY--ILPSAVE-GEPHQE------GVVSNVV--PDSP 363
Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
+++ V +P + E + LL SFG L F LVKD +TG S+G FC Y+DP TDIA
Sbjct: 364 NKICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSETGESRGIAFCEYRDPMSTDIAVE 423
Query: 355 ALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLF 414
LNG+++G+K L V RA+ Q +G++ MS++
Sbjct: 424 NLNGMELGNKKLKVVRASIG-----------------------TTQAAGLDMGVNAMSMY 460
Query: 415 GETLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
+T + +VL L +T + L D+++YEEI +D+R+EC KYG +V + +PRP N
Sbjct: 461 AKTTSQDIEASRVLQLLNMVTTEELIDNDDYEEICDDVRDECSKYGEVVELKVPRPTGNN 520
Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++ GVGK+++++ ++ A AL+GRKF TV Y+ E+ +
Sbjct: 521 KQSAGVGKIYVKFDNSESATKALRALAGRKFQDRTVVTTYFSEENF 566
>gi|367020820|ref|XP_003659695.1| hypothetical protein MYCTH_2297048 [Myceliophthora thermophila ATCC
42464]
gi|347006962|gb|AEO54450.1| hypothetical protein MYCTH_2297048 [Myceliophthora thermophila ATCC
42464]
Length = 567
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 132/430 (30%), Positives = 213/430 (49%), Gaps = 59/430 (13%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P Q M P T+L AF P
Sbjct: 170 RKRRLTQWDIKPPGYDNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMNHP 221
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+ A +R ++R+ V LPP A E+++ FF+ + G N D + +
Sbjct: 222 GGAVNSAALKPTNSRQSKRLIVSNLPPSATEESLVNFFNLQLN--GLNVIETADPCLQAH 279
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG-------------VAVRVRRPTDYNPTLA 264
I ++ FA +E R +A+ A+ALDGI E + +RRP DY +
Sbjct: 280 IAPDRSFAMLEFRHNTDATVALALDGITMEAEDADAANGNGAATQGLHLRRPKDY--IVP 337
Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
A + PN + + +S + + P+++ V LP Y T+ Q+ ELL SFG L F
Sbjct: 338 AVVE----DPNYDPDSDTPSSVVL---DSPNKISVTNLPLYLTDDQVMELLVSFGKLKSF 390
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
LVKD T S+G F Y DP+ T++A LN + +G++ L V++A+
Sbjct: 391 VLVKDNGTQESRGIAFLEYADPSATNVAVQGLNNMMLGERALKVQKASIG---------- 440
Query: 385 LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
+ Q + + M++ M+ GG ++VL L +T + L D+++YEEI E
Sbjct: 441 ITQVSGEMGVNAMSMLAGTMSADAGG--------SRVLQLLNMVTPEELMDNDDYEEIRE 492
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
D++EEC K+G ++++ IPRP ++ GVGK++++Y A A AL+GRKF TV
Sbjct: 493 DVQEECQKFGKILSLKIPRPVGGSRQSAGVGKIYIKYETAESATKALRALAGRKFADRTV 552
Query: 505 NAFYYPEDKY 514
Y+PE+ +
Sbjct: 553 VTTYFPEENF 562
>gi|195585954|ref|XP_002082743.1| GD25073 [Drosophila simulans]
gi|194194752|gb|EDX08328.1| GD25073 [Drosophila simulans]
Length = 445
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 125/367 (34%), Positives = 192/367 (52%), Gaps = 45/367 (12%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP----GDAVVNVYINHEKKFAF 226
TR ARR+YVG +P E+ + FF+ +TA+G + G AV+ N EK FAF
Sbjct: 103 TRQARRLYVGNIPFGVTEEEMMQFFNHRITALGYEAKSSHYMDGKAVLTCQTNLEKNFAF 162
Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-------------S 273
+E R+++EA+ A+ DG++F G +++RRP DY P + + +
Sbjct: 163 LEFRSIDEATQALNFDGMVFRGQTLKIRRPHDYQPVPSISFSAMENYRSFRVPDTTIANP 222
Query: 274 PNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTG 333
PN+ + + + P++++VGGLP + Q+KELL+SFG L G +LV D +T
Sbjct: 223 PNVTIPVTTIV------PDSPNKIYVGGLPTCLNQDQVKELLQSFGELKGLNLVMDGNTS 276
Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
+KG+ F Y DP VTD A A L+G+ +GD+ L V+R+ G++ +
Sbjct: 277 LNKGFAFFEYYDPLVTDHAIAGLHGMLLGDRRLVVQRSIPGGKNAFPGHT---------- 326
Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
+Q G++TL L + + LCL + + L DDEE+E+I D+++EC K+
Sbjct: 327 --APVVQVPGISTL-----LDPGSPTETLCLLNMVRPEELLDDEEFEDIRTDIKQECAKF 379
Query: 454 GTLVNVVIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
G + ++ IPRP G P G GKVF+++ A ALSGRKF V YY
Sbjct: 380 GEVRSIKIPRPI---GPFPKRGCGKVFVQFESVEDSQKALKALSGRKFSDRIVMTSYYDP 436
Query: 512 DKYFNKD 518
+KY D
Sbjct: 437 EKYLADD 443
>gi|70946422|ref|XP_742927.1| U2 snRNP auxiliary factor [Plasmodium chabaudi chabaudi]
gi|56522174|emb|CAH84932.1| U2 snRNP auxiliary factor, putative [Plasmodium chabaudi chabaudi]
Length = 561
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 141/490 (28%), Positives = 229/490 (46%), Gaps = 53/490 (10%)
Query: 64 DRTDRHRDYNRDKERRHRHRSR---SHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPA 120
D +D++ + +ER+ R + S S + + +K + P R SK D + A
Sbjct: 88 DNSDKNYSDDSTRERKKNARDKNDISMSEEDSKKENKEIKPKRKKSK---WDTVDESLLA 144
Query: 121 AAMLPGAAVPGQLPGVPS-AVPEMAQNMLPFGAT-QLGAFPLMPVQVMTQQATRHARRVY 178
ML + L GV + N+LP QLG P + + R++Y
Sbjct: 145 NNMLIDS---NNLSGVLQYQRLSLNGNLLPGNKMPQLGRNP------YELEGDKKQRKLY 195
Query: 179 VGGLPPLANEQAIATFFSQVMTAIGGNSA---GPGDA----VVNVYI-NHEKKFAFVEMR 230
+G LPP + ++ I FF+ +++I S+ GD VV I N + +F F+E R
Sbjct: 196 IGNLPPNSKQEEIVEFFNNTLSSIIKGSSLEVKIGDVQLLPVVKCEIFNPDSRFCFLEFR 255
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
T++ ++ LD + + +R+ RP DY P P + P L + + G +
Sbjct: 256 TMDITWLSLKLDSMSYNNYCLRINRPHDYMP-------PPEGDPALTVVFPDIDMGLLES 308
Query: 291 AEGP------------DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
+ P +++++ LP+ + QI +LL FG L GF+++KD +TG +KGY
Sbjct: 309 FKPPKIAPVRSTGDDDNKLYIQNLPHDLKDDQIMDLLGQFGKLKGFNIIKDLNTGLNKGY 368
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI------ 392
GF Y+D + T +A ALNG G L V++AT + ++
Sbjct: 369 GFFEYEDSSCTQVAIHALNGFVCGKNILNVKKATFNKNPNNIPNPNNIALANNVDVPVSL 428
Query: 393 ---AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
+I + L S + GE ++V+ LT A+ + L + +YEEIL+D++EE
Sbjct: 429 LPNSISQKILSNSIIGLQIQASRKIGEKSSRVIQLTNAVFQEDLIINSQYEEILKDVKEE 488
Query: 450 CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
KYGTL ++VIP+P+ + T GVGK+FL Y D A+ +GR F V A +Y
Sbjct: 489 AEKYGTLQSIVIPKPNIDLSYTEGVGKIFLHYADETAARKAQYMFNGRLFEKRVVCASFY 548
Query: 510 PEDKYFNKDY 519
EDK+ Y
Sbjct: 549 SEDKFLEGKY 558
>gi|261196608|ref|XP_002624707.1| splicing factor u2af large subunit [Ajellomyces dermatitidis
SLH14081]
gi|239595952|gb|EEQ78533.1| splicing factor u2af large subunit [Ajellomyces dermatitidis
SLH14081]
gi|239609528|gb|EEQ86515.1| splicing factor u2af large subunit [Ajellomyces dermatitidis ER-3]
gi|327350238|gb|EGE79095.1| splicing factor u2af large subunit [Ajellomyces dermatitidis ATCC
18188]
Length = 570
Score = 196 bits (497), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 136/469 (28%), Positives = 217/469 (46%), Gaps = 89/469 (18%)
Query: 93 RNRSKSLSP---SRSPS-----------KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGV 136
R R +S SP R P+ + +R + +D+ PP + A + G P +
Sbjct: 139 RERKRSASPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYEHVTAEQAKLSGMFP-L 197
Query: 137 PSAVPEMAQNMLPFGATQLGAFPLMPVQ----------VMTQQATRHARRVYVGGLPPLA 186
P A + A ++L AF P V+ +R A+R++V LPP A
Sbjct: 198 PGAPRQQA-----VDPSRLQAFIHPPTTITAPGSSTNTVLKPSNSRQAKRLFVHNLPPSA 252
Query: 187 NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF 246
E+ + FF+ + G N D ++ ++ +K FA +E R + + A+A DGI
Sbjct: 253 TEERLVQFFNLQLN--GLNVIKGVDPCLSAQLSRDKTFALLEFRNAADTTVALAFDGITM 310
Query: 247 E---------------GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
E + +RRP DY A P Q G+ S +
Sbjct: 311 EDSGEMDTSNGDVDGPSQGLSIRRPKDYILPSAVEEEPQQE---------GVVSNVV--P 359
Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
+ P+++ V +P + E + LL SFG L F LVKD +TG S+G FC Y DP T+I
Sbjct: 360 DSPNKICVSNIPPFIEEEPVTMLLVSFGELKSFILVKDSETGESRGIAFCEYLDPTSTEI 419
Query: 352 ACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGM 411
A LNG+++G+K L V RA+ Q +G++ M
Sbjct: 420 AVENLNGMELGNKRLKVVRASVG-----------------------TTQAAGLDMGVNAM 456
Query: 412 SLFGETLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
S++ +T + +VL L +T + L D+++YEEI +D+R+EC KYG +V + +PRP
Sbjct: 457 SMYAKTTSQDIEASRVLQLLNMVTTEELMDNDDYEEICDDVRDECSKYGEVVELKVPRPT 516
Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
N ++ GVGK+++E+ ++ A AL+GRKF TV Y+ E+ +
Sbjct: 517 GNNKQSAGVGKIYVEFDNSESATKALKALAGRKFQDRTVVTTYFSEENF 565
>gi|241166827|ref|XP_002409934.1| splicing factor u2af large subunit, putative [Ixodes scapularis]
gi|215494685|gb|EEC04326.1| splicing factor u2af large subunit, putative [Ixodes scapularis]
Length = 444
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 131/361 (36%), Positives = 187/361 (51%), Gaps = 40/361 (11%)
Query: 171 TRHARRVYVGGLP--------PLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
TR ARR+YVG +P PL + + +F+ M A G + A PG+ V+ IN +K
Sbjct: 112 TRQARRLYVGNIPFGCSEASRPLLLREEMMDYFNAQMHACGFSQA-PGNPVLACQINLDK 170
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDY--NPTLAAALGPGQPSPNLNLAA 280
FAF+E+ ++ + DY P LG G +P
Sbjct: 171 NFAFLEVSALDTDLGTPCCPTFVL----------YDYLSPPFSGNCLGAGNGTPGDTW-- 218
Query: 281 VGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
+G SG I + P ++F+GGLP Y E Q++ELL SFG L F+LVKD TG SKGY
Sbjct: 219 LGFLSGVISTVVQDSPHKIFIGGLPNYLNEDQVRELLMSFGQLRAFNLVKDSATGLSKGY 278
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
FC Y + TD A LNG+++GDK L V+RA+ +K Q ++ A I + +
Sbjct: 279 AFCEYVEVTTTDQAIMGLNGMQLGDKKLIVQRASVG--AKNSQMNVSRDAPVQIQVPGLQ 336
Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
LQ G G +VLCL + + L D+EEYE+ILED+ EEC KYG + +
Sbjct: 337 LQG------GAGPP------TEVLCLMNLVCPEELKDEEEYEDILEDIHEECNKYGVVKS 384
Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
+ IPRP + G + PG GK F+E+ + C A+ +L+GRKF V Y+ DKY ++
Sbjct: 385 IEIPRPIE-GVDVPGCGKAFVEFNSVIDCQKAQQSLTGRKFSNRVVVTSYFDPDKYHRRE 443
Query: 519 Y 519
+
Sbjct: 444 F 444
>gi|225562835|gb|EEH11114.1| splicing factor u2af large subunit [Ajellomyces capsulatus G186AR]
Length = 571
Score = 195 bits (496), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 138/466 (29%), Positives = 222/466 (47%), Gaps = 83/466 (17%)
Query: 93 RNRSKSLSP---SRSPS-----------KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LP 134
R R +S SP R P+ + +R + +D+ PP + A + G LP
Sbjct: 140 RERKRSASPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYEHVTAEQAKLSGMFPLP 199
Query: 135 GVP---SAVPEMAQNML--PFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQ 189
G P + P Q + P +T G V+ +R A+R++V LP A E+
Sbjct: 200 GAPRQQAVDPSRLQAFIHPPTTSTAPGT---STNTVLKPSNSRQAKRLFVHNLPSSATEE 256
Query: 190 AIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE-- 247
++ FF+ + G N D V ++++K FA +E R + + A+A DGI E
Sbjct: 257 SLVQFFNLQLN--GLNVIKGVDPCVTAQLSNDKTFALLEFRNAADTTVALAFDGITMEDN 314
Query: 248 -------------GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
+ +RRP DY L +A+ G+P G+ S + + P
Sbjct: 315 DDMDTTNGDSNGSNQGLSIRRPKDY--ILPSAVE-GEPHQE------GVVSNVV--PDSP 363
Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
+++ V +P + E + LL SFG L F LVKD +TG S+G FC Y+DP TDIA
Sbjct: 364 NKICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSETGESRGIAFCEYRDPMSTDIAVE 423
Query: 355 ALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLF 414
LNG+++G+K L V RA+ Q +G++ MS++
Sbjct: 424 NLNGMELGNKKLKVVRASIG-----------------------TTQAAGLDMGVNAMSMY 460
Query: 415 GETLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
+T + +VL L +T + L D+++YEEI +D+R+EC KYG +V + +PRP N
Sbjct: 461 AKTTSQDIEASRVLQLLNMVTTEELIDNDDYEEICDDVRDECSKYGEVVELKVPRPTGNN 520
Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++ GVGK+++++ ++ A AL+GRKF TV Y+ E+ +
Sbjct: 521 KQSAGVGKIYVKFDNSESATKALRALAGRKFQDRTVVTTYFSEENF 566
>gi|119592808|gb|EAW72402.1| U2 (RNU2) small nuclear RNA auxiliary factor 2, isoform CRA_a [Homo
sapiens]
Length = 376
Score = 194 bits (494), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 122/311 (39%), Positives = 173/311 (55%), Gaps = 21/311 (6%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T +++ QH
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LVSPPAQHHQSDACDP 359
Query: 400 QTSGMNTLGGG 410
++G++ L G
Sbjct: 360 ASAGLDELPGA 370
>gi|345569109|gb|EGX51978.1| hypothetical protein AOL_s00043g712 [Arthrobotrys oligospora ATCC
24927]
Length = 569
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 132/450 (29%), Positives = 216/450 (48%), Gaps = 54/450 (12%)
Query: 81 RHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPA--AAMLPGAAVPGQLPGVP 137
R RS S + + + L+ S+ KRR + +D+ PP + A + G P +P
Sbjct: 153 RQPRRSPSPPKVKEPTPDLTDIIPVSERKRRLTMWDIKPPGYESVTAEQAKLSGMFP-LP 211
Query: 138 SAVPEMAQNMLPFGATQLGAFPLMPVQ--------VMTQQATRHARRVYVGGLPPLANEQ 189
A + A +++ AF P + + +R A+R+ LPP+ E+
Sbjct: 212 GAPRQTA-----LDPSRMAAFVNHPPKEGSTPQPTALKPSNSRQAKRLLCQNLPPMCTEE 266
Query: 190 AIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV 249
I +FFS + ++ + + ++ VY+N A +E R+ A+ +A DG+ F+
Sbjct: 267 TIYSFFSSFLKSLNAVDSE-NEPLITVYLNPTGTMAMLEFRSTAYATLCLAFDGMEFDDT 325
Query: 250 AVRVR--RPTDY---NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPY 304
V++R RP DY + ++ G SPN+ + +++ V +P
Sbjct: 326 EVKIRLSRPKDYIIPQYSESSESHNGDISPNV--------------PDSINKICVSNIPT 371
Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDK 364
+ + Q+ ELL++FG L F LVKD++ SKG FC Y DP + +IA LNGL + ++
Sbjct: 372 HLADQQVMELLQTFGPLKSFFLVKDKEMDESKGVAFCEYLDPNIAEIAIEGLNGLDINEQ 431
Query: 365 TLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCL 424
L V+RA+ + A A+ I + T+ GG +VL L
Sbjct: 432 LLNVKRASIGVKQS-------AGAEAGIPAMTVIAATTSAEMEGG----------RVLQL 474
Query: 425 TEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDA 484
+TAD L D EEYEEILED+ +EC K+G ++++ IPRP N GVGK+++ + +
Sbjct: 475 LNMVTADELLDQEEYEEILEDVTDECNKFGPIIDIKIPRPSGNQRAAAGVGKIYVRFEEH 534
Query: 485 VGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
A +L+GRKF TV Y+ E+ Y
Sbjct: 535 ESAEKALKSLAGRKFADRTVIVSYFSEENY 564
>gi|148699340|gb|EDL31287.1| U2 small nuclear ribonucleoprotein auxiliary factor (U2AF) 2,
isoform CRA_b [Mus musculus]
Length = 356
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 117/285 (41%), Positives = 162/285 (56%), Gaps = 27/285 (9%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAML-----PGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
RSP K K R +D+ PP + GQ+P + +P M + L T
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT- 135
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+
Sbjct: 136 -------PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVL 187
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S
Sbjct: 188 AVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SE 240
Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
N ++ G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG
Sbjct: 241 NPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGL 298
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT 379
SKGY FC Y D VTD A A LNG+++GDK L V+RA+ ++ T
Sbjct: 299 SKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT 343
>gi|119592809|gb|EAW72403.1| U2 (RNU2) small nuclear RNA auxiliary factor 2, isoform CRA_b [Homo
sapiens]
Length = 356
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 117/280 (41%), Positives = 161/280 (57%), Gaps = 17/280 (6%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT 379
FC Y D VTD A A LNG+++GDK L V+RA+ ++ T
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT 343
>gi|221059061|ref|XP_002260176.1| U2 snRNP auxiliary factor [Plasmodium knowlesi strain H]
gi|193810249|emb|CAQ41443.1| U2 snRNP auxiliary factor, putative [Plasmodium knowlesi strain H]
Length = 865
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 116/375 (30%), Positives = 193/375 (51%), Gaps = 25/375 (6%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA---GPGDAVVNVYI-----NH 220
+ + R++Y+G +PP + ++ + FF+ + +I +S+ GD V+ + N
Sbjct: 489 EGDKKQRKLYIGNIPPNSKQEELIDFFNNTLGSIIKDSSLEIKIGDIVLMPILKCEIFNV 548
Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDY------NPTLAAALGPGQPSP 274
E +F F+E R++E + LD I F A+R+ RP D+ +P L Q
Sbjct: 549 ESRFCFLEFRSLEITWLCLRLDAITFNNYALRIARPHDFVPPPGGDPALTVVFTDIQHEV 608
Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
+ + +A G + +++++ LP+ + QI++LL+ FG L GF+++KD+ TG
Sbjct: 609 FEMVKPIKIAPVRSTG-DDDNKLYIQNLPHDLGDVQIRDLLQQFGKLKGFNVIKDQSTGL 667
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHI- 392
+KGYGF Y+D T IA ALNG G L+V++AT Q+ T+ + ++ +
Sbjct: 668 NKGYGFFEYEDSNCTPIAMHALNGFVCGQNILSVKKATFGKSQNSTQNANTISLGSGSVD 727
Query: 393 --------AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
+I + L S + GE ++V+ LT A+ + L D +YEEIL
Sbjct: 728 LPVSLLPNSISQKILSNSIIGLQIQASRKIGEKSSRVVQLTNAVFQEDLIIDSQYEEILR 787
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
D++EE KYG L N+VIP+P+++ T GVGK+FL Y D A+ +GR F V
Sbjct: 788 DIKEEAEKYGPLQNIVIPKPNKDLSYTEGVGKIFLHYADETTARKAQYMFNGRLFEKRVV 847
Query: 505 NAFYYPEDKYFNKDY 519
A +Y E+K+ Y
Sbjct: 848 CAAFYSEEKFLAGKY 862
>gi|361132025|gb|EHL03640.1| putative Splicing factor U2AF 50 kDa subunit [Glarea lozoyensis
74030]
Length = 568
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 133/407 (32%), Positives = 202/407 (49%), Gaps = 59/407 (14%)
Query: 124 LPGAAVPGQLPGVPSAVPE-MAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGL 182
LPGA P Q P PS + MAQ P G+ A + +R A+R+ V L
Sbjct: 200 LPGA--PRQQPMDPSKLQAIMAQ---PSGSVTNAA--------LKPSNSRQAKRLLVHNL 246
Query: 183 PPLA-NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMAL 241
P + +E+AI FF+ M G N D ++ I+ +K FA +E +T +A+ A+A
Sbjct: 247 PSTSISEEAIINFFNLQMN--GLNIVEGSDPCISAQISKDKSFALLEFKTPSDATLALAF 304
Query: 242 DGIIFEGV-------------AVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
DGI + + +RRP DY + P +P G+ S +
Sbjct: 305 DGITMDDSEYVNREANGGDTKGLSIRRPKDYIVPAVSDETPQEP---------GVVSSVV 355
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
+ +++ + +P Y T+ Q+ ELL SFG L F LVKD TG S+G FC Y DPA
Sbjct: 356 --VDTQNKICMSNVPLYLTDEQVIELLTSFGELKAFVLVKDNSTGESRGIAFCEYADPAA 413
Query: 349 -TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
TDIA LNG+++GDK L V+RA+ + + + + + M+ L
Sbjct: 414 ATDIAVEGLNGMELGDKHLRVQRASIG----------------NTQVSGLEMGVNAMSML 457
Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
G S G +VL L +T + L D+++YEEI ED++EEC KYG ++++ +PRP
Sbjct: 458 AGTTSA-GLEDGRVLQLLNMVTPEELVDNDDYEEICEDVKEECEKYGKVLDMKVPRPTGG 516
Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++ GVGK+F+++ A AL+GRKF TV Y+ E+ +
Sbjct: 517 SRQSNGVGKIFVKFDTPESAGKALRALAGRKFADRTVVTTYFSEENF 563
>gi|307106531|gb|EFN54776.1| hypothetical protein CHLNCDRAFT_58050 [Chlorella variabilis]
Length = 354
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 100/228 (43%), Positives = 139/228 (60%), Gaps = 28/228 (12%)
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
+ GA+ +R+F+GGLPYY TE Q +ELL SFG + FDLVKD++TG SKGYGFCVY+DP
Sbjct: 148 MSGADAAERIFIGGLPYYLTEEQCRELLASFGAIKSFDLVKDKETGQSKGYGFCVYEDPR 207
Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
VTD+AC LNG++MGD+TLTVRRAT + + EQ+
Sbjct: 208 VTDVACQGLNGMRMGDRTLTVRRATEGQRQQAEQKQ------------------------ 243
Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV-VIPRPDQ 466
++ A+V+ L+ A+T + L DD+EY +I+EDM+EECGKYGT+V V +
Sbjct: 244 ---QDMYVGNTARVVKLSHAVTLEELGDDQEYGDIMEDMKEECGKYGTVVQVHIPRPAPP 300
Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ PG+GK+ +E+ + A+NA+ GRKFGG V A + Y
Sbjct: 301 SAPPPPGLGKIIIEFAETPAAMAARNAMHGRKFGGRVVEAVMMGDSDY 348
>gi|258576333|ref|XP_002542348.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902614|gb|EEP77015.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 621
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 134/431 (31%), Positives = 209/431 (48%), Gaps = 70/431 (16%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P Q + P ++L AF P
Sbjct: 155 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQ-----QTVDP---SRLQAFMNQP 206
Query: 163 V-----QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
++ +R A+R++V LPP +E +A FF+ + G N D ++
Sbjct: 207 AGNANSTLLKPSNSRQAKRLFVHNLPPSVSEDTLAQFFNLQLN--GLNVISGVDPCISAQ 264
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFE-----------GVAVRVRRPTDYNPTLAAA 266
++ + KFA +E +T +A+ A+ALDGI E G + ++RP DY A
Sbjct: 265 VSSDGKFALLEFKTASDATVALALDGISLEHDDANGTSSAPGQGLSLKRPKDYIVPSEAD 324
Query: 267 LGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
Q G+ S + + P+++ V +P + E Q+ LL SFG L F L
Sbjct: 325 DSNRQD---------GVVSNEV--PDSPNKICVTNIPPFIQEEQVTMLLVSFGELKSFVL 373
Query: 327 VKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILA 386
VKD T S+G FC Y DP+ T+IA LNG+++GDK L V RA+
Sbjct: 374 VKDSGTDESRGIAFCEYVDPSSTNIAVEGLNGMELGDKRLKVTRASIG------------ 421
Query: 387 QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYE 440
A Q +G++ MS+F +T + +VL L +TA+ L D +EYE
Sbjct: 422 -----------ATQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTAEELMDSDEYE 470
Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
EI +D+REEC KYG ++++ IPRP + GVGK+++++ + A AL+GRKF
Sbjct: 471 EICDDVREECSKYGQVLDLKIPRPTGGSRQAAGVGKIYVKFDSYDSASKAMKALAGRKFQ 530
Query: 501 GNTVNAFYYPE 511
TV ++ E
Sbjct: 531 DRTVVTTFFSE 541
>gi|259480265|tpe|CBF71237.1| TPA: splicing factor u2af large subunit (AFU_orthologue;
AFUA_7G05310) [Aspergillus nidulans FGSC A4]
Length = 547
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 116/360 (32%), Positives = 179/360 (49%), Gaps = 53/360 (14%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
+R A+R++V LPP A + + +FF+ + G N D ++ I+ + FA +E +
Sbjct: 220 SRQAKRLFVYNLPPNATVENLVSFFNLQLN--GLNVIQSVDPCISAQISDDHSFALLEFK 277
Query: 231 TVEEASNAMALDGIIF--------EGVA--VRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
+ + + A+ALDGI G A + VRRP DY PNL
Sbjct: 278 SPNDTTVALALDGITMGEHESNGENGAAKGLEVRRPKDYI------------VPNLAEQD 325
Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
+ ASG + P+++ V +P Y E + LL+SFG L F LVKD T S+G F
Sbjct: 326 LEGASGMKDVPDSPNKICVSNIPQYIPEEPVTMLLKSFGELKSFVLVKDSSTEESRGIAF 385
Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ 400
C Y DP T IA LNG+++GD+ L V RA+ Q
Sbjct: 386 CEYADPNTTTIAVQGLNGMELGDRHLKVVRASIG-----------------------MTQ 422
Query: 401 TSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
+G++ MS+F +T + +VL L +T + L D+E+YEEI +D+R+EC K+G
Sbjct: 423 AAGLDMGVNAMSMFAKTTSQDLESSRVLQLLNMVTPEELMDNEDYEEICDDVRDECSKFG 482
Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++ + IPRP ++PGVGK+F+++ A +L+GRKF TV Y+ E+ +
Sbjct: 483 RVLELKIPRPTGGSRQSPGVGKIFVKFETIEATTAALKSLAGRKFSDRTVVTTYFSEENF 542
>gi|124810295|ref|XP_001348830.1| U2 snRNP auxiliary factor, putative [Plasmodium falciparum 3D7]
gi|23497731|gb|AAN37269.1|AE014827_12 U2 snRNP auxiliary factor, putative [Plasmodium falciparum 3D7]
Length = 833
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 121/394 (30%), Positives = 193/394 (48%), Gaps = 54/394 (13%)
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA---GPGDA----VVNVYI-N 219
Q + R++Y+G +PP + ++ + FF+ + A+ +S+ GD V+ I N
Sbjct: 449 QDTDKKQRKLYIGNIPPNSKQEDVVDFFNNSILAVIKDSSLDVKIGDVQLMPVIKCEIFN 508
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNP-----------------T 262
+ +F F+E RTV+ + LD I + +R+ RP DY P
Sbjct: 509 SDSRFCFLEFRTVQITWLCLKLDSIPYNNYCLRIGRPHDYIPPPEGDPAFTTVFTDINMD 568
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
+ L P +P +N+ ++ +R+++ LP+ + QIK+LLE FG L
Sbjct: 569 VFEKLRPSKP---VNVKT---------SSDEENRLYIQNLPHDLKDEQIKDLLEQFGDLK 616
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQ--SKTE 380
F+++KD +TG +KGYGF Y+D + T +A ALNG G L V++AT + Q + T
Sbjct: 617 AFNIIKDLNTGLNKGYGFFEYEDSSCTQLAIHALNGFVCGQNILNVKKATFNKQPTTITT 676
Query: 381 QESILAQAQQHIA---------------IQKMALQTSGMNTLGGGMSLFGETLAKVLCLT 425
++ Q IA I + L S + GE +KV+ LT
Sbjct: 677 NNNMNNQNPNFIALPNNSDVPVTLLPSSISQKILSNSIIGLQVQASRKIGEKSSKVVQLT 736
Query: 426 EAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAV 485
A+ + L D +YEEIL++++EE KYGTL N+VIP+P+++ T GVGK+FL Y D
Sbjct: 737 NAVFQEDLIVDSQYEEILKEVKEEAEKYGTLQNIVIPKPNKDLSYTEGVGKIFLHYADEA 796
Query: 486 GCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
A+ +GR F V A +Y E+ + Y
Sbjct: 797 TARKAQYMFNGRLFEKRVVCAAFYSEEHFLKGKY 830
>gi|242803779|ref|XP_002484243.1| splicing factor u2af large subunit [Talaromyces stipitatus ATCC
10500]
gi|218717588|gb|EED17009.1| splicing factor u2af large subunit [Talaromyces stipitatus ATCC
10500]
Length = 543
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 135/438 (30%), Positives = 209/438 (47%), Gaps = 76/438 (17%)
Query: 107 KSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
+ +R + +D+ PP + M P P Q PS + + N G T+
Sbjct: 147 RKRRLTQWDIKPPGYDNVTAEQAKLSGMFPLPGAPRQQAVDPSRLQALV-NQPAAGTTEN 205
Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
A L P +R A+R++ LPP E A+ +FF+ + G N D V+
Sbjct: 206 SA--LRPAN------SRQAKRLFAHNLPPNVTEAALVSFFNLQLN--GLNVIEGIDPCVS 255
Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGIIFE-----------GVAVRVRRPTDY-NPTL 263
I+ + FA +E + E + A+ALDGI E + +RRP DY P++
Sbjct: 256 AQISKDHSFALLEFKGANETTVALALDGITMEEHESAATANGGARGLELRRPKDYIVPSV 315
Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHG 323
P Q S N + P+++ + +P Y E + LL+S G L
Sbjct: 316 PEDQQPHQESVISNHVP-----------DSPNKLCITNIPLYIPEEPVTMLLKSIGELKA 364
Query: 324 FDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQES 383
F LVKD T S+G FC Y D A T IA +LNG+++GDK L + A+
Sbjct: 365 FVLVKDSGTDESRGIAFCEYVDAASTAIAVESLNGMELGDKHLKITHASIG--------- 415
Query: 384 ILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDE 437
A Q +G++ MS+F +T + ++L L +TAD L +++
Sbjct: 416 --------------ATQAAGLDMGVNAMSMFAKTTSADLETSRILQLLNMVTADELINND 461
Query: 438 EYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AKNALSG 496
+YEEILED+++EC KYG +++V IPRP ++ GVGK++++ +D+V AT A AL+G
Sbjct: 462 DYEEILEDVQDECSKYGQVLDVKIPRPAGGSRQSAGVGKIYVK-FDSVESATNALKALAG 520
Query: 497 RKFGGNTVNAFYYPEDKY 514
RKF TV Y+PE+ +
Sbjct: 521 RKFSDRTVVTTYFPEESF 538
>gi|367042858|ref|XP_003651809.1| hypothetical protein THITE_2112508 [Thielavia terrestris NRRL 8126]
gi|346999071|gb|AEO65473.1| hypothetical protein THITE_2112508 [Thielavia terrestris NRRL 8126]
Length = 563
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 135/432 (31%), Positives = 216/432 (50%), Gaps = 63/432 (14%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P Q M P T+L AF P
Sbjct: 166 RKRRLTQWDIKPPGYDNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMNQP 217
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+ A +R ++R+ V LP A E+++ F + + G N D +
Sbjct: 218 GGSVNSAALRPTNSRQSKRLVVENLPASATEESMVNFINLQLN--GLNVIENTDPCLQCL 275
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG-------------VAVRVRRPTDYNPTLA 264
I ++ FA +E R +A+ A+A DGI E +R+RRP DY +
Sbjct: 276 IAPDRSFAMLEFRNSPDATVALAFDGISMEADDAHAANGNGAAPAGLRIRRPKDY--IVP 333
Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
A + PN + + +S I + P+++ V LP Y T+ Q+ ELL SFG L F
Sbjct: 334 AVVE----DPNYDPDSDVPSSVVI---DSPNKISVTNLPLYLTDDQVMELLVSFGKLKSF 386
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
LVKD T S+G F Y DP+VT++A LN + +G++ L V++A+
Sbjct: 387 VLVKDNGTQESRGIAFLEYVDPSVTNVAIQGLNNMMLGERALKVQKAS------------ 434
Query: 385 LAQAQQHIAIQKMA--LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEI 442
I I ++A + + M+ L G S + +++VL L +TAD L D+++YEEI
Sbjct: 435 -------IGITQVAGEMGVNAMSMLAGTTSTDSD-VSRVLQLLNMVTADELMDNDDYEEI 486
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+D+++EC K+GT++++ IPRP G ++ GVGK+F+++ A L+GRKF
Sbjct: 487 RDDVQDECEKFGTVLSLKIPRPTGGGRQSAGVGKIFIKFETPEVATKALRGLAGRKFADR 546
Query: 503 TVNAFYYPEDKY 514
TV A Y+PE+ +
Sbjct: 547 TVVATYFPEENF 558
>gi|341038664|gb|EGS23656.1| hypothetical protein CTHT_0003520 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 584
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 185/358 (51%), Gaps = 44/358 (12%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
+R A+R+ V LPP A E+++ FF+ + G N D + +I ++ FA +E R
Sbjct: 252 SRQAKRLVVRNLPPSATEESLVNFFNLQLN--GLNVIETTDPCLQAHIAPDRSFAMLEFR 309
Query: 231 TVEEASNAMALDGIIF----------EGV--AVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
EA+ A+A DGI E V +++ RP DY + A + PN +
Sbjct: 310 NSSEATVALAFDGISMDADDAGANGAEAVHGGLQITRPKDY--IVPAVVE----DPNYDP 363
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
+ +S I + P+++ V +P Y E Q+ ELL SFG L F LVKD T S+G
Sbjct: 364 DSDVPSSVVI---DSPNKISVANIPPYLNEDQVMELLVSFGKLKSFVLVKDNGTQESRGI 420
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
F Y DP+V+++A LN + +G++ L V++A+ I I ++A
Sbjct: 421 AFLEYVDPSVSNVAIQGLNDMPLGEQKLKVKKAS-------------------IGITQVA 461
Query: 399 --LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
+ + M+ L G S E ++VL L +T + L D+++YEEI ED+ EEC K+G +
Sbjct: 462 GEMSVNAMSMLAGTTSTHAEASSRVLQLLNMVTPEELMDNDDYEEIREDVLEECKKFGNV 521
Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+++ IPRP ++ GVGK+++++ A AL+GRKF TV Y+PE+ Y
Sbjct: 522 LSLKIPRPIGGNRQSAGVGKIYVKFEQVESATKALRALAGRKFSDRTVVTTYFPEENY 579
>gi|453082700|gb|EMF10747.1| hypothetical protein SEPMUDRAFT_48483 [Mycosphaerella populorum
SO2202]
Length = 432
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 144/471 (30%), Positives = 231/471 (49%), Gaps = 72/471 (15%)
Query: 74 RDKERR-HRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPA----------- 120
R ER+ R S S + R + L+ S + KRR + +D+ PP
Sbjct: 2 RQLERQIKREASGSPPPKKPREPTPDLTDVVSVLERKRRLTQWDIKPPGYDNVTAEQAKL 61
Query: 121 AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVG 180
+ M P P Q P P+ + Q + Q + L P + R ++RV V
Sbjct: 62 SGMFPLPGAPRQQPMDPAKL----QAFMNQPGNQASSSALKP------SSARQSKRVMVH 111
Query: 181 GLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMA 240
LPP A ++++ FF+ + G N D ++ + +K +A VE +T E+A+NAMA
Sbjct: 112 NLPPSATDESMVDFFNLQLN--GLNITRGVDPCISAQCSKDKTYALVEFKTPEDATNAMA 169
Query: 241 LDGIIFEGVA-------------VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
LDGI + A ++++RP DY + P N + GL S
Sbjct: 170 LDGITMDHDAMDTSGASNGAPKGLQIKRPRDY-------IVPNVIDETENES--GLLSNT 220
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
+ + +++ + LP + E QI+ELL SFG L F LV+++ +G S+G FC Y+DP+
Sbjct: 221 VPDTQ--NKISITNLPSFLAEEQIQELLMSFGELKSFVLVRNQSSGESRGIAFCEYKDPS 278
Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA--LQTSGMN 405
VT +A +LNG+++ D + V+ A+ I IQ+++ + + M+
Sbjct: 279 VTKVAVDSLNGMELADTAMRVKLAS-------------------IGIQQVSSEMSVNAMS 319
Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
+ G S + +VL L IT + L D ++ +EILED++EEC KYG L+ V +PRP
Sbjct: 320 LMAGAKSTDADN-GRVLALMNMITPEELMDPDDADEILEDVKEECAKYGPLLEVKMPRPT 378
Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ GVGK++L+Y + A A AL+GRKF TV Y+ E+ YF+
Sbjct: 379 GGSRQSTGVGKIYLKYETSEHAAKALAALAGRKFADRTVVVTYFGEE-YFD 428
>gi|195346998|ref|XP_002040041.1| GM15574 [Drosophila sechellia]
gi|194135390|gb|EDW56906.1| GM15574 [Drosophila sechellia]
Length = 445
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 122/361 (33%), Positives = 191/361 (52%), Gaps = 33/361 (9%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP----GDAVVNVYINHEKKFAF 226
TR ARR+YVG +P E+ + FF+ + A G + G AV+ + EK FAF
Sbjct: 103 TRQARRLYVGNIPFGVTEEEMMQFFNHRIMAQGYEAKSSHYMDGKAVLTCQTHLEKNFAF 162
Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
+E R+++EA+ A+ DG+++ G +++RRP DY P + + + + + A +A+
Sbjct: 163 LEFRSIDEATQALNFDGMVYRGQTLKIRRPHDYQPVPSISFSAMENYRSFRVPATTIANP 222
Query: 287 -------AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
+ P++++VGGLP + Q+KELL+SFG L G +LV D +T +KG+
Sbjct: 223 PNVTIPVTTIVPDSPNKIYVGGLPTCLNQDQVKELLQSFGELKGLNLVMDGNTSLNKGFA 282
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
F Y DP VTD A A L+G+ +GD+ L V+R+ G++ + +
Sbjct: 283 FFEYYDPLVTDHAIAGLHGMLLGDRRLVVQRSIPGGKNAFPGHT------------GPVV 330
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
Q G++TL L + + LCL + + L DDEE+E+I D+++EC K+G + ++
Sbjct: 331 QVPGISTL-----LDPGSPTETLCLLNMVRPEELLDDEEFEDIRTDIKQECAKFGEVRSI 385
Query: 460 VIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
IPRP G P G GKVF+++ A ALSGRKF V YY +KY
Sbjct: 386 KIPRPI---GPFPKRGCGKVFVQFESVEDSQKALKALSGRKFSDRIVMTSYYDPEKYLAD 442
Query: 518 D 518
D
Sbjct: 443 D 443
>gi|67541022|ref|XP_664285.1| hypothetical protein AN6681.2 [Aspergillus nidulans FGSC A4]
gi|40738434|gb|EAA57624.1| hypothetical protein AN6681.2 [Aspergillus nidulans FGSC A4]
Length = 624
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 116/357 (32%), Positives = 177/357 (49%), Gaps = 53/357 (14%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
+R A+R++V LPP A + + +FF+ + G N D ++ I+ + FA +E +
Sbjct: 220 SRQAKRLFVYNLPPNATVENLVSFFNLQLN--GLNVIQSVDPCISAQISDDHSFALLEFK 277
Query: 231 TVEEASNAMALDGIIF--------EGVA--VRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
+ + + A+ALDGI G A + VRRP DY PNL
Sbjct: 278 SPNDTTVALALDGITMGEHESNGENGAAKGLEVRRPKDYI------------VPNLAEQD 325
Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
+ ASG + P+++ V +P Y E + LL+SFG L F LVKD T S+G F
Sbjct: 326 LEGASGMKDVPDSPNKICVSNIPQYIPEEPVTMLLKSFGELKSFVLVKDSSTEESRGIAF 385
Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ 400
C Y DP T IA LNG+++GD+ L V RA+ Q
Sbjct: 386 CEYADPNTTTIAVQGLNGMELGDRHLKVVRASIG-----------------------MTQ 422
Query: 401 TSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
+G++ MS+F +T + +VL L +T + L D+E+YEEI +D+R+EC K+G
Sbjct: 423 AAGLDMGVNAMSMFAKTTSQDLESSRVLQLLNMVTPEELMDNEDYEEICDDVRDECSKFG 482
Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
++ + IPRP ++PGVGK+F+++ A +L+GRKF TV Y+ E
Sbjct: 483 RVLELKIPRPTGGSRQSPGVGKIFVKFETIEATTAALKSLAGRKFSDRTVVTTYFSE 539
>gi|391864554|gb|EIT73849.1| splicing factor U2AF, large subunit [Aspergillus oryzae 3.042]
Length = 538
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 131/438 (29%), Positives = 204/438 (46%), Gaps = 76/438 (17%)
Query: 107 KSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
+ +R + +D+ PP + M P P Q P PS + M GA
Sbjct: 142 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPS---RLQAFMSQPGAGTA 198
Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
+ L P +R A+R++V LP A + + +FF+ + G N D ++
Sbjct: 199 ESASLKPSN------SRQAKRLFVSNLPASATGENLLSFFNLQLN--GLNVIHSVDPCIS 250
Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV-------------AVRVRRPTDYNPT 262
++ ++ FA +E +T +A+ A+A DGI + + VRRP DY
Sbjct: 251 AQVSDDRSFALLEFKTPNDATVALAFDGITMDESEAAGNGAANGAPQGLEVRRPKDY--- 307
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
PS N G+ + + P+++ V +P+Y E + LL+SFG L
Sbjct: 308 -------IVPSGNEQEYQEGVLLNEV--PDSPNKICVSNIPHYIPEEPVTMLLKSFGELK 358
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
F LVKD T S+G FC Y DP T IA LNG+++GD+ L V RA+
Sbjct: 359 SFVLVKDGSTEESRGIAFCEYADPNATSIAVEGLNGMELGDRHLKVVRAS---------- 408
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
I I Q +G++ MS+F +T + +VL L +T + L D+
Sbjct: 409 ---------IGIT----QAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMDN 455
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
++Y+EI +D+REEC KYG +V + IPRP ++PGVGK+F+++ A AL+G
Sbjct: 456 DDYDEICDDVREECAKYGQVVELKIPRPSGGSRQSPGVGKIFVKFDSVESTTNALKALAG 515
Query: 497 RKFGGNTVNAFYYPEDKY 514
RKF TV Y+ E+ +
Sbjct: 516 RKFSDRTVVTTYFSEENF 533
>gi|317139209|ref|XP_001817348.2| splicing factor u2af large subunit [Aspergillus oryzae RIB40]
Length = 538
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 131/438 (29%), Positives = 204/438 (46%), Gaps = 76/438 (17%)
Query: 107 KSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
+ +R + +D+ PP + M P P Q P PS + M GA
Sbjct: 142 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPS---RLQAFMSQPGAGTA 198
Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
+ L P +R A+R++V LP A + + +FF+ + G N D ++
Sbjct: 199 ESASLKPSN------SRQAKRLFVSNLPASATGENLLSFFNLQLN--GLNVIHSVDPCIS 250
Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV-------------AVRVRRPTDYNPT 262
++ ++ FA +E +T +A+ A+A DGI + + VRRP DY
Sbjct: 251 AQVSDDRSFALLEFKTPNDATVALAFDGITMDESEAAGNGAANGAPQGLEVRRPKDY--- 307
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
PS N G+ + + P+++ V +P+Y E + LL+SFG L
Sbjct: 308 -------IVPSGNEQEYQEGVLLNEV--PDSPNKICVSNIPHYIPEEPVTMLLKSFGELK 358
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
F LVKD T S+G FC Y DP T IA LNG+++GD+ L V RA+
Sbjct: 359 SFVLVKDGSTEESRGIAFCEYADPNATSIAVEGLNGMELGDRHLKVVRAS---------- 408
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
I I Q +G++ MS+F +T + +VL L +T + L D+
Sbjct: 409 ---------IGIT----QAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMDN 455
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
++Y+EI +D+REEC KYG +V + IPRP ++PGVGK+F+++ A AL+G
Sbjct: 456 DDYDEICDDVREECAKYGQVVELKIPRPSGGSRQSPGVGKIFVKFDSVESTTNALKALAG 515
Query: 497 RKFGGNTVNAFYYPEDKY 514
RKF TV Y+ E+ +
Sbjct: 516 RKFSDRTVVTTYFSEENF 533
>gi|238482353|ref|XP_002372415.1| splicing factor u2af large subunit [Aspergillus flavus NRRL3357]
gi|220700465|gb|EED56803.1| splicing factor u2af large subunit [Aspergillus flavus NRRL3357]
Length = 556
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 131/438 (29%), Positives = 204/438 (46%), Gaps = 76/438 (17%)
Query: 107 KSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
+ +R + +D+ PP + M P P Q P PS + M GA
Sbjct: 160 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPS---RLQAFMSQPGAGTA 216
Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
+ L P +R A+R++V LP A + + +FF+ + G N D ++
Sbjct: 217 ESASLKPSN------SRQAKRLFVSNLPASATGENLLSFFNLQLN--GLNVIHSVDPCIS 268
Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV-------------AVRVRRPTDYNPT 262
++ ++ FA +E +T +A+ A+A DGI + + VRRP DY
Sbjct: 269 AQVSDDRSFALLEFKTPNDATVALAFDGITMDESEAAGNGAANGAPQGLEVRRPKDY--- 325
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
PS N G+ + + P+++ V +P+Y E + LL+SFG L
Sbjct: 326 -------IVPSGNEQEYQEGVLLNEV--PDSPNKICVSNIPHYIPEEPVTMLLKSFGELK 376
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
F LVKD T S+G FC Y DP T IA LNG+++GD+ L V RA+
Sbjct: 377 SFVLVKDGSTEESRGIAFCEYADPNATSIAVEGLNGMELGDRHLKVVRAS---------- 426
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
I I Q +G++ MS+F +T + +VL L +T + L D+
Sbjct: 427 ---------IGIT----QAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMDN 473
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
++Y+EI +D+REEC KYG +V + IPRP ++PGVGK+F+++ A AL+G
Sbjct: 474 DDYDEICDDVREECAKYGQVVELKIPRPSGGSRQSPGVGKIFVKFDSVESTTNALKALAG 533
Query: 497 RKFGGNTVNAFYYPEDKY 514
RKF TV Y+ E+ +
Sbjct: 534 RKFSDRTVVTTYFSEENF 551
>gi|68068227|ref|XP_676023.1| U2 snRNP auxiliary factor [Plasmodium berghei strain ANKA]
gi|56495523|emb|CAI00540.1| U2 snRNP auxiliary factor, putative [Plasmodium berghei]
Length = 630
Score = 189 bits (479), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 115/380 (30%), Positives = 188/380 (49%), Gaps = 36/380 (9%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA---GPGDA----VVNVYI-NH 220
+ + R++Y+G LPP + ++ I FF+ +++I S+ GD VV I N
Sbjct: 255 EGDKKQRKLYIGNLPPNSKQEEIVEFFNNTISSIIKGSSLEVKIGDVQLLPVVKCEIFNA 314
Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
+ +F F+E RT++ + LD + + +R+ RP DY P P + P L +
Sbjct: 315 DSRFCFLEFRTMDITWLCLKLDSMSYNNYCLRINRPHDYMP-------PPEGDPALTVVF 367
Query: 281 VGLASGAIGGAEGP------------DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
+ G + + P +++++ LP+ + QI +LL FG L GF+++K
Sbjct: 368 PDIDMGLLESFKPPKIAPVRSTGDDDNKLYIQNLPHDLKDDQIMDLLGQFGKLKGFNIIK 427
Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
D +TG +KGYGF Y+D + T +A ALNG G L V++AT + S S
Sbjct: 428 DLNTGLNKGYGFFEYEDSSCTQVAIHALNGFVCGKNILNVKKATFNKNSNNAPNSNNIVL 487
Query: 389 QQHI---------AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEY 439
++ +I + L S + GE ++V+ LT A+ + L D +Y
Sbjct: 488 ANNVDVPVSLLPNSISQKILSNSIIGLQIQASRKIGEKSSRVIQLTNAVFQEDLIIDSQY 547
Query: 440 EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKF 499
+EIL+D++EE KYG L ++VIP+P+ + T GVGK+FL Y D A+ +GR F
Sbjct: 548 DEILKDVKEEAEKYGPLQSIVIPKPNTDLSYTEGVGKIFLHYVDETAARKAQYMFNGRLF 607
Query: 500 GGNTVNAFYYPEDKYFNKDY 519
V A +Y E+K+ Y
Sbjct: 608 EKRVVCASFYSEEKFLKGKY 627
>gi|357155772|ref|XP_003577233.1| PREDICTED: splicing factor U2af large subunit B-like [Brachypodium
distachyon]
Length = 446
Score = 189 bits (479), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 96/150 (64%), Positives = 111/150 (74%), Gaps = 7/150 (4%)
Query: 113 GFDMAPPAA----AMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ 168
GFD+ P +A P P QLPG S++P M NMLPF Q + P Q MTQ
Sbjct: 141 GFDLGPTSAQAVVPQFPTIPAPSQLPG--SSIPGMFPNMLPFAVGQFNPLVMQP-QAMTQ 197
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
QATRHARRVYVGGLPP ANEQ++A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAFVE
Sbjct: 198 QATRHARRVYVGGLPPTANEQSVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVE 257
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTD 258
MR+VEEASNAMALDGI+FEG V+ TD
Sbjct: 258 MRSVEEASNAMALDGILFEGAPVKDLNVTD 287
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 102/179 (56%), Positives = 124/179 (69%), Gaps = 14/179 (7%)
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTS 402
+D VTDIACAALNG+KMGDKTLTVRRA S Q + EQE+IL QAQQ + +QK+ Q
Sbjct: 281 KDLNVTDIACAALNGIKMGDKTLTVRRANQGSAQPRPEQENILLQAQQQVQLQKLVYQVG 340
Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
+ T KV+CLT+ +TAD L DDEEYE+I+EDMR E GKYGTLV VVIP
Sbjct: 341 ALPT-------------KVICLTQVVTADELKDDEEYEDIMEDMRLEAGKYGTLVKVVIP 387
Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
RP +G GVGKVFLEY D G AK A+ GRKFGGN V A +YPE+K+ ++++ A
Sbjct: 388 RPHPSGEPVAGVGKVFLEYADVDGSTKAKTAMHGRKFGGNPVVAVFYPENKFSDEEFDA 446
>gi|384246661|gb|EIE20150.1| hypothetical protein COCSUDRAFT_30785 [Coccomyxa subellipsoidea
C-169]
Length = 212
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 92/206 (44%), Positives = 133/206 (64%), Gaps = 3/206 (1%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
A R ARR+YVGGLPP + + + +++M + GG A G + + I EK +AF+E
Sbjct: 6 AARPARRIYVGGLPPETTDADLRQYINELMVSTGG-CAATGYPIASCKIYTEKSYAFLEF 64
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
R+VEEASN MA DG+ F+ +RVRRP +Y+ +A LGP P P ++++ + + +
Sbjct: 65 RSVEEASNCMAFDGVAFKDSYLRVRRPNNYDINVAVMLGPTDPDPTMDVSNLDIVKTVV- 123
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
+ P ++F+GGLP +TE Q+KELL FG+L F+LV D++TGNSKGY FC YQD +T
Sbjct: 124 -QDSPHKLFIGGLPCDWTEDQVKELLLPFGSLKAFNLVMDKNTGNSKGYAFCEYQDIGLT 182
Query: 350 DIACAALNGLKMGDKTLTVRRATASG 375
D LNG ++G+K LTV+RA G
Sbjct: 183 DYVIQNLNGKQIGNKFLTVKRALQPG 208
>gi|156099808|ref|XP_001615700.1| U2 snRNP auxiliary factor [Plasmodium vivax Sal-1]
gi|148804574|gb|EDL45973.1| U2 snRNP auxiliary factor, putative [Plasmodium vivax]
Length = 914
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 115/375 (30%), Positives = 189/375 (50%), Gaps = 25/375 (6%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA---GPGDAVVNVYI-----NH 220
+ + R++Y+G +PP + ++ + FF+ + +I +S+ GD V+ + N
Sbjct: 538 EGDKKQRKLYIGNIPPNSKQEELIDFFNNTLASIIKDSSLEIKIGDIVLLPILKCEIFNV 597
Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDY------NPTLAAALGPGQPSP 274
E +F F+E R++E + LD I F +R+ RP D+ +P L
Sbjct: 598 ESRFCFLEFRSLEITWLCLRLDAISFNNYCLRIARPHDFVPPPGGDPALTVVFTDINHEV 657
Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
+ V +A G + +++++ LP+ + QI++LL+ FG L GF+++KD +TG
Sbjct: 658 FEMVKPVKIAPVRSTG-DDDNKLYIQNLPHDLRDDQIRDLLQQFGKLKGFNIIKDLNTGL 716
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHI- 392
+KGYGF Y+D T IA ALNG G L V++AT Q+ T+ + ++ +
Sbjct: 717 NKGYGFFEYEDSNCTPIAMHALNGFVCGQNILNVKKATFGKSQNSTQNANTISLPTGSVD 776
Query: 393 --------AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
+I + L S + GE ++V+ LT A+ + L D +YEEIL
Sbjct: 777 LPVSLLPNSISQKILSNSIIGLQIQASRKIGEKSSRVVQLTNAVFQEDLLIDSQYEEILR 836
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
D++EE KYG L N+VIP+P ++ T GVGK+FL Y D A+ +GR F V
Sbjct: 837 DIKEEAEKYGPLQNIVIPKPSKDLSYTEGVGKIFLHYADETTARKAQYMFNGRLFEKRVV 896
Query: 505 NAFYYPEDKYFNKDY 519
A +Y E+K+ Y
Sbjct: 897 CAAFYSEEKFLAGKY 911
>gi|115400045|ref|XP_001215611.1| hypothetical protein ATEG_06433 [Aspergillus terreus NIH2624]
gi|114191277|gb|EAU32977.1| hypothetical protein ATEG_06433 [Aspergillus terreus NIH2624]
Length = 413
Score = 187 bits (476), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 136/435 (31%), Positives = 205/435 (47%), Gaps = 76/435 (17%)
Query: 107 KSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
+ +R + +D+ PP + M P P Q P PS + + N G+
Sbjct: 22 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPSRL-QAFMNQSTTGSAD- 79
Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
A L P +R A+R++V +PP +A+ +FF+ + G N D ++
Sbjct: 80 -AASLKPSH------SRQAKRLFVYNIPPNVTGEALLSFFNLQLN--GLNVVQSVDPCIS 130
Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA------------VRVRRPTDYNPTL 263
++ + FA +E ++ EA+ A+A DGI + A + VRRP DY
Sbjct: 131 AQVSDDHSFALLEFKSPNEATVALAFDGITMDEHASMDGAGKGEVKGLEVRRPKDYIVPN 190
Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHG 323
+A Q LN + P+++ V +P Y E + LL+SFG L
Sbjct: 191 GSADQEYQEGVLLNEVP-----------DSPNKICVSNIPQYIQEEAVIMLLKSFGELKS 239
Query: 324 FDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQES 383
F LVKD T S+G FC Y DP T IA LNG+++GD+ L V RA+
Sbjct: 240 FVLVKDASTEESRGIAFCEYADPTATSIAVEGLNGMEIGDRPLKVVRASIG--------- 290
Query: 384 ILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDE 437
Q +G++ MS+F +T + +VL L +TA+ L D++
Sbjct: 291 --------------MTQAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTAEELIDND 336
Query: 438 EYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AKNALSG 496
+YEEI ED+REEC KYG ++ + +PRP ++PGVGK+F++ +D V AT A AL+G
Sbjct: 337 DYEEICEDVREECSKYGQVLELKVPRPSGGSRQSPGVGKIFVK-FDTVESATKALKALAG 395
Query: 497 RKFGGNTVNAFYYPE 511
RKF TV YY E
Sbjct: 396 RKFSDRTVVTTYYGE 410
>gi|212539736|ref|XP_002150023.1| splicing factor u2af large subunit [Talaromyces marneffei ATCC
18224]
gi|210067322|gb|EEA21414.1| splicing factor u2af large subunit [Talaromyces marneffei ATCC
18224]
Length = 551
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 130/433 (30%), Positives = 207/433 (47%), Gaps = 66/433 (15%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQ 164
+ +R + +D+ PP + A + G P +P A + A + ++L A P
Sbjct: 155 RKRRMTQWDIKPPGYDNVTAEQAKLSGMFP-LPGAPRQQAVD-----PSRLQALVNQPSA 208
Query: 165 VMTQQAT------RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
T+ +T R A+R++ LPP + A+ +FF+ + G N D V+ I
Sbjct: 209 TTTESSTLRPANSRQAKRLFAYNLPPNVTDAALISFFNLQLN--GLNVIEGIDPCVSSQI 266
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFE-----------GVAVRVRRPTDYNPTLAAAL 267
+ + FA +E + EA+ A+ALDGI E + +RRP DY +
Sbjct: 267 SKDHAFALLEFKGPNEATVALALDGISMEEHEAAATTNGGARGLELRRPKDY-------I 319
Query: 268 GPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLV 327
P P + G+ S + + P+++ V +P Y E + LL+S G L F LV
Sbjct: 320 VPSSPE-DQQPYQEGVISNQV--PDSPNKLCVTNIPLYIPEEPVTMLLKSIGELRAFVLV 376
Query: 328 KDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
KD T S+G FC Y D T IA +LNG+++GDK L + A+
Sbjct: 377 KDSGTDESRGIAFCEYVDATATAIAVESLNGMELGDKHLKITHASIG------------- 423
Query: 388 AQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEE 441
Q +G++ MS+F +T + +VL L +TAD L ++E+YEE
Sbjct: 424 ----------VTQAAGLDMGVNAMSMFAKTTSADLETTRVLQLLNMVTADELINNEDYEE 473
Query: 442 ILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
ILED+++EC KYG ++++ IPRP ++ GVGK+F+++ A AL+GRKF
Sbjct: 474 ILEDVQDECSKYGQVLDLKIPRPAGGSRQSAGVGKIFVKFDTVESATNALKALAGRKFSD 533
Query: 502 NTVNAFYYPEDKY 514
TV Y+PE+ +
Sbjct: 534 RTVVTTYFPEESF 546
>gi|83765203|dbj|BAE55346.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 563
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 131/435 (30%), Positives = 202/435 (46%), Gaps = 76/435 (17%)
Query: 107 KSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
+ +R + +D+ PP + M P P Q P PS + M GA
Sbjct: 163 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPS---RLQAFMSQPGAGTA 219
Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
+ L P +R A+R++V LP A + + +FF+ + G N D ++
Sbjct: 220 ESASLKPSN------SRQAKRLFVSNLPASATGENLLSFFNLQLN--GLNVIHSVDPCIS 271
Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV-------------AVRVRRPTDYNPT 262
++ ++ FA +E +T +A+ A+A DGI + + VRRP DY
Sbjct: 272 AQVSDDRSFALLEFKTPNDATVALAFDGITMDESEAAGNGAANGAPQGLEVRRPKDY--- 328
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
PS N G+ + + P+++ V +P+Y E + LL+SFG L
Sbjct: 329 -------IVPSGNEQEYQEGVLLNEV--PDSPNKICVSNIPHYIPEEPVTMLLKSFGELK 379
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
F LVKD T S+G FC Y DP T IA LNG+++GD+ L V RA+
Sbjct: 380 SFVLVKDGSTEESRGIAFCEYADPNATSIAVEGLNGMELGDRHLKVVRAS---------- 429
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
I I Q +G++ MS+F +T + +VL L +T + L D+
Sbjct: 430 ---------IGIT----QAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMDN 476
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
++Y+EI +D+REEC KYG +V + IPRP ++PGVGK+F+++ A AL+G
Sbjct: 477 DDYDEICDDVREECAKYGQVVELKIPRPSGGSRQSPGVGKIFVKFDSVESTTNALKALAG 536
Query: 497 RKFGGNTVNAFYYPE 511
RKF TV Y+ E
Sbjct: 537 RKFSDRTVVTTYFSE 551
>gi|171684585|ref|XP_001907234.1| hypothetical protein [Podospora anserina S mat+]
gi|170942253|emb|CAP67905.1| unnamed protein product [Podospora anserina S mat+]
Length = 585
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 130/431 (30%), Positives = 212/431 (49%), Gaps = 62/431 (14%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P Q M P T+L AF P
Sbjct: 189 RRRRLTQWDIKPPGYDNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMSQP 240
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+ A +R A+R+ + +P A + +I FF+ + G N D +
Sbjct: 241 GGAVNSAALKPTNSRQAKRLILSNIPASATDDSIVNFFNLQLN--GLNVIEQTDPCLLCN 298
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG------------VAVRVRRPTDYNPTLAA 265
I+ ++ FA +E R +A+ A+ALDGI + +++RRP DY + A
Sbjct: 299 ISPDRSFAMLEFRNNTDATVALALDGITMDADDHQANGNGAAATGLKIRRPKDY--IVPA 356
Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFD 325
+ P+ ++ + + +GP+++ V +P Y TE Q+ ELL SFG L F
Sbjct: 357 IVEDPNYDPDSSVPSTNVV-------DGPNKISVTNIPPYLTEDQVMELLVSFGKLKSFV 409
Query: 326 LVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESIL 385
VKD T +G F Y D +VTD+A + LN + +G+K L V++A+
Sbjct: 410 FVKDNGTQEPRGIAFLEYADSSVTDVAISGLNNMMLGEKALKVQKAS------------- 456
Query: 386 AQAQQHIAIQKMA--LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEIL 443
I I ++A L + M+ L G + +VL L +TAD L D+++YEEI
Sbjct: 457 ------IGITQVAGELSVNAMSMLAGTTPSDNDA-GRVLQLLNMVTADELMDNDDYEEIR 509
Query: 444 EDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNT 503
+D++EEC K+G ++++ IPRP ++ GVGK+F+++ + A AL+GRKF T
Sbjct: 510 DDVQEECEKFGKILSLKIPRPVGGSRQSAGVGKIFIKFENHEAANKALRALAGRKFADRT 569
Query: 504 VNAFYYPEDKY 514
V Y+PE+ +
Sbjct: 570 VVTTYFPEENF 580
>gi|326475623|gb|EGD99632.1| splicing factor u2af large subunit [Trichophyton tonsurans CBS
112818]
gi|326483752|gb|EGE07762.1| splicing factor U2AF subunit [Trichophyton equinum CBS 127.97]
Length = 565
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 133/442 (30%), Positives = 203/442 (45%), Gaps = 72/442 (16%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPV- 163
+ +R + +D+ PP + A V G P +P A + A + ++L AF P
Sbjct: 156 RKRRLTQWDIKPPGYENVTAEQAKVSGMFP-LPGAPRQQAVD-----PSRLQAFMNPPAA 209
Query: 164 ------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
++ +R ++R++ +PP E + FF+ + G N D +V
Sbjct: 210 SGSSNNTLLKPSNSRQSKRLFAHNIPPSVTEDTLQQFFNLQLN--GLNVISGVDPCQSVQ 267
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------------VAVRVRRPTDYNPT 262
I+ + KFA +E T +A+ A+A DGI E + + RP DY
Sbjct: 268 ISKDGKFALLEFNTAADATVALAFDGITMEEHEANRESNGESNGEVKGLTIVRPKDYIVP 327
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
L P Q G+ S + + P+++ V +P + E Q+ LL SFG L
Sbjct: 328 LPTDEEPRQE---------GVVSSNV--PDSPNKICVSNIPPFIQEDQVTMLLVSFGELK 376
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
F LVKD T S+G FC Y DPA T IA LNG+++GD+ L V RA+
Sbjct: 377 SFVLVKDVGTDESRGIAFCEYLDPASTGIAVEGLNGMELGDRRLKVNRASIG-------- 428
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
+Q +G++ MS+F +T + +VL L +TAD L D+
Sbjct: 429 ---------------TVQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTADELIDN 473
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
E+YEEI ED++EEC KYG + + IPRP + GVGK+++++ A AL+G
Sbjct: 474 EDYEEICEDVQEECSKYGVVEELKIPRPSAGSRQAAGVGKIYVKFDTPEAATKALQALAG 533
Query: 497 RKFGGNTVNAFYYPEDKYFNKD 518
RKF TV Y+ E Y N +
Sbjct: 534 RKFQDRTVVTTYFSEASYPNSN 555
>gi|358369529|dbj|GAA86143.1| splicing factor u2af large subunit [Aspergillus kawachii IFO 4308]
Length = 571
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 127/439 (28%), Positives = 200/439 (45%), Gaps = 78/439 (17%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P P ++L AF P
Sbjct: 175 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSRLQAFMNQP 226
Query: 163 ------VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
+ +R A+R++V +P + + FF+ + G N D ++
Sbjct: 227 AGGNADTSTLKPSNSRQAKRLFVYNIPQTVTGETLLAFFNVQLN--GLNVIESVDPCISA 284
Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFE-------------GVAVRVRRPTDYNPTL 263
+ + FA +E ++ +A+ A+A DGI E + VRRP DY
Sbjct: 285 QVAQDHSFALLEFKSPNDATVALAFDGIAMEEHEAAGNGAANGAAQGLEVRRPKDY---- 340
Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTL 321
+ PG A G + + P+++ V +P+Y E + LL+SFG L
Sbjct: 341 ---IVPGG-------AEQEYQEGVLLNEVPDSPNKICVSNIPHYIPEEPVTMLLKSFGEL 390
Query: 322 HGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQ 381
F LVKD T S+G FC Y DP+ T IA LNG+++GD+ L V RA+
Sbjct: 391 KSFVLVKDSSTEESRGIAFCEYADPSATTIAVEGLNGMELGDRHLKVVRASIG------- 443
Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALAD 435
Q +G++ MS+F +T + +VL L +T + L D
Sbjct: 444 ----------------MTQAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMD 487
Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
E+YEEI +D+R+EC KYGT+V + +PRP ++PGVGK+F+++ A AL+
Sbjct: 488 PEDYEEICDDVRDECSKYGTVVELKVPRPTGGSRQSPGVGKIFVKFDTVESTTNALKALA 547
Query: 496 GRKFGGNTVNAFYYPEDKY 514
GRKF TV Y+ E+ +
Sbjct: 548 GRKFSDRTVVTTYFSEENF 566
>gi|159123253|gb|EDP48373.1| splicing factor u2af large subunit [Aspergillus fumigatus A1163]
Length = 567
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 131/439 (29%), Positives = 206/439 (46%), Gaps = 76/439 (17%)
Query: 106 SKSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
++ +R + +D+ PP + M P P Q P PS + + N G+
Sbjct: 170 TRKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPSRL-QAFMNQSGGGSAD 228
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
A + +R ARR++V LPP + + + + F+ + G N D +
Sbjct: 229 NSA--------LKPSNSRQARRLFVYNLPPGVSSEHLVSLFNLQLN--GLNVIHHVDPCI 278
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEG------------VAVRVRRPTDYNPT 262
+ I+ + FA +E +T +A+ A+A DGI E + VRRP DY
Sbjct: 279 SAQISEDHSFALLEFKTPNDATVALAFDGITMEEHEPVSGAENGAPKGLEVRRPKDYIVP 338
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
+A Q LN + P+++ V +P Y E + LL+SFG L
Sbjct: 339 NGSADQEYQEGVLLNEVP-----------DSPNKICVSNIPQYIPEEPVTMLLKSFGELK 387
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
F LVKD T S+G FC Y DP+ T IA LNG+++GD+ L V RA+
Sbjct: 388 SFVLVKDSSTEESRGIAFCEYADPSATAIAVEGLNGMELGDRHLKVVRASIG-------- 439
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
Q +G++ MS+F +T + +VL L +T + L D+
Sbjct: 440 ---------------MTQAAGLDMGVNAMSMFAKTTSQDLESSRVLQLLNMVTPEELLDN 484
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AKNALS 495
++YEEI +D+REEC KYG ++++ +PRP ++PGVGK++++ +D V AT A AL+
Sbjct: 485 DDYEEICDDVREECFKYGKVLDLKVPRPSGGSRQSPGVGKIYVK-FDTVEAATNALKALA 543
Query: 496 GRKFGGNTVNAFYYPEDKY 514
GRKF TV Y+ E+ +
Sbjct: 544 GRKFSDRTVVTTYFSEENF 562
>gi|146324846|ref|XP_748978.2| splicing factor u2af large subunit [Aspergillus fumigatus Af293]
gi|129556630|gb|EAL86940.2| splicing factor u2af large subunit [Aspergillus fumigatus Af293]
Length = 563
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 131/439 (29%), Positives = 206/439 (46%), Gaps = 76/439 (17%)
Query: 106 SKSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
++ +R + +D+ PP + M P P Q P PS + + N G+
Sbjct: 166 TRKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPSRL-QAFMNQSGGGSAD 224
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
A + +R ARR++V LPP + + + + F+ + G N D +
Sbjct: 225 NSA--------LKPSNSRQARRLFVYNLPPGVSSEHLVSLFNLQLN--GLNVIHHVDPCI 274
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEG------------VAVRVRRPTDYNPT 262
+ I+ + FA +E +T +A+ A+A DGI E + VRRP DY
Sbjct: 275 SAQISEDHSFALLEFKTPNDATVALAFDGITMEEHEPVSGAENGAPKGLEVRRPKDYIVP 334
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
+A Q LN + P+++ V +P Y E + LL+SFG L
Sbjct: 335 NGSADQEYQEGVLLNEVP-----------DSPNKICVSNIPQYIPEEPVTMLLKSFGELK 383
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
F LVKD T S+G FC Y DP+ T IA LNG+++GD+ L V RA+
Sbjct: 384 SFVLVKDSSTEESRGIAFCEYADPSATAIAVEGLNGMELGDRHLKVVRASIG-------- 435
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
Q +G++ MS+F +T + +VL L +T + L D+
Sbjct: 436 ---------------MTQAAGLDMGVNAMSMFAKTTSQDLESSRVLQLLNMVTPEELLDN 480
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AKNALS 495
++YEEI +D+REEC KYG ++++ +PRP ++PGVGK++++ +D V AT A AL+
Sbjct: 481 DDYEEICDDVREECFKYGKVLDLKVPRPSGGSRQSPGVGKIYVK-FDTVEAATNALKALA 539
Query: 496 GRKFGGNTVNAFYYPEDKY 514
GRKF TV Y+ E+ +
Sbjct: 540 GRKFSDRTVVTTYFSEENF 558
>gi|82540696|ref|XP_724646.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23479360|gb|EAA16211.1| splicing factor-like protein, putative [Plasmodium yoelii yoelii]
Length = 714
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 114/380 (30%), Positives = 188/380 (49%), Gaps = 36/380 (9%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA---GPGDA----VVNVYI-NH 220
+ + R++Y+G LPP + ++ I FF+ +++I S+ GD VV I N
Sbjct: 339 EGDKKQRKLYIGNLPPNSKQEEIVEFFNNTISSIIKGSSLEVKIGDVQLLPVVKCEIFNA 398
Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
+ +F F+E RT++ + LD + + +R+ RP DY P P + P L +
Sbjct: 399 DSRFCFLEFRTMDITWLCLKLDSMSYNNYCLRINRPHDYMP-------PPEGDPALTVVF 451
Query: 281 VGLASGAIGGAEGP------------DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
+ G + + P +++++ LP+ + QI +LL FG L GF+++K
Sbjct: 452 PDIDMGLLESFKPPKIAPVRSTGDDDNKLYIQNLPHDLKDDQIMDLLGQFGKLKGFNIIK 511
Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
D +TG +KGYGF Y+D + T +A ALNG G L V++AT + S S
Sbjct: 512 DLNTGLNKGYGFFEYEDSSCTQVAIHALNGFVCGKNILNVKKATFNKNSNNAPNSNNIAL 571
Query: 389 QQHI---------AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEY 439
++ +I + L S + GE ++V+ LT A+ + L + +Y
Sbjct: 572 ANNVDVPVSLLPNSISQKILSNSIIGLQIQASRKIGEKSSRVIQLTNAVFQEDLIINSQY 631
Query: 440 EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKF 499
+EIL+D++EE KYG L ++VIP+P+ + T GVGK+FL Y D A+ +GR F
Sbjct: 632 DEILKDVKEEAEKYGPLQSIVIPKPNTDLSYTEGVGKIFLHYVDETAARKAQYMFNGRLF 691
Query: 500 GGNTVNAFYYPEDKYFNKDY 519
V A +Y E+K+ Y
Sbjct: 692 EKRVVCASFYSEEKFLEGKY 711
>gi|121711505|ref|XP_001273368.1| splicing factor u2af large subunit [Aspergillus clavatus NRRL 1]
gi|119401519|gb|EAW11942.1| splicing factor u2af large subunit [Aspergillus clavatus NRRL 1]
Length = 583
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 130/437 (29%), Positives = 202/437 (46%), Gaps = 77/437 (17%)
Query: 106 SKSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
++ +R + +D+ PP + M P P Q P PS + P G +
Sbjct: 167 TRKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPSRLQAFMNQ--PGGGSA 224
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
A + +R A+R++V LPP + + + +FF+ + G N D +
Sbjct: 225 DNA-------ALKPSNSRQAKRLFVYNLPPGVSNEHLVSFFNLQLN--GLNVIHNVDPCI 275
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFE-------------GVAVRVRRPTDYNP 261
+ I+ + FA +E ++ + + A+A DGI E + VRRP DY
Sbjct: 276 SAQISEDHTFALLEFKSPNDTTVALAFDGITMEEHEAMGAGAENGASKGLEVRRPKDYVV 335
Query: 262 TLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTL 321
+A Q LN + P+++ V +P Y E + LL+SFG L
Sbjct: 336 PNGSADQEYQEGVLLNEVP-----------DSPNKICVSNIPQYIPEEPVTMLLKSFGEL 384
Query: 322 HGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQ 381
F LVKD T S+G FC Y DP+ T IA LNG+++GD+ L V RA+
Sbjct: 385 KSFVLVKDSSTEESRGIAFCEYADPSATPIAVEGLNGMELGDRHLKVVRASIG------- 437
Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALAD 435
Q G++ MS+F +T + +VL L +T + L D
Sbjct: 438 ----------------MTQAVGLDMGVNAMSMFAKTTSQDLESGRVLQLLNMVTPEELMD 481
Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AKNAL 494
+++YEEI +D+REEC KYG ++++ +PRP ++PGVGK+F++ +D V AT A AL
Sbjct: 482 NDDYEEICDDVREECSKYGKVLDLKVPRPSGGSRQSPGVGKIFVK-FDTVESATNALKAL 540
Query: 495 SGRKFGGNTVNAFYYPE 511
+GRKF TV Y+ E
Sbjct: 541 AGRKFSDRTVVTTYFAE 557
>gi|452979953|gb|EME79715.1| hypothetical protein MYCFIDRAFT_81194 [Pseudocercospora fijiensis
CIRAD86]
Length = 552
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 145/461 (31%), Positives = 220/461 (47%), Gaps = 59/461 (12%)
Query: 77 ERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPAAAMLPG--AAVPGQ- 132
ER+ +S S + R + L+ S + KRR + +D+ PP + A + G
Sbjct: 126 ERQTARKSASPPPKKPREPTPDLTDVVSILERKRRLTQWDIKPPGYENVTAEQAKLSGMF 185
Query: 133 -LPGVPSAVPEMAQNMLPF---GATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANE 188
LPG P P Q + F Q + L P + R ++R+ + +P A +
Sbjct: 186 PLPGAPRQQPMDPQKLQAFMNQPGNQASSTALKP------SSARQSKRLLIHNIPAAATD 239
Query: 189 QAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF-- 246
I FF+ + G N D V+ I+ E +A VE +T E+A+NAMA DGI
Sbjct: 240 DNIVDFFNLQLN--GLNVTRGQDPCVSAQISKENGYALVEFKTPEDATNAMAFDGINMMP 297
Query: 247 ---------EGVA--VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
G A ++++RP DY + P N + G+ SG + + +
Sbjct: 298 DAMDTNGDSNGTAKGLQIKRPKDY-------IVPNVTDETENPS--GILSGVVPDTQ--N 346
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
++ + LP + E QI+ELL SFG L F LVKD T S+G FC Y+DP+VT A +
Sbjct: 347 KISITNLPTFLGEDQIQELLNSFGELRNFVLVKDTSTEESRGIAFCEYKDPSVTKTAVES 406
Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFG 415
LNG+++GD + V+ A+ Q Q +++ M+L G G
Sbjct: 407 LNGMELGDAAMKVKLASIGIQ----------QVPGEMSVNAMSLM--------AGTQAEG 448
Query: 416 ETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
+VLCL IT + L D +E +EIL D++EE KYG L++V +PRP + G+G
Sbjct: 449 TEKGRVLCLMNMITPEELMDADEADEILVDVKEEVSKYGPLLDVKMPRPTGGSRQNNGIG 508
Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
K++L+Y A A AL+GRKF TV Y+ E+ YF+
Sbjct: 509 KIYLKYESPDSAAKALAALAGRKFADRTVVVTYFGEE-YFD 548
>gi|212539738|ref|XP_002150024.1| splicing factor u2af large subunit [Talaromyces marneffei ATCC
18224]
gi|210067323|gb|EEA21415.1| splicing factor u2af large subunit [Talaromyces marneffei ATCC
18224]
Length = 556
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 130/435 (29%), Positives = 207/435 (47%), Gaps = 66/435 (15%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQ 164
+ +R + +D+ PP + A + G P +P A + A + ++L A P
Sbjct: 155 RKRRMTQWDIKPPGYDNVTAEQAKLSGMFP-LPGAPRQQAVD-----PSRLQALVNQPSA 208
Query: 165 VMTQQAT------RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
T+ +T R A+R++ LPP + A+ +FF+ + G N D V+ I
Sbjct: 209 TTTESSTLRPANSRQAKRLFAYNLPPNVTDAALISFFNLQLN--GLNVIEGIDPCVSSQI 266
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFE-----------GVAVRVRRPTDYNPTLAAAL 267
+ + FA +E + EA+ A+ALDGI E + +RRP DY +
Sbjct: 267 SKDHAFALLEFKGPNEATVALALDGISMEEHEAAATTNGGARGLELRRPKDY-------I 319
Query: 268 GPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLV 327
P P + G+ S + + P+++ V +P Y E + LL+S G L F LV
Sbjct: 320 VPSSPE-DQQPYQEGVISNQV--PDSPNKLCVTNIPLYIPEEPVTMLLKSIGELRAFVLV 376
Query: 328 KDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
KD T S+G FC Y D T IA +LNG+++GDK L + A+
Sbjct: 377 KDSGTDESRGIAFCEYVDATATAIAVESLNGMELGDKHLKITHASIG------------- 423
Query: 388 AQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEE 441
Q +G++ MS+F +T + +VL L +TAD L ++E+YEE
Sbjct: 424 ----------VTQAAGLDMGVNAMSMFAKTTSADLETTRVLQLLNMVTADELINNEDYEE 473
Query: 442 ILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
ILED+++EC KYG ++++ IPRP ++ GVGK+F+++ A AL+GRKF
Sbjct: 474 ILEDVQDECSKYGQVLDLKIPRPAGGSRQSAGVGKIFVKFDTVESATNALKALAGRKFSD 533
Query: 502 NTVNAFYYPEDKYFN 516
TV Y+PE + +
Sbjct: 534 RTVVTTYFPEVSFLS 548
>gi|315044445|ref|XP_003171598.1| splicing factor U2AF subunit [Arthroderma gypseum CBS 118893]
gi|311343941|gb|EFR03144.1| splicing factor U2AF subunit [Arthroderma gypseum CBS 118893]
Length = 565
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 131/438 (29%), Positives = 202/438 (46%), Gaps = 72/438 (16%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPV- 163
+ +R + +D+ PP + A V G P +P A + A + ++L AF P
Sbjct: 165 RKRRLTQWDIKPPGYENVTAEQAKVSGMFP-LPGAPRQQAVD-----PSRLQAFMNPPAA 218
Query: 164 ------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
++ +R ++R++ +PP E + FF+ + G N D +V
Sbjct: 219 SGSNTNTLLKPSNSRQSKRLFTHNIPPSVTEDTLQQFFNLQLN--GLNVISGVDPCQSVQ 276
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------------VAVRVRRPTDYNPT 262
I+ + KFA +E T +A+ A+A DGI E + + RP DY
Sbjct: 277 ISKDGKFALLEFNTAADATVALAFDGITMEEHEANRENNGESNGEVKGLSIIRPKDYIVP 336
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
L P Q G+ S + + P+++ V +P + E Q+ LL SFG L
Sbjct: 337 LPTDEEPHQE---------GVVSSNV--PDSPNKICVSNIPPFIQEDQVTMLLVSFGELK 385
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
F LVKD T S+G FC Y DPA T IA LNG+++GD+ L V RA+
Sbjct: 386 SFVLVKDVGTDESRGIAFCEYLDPASTGIAVEGLNGMELGDRRLKVNRASIG-------- 437
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
+Q +G++ MS+F +T + +VL L +TAD L D+
Sbjct: 438 ---------------TVQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTADELIDN 482
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
E+YEEI ED++EEC KYG + + IPRP + GVGK+++++ A AL+G
Sbjct: 483 EDYEEICEDVQEECSKYGVVEELKIPRPSAGSRQAAGVGKIYIKFDTPESATKALQALAG 542
Query: 497 RKFGGNTVNAFYYPEDKY 514
RKF TV Y+ E+ +
Sbjct: 543 RKFQDRTVVTTYFSEENF 560
>gi|281207514|gb|EFA81697.1| RNA-binding region RNP-1 domain-containing protein [Polysphondylium
pallidum PN500]
Length = 682
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 116/368 (31%), Positives = 181/368 (49%), Gaps = 32/368 (8%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
VQ + + +RR+YVG +PP E I FF+ + A + PG V+ I K
Sbjct: 287 VQSASAALAKQSRRLYVGNIPPNVTEAQIVEFFNAAIIAAALTTK-PGQPVLLCQITTGK 345
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+ EEA+ M LDGI G ++++RRP DY G +P P G
Sbjct: 346 SFAFIEFRSSEEATLGMGLDGISLSGYSLKIRRPKDYQS------GSNEPMP------TG 393
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
L+ + + +++F+GGLP E QIK +L + G L F+LVKD TG SKG+ FC
Sbjct: 394 LSIVSTNVPDSENKIFLGGLPPTLNEEQIKSMLSAIGRLKAFNLVKDTKTGISKGFAFCE 453
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
+ DP TD ACA LNG K GDK+L V++A+ ++ + Q K+ + +S
Sbjct: 454 FLDPENTDKACAELNGTKFGDKSLLVQKASLGKEAIANNNNNNNNNQSLNNPSKVKVDSS 513
Query: 403 GMNTLGGGMSLFGETL---------------AKVLCLTEAITADALADDEEYEEILEDMR 447
+++L + + L ++V+ L + + DD YE +L D +
Sbjct: 514 -VSSLLNLTTALPQVLGAIRSNVSSDNNSKPSRVVQLLNMTDKEEIQDDNNYENLLLDTK 572
Query: 448 EECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAF 507
+ C ++G + ++ I RP + + + KVF+ + A L GRK+ T+
Sbjct: 573 DACEEFGEIESIFISRPKDSPLD---IIKVFVCFSQLESAQKAWVGLGGRKYNYRTIITA 629
Query: 508 YYPEDKYF 515
+YPED Y
Sbjct: 630 FYPEDLYI 637
>gi|221483471|gb|EEE21790.1| U2 snRNP auxiliary factor, putative [Toxoplasma gondii GT1]
gi|221507941|gb|EEE33528.1| U2 snRNP splicing factor, putative [Toxoplasma gondii VEG]
Length = 553
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 118/366 (32%), Positives = 184/366 (50%), Gaps = 28/366 (7%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS-------AGPGDAVVNV----YINH 220
R +R+YVG LPP + + + FF+ + A+ + A G+ ++ V N
Sbjct: 185 RKQKRLYVGNLPPGSTQPDVVGFFNGALLAVNAQTGFVKEDEATAGEQLLPVERCEVFNE 244
Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
+F F+E+R + A + LDGI + G ++RV RP DY P G P+ +
Sbjct: 245 SSRFCFIELRNEQYAILCVKLDGITYNGYSLRVGRPHDYVPPPG-----GDPAHQAYIPL 299
Query: 281 VGLASGAIGGAE---------GPD-RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDR 330
+ A + GPD ++++ LP E Q+++LLE FGTL +L+++
Sbjct: 300 LDDAKKVKREEKREKPSRPETGPDNKIYIQNLPPEMGEEQVRDLLEQFGTLRVLNLIRNV 359
Query: 331 DTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT-EQESILAQAQ 389
TG KGYGF Y+DP VTD A ALNG G L+V+RA SG T E +A
Sbjct: 360 QTGQHKGYGFFEYEDPEVTDQAILALNGFVCGANMLSVQRANFSGSVVTRETNRTMAVTS 419
Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
++ + L + GE +KV+ L + + L D +EYE I +D+++E
Sbjct: 420 LPNSMTQKLLSDPLVAVQVQAARKIGERPSKVVQLLNCVYQEDLIDPKEYEAICDDIKQE 479
Query: 450 CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN-TVNAFY 508
K+G L V++P+P+++ GVGKVFL Y D A+ L+GR+F N V A +
Sbjct: 480 AEKHGALEEVLVPKPNEDLSYREGVGKVFLRYSDVTAARKAQLMLNGRRFDSNRVVCAAF 539
Query: 509 YPEDKY 514
+PE+K+
Sbjct: 540 FPEEKF 545
>gi|116192501|ref|XP_001222063.1| hypothetical protein CHGG_05968 [Chaetomium globosum CBS 148.51]
gi|88181881|gb|EAQ89349.1| hypothetical protein CHGG_05968 [Chaetomium globosum CBS 148.51]
Length = 566
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 130/430 (30%), Positives = 208/430 (48%), Gaps = 59/430 (13%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P Q M P T+L AF P
Sbjct: 169 RRRRLTQWDIKPPGYDNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMNQP 220
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+ A +R ++R+ + LP ++++ FF+ + G N D + +
Sbjct: 221 GGAVNSAALKPSNSRQSKRLIISNLPASVTDESLTNFFNLQLN--GLNVIETADPCLQAH 278
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFE----------GVA---VRVRRPTDYNPTLA 264
I E+ FA VE R +A+ A+ALDGI E G A + +RRP DY +
Sbjct: 279 IAAERAFAMVEFRNNTDATVALALDGISMEADDAHAANGNGTAPQGLHIRRPKDY--IVP 336
Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
A + PN + + +S + + P+++ V LP Y TE Q+ ELL SFG L F
Sbjct: 337 AVV----EDPNYDPDSDRPSSVVV---DSPNKISVTNLPLYLTEDQVMELLVSFGKLKSF 389
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
LVKD T S+G F Y DP VT +A L+ + +G++ L V++A+
Sbjct: 390 VLVKDNGTEESRGIAFLEYADPGVTTVAIQGLHNMMLGERALKVQKASIG---------- 439
Query: 385 LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
+ Q + + M++ + G +VL L +T D L D+++YEEI +
Sbjct: 440 ITQVSGEMGVNAMSMLAGTTSADAGA--------GRVLQLLNMVTPDELMDNDDYEEIRD 491
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
D++EEC K+G ++++ IPRP ++ GVGK+F+++ A AL+GRKF TV
Sbjct: 492 DVQEECEKFGKILSIKIPRPAGGSRQSAGVGKIFIKFETPETATKALQALAGRKFADRTV 551
Query: 505 NAFYYPEDKY 514
Y+PE+ +
Sbjct: 552 VTTYFPEENF 561
>gi|317029342|ref|XP_001391373.2| splicing factor u2af large subunit [Aspergillus niger CBS 513.88]
Length = 561
Score = 184 bits (467), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 126/439 (28%), Positives = 200/439 (45%), Gaps = 78/439 (17%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P P ++L AF P
Sbjct: 165 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSRLQAFMNQP 216
Query: 163 ------VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
+ +R A+R++V +P + + FF+ + G N D ++
Sbjct: 217 AGGNADTSTLKPSNSRQAKRLFVYNIPESVTGETLLAFFNVQLN--GLNVIQSVDPCISA 274
Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFE-------------GVAVRVRRPTDYNPTL 263
+ + FA +E ++ +A+ A+A DGI E + VRRP DY
Sbjct: 275 QVAQDHTFALLEFKSPNDATVALAFDGIAMEEHEAAGNGAANGAAQGLEVRRPKDY---- 330
Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTL 321
+ PG A G + + P+++ V +P+Y E + LL+SFG L
Sbjct: 331 ---IVPGG-------AEQEYQEGVLLNEVPDSPNKICVSNIPHYIPEEPVTMLLKSFGEL 380
Query: 322 HGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQ 381
F LVKD T S+G FC Y DP+ T IA LNG+++GD+ L V RA+
Sbjct: 381 KSFVLVKDSSTEESRGIAFCEYADPSATTIAVEGLNGMELGDRHLKVVRASIG------- 433
Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALAD 435
Q +G++ MS+F +T + +VL L +T + L D
Sbjct: 434 ----------------MTQAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMD 477
Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
E+Y+EI +D+R+EC KYGT+V + +PRP ++PGVGK+F+++ A AL+
Sbjct: 478 PEDYDEICDDVRDECSKYGTVVELKVPRPTGGSRQSPGVGKIFVKFDTVESTTNALKALA 537
Query: 496 GRKFGGNTVNAFYYPEDKY 514
GRKF TV Y+ E+ +
Sbjct: 538 GRKFSDRTVVTTYFSEENF 556
>gi|119189253|ref|XP_001245233.1| hypothetical protein CIMG_04674 [Coccidioides immitis RS]
gi|392868136|gb|EAS33879.2| U2 snRNP auxilliary factor, large subunit, splicing factor
[Coccidioides immitis RS]
Length = 545
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 126/432 (29%), Positives = 211/432 (48%), Gaps = 66/432 (15%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMP-- 162
+ +R + +D+ PP + A + G P +P A + A + ++L AF P
Sbjct: 151 RKRRLTQWDIKPPGYENVTAEQAKLSGMFP-LPGAPRQQAVD-----PSRLQAFMNQPGG 204
Query: 163 ---VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
++ +R A+R++V L P +E +IA FF+ + G N D V+ ++
Sbjct: 205 NASNTLLKPSNSRQAKRLFVYNLSPSLSEDSIAQFFNLQLN--GLNVVSGVDPCVSAQLS 262
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFE-----------GVAVRVRRPTDYNPTLAAALG 268
+ FA +E +T +A+ A+A DG+ E + +RRP DY +
Sbjct: 263 TDGTFALLEFKTAADATVALAFDGVSMEPDDANGHTNGSSQGLSIRRPKDY-------IV 315
Query: 269 PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
P + + G+ S + + P+++ V +P + E Q+ LL SFG L F LVK
Sbjct: 316 PSETDDSNRQE--GVVSNEV--PDSPNKICVTNIPPFIQEEQVTMLLVSFGELKSFVLVK 371
Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
D T S+G FC Y DP+ T+IA LNG+++GDK L V RA+
Sbjct: 372 DSGTDESRGIAFCEYVDPSSTNIAVEGLNGMELGDKRLKVTRASIG-------------- 417
Query: 389 QQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEI 442
A Q +G++ MS+F +T + +VL L +TA+ L D+++YEEI
Sbjct: 418 ---------ATQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTAEELMDNDDYEEI 468
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+D+R+EC KYG ++ + +PRP ++ GVGK+++++ + A AL+GRKF
Sbjct: 469 CDDVRDECSKYGQILEMKVPRPTGGSRQSAGVGKIYVKFDNYESAYKAMKALAGRKFQDR 528
Query: 503 TVNAFYYPEDKY 514
TV ++ E+ +
Sbjct: 529 TVVTTFFSEENF 540
>gi|400602736|gb|EJP70338.1| splicing factor U2AF 65 kDa subunit [Beauveria bassiana ARSEF 2860]
Length = 576
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 158/521 (30%), Positives = 246/521 (47%), Gaps = 87/521 (16%)
Query: 33 RDRHHRDFKSGGDDRRRDKNYKYDR-------------EGIRDHDRTDRHRD-------- 71
RDR + +GG DRR D+ + DR G RD D D +R
Sbjct: 99 RDREREERYTGGRDRRGDREWDRDRGSSRRDARRDDDGHGRRDRDGFDENRRGGRDRRDD 158
Query: 72 -YNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPA--AAMLPGA 127
+ R +ERR S S + R + L+ + KRR + +D+ PP A A
Sbjct: 159 GFARQQERR------SPSPPKRREPTPDLTDVIPVLERKRRMTQWDIKPPGYEAVTSEQA 212
Query: 128 AVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMPV--QV----MTQQATRHARRVYV 179
+ G LPG P Q M P T+L AF P QV + +R A+R+ V
Sbjct: 213 KMSGMFPLPGAPRQ-----QQMDP---TKLQAFMNQPAGGQVSSAGLKASNSRQAKRLLV 264
Query: 180 GGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAM 239
LP + A+ FF+ + + SA D ++++K FA +E + +A+ A+
Sbjct: 265 SNLPSGTTDDALVAFFNLQLNGLNVISAT--DPCALSQLSNDKSFAVLEFKNTSDATVAL 322
Query: 240 ALDGIIFE--GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD-- 295
ALDGI E G + +RRP DY + A P+ + + S ++ PD
Sbjct: 323 ALDGISMEANGPGLSIRRPKDY--VMPAV-------PDDIMYNPDVVSDSV-----PDTI 368
Query: 296 -RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP-AVTDIAC 353
++ + LP + TE Q+ ELL +FG F LVKDR T S+G F Y +P + + A
Sbjct: 369 HKLSITNLPPFLTEEQVLELLAAFGKPKAFVLVKDRTTEESRGIAFAEYAEPGSANEAAL 428
Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSL 413
AL+G+ +G K L + +A G ++ + AI +A Q +G
Sbjct: 429 KALSGMDVGGKPLKITKACIGGTQVANFDAGIN------AISNLAGQGNG---------- 472
Query: 414 FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPG 473
GE +VL L +TA+ L D+++YEEI +D+R+EC KYG ++++ +PRP ++ G
Sbjct: 473 -GEA-TRVLQLLNMVTAEELLDNDDYEEICDDVRDECSKYGKILDLKVPRPAGGSRQSAG 530
Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
VG++F+++ +A AL+GRKF TV Y+PE+ +
Sbjct: 531 VGRIFVKFESVDATTSALKALAGRKFADRTVVTTYFPEENF 571
>gi|378731414|gb|EHY57873.1| U2AF domain-containing protein (UHM) kinase 1 [Exophiala
dermatitidis NIH/UT8656]
Length = 574
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 133/439 (30%), Positives = 203/439 (46%), Gaps = 79/439 (17%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P Q M P ++L AF +P
Sbjct: 174 RKRRLTQWDIKPPGYENVTAEQAKMSGMFPLPGAPRQ-----QQMDP---SRLQAFMNLP 225
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
A +R ARR++V LP +E+A+ FF+ + G N D
Sbjct: 226 SSSSNNTALKPSNSRQARRLFVHNLPASVSEEALVQFFNLQLN--GLNVTKAVDPCAQAN 283
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG-------------------VAVRVRRPTD 258
I ++ FA VE + +A+ A+ALDGI + +RRP D
Sbjct: 284 IAEDRSFALVEFKNASDATLALALDGITMPEHHSEMNGNGDANGNGTAAPKGLEIRRPKD 343
Query: 259 YNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESF 318
Y PS + A G S + + +++ V LP + T+ Q+ ELL++F
Sbjct: 344 Y----------IVPSADEATYAEGEISSEV--PDTANKLAVTNLPPFLTDDQVIELLKAF 391
Query: 319 GTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK 378
G + F LV++ D+ S+G FC Y DPA T +A LNG+ + ++ V RA+ Q
Sbjct: 392 GEVKAFVLVREPDSQESRGIAFCEYADPASTAVAIEGLNGMDLAGNSIKVTRASIGYQ-- 449
Query: 379 TEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADA 432
Q +G++ MS+F T + +VL L +T +
Sbjct: 450 ---------------------QAAGLDMGVNAMSMFAGTTSDAHDEGRVLQLLNMVTPED 488
Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
L D+++YEEI ED+ EEC KYG ++++ IPRP ++ GVGK+FL+Y DA A
Sbjct: 489 LMDNDDYEEICEDVMEECSKYGKILSMKIPRPSGGSRQSAGVGKIFLKYEDAESAKKALQ 548
Query: 493 ALSGRKFGGNTVNAFYYPE 511
AL+GRKF TV Y+ E
Sbjct: 549 ALAGRKFADRTVVTTYFDE 567
>gi|320031290|gb|EFW13263.1| splicing factor u2af large subunit [Coccidioides posadasii str.
Silveira]
Length = 545
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 126/432 (29%), Positives = 211/432 (48%), Gaps = 66/432 (15%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMP-- 162
+ +R + +D+ PP + A + G P +P A + A + ++L AF P
Sbjct: 151 RKRRLTQWDIKPPGYENVTAEQAKLSGMFP-LPGAPRQQAVD-----PSRLQAFMNQPGG 204
Query: 163 ---VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
++ +R A+R++V L P +E +IA FF+ + G N D V+ ++
Sbjct: 205 NASNTLLKPSNSRQAKRLFVYNLSPSLSEDSIAQFFNLQLN--GLNVVSGVDPCVSAQLS 262
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFE-----------GVAVRVRRPTDYNPTLAAALG 268
+ FA +E +T +A+ A+A DG+ E + +RRP DY +
Sbjct: 263 TDGTFALLEFKTAADATVALAFDGVSMEPDDANGHTNGSSQGLSIRRPKDY-------IV 315
Query: 269 PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
P + + G+ S + + P+++ V +P + E Q+ LL SFG L F LVK
Sbjct: 316 PSETDDSNRQE--GVVSNEV--PDSPNKICVTNIPPFIQEEQVTMLLVSFGELKSFILVK 371
Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
D T S+G FC Y DP+ T+IA LNG+++GDK L V RA+
Sbjct: 372 DSGTDESRGIAFCEYVDPSSTNIAVEGLNGMELGDKRLKVTRASIG-------------- 417
Query: 389 QQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEI 442
A Q +G++ MS+F +T + +VL L +TA+ L D+++YEEI
Sbjct: 418 ---------ATQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTAEELMDNDDYEEI 468
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+D+R+EC KYG ++ + +PRP ++ GVGK+++++ + A AL+GRKF
Sbjct: 469 CDDVRDECSKYGQVLEMKVPRPTGGSRQSAGVGKIYVKFDNYESAYKAMKALAGRKFQDR 528
Query: 503 TVNAFYYPEDKY 514
TV ++ E+ +
Sbjct: 529 TVVTTFFSEENF 540
>gi|237839189|ref|XP_002368892.1| U2 snRNP auxiliary factor or splicing factor, putative [Toxoplasma
gondii ME49]
gi|211966556|gb|EEB01752.1| U2 snRNP auxiliary factor or splicing factor, putative [Toxoplasma
gondii ME49]
Length = 553
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 117/366 (31%), Positives = 184/366 (50%), Gaps = 28/366 (7%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS-------AGPGDAVVNV----YINH 220
R +R+YVG LPP + + + FF+ + A+ + A G+ ++ V N
Sbjct: 185 RKQKRLYVGNLPPGSTQPDVVGFFNGALLAVNAQTGFVKEDEATAGEQLLPVERCEVFNE 244
Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
+F F+E+R + A + LDGI + G ++RV RP DY P G P+ +
Sbjct: 245 SSRFCFIELRNEQYAILCVKLDGITYNGYSLRVGRPHDYVPPPG-----GDPAHQAYIPL 299
Query: 281 VGLASGAIGGAE---------GP-DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDR 330
+ A + GP +++++ LP E Q+++LLE FGTL +L+++
Sbjct: 300 LDDAKKVKREEKREKPSRPETGPNNKIYIQNLPPEMGEEQVRDLLEQFGTLRVLNLIRNV 359
Query: 331 DTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT-EQESILAQAQ 389
TG KGYGF Y+DP VTD A ALNG G L+V+RA SG T E +A
Sbjct: 360 QTGQHKGYGFFEYEDPEVTDQAILALNGFVCGANMLSVQRANFSGSVVTRETNRTMAVTS 419
Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
++ + L + GE +KV+ L + + L D +EYE I +D+++E
Sbjct: 420 LPNSMTQKLLSDPLVAVQVQAARKIGERPSKVVQLLNCVYQEDLIDPKEYEAICDDIKQE 479
Query: 450 CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN-TVNAFY 508
K+G L V++P+P+++ GVGKVFL Y D A+ L+GR+F N V A +
Sbjct: 480 AEKHGALEEVLVPKPNEDLSYREGVGKVFLRYSDVTAARKAQLMLNGRRFDSNRVVCAAF 539
Query: 509 YPEDKY 514
+PE+K+
Sbjct: 540 FPEEKF 545
>gi|296811258|ref|XP_002845967.1| splicing factor U2AF subunit [Arthroderma otae CBS 113480]
gi|238843355|gb|EEQ33017.1| splicing factor U2AF subunit [Arthroderma otae CBS 113480]
Length = 557
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 129/438 (29%), Positives = 203/438 (46%), Gaps = 72/438 (16%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPV- 163
+ +R + +D+ PP + A + G P +P A + A + ++L AF P
Sbjct: 157 RKRRLTQWDIKPPGYENVTAEQAKLSGMFP-LPGAPRQQAVD-----PSRLQAFINPPTA 210
Query: 164 ------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
++ +R ++R++ +PP E + FF+ + G N D +V
Sbjct: 211 SGSSNNTLLKPSNSRQSKRLFAHNIPPSVTEDTLQQFFNLQLN--GLNVISGVDPCQSVQ 268
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------------VAVRVRRPTDYNPT 262
I+ + KFA +E T +A+ A+A DGI E + + RP DY
Sbjct: 269 ISKDGKFALLEFNTAADATVALAFDGITMEEHEANQESNGESNGQVKGLSIVRPKDYIVP 328
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
L P Q G+ S + + P+++ V +P + E Q+ LL SFG L
Sbjct: 329 LPTEEEPRQE---------GVLSSNV--PDSPNKICVSNIPPFIQEDQVTMLLISFGELK 377
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
F LVKD T S+G FC Y DPA T IA LNG+++GD+ L V RA+
Sbjct: 378 SFVLVKDVGTDESRGIAFCEYLDPASTGIAVEGLNGMELGDRRLKVNRASIG-------- 429
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
+Q +G++ MS+F +T + +VL L +TAD L D+
Sbjct: 430 ---------------TVQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTADELIDN 474
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
++YEEI ED+++EC KYG + + IPRP + GVGK+++++ A A AL+G
Sbjct: 475 DDYEEICEDVQDECSKYGVVEELKIPRPSGGSRQAAGVGKIYVKFDTAESATKALQALAG 534
Query: 497 RKFGGNTVNAFYYPEDKY 514
RKF TV Y+ E+ +
Sbjct: 535 RKFQDRTVVTTYFSEENF 552
>gi|350635494|gb|EHA23855.1| hypothetical protein ASPNIDRAFT_209800 [Aspergillus niger ATCC
1015]
Length = 566
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 126/436 (28%), Positives = 198/436 (45%), Gaps = 78/436 (17%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P P ++L AF P
Sbjct: 150 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSRLQAFMNQP 201
Query: 163 ------VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
+ +R A+R++V +P + + FF+ + G N D ++
Sbjct: 202 AGGNADTSTLKPSNSRQAKRLFVYNIPESVTGETLLAFFNVQLN--GLNVIQSVDPCISA 259
Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFE-------------GVAVRVRRPTDYNPTL 263
+ + FA +E ++ +A+ A+A DGI E + VRRP DY
Sbjct: 260 QVAQDHTFALLEFKSPNDATVALAFDGIAMEEHEAAGNGAANGAAQGLEVRRPKDY---- 315
Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTL 321
+ PG A G + + P+++ V +P+Y E + LL+SFG L
Sbjct: 316 ---IVPGG-------AEQEYQEGVLLNEVPDSPNKICVSNIPHYIPEEPVTMLLKSFGEL 365
Query: 322 HGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQ 381
F LVKD T S+G FC Y DP+ T IA LNG+++GD+ L V RA+
Sbjct: 366 KSFVLVKDSSTEESRGIAFCEYADPSATTIAVEGLNGMELGDRHLKVVRASIG------- 418
Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALAD 435
Q +G++ MS+F +T + +VL L +T + L D
Sbjct: 419 ----------------MTQAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMD 462
Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
E+Y+EI +D+R+EC KYGT+V + +PRP ++PGVGK+F+++ A AL+
Sbjct: 463 PEDYDEICDDVRDECSKYGTVVELKVPRPTGGSRQSPGVGKIFVKFDTVESTTNALKALA 522
Query: 496 GRKFGGNTVNAFYYPE 511
GRKF TV Y+ E
Sbjct: 523 GRKFSDRTVVTTYFSE 538
>gi|303323229|ref|XP_003071606.1| splicing factor, putative [Coccidioides posadasii C735 delta SOWgp]
gi|240111308|gb|EER29461.1| splicing factor, putative [Coccidioides posadasii C735 delta SOWgp]
Length = 545
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 126/432 (29%), Positives = 210/432 (48%), Gaps = 66/432 (15%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMP-- 162
+ +R + +D+ PP + A + G P +P A + A + ++L AF P
Sbjct: 151 RKRRLTQWDIKPPGYENVTAEQAKLSGMFP-LPGAPRQQAVD-----PSRLQAFMNQPGG 204
Query: 163 ---VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
++ +R A+R++V L P +E +IA FF+ + G N D V+ ++
Sbjct: 205 NASNTLLKPSNSRQAKRLFVYNLSPSLSEDSIAQFFNLQLN--GLNVVSGVDPCVSAQLS 262
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFE-----------GVAVRVRRPTDYNPTLAAALG 268
+ FA +E +T +A+ A+A DG+ E + +RRP DY +
Sbjct: 263 TDGTFALLEFKTAADATVALAFDGVSMEPDDANGHTNGSSQGLSIRRPKDY-------IV 315
Query: 269 PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
P + + G+ S + + P ++ V +P + E Q+ LL SFG L F LVK
Sbjct: 316 PSETDDSNRQE--GVVSNEV--PDSPSKICVTNIPPFIQEEQVTMLLVSFGELKSFILVK 371
Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
D T S+G FC Y DP+ T+IA LNG+++GDK L V RA+
Sbjct: 372 DSGTDESRGIAFCEYVDPSSTNIAVEGLNGMELGDKRLKVTRASIG-------------- 417
Query: 389 QQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEI 442
A Q +G++ MS+F +T + +VL L +TA+ L D+++YEEI
Sbjct: 418 ---------ATQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTAEELMDNDDYEEI 468
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+D+R+EC KYG ++ + +PRP ++ GVGK+++++ + A AL+GRKF
Sbjct: 469 CDDVRDECSKYGQVLEMKVPRPTGGSRQSAGVGKIYVKFDNYESAYKAMKALAGRKFQDR 528
Query: 503 TVNAFYYPEDKY 514
TV ++ E+ +
Sbjct: 529 TVVTTFFSEENF 540
>gi|134075845|emb|CAL00224.1| unnamed protein product [Aspergillus niger]
Length = 598
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 126/436 (28%), Positives = 198/436 (45%), Gaps = 78/436 (17%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P P ++L AF P
Sbjct: 182 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSRLQAFMNQP 233
Query: 163 ------VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
+ +R A+R++V +P + + FF+ + G N D ++
Sbjct: 234 AGGNADTSTLKPSNSRQAKRLFVYNIPESVTGETLLAFFNVQLN--GLNVIQSVDPCISA 291
Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFE-------------GVAVRVRRPTDYNPTL 263
+ + FA +E ++ +A+ A+A DGI E + VRRP DY
Sbjct: 292 QVAQDHTFALLEFKSPNDATVALAFDGIAMEEHEAAGNGAANGAAQGLEVRRPKDY---- 347
Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTL 321
+ PG A G + + P+++ V +P+Y E + LL+SFG L
Sbjct: 348 ---IVPGG-------AEQEYQEGVLLNEVPDSPNKICVSNIPHYIPEEPVTMLLKSFGEL 397
Query: 322 HGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQ 381
F LVKD T S+G FC Y DP+ T IA LNG+++GD+ L V RA+
Sbjct: 398 KSFVLVKDSSTEESRGIAFCEYADPSATTIAVEGLNGMELGDRHLKVVRASIG------- 450
Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALAD 435
Q +G++ MS+F +T + +VL L +T + L D
Sbjct: 451 ----------------MTQAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMD 494
Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
E+Y+EI +D+R+EC KYGT+V + +PRP ++PGVGK+F+++ A AL+
Sbjct: 495 PEDYDEICDDVRDECSKYGTVVELKVPRPTGGSRQSPGVGKIFVKFDTVESTTNALKALA 554
Query: 496 GRKFGGNTVNAFYYPE 511
GRKF TV Y+ E
Sbjct: 555 GRKFSDRTVVTTYFSE 570
>gi|398403643|ref|XP_003853288.1| hypothetical protein MYCGRDRAFT_100024 [Zymoseptoria tritici
IPO323]
gi|339473170|gb|EGP88264.1| hypothetical protein MYCGRDRAFT_100024 [Zymoseptoria tritici
IPO323]
Length = 544
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 148/462 (32%), Positives = 217/462 (46%), Gaps = 79/462 (17%)
Query: 93 RNRSKSLSPSRSP--------------SKSKRRSGFDMAPPAAAMLPG--AAVPGQ--LP 134
+++ KS SP R P K +R + +D+ PP + A + G LP
Sbjct: 120 QSKRKSASPPRKPKEPTPDLTDIVPILEKPRRMTQWDVKPPGYENVTAEQAKLSGMFPLP 179
Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
G P VP M N L Q G+ + + R ++R+ V LP A + ++ F
Sbjct: 180 GAPR-VPAMDANRLKEFMAQPGS--QANTSALKPSSARQSKRLLVYNLPASATDDSLMDF 236
Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEG------ 248
F+ + G N D ++ I+ +A +E +T E+A+NAMA+DGI E
Sbjct: 237 FNLQLN--GLNVTKGADPCISANISQGNGYALLEFKTPEDATNAMAMDGIKMEADVDMGN 294
Query: 249 -------VAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGG 301
+ ++RP DY + P N + GL S + + +++ +
Sbjct: 295 GESNGTSKGLEIKRPKDY-------IVPTVSDETENTS--GLFSSIVPDTQ--NKISITN 343
Query: 302 LPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP-AVTDIACAALNGLK 360
+P Y E Q+ ELL SFG L F LVKD+ T S+G F Y+DP + T IA ALNG+
Sbjct: 344 IPVYLQEEQVVELLTSFGQLKNFVLVKDKSTEESRGIAFVEYKDPDSTTKIALEALNGMD 403
Query: 361 MGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA- 419
+GD L V+ A+ I IQ Q SG T+G M L T +
Sbjct: 404 LGDAALKVKLAS-------------------IGIQ----QVSGEMTVGA-MGLIAGTKST 439
Query: 420 -----KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
+VLCL IT + L D +E +EILED++EEC KYG L+ V +PRP + G+
Sbjct: 440 DADNGRVLCLMNMITPEELMDADEADEILEDVKEECAKYGELMEVKMPRPTGGSRQNNGI 499
Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
GK++L+Y A A AL+GRKF TV Y+ E+ YF+
Sbjct: 500 GKIYLKYKAPDSAAKALGALAGRKFADRTVVVTYFGEE-YFD 540
>gi|237838479|ref|XP_002368537.1| RNA binding motif-containing protein [Toxoplasma gondii ME49]
gi|211966201|gb|EEB01397.1| RNA binding motif-containing protein [Toxoplasma gondii ME49]
gi|221505828|gb|EEE31473.1| RNA binding motif-containing protein, putative [Toxoplasma gondii
VEG]
Length = 816
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 93/230 (40%), Positives = 137/230 (59%), Gaps = 23/230 (10%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
+RHAR+VYVG LP + + +F++++T + PGD +V+VY+N ++FAF+E R
Sbjct: 275 SRHARKVYVGNLPVPVTQAEVQQYFNELLTTLLPKKV-PGDTIVHVYVNPSRRFAFLEHR 333
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA--------AALG---------PGQPS 273
++EEA+ + LDG+ + A+ +RRP DYNPTLA A LG P Q +
Sbjct: 334 SIEEANFTLGLDGVSWRNCALSLRRPQDYNPTLAEQQYREERARLGSMTGFAVPPPSQAA 393
Query: 274 PNLNLAAVGLASGAIGGA-----EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
+ A L +GA+G + P ++F+GGLP+ TE K+LLE+FG L +VK
Sbjct: 394 TPASPAESSLIAGALGIVSTTVPDSPHKIFIGGLPHSITEQGCKQLLEAFGQLRALHVVK 453
Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK 378
D+ G+ KG+ FC Y DP VTD+A A LN +++ D+ L VRRA GQ K
Sbjct: 454 DQQRGDCKGFAFCEYLDPNVTDVAVAGLNNMRIADRVLQVRRAMPHGQMK 503
>gi|302511201|ref|XP_003017552.1| hypothetical protein ARB_04434 [Arthroderma benhamiae CBS 112371]
gi|291181123|gb|EFE36907.1| hypothetical protein ARB_04434 [Arthroderma benhamiae CBS 112371]
Length = 501
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 131/442 (29%), Positives = 203/442 (45%), Gaps = 72/442 (16%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPV- 163
+ +R + +D+ PP + A V G P +P A + A + ++L AF P
Sbjct: 94 RKRRLTQWDIKPPGYENVTAEQAKVSGMFP-LPGAPRQQAVD-----PSRLQAFMNPPAA 147
Query: 164 ------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
++ +R ++R++ +PP E + FF+ + G N D +V
Sbjct: 148 SGSSNNTLLKPSNSRQSKRLFAHNIPPNVTEDTLQQFFNLQLN--GLNVISGVDPCQSVQ 205
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------------VAVRVRRPTDYNPT 262
I+ + KFA +E T +A+ A+A DGI E + + RP DY
Sbjct: 206 ISKDGKFALLEFNTAADATVALAFDGITMEEHEANRESNGESNGEVKGLTIVRPKDYIVP 265
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
+ P Q G+ S + + P+++ V +P + E Q+ LL SFG L
Sbjct: 266 IPTDEEPRQE---------GVVSSNV--PDSPNKICVSNIPPFIQEDQVTMLLVSFGELK 314
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
F LVKD T S+G FC Y DPA T IA LNG+++GD+ L V RA+
Sbjct: 315 SFVLVKDVGTDESRGIAFCEYLDPASTGIAVEGLNGMELGDRRLKVNRASIG-------- 366
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
+Q +G++ MS+F +T + +VL L +TAD L D+
Sbjct: 367 ---------------TVQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTADELIDN 411
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
E+YEEI ED++EEC KYG + + IPRP + GVGK+++++ A AL+G
Sbjct: 412 EDYEEICEDVQEECSKYGVVEELKIPRPSAGSRQAAGVGKIYVKFDSPESATKALQALAG 471
Query: 497 RKFGGNTVNAFYYPEDKYFNKD 518
RKF TV Y+ E + N +
Sbjct: 472 RKFQDRTVVTTYFSEASHPNSN 493
>gi|119482894|ref|XP_001261475.1| splicing factor u2af large subunit [Neosartorya fischeri NRRL 181]
gi|119409630|gb|EAW19578.1| splicing factor u2af large subunit [Neosartorya fischeri NRRL 181]
Length = 563
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 133/443 (30%), Positives = 208/443 (46%), Gaps = 84/443 (18%)
Query: 106 SKSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLM 161
++ +R + +D+ PP + A + G LPG P P ++L AF
Sbjct: 166 TRKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSRLQAFMNQ 217
Query: 162 P------VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
P + +R ARR++V LP + + + + +FF+ + G N D ++
Sbjct: 218 PGGGSADNSALKPSNSRQARRLFVYNLPSVVSSEHLVSFFNLQLN--GLNVIHSVDPCIS 275
Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGIIFEG------------VAVRVRRPTDYNPTL 263
I+ + FA +E +T + + A+A DGI E + VRRP DY
Sbjct: 276 AQISEDHSFALLEFKTPNDTTVALAFDGITMEEHEPASGTENGAPKGLEVRRPKDYIVPN 335
Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHG 323
+A Q LN + P+++ V +P Y E + LL+SFG L
Sbjct: 336 GSADQEYQEGVLLNEVP-----------DSPNKICVSNIPQYIPEEPVTMLLKSFGELKS 384
Query: 324 FDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-----ASGQSK 378
F LVKD T S+G FC Y DP+ T IA LNG+++GD+ L V RA+ A+G
Sbjct: 385 FVLVKDSSTEESRGIAFCEYADPSATAIAVEGLNGMELGDRHLKVVRASIGMTQAAGLDM 444
Query: 379 TEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL------AKVLCLTEAITADA 432
G+N MS+F +T ++VL L +T +
Sbjct: 445 ------------------------GVN----AMSMFAKTTSQDLESSRVLQLLNMVTPEE 476
Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AK 491
L D+++YEEI +D+REEC KYG ++++ +PRP ++PGVGK++++ +D V AT A
Sbjct: 477 LLDNDDYEEICDDVREECSKYGKVLDLKVPRPSGGSRQSPGVGKIYVK-FDTVESATNAL 535
Query: 492 NALSGRKFGGNTVNAFYYPEDKY 514
AL+GRKF TV Y+ E+ +
Sbjct: 536 KALAGRKFSDRTVVTTYFSEENF 558
>gi|221484193|gb|EEE22489.1| RNA binding motif-containing protein, putative [Toxoplasma gondii
GT1]
Length = 820
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 93/230 (40%), Positives = 137/230 (59%), Gaps = 23/230 (10%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
+RHAR+VYVG LP + + +F++++T + PGD +V+VY+N ++FAF+E R
Sbjct: 279 SRHARKVYVGNLPVPVTQAEVQQYFNELLTTLLPKKV-PGDTIVHVYVNPSRRFAFLEHR 337
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA--------AALG---------PGQPS 273
++EEA+ + LDG+ + A+ +RRP DYNPTLA A LG P Q +
Sbjct: 338 SIEEANFTLGLDGVSWRNCALSLRRPQDYNPTLAEQQYREERARLGSMTGFAVPPPSQAA 397
Query: 274 PNLNLAAVGLASGAIGGA-----EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
+ A L +GA+G + P ++F+GGLP+ TE K+LLE+FG L +VK
Sbjct: 398 TPASPAESSLIAGALGIVSTTVPDSPHKIFIGGLPHSITEQGCKQLLEAFGQLRALHVVK 457
Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK 378
D+ G+ KG+ FC Y DP VTD+A A LN +++ D+ L VRRA GQ K
Sbjct: 458 DQQRGDCKGFAFCEYLDPNVTDVAVAGLNNMRIADRVLQVRRAMPHGQMK 507
>gi|389642205|ref|XP_003718735.1| splicing factor U2AF 50 kDa subunit [Magnaporthe oryzae 70-15]
gi|351641288|gb|EHA49151.1| splicing factor U2AF 50 kDa subunit [Magnaporthe oryzae 70-15]
Length = 620
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 118/363 (32%), Positives = 175/363 (48%), Gaps = 55/363 (15%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
+R ++R+ + LP E ++ +F + + + N D + + + FA VE R
Sbjct: 289 SRQSKRLILSNLPAGTTEDSLISFLNLQLNGL--NVIEASDPCLACQMAPDGSFAMVEFR 346
Query: 231 TVEEASNAMALDGIIFE----------GVAVR---VRRPTDYNPTLAAALGPGQPSPNLN 277
+ + + A ALDGI E G A + +RRP DY + A + P
Sbjct: 347 SPSDTTVAYALDGISMEAEDAGNGDANGAASKGLAMRRPKDY--IVPAVVDDTGYEP--- 401
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
G+ S + + P ++ V LP Y T+ Q+ ELL SFG L L KD T S+G
Sbjct: 402 ----GVVSSRV--VDTPHKISVTNLPAYLTDEQVVELLSSFGELKALVLAKDSSTEESRG 455
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
FC Y D TD+A LNG+++GDK L VR+A+ I I
Sbjct: 456 IAFCEYVDVTNTDVAIEGLNGMELGDKRLKVRKAS-------------------IGIT-- 494
Query: 398 ALQTSGMNTLGGGMSLFGETLAK------VLCLTEAITADALADDEEYEEILEDMREECG 451
Q SGM MS+ T+A+ VL L +TAD L D+++YEEI ED++EEC
Sbjct: 495 --QVSGMEMGVNAMSMLAGTVAQDPDLSPVLQLLNMVTADELMDNDDYEEICEDVQEECA 552
Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
KYGT++ + +PRP + GVGK+++++ A AL GRKF TV Y+PE
Sbjct: 553 KYGTVIELKVPRPSSGAKQAAGVGKIYVKFDSIESSTKALKALGGRKFADRTVVTTYFPE 612
Query: 512 DKY 514
+ +
Sbjct: 613 ENF 615
>gi|440468063|gb|ELQ37246.1| splicing factor U2AF 50 kDa subunit [Magnaporthe oryzae Y34]
gi|440489023|gb|ELQ68704.1| splicing factor U2AF 50 kDa subunit [Magnaporthe oryzae P131]
Length = 640
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 118/363 (32%), Positives = 175/363 (48%), Gaps = 55/363 (15%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
+R ++R+ + LP E ++ +F + + + N D + + + FA VE R
Sbjct: 289 SRQSKRLILSNLPAGTTEDSLISFLNLQLNGL--NVIEASDPCLACQMAPDGSFAMVEFR 346
Query: 231 TVEEASNAMALDGIIFE----------GVAVR---VRRPTDYNPTLAAALGPGQPSPNLN 277
+ + + A ALDGI E G A + +RRP DY + A + P
Sbjct: 347 SPSDTTVAYALDGISMEAEDAGNGDANGAASKGLAMRRPKDY--IVPAVVDDTGYEP--- 401
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
G+ S + + P ++ V LP Y T+ Q+ ELL SFG L L KD T S+G
Sbjct: 402 ----GVVSSRV--VDTPHKISVTNLPAYLTDEQVVELLSSFGELKALVLAKDSSTEESRG 455
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
FC Y D TD+A LNG+++GDK L VR+A+ I I
Sbjct: 456 IAFCEYVDVTNTDVAIEGLNGMELGDKRLKVRKAS-------------------IGIT-- 494
Query: 398 ALQTSGMNTLGGGMSLFGETLAK------VLCLTEAITADALADDEEYEEILEDMREECG 451
Q SGM MS+ T+A+ VL L +TAD L D+++YEEI ED++EEC
Sbjct: 495 --QVSGMEMGVNAMSMLAGTVAQDPDLSPVLQLLNMVTADELMDNDDYEEICEDVQEECA 552
Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
KYGT++ + +PRP + GVGK+++++ A AL GRKF TV Y+PE
Sbjct: 553 KYGTVIELKVPRPSSGAKQAAGVGKIYVKFDSIESSTKALKALGGRKFADRTVVTTYFPE 612
Query: 512 DKY 514
+ +
Sbjct: 613 ENF 615
>gi|164657478|ref|XP_001729865.1| hypothetical protein MGL_2851 [Malassezia globosa CBS 7966]
gi|159103759|gb|EDP42651.1| hypothetical protein MGL_2851 [Malassezia globosa CBS 7966]
Length = 473
Score = 181 bits (459), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 119/399 (29%), Positives = 196/399 (49%), Gaps = 75/399 (18%)
Query: 128 AVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLAN 187
A+PG +P PSA+ E+ T AF RR++V + +
Sbjct: 124 AMPGTIP--PSAMAELE------ATTSAAAF--------NASMYLETRRLHVSPVSSVKT 167
Query: 188 EQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE 247
Q + F + M S+G + V ++ ++ +A++E R +EASNA+ LDG+ F
Sbjct: 168 SQQLRIFINAKMNERLLCSSGSLEPCYAVDMHLDEGYAYLEFRNPDEASNALLLDGVAFL 227
Query: 248 GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYY 305
G + + RP Y A P+P GAI + +GP+++++G +P +
Sbjct: 228 GHRLHIERPKGYVGQDAV------PAP-----------GAIETSVPDGPNKLYIGNVPVF 270
Query: 306 FTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKT 365
E Q+ ELL++FG + FDL++D +T S+G FC + + AVTD+AC L+GL++G++
Sbjct: 271 LNEQQVMELLKAFGDVRHFDLIRDPETQRSRGMAFCEFHEDAVTDLACEGLDGLEVGEQR 330
Query: 366 LTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLT 425
L VRR AS + T +++ +TS +T + + +
Sbjct: 331 LMVRRVNASTNTHTHEDT---------------QETS-------------DTPTRAMLML 362
Query: 426 EAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP------------DQNGGETPG 473
+T D L DD EY++I ED+ EC ++GT+ +V IPRP +G E G
Sbjct: 363 NMVTTDELLDDTEYQDIKEDVHSECSRHGTVTSVYIPRPLAAAAGHATSADAASGMEPKG 422
Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
VG+V++++ A C A A++GR+F G TV Y +D
Sbjct: 423 VGRVYVQFVHADECEAALRAIAGRQFDGRTVICAYVRDD 461
>gi|302656965|ref|XP_003020217.1| hypothetical protein TRV_05722 [Trichophyton verrucosum HKI 0517]
gi|291184026|gb|EFE39599.1| hypothetical protein TRV_05722 [Trichophyton verrucosum HKI 0517]
Length = 486
Score = 181 bits (459), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 130/435 (29%), Positives = 200/435 (45%), Gaps = 72/435 (16%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPV- 163
+ +R + +D+ PP + A V G P +P A + A + ++L AF P
Sbjct: 94 RKRRLTQWDIKPPGYENVTAEQAKVSGMFP-LPGAPRQQAVD-----PSRLQAFMNPPAA 147
Query: 164 ------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
++ +R ++R++ +PP E + FF+ + G N D +V
Sbjct: 148 SGSSNNTLLKPSNSRQSKRLFAHNIPPNVTEDTLQQFFNLQLN--GLNVISGVDPCQSVQ 205
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------------VAVRVRRPTDYNPT 262
I+ + KFA +E T +A+ A+A DGI E + + RP DY
Sbjct: 206 ISKDGKFALLEFNTAADATVALAFDGITMEEHEANRESNGESNGDVKGLTIVRPKDYIVP 265
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
+ P Q G+ S + + P+++ V +P + E Q+ LL SFG L
Sbjct: 266 IPTDEEPRQE---------GVVSSNV--PDSPNKICVSNIPPFIQEDQVTMLLVSFGELK 314
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
F LVKD T S+G FC Y DPA T IA LNG+++GD+ L V RA+
Sbjct: 315 SFVLVKDVGTDESRGIAFCEYLDPASTGIAVEGLNGMELGDRRLKVNRASIG-------- 366
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
+Q +G++ MS+F +T + +VL L +TAD L D+
Sbjct: 367 ---------------TVQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTADELIDN 411
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
E+YEEI ED++EEC KYG + + IPRP + GVGK+++++ A AL+G
Sbjct: 412 EDYEEICEDVQEECSKYGVVEELKIPRPSAGSRQAAGVGKIYVKFDSPESATKALQALAG 471
Query: 497 RKFGGNTVNAFYYPE 511
RKF TV Y+ E
Sbjct: 472 RKFQDRTVVTTYFSE 486
>gi|327297188|ref|XP_003233288.1| splicing factor u2af large subunit [Trichophyton rubrum CBS 118892]
gi|326464594|gb|EGD90047.1| splicing factor u2af large subunit [Trichophyton rubrum CBS 118892]
Length = 563
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 131/442 (29%), Positives = 202/442 (45%), Gaps = 72/442 (16%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPV- 163
+ +R + +D+ PP + A V G P +P A + A + ++L AF P
Sbjct: 156 RKRRLTQWDIKPPGYENVTAEQAKVSGMFP-LPGAPRQQAVD-----PSRLQAFMNPPAA 209
Query: 164 ------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
++ +R ++R++ +PP E + FF+ + G N D +V
Sbjct: 210 SGSGNNTLLKPSNSRQSKRLFAHNIPPNVTEDTLQQFFNLQLN--GLNVISGVDPCQSVQ 267
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------------VAVRVRRPTDYNPT 262
I+ + KFA +E T +A+ A+A DGI E + + RP DY
Sbjct: 268 ISKDGKFALLEFNTAADATVALAFDGITMEEHEANRESNGESNGNVKGLTIVRPKDYIVP 327
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
L P Q G+ S + + P+++ V +P + E Q+ LL SFG L
Sbjct: 328 LPTDEEPRQE---------GVVSSNV--PDSPNKICVSNIPPFIQEDQVTMLLVSFGELK 376
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
F LVKD T S+G FC Y D A T IA LNG+++GD+ L V RA+
Sbjct: 377 SFVLVKDVGTDESRGIAFCEYLDSASTGIAVEGLNGMELGDRRLKVNRASIG-------- 428
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
+Q +G++ MS+F +T + +VL L +TAD L D+
Sbjct: 429 ---------------TVQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTADELIDN 473
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
E+YEEI ED++EEC KYG + + IPRP + GVGK+++++ A AL+G
Sbjct: 474 EDYEEICEDVQEECSKYGVVEELKIPRPSAGSRQAAGVGKIYVKFDTPESATKALQALAG 533
Query: 497 RKFGGNTVNAFYYPEDKYFNKD 518
RKF TV Y+ E + N +
Sbjct: 534 RKFQDRTVVTTYFSEASHSNSN 555
>gi|449299113|gb|EMC95127.1| hypothetical protein BAUCODRAFT_526859 [Baudoinia compniacensis
UAMH 10762]
Length = 432
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 134/438 (30%), Positives = 209/438 (47%), Gaps = 72/438 (16%)
Query: 107 KSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
+ +R + +D+ PP + M P P Q P PS + Q + Q
Sbjct: 35 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPSRL----QAFMNQPGNQA 90
Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
L P R ++R++ +PP NE I+ FF+ + G N D ++
Sbjct: 91 NTSALKP------STARQSKRLFAYNIPPNVNESMISDFFNLQLN--GLNVTRGVDPCIS 142
Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGI-------IFEGVA------VRVRRPTDYNPT 262
++ + +A ++ +T E+A+NAMALDGI + G A + ++RP DY
Sbjct: 143 AQLSQDLTYALLDFKTSEDATNAMALDGITMPEHMEVMNGSANGNSQGLIIQRPKDY--I 200
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVF---VGGLPYYFTETQIKELLESFG 319
+ A + + G+ S + PD F + +P Y TE Q++ELL SFG
Sbjct: 201 VPAVVDDTE-------HEAGVLSSTV-----PDTQFKISITHIPSYLTEEQVQELLVSFG 248
Query: 320 TLHGFDLVKDRDTGNSKGYGFCVYQDPA-VTDIACAALNGLKMGDKTLTVRRATASGQSK 378
L F LVKD T S+G FC Y+D TDIA +LNG+++GD L V+RA+ Q
Sbjct: 249 ELKNFVLVKDAGTDQSRGIAFCEYKDAKNTTDIAVESLNGMELGDSHLKVQRASIGTQ-- 306
Query: 379 TEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEE 438
Q + + M++ S GG ++VLCL IT + L D +E
Sbjct: 307 --------QVGGEMTVNAMSMMASA----AGGAD---RDASRVLCLMNMITPEELMDADE 351
Query: 439 YEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRK 498
+EILED++EEC KYG +++V +PRP ++ G+GK++++Y A AL+GRK
Sbjct: 352 ADEILEDVKEECAKYGAIIDVKMPRPSSGSRQSNGIGKIYVKYEKPEAAQKALAALAGRK 411
Query: 499 FGGNTVNAFYYPEDKYFN 516
F TV ++ E+ YF+
Sbjct: 412 FADRTVVVTFFGEE-YFD 428
>gi|95103124|gb|ABF51503.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 isoform 2
[Bombyx mori]
Length = 306
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 96/221 (43%), Positives = 136/221 (61%), Gaps = 10/221 (4%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ FF+Q M + G + G+ V+ I
Sbjct: 83 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEETMEFFNQQMH-LSGLAQAAGNPVLACQI 141
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P PG +P +N+
Sbjct: 142 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGTENPAINV 195
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
A G+ S + + P ++F+GGLP Y E Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 196 PA-GVISTVV--PDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSSTGLSKGY 252
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT 379
F Y D ++TD A A LNG+++GDK L V+RA+ ++ T
Sbjct: 253 AFAEYVDISMTDQAIAGLNGMQLGDKKLIVQRASIGAKNST 293
>gi|452841884|gb|EME43820.1| hypothetical protein DOTSEDRAFT_71600 [Dothistroma septosporum
NZE10]
Length = 433
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 139/463 (30%), Positives = 217/463 (46%), Gaps = 55/463 (11%)
Query: 74 RDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPAAAMLPG--AAVP 130
R ER+ +S S + + + L+ S + KRR + +D+ PP + A +
Sbjct: 2 RQLERQTARKSASPPPRKPKEPTPDLTEVTSVLERKRRLTQWDIKPPGYENVTAEQAKLS 61
Query: 131 GQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANE 188
G LPG P P Q + F G ++ T R ++R+ + +P A E
Sbjct: 62 GMFPLPGAPRQQPMDPQKLQAFMNQPGGEANKTALKPST---ARQSKRLLIYNIPASATE 118
Query: 189 QAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEG 248
I FF+ + G N D ++ ++ +K +A +E +T E+A+NAMA DGI E
Sbjct: 119 DTIMDFFNLQLN--GLNVTRGADPCISAQLSQDKAYALLEFKTPEDATNAMAFDGINMEP 176
Query: 249 VA---------------VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
A + ++RP DY + + G + G+ S + +
Sbjct: 177 EAMVTSGNEDENGGARGLDIKRPKDY---IVPVVTDGTEN------DAGVLSNVVPDTQ- 226
Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
+++ + +P Y E Q ELL SFG L F LVKD T S+G FC Y+DP T +A
Sbjct: 227 -NKISITNIPAYVDEEQTMELLNSFGELKNFVLVKDASTEESRGIAFCEYKDPNSTKVAV 285
Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSL 413
+L+G+ +GD + VR A+ Q Q +++ M+L G G
Sbjct: 286 ESLHGMTLGDAAMKVRLASIGIQ----------QVSGEMSVNAMSLMAGTARADGEG--- 332
Query: 414 FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPG 473
+VL L IT + L D +E +EILED++EEC KYG L++V +PRP ++ G
Sbjct: 333 -----GRVLSLMNMITPEELMDPDEADEILEDVKEECAKYGPLLDVKMPRPTGGSRQSNG 387
Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+GK++L+Y A A AL+GRKF TV Y+ E+ YF+
Sbjct: 388 IGKIYLKYESTESAAKALAALAGRKFADRTVVVTYFGEE-YFD 429
>gi|255931767|ref|XP_002557440.1| Pc12g05960 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211582059|emb|CAP80223.1| Pc12g05960 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 554
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 126/429 (29%), Positives = 195/429 (45%), Gaps = 72/429 (16%)
Query: 106 SKSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
++ +R + +DM PP + M P P Q P PS + + P G ++
Sbjct: 173 TRKRRLTQWDMKPPGYENVTAEQAKISGMFPLPGAPRQQPMDPSRMKDFLNP--PTGDSE 230
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
A + +R ++R++V +PP + A+ FF+ + G N D +
Sbjct: 231 NAA--------LKPSNSRQSKRLFVYNIPPGVSGDAVIAFFNLQLN--GLNVIRSVDPCI 280
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFE---GVAVRVRRPTDYNPTLAAALGPGQ 271
+ ++ +K FA +E + +A+ A+ALDGI + VRRP DY +A P Q
Sbjct: 281 SAQVSEDKTFALLEFKDPNDATVALALDGITMPESGDKGLEVRRPKDYIVPDGSAAQPVQ 340
Query: 272 PSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRD 331
P LN + P+++ + +P Y E I LL+SFG L F LVKD
Sbjct: 341 PGVVLNEVP-----------DSPNKICISNIPTYINEEAIIMLLKSFGDLKSFILVKDAA 389
Query: 332 TGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQH 391
T S+G F Y DP T +A LNG+++ D+ L RA+
Sbjct: 390 TEESRGIAFYEYVDPNNTALAVEGLNGMELADRRLKFVRASIG----------------- 432
Query: 392 IAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEILED 445
Q +G++ M +F +T + +VL L +T D L +DE+YEEILED
Sbjct: 433 ------TTQATGLDMGVNAMQMFAKTTSQDLETTQVLQLLNMVTLDELLNDEDYEEILED 486
Query: 446 MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
+ EEC K+G ++ + IPR G GK+F++Y A A AL+GRKF TV
Sbjct: 487 VGEECSKFGKMIGIKIPRRGH------GAGKIFIKYDTAESATNALKALAGRKFSDRTVV 540
Query: 506 AFYYPEDKY 514
A Y+ + +
Sbjct: 541 ASYFSVENF 549
>gi|312078073|ref|XP_003141580.1| U2af splicing factor protein 1 [Loa loa]
Length = 460
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 103/280 (36%), Positives = 155/280 (55%), Gaps = 31/280 (11%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V V+ T +RR+YVG +P +E A+ FF+Q M + G + PG+ V+ +N +K
Sbjct: 178 VPVVGPSVTCQSRRLYVGNIPFGCSEDAMLDFFNQQMH-LCGLAQAPGNPVLACQMNLDK 236
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
FAF+E R+++E + MA DGI F G +++RRP DY P S + +L +
Sbjct: 237 NFAFIEFRSIDETTAGMAFDGINFMGQQLKIRRPRDYQPM----------STSYDLGNMM 286
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+++ + P ++F+GGLP Y Q+KELL SFG L F+LV ++ TG SKGY F
Sbjct: 287 VSNIV---PDSPHKIFIGGLPSYLNAEQVKELLSSFGQLKAFNLVTEQSTGVSKGYAFAE 343
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
Y DP++TD A A LNG+++GDK L V+ + A+ ++ Q + +Q +
Sbjct: 344 YLDPSLTDQAIAGLNGMQLGDKNLVVQLSCANARNNVAQNTF------------PQIQVA 391
Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEI 442
G++ G G +VLCL +T D L DDEEYE I
Sbjct: 392 GIDLSHGA----GPP-TEVLCLMNMVTEDELKDDEEYEGI 426
>gi|342873171|gb|EGU75391.1| hypothetical protein FOXB_14096 [Fusarium oxysporum Fo5176]
Length = 661
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 129/430 (30%), Positives = 205/430 (47%), Gaps = 67/430 (15%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P P ++L AF P
Sbjct: 177 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMNQP 228
Query: 163 VQVMTQ-----QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+T +R ++R+ V +P +E+A+ +FF+ + G N D V
Sbjct: 229 GGQVTSAGLKANNSRQSKRLLVSRIPSGTSEEALMSFFNLQLN--GLNVIDTTDPCVLCQ 286
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGV-----------AVRVRRPTDYNPTLAAA 266
++++ FA +E + EA+ A+A+DGI E + +RRP DY + A
Sbjct: 287 FSNDRSFAVIEFKDAPEATVALAMDGISMEASDASNGTDGGHRGLEIRRPRDY---VVPA 343
Query: 267 LGPGQPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
+ V S + + +++ + +P + TE QI ELL SFG F
Sbjct: 344 V----------TEEVSYDSEVVSNIVPDTVNKLSITNIPTFLTEEQIIELLASFGKPKAF 393
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTD-IACAALNGLKMGDKTLTVRRATASGQSKTEQES 383
LVKDR T S+G F YQDPA ++ A LNG+++G K L V +A+
Sbjct: 394 VLVKDRGTEESRGIAFAEYQDPAASNPTALDTLNGMEIGGKKLKVSKAS----------- 442
Query: 384 ILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL--AKVLCLTEAITADALADDEEYEE 441
I ++A G+ + G S + ++VL L +TA+ L D+++YEE
Sbjct: 443 --------IGPTQVANFDVGITAISGLASQTANEVESSRVLQLLNMVTAEELLDNDDYEE 494
Query: 442 ILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
I ED++EEC K+G +++V +PRP ++ GVGK+F++Y A A A AL+GRKF
Sbjct: 495 ICEDVKEECSKFGKIIDVKVPRPTGGSRQSAGVGKIFVKYEKAEDTAKALQALAGRKFAD 554
Query: 502 NTVNAFYYPE 511
TV Y+PE
Sbjct: 555 RTVVTTYFPE 564
>gi|408397958|gb|EKJ77095.1| hypothetical protein FPSE_02739 [Fusarium pseudograminearum CS3096]
Length = 554
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 129/431 (29%), Positives = 201/431 (46%), Gaps = 63/431 (14%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P P ++L AF P
Sbjct: 159 RQRRLTQWDIKPPGYDNVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMNQP 210
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+T +R ++R+ V +PP +E + FF+ + G N D V
Sbjct: 211 GGQVTSAGLKASNSRQSKRLLVSRIPPGTSEDTLIAFFNLQLN--GLNVIDTTDPCVLCQ 268
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG-----------VAVRVRRPTDYNPTLAAA 266
++++ FA +E + E + A+ALDGI E + +RRP DY + A
Sbjct: 269 FSNDRSFAVIEFKDAPETTVALALDGISMEANDASNGADGGHRGLEIRRPRDY--VVPAV 326
Query: 267 LGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
P + V + +++ + +P + TE QI ELL SFG F L
Sbjct: 327 TEDVAYDPEVVSNVV---------PDTVNKLSITNIPPFLTEEQIIELLASFGKPKAFVL 377
Query: 327 VKDRDTGNSKGYGFCVYQDPAVTD-IACAALNGLKMGDKTLTVRRATASGQSKTEQESIL 385
VKDR T S+G F YQDPAV++ A LNG+ +G K + V +A+
Sbjct: 378 VKDRGTEESRGIAFAEYQDPAVSNPTALDTLNGMDIGGKQIKVSKAS------------- 424
Query: 386 AQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL--AKVLCLTEAITADALADDEEYEEIL 443
I ++A G+ + G S + ++VL L +TA+ L D+++YEEI
Sbjct: 425 ------IGPTQVANFDVGITAISGLASQTANEVESSRVLQLLNMVTAEELLDNDDYEEIC 478
Query: 444 EDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNT 503
ED+REEC KYG +++V +PRP ++ GVGK+F++Y A AL+GRKF T
Sbjct: 479 EDVREECSKYGKILDVKVPRPTGGSRQSAGVGKIFVKYEHTEDTTKALQALAGRKFADRT 538
Query: 504 VNAFYYPEDKY 514
V Y+PE+ +
Sbjct: 539 VVTTYFPEENF 549
>gi|403331270|gb|EJY64574.1| RNA-binding proteins (RRM domain) [Oxytricha trifallax]
Length = 565
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 123/391 (31%), Positives = 187/391 (47%), Gaps = 62/391 (15%)
Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
H+RR+YVGG+P ++ + F +Q + GG GD V+ N EK++ F+E+R+V
Sbjct: 188 HSRRLYVGGVPTSQSDVQVVQFLTQTLRKAGG-ILEEGDPVIKSQNNPEKRYTFLELRSV 246
Query: 233 EEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
EEAS + LDGI F +R+RRP DY+ P + AA+G+ S + E
Sbjct: 247 EEASTMIQLDGIKFMDSTLRIRRPEDYDKYPQIPPRRPIPQIDT--AALGIISTKV--EE 302
Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
P ++FVGGLP F E QIK LL +G L F LVK + S+G+ FC Y D A
Sbjct: 303 TPLKIFVGGLPKEFNEEQIKNLLLRYGQLKSFHLVKHTNIDQSRGFAFCEYTDEKGVQNA 362
Query: 353 CAALNGLKMGDKTLTVRRATASGQSKTEQESILAQ------------------------- 387
LNGLK+G +++ VRR AS S +Q I +
Sbjct: 363 IQFLNGLKIGSRSINVRRTGAST-STVQQNQISEEDKKKFEDKLDDFINETGKFMQNANM 421
Query: 388 -------AQQHIAI-------QKMA---LQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
QQH Q+MA Q+ G T+ M + K+ L++ I
Sbjct: 422 YNIDKQSVQQHNPFDELERHNQQMASYIQQSYGYTTISTVMEI------KIPALSQQI-- 473
Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRP---DQNGGETP-GVGKVFLEYYDAVG 486
L +D+E++E+++D+ +E K+G + +++PR Q P +GK F+E+ +
Sbjct: 474 --LDNDQEHDELVKDLTQELQKFGKIRTLLLPRSLDMMQTSTVKPSAIGKAFVEFEEVTS 531
Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
N L+GR+F G V +Y +D + K
Sbjct: 532 GFACYNLLNGRQFMGMPVEINFYNKDLFVTK 562
>gi|340780291|pdb|2YH0|A Chain A, Solution Structure Of The Closed Conformation Of Human
U2af65 Tandem Rrm1 And Rrm2 Domains
gi|340780292|pdb|2YH1|A Chain A, Model Of Human U2af65 Tandem Rrm1 And Rrm2 Domains With
Eight-Site Uridine Binding
Length = 198
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 131/204 (64%), Gaps = 10/204 (4%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+
Sbjct: 4 ARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVD 62
Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
E + AMA DGIIF+G ++++RRP DY P PG S N ++ G+ S + +
Sbjct: 63 ETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDS 113
Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY FC Y D VTD A
Sbjct: 114 AHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAI 173
Query: 354 AALNGLKMGDKTLTVRRATASGQS 377
A LNG+++GDK L V+RA+ ++
Sbjct: 174 AGLNGMQLGDKKLLVQRASVGAKN 197
>gi|402073699|gb|EJT69251.1| splicing factor U2AF 50 kDa subunit [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 623
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 114/365 (31%), Positives = 177/365 (48%), Gaps = 59/365 (16%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
+R ++R+ V P E+A+ F + + + N D + + + FA +E R
Sbjct: 292 SRQSKRLIVTNFAPGTTEEALVAFMNLQLNGL--NVIESTDPCLLCQMAPDSSFAILEFR 349
Query: 231 TVEEASNAMALDGIIFE-------GVA------VRVRRPTDYNPTLAAALGPGQPSPNLN 277
+ E + A+ALDGI E G A + +RRP DY + A+
Sbjct: 350 SPAETTVALALDGITMEAEDTPMEGAANGTPQGLELRRPKDY---IVPAV---------- 396
Query: 278 LAAVGLASGAIGG--AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNS 335
+ G G + + P ++ + L Y TE Q+ ELL SFG L LVKD T S
Sbjct: 397 VEDTGYERGVVSSRVVDTPHKIGITNLAPYLTEEQVTELLVSFGELKALVLVKDSGTEES 456
Query: 336 KGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQ 395
+G FC Y DP TD+A LN +++G+K L V++A+ I I
Sbjct: 457 RGIAFCEYVDPVATDVAIHGLNNMELGEKRLRVKKAS-------------------IGIT 497
Query: 396 KMALQTSGMNTLGGGMSLFGET------LAKVLCLTEAITADALADDEEYEEILEDMREE 449
+++ G+N MS+ T L++VL L +TAD L D+++YEEI +D+REE
Sbjct: 498 QVSGIEMGIN----AMSMLAGTVAQDPDLSRVLQLLNMVTADELLDNDDYEEICDDVREE 553
Query: 450 CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
C K+GT++ + IPRP + GVGK+++++ A AL+GRKF TV Y+
Sbjct: 554 CSKFGTILELKIPRPSGGARQLAGVGKIYVKFDTIESSTEALKALAGRKFADRTVVTTYF 613
Query: 510 PEDKY 514
PE+ +
Sbjct: 614 PEENF 618
>gi|429854658|gb|ELA29655.1| splicing factor u2af large subunit [Colletotrichum gloeosporioides
Nara gc5]
Length = 559
Score = 174 bits (442), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 159/526 (30%), Positives = 243/526 (46%), Gaps = 67/526 (12%)
Query: 16 RHKSSWVSGRSRTGER--GRDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYN 73
R + SGR R G+R RDR + DD + + + R R DR +
Sbjct: 69 REREDRYSGRDRRGDREWDRDRGSSRRDARRDDDDHRRRDRDPYDDRRRGGRGDRQQQQQ 128
Query: 74 RDKERRHRHRSRSHSSD---RFRNRSKSLSPSRSPSKSKRR-SGFDMAPPA--------- 120
D R R S+ + R + L+ S + KRR + +D+ PP
Sbjct: 129 HDDGGMGRRGDRQRSATPPPKKREPTPDLTNVTSVLERKRRLTQWDIKPPGYENVTAEQA 188
Query: 121 --AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVY 178
+ M P P Q P PS + Q ++ Q+ + L P +R A+R+
Sbjct: 189 KLSGMFPLPGAPRQQPMDPSKL----QAIMNQPGGQVNSAALKPSN------SRQAKRLL 238
Query: 179 VGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNA 238
+ LPP A E +I FF+ + G N D + ++ + FA VE R EA+ A
Sbjct: 239 INNLPPSATEDSIVGFFNLQLN--GLNVIESTDPCTSCQLSKDHSFAVVEFRNASEATVA 296
Query: 239 MALDGIIFE------GVA----VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
+ALDGI E G A + +RRP DY P +P G+ S +
Sbjct: 297 LALDGITMEADDATNGAAGSNGLVIRRPKDYIVPAVVDDVPYEP---------GVVSNIV 347
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
+ P+++ + +P Y ++ Q+ ELL SFG L F LV+D+ T S+G FC Y +P+
Sbjct: 348 --IDTPNKISIANMPPYLSDEQVTELLVSFGELKAFVLVRDKSTEESRGIAFCEYVEPSA 405
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
TD+A LNG+ +GDK L V++A+ +A +M + + M+ L
Sbjct: 406 TDVAIQGLNGMDLGDKKLRVQKASV--------------GVTQVAGVEMGV--AAMSMLA 449
Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
G S E +VL L +T + L D+++YEEI ED+ EEC K+G +++V IPRP
Sbjct: 450 GTTSTDSEE-TRVLQLLNMVTPEELMDNDDYEEIKEDVEEECTKFGKVLDVKIPRPVGGS 508
Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++ GVGK+F+ + A AL+GRKF TV Y+PE+ +
Sbjct: 509 RQSAGVGKIFVRFESKEVAKKALQALAGRKFADRTVVTTYFPEENF 554
>gi|407929464|gb|EKG22293.1| hypothetical protein MPH_00360 [Macrophomina phaseolina MS6]
Length = 824
Score = 174 bits (442), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 119/366 (32%), Positives = 178/366 (48%), Gaps = 56/366 (15%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
+R A+R++V P + + +I FF+ + + N D ++V I+ ++ FA E +
Sbjct: 18 SRQAKRLFVYNFPAASTDDSIQDFFNLQLNHL--NVISSSDPCISVQISKDRTFALCEFK 75
Query: 231 TVEEASNAMALDGIIFEG------------VAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
T E+ + A+ALDG E +++ RP DY + P Q S + +
Sbjct: 76 TPEDTTMALALDGQSMEAEDASNGASNGGHSGIKISRPKDY-------IVPAQ-SDDADY 127
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
G+ S + +GP ++ V +P Y TE Q+ +LL +FG L F LVKD T SKG
Sbjct: 128 QE-GVVSNKV--KDGPHKICVAQIPVYLTEEQVMDLLSAFGGLKAFTLVKDTGTDQSKGI 184
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
FC Y DP TD A L+G+++ L V++A + IQ
Sbjct: 185 AFCEYVDPDTTDPAVEGLDGMEIAQDHLKVKKAC-------------------VGIQ--- 222
Query: 399 LQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEILEDMREECGK 452
Q SG+ MS+ T + +VL L +T + L D +EYEEI ED+ EEC K
Sbjct: 223 -QASGLEMGVNAMSMLAGTSSGDVEQGRVLMLLNMVTPEELMDPQEYEEIQEDVHEECSK 281
Query: 453 YGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
YG + + IPRP Q E GVGK+F++Y A AL+GRKF TV ++ E+
Sbjct: 282 YGKVEELKIPRP-QPPKENKGVGKIFVKYDTPESAQKALRALAGRKFADRTVVVTFFGEE 340
Query: 513 KYFNKD 518
YF+ D
Sbjct: 341 -YFDVD 345
>gi|322693990|gb|EFY85833.1| splicing factor, putative [Metarhizium acridum CQMa 102]
Length = 584
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 160/521 (30%), Positives = 232/521 (44%), Gaps = 66/521 (12%)
Query: 16 RHKSSWVSGRSRTGERGRDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRD 75
R + SGR R GER DR + D+ + DREG D R R D D
Sbjct: 86 REREDRYSGRDRRGERDWDRDRGSSRRDARRDEDDRPNRRDREGFDDRRRGGRGGDRRDD 145
Query: 76 KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPAAAMLPG--AAVPGQ 132
+ RS S + R + L+ + KRR + +D+ PP ++ A + G
Sbjct: 146 GGFARQESRRSPSPAKPREPTPDLTDIIPVLERKRRMTQWDIKPPGYELVTAEQAKLSGM 205
Query: 133 --LPGVPSAVPEMAQNMLPFGATQLGAFPLMP-----VQVMTQQATRHARRVYVGGLPPL 185
LPG P P T+L AF P + +R A+R+ V +P
Sbjct: 206 FPLPGAP--------RQQPMDPTKLQAFMKEPNGGVSSAGLKASNSRQAKRLIVSNIPQG 257
Query: 186 ANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGII 245
+E+++ +FF+ + G N D + ++ FA +E R +A+ A+ALDGI
Sbjct: 258 NSEESLISFFNLQLN--GLNVIESSDPCNLCQFSTDRSFAVLEFRNAGDATVALALDGIN 315
Query: 246 FEG-----------VAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
E + +RRP DY + A PN+ V P
Sbjct: 316 MEADDTMNGDGGEKQGLSIRRPKDY--VMPAIPEEMAYDPNVVSNVV------------P 361
Query: 295 DRVF---VGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP-AVTD 350
D V + +P + TE QI ELL +FG F LVKDR T S+G F Y +P + +
Sbjct: 362 DTVHKLSITNIPTFLTEDQIIELLAAFGKPKAFVLVKDRSTEESRGIAFAEYLEPGSANE 421
Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
A +ALNG+ +G K L V +A+ G ++ + AI +A QTS G
Sbjct: 422 PALSALNGMDVGGKKLKVAKASI-GPTQVANFDV-----GITAISGLASQTSTDAEKG-- 473
Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
+VL L +T + L D EEYEEI ED+REEC K+G ++ + IPRP +
Sbjct: 474 ---------RVLQLLNMVTPEELMDTEEYEEICEDVREECSKFGNILELKIPRPVGGSRQ 524
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
+ GVGK+F+++ C A AL+GRKF TV Y+PE
Sbjct: 525 SAGVGKIFVKFDTPDSCHKALTALAGRKFADRTVVTTYFPE 565
>gi|47217926|emb|CAG02209.1| unnamed protein product [Tetraodon nigroviridis]
Length = 600
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 95/212 (44%), Positives = 123/212 (58%), Gaps = 17/212 (8%)
Query: 309 TQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV 368
++KELL SFG L F+LVKD T SKGY FC Y D TD A A LNG+++GDK L V
Sbjct: 405 VKVKELLTSFGPLKAFNLVKDGATSLSKGYAFCEYVDVGATDQAVAGLNGMQLGDKKLIV 464
Query: 369 RRATASGQSKTEQESILAQAQQHIAIQKMA-LQTSGMNTLGGGMSLFGETLAKVLCLTEA 427
+RA+ +K S A+A + + + LQTSG+ T +VLCL
Sbjct: 465 QRASVG--AKNANPSAAAEAPVTLQVPGLQRLQTSGVPT-------------EVLCLLNM 509
Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
+ + L DDE+YEEILED+REEC KYG + ++ IPRP +G E PG GK+F+EY A C
Sbjct: 510 VVPEELVDDEDYEEILEDVREECCKYGGVRSIEIPRP-VDGVEVPGCGKIFVEYVSASDC 568
Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
A AL+GRKF V YY D Y ++
Sbjct: 569 QKAMQALTGRKFANRVVVTKYYDPDMYHRHEF 600
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 66/254 (25%), Positives = 98/254 (38%), Gaps = 63/254 (24%)
Query: 100 SPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
+P R P + R + PPAA +P A +L A+
Sbjct: 112 APPRLPGPAPRLPSVSLWPPAAGQIPTMA------------------LLATAASAGVVAA 153
Query: 160 LMPVQVMTQQATRHARRVYVGGLP---------PLANEQAIATFFSQVMTAIGGNSAGPG 210
PV V Q TR ARR+YVG +P + ++++A FF+ M + G S P
Sbjct: 154 PTPVPVAGSQMTRQARRLYVGNIPFGLTEALRRLCSPQESMAEFFNAQMR-LAGLSQAPS 212
Query: 211 DAVVNVYINHEKKFAFVEMR-------------TVEEASNAMALDGIIFEGVAVRVRR-- 255
+ V+ V IN +K FAF+E+R + ++ L G G +R R
Sbjct: 213 NPVLAVQINQDKNFAFLEVRPGFSAAAALPAAAAAADVCVSVPLGGRDHAGHGLRRHRVP 272
Query: 256 ------PTDYN-PTLAAALG-PGQPSPNL----------NLAAVGLASGAIGGAEGPDRV 297
PT P A LG G P P G+ S + + P ++
Sbjct: 273 GSGSEDPTASRLPASARHLGAAGVPRPRFLRAAARHAARVGRRPGVVSTVV--PDSPHKL 330
Query: 298 FVGGLPYYFTETQI 311
F+GGLP Y + Q+
Sbjct: 331 FIGGLPNYLNDDQV 344
>gi|389585165|dbj|GAB67896.1| U2 snRNP auxiliary factor [Plasmodium cynomolgi strain B]
Length = 894
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 105/318 (33%), Positives = 163/318 (51%), Gaps = 17/318 (5%)
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDY------NPTLAAALGPGQ 271
N E +F F+E R++E + LD I F +R+ RP D+ +P L
Sbjct: 575 FNVESRFCFLEFRSLEITWLCLRLDAISFNNYCLRIARPHDFVPPPGGDPALTVVFTDIN 634
Query: 272 PSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRD 331
+ V +A G + +++++ LP+ + QI++LL+ FG L GF+++KD +
Sbjct: 635 HEVFEMVKPVKIAPVRSTG-DDDNKLYIQNLPHDLRDDQIRDLLQQFGKLKGFNVIKDLN 693
Query: 332 TGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQ 390
TG +KGYGF Y+D T IA ALNG G L V++AT Q+ T+ + + A
Sbjct: 694 TGLNKGYGFFEYEDSNCTPIAMHALNGFVCGQNILNVKKATFGKSQNSTQNANTTSLATG 753
Query: 391 HI---------AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEE 441
+ +I + L S + GE ++V+ LT A+ + L D +YEE
Sbjct: 754 SVDLPVSLLPNSISQKILSNSIIGLQIQASRKIGEKSSRVVQLTNAVFQEDLLIDSQYEE 813
Query: 442 ILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
IL+D++EE KYG L N+VIP+P+++ T GVGK+FL Y D A+ L+GR F
Sbjct: 814 ILKDIKEEAEKYGPLQNIVIPKPNKDLSYTEGVGKIFLHYADETTARKAQYMLNGRLFEK 873
Query: 502 NTVNAFYYPEDKYFNKDY 519
V A +Y E+K+ Y
Sbjct: 874 RVVCAAFYSEEKFLAGKY 891
>gi|344251408|gb|EGW07512.1| Splicing factor U2AF 65 kDa subunit [Cricetulus griseus]
Length = 422
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 115/285 (40%), Positives = 160/285 (56%), Gaps = 30/285 (10%)
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++ G+ S +
Sbjct: 26 RSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV- 77
Query: 290 GAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
PD ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY FC Y D
Sbjct: 78 ----PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDI 133
Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
VTD A A LNG+++GDK L V+RA+ ++ T + Q + +Q L +S +
Sbjct: 134 NVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGLMSSQVQ- 188
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
+GG + +VLCL + + L DDEEYEEI+ED+R+EC KYG + ++ IPRP
Sbjct: 189 MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-V 240
Query: 467 NGGETPGVGK--VFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
+G E PG GK +EY G + T+ F Y
Sbjct: 241 DGVEVPGCGKAMTLMEYLIKTGSERVSQQCKENMYAVQTLKDFQY 285
>gi|167515386|ref|XP_001742034.1| RNA-binding region RNP-1 containing protein [Monosiga brevicollis
MX1]
gi|163778658|gb|EDQ92272.1| RNA-binding region RNP-1 containing protein [Monosiga brevicollis
MX1]
Length = 431
Score = 171 bits (433), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 138/464 (29%), Positives = 211/464 (45%), Gaps = 69/464 (14%)
Query: 75 DKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPP------AAAMLPGAA 128
D++RR R R R S+ +P S + G++ PP G
Sbjct: 16 DRDRRSRSLERDS-----RRSSERDAPEPSEAWDVPPPGYENMPPKVYKDYVCTYFAGLP 70
Query: 129 VPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANE 188
+ +LPG+ S++P PL TR ARR+YVGG+P AN+
Sbjct: 71 ISAELPGLRSSMPN----------------PL----------TRGARRLYVGGIPNGAND 104
Query: 189 QAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEG 248
+A FF+ +T G + GPG VV+ IN EK FAF+E+R+ EEA++ +A D I+F G
Sbjct: 105 MELAEFFNMQLTQ-QGLTIGPGAPVVSAQINEEKSFAFLELRSPEEATSCIAFDNIMFMG 163
Query: 249 VAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD--------RVFVG 300
+R+RRP DY A G P +++ R+ V
Sbjct: 164 NQLRIRRPKDYQ----APAGGTSEVPKVDMPMPRPMPMPTPMPMPTPMPMLVPSGRLNVT 219
Query: 301 GLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLK 360
+P E Q++EL FGT+ +L K+ +T G + D D LN +K
Sbjct: 220 NIPLAMDEEQLRELFSVFGTIASLELRKEPETDKFAGDAIVEF-DTRAPDF----LNQVK 274
Query: 361 MGDKTLTVRRATASGQS-KTEQESILAQAQQHIAIQKMALQTSGMNT---LGGGM--SLF 414
G + + GQ K EQ + + + ++ + S +N +GG ++
Sbjct: 275 AGLEDIDF-----EGQKLKVEQ---VVRWWSYCGLRASYIAPSLVNASPFVGGAAAPAVP 326
Query: 415 GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
+VL L +T + L DDEEY++I+ED+REECGK+G + ++ IPRP G + G+
Sbjct: 327 DVEATEVLVLMNMVTKEELQDDEEYKDIMEDIREECGKFGNITDLKIPRPVAEGEQPIGL 386
Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
K+F+ Y A+ ALSGR+F TV Y K+ N +
Sbjct: 387 EKIFIRYATVDEARNAQRALSGRRFANRTVVVSYLDVAKFENDE 430
>gi|328865493|gb|EGG13879.1| RapGAP/RanGAP domain-containing protein [Dictyostelium fasciculatum]
Length = 3032
Score = 171 bits (433), Expect = 9e-40, Method: Composition-based stats.
Identities = 114/373 (30%), Positives = 186/373 (49%), Gaps = 38/373 (10%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGN-SAGPGDAVVNVYINHEKKFAFV 227
Q + +RR+Y+G +PP + + FF+ +TA + S+ G V++ IN K FAF+
Sbjct: 2570 QQNKQSRRLYIGNIPPNITDNTLIDFFNTAITAANLHLSSKTGPVVLSCQINSAKNFAFL 2629
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
E R+ EEA+NAM LDGI ++++RRPTDY P + P P ++ + S
Sbjct: 2630 EFRSAEEATNAMGLDGISLFTFSLKIRRPTDYQPPANESSMPSAP------VSMSIVSTN 2683
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
+ +E +++F+GG+P E QIK +L +FG L F+LVKD TG+SKGY FC Y +
Sbjct: 2684 VPDSE--NKIFIGGIPTTLNEEQIKSMLLAFGRLKAFNLVKDPKTGSSKGYAFCEYYETE 2741
Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ------------------ 389
T+ LNG K G+K+L V+R++ + + +
Sbjct: 2742 ETNDCINGLNGTKFGEKSLVVQRSSVGTKDPSTTSTNNNNNNNNNNNNNNANMNSNRKSV 2801
Query: 390 ---QHIAIQKMALQTSGMNTLGGGMSLF----GETLAKVLCLTEAITADALADDEEYEEI 442
Q + L +S LG S + V+ L + + + DD +YE +
Sbjct: 2802 TTFDQSVTQMLNLASSIPQVLGTIRSNIPSDSNTKSSTVIQLFNLVDREDIQDDSDYENL 2861
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETP-GVGKVFLEYYDAVGCATAKNALSGRKFGG 501
L D++EEC ++G + ++ I RP + E P + KVF+++ A ++ GR++
Sbjct: 2862 LIDVKEECEEFGEVESIFISRPKE---ENPLDIVKVFVKFVSLESAQRAWMSIGGRRYNY 2918
Query: 502 NTVNAFYYPEDKY 514
T+ +YPED Y
Sbjct: 2919 RTIITAFYPEDFY 2931
>gi|384500209|gb|EIE90700.1| hypothetical protein RO3G_15411 [Rhizopus delemar RA 99-880]
Length = 490
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/350 (31%), Positives = 163/350 (46%), Gaps = 73/350 (20%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
AT+ ARR+YVG +PP E+ +A FF+ M + P V V INHEK +AFVE
Sbjct: 214 ATKQARRLYVGQIPPGLEEKPLADFFNATMHQLQMQDRTP---VAAVQINHEKSYAFVEF 270
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
+T E+A+ MA DGI+F+G +++RRP DY P ++++ GL S +
Sbjct: 271 QTAEQATACMAFDGIMFQGQQLKIRRPKDYQPPAEG---------DVSMQLPGLVSTNV- 320
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
+ P+++F+GGLP Y + Q+ ELL+SF
Sbjct: 321 -PDTPNKIFIGGLPVYLNDDQVIELLKSF------------------------------- 348
Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
GD+ L V+RA+ +HI M+ N +
Sbjct: 349 ------------GDRKLIVQRASVGA--------------KHIPPDYMSGPMLPANYV-P 381
Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
S + +VL L +T + L DDEEY++I ED+ EEC K+G ++++ IP+P Q
Sbjct: 382 VTSAKEDDATRVLQLMNMVTPEELEDDEEYQDIWEDIAEECAKFGNVLDMKIPKP-QKDQ 440
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
E PG G +F+ + A AL+GRKF TV A + E Y +
Sbjct: 441 EVPGCGLIFVRFETKDQTLDALRALAGRKFADRTVVATFIDEQNYLTDSF 490
>gi|425773483|gb|EKV11835.1| Splicing factor u2af large subunit [Penicillium digitatum Pd1]
Length = 585
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 125/424 (29%), Positives = 191/424 (45%), Gaps = 72/424 (16%)
Query: 106 SKSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
++ +R + +DM PP + M P P Q P PS + + P G +
Sbjct: 204 TRKRRLTQWDMKPPGYENVTAEQAKISGMFPLPGAPRQQPMDPSRMKDFLNP--PTGDSD 261
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
A L P +R ++R++V +P + A+ FF+ + G N D +
Sbjct: 262 NAA--LKPSN------SRQSKRLFVYNIPSGVSGDAVIAFFNLQLN--GLNVVHSVDPCI 311
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFE---GVAVRVRRPTDYNPTLAAALGPGQ 271
+ ++ +K FA +E + +A+ A+A DGI + VRRP DY +A P Q
Sbjct: 312 SAQVSEDKTFALLEFKDPNDATVALAFDGITMAESGDKGLEVRRPKDYIVPDGSASQPVQ 371
Query: 272 PSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRD 331
LN + P+++ + +P Y E I LL+SFG L F LVKD
Sbjct: 372 AGVVLNEVP-----------DSPNKICISNIPTYINEEAIIMLLKSFGDLKSFVLVKDAA 420
Query: 332 TGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQH 391
T S+G F Y DP T +A LNG+++ D+ L RA+
Sbjct: 421 TEESRGIAFYEYVDPNNTALAVEGLNGMELVDRHLKFVRASIG----------------- 463
Query: 392 IAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEILED 445
Q SG++ M +F +T + +VL L +T D L +DE+YEEI+ED
Sbjct: 464 ------TTQASGLDMGVNAMQMFAKTTSQDLETTQVLQLLNMVTLDELLNDEDYEEIMED 517
Query: 446 MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
+ +EC K+GT++ + IPR G GK+F++Y A A AL+GRKF TV
Sbjct: 518 VSDECSKFGTILGIKIPRRGH------GAGKIFIKYDAAESATNALKALAGRKFSDRTVV 571
Query: 506 AFYY 509
A Y+
Sbjct: 572 ASYF 575
>gi|425775779|gb|EKV14031.1| Splicing factor u2af large subunit [Penicillium digitatum PHI26]
Length = 585
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 123/424 (29%), Positives = 190/424 (44%), Gaps = 72/424 (16%)
Query: 106 SKSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
++ +R + +DM PP + M P P Q P PS + + P G +
Sbjct: 204 TRKRRLTQWDMKPPGYENVTAEQAKISGMFPLPGAPRQQPMDPSRMKDFLNP--PTGDSD 261
Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
A + +R ++R++V +P + A+ FF+ + G N D +
Sbjct: 262 NAA--------LKPSNSRQSKRLFVYNIPSGVSGDAVIAFFNLQLN--GLNVVHSVDPCI 311
Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFE---GVAVRVRRPTDYNPTLAAALGPGQ 271
+ ++ +K FA +E + +A+ A+A DGI + VRRP DY +A P Q
Sbjct: 312 SAQVSEDKTFALLEFKDPNDATVALAFDGITMAESGDKGLEVRRPKDYIVPDGSASQPVQ 371
Query: 272 PSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRD 331
LN + P+++ + +P Y E I LL+SFG L F LVKD
Sbjct: 372 AGVVLNEVP-----------DSPNKICISNIPTYINEEAIIMLLKSFGDLKSFVLVKDAA 420
Query: 332 TGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQH 391
T S+G F Y DP T +A LNG+++ D+ L RA+
Sbjct: 421 TEESRGIAFYEYVDPNNTALAVEGLNGMELVDRHLKFVRASIG----------------- 463
Query: 392 IAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEILED 445
Q SG++ M +F +T + +VL L +T D L +DE+YEEI+ED
Sbjct: 464 ------TTQASGLDMGVNAMQMFAKTTSQDLETTQVLQLLNMVTLDELLNDEDYEEIMED 517
Query: 446 MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
+ +EC K+GT++ + IPR G GK+F++Y A A AL+GRKF TV
Sbjct: 518 VSDECSKFGTILGIKIPRRGH------GAGKIFIKYDAAESATNALKALAGRKFSDRTVV 571
Query: 506 AFYY 509
A Y+
Sbjct: 572 ASYF 575
>gi|76154831|gb|AAX26240.2| SJCHGC03157 protein [Schistosoma japonicum]
Length = 258
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 112/273 (41%), Positives = 158/273 (57%), Gaps = 20/273 (7%)
Query: 250 AVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTET 309
A++ RRP + P L S ++ G+ S + + P ++FVGGLPYY E
Sbjct: 3 ALKFRRPRVFAPLLGV-------SEQQSVIVPGVVSTVV--QDSPHKIFVGGLPYYLNED 53
Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVR 369
Q+KELL SFG L GF+LVKD TG SKGY FC Y D VTD ACA LNG+++GDK L V+
Sbjct: 54 QVKELLLSFGPLKGFNLVKDGSTGLSKGYAFCEYVDSNVTDHACAGLNGMQLGDKKLIVQ 113
Query: 370 RATASGQSKTEQESILAQAQQHIA-IQKMALQTSGMNTLGGGMSLF--GETLAKVLCLTE 426
RA+ + T +L Q ++ +++ A+Q NT G G G +VLCL
Sbjct: 114 RASVGAKHTT---GVLPQCLLQMSGLEEGAVQ----NTTGSGNLTVRSGGPPTEVLCLMN 166
Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
I L DDEEYE+I+ED+R EC KYG + ++ IPRP + G + PGVGK+++E+ +
Sbjct: 167 MIETSELEDDEEYEDIVEDVRAECSKYGVVRSLEIPRPIR-GIDVPGVGKIYVEFASLID 225
Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
C A AL+GRKF V ++ + Y +++
Sbjct: 226 CQKAATALTGRKFNQRLVVTSFFSPNSYHRREF 258
>gi|112490659|pdb|2G4B|A Chain A, Structure Of U2af65 Variant With Polyuridine Tract
Length = 172
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 91/199 (45%), Positives = 121/199 (60%), Gaps = 30/199 (15%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+
Sbjct: 4 ARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVD 62
Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
E + AMA DGIIF+G ++++RRP DY QP P G
Sbjct: 63 ETTQAMAFDGIIFQGQSLKIRRPHDY-----------QPLP------------------G 93
Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY FC Y D VTD A
Sbjct: 94 AHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAI 153
Query: 354 AALNGLKMGDKTLTVRRAT 372
A LNG+++GDK L V+RA+
Sbjct: 154 AGLNGMQLGDKKLLVQRAS 172
>gi|449802099|pdb|3VAF|A Chain A, Structure Of U2af65 Variant With Bru3 Dna
gi|449802100|pdb|3VAF|B Chain B, Structure Of U2af65 Variant With Bru3 Dna
gi|449802101|pdb|3VAG|A Chain A, Structure Of U2af65 Variant With Bru3c2 Dna
gi|449802102|pdb|3VAG|B Chain B, Structure Of U2af65 Variant With Bru3c2 Dna
gi|449802103|pdb|3VAH|A Chain A, Structure Of U2af65 Variant With Bru3c4 Dna
gi|449802104|pdb|3VAH|B Chain B, Structure Of U2af65 Variant With Bru3c4 Dna
gi|449802105|pdb|3VAI|A Chain A, Structure Of U2af65 Variant With Bru3c5 Dna
gi|449802106|pdb|3VAI|B Chain B, Structure Of U2af65 Variant With Bru3c5 Dna
gi|449802107|pdb|3VAJ|A Chain A, Structure Of U2af65 Variant With Bru5c6 Dna
gi|449802108|pdb|3VAJ|B Chain B, Structure Of U2af65 Variant With Bru5c6 Dna
gi|449802109|pdb|3VAK|A Chain A, Structure Of U2af65 Variant With Bru5 Dna
gi|449802110|pdb|3VAK|B Chain B, Structure Of U2af65 Variant With Bru5 Dna
gi|449802113|pdb|3VAL|A Chain A, Structure Of U2af65 Variant With Bru5c1 Dna
gi|449802114|pdb|3VAL|B Chain B, Structure Of U2af65 Variant With Bru5c1 Dna
gi|449802115|pdb|3VAL|D Chain D, Structure Of U2af65 Variant With Bru5c1 Dna
gi|449802116|pdb|3VAL|I Chain I, Structure Of U2af65 Variant With Bru5c1 Dna
gi|449802117|pdb|3VAM|A Chain A, Structure Of U2af65 Variant With Bru5c2 Dna
gi|449802118|pdb|3VAM|B Chain B, Structure Of U2af65 Variant With Bru5c2 Dna
Length = 174
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 91/199 (45%), Positives = 121/199 (60%), Gaps = 30/199 (15%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+
Sbjct: 6 ARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVD 64
Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
E + AMA DGIIF+G ++++RRP DY QP P G
Sbjct: 65 ETTQAMAFDGIIFQGQSLKIRRPHDY-----------QPLP------------------G 95
Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY FC Y D VTD A
Sbjct: 96 AHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAI 155
Query: 354 AALNGLKMGDKTLTVRRAT 372
A LNG+++GDK L V+RA+
Sbjct: 156 AGLNGMQLGDKKLLVQRAS 174
>gi|358378060|gb|EHK15743.1| hypothetical protein TRIVIDRAFT_79964 [Trichoderma virens Gv29-8]
Length = 503
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 152/533 (28%), Positives = 233/533 (43%), Gaps = 96/533 (18%)
Query: 33 RDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKER-------------- 78
RDR D S G DRR D+ + DR R R D NR +ER
Sbjct: 11 RDREREDRYSSGRDRRGDREWDRDRGSYRRDARRDDDERPNR-REREPYDDRRRGGGRER 69
Query: 79 --------RHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPAAAMLPG--A 127
R RS S + R + L+ + KRR + +D+ PP ++ A
Sbjct: 70 ERRDDGFARQEQPRRSPSPPKKREPTPDLTDIVPVLERKRRLTQWDIKPPGYDLVTAEQA 129
Query: 128 AVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQA-----TRHARRVYVG 180
+ G LPG P P T+L AF P +T +R A+R+ V
Sbjct: 130 KLSGMFPLPGAP--------RQQPMDPTKLQAFMTQPGGQVTSAGLKASNSRQAKRLLVS 181
Query: 181 GLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMA 240
+P + A+ +FF+ + G N D V + +K FA +E R +A+ A+A
Sbjct: 182 NVPSSVTDDALISFFNLQLN--GLNVIDSSDPCVLSQFSQDKAFAVLEFRNASDATVALA 239
Query: 241 LDGIIFE------GVA------VRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGLASGA 287
LDGI E G A + +RRP DY P L + P P N+
Sbjct: 240 LDGITMEADDAQNGTANGGNHGLVIRRPKDYVMPALPDEM-PYDPEVISNVV-------- 290
Query: 288 IGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
PD ++ + +P + E Q+ ELL +FG F LVKDR T S+G F Y
Sbjct: 291 ------PDTVHKLCITNIPSFLNEDQVIELLAAFGKPKAFVLVKDRSTEESRGIAFTEYL 344
Query: 345 DPAV-TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
+P+ + A +LNG+ +G K L V +A+ I ++A G
Sbjct: 345 EPSTANEPALNSLNGMDVGGKKLKVTKAS-------------------IGPTQVANFDVG 385
Query: 404 MNTLGGGMSLFGETLAK--VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
+ + G S + + V+ L +T + L D+++YEEI ED+++EC K+G +V + +
Sbjct: 386 ITAISGLASQTSNDIERSSVIQLLNMVTPEELMDNDDYEEICEDVQDECSKFGKVVELKV 445
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
PRP ++ GVGK+++++ A AL+GRKF TV + Y+PE+ +
Sbjct: 446 PRPSGGSRQSTGVGKIYVKFDSEESATKALTALAGRKFADRTVVSTYFPEENF 498
>gi|170053756|ref|XP_001862821.1| splicing factor U2AF 50 kDa subunit [Culex quinquefasciatus]
gi|167874130|gb|EDS37513.1| splicing factor U2AF 50 kDa subunit [Culex quinquefasciatus]
Length = 382
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 92/221 (41%), Positives = 129/221 (58%), Gaps = 18/221 (8%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 115 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQMH-LSGLAQAAGNPVLACQI 173
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY QP P +
Sbjct: 174 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDY-----------QPMPGMTD 222
Query: 279 AAVGLASGAIGGA------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
+AV G + P ++F+GGLP Y E Q+KELL SFG L F+LVKD T
Sbjct: 223 SAVAPVQEKFSGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLLSFGQLKAFNLVKDAAT 282
Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
G SKGY F Y + ++TD A A LNG+++GDK L V+RA+
Sbjct: 283 GLSKGYAFAEYVEYSITDQAIAGLNGMQLGDKKLIVQRASV 323
>gi|302916595|ref|XP_003052108.1| hypothetical protein NECHADRAFT_38412 [Nectria haematococca mpVI
77-13-4]
gi|256733047|gb|EEU46395.1| hypothetical protein NECHADRAFT_38412 [Nectria haematococca mpVI
77-13-4]
Length = 564
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 134/455 (29%), Positives = 214/455 (47%), Gaps = 60/455 (13%)
Query: 79 RHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPAAAMLPG--AAVPGQ--L 133
RH +R + + R + L+ + KRR + +D+ PP + A + G L
Sbjct: 127 RHENRRSASPPPKKREPTPDLTNIVPILERKRRLTQWDIKPPGYDNVTAEQAKLSGMFPL 186
Query: 134 PGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ-----QATRHARRVYVGGLPPLANE 188
PG P P ++L AF P +T +R ++R+ V +P +E
Sbjct: 187 PGAP--------RQQPMDPSKLQAFMNQPGGQVTSAGLKANNSRQSKRLLVSKIPSGTSE 238
Query: 189 QAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE- 247
+A+ +FF+ + G N D + ++++ FA +E R EA+ A+ALDG E
Sbjct: 239 EALISFFNLQLN--GLNVIDATDPCILCQFSNDRSFAVLEFREASEATVALALDGTSMEP 296
Query: 248 ----------GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRV 297
+ +RRP DY + A +P++ + I +++
Sbjct: 297 DDANGASNGESRGLEIRRPRDY--VVPAVTEEVSYNPDV---VSNIVPDTI------NKL 345
Query: 298 FVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP-AVTDIACAAL 356
+ +P + E Q+ ELL +FG F LVKDR T S+G F YQDP A A L
Sbjct: 346 CITNIPPFLAEDQVIELLAAFGKPKAFVLVKDRGTEESRGIAFAEYQDPNAANPTALDTL 405
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGE 416
NG+ +G K L V +A+ G ++ + AI +A QT+ N + G
Sbjct: 406 NGMDVGGKKLKVTKASI-GPTQVANFDV-----GITAISGLASQTA--NDVEG------- 450
Query: 417 TLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGK 476
++VL L +TA+ L D+++YEEI ED++EEC K+G ++++ IPRP ++ GVGK
Sbjct: 451 --SRVLQLLNMVTAEELLDNDDYEEICEDVKEECSKFGKIIDMKIPRPTGGSRQSAGVGK 508
Query: 477 VFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
+F++Y A AL+GRKF TV Y+PE
Sbjct: 509 IFVKYETIEDTTKALKALAGRKFADRTVVTTYFPE 543
>gi|348681357|gb|EGZ21173.1| hypothetical protein PHYSODRAFT_488481 [Phytophthora sojae]
Length = 640
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/404 (28%), Positives = 181/404 (44%), Gaps = 88/404 (21%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
A + AR +YVG LPP + F S ++ +G + PG+ ++N +I+ + FAF EM
Sbjct: 272 AQKPARELYVGNLPPNVTGPQLQEFLSTIIQQVGLTTQ-PGNPIINTWISTDGHFAFCEM 330
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPS---------------- 273
R+VEE + A+ L+ + G ++ RP + +GP QP
Sbjct: 331 RSVEECNLALLLNQLSLLGQPLKFGRPRSF-------MGPPQPMPQISARTQTALTNLGC 383
Query: 274 ------------PNLNLAAVG-------------------------LASGAIGGAEGPDR 296
P+L+ AA +AS + + R
Sbjct: 384 TPNPAWFAQPAVPSLDEAAAAPVGDSSTLAGATAAAVAAAQPAVPAVASTTVDASLSAHR 443
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
+ + +P TE Q+KEL+E FG L F LVKD TG S G Y+D +VT A L
Sbjct: 444 LIMSNIPVVLTEDQVKELVEPFGALKSFTLVKDTATGASMGSALFEYEDDSVTAQAVEGL 503
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGE 416
NGL +G L+V+ ASG + F +
Sbjct: 504 NGLSIGGILLSVQCQPASGAALPAAPGATPN--------------------------FED 537
Query: 417 TLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGK 476
+ VL + ++ D L DD+EY ++ ED+ EEC ++G + + IPRP ++G E PG+G
Sbjct: 538 QPSAVLKMANMVSIDELRDDDEYADLAEDVEEECKRFGNVTGLEIPRP-KDGEEVPGLGC 596
Query: 477 VFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
+++ + + A AL+GRKFGGN V Y+P DK+ +++S
Sbjct: 597 IYVRFEEEKNAVDALKALNGRKFGGNIVKVTYFPLDKFDKQEFS 640
>gi|347968829|ref|XP_003436304.1| AGAP002908-PC [Anopheles gambiae str. PEST]
gi|333467822|gb|EGK96709.1| AGAP002908-PC [Anopheles gambiae str. PEST]
Length = 250
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 93/212 (43%), Positives = 125/212 (58%), Gaps = 21/212 (9%)
Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVR 369
Q+KELL SFG L F+LVKD TG KGY F Y + VTD A A LNG+++GDK L V+
Sbjct: 58 QVKELLLSFGQLKAFNLVKDAATGLGKGYAFAEYVEYTVTDQAIAGLNGMQLGDKKLIVQ 117
Query: 370 RATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGET--LAKVLCLTEA 427
RA+ +K +++A Q + G+SL G + +VLCL
Sbjct: 118 RASVG--AKNSNAAVVAPVQIQVP----------------GLSLVGSSGPPTEVLCLLNM 159
Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
+T D L D+EEYE+ILED+REEC KYG + +V IPRP + G + PG GKVF+E+ V C
Sbjct: 160 VTPDELKDEEEYEDILEDIREECNKYGVVRSVEIPRPIE-GVDVPGCGKVFVEFNSIVDC 218
Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
A+ AL+GRKF V Y+ DKY +++
Sbjct: 219 QKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 250
>gi|355727237|gb|AES09128.1| U2 small nuclear RNA auxiliary factor 2 [Mustela putorius furo]
Length = 301
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 104/251 (41%), Positives = 141/251 (56%), Gaps = 17/251 (6%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 65 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 120
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 121 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 179
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 180 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 232
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 233 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 290
Query: 340 FCVYQDPAVTD 350
FC Y D VTD
Sbjct: 291 FCEYVDINVTD 301
>gi|395529346|ref|XP_003766777.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Sarcophilus
harrisii]
Length = 462
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 104/251 (41%), Positives = 141/251 (56%), Gaps = 17/251 (6%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 126 RSPRHEKKKKIRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 181
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 182 PAPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 240
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P PG S N ++
Sbjct: 241 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 293
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 294 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 351
Query: 340 FCVYQDPAVTD 350
FC Y D VTD
Sbjct: 352 FCEYVDINVTD 362
>gi|301121478|ref|XP_002908466.1| splicing factor U2af large subunit, putative [Phytophthora
infestans T30-4]
gi|262103497|gb|EEY61549.1| splicing factor U2af large subunit, putative [Phytophthora
infestans T30-4]
Length = 597
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 115/386 (29%), Positives = 174/386 (45%), Gaps = 77/386 (19%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
A + AR +YVG LPP + F S ++ +G + PG+ ++N + + + FAF EM
Sbjct: 242 AQKPARELYVGNLPPNVTGPQLQEFLSTIIQQVGLTTQ-PGNPIINTWTSTDGHFAFCEM 300
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL---------NLAA 280
R+VEE + A+ L+ + G ++ RP + +GP QP P + NL
Sbjct: 301 RSVEECNLALLLNQLSLLGQPLKFGRPRSF-------MGPPQPMPQVSARTQTALTNLGC 353
Query: 281 V-----------------------------GLASGAIGGAEGP---DRVFVGGLPYYFTE 308
+A+ G+E +R+ + +P E
Sbjct: 354 TPNPAWFAQHTVSSTETTTTETTLAEATLSAIAAAQPAGSEAVSSGNRLIMSNIPVVLAE 413
Query: 309 TQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV 368
Q+KEL+E FG L F LVKD TG S G Y+D V A LNGL +G L+V
Sbjct: 414 EQVKELVEPFGKLKSFTLVKDSATGASLGSALFEYEDSDVAAQAVEGLNGLSIGGILLSV 473
Query: 369 RRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAI 428
+R AS + + Q A+ KMA +
Sbjct: 474 QRQPASSAAALPSAAAANPEDQPSAVLKMA---------------------------NMV 506
Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCA 488
+ D L DDEEY ++ ED+ EEC ++G + + IPRP ++G E PG+G +++ +
Sbjct: 507 SIDELRDDEEYADLAEDVEEECKRFGGVTGMEIPRP-KDGEEVPGLGCIYVRFGKEEDAV 565
Query: 489 TAKNALSGRKFGGNTVNAFYYPEDKY 514
+A AL+GRKFGGN V Y+P DK+
Sbjct: 566 SALKALNGRKFGGNIVKVTYFPVDKF 591
>gi|320590609|gb|EFX03052.1| splicing factor u2af large subunit [Grosmannia clavigera kw1407]
Length = 420
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 152/297 (51%), Gaps = 35/297 (11%)
Query: 226 FVEMRTVEEASNAMALDGIIFEG--------VAVRVRRPTDYNPTLAAALGPGQPSPNLN 277
VE + +A+ A+AL+GI E + ++RP DY P
Sbjct: 1 MVEFKEPIDATVALALNGISMEAEDASGSGQSGLSIQRPKDYIVPAVVDYSVYHP----- 55
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
G+ S + + P ++ + +P Y ++ Q+ ELL SFG L F L+KDR T S+G
Sbjct: 56 ----GVVSNVV--IDTPFKIAITNIPSYLSDEQVTELLVSFGELRAFVLLKDRSTEESRG 109
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
FC Y +P TD+A LNG+ +GD+ L V++A+ T E
Sbjct: 110 VAFCEYTEPQSTDVAIQGLNGMDLGDRKLRVQKASIGITQVTSVE--------------- 154
Query: 398 ALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
+ + M+ L G +S +++V+ L +TA+ L ++++YE+I ED+ EEC K+G ++
Sbjct: 155 -MGVNAMSLLAGTISQEASDVSRVVQLLNMVTAEELVNNDDYEDICEDVTEECAKFGPVM 213
Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ +PRP G +PGVGK+F+++ A AL+GRKF TV A Y+PE+ +
Sbjct: 214 GLKVPRPASGGRHSPGVGKIFVKFDSRDSATKALKALAGRKFSDRTVVATYFPEENF 270
>gi|346324367|gb|EGX93964.1| splicing factor u2af large subunit [Cordyceps militaris CM01]
Length = 583
Score = 164 bits (416), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 126/424 (29%), Positives = 203/424 (47%), Gaps = 58/424 (13%)
Query: 107 KSKRRSGFDMAPPA--AAMLPGAAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP A A + G LPG P Q + P T+L A P
Sbjct: 197 RKRRMTQWDIKPPGYEAVTSEQAKMSGMFPLPGAPRQ-----QQVDP---TKLQALMNQP 248
Query: 163 V--QV----MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
QV + +R ARR+ V +P E A+ FF+ + + N D
Sbjct: 249 AGGQVSSAGLKANNSRQARRLLVSDIPSGTTEDALVAFFNLQLNGL--NVIEATDPCALC 306
Query: 217 YINHEKKFAFVEMRTVEEASNAMALDG--IIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
++++K FA +E + +A+ A+ALDG ++ + + +RRP DY + P P
Sbjct: 307 QLSNDKSFAVLEFKNTGDATVALALDGSSMVADTPGLSIRRPKDY-------VMPAVPDE 359
Query: 275 NLNLAAVGLASGAIGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRD 331
+ V S ++ PD ++ + +P + TE Q+ ELL +FG F LVK+R
Sbjct: 360 IIFNPEV--VSNSV-----PDTIHKLCITNIPPFLTEDQVLELLAAFGKPKAFVLVKERS 412
Query: 332 TGNSKGYGFCVYQDPA-VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQ 390
T S+G F Y +P + A LNG+ +G K L R+A G ++ +
Sbjct: 413 TEESRGIAFAEYVEPTNANEPALNTLNGMDVGGKKLKARKACVGGTQVANFDAGIN---- 468
Query: 391 HIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREEC 450
AI +A Q +G + +VL L +TA+ L D+++YEEI ED+R+EC
Sbjct: 469 --AISNLAGQGNGGDA------------TRVLQLLNMVTAEELLDNDDYEEICEDVRDEC 514
Query: 451 GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYP 510
KYG +++V +PRP ++ GVG++F+++ A AL+GRKF TV Y+P
Sbjct: 515 SKYGKVLDVKVPRPAGGSRQSAGVGRIFVKFESVDSTTGALKALAGRKFADRTVVTTYFP 574
Query: 511 EDKY 514
E+ +
Sbjct: 575 EENF 578
>gi|66808005|ref|XP_637725.1| RNA-binding region RNP-1 domain-containing protein [Dictyostelium
discoideum AX4]
gi|60466159|gb|EAL64222.1| RNA-binding region RNP-1 domain-containing protein [Dictyostelium
discoideum AX4]
Length = 671
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 83/201 (41%), Positives = 120/201 (59%), Gaps = 10/201 (4%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
+ +RR+YVG +PP ++ + FF+ + A N+ PG VV IN K FAF+E R+
Sbjct: 265 KQSRRIYVGNIPPGISDSELMEFFNAAVLAANLNTK-PGPPVVFCQINAPKCFAFIEFRS 323
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
EEA+NAM DGI + +++RRP DY T G P++ V
Sbjct: 324 PEEATNAMRFDGISLKNFTLKIRRPKDYQSTSDNTGGNASLLPSIVPTNV---------P 374
Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
+ ++++VGGLP +E Q+K LL ++G L F+LVKD +TG SKG+ FC YQD VTD+
Sbjct: 375 DSENKIYVGGLPSNLSEEQVKSLLSAYGKLKAFNLVKDTNTGVSKGFAFCEYQDSEVTDV 434
Query: 352 ACAALNGLKMGDKTLTVRRAT 372
AC+ LNG+ + DKTL V+RA+
Sbjct: 435 ACSKLNGIPLADKTLVVQRAS 455
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 56/95 (58%), Gaps = 3/95 (3%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
V+ + + + + DD+EY+ IL D++EEC ++G + ++ +P P +N E V +V++E
Sbjct: 570 VIQILNLVDREDIFDDKEYDNILIDVKEECEQFGEVQSIWLPLPSKNPLE---VTRVYVE 626
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
+ A AL GRK+ G + + YYPED +F
Sbjct: 627 FSQVEFAQKACLALGGRKYNGRVLFSAYYPEDLFF 661
>gi|325179530|emb|CCA13927.1| splicing factor U2af large subunit putative [Albugo laibachii Nc14]
Length = 833
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 119/402 (29%), Positives = 179/402 (44%), Gaps = 86/402 (21%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
A + AR +YVG LP + F ++ +G S PG+ +++V+I+ + FAF EM
Sbjct: 467 AMKPARELYVGNLPATITGPQLQEFLGTIIQQVGL-STQPGNPILSVWISTDGHFAFCEM 525
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP-----------NLNL 278
R+VEE + A+ L+ + G ++ RP + +GP QP P NL
Sbjct: 526 RSVEECNLALLLNQLPLLGQPLKFGRPRSF-------MGPPQPMPIVSARTQTALVNLGC 578
Query: 279 A--------------------------------------AVGLASGAIGGAEGPD--RVF 298
A LA P+ ++
Sbjct: 579 TPNPVWFASPDVTSFGSDPMGFGNGLNGFLSSSSSSLMSATALADSLASLPSDPNATQLL 638
Query: 299 VGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNG 358
+ +P E Q+KEL++ FG L F L+KD TG S G F YQ+ VT A L+G
Sbjct: 639 MSNIPGVLAEEQVKELVQPFGELRFFKLIKDPITGQSTGTAFFEYQENQVTTEALNGLDG 698
Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
L +G L+VRRA + +K Q ++L + G GE
Sbjct: 699 LDIGGVKLSVRRAPDA--TKYPQIAVL---------------------MPGAA---GEEP 732
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
VL + ++ D L +DEE+ ++ ED+ EEC ++GT++ + IPR Q+G E G G +F
Sbjct: 733 GPVLRMANMVSEDELKNDEEFADLKEDVEEECKRFGTIIALDIPR-SQDGEEIAGTGNIF 791
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
+ Y D A+ AL GRKFGGN V Y+ K+ K+YS
Sbjct: 792 VRYSDTKEATAAQKALCGRKFGGNVVKVTYFSLSKFEAKEYS 833
>gi|71028054|ref|XP_763670.1| U2 small nuclear ribonucleoprotein, auxiliary factor, large subunit
[Theileria parva strain Muguga]
gi|68350624|gb|EAN31387.1| U2 small nuclear ribonucleoprotein, auxiliary factor, large
subunit, putative [Theileria parva]
Length = 380
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 175/362 (48%), Gaps = 41/362 (11%)
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFS-QVMTAIGGNSAGPGDAVVN---VYINHEKK 223
++A + +R+YVG LP Q + FF+ +M + GN+ P D +V +Y N ++
Sbjct: 49 EEAKKRQKRLYVGNLPSGTKLQDVVDFFNGALMAMVPGNTIDPRDPLVTKTEIY-NPDQG 107
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
+ F+E +T E A A LDGI G ++++RRP D+N
Sbjct: 108 YCFLEFKTPELADLAFKLDGITCNGYSLKLRRPLDFN----------------------- 144
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
+G +VFV +P TE Q+KELLE G L +L+KD TG SKGYGF +
Sbjct: 145 ----LGTNSDDTKVFVQNIPLDVTEDQMKELLEKHGKLKLANLLKDPATGVSKGYGFFEF 200
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT----ASGQSKTEQESILAQAQQHIAIQKMAL 399
+D + +A LNG +G L+V+ A ASG + ++ + I ++
Sbjct: 201 EDARSSKLAVLHLNGSVLGKNVLSVKHAAFGYFASGGKPIDCKA--SNLPNSITQSILSN 258
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
G+ G G +KV+ L + + L D Y EI+ ++EE KYG L V
Sbjct: 259 PLLGLQLQNG--RRIGSNPSKVIQLLNMVFHEDLISDYNYNEIVRLVKEEAQKYGPLQEV 316
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN-TVNAFYYPEDKYFNKD 518
VIPRPD++ GVGKVF+ Y D + A+ +GR F N V + ++PED +
Sbjct: 317 VIPRPDKDLTFKEGVGKVFIRYEDLLSARKAQYMFNGRVFDKNRIVCSAFFPEDLFITGK 376
Query: 519 YS 520
Y+
Sbjct: 377 YT 378
>gi|46125343|ref|XP_387225.1| hypothetical protein FG07049.1 [Gibberella zeae PH-1]
Length = 564
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 123/414 (29%), Positives = 193/414 (46%), Gaps = 63/414 (15%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP + A + G LPG P P ++L AF P
Sbjct: 159 RQRRLTQWDIKPPGYDNVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMNQP 210
Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
+T + +R ++R+ V +PP +E A+ FF+ + G N D V
Sbjct: 211 GGQVTSASLKASNSRQSKRLLVSRIPPGTSEDALIAFFNLQLN--GLNVIDTTDPCVLCQ 268
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG-----------VAVRVRRPTDYNPTLAAA 266
++++ FA +E + E + A+ALDGI E + +RRP DY + A
Sbjct: 269 FSNDRSFAVIEFKDAPETTVALALDGISMEANDASNGADGGHRGLEIRRPRDY--VVPAV 326
Query: 267 LGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
P + V + +++ + +P + TE QI ELL SFG F L
Sbjct: 327 TEDVAYDPEVVSNVV---------PDTVNKLSITNIPPFLTEEQIIELLASFGKPKAFVL 377
Query: 327 VKDRDTGNSKGYGFCVYQDPAVTD-IACAALNGLKMGDKTLTVRRATASGQSKTEQESIL 385
VKDR T S+G F YQDPAV++ A LNG+ +G K + V +A+
Sbjct: 378 VKDRGTEESRGIAFAEYQDPAVSNPTALDTLNGMDIGGKQIKVSKAS------------- 424
Query: 386 AQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL--AKVLCLTEAITADALADDEEYEEIL 443
I ++A G+ + G S + ++VL L +TA+ L D+++YEEI
Sbjct: 425 ------IGPTQVANFDVGITAISGLASQTANEVESSRVLQLLNMVTAEELLDNDDYEEIC 478
Query: 444 EDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGR 497
ED+REEC KYG +++V +PRP ++ GVGK+F++Y A AL+GR
Sbjct: 479 EDVREECSKYGKILDVKVPRPTGGSRQSAGVGKIFVKYEHTEDTTKALQALAGR 532
>gi|406699650|gb|EKD02849.1| splicing factor (U2 snRNP auxiliary factor large subunit)
[Trichosporon asahii var. asahii CBS 8904]
Length = 487
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 109/366 (29%), Positives = 172/366 (46%), Gaps = 62/366 (16%)
Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
+R+Y G+ NE + F++V+ +G + G+AV V IN EK + +VE +
Sbjct: 135 QKKRIYFAGVTDAMNENRLRKLFNKVLRDVGYD----GEAVSGVEINKEKDYVWVEFVSS 190
Query: 233 EEASNAMALDGIIFEGVAVRVRRPTDY---NPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
+ A + F+G + +RP D+ +P L G P+
Sbjct: 191 DLAQVVFNKKDLDFDGAPIEPKRPKDFVGIDPALGFMGVSGDPN---------------- 234
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
+++FVGGLP ++KELL FG L F+LVK+ + SKG+ F + DPAVT
Sbjct: 235 -----NKLFVGGLPTTLGSDEVKELLTPFGELRTFNLVKEGNGSVSKGFAFVEFLDPAVT 289
Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKTEQ-ESILAQAQQHIAIQKMALQTSGMNTLG 408
DIA LNG ++GD+ L V+RA +G+S + S AQ +I + A + +
Sbjct: 290 DIAIQGLNGFQLGDRALVVQRAATTGRSASSTGVSGTAQFLAQSSILEKADEPA------ 343
Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP---- 464
+V+ + + AD L DD++Y +ILED+R+EC K+G + V IPRP
Sbjct: 344 --------PATRVILMLNMVGADELYDDQDYADILEDIRDECSKFGEVEGVRIPRPVPKS 395
Query: 465 ---------------DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
++ + GVG+V++ Y D A AL GR+F G T+
Sbjct: 396 TKWEPSDSAAQTAEKNRRIDQENGVGRVYVMYADTESAVKAMRALGGRQFAGRTILVASC 455
Query: 510 PEDKYF 515
E+ +
Sbjct: 456 SEEDFL 461
>gi|440290938|gb|ELP84237.1| splicing factor u2af large subunit, putative [Entamoeba invadens
IP1]
Length = 623
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 127/472 (26%), Positives = 208/472 (44%), Gaps = 67/472 (14%)
Query: 74 RDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQL 133
RD RHR R SH +R R+R S P R + RS PP A P
Sbjct: 38 RDYHERHRDRYESHYDNRPRDRYDS--PKRRYYDKEDRS-----PPRHDERRKARSPS-- 88
Query: 134 PGVPSAVPEMAQNMLPFGATQLGAFPLMPV---QVMTQQATRH----ARRVYVGGLPPLA 186
+ P G + PV ++ QQ H +RRVYVG +
Sbjct: 89 -------------LSPLGKKIKSRWDEQPVADASLLQQQLNVHQEKGSRRVYVGNINTTT 135
Query: 187 NEQAIATFFSQVM---TAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDG 243
EQ I F+ M + N D +V+ +N+EK +AF+E RT ++A A++LDG
Sbjct: 136 TEQDIVEAFNDAMRRGDYVDKNDKS--DIIVSTEVNYEKSYAFIEFRTFDQAVKALSLDG 193
Query: 244 IIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLP 303
+ +G +V+VRRP D+NP L Q L VG G +++G +P
Sbjct: 194 LTIKGASVKVRRPKDFNPVLPFISSLSQ------LMEVGTTKPRDGV------MYMGNIP 241
Query: 304 YYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV--YQDPAVTDIACAALNGLKM 361
++ QI++ LE+ L + +V+D G +G +C+ YQ+P D A NG+ +
Sbjct: 242 LQMSDEQIQKKLENLNPLKKYVVVRDPSLGAPQGKCYCLFEYQNPEYKD-KVLAFNGIIL 300
Query: 362 GDKTLTVRRATASGQS--KTEQESILAQAQQHIAIQKMALQTSG-MNTLGGGMSLFGETL 418
G + V SG K + L + + Q+ + T+ +N+ G +F L
Sbjct: 301 GGDKIEV----CSGLEGFKHFPTAALNELCMKMFPQRTDIITATLLNSSVGYSDVFERVL 356
Query: 419 -----------AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
+V+ L + L +++ Y E+++D+RE C YG ++++ IPRP +
Sbjct: 357 HNSEDLSQYECTRVIVLFNMFFPEDLNNEQRYIELVDDIREACIAYGEVISISIPRPTET 416
Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
G+G+ F+E+ D T + R++ + A +Y E KY ++ +
Sbjct: 417 NKRPSGIGRAFVEFKDVEMAKTCWREIVKRRYDNRQIVAGFYSESKYNSRSF 468
>gi|167395950|ref|XP_001741817.1| hexokinase [Entamoeba dispar SAW760]
gi|165893477|gb|EDR21726.1| hexokinase, putative [Entamoeba dispar SAW760]
Length = 974
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 124/494 (25%), Positives = 219/494 (44%), Gaps = 103/494 (20%)
Query: 54 KYDREGIRDHDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSG 113
K RE R DR DRHR R E H +R+ +DR SPS SP K S
Sbjct: 286 KTRREYSRSEDREDRHR---RVAEEEHYNRNIRRRADR--------SPSLSPLGDKLHSR 334
Query: 114 FDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRH 173
+D P A + Q+ Q + R
Sbjct: 335 WDEQPKA-----------------------------IDSVQIS-------QQLNVHQERA 358
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVM---TAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
A+R+YVG + +E+ I F++ M + N P D + ++ +N+E+ +AF+E R
Sbjct: 359 AKRIYVGNINSSTSEKDIVDAFNEAMRRGDYVDKND--PRDIITHIEVNYERSYAFLEFR 416
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPT------LAAALGPGQPSPNLNLAAVGLA 284
T+EEA A++LDG+ +G +V+VRRP DYNP L+ + PG +P ++
Sbjct: 417 TLEEAVKALSLDGLTIKGASVKVRRPKDYNPVLPFISGLSQLMEPGTTNPRESI------ 470
Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV-- 342
+++G +P T+ QI++ LE+ L F +++D D G +G +C+
Sbjct: 471 ------------LYMGNIPLQMTDEQIRKKLENLNPLKKFFVIRDPDLGAPQGKCYCLFE 518
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
YQ+P + +G+ +G + V SG + L +A + KM T+
Sbjct: 519 YQNPEYKE-KILTFDGINLGGNKIEV----CSGVDGFKH---LPKASLNELFSKMFPHTT 570
Query: 403 G------MNTLGGGMSLFGETL-----------AKVLCLTEAITADALADDEEYEEILED 445
+N+ G ++F + L ++++ + + + L D + Y E+++D
Sbjct: 571 DLVIGTLLNSSVGYSTVFEKILKPSEKIEDQHVSRIIVIFNMVYPEDLIDQQRYIELIDD 630
Query: 446 MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
+R C +YG + ++ IPRP + + G+G+VF+E+ G + +++ ++
Sbjct: 631 IRFVCQEYGEVESISIPRPTEENKKPSGLGRVFIEFKTIDGAIRCWKEIVKKRYDNRSLL 690
Query: 506 AFYYPEDKYFNKDY 519
+Y E KY N+ +
Sbjct: 691 VGFYSEKKYANRMF 704
>gi|401402634|ref|XP_003881297.1| rna recognition motif (RRM)-containing protein,related [Neospora
caninum Liverpool]
gi|325115709|emb|CBZ51264.1| rna recognition motif (RRM)-containing protein,related [Neospora
caninum Liverpool]
Length = 555
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 85/221 (38%), Positives = 127/221 (57%), Gaps = 23/221 (10%)
Query: 180 GGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAM 239
G LP + + +F++++ ++ PGD +V+VY+N ++FAF+E R++EEA+ +
Sbjct: 16 GNLPVPVTQGEVQQYFNELLNSLLPQKV-PGDTIVHVYVNPARRFAFLEHRSIEEANFTL 74
Query: 240 ALDGIIFEGVAVRVRRPTDYNPTLA--------AALG---------PGQPSPNLNLAAVG 282
LDG+ + A+ +RRP DYNPTLA A LG P Q + A
Sbjct: 75 GLDGVSWRNCALSLRRPQDYNPTLADQQYREERARLGSMTGFAVPPPSQAATPAAPAESS 134
Query: 283 LASGAIGGA-----EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
L +GA+G + P ++F+GGLP+ TE K+LLE+FG L +VKD+ G+ KG
Sbjct: 135 LIAGALGIVSTTVPDSPHKIFIGGLPHSITEQGCKQLLEAFGQLRALHVVKDQQRGDCKG 194
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK 378
+ FC Y DP VTD+A A LN +++ D+ L VRRA GQ K
Sbjct: 195 FAFCEYLDPNVTDVAVAGLNNMRIADRVLQVRRAMPHGQMK 235
>gi|397633851|gb|EJK71162.1| hypothetical protein THAOC_07424, partial [Thalassiosira oceanica]
Length = 449
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 92/226 (40%), Positives = 133/226 (58%), Gaps = 21/226 (9%)
Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFF------SQVMTAIGGNSAG------PGDAV 213
M TRHARR+Y+G +P ++ E I +FF S +M N A D +
Sbjct: 230 MNPNQTRHARRLYIGNIPDIS-ETEIHSFFRKTIEKSLIMNRDDKNYAQLQEEYIANDPI 288
Query: 214 VNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA-VRVRRPTDYNPTLAAALGPGQP 272
V+VYIN E++FAF+E RT++ + ++LDGI EG V+V+RP DYN +LA G
Sbjct: 289 VSVYINRERRFAFIEFRTMDITTACLSLDGIDVEGRGKVKVKRPNDYNASLAPQTSNGV- 347
Query: 273 SPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
+L+ A +GL S + +GP+++F+GGLPY+ TE+Q+ ELL +FG++ F LVK +
Sbjct: 348 --SLDTAKLGLVSSTV--PDGPNKIFIGGLPYHLTESQVLELLGAFGSVRAFHLVKSEPS 403
Query: 333 G-NSKGYGFCVYQDPAVTDIACAALNGLKM-GDKTLTVRRATASGQ 376
SKGY F Y DP +T +AC LNG+ + G K L+ R A Q
Sbjct: 404 ATTSKGYCFVEYADPNITQVACMGLNGMDLGGGKQLSCRMAVQGLQ 449
>gi|357017085|gb|AET50571.1| hypothetical protein [Eimeria tenella]
Length = 527
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 184/385 (47%), Gaps = 43/385 (11%)
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS-------AGPGDAVVNVY 217
V ++ R ARR+++ +PP E I FF+ + A+ + A ++ V
Sbjct: 155 VTHSESDRIARRLFISNIPPGTTEADICGFFNGALLAVNAQTGYTDLSLASDKPQLLPVE 214
Query: 218 ----INHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP--GQ 271
+ + F+++R+ E + LDGI F +++V RP +Y + P G
Sbjct: 215 RCEGLQENSRHCFLDLRSHEWVVLCLKLDGITFNNNSLKVLRPKEY-------VQPPGGD 267
Query: 272 PSPNLNLAAVGLASGAIGG---AEGPDR-----VFVGGLPYYFTETQIKELLESFGTLHG 323
P+ +++ + + A P R +++ LP E Q+++LLE FG L
Sbjct: 268 PAKTVHIPELERGTKPQQNEVRATAPPRSADCKLYIQNLPPEMGEDQVRDLLEQFGKLRV 327
Query: 324 FDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-----ASGQSK 378
+L+K+R TG +GYGF Y+DP VTD A ALNG G L+V+R+ +
Sbjct: 328 LNLIKNRQTGKHRGYGFFEYEDPEVTDQAIEALNGFVCGASVLSVQRSNFMPDLLPTKQH 387
Query: 379 TEQESILAQAQQHIAIQK--MALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADD 436
T + + L + + + +A+Q T+ GE ++V+ L I + + D
Sbjct: 388 TTEVTALPSSTSYAVLSDPVVAIQVRAGRTI-------GEKPSRVVQLLNTIYPEDIMTD 440
Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
+E ++D R E KYG L V+IPRP+++ PGVGKVFL Y D A+ L+G
Sbjct: 441 SSHEAAVKDTRSEAEKYGPLEEVLIPRPNEDLSYKPGVGKVFLVYGDVTSARRAQYMLNG 500
Query: 497 RKFGGN-TVNAFYYPEDKYFNKDYS 520
R+F V A ++PE K+ + Y+
Sbjct: 501 RRFDQTRVVCAAFFPEQKFKDGQYT 525
>gi|84996015|ref|XP_952729.1| splicing factor [Theileria annulata strain Ankara]
gi|65303726|emb|CAI76103.1| splicing factor, putative [Theileria annulata]
Length = 380
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/362 (30%), Positives = 174/362 (48%), Gaps = 41/362 (11%)
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFS-QVMTAIGGNSAGPGDAVVN---VYINHEKK 223
++A + +R+YVG LP Q + FF+ +M + GN+ P D +V +Y N ++
Sbjct: 49 EEARKRQKRLYVGNLPSGTKLQDVVDFFNGALMAMVPGNTMDPRDPLVTKTEIY-NPDQG 107
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
+ F+E +T E A LDGI G ++++RRP D+N
Sbjct: 108 YCFLEFKTPELADLGFKLDGITCNGYSLKIRRPLDFN----------------------- 144
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
+G +VFV +P TE ++K LLE G L +L+KD TG SKGYGF +
Sbjct: 145 ----LGANSDDTKVFVQNIPLDVTEDEMKALLEKHGKLKMANLLKDPATGVSKGYGFFEF 200
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT----ASGQSKTEQESILAQAQQHIAIQKMAL 399
+D + +A LNG +G L+V+ A ASG + ++ + I ++
Sbjct: 201 EDARSSKLAVLHLNGSVLGKNVLSVKHAAFGYFASGGKPIDCKA--SNLPNSITQSILSN 258
Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
G+ G G +KV+ L + + L D Y EI+ ++EE KYG L V
Sbjct: 259 PLLGLQLQNG--RRIGSNPSKVIQLLNMVFHEDLISDYNYNEIVRLVKEEAQKYGPLQEV 316
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN-TVNAFYYPEDKYFNKD 518
VIPRPD++ GVGKVF+ Y + + A+ +GR F N + + ++PED + +
Sbjct: 317 VIPRPDKDLTFKEGVGKVFIRYENLLSARKAQYMFNGRVFDKNRIICSAFFPEDLFISGK 376
Query: 519 YS 520
Y+
Sbjct: 377 YT 378
>gi|407043289|gb|EKE41863.1| U2 snRNP auxiliary factor large subunit, putative [Entamoeba
nuttalli P19]
Length = 628
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 116/467 (24%), Positives = 205/467 (43%), Gaps = 80/467 (17%)
Query: 73 NRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQ 132
N D+E RHR +R R SPS SP K S +D P A
Sbjct: 76 NEDREDRHRRVPEEERYNRSIRRRADRSPSLSPLGDKLPSRWDEQPKA------------ 123
Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIA 192
+ Q+ Q + R A+R+YVG + +E+ I
Sbjct: 124 -----------------IDSVQIS-------QQLNVHQERAAKRIYVGNINSSTSEKDIV 159
Query: 193 TFFSQVM---TAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV 249
F++ M + N D + ++ +N+E+ +AF+E RT+EEA A++LDG+ +G
Sbjct: 160 DAFNEAMRRGDYVDKNDTR--DIITHIEVNYERSYAFLEFRTLEEAVKALSLDGLTIKGA 217
Query: 250 AVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTET 309
+V+VRRP DYNP L G Q + G E +++G +P T+
Sbjct: 218 SVKVRRPKDYNPVLPFISGLSQ----------LMEPGTTNPRESI--LYMGNIPLQMTDE 265
Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV--YQDPAVTDIACAALNGLKMGDKTLT 367
QI++ LE+ L F +++D D G +G +C+ YQ+P + +G+ +G +
Sbjct: 266 QIRKKLENLNPLKNFFVIRDPDLGAPQGKCYCLFEYQNPEYKE-KILTFDGINLGGNKIE 324
Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG------MNTLGGGMSLFGETL--- 418
V SG + L +A + KM T+ +N+ G ++F + L
Sbjct: 325 V----CSGVDGFKH---LPKASLNELFSKMFPHTTDLVIGTLLNSSVGYSTVFEKILKPS 377
Query: 419 --------AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
++++ + + + L D + Y E+++D+R C +YG + ++ IPRP + +
Sbjct: 378 EKIEDQHVSRIIIIFNMVYPEDLTDQQRYIELIDDIRFVCQEYGEVESISIPRPTEENKK 437
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
G+G+VF+E+ G + +++ ++ +Y E KY N+
Sbjct: 438 PSGLGRVFIEFKTIEGAIKCWKEIIKKRYDNRSLLVGFYSEKKYANR 484
>gi|449707077|gb|EMD46798.1| RNA recognition motif domain containing protein [Entamoeba
histolytica KU27]
Length = 712
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 116/467 (24%), Positives = 205/467 (43%), Gaps = 80/467 (17%)
Query: 73 NRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQ 132
N D+E RHR +R R SPS SP K S +D P A
Sbjct: 76 NEDREDRHRRVPEEERYNRSIRRRADRSPSLSPLGDKLPSRWDEQPKA------------ 123
Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIA 192
+ Q+ Q + R A+R+YVG + +E+ I
Sbjct: 124 -----------------IDSVQIS-------QQLNVHQERAAKRIYVGNINSSTSEKDIV 159
Query: 193 TFFSQVM---TAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV 249
F++ M + N D + ++ +N+E+ +AF+E RT+EEA A++LDG+ +G
Sbjct: 160 DAFNEAMRRGDYVDKNDTR--DIITHIEVNYERSYAFLEFRTLEEAVKALSLDGLTIKGA 217
Query: 250 AVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTET 309
+V+VRRP DYNP L G Q + G E +++G +P T+
Sbjct: 218 SVKVRRPKDYNPVLPFISGLSQ----------LMEPGTTNPRESI--LYMGNIPLQMTDE 265
Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV--YQDPAVTDIACAALNGLKMGDKTLT 367
QI++ LE+ L F +++D D G +G +C+ YQ+P + +G+ +G +
Sbjct: 266 QIRKKLENLNPLKNFFVIRDPDLGAPQGKCYCLFEYQNPEYKE-KILTFDGINLGGNKIE 324
Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG------MNTLGGGMSLFGETL--- 418
V SG + L +A + KM T+ +N+ G ++F + L
Sbjct: 325 V----CSGVDGFKH---LPKASLNELFSKMFPHTTDLVIGTLLNSSVGYSTVFEKILKPS 377
Query: 419 --------AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
++++ + + + L D + Y E+++D+R C +YG + ++ IPRP + +
Sbjct: 378 EKIEDQHVSRIIIIFNMVYPEDLTDQQRYIELIDDIRFVCQEYGEVESISIPRPTEENKK 437
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
G+G+VF+E+ G + +++ ++ +Y E KY N+
Sbjct: 438 PSGLGRVFIEFKTIEGAIKCWKEIIKKRYDNRSLLVGFYSEKKYANR 484
>gi|67475980|ref|XP_653619.1| U2 snRNP auxiliary factor large subunit [Entamoeba histolytica
HM-1:IMSS]
gi|56470591|gb|EAL48233.1| U2 snRNP auxiliary factor large subunit, putative [Entamoeba
histolytica HM-1:IMSS]
Length = 712
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 116/467 (24%), Positives = 205/467 (43%), Gaps = 80/467 (17%)
Query: 73 NRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQ 132
N D+E RHR +R R SPS SP K S +D P A
Sbjct: 76 NEDREDRHRRVPEEERYNRSIRRRADRSPSLSPLGDKLPSRWDEQPKA------------ 123
Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIA 192
+ Q+ Q + R A+R+YVG + +E+ I
Sbjct: 124 -----------------IDSVQIS-------QQLNVHQERAAKRIYVGNINSSTSEKDIV 159
Query: 193 TFFSQVM---TAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV 249
F++ M + N D + ++ +N+E+ +AF+E RT+EEA A++LDG+ +G
Sbjct: 160 DAFNEAMRRGDYVDKNDTR--DIITHIEVNYERSYAFLEFRTLEEAVKALSLDGLTIKGA 217
Query: 250 AVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTET 309
+V+VRRP DYNP L G Q + G E +++G +P T+
Sbjct: 218 SVKVRRPKDYNPVLPFISGLSQ----------LMEPGTTNPRESI--LYMGNIPLQMTDE 265
Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV--YQDPAVTDIACAALNGLKMGDKTLT 367
QI++ LE+ L F +++D D G +G +C+ YQ+P + +G+ +G +
Sbjct: 266 QIRKKLENLNPLKNFFVIRDPDLGAPQGKCYCLFEYQNPEYKE-KILTFDGINLGGNKIE 324
Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG------MNTLGGGMSLFGETL--- 418
V SG + L +A + KM T+ +N+ G ++F + L
Sbjct: 325 V----CSGVDGFKH---LPKASLNELFSKMFPHTTDLVIGTLLNSSVGYSTVFEKILKPS 377
Query: 419 --------AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
++++ + + + L D + Y E+++D+R C +YG + ++ IPRP + +
Sbjct: 378 EKIEDQHVSRIIIIFNMVYPEDLTDQQRYIELIDDIRFVCQEYGEVESISIPRPTEENKK 437
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
G+G+VF+E+ G + +++ ++ +Y E KY N+
Sbjct: 438 PSGLGRVFIEFKTIEGAIKCWKEIIKKRYDNRSLLVGFYSEKKYANR 484
>gi|358391563|gb|EHK40967.1| hypothetical protein TRIATDRAFT_135674 [Trichoderma atroviride IMI
206040]
Length = 558
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 150/532 (28%), Positives = 234/532 (43%), Gaps = 94/532 (17%)
Query: 33 RDRHHRDFKSGG-------DDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRHRSR 85
RDR D S D R +Y+ D D + R RD D+ R R R
Sbjct: 66 RDREREDRYSSARDRRGDRDWDRDRGSYRRDARRDDDERPSRRERDPYDDRRRGGRDRRD 125
Query: 86 SHSSDRFRNRSKSLSPS----RSPS-----------KSKRRSGFDMAPPAAAMLPG--AA 128
+ + + SPS R P+ + +R + +D+ PP ++ A
Sbjct: 126 DGFARQQEQQQPRRSPSPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYDLVTAEQAK 185
Query: 129 VPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQA-----TRHARRVYVGG 181
+ G LPG P P T+L AF P +T +R A+R+ V
Sbjct: 186 LSGMFPLPGAP--------RQQPMDPTKLQAFITQPGGQVTSAGLKASNSRQAKRLLVSN 237
Query: 182 LPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMAL 241
+P A E A+ +FF+ + G N D V + ++ FA +E R +A+ A+AL
Sbjct: 238 VPSGAGEDALISFFNLQLN--GLNVIESSDPCVLCQFSADRAFAVLEFRNASDATVALAL 295
Query: 242 DGIIFE----------GVA--VRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGLASGAI 288
DGI E GV+ + +RRP DY P L + P P N+
Sbjct: 296 DGISMEADDAMNGTADGVSSGLNIRRPKDYVMPALPDEM-PFDPEVISNVV--------- 345
Query: 289 GGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQD 345
PD ++ + +P + TE Q+ ELL +FG F LVKD+ T S+G F Y +
Sbjct: 346 -----PDTVHKLCITNIPSFLTEEQVIELLAAFGKPKAFVLVKDQSTEESRGIAFTEYLE 400
Query: 346 P-AVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM 404
P + + A +LNG+ +G K L V +A+ I ++A G+
Sbjct: 401 PSSANEPALNSLNGMDVGGKKLKVTKAS-------------------IGPTQVANFDVGI 441
Query: 405 NTLGGGMSLFGETLAK--VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
+ G S + + V+ L +T + L D+++YEEI ED+++EC K+G +V + +P
Sbjct: 442 TAISGLASQTSNDIERSSVIQLLNMVTPEELIDNDDYEEICEDVQDECAKFGKVVELKVP 501
Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
RP ++ GVGK++++Y A AL+GRKF TV A Y+PE+ +
Sbjct: 502 RPSGGSRQSAGVGKIYVKYDSEESATKALTALAGRKFADRTVVATYFPEENF 553
>gi|330931856|ref|XP_003303563.1| hypothetical protein PTT_15819 [Pyrenophora teres f. teres 0-1]
gi|311320368|gb|EFQ88342.1| hypothetical protein PTT_15819 [Pyrenophora teres f. teres 0-1]
Length = 578
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 126/396 (31%), Positives = 189/396 (47%), Gaps = 54/396 (13%)
Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMP------VQVMTQQATRHARRVYVGGLPPLA 186
LPG P A P M P ++L AF + P + A + ++R+YV LP
Sbjct: 218 LPGAPRAAP-----MDP---SKLAAF-ISPSAGTATAAALATSAAKQSKRLYVHNLPSGC 268
Query: 187 NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF 246
Q I FF+ + G N D ++ +I K++A +E + E+A+ A+A++GI
Sbjct: 269 TSQEIMEFFNNQLN--GLNVVSGNDPCLSAHIATSKEYAALEFKAPEDATLALAMNGISM 326
Query: 247 --EGVA-----VRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVF 298
EG A + +RRP DY PT P P +++V + P+++
Sbjct: 327 RDEGGAPDRSGLSIRRPKDYITPTADENAYP----PGDEVSSVV--------KDSPNKLS 374
Query: 299 VGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNG 358
+ +P Y E QI+EL+E+ G L F LVKD T +G FC Y D + D LN
Sbjct: 375 IVNIPTYIEEEQIRELVETMGKLKAFILVKDTGTDQHRGIAFCEYADNEIIDAVIEGLND 434
Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
+ +GD L V RAT Q T + + ++ L G +
Sbjct: 435 IPLGDGNLKVSRATVGLQQSTGLDGGVG----------------AISMLAGASAAENHEH 478
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
++V+CL +T+D L +D+EYEEI ED+ EECGKYG +V IPRP GVGK++
Sbjct: 479 SRVVCLMNMVTSDELLNDDEYEEIKEDIEEECGKYGPIVETKIPRP-AGARVNLGVGKIY 537
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++Y D A AL+GR+F TV A + E+ +
Sbjct: 538 IKYQDTESAQKAIKALAGRQFSRRTVVATEFSEEGF 573
>gi|340520531|gb|EGR50767.1| predicted protein [Trichoderma reesei QM6a]
Length = 539
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 128/422 (30%), Positives = 196/422 (46%), Gaps = 72/422 (17%)
Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
+ +R + +D+ PP ++ A + G LPG P P T+L AF P
Sbjct: 160 RKRRLTQWDIKPPGYDLVTAEQAKLSGMFPLPGAP--------RQQPMDPTKLQAFMTQP 211
Query: 163 V-QV----MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
QV + +R A+R+ V +P E+A+ FF+ + G N D V
Sbjct: 212 GGQVSSAGLKASNSRQAKRLLVYNVPSGVTEEALIAFFNLQLN--GLNVIETPDPCVLCQ 269
Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFE------GVA------VRVRRPTDYNPTLAA 265
+ +K FA VE R +A+ A+ALDGI E G A + +RRP DY
Sbjct: 270 FSSDKTFAVVEFRNASDATVALALDGITMEADDAQNGTANGGSHGLDIRRPKDY------ 323
Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLH 322
+ PG P + I PD ++ + +P + E QI ELL +FG
Sbjct: 324 -VMPGIPD------DIPYDPEVISNVV-PDTVHKLCITNIPTFLNEEQIIELLAAFGKPK 375
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPA-VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQ 381
F LVKDR T S+G F Y DP+ + A +LNG+ + K L V +A+
Sbjct: 376 SFVLVKDRSTEESRGIAFTEYLDPSSANEPALNSLNGMDVAGKKLKVTKAS--------- 426
Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAK--VLCLTEAITADALADDEEY 439
I ++A G+ + G S + + V+ L +T + L D+++Y
Sbjct: 427 ----------IGPTQVANFDVGITAISGLASQTSNDIERSSVIQLLNMVTPEELLDNDDY 476
Query: 440 EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AKNALSGRK 498
EEI ED+++EC K+G +V + +PRP ++ GVGK+F++ +D+V AT A AL+GRK
Sbjct: 477 EEICEDVQDECSKFGKVVELKVPRPTGGSRQSAGVGKIFVK-FDSVESATKALTALAGRK 535
Query: 499 FG 500
F
Sbjct: 536 FA 537
>gi|403367221|gb|EJY83425.1| RNA-binding proteins (RRM domain) [Oxytricha trifallax]
Length = 543
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 129/530 (24%), Positives = 233/530 (43%), Gaps = 55/530 (10%)
Query: 13 EGSRHKSSWVSGRSRTGERGRDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDY 72
+ +H+SS ++R R R + RRRDK DR+G RD T + Y
Sbjct: 45 DDKKHRSSRDIDKARDESSKRKDDKRRTSDSREGRRRDK----DRKGKRD---TSEEKKY 97
Query: 73 NRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQ 132
+R +++ R++ + SSD ++ S S + + + + L + Q
Sbjct: 98 SRSEKKDRRNKDGNMSSDSQKSSPLVSSSSSNTPEKYKGDDILITEGYKLYLERKKLRKQ 157
Query: 133 LP-GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAI 191
+A+ + + L L FP + R++ + LPP E+ +
Sbjct: 158 EEEKKAAALQDGMEGGLKLRKVDLKDFP------------NYKRKLVIQNLPPDITEEDV 205
Query: 192 ATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE-GVA 250
+F V+++ + +++V + F +E R +E + LDG + G
Sbjct: 206 MNYFFTVISSFS-KVEYQKNPIMSVIKYKDLGFVTLEFRKRDEGEICLTLDGTEYRTGYK 264
Query: 251 VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE-----GPD---------- 295
+R+ R + A + G+ G++ + G + PD
Sbjct: 265 MRIMRVKRFIDDWNADIDKGKNPIEAMTRGKGVSLFSTGNNQFKEPAKPDQKAGKKEKVE 324
Query: 296 ----RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD-RDTGNSKGYGFCVYQDPAVTD 350
R+++G +P + +K++ ESFG L F+LVKD + +KGY F Y D D
Sbjct: 325 EVDNRLYMGNIPNSMKDEDVKKMCESFGRLKAFNLVKDPMNPDLNKGYAFFEYVDERSID 384
Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
A +LNGL +K L V++A+A ++ +Q Q I + K +
Sbjct: 385 KAIKSLNGLDFKEKKLKVQKASAHQKT--------SQTQIQIGMYKNVPDEKRL-----P 431
Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
+ LF T ++V+ I+ + L +++E + +D+ +EC YG ++++ IP+PD+ G
Sbjct: 432 IPLFAMTPSRVVQFINMISVEDLFEEDEIIHVKDDLLQECKNYGEIISIEIPKPDEQGHA 491
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
T GVGK+F+++ V A+ LSGRK+ G TV +YPE + K++S
Sbjct: 492 TYGVGKIFVKFNHIVAAKQARYKLSGRKYNGRTVVVSFYPEHYFDIKEFS 541
>gi|403224363|dbj|BAM42493.1| splicing factor [Theileria orientalis strain Shintoku]
Length = 377
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 173/366 (47%), Gaps = 50/366 (13%)
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN---VYINHEKKF 224
++ + +R+Y+G LP + FF+ + A+ ++ D +V+ +Y N E+ +
Sbjct: 47 EENKKRQKRLYIGNLPAGMKLGDVVEFFNGALLAMVPSNQTTKDPLVSKTEIY-NPEQGY 105
Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
F+E +T E A LDGI G ++++RRP D+
Sbjct: 106 CFLEFKTPELTDLAFKLDGITCNGYSLKIRRPIDFTQ----------------------- 142
Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
G ++F+ + TE +++ELLE G L F+L+KD TG SKGYGF Y+
Sbjct: 143 ----GNQLEDTKIFIQNVATDVTEAELRELLEKHGKLKLFNLIKDPITGASKGYGFFEYE 198
Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRAT----ASGQSKTEQESI-----LAQAQQHIAIQ 395
D +A LNG + L+V+ A ASG + ++ + Q+ + +
Sbjct: 199 DSRSAKMAVLHLNGQALKQNVLSVKHAAFGYFASGGKPIDCKASNLPNSITQSILNNPLL 258
Query: 396 KMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGT 455
+ LQ S + G +V+ L + ++ L D Y EI+ +EE GKYG
Sbjct: 259 GLQLQNS---------KIVGAKPTRVVQLLNMVFSEDLLSDYNYNEIVRLTKEEAGKYGA 309
Query: 456 LVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN-TVNAFYYPEDKY 514
L +V+PRP ++ GVGKVFL+Y + + A++ +GR F N V A +YPEDKY
Sbjct: 310 LDEIVVPRPSKDLTFKSGVGKVFLKYKEVLHARKAQHMFNGRIFDKNRVVCAAFYPEDKY 369
Query: 515 FNKDYS 520
+Y+
Sbjct: 370 SRGEYT 375
>gi|451849636|gb|EMD62939.1| hypothetical protein COCSADRAFT_200575 [Cochliobolus sativus
ND90Pr]
Length = 576
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 126/396 (31%), Positives = 191/396 (48%), Gaps = 54/396 (13%)
Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMP------VQVMTQQATRHARRVYVGGLPPLA 186
LPG P A P M P ++L AF + P + A + ++R+YV LP
Sbjct: 216 LPGAPRAAP-----MDP---SKLAAF-ISPSTGTATAAALATSAAKQSKRLYVHNLPSGC 266
Query: 187 NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF 246
Q I FF+ + + S GP D V+ +I K++A +E + E+A+ A+A++GI
Sbjct: 267 TSQEIMEFFNTQLNGLNVVS-GP-DPCVSAHIATSKEYAALEFKAPEDATLALAMNGISM 324
Query: 247 -------EGVAVRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVF 298
+ + +RRP DY PT P P +++V + P+++
Sbjct: 325 RDDGGAPDRAGLSIRRPKDYITPTADENAYP----PGDEVSSVV--------KDSPNKLS 372
Query: 299 VGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNG 358
+ +P Y E QI+EL+E+ G L F LVKD T +G FC Y D + D LN
Sbjct: 373 IVNIPTYIEEEQIRELVETMGKLKAFILVKDTSTDQHRGIAFCEYADNEIIDAVIEGLND 432
Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
+ +GD L V RAT Q T + + ++ L G ++
Sbjct: 433 IPLGDGNLKVSRATVGLQQTTGLDGGVG----------------AISMLAGASAVENREH 476
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
++V+CL +T+D L +DEEYEEI ED+ EECGKYGT++ IPRP GVGK++
Sbjct: 477 SRVVCLMNMVTSDELLNDEEYEEIKEDIEEECGKYGTILETKIPRP-AGARVNLGVGKIY 535
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++Y D A AL+GR+F TV A + E+ +
Sbjct: 536 IKYQDIESAQKAIKALAGRQFSRRTVVATEFSEEGF 571
>gi|156085070|ref|XP_001610018.1| RNA recognition motif (RRM)-containing protein [Babesia bovis]
gi|154797270|gb|EDO06450.1| RNA recognition motif (RRM)-containing protein [Babesia bovis]
Length = 383
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 170/361 (47%), Gaps = 48/361 (13%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAG----PGDAVVNVYINHEKKFAFV 227
RH RR+Y+G LP +A+ F S + +S P + ++ N ++ + F+
Sbjct: 57 RH-RRLYIGNLPSGTTYKALVEFLSAALRLPNDDSGQTVQVPHISKTEIF-NEDQGYCFL 114
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
E T E A LDGI F+G +++RRP DY T ++
Sbjct: 115 EFSTPELADACFKLDGINFKGKLLKIRRPIDYGTTSSSE--------------------- 153
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
+VFV +P +E ++KELLE G + +LVKD TG +KGYGF + D
Sbjct: 154 ------DTKVFVQNIPPTMSEAEVKELLEKHGKIKSSNLVKDLKTGQNKGYGFFEFDDSR 207
Query: 348 VTDIACAALNGLKMGDKTLTVRRAT----ASGQSKTEQESIL---AQAQQHIAIQKMALQ 400
+A LNG +G L+V+ A A+G T+ ++ + Q ++ + LQ
Sbjct: 208 AAKMAVCHLNGHIIGKNVLSVKHAAFSYFAAGGKLTDCKATNLPNSVTQSILSNPLLGLQ 267
Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
+G S +++ L + + L D+ Y E+ + + EE KYG L ++V
Sbjct: 268 MQSGRRIGSKPS-------RIVQLINIVFHEDLIQDKRYHEVKDAIMEEAKKYGHLEDIV 320
Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN-TVNAFYYPEDKYFNKDY 519
IPRP+ + GVGKVFL++ D + A+ L+GR F GN V A ++P D++ Y
Sbjct: 321 IPRPNDDLSYKEGVGKVFLKFGDEISSRRAQYMLNGRVFDGNRIVCAAFFPLDRFLKGKY 380
Query: 520 S 520
+
Sbjct: 381 T 381
>gi|189204129|ref|XP_001938400.1| splicing factor U2AF 65 kDa subunit [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187985499|gb|EDU50987.1| splicing factor U2AF 65 kDa subunit [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 572
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 127/396 (32%), Positives = 188/396 (47%), Gaps = 54/396 (13%)
Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMP------VQVMTQQATRHARRVYVGGLPPLA 186
LPG P A P M P ++L AF + P + A + ++R+YV LP
Sbjct: 212 LPGAPRAAP-----MDP---SKLAAF-ISPSAGTATAAALATSAAKQSKRLYVHNLPSGC 262
Query: 187 NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF 246
Q I FF+ + G N D ++ +I K++A +E + E+A+ A+A+ GI
Sbjct: 263 TSQEIMEFFNNQLN--GLNVVSGNDPCLSAHIATSKEYAALEFKAPEDATLALAMTGISM 320
Query: 247 --EGVA-----VRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVF 298
EG A + +RRP DY PT P P +++V + P+++
Sbjct: 321 RDEGGAPDRSGLSIRRPKDYITPTADENAYP----PGDEVSSVV--------KDSPNKLS 368
Query: 299 VGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNG 358
+ +P Y E QI+EL+E+ G L F LVKD T +G FC Y D + D LN
Sbjct: 369 IVNIPTYIEEEQIRELVETMGKLKAFILVKDTGTDQHRGIAFCEYADNEIIDAVIEGLND 428
Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
+ +GD L V RAT Q T + + ++ L G +
Sbjct: 429 IPLGDGNLKVSRATVGLQQSTGLDGGVG----------------AISMLAGASAAENHEH 472
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
++V+CL +T+D L +DEEYEEI ED+ EECGKYG +V IPRP GVGK++
Sbjct: 473 SRVVCLMNMVTSDELLNDEEYEEIKEDIEEECGKYGPIVETKIPRP-AGARVNLGVGKIY 531
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++Y D A AL+GR+F TV A + E+ +
Sbjct: 532 IKYQDTESAQKAIKALAGRQFSRRTVVATEFSEEGF 567
>gi|296083697|emb|CBI23686.3| unnamed protein product [Vitis vinifera]
Length = 882
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 150/538 (27%), Positives = 230/538 (42%), Gaps = 113/538 (21%)
Query: 25 RSRTGERGRDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRD-KERRHRHR 83
RSR E+ R HR +G D++ R++N ++ HD RH D KERR
Sbjct: 222 RSRKSEKESKRKHR---TGEDEKNRERN------SMKKHDPGKRHESEFLDRKERRESPP 272
Query: 84 SRSHSSDRFRNRS---------------------------------KSLSPS-RSPSKSK 109
SR SD RNR K+ SP+ RSP K
Sbjct: 273 SRRQHSDADRNRISNNGSSSHFRRHGGSASGLGGYSPRKRRTEAAIKTPSPTNRSPEK-- 330
Query: 110 RRSGFDMAPPAA-AMLPGAAVPGQLP-GVPSAVPEMAQNMLP-------FGATQLGAFPL 160
+ +G+D+ P M G+ + +LP VP AVP A P ++ +
Sbjct: 331 KSAGWDLPPSRTDGMNAGSVLSNELPSAVPVAVPVTATTAKPPLPRIYSDAVSKNKNVSI 390
Query: 161 MPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH 220
+Q+ QATR RR+YV LP ++E+A+ + + + G N ++ I+
Sbjct: 391 DSIQLT--QATRPMRRLYVENLPVSSSEKALMECLNNFLLSSGINHVQGTPPCISCIIHK 448
Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
EK A VE T E+AS A++ DGI F G +++RRP D+ L A
Sbjct: 449 EKGQALVEFLTPEDASAALSFDGISFSGSILKIRRPKDFVDMTGV---------QEKLVA 499
Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
A I + P ++F+GG+ + + E+ +FG L + + D G + F
Sbjct: 500 APDAISDI-VKDSPHKIFIGGISRALSSDMLMEIAAAFGPLKAYRFQVNEDLG--EPCAF 556
Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ 400
Y D +VT ACA LNG+K+G + LTV +A + +A++
Sbjct: 557 LEYVDQSVTLKACAGLNGMKLGGQVLTVVQAIPNA---------------------LAME 595
Query: 401 TSGMNTLGGGMSLFGETL----AKVLCLTEAITADALA--DDEEYEEILEDMREECGKYG 454
+G N G+ + L +VL L + D L+ + E EEILED+R EC ++G
Sbjct: 596 NTG-NLPFYGIPEHAKPLLERPTQVLKLKNVVNPDDLSSLSEAELEEILEDIRLECTRFG 654
Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDA-------VGCATAKNALSGRKFGGNTVN 505
T+ +V I + + + T LE Y+A +GC N++ GG T N
Sbjct: 655 TVKSVNIVKYNNSHVST-------LEVYEAADNTGSNLGCDG--NSMKAETLGGGTDN 703
>gi|320170643|gb|EFW47542.1| splicing factor u2af large subunit [Capsaspora owczarzaki ATCC
30864]
Length = 393
Score = 144 bits (364), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 106/355 (29%), Positives = 165/355 (46%), Gaps = 66/355 (18%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
R +RR+Y+GG+ P + I F ++ M G S+ PG+ V+ + + +K FAF++MRT
Sbjct: 98 RQSRRLYIGGIVPGTPDVLIVDFLNREMNQRGMTSS-PGNPVLAIQMTPDKNFAFLDMRT 156
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYN-------PTLAAALGPGQPSPNLNLAAVGLA 284
EEA+ +ALDGI FEG R++RP +Y P+L + A G +
Sbjct: 157 SEEATMCIALDGIPFEGTVFRIKRPKEYEGREANDPPSLFGMPSSSGGGFSSQGGAQGGS 216
Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
G G + P+++++GGLP+ E QI+EL L F ++++ F + +
Sbjct: 217 FGGSMGNDNPNKIYIGGLPFSLDEQQIREL------LQTFGVIRN----------FSLVR 260
Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM 404
+ +GQSK +Q G
Sbjct: 261 E---------------------------GNGQSKGQQPP--------------PSMPYGA 279
Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP 464
+ G + V+ L +T + L D EEY++I++D+REEC KYG +V+V IPRP
Sbjct: 280 PSSFGAAPITPMQATPVVQLLNMVTPEELMDPEEYQDIVDDIREECSKYGEVVSVAIPRP 339
Query: 465 DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
G E GVGKV++E+ + A ALSGRKF V +Y D Y ++
Sbjct: 340 -VPGREVSGVGKVYVEFSNVDHAYQALQALSGRKFASRIVVTSFYGLDAYRRSEF 393
>gi|452001453|gb|EMD93912.1| hypothetical protein COCHEDRAFT_1020092 [Cochliobolus
heterostrophus C5]
Length = 352
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 175/358 (48%), Gaps = 53/358 (14%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
+ ++R+YV LP Q I FF+ + + S GP D V+ +I K++A +E +
Sbjct: 28 KQSKRLYVHNLPSGCTSQEIMEFFNTQLNGLNVVS-GP-DPCVSAHIATSKEYAALEFKA 85
Query: 232 VEEASNAMALDGIIF-------EGVAVRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGL 283
E+A+ A+A++GI + + +RRP DY PT P P +++V
Sbjct: 86 PEDATLALAMNGISMRDDGGAPDRAGLSIRRPKDYITPTADENAYP----PGDEVSSVV- 140
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
+ P+++ + +P Y E QI+EL+E+ G L F LVKD T +G FC Y
Sbjct: 141 -------KDSPNKLSIVNIPTYIEEEQIRELVETMGKLKAFILVKDTSTDQHRGIAFCEY 193
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
D + D LN + +GD L V RAT Q QT+G
Sbjct: 194 ADNEIIDAVIEGLNDIPLGDGNLKVSRATVGLQ-----------------------QTTG 230
Query: 404 MNTLGGGMSLFGETLA-------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
++ G +S+ A +V+CL +T+D L +DEEYEEI ED+ EECGKYGT+
Sbjct: 231 LDGGVGAISMLAGASAAENREHSRVVCLMNMVTSDELLNDEEYEEIKEDIEEECGKYGTI 290
Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ IPRP GVGK++++Y D A AL+GR+F TV A + E+ +
Sbjct: 291 LETKIPRP-AGARVNLGVGKIYIKYQDTESAQKAIKALAGRQFSRRTVVATEFSEEGF 347
>gi|399216439|emb|CCF73127.1| unnamed protein product [Babesia microti strain RI]
Length = 424
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 114/404 (28%), Positives = 189/404 (46%), Gaps = 50/404 (12%)
Query: 150 FGATQLGAFPL-MPVQVMTQQATRHARRVYVGGLPP------LANEQA-IATFFSQVMTA 201
FG G+ L +P + +A R RR+Y+G +P L + Q+ I F + +
Sbjct: 34 FGFDSSGSSALAIPAADLDPEAERRHRRLYIGNVPAGNHNTNLGSSQSDIVAFLNGALLT 93
Query: 202 IGGNS---AGPGDAVVNVY--INHEKKFAFVEMRTVEEASNAMALDGII-------FEGV 249
+ N+ A P D + N E +F F+E+R V+ + +DGI + G
Sbjct: 94 VLSNTGMPATPADTPITKCESFNSENRFCFIELRNVDVTLVCLKMDGISLVDSGINYNGN 153
Query: 250 AVRVRRPTDYNP----TLAAALGPG--QPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLP 303
A+++ RP+DY P LA + P QP +A ++ + +P
Sbjct: 154 ALKISRPSDYVPPSNNELATQMQPTIQQPPRGFTMALQVF------------KLHIQNIP 201
Query: 304 YYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGD 363
E + EL++ FG + ++KD TG K F ++D + A AL G ++
Sbjct: 202 TTMAEDGVLELVKEFGDVKYVYIIKDT-TGQHKNTAFVEFKDSVSLEPASKALTGKEVEG 260
Query: 364 KTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSL-------FGE 416
++LT + T S Q+ T LA + ++ ++ S +S+ G
Sbjct: 261 QSLTAKIVT-SNQADTLAS--LAAGKYNLGATHLSTSISRKILSDPLLSIGVQSGRKIGA 317
Query: 417 TLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGK 476
T++ V+ L + + L DD+ Y+ +LED+R+E KYGTL ++VIPRP+ + GVGK
Sbjct: 318 TVSTVVQLLNIVFHEDLIDDDSYQSLLEDIRKEAKKYGTLEDIVIPRPNLDKTFNEGVGK 377
Query: 477 VFLEYYDAVGCATAKNALSGRKFGGN-TVNAFYYPEDKYFNKDY 519
VFL++ D + A+ L+GR+F V A +YP DK+ K Y
Sbjct: 378 VFLQFADELSSRKAQYMLNGRRFDAKRVVCAAFYPLDKFLEKTY 421
>gi|428671645|gb|EKX72563.1| conserved hypothetical protein [Babesia equi]
Length = 455
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 106/369 (28%), Positives = 166/369 (44%), Gaps = 52/369 (14%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
ATR R +YVG +PP+++ + F ++ + AI G S PG+ + +I+ + +AF+E+
Sbjct: 72 ATRPYREIYVGNIPPVSDVSTLLDFLNEALIAINGTSM-PGNPCLKGWISSDSHYAFIEL 130
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNP-TLAAALGPGQPSPNLNLAAVGL----- 283
RT+EEASN M L G+ G +RV RP Y P LA A P P+ + +L A+GL
Sbjct: 131 RTMEEASNCMQLTGLNCMGYNIRVNRPKTYTPEMLALAPSPTVPTLDPSLLAMGLKALKN 190
Query: 284 ------ASGAIGGAEG----PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTG 333
A+ I E DR+ + +P ++ +K +E+ G + + D
Sbjct: 191 AREQIVAASDILATEKAKAMTDRLCIIDIPSETQDSDLKSAIEAIGQVKYIHFIND---D 247
Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
SK YQ I AL L K + A G S + Q +
Sbjct: 248 PSKRVCLFEYQHIEQQKI---ALEQLPANHKVIMAIDAVTQG---IINPSYIRQQLEKCE 301
Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
I + E +VL L+ ++ + L DD EY +I++D+R EC Y
Sbjct: 302 IMR------------------PEVPTRVLWLSNLVSKEELDDDAEYFDIIDDVRTECEDY 343
Query: 454 GTLVNVVIPR-------PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
G ++ + +PR + +T VG F+ + GC A+ L GR+FG V+A
Sbjct: 344 GQVIRLELPRVPKGLTEEEMKTVDTSSVGCAFVLFTTIDGCTKARKILGGRRFGPRIVDA 403
Query: 507 FYYPEDKYF 515
Y+ E YF
Sbjct: 404 HYFSE-LYF 411
>gi|396472864|ref|XP_003839217.1| hypothetical protein LEMA_P028900.1 [Leptosphaeria maculans JN3]
gi|312215786|emb|CBX95738.1| hypothetical protein LEMA_P028900.1 [Leptosphaeria maculans JN3]
Length = 587
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 119/402 (29%), Positives = 186/402 (46%), Gaps = 67/402 (16%)
Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQAT------RHARRVYVGGLPPLA 186
LPG P A P M P ++L AF + P A + ++R+YV LP
Sbjct: 228 LPGAPRAAP-----MDP---SKLAAF-MTPSAGSASSAALAPSAAKQSKRLYVHNLPSGV 278
Query: 187 NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF 246
+ + + FF+ + G N D ++ I K++A +E +T E+A+ A+A++GI
Sbjct: 279 SSEELMEFFNLQLN--GLNVVSGQDPCLSAQIATSKEYAALEFKTPEDATVALAMNGISM 336
Query: 247 -------EGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFV 299
+ + +RRP DY P+ + N S + + P+++ +
Sbjct: 337 REESGGPDRSGLSIRRPKDYI----------TPTADDNAYTGDEVSSVV--KDSPNKLSI 384
Query: 300 GGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGL 359
+P Y E Q++EL+ + G L F LVKD T +G FC Y D + D LN +
Sbjct: 385 VNIPTYIEEEQVRELVGTMGKLKAFVLVKDESTDQHRGIAFCEYADNEIVDAVIEGLNDI 444
Query: 360 KMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA 419
+GD L V RAT Q QT+G++ G +S+ A
Sbjct: 445 PLGDGNLKVTRATVGLQ-----------------------QTAGLDGGVGAISMLAGASA 481
Query: 420 -------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
+V+CL +T++ L +DEEYEEI ED+ EECGK+GT++ IPRP
Sbjct: 482 AENREHSRVICLMNMVTSEELINDEEYEEIKEDIEEECGKFGTILETKIPRP-AGARVNL 540
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
GVGK++++Y D A AL+GR+F TV + E+ +
Sbjct: 541 GVGKIYIKYQDTESAQKAIKALAGRQFSRRTVVVTEFSEEGF 582
>gi|169602913|ref|XP_001794878.1| hypothetical protein SNOG_04461 [Phaeosphaeria nodorum SN15]
gi|160706286|gb|EAT88221.2| hypothetical protein SNOG_04461 [Phaeosphaeria nodorum SN15]
Length = 594
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 128/442 (28%), Positives = 204/442 (46%), Gaps = 60/442 (13%)
Query: 94 NRSKSLSPSRSP-----SKSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMA 144
N+++ +P +P + +R + +D+ P + A + G LPG P A P
Sbjct: 187 NKAREPTPDLAPFTNILKRERRMTQWDIKPAGYENITAEQAKLSGMFPLPGAPRAAP--- 243
Query: 145 QNMLPFGATQLGAF-----PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVM 199
M P ++L AF + A++ ++R+YV LP + + FF+ +
Sbjct: 244 --MDP---SKLAAFMSPSAGTASAAALAPGASKQSKRLYVHNLPSGTTSEELLEFFNLQL 298
Query: 200 TAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF-------EGVAVR 252
G N D ++ I K +A +E +T E+A+ A+A+ GI + +
Sbjct: 299 N--GLNVVSGQDPCLSAQIASSKTYAALEFKTPEDATVALAMSGISMRDDGGGPDRSGLS 356
Query: 253 VRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIK 312
+RRP DY PS + N S + + P+++ + +P + E QI+
Sbjct: 357 IRRPKDYI----------TPSADENAYPGDEVSSVV--KDSPNKLSIVNIPTFIEEEQIR 404
Query: 313 ELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
EL+E+ G L+ F LVKD + +G FC Y D V + LN + +G+ L V RAT
Sbjct: 405 ELVETMGKLNAFVLVKDISSEQHRGIAFCEYADNEVVNAVIEGLNDITLGEGNLKVSRAT 464
Query: 373 ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADA 432
Q Q + L I++ A TS + +V+CL +T+D
Sbjct: 465 VGMQ----QNAGLDGGVNAISMLASAEPTSNLEH------------GRVVCLMNMVTSDE 508
Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
L +DEEYEEI ED+ EEC KYG +V IPRP + GVGK++++Y D A
Sbjct: 509 LINDEEYEEIKEDIEEECQKYGPIVETKIPRP-AGARSSLGVGKIYIKYQDTESAQRAIK 567
Query: 493 ALSGRKFGGNTVNAFYYPEDKY 514
AL+GR+F TV A + E+ +
Sbjct: 568 ALAGRQFSRRTVVATQFSEEGF 589
>gi|145536694|ref|XP_001454069.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421813|emb|CAK86672.1| unnamed protein product [Paramecium tetraurelia]
Length = 426
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 96/311 (30%), Positives = 149/311 (47%), Gaps = 42/311 (13%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIF-EGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
K + +E + E + D + F ++V +P + L L P LN
Sbjct: 141 KSWVVLECSSKEAKRALVTQDQVQFVNNCKIKVEKPRKF---LERILNPQAKEAELN--- 194
Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD--RDTGNSKGY 338
A E R+++GGLP Y + + +L++SFGT F+LVKD +T SKGY
Sbjct: 195 ------ADQKQEDNTRLYLGGLPTYLRDEDVMKLIQSFGTTKYFNLVKDTTSNTEISKGY 248
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTV-RRATASGQSKTEQESILA--------QAQ 389
F Y+ A T A ALN L++GDK L + ++ Q S LA Q Q
Sbjct: 249 CFFEYEKTASTAKALKALNNLQIGDKKLKICKKINGRDQPSNYAGSFLASCDLLRIPQVQ 308
Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
Q + I + AL S KV+ + + L +D+ YEE++ED+R E
Sbjct: 309 QMLTIPQSALIPS-----------------KVVQFLNMCSIEDLYEDDIYEELMEDIRSE 351
Query: 450 CGKYGTLVNVVIPRPDQNGGET-PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
C ++G + + IPRPD++ G P VGK+F+++Y + AK L+GR + T+ +
Sbjct: 352 CIRFGQIEKIEIPRPDKDSGFCNPAVGKIFVKFYYQIPAKKAKFHLAGRTYNKRTIITSF 411
Query: 509 YPEDKYFNKDY 519
YPE+++ KDY
Sbjct: 412 YPEEQFDYKDY 422
>gi|213408691|ref|XP_002175116.1| splicing factor U2AF subunit [Schizosaccharomyces japonicus yFS275]
gi|212003163|gb|EEB08823.1| splicing factor U2AF subunit [Schizosaccharomyces japonicus yFS275]
Length = 511
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 105/414 (25%), Positives = 178/414 (42%), Gaps = 35/414 (8%)
Query: 109 KRRSGFDMAPPAAAMLPG--AAVPG--QLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQ 164
++RS +DM PP + A + G LPG P + + + F + G+
Sbjct: 120 RKRSMWDMKPPGYENVTADQAKMSGLFPLPGAPRSATADPEKLAAFARSTAGSIIAP-PP 178
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
+ A+R ARR+ V LP + + F + ++ + V +Y +++
Sbjct: 179 PIQPGASRQARRLKVKELPAEFEVEDLKNVFEESISTSSFHKDRDTKHVTAIYPCKTERY 238
Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGL 283
A +E+ T E+A+ + F+ V + R Y P +++ + +P +LN +
Sbjct: 239 AIIELATPEDATFIWGARKLKFKNETVLIDRLEGYIVPQISSEVAQKRPKNDLNQKVLDS 298
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
A D+V++G LP Y E QI ELL+ FG L L K+ S+GY FC Y
Sbjct: 299 A----------DKVYIGSLPLYLNEDQISELLKPFGELQSLFLAKNSADMTSRGYAFCEY 348
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
A LN ++ GD L V+ A Q + A I + K + + +
Sbjct: 349 ISSESATAAVQGLNNMEFGDTRLMVQFACVGIQQPVPSPRSVGMAAL-IELSKSSTEAAP 407
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
+VL + + AD D E+YE+I + ++ +C +YG ++++ +PR
Sbjct: 408 ---------------TRVLQIHNLLDADETLDTEDYEDIRKSVQNKCNEYGQVLDLKLPR 452
Query: 464 PDQNGGET---PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ T PGVG F+ + A A +SG +F ++ YYPED Y
Sbjct: 453 ETSSSDNTSAPPGVGVTFVRFGSIKDAANALQHMSGLRFDDRSIVIAYYPEDCY 506
>gi|118390069|ref|XP_001028025.1| U2 snRNP auxilliary factor, splicing factor [Tetrahymena
thermophila]
gi|89309795|gb|EAS07783.1| U2 snRNP auxilliary factor, splicing factor [Tetrahymena
thermophila SB210]
Length = 480
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 177/364 (48%), Gaps = 39/364 (10%)
Query: 177 VYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEAS 236
+ V +P + + I FF+ +++ + A P VV V + +FA + M S
Sbjct: 130 LIVSDIPRMITDIEIKEFFNILISKLRPELAEPS-PVVKVDVMTNGQFATMHMSCKLAKS 188
Query: 237 NAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDR 296
A+ L G+ F+ + + +P Y + Q V + GA+ + ++
Sbjct: 189 FALTLRGVEFQKCKLMIEKPKQY--FFRMYMEKQQND----DVMVDVDDGALQQMQM-NK 241
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN---SKGYGFCVYQDPAVTDIAC 353
+++GGLP Y + +++L E+FG L F++ K ++ SKGY F Y+DP +T+ A
Sbjct: 242 IYMGGLPTYLKDIDVRKLCETFGKLKYFNVAKQQNENKEQVSKGYCFFEYEDPNITEKAI 301
Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQ----KMALQTSGMNTLGG 409
ALNGL GD+ L V R T Q+K LA QQ IQ K+A + + LGG
Sbjct: 302 KALNGLPCGDRKLKVSRVT-KDQNK------LANTQQ---IQSEKNKLAPSNNSGSFLGG 351
Query: 410 GMSLFGETLAKVLCLTE-------------AITADALADDEEYEEILEDMREECGKYGTL 456
+ + K+L + E ++ + L +D+ +++ +D+ EC K G +
Sbjct: 352 SDLIRKDEFQKLLTIPEFTSLPSRVIQLLNMVSIEDLFEDDIVDDLYQDVMTECEKIGPV 411
Query: 457 VNVVIPRPDQNGGET-PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
+ IP+P + G P +GKVF+++ + A+ +L+GR + TV A +YPEDK+
Sbjct: 412 EKIEIPKPCKTTGICPPCIGKVFVKFKYMLKAKKARYSLNGRTYNRRTVIASFYPEDKFD 471
Query: 516 NKDY 519
KD+
Sbjct: 472 RKDF 475
>gi|294954867|ref|XP_002788334.1| U2 small nuclear ribonucleoprotein, auxiliary factor, large
subunit, putative [Perkinsus marinus ATCC 50983]
gi|239903646|gb|EER20130.1| U2 small nuclear ribonucleoprotein, auxiliary factor, large
subunit, putative [Perkinsus marinus ATCC 50983]
Length = 543
Score = 135 bits (339), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 102/357 (28%), Positives = 171/357 (47%), Gaps = 24/357 (6%)
Query: 186 ANEQAIATFFSQVMTAIGGN--SAGPGDAVVNVYI---NHEKKFAFVEMRTVEEASNAMA 240
+++Q++ FF + A+ GN P VV+V+ + + A VE RT A+ AM
Sbjct: 113 SSQQSVMDFFKGALFAVTGNGGKTTPLHPVVSVFFLISDGHSRTALVEFRTPIAATVAMR 172
Query: 241 LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD-RVFV 299
L+GI +G + + RP YN + + + + + S A G + ++ +
Sbjct: 173 LNGIDLDGRKLAITRPHGYNKEDPSKSITAEDIQKVTIEELCGGSSTKKTAPGSNLQLGI 232
Query: 300 GGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGL 359
LP TET +++LLE FG L L++D+ TG SKGYGFC ++DP D AL+
Sbjct: 233 YHLPPVMTETYLRDLLEQFGALTMVSLIRDKTTGLSKGYGFCQFEDPNDADRCLYALDQF 292
Query: 360 KMGDKTLTVRRAT---------ASGQSKTEQESILAQAQQHIA-IQKMALQTSGMNTLGG 409
+G+ +L+V R G + + LA +A +Q M + L
Sbjct: 293 VLGNYSLSVTRLVPDAQQGGAAGIGGAGVGPATNLADGSSGVAVVQSMTARVLANPALAA 352
Query: 410 GMSL---FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
+ G T + V+ L A+ + L + E + I +++REE ++GT++ V +PRP
Sbjct: 353 QLKAGREIGSTPSTVVQLLNAVYIEDLMSETEVKSIEDEIREEAQRHGTVLEVRVPRP-- 410
Query: 467 NGGETP---GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
+ TP GVGK+F+++ D + +GRKF + A +YP D+Y Y+
Sbjct: 411 SASLTPYANGVGKIFVQFADITAARKFQATNNGRKFDDRVMCAAFYPTDRYKMGKYT 467
>gi|18406905|ref|NP_564764.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
gi|12323801|gb|AAG51869.1|AC079675_4 U2 snRNP auxiliary factor, large subunit, putative; 15147-15692
[Arabidopsis thaliana]
gi|332195616|gb|AEE33737.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
Length = 111
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 63/105 (60%), Positives = 78/105 (74%)
Query: 415 GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
G T K++CLT+ +TAD L DD EY +I+EDM +E GK+G LVNVVIPRP+ + TPGV
Sbjct: 5 GGTPTKIVCLTQVVTADDLRDDAEYADIMEDMSQEGGKFGNLVNVVIPRPNPDHDPTPGV 64
Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
GKVFLEY D G + A++ ++GRKFGGN V A YYPEDKY DY
Sbjct: 65 GKVFLEYADVDGSSKARSGMNGRKFGGNQVVAVYYPEDKYAQGDY 109
>gi|19112188|ref|NP_595396.1| U2AF large subunit (U2AF-59) [Schizosaccharomyces pombe 972h-]
gi|549144|sp|P36629.1|U2AF2_SCHPO RecName: Full=Splicing factor U2AF 59 kDa subunit; AltName: Full=U2
auxiliary factor 59 kDa subunit; Short=U2AF59; AltName:
Full=U2 snRNP auxiliary factor large subunit
gi|410322|gb|AAA03578.1| splicing factor U2AF large subunit [Schizosaccharomyces pombe]
gi|5441489|emb|CAB46760.1| U2AF large subunit (U2AF-59) [Schizosaccharomyces pombe]
Length = 517
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 110/413 (26%), Positives = 188/413 (45%), Gaps = 37/413 (8%)
Query: 109 KRRSGFDMAPPAAAMLPG--AAVPG--QLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQ 164
++RS +D+ PP ++ A + G LPG P A + +L F + G+ + P
Sbjct: 130 RKRSLWDIKPPGYELVTADQAKMSGVFPLPGAPRAAVTDPEKLLEFARSAEGSI-IAPPP 188
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
+ A+R ARR+ V G+P E A +F + + + +V + E+ F
Sbjct: 189 PLQPGASRQARRLVVTGIPNEFVEDAFVSFIEDLFISTTYHKPE-TKHFSSVNVCKEENF 247
Query: 225 AFVEMRTVEEASNAMALDGIIFEG-VAVRVRRPTDYNPTLAAALGPGQPSPNLNLA-AVG 282
A +E+ T E+A+ L + V ++ +R +Y + P Q +P ++ +
Sbjct: 248 AILEVATPEDATFLWGLQSESYSNDVFLKFQRIQNY-------IVP-QITPEVSQKRSDD 299
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
A + + D++++ LP E Q+ ELL+ FG L F L+K+ G+SKG+ FC
Sbjct: 300 YAKNDV--LDSKDKIYISNLPLNLGEDQVVELLKPFGDLLSFQLIKNIADGSSKGFCFCE 357
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
+++P+ ++A + L+G L + A L QA M +++
Sbjct: 358 FKNPSDAEVAISGLDGKDTYGNKLHAQFACVG----------LNQA--------MIDKSN 399
Query: 403 GMNTLGGGMSLFGETL-AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
GM L +++ +VL L IT D + D +EYE+I E ++ + YG L+++ I
Sbjct: 400 GMAILTELAKASSQSIPTRVLQLHNLITGDEIMDVQEYEDIYESVKTQFSNYGPLIDIKI 459
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
PR G GKVF+ Y D A + G KF T+ +Y ED Y
Sbjct: 460 PRSIGTRNSGLGTGKVFVRYSDIRSAEVAMEEMKGCKFNDRTIVIAFYGEDCY 512
>gi|145544238|ref|XP_001457804.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124425622|emb|CAK90407.1| unnamed protein product [Paramecium tetraurelia]
Length = 435
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 147/320 (45%), Gaps = 51/320 (15%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIF-EGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
K + +E + E + D + F ++V RP + L L P L
Sbjct: 141 KSWVVLECSSKEAKRALVTQDQVQFVNNCKIKVERPRKF---LERILNPQAREGEL---- 193
Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD--RDTGNSKGY 338
A E R+++GGLP Y + + +L++SFGT F+LVKD +T SKGY
Sbjct: 194 -----SAEQKQEDNTRLYLGGLPTYLRDEDVMKLIQSFGTTKYFNLVKDTTSNTEISKGY 248
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT-----EQESI--------- 384
F Y++ T A ALN L++GDK L + + Q EQ S
Sbjct: 249 CFFEYENTGSTAKALKALNNLQIGDKKLKICKVQGEPQQNKKINGREQPSNYAGSFLASC 308
Query: 385 ----LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE 440
L Q QQ + I + AL S KV+ + + L +D+ YE
Sbjct: 309 DLLRLPQIQQMLTIPQSALIPS-----------------KVVQFLNMCSVEDLYEDDLYE 351
Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGG-ETPGVGKVFLEYYDAVGCATAKNALSGRKF 499
E++ED+R EC ++G + + IPRPD+ G P VGK+F+++Y + AK L+GR +
Sbjct: 352 ELMEDIRSECIRFGQIEKIEIPRPDKESGFCNPAVGKIFVKFYYQIPAKKAKFHLAGRTY 411
Query: 500 GGNTVNAFYYPEDKYFNKDY 519
TV +YPE+++ KDY
Sbjct: 412 NKRTVVTSFYPEEQFDYKDY 431
>gi|156086444|ref|XP_001610631.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797884|gb|EDO07063.1| conserved hypothetical protein [Babesia bovis]
Length = 400
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 104/374 (27%), Positives = 172/374 (45%), Gaps = 53/374 (14%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
AT+ R +Y+G +PP A+ + F + +TA+ G S PG+ +I+ + +AFVEM
Sbjct: 20 ATKPYREIYIGNIPPQADVNNLLEFLNDALTAVNGTSI-PGNPCQKGWISADSHYAFVEM 78
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNP-TLAAALGPGQPSPNLNLAAVGL----- 283
RT+EEASN + L GI + ++R+ RP YNP L A P P+ + +L A+G+
Sbjct: 79 RTMEEASNCIQLSGINYMNYSLRINRPKTYNPEILTEAPSPTIPTLDPSLLALGIAGLKC 138
Query: 284 ASGAIGGAEG----------PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTG 333
AS I A DR+ V + E +K LE+ G L + + + +
Sbjct: 139 ASEQISAAADMLATERAKAMTDRLCVLNVT---DEPALKRELEAQGNLKYYQYITEDNKP 195
Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
C+++ + ++ AL GLK D + V A + + E + Q +
Sbjct: 196 -----PLCIFEYEHI-EMQNIALEGLKKRD--VKVELAVDALERGAMSEDFMKQQIESCD 247
Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
I K + T +VL L ++ + L DD EY +I++D+R EC +Y
Sbjct: 248 IMKSQIPT------------------RVLLLANLVSKEDLEDDAEYYDIIDDVRCECEEY 289
Query: 454 GTLVNVVIPR-------PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
G +V V +PR + + VG F+ + + G + A+ L GRKFG V
Sbjct: 290 GPVVRVEMPRVPKGLTLDEIRNMDFSAVGCAFVLFSNIEGASKARKVLDGRKFGHRIVEC 349
Query: 507 FYYPEDKYFNKDYS 520
++ E + ++S
Sbjct: 350 HFFSELLFHVGEFS 363
>gi|146170296|ref|XP_001470832.1| hypothetical protein TTHERM_00484731 [Tetrahymena thermophila]
gi|146145092|gb|EDK31651.1| hypothetical protein TTHERM_00484731 [Tetrahymena thermophila
SB210]
Length = 471
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 73/220 (33%), Positives = 118/220 (53%), Gaps = 7/220 (3%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGG--NSAGPGDAVVNVYINH 220
+++ Q RHARR+Y+G +P N++ ++ + + + A GG S + +V I+
Sbjct: 22 IKLDNQSGYRHARRLYIGNIPETINQEYLSEWLYRSLEAAGGLQPSLPSENPIVKCEIDP 81
Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYN--PTLAAALGPGQPSPNLNL 278
+ +FAF E+R++EE + + LDGII +R+RRPT+Y P + P N +L
Sbjct: 82 KGRFAFTELRSIEETTALLQLDGIILWHRQLRIRRPTEYEKFPKVQGQFEANIPKLNFDL 141
Query: 279 -AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQI-KEL-LESFGTLHGFDLVKDRDTGNS 335
VG+ +GP+++F+ LP E I EL L G + F LVKD T S
Sbjct: 142 FKTVGIVIIPTIVDDGPNKIFLANLPTKMDELMILDELKLRDMGEIKAFHLVKDNQTNQS 201
Query: 336 KGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASG 375
KGY F ++DP++TD L+G++ +TLT +R+ G
Sbjct: 202 KGYAFFEFKDPSLTDNCIETLHGMQYAGRTLTCKRSQIGG 241
>gi|298713809|emb|CBJ27181.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1141
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 161/342 (47%), Gaps = 47/342 (13%)
Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE-KKFAFVEMRTVE 233
R ++VGGLP + + F + M + ++ G+ V+ + + + FAF+E+RT E
Sbjct: 749 RELHVGGLPHGVSGVQLQDFLNAAMQYLKIATSA-GNPVIRIAMGPDGTNFAFIELRTEE 807
Query: 234 EASNAMA-LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
E + + + GI ++ RP + A +P ++V
Sbjct: 808 ETNATLGRMSGIQCGTGHLKFGRPKAHAAGATAV------APKKEESSV----------- 850
Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
+ V LP T+ ++ELL FG L F+L+KD +G SKG Y D +A
Sbjct: 851 ----LMVMNLPDSLTDDHVRELLSPFGELKKFNLLKD-SSGKSKGTAVFEYTDMENGQLA 905
Query: 353 CAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMS 412
+ L+GL +G L V+R A + A + + ++++ + +
Sbjct: 906 LSGLSGLPVGKGKLMVQRVPAM---------MAATLLKPVKVKEVEDEQDNVEPTC---- 952
Query: 413 LFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
V+ L+ + + LADD EY EI D+ EEC +YG + + +PRP ++G E
Sbjct: 953 --------VVRLSNMVEVEELADDTEYAEIKGDVVEECEQYGKVKSAEVPRP-EDGKEVL 1003
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
G+G++F+E+ D G +NAL+GRKFGG V A YYP D +
Sbjct: 1004 GLGEIFVEFEDVAGATKGRNALAGRKFGGKAVKATYYPLDLF 1045
>gi|307106441|gb|EFN54687.1| hypothetical protein CHLNCDRAFT_53018 [Chlorella variabilis]
Length = 247
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 81/211 (38%), Positives = 109/211 (51%), Gaps = 36/211 (17%)
Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN-VYINHEK 222
Q+ A R A+RVYVG LP +E + +++M G GD + N ++ +K
Sbjct: 68 QLFNPDAARPAKRVYVGNLPAAVSEAELRQAVNELM--------GNGDLLFNGMHQVQDK 119
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
+AFVE R+VEEASNAMALDG+ F +++ VG
Sbjct: 120 GYAFVEFRSVEEASNAMALDGVKFHDSYLKL---------------------------VG 152
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
L + P ++F+GGLP ++E Q+KE+L FG L F+LV DR TGNSKGY F
Sbjct: 153 LEVVKTVVQDSPHKLFIGGLPCDWSEDQVKEMLMPFGQLKAFNLVMDRGTGNSKGYAFAE 212
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
+ D VTDI LNG K LTV+RA A
Sbjct: 213 FMDVHVTDIVIQNLNGKPCNTKFLTVKRALA 243
>gi|145542929|ref|XP_001457151.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124424966|emb|CAK89754.1| unnamed protein product [Paramecium tetraurelia]
Length = 429
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 145/320 (45%), Gaps = 51/320 (15%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIF-EGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
K + +E + E + D + F ++V RP + L L P LN
Sbjct: 135 KSWVVLECSSKEAKRALVTQDQVQFVNNCKIKVERPRKF---LERILNPQTKDGELN--- 188
Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD--RDTGNSKGY 338
E R+++GGLP Y + + +L++SFG F+LVKD +T SKGY
Sbjct: 189 ------PDQKQEDNTRLYLGGLPTYLRDEDVMKLIQSFGITKYFNLVKDTTSNTEISKGY 242
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE----------SILA-- 386
F Y++ T A ALN L++GDK L + + Q + S LA
Sbjct: 243 CFFEYENAQSTAKALKALNNLQIGDKKLKICKVQGETQQNKKINGKDQPSNYAGSFLASC 302
Query: 387 ------QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE 440
Q QQ + I + AL S KV+ + L +D+ +E
Sbjct: 303 DLLRIPQVQQMLTIPQSALIPS-----------------KVVQFLNMCSIQDLYEDDIFE 345
Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGG-ETPGVGKVFLEYYDAVGCATAKNALSGRKF 499
E++ED+R EC +YG + + IPRPD+ G P VGK+F+++Y + AK L+GR +
Sbjct: 346 ELMEDIRSECMRYGQIEKIEIPRPDKESGFCNPAVGKIFVKFYYQIPAKKAKFHLAGRTY 405
Query: 500 GGNTVNAFYYPEDKYFNKDY 519
T+ +YPE+++ KDY
Sbjct: 406 NKRTIITSFYPEEQFDYKDY 425
>gi|71027151|ref|XP_763219.1| hypothetical protein [Theileria parva strain Muguga]
gi|68350172|gb|EAN30936.1| hypothetical protein TP03_0201 [Theileria parva]
Length = 509
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 97/369 (26%), Positives = 172/369 (46%), Gaps = 39/369 (10%)
Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEE 234
R +Y+G +PP+ + + + +Q + ++ G S PG+ + +I+ + +AF+E+RT+EE
Sbjct: 104 REIYIGNIPPVGDIEILMDIINQALISVNGTSM-PGNPCLKGWISSDGHYAFIELRTMEE 162
Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNP-TLAAALGPGQPSPNLNLAAVGL---------- 283
ASN M L G+ G ++V RP ++ + A P P+ + +L A+G+
Sbjct: 163 ASNCMQLTGLNIMGHNIKVNRPKTFDADVFSKAPSPTVPTLDPSLLAMGVQALKSAKEQI 222
Query: 284 -ASGAIGGAEG----PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
A+ I AE DR+ + G+P + + + L GT+ + + + N
Sbjct: 223 AAASDILAAEKAKPITDRLCLVGIPKDTDQQTVVDTLRLHGTIKFTNFIMGIENFNYITV 282
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI----AI 394
+ +Y +T + + G+ + Q KT ES+ Q + I A+
Sbjct: 283 IYVIYIYQLMTIV--------EKGEMVVLFEYENLEDQ-KTALESLPKQGYRVILAIDAV 333
Query: 395 QKMALQTSGMNTLGGGMSLF-GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
+ + + T SL E +VL L+ ++ D L DDEEY +I++D+R EC Y
Sbjct: 334 TQGIISPQQIKTQLANCSLMRAEIPTRVLLLSNLVSKDELEDDEEYVDIIDDVRCECELY 393
Query: 454 GTLVNVVIPR-------PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
G ++ V +PR + + VG F+ + + A+ L GRKFG TV+A
Sbjct: 394 GVVLRVELPRVPKGLTEEEMKAFDPTSVGSAFVLFSTVESASKARKVLDGRKFGQRTVHA 453
Query: 507 FYYPEDKYF 515
++ E YF
Sbjct: 454 HFFSE-LYF 461
>gi|399216014|emb|CCF72702.1| unnamed protein product [Babesia microti strain RI]
Length = 487
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 116/460 (25%), Positives = 197/460 (42%), Gaps = 71/460 (15%)
Query: 81 RHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAV 140
R R +SDR R R S S ++R FD +PP A P G + G +
Sbjct: 57 RGRDIDRASDRSRFR-------HSDSYDRKRFKFD-SPPKQA--PKEGFGGGVLGYVDGI 106
Query: 141 PEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMT 200
P + + G F ++TR +R++ + PP + I +F+ M
Sbjct: 107 PVQGKRHIIMQTCLFGIF------YSEAESTRFSRQLEISNTPPNIEVEVIIEYFNMAML 160
Query: 201 AIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYN 260
A+GGN A+ + +++K +EMRT+EE SNA+ L+G+ G ++ + R +
Sbjct: 161 AVGGNCLPGNPAIRGKHNSNDKTSITIEMRTLEETSNALQLNGLNLMGKSLSITRVGNCP 220
Query: 261 PT-LAAALGPGQPSPNLNLAAVG-----------LASGAI----GGAEGPDRVFVGGLPY 304
P + A P P+ + ++ A+G L S AI GGA DR+ + LP
Sbjct: 221 PEYINKAPPPTVPTISPSILALGVNGLQSADIKPLLSNAITSLVGGAPKTDRLLILDLPI 280
Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLK---- 360
+E QIK ++E FG L L K+ D ++ G C+ + T++ AL ++
Sbjct: 281 TQSEDQIKSMVEEFGKLKYIQLFKNADDTSA---GMCLIEF-VDTNVQVEALQKMRLQYN 336
Query: 361 --MGDKTLTVR---RATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFG 415
+ + LT R R Q + + E + Q I + + T+ +++
Sbjct: 337 IILAEDALTKRIIDRNLLRLQMRNQSELMKTQIPTRCIIIRNLVTTASVSS--------- 387
Query: 416 ETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
+ L +D EY+E++ED+R EC G + V +PR +
Sbjct: 388 ------------VQFMILQNDREYQEVIEDIRAECDLMGQVERVEVPR-----NPPSEMA 430
Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
F+ + G A A+ +L GR+F N V +Y E+++
Sbjct: 431 YAFVLFESIQGAAMARKSLGGRRFASNVVQVDFYNEEEFM 470
>gi|145538137|ref|XP_001454774.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124422551|emb|CAK87377.1| unnamed protein product [Paramecium tetraurelia]
Length = 426
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 144/320 (45%), Gaps = 51/320 (15%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIF-EGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
K + +E + E + D + F ++V RP + L L P LN
Sbjct: 132 KSWVVLECSSKEAKRALVTQDQVQFVNNCKIKVERPRKF---LERILNPQARDGELNPEQ 188
Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD--RDTGNSKGY 338
E R+++GGLP Y + + +L++SFG F+LVKD +T SKGY
Sbjct: 189 ---------KQEDNTRLYLGGLPTYLRDEDVMKLIQSFGITKYFNLVKDTTSNTEISKGY 239
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE----------SILA-- 386
F Y+ T A ALN L++GD+ L + + Q + S LA
Sbjct: 240 CFFEYESAQSTAKALKALNNLQIGDRKLKICKVQGETQQNKKINGKDQPSNYAGSFLASC 299
Query: 387 ------QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE 440
Q QQ + I + AL S KV+ + L +D+ +E
Sbjct: 300 DLLRIPQVQQMLTIPQSALIPS-----------------KVVQFLNMCSIQDLYEDDIFE 342
Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGET-PGVGKVFLEYYDAVGCATAKNALSGRKF 499
E++ED+R EC +YG + + IPRPD+ G P VGK+F+++Y + AK L+GR +
Sbjct: 343 ELMEDIRSECVRYGQIEKIEIPRPDKESGFCNPAVGKIFVKFYYQIPAKKAKFHLAGRTY 402
Query: 500 GGNTVNAFYYPEDKYFNKDY 519
TV +YPE+++ KDY
Sbjct: 403 NKRTVITSFYPEEQFDYKDY 422
>gi|85000357|ref|XP_954897.1| snrnp splicing factor (U2AF) [Theileria annulata strain Ankara]
gi|65303043|emb|CAI75421.1| snrnp splicing factor (U2AF), putative [Theileria annulata]
Length = 486
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 174/368 (47%), Gaps = 52/368 (14%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
++ R +Y+G +PP+ + + + +Q + ++ G S PG+ + +I+ + +AF+E+R
Sbjct: 100 SKAFREIYIGNIPPVGDIEILMDIINQALISVNGTSM-PGNPCLKGWISSDGHYAFIELR 158
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNP-TLAAALGPGQPSPNLNLAAVGL------ 283
T+EEASN M L G+ G ++V RP Y+ + A P P+ + +L A+G+
Sbjct: 159 TMEEASNCMQLTGLNIMGHNIKVNRPKTYDADVFSKAPSPTVPTLDPSLLAMGVQALKSA 218
Query: 284 -----ASGAIGGAEG----PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
A+ I AE DR+ + G+P + + +LL+S GT+ + +
Sbjct: 219 KEQIAAASDILAAEKAKSITDRLCLVGIPKDMEQQTVVDLLQSQGTIKFTHFIME----- 273
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
KG +++ + D A + K G + + +I A Q I+
Sbjct: 274 -KGEMVVLFEYENLEDQKSALESLPKQGYRVIM---------------AIDAVTQGIISP 317
Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
Q++ Q + + + E + L L+ ++ + L DDEEY +I++D+R EC YG
Sbjct: 318 QQIKTQLANCSLMK------AEIPTRALLLSNLVSKEELDDDEEYVDIIDDIRCECELYG 371
Query: 455 TLVNVVIPR-------PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAF 507
++ V +PR + N + VG F+ + + A+ L GRKFG TV+A
Sbjct: 372 VVLRVELPRVPKGLSEEEMNSFDPTSVGSGFVLFSTVDSASKARKVLDGRKFGQRTVHAH 431
Query: 508 YYPEDKYF 515
++ E YF
Sbjct: 432 FFSE-LYF 438
>gi|340503018|gb|EGR29650.1| splicing factor u2af large subunit, putative [Ichthyophthirius
multifiliis]
Length = 438
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 90/337 (26%), Positives = 159/337 (47%), Gaps = 25/337 (7%)
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
Q+ HA R+Y+G +P + + + F + + GG PG+ +++ + KKF F+
Sbjct: 109 QKNYIHALRIYIGNIPDPIDTEDVCHFVYKSLLESGG-LLEPGNPIISKKNDPIKKFIFL 167
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
++R++EE S M LDGI+++G ++R RRP DY T+ G + P L+ + +
Sbjct: 168 QLRSIEETSACMQLDGILYKGKSLRFRRPKDYT-TMPQVEG-TRKIPILDRNKLRIVQTQ 225
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
+ +++ V +P +E + ++L+++G L F L D TG SKG+ FC Y
Sbjct: 226 VENTY--NKLQVMNIPETISEEHVMQILQNYGELRSFHLAVDIYTGESKGFAFCEYLTDK 283
Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE-----------------SILAQAQQ 390
T L+G ++ +K + V+R + E++ SIL +
Sbjct: 284 ATMDCLNQLSGQQILNKIINVKRCNPNLAPPVEEQMQPIEVLVKNLCDFINKSILESGFK 343
Query: 391 HIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREEC 450
IQ+ +Q N L E VL + I + +D EYE I D++++
Sbjct: 344 D--IQEEYIQKVISNEGQKYSGLNQEEATSVLKIKNVIDKQVIEEDPEYEFIYNDLKQQL 401
Query: 451 GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
K+G L ++IPR + + VG VF+E+ + C
Sbjct: 402 VKFGRLKQMIIPRLKEK-YQPDSVGLVFVEFENEKIC 437
>gi|397623851|gb|EJK67169.1| hypothetical protein THAOC_11833 [Thalassiosira oceanica]
Length = 436
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 114/385 (29%), Positives = 174/385 (45%), Gaps = 50/385 (12%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
T+ +R ++VG PP +E + F S M+ + + P D V KF F+E+
Sbjct: 66 TKLSRELFVGNTPPGTSEALLMQFLSGAMSRV---NLCPPDVTPIVTCRKNDKFCFIELA 122
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYN----PTLAAALGPGQPSPNLNLAAVGLASG 286
TV+ A+ A+ L+GI F G ++RV RP+ Y+ P+ GQP P +AAV +G
Sbjct: 123 TVDLANKALNLNGIPFLGSSLRVARPSKYSGPHVPSQTWQQLTGQPLPP-GMAAVPENTG 181
Query: 287 AIGGAEGPDRV----FVGGLPYYFTETQIKELL----ESFG--TLHGFDLVKDRDTGNSK 336
G D++ F+G T +++ L E G T+ G +V R +
Sbjct: 182 VTMALSGEDKLSRELFIGNTTPEMTAEMLRDFLGRAMEQVGLSTMPGNPIVTVRPSAK-- 239
Query: 337 GYGFCVYQDPAVTDIACAA-LNGLKMGDKTLTVRRATASGQSKTEQ---ESILAQ----- 387
F + ++ + A A LN + L V R + +T E ILA+
Sbjct: 240 ---FAFIEVRSMQEAANALNLNNIPYLGAQLRVGRPSKYSGPETPHGNWEDILAKFMSGE 296
Query: 388 -------AQQHIAIQKMALQTSGMNTLGGGMSLFGETLAK-----VLCLTEAITADALAD 435
Q + +Q+ + ++L ++ +K V+ L +T L D
Sbjct: 297 LHLKNNATQANPLVQQAHAVAAAASSLAPSLASVPPLASKASPSPVVELRHMLTQQDLDD 356
Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
D EY +ILED R+EC +GTL N+VIPR + PG K+FLEY A A L+
Sbjct: 357 DNEYNDILEDTRDECSSFGTLKNIVIPR------KGPGATKIFLEYMTAEDAGKAIAGLA 410
Query: 496 GRKFGGNTVNAFYYPEDKYFNKDYS 520
GR F G V A Y+ K+ N+DYS
Sbjct: 411 GRTFDGRKVTAVYFDTVKFANEDYS 435
>gi|255082091|ref|XP_002508264.1| RNA binding protein [Micromonas sp. RCC299]
gi|226523540|gb|ACO69522.1| RNA binding protein [Micromonas sp. RCC299]
Length = 493
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 80/246 (32%), Positives = 121/246 (49%), Gaps = 35/246 (14%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGN---SAGPGDAVVNVYINHEKKFA 225
Q TR +RR+YVG LP N++A+ FF+ M G S GP +VVN I HEK FA
Sbjct: 112 QHTRQSRRLYVGSLPKPVNDEALHAFFNNAMVNSGAAIDPSGGP--SVVNTTITHEKGFA 169
Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDY----NPTLAAALGPGQPSPNLNLAAV 281
F+E R +E+A +A+ DGI+F G + ++RP DY NP + A G P + L
Sbjct: 170 FIEFRRLEDAESALMFDGIVFNGSKLIIKRPKDYDAARNP-IWAMRGQAPPQDEVKLIGE 228
Query: 282 GLASGAI--GGAE----------------------GPDRVFVGGLPYYFTETQIKELLES 317
L G I G E GP +++ GG T+ Q++++L+S
Sbjct: 229 ELPIGTIIVDGKEVKIPLPPPLPSEWPRLPRRTPNGPHKMYCGGFHPLHTDLQVRQVLQS 288
Query: 318 FGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQS 377
G L F ++ D + G G+ F Y+DP ++ +A L G+++ ++ L RR
Sbjct: 289 VGELKSFAVMPD-ENGRPTGHAFFEYKDPRLSAVAETVLTGIRVRNRRLVCRRMNPDAAP 347
Query: 378 KTEQES 383
+ ES
Sbjct: 348 EKPGES 353
>gi|403223258|dbj|BAM41389.1| snRNP splicing factor U2AF [Theileria orientalis strain Shintoku]
Length = 534
Score = 121 bits (304), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 170/371 (45%), Gaps = 51/371 (13%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
ATR R +Y+G +PP+ + + +Q + ++ G S PG+ + +I+ + +AFVE+
Sbjct: 119 ATRPYREIYIGNIPPVGDIAILLDIINQALISVNGTSM-PGNPCLKGWISSDGHYAFVEL 177
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAA-ALGPGQPSPNLNLAAVGL----- 283
RT+EEASN M L G+ G ++V RP Y+P L + A P P+ + +L A+GL
Sbjct: 178 RTMEEASNCMQLTGLNIMGHNIKVNRPKTYDPDLMSKAPSPTVPTLDPSLLAMGLQALKS 237
Query: 284 ------ASGAIGGAEG----PDRVFVGGLPYYFTETQIKELLESFGTL-HGFDLVKDRDT 332
A+ + AE DR+ + +P + + L+ S G + + + + + ++
Sbjct: 238 AREQIVAASDVLAAEKAKVMTDRLCIVDIPPEADKQTVINLVHSMGEVKYTYFVDEPAES 297
Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGL-KMGDKTLTVRRATASGQSKTEQESILAQAQQH 391
G +K Y + D A+ + KM + + A G E + + +
Sbjct: 298 GTNKRVFLFEYMN---MDHQKKAMEEIPKMNYRLILAIDAVTQGMIAPE---YIKKQLES 351
Query: 392 IAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECG 451
AI K + T + L L ++ + L DD EY +I++D++ EC
Sbjct: 352 CAIMKPEVPT------------------RALLLGNLVSKEELDDDAEYVDIIDDVKTECE 393
Query: 452 KYGTLVNVVIPRPDQNGGE-------TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
YG ++ + +PR + E VG F+ + G + A+ L GRKFG TV
Sbjct: 394 DYGVVLRLELPRVPKGLSEEEMRSFDESSVGSAFVLFSTVDGASKARKVLDGRKFGNRTV 453
Query: 505 NAFYYPEDKYF 515
A ++ E YF
Sbjct: 454 KAHFFSE-LYF 463
>gi|218192051|gb|EEC74478.1| hypothetical protein OsI_09930 [Oryza sativa Indica Group]
Length = 1128
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 157/342 (45%), Gaps = 42/342 (12%)
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQ-VMTAIGGNSAGPGDAVVNVYINHEKK 223
V QATR RR+++ LP LA E + ++ ++++ + ++ IN +K+
Sbjct: 628 VQLTQATRPLRRLHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKR 687
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
AFVE T E+A+ A++ DG F G ++++RRP +Y A + P +PS + L + +
Sbjct: 688 QAFVEFLTPEDATAALSFDGRSFGGSSLKIRRPKEY--VEMAHVAPKKPSEEIKLISDVV 745
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
A+ P ++F+ G+ + + E++ SFG L + + + D G + F Y
Sbjct: 746 -------ADSPHKIFIAGISGVISSEMLMEIVSSFGPLAAYRFLFNEDLGGA--CAFLEY 796
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
D ++T ACA LNG+K+G LT A + TEQ +A I A
Sbjct: 797 IDHSITSKACAGLNGMKLGGGILT---AVNVFPNSTEQ--AFNEASPFYGIPDSA----- 846
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADA--LADDEEYEEILEDMREECGKYGTLVNV-V 460
SL E KVL L + L E EEILED+R EC ++G + ++ V
Sbjct: 847 -------KSLLEEP-TKVLQLKNVFDQEEYLLLSKSELEEILEDVRVECARFGAVKSINV 898
Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+ P + T G E C + +++GGN
Sbjct: 899 VEYPASSDNTT---GDTITE------CEDGSTKIEPKEYGGN 931
>gi|340506650|gb|EGR32741.1| u2 snrnp auxilliary splicing factor, putative [Ichthyophthirius
multifiliis]
Length = 276
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 83/261 (31%), Positives = 133/261 (50%), Gaps = 34/261 (13%)
Query: 278 LAAVGLASGAIGGAEG--PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN- 334
+ + + + EG +++++GGLP Y + +IK+L E+FG L F+L K ++
Sbjct: 26 MDKIQVEDAILDSEEGIQENKIYMGGLPTYLKDPEIKKLCETFGKLKYFNLAKQQNENKE 85
Query: 335 --SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI 392
SKGY F Y+D VTD A ALNGL GD+ L V + T Q+K LA+ QQ
Sbjct: 86 WVSKGYCFFEYEDKEVTDRAIKALNGLPCGDRKLKVSKVT-RDQNK------LAKTQQ-- 136
Query: 393 AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTE-------------AITADALADDEEY 439
+Q + LG + E + K+L + E + + L +D+ Y
Sbjct: 137 ------IQNDSGSYLGDCHLIKNEFVRKMLSIPEYTYQPSRVIQLLNMCSPEDLFEDDIY 190
Query: 440 EEILEDMREECGKYGTLVNVVIPRPDQNGGET-PGVGKVFLEYYDAVGCATAKNALSGRK 498
EI +D++ EC K G + V I RP + G P VGK+F+++ + A++ L+GR
Sbjct: 191 NEIYQDVQSECEKIGPIEKVEIVRPCKMTGICPPSVGKIFVKFKYLLKAKRARHVLNGRT 250
Query: 499 FGGNTVNAFYYPEDKYFNKDY 519
+ TV A +YPE+K+ K++
Sbjct: 251 YNKRTVVASFYPEEKFDCKEF 271
>gi|118376950|ref|XP_001021657.1| hypothetical protein TTHERM_00151210 [Tetrahymena thermophila]
gi|89303423|gb|EAS01411.1| hypothetical protein TTHERM_00151210 [Tetrahymena thermophila
SB210]
Length = 554
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 161/365 (44%), Gaps = 43/365 (11%)
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
Q+ HA RVY+G +P + + + F + M GG PG+
Sbjct: 210 QKQYIHALRVYIGNIPDPVDVEDVCKFVFEQMANAGG-LLEPGNP--------------- 253
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
+R++EE S M LDGII++G ++R RRP D+ L G +P P L+ + +
Sbjct: 254 -LRSIEETSACMELDGIIYKGKSLRFRRPKDFG-VLQKVEG-TRPVPTLDKTKLKIVQTQ 310
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
+ +++ + LP F+E + +LL ++G L F L D+ T SKG+ FC +
Sbjct: 311 VENTY--NKLQIMNLPENFSEEHVMQLLLTYGDLKSFHLAVDKITSESKGFAFCEFITDR 368
Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT---SGM 404
T L+G ++ +K + V+R + E E IL+ Q + + + +T SG
Sbjct: 369 STVECLNKLSGQQILNKVINVKRCNPQLAPQHE-EPILSLDQLYKNLVENVNKTIIESGQ 427
Query: 405 -----NTLGGGMS--------LFGETLAKVLCLTEAITADALADDEEYEEILEDMREECG 451
+ L +S L E VL L + + +D EY I D++ +
Sbjct: 428 KDIQEDYLKKMLSINAPKYDGLITEDATNVLKLHNIVNKQLIEEDAEYHFIFNDLKTQLD 487
Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA-FYYP 510
+ G ++IPR E G+G VF+E+ + A L K+ G V A FY P
Sbjct: 488 RIGRTKQIIIPRKKDKFLE--GIGFVFVEFDNERTSQIASFLLQKIKYDGKDVKAEFYSP 545
Query: 511 EDKYF 515
+ YF
Sbjct: 546 Q--YF 548
>gi|428172624|gb|EKX41532.1| hypothetical protein GUITHDRAFT_112506 [Guillardia theta CCMP2712]
Length = 514
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 111/371 (29%), Positives = 164/371 (44%), Gaps = 39/371 (10%)
Query: 169 QATRHARRVYVGGLP---PLANEQAIATFFSQVMTAI-------------GGNSAGPGDA 212
Q T ARRVYVG LP P +E A+ FF Q M + G + PG
Sbjct: 133 QLTLKARRVYVGNLPQLDPPISEPALKEFFDQAMHQVQDQGAYFKAEFAQAGLTQSPGCC 192
Query: 213 VVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP 272
V +V+I+ EK FAF+E+RTV+EA++AM LDGI F G +RV RP DY P A+
Sbjct: 193 VCDVWISSEKHFAFIEVRTVQEATSAMTLDGITFYGTPLRVNRPHDYVPPAPDAMIMTMA 252
Query: 273 SPNLNLAAVGLA---SGAIGGAEGPDRVFVGGLPY-YFTETQIKELLESFGTLHGFDLVK 328
L + G+A S + + R+ VG L T +K+ + ++ LV
Sbjct: 253 QAGLMGSGGGIAANLSALMQQTKKARRIHVGNLLVGSMTSASLKQFISQ--SMQQLSLVV 310
Query: 329 -------DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQ 381
D +GF + A + A AL+G++ + + V R E
Sbjct: 311 KPGDPCIDSFLSGDGNFGFVEMRTVAEANNA-MALSGIECNGRPIRVGRPADYVPLNAE- 368
Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAK---VLCLTEAITADALADDEE 438
++AQ Q I +G GM L G +K V+ + ++ D LA+D+E
Sbjct: 369 --LIAQC-QGTGILGTPGDAGVTEAVGAGM-LNGPDESKATEVVVIRNMMSDDDLANDDE 424
Query: 439 YEEILEDMREEC-GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGR 497
++I ED +C +YG +V VI RP + G +G V +++ A N L+
Sbjct: 425 CKDIAEDTISKCEEEYGKVVRFVIVRPGREGAPADLIGNVLVQFETKESAIKAANDLNHV 484
Query: 498 KFGGNTVNAFY 508
KF V Y
Sbjct: 485 KFDERVVETDY 495
>gi|145547916|ref|XP_001459639.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427465|emb|CAK92242.1| unnamed protein product [Paramecium tetraurelia]
Length = 402
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 90/356 (25%), Positives = 157/356 (44%), Gaps = 48/356 (13%)
Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
HA R+Y+G LP ++ + + Q M + G PGD V+ V + +K+ FV+ R++
Sbjct: 86 HAVRLYLGNLPDNVDKDHLHNYIRQQMESHGA-VLDPGDPVIQVQLQPGQKYCFVQFRSI 144
Query: 233 EEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
EE A+ +D I ++G ++ +R DY ++ + + P I E
Sbjct: 145 EETEAALQIDTINYQGKPLKFKRVKDYE--ISPRIEGEREVP------------KIQPKE 190
Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN-SKGYGFCVYQDPAVTDI 351
++FV GL + +L +G L ++V RD N KG+ FC ++ T
Sbjct: 191 PAQKLFVCGLAPDTDNDALANILSEYGNLKSLNVV--RDIKNVCKGFAFCEFETDLETQN 248
Query: 352 ACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGM 411
LN +G + L V++ AQ Q + T + G
Sbjct: 249 CVNGLNNKVIGGRLLQVKK----------------NAQLPTPTQDYIIDTITL----GEQ 288
Query: 412 SLFGETLAK--------VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
S F L + V+ + A+ + DD EY I++D+++E K G L+++V+PR
Sbjct: 289 SAFEAKLQQINQMKVSSVVVINNAVRIKNIEDDYEYNFIVKDLKKEIEKIGRLISMVVPR 348
Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+ G + G+GKVF+E+ + A L +K+ G ++ +Y Y +K Y
Sbjct: 349 --KKDGYSEGIGKVFVEFENEQFAKIAIILLQNKKYDGREIDIAFYDPRLYADKQY 402
>gi|209878476|ref|XP_002140679.1| RNA recognition motif. family protein [Cryptosporidium muris RN66]
gi|209556285|gb|EEA06330.1| RNA recognition motif. family protein [Cryptosporidium muris RN66]
Length = 577
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 86/364 (23%), Positives = 164/364 (45%), Gaps = 21/364 (5%)
Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEE 234
R VYVG LP + F +Q + N PG+ V+ +I+ + K+AF E R++EE
Sbjct: 183 REVYVGNLPSGIGTTTLLEFMNQFLIK-NCNITTPGNPFVSAWISSDGKYAFCECRSMEE 241
Query: 235 ASNAMALDGII-FEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE- 292
A+ A+ L+ I G +R+ RP + + ++ + + + +
Sbjct: 242 ANMALQLNNTINLNGNILRIGRPKTIENSSNINSSNEPNNSVVSSISTQSNTTFLSNIQP 301
Query: 293 ---GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLV-------KDR-DTGNSKGYGFC 341
DR+ + G PY +++ I++++ L+ K R ++ N C
Sbjct: 302 IIKKADRIVISGFPYSYSDEDIEDIIREVNGNQAIKLLYVPPNSNKGRIESSNCLKIAIC 361
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
++D +T+ +N + + L R+ + Q+K ++L+ I ++ +
Sbjct: 362 EFEDVVITERVIRRVNTQNVCNLKLNAFRSHEALQNKYIL-NVLSDEIHKIYDYEVKQLS 420
Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
S + + + L G+ + + ++ IT + L D Y EI++++++E KYG + ++VI
Sbjct: 421 SDYSEISTFL-LRGQIPCRCIKISNIITPEELVVDNIYNEIMDEIKQEVCKYGNIKHIVI 479
Query: 462 PRP-----DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
PRP ++ G G VF+ Y D AK L KF G +V YYPE+ +
Sbjct: 480 PRPASAFKSEDSGYFSIYGSVFVLYNDVQNAIDAKINLYKMKFSGRSVCISYYPENYFIQ 539
Query: 517 KDYS 520
++S
Sbjct: 540 NNFS 543
>gi|50552688|ref|XP_503754.1| YALI0E09889p [Yarrowia lipolytica]
gi|49649623|emb|CAG79345.1| YALI0E09889p [Yarrowia lipolytica CLIB122]
Length = 601
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 161/365 (44%), Gaps = 51/365 (13%)
Query: 171 TRHARRVYVGGLPP-LANEQAIATFFSQVMTAIGGNSAGPGDAVVN-VYINHEKKFAFVE 228
+R ARR+ + G+P + AI +FF+ + G G + +V+ VY + VE
Sbjct: 262 SRVARRLILSGIPADQIDTVAIKSFFTDFIE--GLELQGSKERIVDGVYKHPRLPEVLVE 319
Query: 229 MRTVEEASNAMALDG--IIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
+ E A+ A+AL G I + G + +RRP++Y + P ++ ++
Sbjct: 320 FFSAEMATLALALSGLGINYSGPPISIRRPSNY-------ICPTPERSEVSRRSLDEEKE 372
Query: 287 AIGGAEGPD-RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQD 345
E + ++ V +P+ E Q+++L SFG L F L++ + S G Y+D
Sbjct: 373 VASVVEDSNTKIIVWDIPFNVEEDQVRQLTASFGELSAFQLIRQLPSRESAGIALVDYKD 432
Query: 346 PAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
P V A + L+G +G K L V A G ++ L S N
Sbjct: 433 PEVVKDAVSGLSGQVIGGKNLKVMLA-CEGPTQ--------------------LSCSSNN 471
Query: 406 TLGGGMSLFGETLAK----VLCLTEAITADALADDEEYEEILEDMREECGKY--GTLVNV 459
L G +++ + ++ V+ L +T D L DD Y EI E + EC KY G V +
Sbjct: 472 GLKGIVTVMNDVKSRPESSVIVLFNLVTLDELLDDVAYREITEQVESECLKYGGGEEVQI 531
Query: 460 VIPRPDQNGGET----------PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
IPRPD + PGVGKV++++ A L+G +F +V A YY
Sbjct: 532 KIPRPDPEAMKASYRRLIFETRPGVGKVYVKFASVETSRVAMQKLTGLRFSRRSVIASYY 591
Query: 510 PEDKY 514
E+ +
Sbjct: 592 SEECF 596
>gi|108706080|gb|ABF93875.1| RNA recognition motif family protein, expressed [Oryza sativa
Japonica Group]
Length = 964
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 95/342 (27%), Positives = 156/342 (45%), Gaps = 42/342 (12%)
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQ-VMTAIGGNSAGPGDAVVNVYINHEKK 223
V QATR RR+++ LP LA E + ++ ++++ + ++ IN +K+
Sbjct: 464 VQLTQATRPLRRLHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKR 523
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
AFVE T E+A+ A++ DG F G ++++RRP +Y A + P +PS + L + +
Sbjct: 524 QAFVEFLTPEDATAALSFDGRSFGGSSLKIRRPKEY--VEMAHVAPKKPSEEIKLISDVV 581
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
A+ P ++F+ G+ + + E++ SFG L + + + G + F Y
Sbjct: 582 -------ADSPHKIFIAGISGVISSEMLMEIVSSFGPLAAYRFLFNEYLGGA--CAFLEY 632
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
D ++T ACA LNG+K+G LT A + TEQ +A I A
Sbjct: 633 IDHSITSKACAGLNGMKLGGGILT---AVNVFPNSTEQ--AFNEASPFYGIPDSA----- 682
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADA--LADDEEYEEILEDMREECGKYGTLVNV-V 460
SL E KVL L + L E EEILED+R EC ++G + ++ V
Sbjct: 683 -------KSLLEEP-TKVLQLKNVFDQEEYLLLSKSELEEILEDVRVECARFGAVKSINV 734
Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+ P + T G E C + +++GGN
Sbjct: 735 VKYPASSDNTT---GDTITE------CEDGSTKIEPKEYGGN 767
>gi|118489922|gb|ABK96758.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 787
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 84/303 (27%), Positives = 141/303 (46%), Gaps = 39/303 (12%)
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
+ QAT RR+Y+ +P A+E+A+ + + + G + ++ EK
Sbjct: 285 IQLTQATHPIRRLYMENIPASASEKAVMDCLNNFLISSGVHHIQGTQPCISCIRQKEKGQ 344
Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
A VE T E+AS A++ DG F G ++VRRP D+ + A G L A
Sbjct: 345 ALVEFLTPEDASAALSFDGRSFSGSIIKVRRPKDF---IEVATG--------ELEKSAAA 393
Query: 285 SGAIGG--AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
AIG + P ++F+GG+ + + E+ +FG L + +D + + F
Sbjct: 394 IDAIGDIVKDSPHKIFIGGISKVLSSKMLMEIASAFGPLKAYQFENRKDP--DEPFAFLE 451
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK-TEQESILAQAQQHIAIQKMALQT 401
Y D +VT ACA LNG+K+G + +T +A + S ++ S Q QH
Sbjct: 452 YADESVTFKACAGLNGMKLGGQVITAIQAVPNASSSGSDGNSQFGQISQH---------- 501
Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALA--DDEEYEEILEDMREECGKYGTL--V 457
+L E +VL L +++L+ + E EE+LED+R EC ++G++ +
Sbjct: 502 --------AKALL-EKPTEVLKLKNVFDSESLSSLSNTEVEEVLEDVRLECARFGSVKSI 552
Query: 458 NVV 460
NV+
Sbjct: 553 NVI 555
>gi|312372039|gb|EFR20089.1| hypothetical protein AND_20681 [Anopheles darlingi]
Length = 384
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 85/284 (29%), Positives = 128/284 (45%), Gaps = 24/284 (8%)
Query: 256 PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELL 315
P Y AA P + AAV + I R++VG +P+ TE ++ E
Sbjct: 105 PLQYKAMQAAGQIPANIVADTPQAAVPVVGSTI--TRQARRLYVGNIPFGVTEEEMMEFF 162
Query: 316 ESFGTLHGF-----DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
L G + V K + F ++ T A A + + ++L +RR
Sbjct: 163 NQQMHLSGLAQAAGNPVLACQINLDKNFAFLEFRSIDETTQA-MAFDSINFKGQSLKIRR 221
Query: 371 ATASGQSKTEQESILAQAQQHIA--IQKMALQTSGMNTLGG-----------GMSLFGET 417
+S + + I + ++ +GG G+SL G +
Sbjct: 222 PHDYQPMPGMTDSATVNVPEKFSGVISTVVPDSAHKIFIGGLPNYLNEDQVPGLSLVGSS 281
Query: 418 --LAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
+VLCL +T D L D+EEYE+ILED+REEC KYG + +V IPRP + G + PG G
Sbjct: 282 GPPTEVLCLLNMVTPDELKDEEEYEDILEDIREECNKYGVVRSVEIPRPIE-GVDVPGCG 340
Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
KVF+E+ V C A+ AL+GRKF V Y+ DKY +++
Sbjct: 341 KVFVEFNSIVDCQKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 384
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 86/159 (54%), Gaps = 12/159 (7%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P V V+ TR ARR+YVG +P E+ + FF+Q M + G + G+ V+ I
Sbjct: 126 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 184
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP---SPN 275
N +K FAF+E R+++E + AMA D I F+G ++++RRP DY P PG + N
Sbjct: 185 NLDKNFAFLEFRSIDETTQAMAFDSINFKGQSLKIRRPHDYQPM------PGMTDSATVN 238
Query: 276 LNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
+ G+ S + + ++F+GGLP Y E Q+ L
Sbjct: 239 VPEKFSGVISTVV--PDSAHKIFIGGLPNYLNEDQVPGL 275
>gi|347968831|ref|XP_003436305.1| AGAP002908-PB [Anopheles gambiae str. PEST]
gi|333467821|gb|EGK96708.1| AGAP002908-PB [Anopheles gambiae str. PEST]
Length = 144
Score = 114 bits (285), Expect = 1e-22, Method: Composition-based stats.
Identities = 64/163 (39%), Positives = 93/163 (57%), Gaps = 21/163 (12%)
Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGET- 417
+++GDK L V+RA+ +K +++A Q + G+SL G +
Sbjct: 1 MQLGDKKLIVQRASVG--AKNSNAAVVAPVQIQVP----------------GLSLVGSSG 42
Query: 418 -LAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGK 476
+VLCL +T D L D+EEYE+ILED+REEC KYG + +V IPRP + G + PG GK
Sbjct: 43 PPTEVLCLLNMVTPDELKDEEEYEDILEDIREECNKYGVVRSVEIPRPIE-GVDVPGCGK 101
Query: 477 VFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
VF+E+ V C A+ AL+GRKF V Y+ DKY +++
Sbjct: 102 VFVEFNSIVDCQKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 144
>gi|403345499|gb|EJY72120.1| Splicing factor U2af large subunit, putative [Oxytricha trifallax]
Length = 437
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 159/377 (42%), Gaps = 47/377 (12%)
Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA--GPGDAVVNVYIN 219
P Q +T + R++YVG +PP I + + +G ++ GD +V +I+
Sbjct: 83 PSQGLTNH-NKAERQLYVGNIPPGLAVPQIMELLNTALKELGKDAGIFQEGDPIVGAWIS 141
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDY-----NPTLAAALGP----G 270
+ +AFV+ RT EEA+ AL + G ++V RP + NP+ A P G
Sbjct: 142 GDGHYAFVDFRTAEEATQGFALQQVSIHGNNLKVGRPKNATGPIPNPSQLLAGNPNLMSG 201
Query: 271 QP--SPNLNLAAVGLASGAIGGAEGP------DRVFVGGLPYYFTETQIKELLESFGTLH 322
Q S N GL + +G +V V P ++ I ++ E FG +
Sbjct: 202 QNVISNNKKKTNQGLKNLQLGDQGNQIIQALNTKVMVSNFPVNHSKESIHKICEVFGKVK 261
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
DL+KD TG KG ++D L G K+ +K L V+R T T+ E
Sbjct: 262 NVDLLKDITTGEFKGQVNVEFEDELEAKKGYTGLMGFKIDEKVLFVKRLTTISAPTTQIE 321
Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEI 442
+ + +L + + L L I + + + ++Y+++
Sbjct: 322 GEVFK------------------------NLIEDKPTECLMLKNCIILEEMTERDDYKDL 357
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGG--ETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
+ EE +YG +V V PRP G PGVGKV++ + AK+ + R+
Sbjct: 358 EIAVEEEMSRYGKVVKVHCPRPPIFGDPYSVPGVGKVYVRFQTEEDSEKAKHGIYKRRLN 417
Query: 501 GNTVNAFYYPEDKYFNK 517
G V+ YY +K FNK
Sbjct: 418 GRAVDPVYYSVEK-FNK 433
>gi|383125816|gb|AFG43490.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
gi|383125818|gb|AFG43491.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
gi|383125820|gb|AFG43492.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
gi|383125822|gb|AFG43493.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
gi|383125824|gb|AFG43494.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
gi|383125826|gb|AFG43495.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
gi|383125828|gb|AFG43496.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
gi|383125830|gb|AFG43497.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
gi|383125832|gb|AFG43498.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
gi|383125834|gb|AFG43499.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
gi|383125836|gb|AFG43500.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
Length = 93
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 50/88 (56%), Positives = 64/88 (72%)
Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
A+ D L DD+E+E+I +DM+EECGK+G + +VIPRP G E PGVGKVF+EY +
Sbjct: 1 AVNPDELLDDQEFEDIYDDMKEECGKHGEITKLVIPRPKSTGEEVPGVGKVFVEYANTQS 60
Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKY 514
A A+ +L GRKFGGN V A YYPEDK+
Sbjct: 61 SAKARASLHGRKFGGNVVVAVYYPEDKF 88
>gi|294948294|ref|XP_002785691.1| splicing factor u2af large subunit, putative [Perkinsus marinus
ATCC 50983]
gi|239899714|gb|EER17487.1| splicing factor u2af large subunit, putative [Perkinsus marinus
ATCC 50983]
Length = 370
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 84/296 (28%), Positives = 138/296 (46%), Gaps = 16/296 (5%)
Query: 239 MALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD-RV 297
M L+GI +G + + RP YN + + + + + S A G + ++
Sbjct: 1 MRLNGIDLDGRKLAITRPHGYNKEDPSKSITAEDIQKVTIEELCGGSSTKKTAPGSNLQL 60
Query: 298 FVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALN 357
+ LP TET +++LLE FG L L++D+ TG SKGYGFC ++DP D AL+
Sbjct: 61 GIYHLPPVMTETYLRDLLEQFGALTMVSLIRDKTTGLSKGYGFCQFEDPNDADRCLYALD 120
Query: 358 GLKMGDKTLTVRRATASGQSKTEQESILAQ-------AQQHIAIQKMALQTSGMNTLGGG 410
+G+ +L+V R Q A A +Q M + L
Sbjct: 121 QFVLGNYSLSVTRLVPDAQQGGAAGIGGAGVGPATNLADGSSGVQSMTARVLANPALAAQ 180
Query: 411 MSL---FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
+ G T + V+ L A+ + L + E + I +++REE ++GT++ V +PRP +
Sbjct: 181 LKAGREIGSTPSTVVQLLNAVYIEDLMSETEVKSIEDEIREEAQRHGTVLEVRVPRP--S 238
Query: 468 GGETP---GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
TP GVGK+F+++ D + +GRKF + A +YP D+Y Y+
Sbjct: 239 ASLTPYANGVGKIFVQFADITAARKFQATNNGRKFDDRVMCAAFYPTDRYKMGKYT 294
>gi|432095994|gb|ELK26905.1| Splicing factor U2AF 65 kDa subunit [Myotis davidii]
Length = 171
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 68/177 (38%), Positives = 94/177 (53%), Gaps = 36/177 (20%)
Query: 161 MPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH 220
MPV V TR ARR+ VG +P E+A+ +N
Sbjct: 31 MPVPVAVSNMTRQARRLCVGNIPFGITEEAM--------------------------VNR 64
Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
+K FAF+E R+V+E + AMALDGIIF+G ++++RRP DY P + P P
Sbjct: 65 DKNFAFLEFRSVDETTQAMALDGIIFQGQSLKIRRPHDYQPLPDMSENPSVYLP------ 118
Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
G+ S + + ++F+GGLPYY + Q+KELL SFG L F+LVKD TG S+G
Sbjct: 119 -GVVSTVV--PDSAHKLFMGGLPYYLKD-QVKELLTSFGPLKAFNLVKDGATGLSRG 171
>gi|414591751|tpg|DAA42322.1| TPA: hypothetical protein ZEAMMB73_939656 [Zea mays]
Length = 704
Score = 111 bits (278), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 59/106 (55%), Positives = 70/106 (66%), Gaps = 5/106 (4%)
Query: 112 SGFDMAPP--AAAMLPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
SGFD AP A ++ +PGQLPGV + +P + N+ A Q + P Q MT
Sbjct: 184 SGFDQAPTQQAVPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQFNPHVIQP-QAMT 242
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAV 213
QQATRHAR VYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV
Sbjct: 243 QQATRHARPVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAV 288
>gi|108706079|gb|ABF93874.1| RNA recognition motif family protein, expressed [Oryza sativa
Japonica Group]
Length = 964
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 94/342 (27%), Positives = 155/342 (45%), Gaps = 42/342 (12%)
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQ-VMTAIGGNSAGPGDAVVNVYINHEKK 223
V QATR RR+++ LP LA E + ++ ++++ + ++ IN +K+
Sbjct: 464 VQLTQATRPLRRLHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKR 523
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
AFVE T E+A+ A++ DG F G ++++RRP +Y A + P +PS + L + +
Sbjct: 524 QAFVEFLTPEDATAALSFDGRSFGGSSLKIRRPKEY--VEMAHVAPKKPSEEIKLISDVV 581
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
A+ P ++F+ G+ + + E++ SFG L + + + G + F Y
Sbjct: 582 -------ADSPHKIFIAGISGVISSEMLMEIVSSFGPLAAYRFLFNEYLGGA--CAFLEY 632
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
D ++T ACA LNG+K+G LT A + TEQ +A I A
Sbjct: 633 IDHSITSKACAGLNGMKLGGGILT---AVNVFPNSTEQ--AFNEASPFYGIPDSA----- 682
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADA--LADDEEYEEILEDMREECGKYGTLVNV-V 460
SL E KVL L + L E EEILED+R E ++G + ++ V
Sbjct: 683 -------KSLLEEP-TKVLQLKNVFDQEEYLLLSKSELEEILEDVRVEYDRFGAVKSINV 734
Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+ P + T G E C + +++GGN
Sbjct: 735 VKYPASSDNTT---GDTITE------CEDGSTKIEPKEYGGN 767
>gi|224077134|ref|XP_002305147.1| predicted protein [Populus trichocarpa]
gi|222848111|gb|EEE85658.1| predicted protein [Populus trichocarpa]
Length = 191
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 88/173 (50%), Positives = 107/173 (61%), Gaps = 23/173 (13%)
Query: 15 SRHK--SSWVSGRSRTGERGRDRHHRDFKSG------GDDRRRDKNYKYDREGIRDHDRT 66
SRHK SS S R+ RGR + H ++ G +D RRDK +DR H+R+
Sbjct: 23 SRHKTYSSRESEHDRSRTRGRGKDHDRYRGGYKDGSVRNDGRRDKFGDFDR-----HERS 77
Query: 67 DR------HRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPA 120
R HRDY+ D+ RR+ +RS S+S RF+NRS+S S SRSPSKSKR+SGFDMAP
Sbjct: 78 SRGRNYHRHRDYDGDRGRRNGNRSSSYSQGRFQNRSRSRSRSRSPSKSKRKSGFDMAPSE 137
Query: 121 AAMLPGAAV----PGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQ 169
MLPGAAV GQLP +P +P + QN L FG TQ G FPLMP Q MTQQ
Sbjct: 138 VGMLPGAAVAVNDAGQLPSLPQTMPGVVQNALQFGTTQFGVFPLMPAQAMTQQ 190
>gi|429328959|gb|AFZ80718.1| hypothetical protein BEWA_001250 [Babesia equi]
Length = 711
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 157/381 (41%), Gaps = 83/381 (21%)
Query: 189 QAIATFFSQVM----TAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGI 244
Q + FF+ + T+I N GP + N E+ + F+E T E A LDGI
Sbjct: 363 QDVVDFFNGALMTMSTSIDIN--GPMPVMKTEIFNQEQGYCFLEFTTAEYADLCYKLDGI 420
Query: 245 IFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPY 304
G ++++RRP D++ ++++ ++FV +P
Sbjct: 421 QCNGYSLKLRRPIDFSSSMSSE---------------------------DTKIFVQNIPE 453
Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDK 364
F+E I++LLE+ G L +LV D T +KGYGF Y+ + A LNG + +
Sbjct: 454 SFSEEDIRKLLEAHGKLKTCNLVIDPFTRLNKGYGFFEYESSSSAKEAVIHLNGHVIQNN 513
Query: 365 TLTVRRA-----TASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA 419
L+V+ A A G+ + S + + H L G+ G G +
Sbjct: 514 VLSVKHAAFSSFAAGGKPADCRASSIITSVSHCVFSNPLL---GLQMQNGRKK--GSEPS 568
Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR---------------- 463
+V+ L + + + DD+ Y E+L++++EE KYG L + IPR
Sbjct: 569 RVVQLLNVVYPEDILDDKNYREMLKEIKEEAQKYGPLEEIYIPRIHKREEPASIEDVKTE 628
Query: 464 -----------------------PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
D+N GVGKVFL+Y + A+ L+GR F
Sbjct: 629 GNDKVAVKSEETSVKTDVRQQTIEDRNKEYQLGVGKVFLKYSNETAGRKAQYMLNGRIFD 688
Query: 501 GN-TVNAFYYPEDKYFNKDYS 520
N V A ++P D Y Y+
Sbjct: 689 KNRVVCAAFFPCDLYQQGKYT 709
>gi|358417046|ref|XP_001256277.3| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Bos taurus]
Length = 330
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 68/177 (38%), Positives = 95/177 (53%), Gaps = 8/177 (4%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL 276
+K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P + P P L
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGL 249
>gi|242037001|ref|XP_002465895.1| hypothetical protein SORBIDRAFT_01g047730 [Sorghum bicolor]
gi|241919749|gb|EER92893.1| hypothetical protein SORBIDRAFT_01g047730 [Sorghum bicolor]
Length = 969
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 99/373 (26%), Positives = 163/373 (43%), Gaps = 45/373 (12%)
Query: 102 SRSPSK-----SKRRSGFDMAPPAA--AMLPGAAVP--GQLPGVPSAVPEMAQNMLPFGA 152
+++PSK K+ + +D P A + P +P GQ+ +P + + ++
Sbjct: 391 TKTPSKVIQSPEKKSATWDQPPVKANQSNFPTTFLPTVGQMAPIPFSFSTIKKDPSTTVE 450
Query: 153 TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVM--TAIGGNSAGPG 210
T L L V QATR RR+++ LP A E + + + T I + P
Sbjct: 451 TMLVGNSLTADSVQLTQATRPLRRLHIENLPDSATEDKLIDCLNDFLLPTGIKPQRSKP- 509
Query: 211 DAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPG 270
++ IN EK+ AFVE T E+A+ A++ DG G +R+RRP +Y T+ + P
Sbjct: 510 --CLSCTINREKRQAFVEFLTPEDATAALSFDGRSLNGSTLRIRRPKEYVETV--NVTPK 565
Query: 271 QPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDR 330
+P+ L S + A+ P ++F+ G+ + + E++ +FG L + + +
Sbjct: 566 KPA-----EETALISDVV--ADSPHKIFIAGIAGVISSEMLMEIVSAFGPLAAYRFLFNS 618
Query: 331 DTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQ 390
+ G F Y D ++T ACA LNG+ +G LT + + E A
Sbjct: 619 ELGGP--CAFLEYADRSITSKACAGLNGMMLGGCVLTAVHVFPNPPVEAANE-----ASP 671
Query: 391 HIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADA--LADDEEYEEILEDMRE 448
I + A SL E KVL L + L E EE LED+R
Sbjct: 672 FYGIPENA------------KSLLKEP-TKVLQLKNTFEREEYMLLSKSELEETLEDVRV 718
Query: 449 ECGKYGTLVNVVI 461
EC ++G + +V +
Sbjct: 719 ECTRFGAVKSVHV 731
>gi|356536627|ref|XP_003536838.1| PREDICTED: uncharacterized protein LOC100810537 [Glycine max]
Length = 735
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 114/445 (25%), Positives = 177/445 (39%), Gaps = 115/445 (25%)
Query: 56 DREGIRDHDRTDRHRDYNRDKERR---------HRHRSRSHSS-----DRFRNRSKSLSP 101
+R+ + H D R N D +R H HR +S + +S++ +
Sbjct: 134 ERKELSMHSLKDSSRTKNPDIDRNRVSTNGSSGHHHRHGVSTSGLGGYSPRKRKSEAAAK 193
Query: 102 SRSPSK---SKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAF 158
+ SPSK K+R+G+D+ PPA P A V P AV +++ + L
Sbjct: 194 TPSPSKHSLEKKRAGWDL-PPAGTNNPSAVVSSSFPVSNCAVLSNMHDVVSTSSLDLALV 252
Query: 159 PLMPV---------------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIG 203
+PV V QATR RR+Y+ LP A+E+A+ F+ ++ +
Sbjct: 253 KPLPVSFPSDVSTGKNTNIDSVQLTQATRPIRRLYLENLPASASEKAVMDCFNNLLLSAR 312
Query: 204 GNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTL 263
N + ++ +K A VE T ++AS A++ DG + G V++RRP DY +
Sbjct: 313 VNHIQQAQPCICCILHKDKGQALVEFLTADDASAALSFDGSMLFGSIVKIRRPKDYIELM 372
Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHG 323
I G VF G L Y ET++
Sbjct: 373 -----------------------EIAG------VF-GSLKAYHFETKV------------ 390
Query: 324 FDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQES 383
N+ F Y D +VT ACA LNG+K+G + LTV +A
Sbjct: 391 ----------NNGPCAFLEYVDHSVTIKACAGLNGMKLGGEVLTVLQAMPDAS------- 433
Query: 384 ILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL----AKVLCLTEAITADAL--ADDE 437
L+ +G +L G+ + L +VL + AD + D
Sbjct: 434 --------------PLENAG-ESLSYGVPEHAKPLLRKPTQVLEINNVFAADTILSLSDM 478
Query: 438 EYEEILEDMREECGKYGTL--VNVV 460
EEIL+D+R EC ++GT+ +NVV
Sbjct: 479 AIEEILDDVRLECARFGTIKSINVV 503
>gi|357114131|ref|XP_003558854.1| PREDICTED: uncharacterized protein LOC100840355 [Brachypodium
distachyon]
Length = 840
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 82/301 (27%), Positives = 133/301 (44%), Gaps = 36/301 (11%)
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPG-DAVVNVYINHEKK 223
V QATR RR+++ LP A+E + + + N ++ IN EK
Sbjct: 327 VQLTQATRPLRRLHIENLPSSASEDMLIGCLNDFFLSSDVNHIQKSKQPCLSCTINKEKH 386
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
AFVE T E+A+ A++ DG F G A+++RRP +Y + P + + L
Sbjct: 387 QAFVEFLTPEDATAALSFDGRSFNGSALKIRRPKEY-------IEMANVVPKKTVEEIKL 439
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
AS A+ P ++FV G+ + + E++ SFG L + +D + + + F Y
Sbjct: 440 ASDV---ADSPHKIFVAGISGVISSEMLMEIVSSFGQLAAYRF-QDHEALSGR-CAFLEY 494
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
D ++TD ACA LNG+K+G IL Q + + S
Sbjct: 495 IDHSITDKACAGLNGMKLGG-------------------CILTAVQVFPNPLEACNEASP 535
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADA--LADDEEYEEILEDMREECGKYGTL--VNV 459
++ + E +VL L + L E EEI+ED+R EC ++G + +N+
Sbjct: 536 FYSIPDSAKMLLEAPTEVLQLKNVFDREEYLLLSKSELEEIMEDIRMECARFGAVKSINI 595
Query: 460 V 460
V
Sbjct: 596 V 596
>gi|297823139|ref|XP_002879452.1| hypothetical protein ARALYDRAFT_321074 [Arabidopsis lyrata subsp.
lyrata]
gi|297325291|gb|EFH55711.1| hypothetical protein ARALYDRAFT_321074 [Arabidopsis lyrata subsp.
lyrata]
Length = 497
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 106/430 (24%), Positives = 176/430 (40%), Gaps = 82/430 (19%)
Query: 96 SKSLSPSRSPSKSKRRSGFDMAPPAAA-MLPGAAVPGQLPGVPSAVPEMAQ-NMLPFGAT 153
+K++SP + S K+ + +D+AP + M G G +A P +++ +++
Sbjct: 132 TKAVSPP-NLSSEKKSAKWDLAPTVTSGMFSGPVFSGLQAATQTAYPTISEASLMLLKPL 190
Query: 154 QLGAFPLMPVQVMTQ-------QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS 206
G F P + +T ++TR RR+Y +P A+E+++ F+ M + G N
Sbjct: 191 MEGTFRTPPPRQITSFDSVQLTESTRPMRRLYAENVPDSASEKSLIECFNGYMLSSGSNH 250
Query: 207 AGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAA 266
+ ++ IN EK A VE T ++AS A++LDG F G +++RRP DY
Sbjct: 251 IKGSEPCISCIINKEKSQALVEFLTPQDASAALSLDGCSFAGSNLKIRRPKDY------- 303
Query: 267 LGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
VG T + E++ FG L +
Sbjct: 304 --------------VG--------------------------TTLMEIVSVFGPLKAYRF 323
Query: 327 VKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILA 386
V + D Y Y D +VT ACA LNG+K+G +T A S
Sbjct: 324 VSNNDLNQQCAY--LEYTDGSVTLKACAGLNGMKLGGSVITAVCAFPDASS--------- 372
Query: 387 QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITAD--ALADDEEYEEILE 444
+A+ + G L G+ +L L + + L ++E +EIL+
Sbjct: 373 -----VAVNE---NPPFYGIPGHAKPLLGKP-KHILKLKNVVDPEDFTLLSEQEVKEILD 423
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
D+R EC ++ T + R D + PG +F+EY A ++L GR + V
Sbjct: 424 DVRLECARWDTDDKMEEER-DPDDLFEPGC--IFIEYGRPEATCDAAHSLHGRLYDNRIV 480
Query: 505 NAFYYPEDKY 514
A Y ++ Y
Sbjct: 481 KAEYVSKELY 490
>gi|71996481|ref|NP_497326.2| Protein UAF-1, isoform b [Caenorhabditis elegans]
gi|351018335|emb|CCD62279.1| Protein UAF-1, isoform b [Caenorhabditis elegans]
Length = 143
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/161 (37%), Positives = 84/161 (52%), Gaps = 18/161 (11%)
Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
+++GDK L V+ A A+ Q L S G +S
Sbjct: 1 MQLGDKQLVVQLACANQQR-----------------HNTNLPNSASAIAGIDLSQGAGRA 43
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
++LCL +T D L D+EYEEILED+R+EC KYG + ++ IPRP ++ PGVGKVF
Sbjct: 44 TEILCLMNMVTEDELKADDEYEEILEDVRDECSKYGIVRSLEIPRPYEDHP-VPGVGKVF 102
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+E+ C A+ AL+GRKF TV YY DKY N+ +
Sbjct: 103 VEFASTSDCQRAQAALTGRKFANRTVVTSYYDVDKYHNRQF 143
>gi|296088195|emb|CBI35711.3| unnamed protein product [Vitis vinifera]
Length = 116
Score = 105 bits (262), Expect = 6e-20, Method: Composition-based stats.
Identities = 47/69 (68%), Positives = 55/69 (79%)
Query: 453 YGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
+G LV+VVIPRP NG PGVGKVFLEY D G ++A+NALSGRKFGGN V+A YYPED
Sbjct: 48 HGALVHVVIPRPSPNGDLIPGVGKVFLEYSDTAGSSSARNALSGRKFGGNVVSAVYYPED 107
Query: 513 KYFNKDYSA 521
KY++ DY A
Sbjct: 108 KYYDGDYGA 116
>gi|359477752|ref|XP_002281833.2| PREDICTED: uncharacterized protein LOC100266510 [Vitis vinifera]
Length = 895
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 133/498 (26%), Positives = 197/498 (39%), Gaps = 141/498 (28%)
Query: 56 DREGIRDHDRTDRHRDYNRDKERRHRHRSRSHS-----SDRFRNRS---KSLSPS-RSPS 106
DR G R H DR+R N R S S S R R K+ SP+ RSP
Sbjct: 295 DRSG-RQHSDADRNRISNNGSSSHFRRHGGSASGLGGYSPRKRRTEAAIKTPSPTNRSPE 353
Query: 107 KSKRRSGFDMAPPAAAMLPGAAVPGQL----PGV-------PSAVPEMAQNMLPFGATQL 155
K + +G+D+ P + +V L P V PSAVP +P AT
Sbjct: 354 K--KSAGWDLPPSRTDGMNAGSVLSSLQVLKPTVSSNADELPSAVPVA----VPVTATT- 406
Query: 156 GAFPLMPV---------------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMT 200
A P +P + QATR RR+YV LP ++E+A+ + +
Sbjct: 407 -AKPPLPRIYSDAVSKNKNVSIDSIQLTQATRPMRRLYVENLPVSSSEKALMECLNNFLL 465
Query: 201 AIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYN 260
+ G N ++ I+ EK A VE T E+AS A++ DGI F G +++RRP D+
Sbjct: 466 SSGINHVQGTPPCISCIIHKEKGQALVEFLTPEDASAALSFDGISFSGSILKIRRPKDF- 524
Query: 261 PTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGT 320
+ E+ +FG
Sbjct: 525 --------------------------------------------------LMEIAAAFGP 534
Query: 321 LHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTE 380
L + + D G + F Y D +VT ACA LNG+K+G + LTV +A +
Sbjct: 535 LKAYRFQVNEDLG--EPCAFLEYVDQSVTLKACAGLNGMKLGGQVLTVVQAIPNA----- 587
Query: 381 QESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL----AKVLCLTEAITADALA-- 434
+A++ +G N G+ + L +VL L + D L+
Sbjct: 588 ----------------LAMENTG-NLPFYGIPEHAKPLLERPTQVLKLKNVVNPDDLSSL 630
Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDA-------VGC 487
+ E EEILED+R EC ++GT+ +V I + + + T LE Y+A +GC
Sbjct: 631 SEAELEEILEDIRLECTRFGTVKSVNIVKYNNSHVST-------LEVYEAADNTGSNLGC 683
Query: 488 ATAKNALSGRKFGGNTVN 505
N++ GG T N
Sbjct: 684 DG--NSMKAETLGGGTDN 699
>gi|303279322|ref|XP_003058954.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460114|gb|EEH57409.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 559
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 73/248 (29%), Positives = 113/248 (45%), Gaps = 33/248 (13%)
Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGG--NSAGPGDAVVNVYINHE 221
Q Q TR ARR+Y+GG+PP A + F + +M G N A G VV+V I HE
Sbjct: 149 QTAYAQHTRQARRLYIGGIPPGAINSDVQRFLNDLMLNSGAAINPAA-GPPVVDVKIQHE 207
Query: 222 KKFAFVEMRTVEEASNAMALDGIIF--EGVAVRVRRPTDYNPTLAAAL--------GP-- 269
K F F E ++A +A+ DG+++ G +RV RP DY+P+ + GP
Sbjct: 208 KGFGFAEFTNCDDAQSALMFDGVVYGDTGRKIRVNRPRDYDPSKNPVVIRDGLQIEGPKG 267
Query: 270 ---------GQPSPNLNLAAVGLASGAIG--------GAEGPDRVFVGGLPYYFTETQIK 312
P P A+ + +GP++++VGG TE Q +
Sbjct: 268 IGLLGEKQANAPPPWPEDLAIPAPPPLVSEWPKLPKRTPDGPNKLYVGGFDPLHTEGQTR 327
Query: 313 ELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
++L++ G L F ++ D G + G+ FC + DP +T +A AL G + + +RA
Sbjct: 328 QVLQAIGELKSFCVMPDA-RGRNTGHVFCEFADPRLTVVAEEALTGAWCFRQPIVCKRAM 386
Query: 373 ASGQSKTE 380
E
Sbjct: 387 PDAAPAKE 394
>gi|189308116|gb|ACD86942.1| UAF-1 [Caenorhabditis brenneri]
Length = 108
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/101 (50%), Positives = 65/101 (64%), Gaps = 1/101 (0%)
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
+VLCL +T D L DE+YEEILED+REEC KYG + ++ IPRP + PGVGKVF
Sbjct: 9 TEVLCLMNMVTEDELKSDEDYEEILEDVREECSKYGIVRSLEIPRP-YDEHPVPGVGKVF 67
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+E+ C A+ AL+GRKF TV YY DKY N+ +
Sbjct: 68 VEFASTSDCQRAQAALTGRKFANRTVVTSYYDVDKYHNRQF 108
>gi|413956976|gb|AFW89625.1| hypothetical protein ZEAMMB73_282398 [Zea mays]
Length = 635
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 96/372 (25%), Positives = 160/372 (43%), Gaps = 45/372 (12%)
Query: 102 SRSPSK-----SKRRSGFDMAPPAA--AMLPGAAVP--GQLPGVPSAVPEMAQNMLPFGA 152
+++PSK K+ + +D P A + P +P GQ+ P + + ++
Sbjct: 53 TKTPSKVIQSPEKKSATWDQPPVKANQSNFPTTFLPTVGQMAPTPFSF-SVIKDPSTTAV 111
Query: 153 TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDA 212
T L L V QATR RR+++ LP A E + + + + G
Sbjct: 112 TMLAGNSLTADSVQLTQATRPLRRLHIENLPDSATEDKLIDCLNDFLLSTGSKLQR-SKP 170
Query: 213 VVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP 272
++ IN EK+ AFVE T E+A+ A++ DG G +R+RRP +Y T+ +
Sbjct: 171 CLSCTINREKRQAFVEFLTPEDATAAISFDGRSLNGSVLRIRRPKEYVETVNVTPKKAEE 230
Query: 273 SPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
+ L S + A+ P ++F+ G+ + + E++ +FG L + + + +
Sbjct: 231 T--------ALISDVV--ADSPYKIFIAGIAGVISSKMLMEIVSAFGPLAAYRFLFNNEL 280
Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI 392
G F Y D +VT ACA LNG+ +G + LT ++ H+
Sbjct: 281 GGP--CAFLEYADRSVTSKACAGLNGMMLGGRVLT---------------AVHVFPNPHV 323
Query: 393 AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITAD--ALADDEEYEEILEDMREEC 450
+ A + S + L + KVL L + L E EE LED+R EC
Sbjct: 324 ---EAANEASPFYGIPDNAKLLLKEPTKVLQLKNVFEREEYMLLSKSELEETLEDVRVEC 380
Query: 451 GKYGTL--VNVV 460
++G + VNVV
Sbjct: 381 TRFGAVKSVNVV 392
>gi|389582230|dbj|GAB64785.1| RNA binding domain [Plasmodium cynomolgi strain B]
Length = 1046
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 110/495 (22%), Positives = 188/495 (37%), Gaps = 131/495 (26%)
Query: 118 PPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRV 177
PP M+ + GQL + ++ ++ L G +L L P A + AR +
Sbjct: 557 PPQLNMI----LDGQLKLDSQTIQQLCKSAL--GINELCLSSLDPT------AEKTAREL 604
Query: 178 YVGGLPPLANEQAIATFFSQVMTAIGGNSAGPG--------------------------- 210
YVG +P + Q I F ++ + + +G
Sbjct: 605 YVGNIPQHIDVQEIVKFLNKCLLILYNKESGSEAENGQGDEQEREKKQEQGNEQEEGSLS 664
Query: 211 -----------------------DAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE 247
D + I + +AFVE RT+++ SN M L+GI F
Sbjct: 665 QSQNQSQNQSLSQSQNQGQGQCEDICLKACIRGDTHYAFVEFRTLQDTSNCMLLNGINFY 724
Query: 248 GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA------EGPDRVFVGG 301
G +R+ RP + P +L P P ++ + L+ G IG + +++
Sbjct: 725 GNNLRIGRPKTF-PAELTSLIPAPTIPTID--SYYLSQGIIGLQAFAVFFQNEEKMKNAY 781
Query: 302 LPYYFTETQ---------------IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
LP + Q IKELLE+FG + F+ + D+ +
Sbjct: 782 LPMSMIKLQKLCVSNISKNNERGKIKELLEAFGEIRNFECFEGDDSTD------------ 829
Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
T IA + + + S + + E E + + +K +G+
Sbjct: 830 --TYIALVEYTTSENAIQAQKILNQNTSYKIQFEYEILNDPLINRLIKRKYMSSENGI-- 885
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP---- 462
L + + + L+ T D L D EY +I+ED++ EC K+G++V VV+P
Sbjct: 886 ------LSQQIPTRTVVLSRIATFDELCDPSEYRDIVEDIKIECEKFGSVVEVVLPVFSR 939
Query: 463 ---------------RPDQN--GGETPGVGKVFLEYYDAVGCAT-AKNALSGRKFGGNTV 504
+ D+ + +G F+ Y++ + AT A+ LSGRKFG N +
Sbjct: 940 ETFDFLLREAAKCAAKEDRTHPNYDLTSIGCAFI-YFETIEAATKARKELSGRKFGANII 998
Query: 505 NAFYYPEDKYFNKDY 519
A YY E K+ K++
Sbjct: 999 EANYYSEKKFLLKNF 1013
>gi|395334381|gb|EJF66757.1| splicing factor CC1-like protein [Dichomitus squalens LYAD-421 SS1]
Length = 624
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 85/350 (24%), Positives = 152/350 (43%), Gaps = 42/350 (12%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
AR V+V L + + FF +G N+ V + K +VE R+VE
Sbjct: 292 ARSVFVSQLAARLTARDLGYFFED---KLGENTVMDSRIVTDRISRRSKGIGYVEFRSVE 348
Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
A+AL G + G+ ++++ + L PG NLNL + G
Sbjct: 349 LVDKAIALSGTVVMGLPIQIQ----HTEAERNRLHPG--DGNLNLPP------GVSAPHG 396
Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
+++VG L + +E+ IK++ E FG L DL +D TG SKGY F Y+ +A
Sbjct: 397 GMQLYVGSLHFNLSESDIKQVFEPFGELEFVDLHRDPVTGRSKGYAFVQYKRAEDAKMAL 456
Query: 354 AALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMALQTSGMN 405
++G ++ +TL V G ++ T+Q+S+ L A + +QK+A ++
Sbjct: 457 EQMDGFELAGRTLRVNTVHEKGSARYTQQDSLDEAGGGNLNAASRQALMQKLAR----ID 512
Query: 406 TLGGGMSLFGE-TLAKVLCLTEAITADALADDEEYE-----EILEDMREEC-GKYGTLVN 458
M + + + + + +EE E ++ ED++ EC KYG ++
Sbjct: 513 PTPAKMEPIARPNIPQTMQSRSVLMKNMFNPEEETERDWDKDLAEDVKGECESKYGRVLA 572
Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
+ + + Q G++++++ A L+GR FGG + A +
Sbjct: 573 IKVEKESQ--------GEIYVKFETVDAAKNAIEGLNGRWFGGRQITAAF 614
>gi|390604396|gb|EIN13787.1| splicing factor CC1-like protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 433
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 91/350 (26%), Positives = 158/350 (45%), Gaps = 35/350 (10%)
Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
AR V+V L + + FF +G + V + K +VE++T+
Sbjct: 101 EARSVFVSQLAARLTARDLGYFFED---KLGEGTVMDARIVTDRLSRRSKGIGYVELKTI 157
Query: 233 EEASNAMALDGIIFEGVAVRVRR-PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
E A+ L G + G+ ++V+ + N T A G G +LNL + G
Sbjct: 158 ELVDQAINLSGTVVMGLPIKVQHTEAERNRTHA---GDG----SLNLPP------GVSGT 204
Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
GP +++VG L + TE+ IK++ E FG L DL +D TG SKGY F Y+ +
Sbjct: 205 HGPRQLYVGSLHFNLTESDIKQVFEPFGELEFVDLHRDPMTGRSKGYCFIQYKRAEDAKM 264
Query: 352 ACAALNGLKMGDKTLTVRRATASGQSK-TEQESI------LAQAQQHIAIQKMA-LQTSG 403
A + G ++ +TL V G K T+QES+ L A + +QK+A ++ +
Sbjct: 265 ALEQMEGFELAGRTLRVNTVHEKGTVKYTQQESLEENGGNLNAASRQALMQKLARIEPAR 324
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEIL-EDMREEC-GKYGTLVNVVI 461
+ +TL L + + A ++++++ L +D++ EC KYG + + +
Sbjct: 325 APVETVSKPVITQTLQSKSVLLKNMFDPAEETEKDWDKDLADDVKVECENKYGMVNFIKV 384
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
+ Q G++++++ A L+GR FGG V A + P+
Sbjct: 385 DKESQ--------GEIYVKFDTVDSAKKAIEGLNGRYFGGRQVTATFIPD 426
>gi|393218616|gb|EJD04104.1| splicing factor, CC1-like protein [Fomitiporia mediterranea MF3/22]
Length = 464
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 154/351 (43%), Gaps = 44/351 (12%)
Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
AR V+V L + + FF +G NS V + K A+VE ++
Sbjct: 133 EARSVFVSQLAARMTARDLGYFFED---KLGDNSVLDVRIVTDRISRRSKGIAYVEFGSI 189
Query: 233 EEASNAMALDGIIFEGVAVRVRR-PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
E A++L G I G+ + ++ + N T A G S NL A G GA
Sbjct: 190 ELVDKAISLTGTIVMGLPIMIQHTEAERNKTHA-----GDGSINLPPGASG--RGAT--- 239
Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
++VG L + TE+ IK++ E FG L DL KD TG SKGY F Y+ +
Sbjct: 240 -----LYVGSLHFNLTESDIKQVFEPFGELDFVDLHKDSATGRSKGYAFIHYKRAEDAKM 294
Query: 352 ACAALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMALQTSG 403
A + G ++ +TL V GQ++ + Q+S+ L A + +QK+A S
Sbjct: 295 ALEQMEGFELAGRTLRVNTVHEKGQTRISTQDSLDESGGGNLNAASRQALMQKLARIDSA 354
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE-----EILEDMREEC-GKYGTLV 457
T + T+A+ + + + +EE E ++ ED++ EC KYG +
Sbjct: 355 PVT---QQPIMKPTVAQPMTSKSVLMRNMFDPEEETEPAWDKDLAEDVKTECQAKYGRVQ 411
Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
++ + E G++++++ A N L+GR FGG ++A +
Sbjct: 412 HIKV--------EKDSEGEIYVQFDTVDAAKAAINGLNGRWFGGKQISATF 454
>gi|108706081|gb|ABF93876.1| RNA recognition motif family protein, expressed [Oryza sativa
Japonica Group]
Length = 704
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 60/204 (29%), Positives = 106/204 (51%), Gaps = 12/204 (5%)
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQ-VMTAIGGNSAGPGDAVVNVYINHEKK 223
V QATR RR+++ LP LA E + ++ ++++ + ++ IN +K+
Sbjct: 464 VQLTQATRPLRRLHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKR 523
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
AFVE T E+A+ A++ DG F G ++++RRP +Y A + P +PS + L + +
Sbjct: 524 QAFVEFLTPEDATAALSFDGRSFGGSSLKIRRPKEY--VEMAHVAPKKPSEEIKLISDVV 581
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
A+ P ++F+ G+ + + E++ SFG L + + + G + F Y
Sbjct: 582 -------ADSPHKIFIAGISGVISSEMLMEIVSSFGPLAAYRFLFNEYLGGA--CAFLEY 632
Query: 344 QDPAVTDIACAALNGLKMGDKTLT 367
D ++T ACA LNG+K+G LT
Sbjct: 633 IDHSITSKACAGLNGMKLGGGILT 656
>gi|312085420|ref|XP_003144672.1| U2af splicing factor protein 1 [Loa loa]
Length = 143
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 60/162 (37%), Positives = 88/162 (54%), Gaps = 20/162 (12%)
Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMN-TLGGGMSLFGET 417
+++GDK L V+ + A+ ++ Q + +Q +G++ + G G
Sbjct: 1 MQLGDKNLVVQLSCANARNNVAQNTF------------PQIQVAGIDLSHGAGPP----- 43
Query: 418 LAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKV 477
+VLCL +T D L DDEEYE+ILED+REEC KYG + ++ IPR G + GVGKV
Sbjct: 44 -TEVLCLMNMVTEDELKDDEEYEDILEDIREECAKYGIVKSLEIPR-SVPGVDVTGVGKV 101
Query: 478 FLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
F+E+ C A+ AL+GRKF TV YY D Y + +
Sbjct: 102 FVEFNSKQECQKAQAALTGRKFANRTVVTSYYDPDMYHRRQF 143
>gi|108706082|gb|ABF93877.1| RNA recognition motif family protein, expressed [Oryza sativa
Japonica Group]
Length = 720
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 60/204 (29%), Positives = 106/204 (51%), Gaps = 12/204 (5%)
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQ-VMTAIGGNSAGPGDAVVNVYINHEKK 223
V QATR RR+++ LP LA E + ++ ++++ + ++ IN +K+
Sbjct: 464 VQLTQATRPLRRLHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKR 523
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
AFVE T E+A+ A++ DG F G ++++RRP +Y A + P +PS + L + +
Sbjct: 524 QAFVEFLTPEDATAALSFDGRSFGGSSLKIRRPKEY--VEMAHVAPKKPSEEIKLISDVV 581
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
A+ P ++F+ G+ + + E++ SFG L + + + G + F Y
Sbjct: 582 -------ADSPHKIFIAGISGVISSEMLMEIVSSFGPLAAYRFLFNEYLGGA--CAFLEY 632
Query: 344 QDPAVTDIACAALNGLKMGDKTLT 367
D ++T ACA LNG+K+G LT
Sbjct: 633 IDHSITSKACAGLNGMKLGGGILT 656
>gi|82794077|ref|XP_728296.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23484570|gb|EAA19861.1| KED [Plasmodium yoelii yoelii]
Length = 858
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 81/334 (24%), Positives = 145/334 (43%), Gaps = 50/334 (14%)
Query: 150 FGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
FG +LG + + A + AR +YVG +P + Q I F + + +
Sbjct: 396 FGIPELG------LSTIDANAEKTARELYVGNIPQNIDIQEIVKFLNTCLLILYNKENEN 449
Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRR----PTDYNPTLAA 265
+ + I + ++AFVE R++++ SN M L+GI F +R+ R P +Y +
Sbjct: 450 ENICLKACIRGDTRYAFVEFRSLQDTSNCMLLNGIYFYTNNLRIGRPKTFPIEYTKLIPP 509
Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPD----------------RVFVGGLPYYFTET 309
A P + L+ VG+ + AI + ++ V +
Sbjct: 510 ATIPTIDTYYLSQGLVGIKAFAIFHQNKDENKNEYHHLPVDMIKLQKLCVSNISKNNETN 569
Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVR 369
+IKELLE+FG + F+ + + NS Y I N ++ + +
Sbjct: 570 KIKELLEAFGDIQTFEFFEGEE--NSDTY------------ICLVEYNNIENAIQAHKIL 615
Query: 370 RATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL-AKVLCLTEAI 428
S + + E E IL + ++K +Q S+ + + KV+ L++
Sbjct: 616 NQNTSYKIQFEYE-ILNDPTINQLVKKKYMQNKN--------SILSQQIPTKVIVLSKIA 666
Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
T D L++ E+Y+EI ED++ EC KYG+++ VV+P
Sbjct: 667 TFDELSNPEDYKEISEDIKIECEKYGSVIEVVLP 700
Score = 39.3 bits (90), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 25/47 (53%)
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+G F+ + G + LSGRKFG N + A YY E K+ K++
Sbjct: 779 SIGCAFIYFETIEGATKTRKELSGRKFGANIIEANYYSEKKFIMKNF 825
>gi|170083917|ref|XP_001873182.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164650734|gb|EDR14974.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 448
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/350 (26%), Positives = 152/350 (43%), Gaps = 41/350 (11%)
Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
AR V+V L + + FF +G + V + K +VE RT+
Sbjct: 116 EARSVFVSQLAARLTARDLGYFFED---KLGEGTVMDSRIVTDRLSRRSKGIGYVEFRTI 172
Query: 233 EEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
+ A+AL G + G+ + V+ + L PG S NL V + GAI
Sbjct: 173 DHVEKALALSGTVVMGLPIMVQ----LTESERNKLHPGDGSLNLP-PGVTASHGAI---- 223
Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
++VG L + TE+ IK++ E FG L DL +D TG SKGY F Y+ +A
Sbjct: 224 ----LYVGSLHFNLTESDIKQVFEPFGELEFVDLHRDPMTGRSKGYAFVQYKRSEDARMA 279
Query: 353 CAALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMALQTSGM 404
+ G ++ +TL V G ++ T+Q+S+ L A + +QK+A +T
Sbjct: 280 LEQMEGFELAGRTLRVNTVHEKGTARYTQQDSLDEAGGGNLNAASRQALMQKLA-RTEAP 338
Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE-----EILEDMREEC-GKYGTLVN 458
T ++ + + + + + +EE E E+ +D++ EC KYG +
Sbjct: 339 PTFTEPVAR--PNIPQAMQSRSVLLKNMFDPEEETERDWDKELADDVKVECENKYGKVEA 396
Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
+ + R Q G+++L++ A L+GR FGG V+A +
Sbjct: 397 IKVERETQ--------GEIYLKFDSIESAKQAIQGLNGRWFGGRQVSAAF 438
>gi|209881578|ref|XP_002142227.1| RNA recognition motif. family protein [Cryptosporidium muris RN66]
gi|209557833|gb|EEA07878.1| RNA recognition motif. family protein [Cryptosporidium muris RN66]
Length = 533
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 127/500 (25%), Positives = 203/500 (40%), Gaps = 93/500 (18%)
Query: 76 KERRH--RHRSRSHSSDRF----RNRSKSLSPSRS--PSKSKRRSGFDMAPPAAAMLPGA 127
KE H R RSRS S+D N +S+SP RS S+ KRR G G
Sbjct: 69 KEIYHIPRRRSRSPSTDNNDKEESNTGRSISPLRSDDSSRKKRRRGSS----------GW 118
Query: 128 AVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLAN 187
P P V +P M Q + +G+ + P QV TQ A R+YVG L +
Sbjct: 119 DAPFN-PDVSQLMPNMKQANV---GQMMGSSNIAPRQV-TQGA-----RIYVGSLDYSLS 168
Query: 188 EQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMA-LDGIIF 246
E + F T + + G N K F F+E T E A A+A ++ +
Sbjct: 169 EADLRQVFGSFGTIVNIDMPREG--------NRSKGFCFIEYTTQESAEMALATMNRFVL 220
Query: 247 EGVAVRVRRPTD----YNPTLAAALG--------PGQPSPNLNLAAVGLASGAIGGAEGP 294
+G ++V RPT+ N ++G P P N N + I
Sbjct: 221 KGRPIKVGRPTNAIVSNNQNNNNSMGNHTGMVGMPVLPPENTN---ANIPPHQIPQNPPQ 277
Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT-GNSKGYGFCVYQDPAVTDIAC 353
+R+++G +PY FT ++ + ++FG + L+ + G +GYGF + P +A
Sbjct: 278 NRIYIGSVPYSFTPDDLRHIFKAFGVILSCQLIPSVEKPGTHRGYGFIEFGTPDQAKLAI 337
Query: 354 AALNGLKMGDKTLTVRRATA---SGQSKTEQESILAQAQQHIAIQKM--ALQTSGMNTLG 408
+NG ++G K L V ATA S + Q I++ Q++ Q++ L +
Sbjct: 338 ETMNGFEVGGKQLKVNVATALKPSNSISSNQIPIVSPTLQNVMSQQIPPTLAIPPTMAIP 397
Query: 409 GGMSLFGETL-----------------------------AKVLCLTEAITADALADDEEY 439
+S+ T + V+ LT I + + D
Sbjct: 398 PVLSMPNVTPLPPNLYQPPNIPVPYPANSYPIIPNSTSNSNVILLTNMIGPEEVDD---- 453
Query: 440 EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKF 499
E+ E+++ EC KYG + +V I D + V ++F+ + A AL+ R F
Sbjct: 454 -ELKEEVKIECSKYGKVYDVRIHISDHVSKPSDRV-RIFVVFETNTMAQIAVPALNNRWF 511
Query: 500 GGNTVNAFYYPEDKYFNKDY 519
GGN V Y +++++ Y
Sbjct: 512 GGNQVYCRLYNTERFYSSFY 531
>gi|124511860|ref|XP_001349063.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
gi|23498831|emb|CAD50908.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
Length = 1125
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 146/343 (42%), Gaps = 50/343 (14%)
Query: 139 AVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQV 198
++ + +N+L L P + V + + AR +YVG +P + Q I + +
Sbjct: 623 SLQNLYKNVLNINDLNL---PTIDVNI-----EKTARELYVGNIPQHIDIQEIVKYLNSC 674
Query: 199 MTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRP-- 256
+ + + + I + +AFVE R +++ SN M L+GI F G +R+ RP
Sbjct: 675 LLILYNKENENENICLKACIRGDTHYAFVEFRNIQDTSNCMLLNGINFYGNNLRIGRPKT 734
Query: 257 --TDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQ---- 310
+Y+ + A P + L+ +GL S I + +++ GLP + Q
Sbjct: 735 FPIEYHSLIPQATIPAIDNYYLSQGLIGLRSFIIF-CKNEEKMKNDGLPVNMIKLQKLCV 793
Query: 311 -----------IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGL 359
IKELLE+FG + F+ YG + + T I+
Sbjct: 794 SNISKNNDTSKIKELLEAFGEIKNFEFF----------YG----DETSDTYISLVEYVNT 839
Query: 360 KMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA 419
+ + + S + + E E I HI ++ M T +SL +
Sbjct: 840 ENAIQAHKILNQNTSYKIQFEHEII---NDPHIN---NIIKNKYMKTENSILSL--QVPT 891
Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
KV+ L + T + L+D EY++I+ED++ EC KYG + VV+P
Sbjct: 892 KVIVLNKIATFEELSDSSEYKDIVEDIKIECDKYGKTLEVVLP 934
Score = 39.3 bits (90), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 16/47 (34%), Positives = 26/47 (55%)
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+G F+ + + A+ LSGRKFG N + A Y+ E K+ K++
Sbjct: 1046 SIGCAFIHFENIESATKARKELSGRKFGANIIEANYFSEKKFLMKNF 1092
>gi|403412344|emb|CCL99044.1| predicted protein [Fibroporia radiculosa]
Length = 599
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 87/349 (24%), Positives = 145/349 (41%), Gaps = 40/349 (11%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
AR V+V L + + FF +G S V + K +VE R+VE
Sbjct: 267 ARSVFVSQLAARLTARDLGYFFED---KLGEGSVMDSRIVTDRISRRSKGIGYVEFRSVE 323
Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
A+ L G + G+ ++++ + L PG NLNL + G
Sbjct: 324 LVDKALGLSGTVVMGLPIQIQ----HTEAERNRLHPG--DGNLNLPP------GVSAPHG 371
Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
+++VG L + TE+ IK++ E FG L DL +D TG SKGY F Y+ +A
Sbjct: 372 GMQLYVGSLHFNLTESDIKQVFEPFGELEFVDLHRDPMTGRSKGYAFVQYKRAEDARMAL 431
Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQ--------AQQHIAIQKMALQTSGMN 405
+ G ++ +TL V G +K Q+ L + A + +QK+A
Sbjct: 432 EQMEGFELAGRTLRVNTVHEKGTTKYAQQDSLDEAGGGNLNAASRQALMQKLARTDQPAV 491
Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYE-----EILEDMREEC-GKYGTLVNV 459
L + + + + + + +EE E ++ ED++ EC KYG + +
Sbjct: 492 KLP---PVTKPNIPQSMQSRSVLLKNMFNPEEETERDWDKDLAEDVKGECEDKYGKVEFI 548
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
+ R Q G++++++ A L GR FGGN V+A +
Sbjct: 549 KVERESQ--------GEIYVKFDSIESAKNAIQGLHGRWFGGNQVSAAF 589
>gi|449551106|gb|EMD42070.1| hypothetical protein CERSUDRAFT_90674 [Ceriporiopsis subvermispora
B]
Length = 623
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 166/379 (43%), Gaps = 49/379 (12%)
Query: 159 PLMPVQVMTQQATRH----ARRVYVGGLPPLANEQAIATFF-----------SQVMT-AI 202
P++ V M + R AR V+V L + + FF S+++T I
Sbjct: 255 PIVDVDPMNPEEPREDDSEARSVFVSQLAARLTARDLGYFFEDKLGEGSVMDSRIVTDRI 314
Query: 203 GGNSAGPGDAV--VNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYN 260
S G + +N + + +VE RTVE A+AL G + G+ ++++ +
Sbjct: 315 SRRSKGLLLIISRINTSLTFCLRIGYVEFRTVELVDKAIALSGTVVMGLPIQIQ----HT 370
Query: 261 PTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGT 320
L PG NLNL + + G +++VG L + TE+ IK++ E FG
Sbjct: 371 EAERNRLHPG--DGNLNLPP------GVSASHGGMQLYVGSLHFNLTESDIKQVFEPFGE 422
Query: 321 LHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK-T 379
L DL +D TG SKGY F Y+ +A + G ++ +TL V G + T
Sbjct: 423 LEFVDLHRDPMTGRSKGYAFVQYKRSEDARMALEQMEGFELAGRTLRVNTVHEKGTIRYT 482
Query: 380 EQESI-------LAQAQQHIAIQKMALQTSGMNTLGGGM--SLFGETLAKVLCLTEAITA 430
+Q+S+ L A + +QK+A + T + ++ ++ + L
Sbjct: 483 QQDSLDEAGGGNLNAASRQALMQKLARTDQTVITPPPVVRPNIPQTMQSRSVLLKNMFNP 542
Query: 431 DALADDEEYEEILEDMREEC-GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT 489
+ + + +++ ED++ EC KYG + + + + Q G++++++
Sbjct: 543 ENETERDWDKDLAEDVKYECEDKYGKVEFIKVEKDSQ--------GEIYVKFDSVESAKN 594
Query: 490 AKNALSGRKFGGNTVNAFY 508
A L+GR FGGN V+A +
Sbjct: 595 AIQGLNGRWFGGNQVSAGF 613
>gi|432090458|gb|ELK23883.1| Splicing factor U2AF 65 kDa subunit [Myotis davidii]
Length = 423
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 46/84 (54%), Positives = 59/84 (70%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY FC Y D VTD A A
Sbjct: 263 KLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAG 322
Query: 356 LNGLKMGDKTLTVRRATASGQSKT 379
LNG+++GDK L V+RA+ ++ T
Sbjct: 323 LNGMQLGDKKLLVQRASVGAKNAT 346
Score = 58.5 bits (140), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 60/108 (55%), Gaps = 17/108 (15%)
Query: 194 FFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRV 253
FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+E + AMA G ++++
Sbjct: 4 FFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMA-------GQSLKI 55
Query: 254 RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGG 301
RRP DY P PG S N ++ G+ S + + ++F+GG
Sbjct: 56 RRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGG 94
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 32/55 (58%), Gaps = 1/55 (1%)
Query: 450 CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
C KYG + ++ IPRP +G E PG GK+F+E+ C A L+GRKF V
Sbjct: 362 CSKYGLVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAVKGLTGRKFANRVV 415
>gi|68076889|ref|XP_680364.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56501286|emb|CAI04220.1| conserved hypothetical protein [Plasmodium berghei]
Length = 652
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 146/332 (43%), Gaps = 48/332 (14%)
Query: 150 FGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
FG ++LG + + A + AR +YVG +P + Q I F + + +
Sbjct: 195 FGISELG------LSTIDANAEKTARELYVGNIPQNIDIQEIVKFLNTCLLILYNKENEN 248
Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRP----TDYNPTLAA 265
+ + I + ++AFVE R++++ SN M L+GI F +R+ RP +Y +
Sbjct: 249 ENICLKACIRGDTRYAFVEFRSLQDTSNCMLLNGIYFYTNNLRIGRPKTFPIEYTKLIPP 308
Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPD--------------RVFVGGLPYYFTETQI 311
A P + L+ +G+ + AI + ++ V + +I
Sbjct: 309 ATIPTIDTYYLSQGLIGIKAFAIFHQNKDETKNEYIPVDMIKLQKLCVSNISKNNETNKI 368
Query: 312 KELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
KELLE+FG + GF+ + + NS Y I N ++ + +
Sbjct: 369 KELLEAFGEIQGFEFFEGEE--NSDTY------------ICLVEYNNVENAIQAHKILNQ 414
Query: 372 TASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL-AKVLCLTEAITA 430
S + + E E IL + ++K +Q S+ + + KV+ L++ T
Sbjct: 415 NTSYKIQFEYE-ILNDPIINQLVKKKYMQNKN--------SILSQQIPTKVIVLSKIATF 465
Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIP 462
+ L++ E+Y+EI ED++ EC KYG ++ VV+P
Sbjct: 466 EELSNPEDYKEISEDIKIECEKYGPVLEVVLP 497
>gi|313244755|emb|CBY15469.1| unnamed protein product [Oikopleura dioica]
Length = 2588
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 110/412 (26%), Positives = 169/412 (41%), Gaps = 85/412 (20%)
Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY---INH 220
Q+ Q+A ++YVG + E I F + GP ++ Y N
Sbjct: 2208 QLQRQRAVAIMCKIYVGSIYYEIGEATIRQSFE---------TFGPVRSIDMSYDQGTNR 2258
Query: 221 EKKFAFVEMRTVEEASNAMA-LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
K F F+E E A A+ + I G AV+V R ++ GQ + +A
Sbjct: 2259 HKGFCFLEFECPEAAFLALEHMQSITIGGRAVKVGRLSNI----------GQVAAQHFIA 2308
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G A RV++ + +T IK + ESFG + LVK+ DTG K YG
Sbjct: 2309 QFG------NEAAKYHRVYIANIHVNIVDTDIKAVFESFGRVLSCQLVKNVDTGRHKNYG 2362
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV------------------RRATASGQSKTEQ 381
F Y + A +A+NG +G + + V +TA +K Q
Sbjct: 2363 FVEYDNSQSMKEAISAMNGFDLGGQCIRVGPCVVPPSMHNIPTVAPGNASTALSGAKAVQ 2422
Query: 382 ESILAQAQQHIAIQK-----------MALQT--------------SGMNTLGGG------ 410
E +L + ++ + + +A++ SG + GG
Sbjct: 2423 E-MLKKKKKDSGVNRPSPPKDSMADVLAIREELNKKARNPTCDDDSGQMKITGGAQRNMI 2481
Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP-DQNGG 469
M + V+ L ++AD L DEE E ++ +EC +YG ++ VVI + D+
Sbjct: 2482 MRKLMTRRSNVVVLKNMLSADDL--DEEVES---EVTQECSQYGNVLRVVIYQEVDRLAP 2536
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
+ KVF+++ DA G TAK LSGR F G +NA Y E + KDYSA
Sbjct: 2537 GCEPIVKVFVQFTDADGAETAKKELSGRFFAGRKINAQSYDETAFEMKDYSA 2588
>gi|392597434|gb|EIW86756.1| splicing factor CC1-like protein [Coniophora puteana RWD-64-598
SS2]
Length = 360
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/350 (24%), Positives = 151/350 (43%), Gaps = 41/350 (11%)
Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
AR V+V L + + FF +G + V + K +VE R++
Sbjct: 28 EARSVFVSQLAARLTARDLGYFFED---KLGEGTVLDSRIVTDRISRRSKGIGYVEFRSI 84
Query: 233 EEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
+ A+ L G + G+ + V+ + L PG NLNL +
Sbjct: 85 DLVEKALGLSGTVVMGLPIMVQ----LTESERNRLHPG--DGNLNLPP------GVHAPH 132
Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
G +++VG L + TE IK++ E FG L DL +D TG SKGY F Y+ P +A
Sbjct: 133 GAMQLYVGSLHFNLTEADIKQVFEPFGDLEFVDLHRDSTTGRSKGYAFVQYKRPEDAKMA 192
Query: 353 CAALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMALQTSGM 404
++G ++ +TL V G ++ T+Q+S+ L A + +QK+A +
Sbjct: 193 LEQMDGFELAGRTLRVNTVHEKGTARYTQQDSLEETGGGNLNAASRQALMQKLA----RI 248
Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE-----EILEDMREECG-KYGTLVN 458
T + ++ + + + + +EE E E+ ED++ EC KYG +
Sbjct: 249 ETPTPAEPVSRPSIPQAMQSRSVLMKNMFDPEEETERDWDKELAEDVKGECQEKYGKVEA 308
Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
+ + + Q G++++++ A AL+GR FGG ++A +
Sbjct: 309 IKVEKETQ--------GEIYVKFATIDSAKEAVQALNGRWFGGRQISAVF 350
>gi|336389603|gb|EGO30746.1| hypothetical protein SERLADRAFT_455043 [Serpula lacrymans var.
lacrymans S7.9]
Length = 583
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 87/351 (24%), Positives = 143/351 (40%), Gaps = 45/351 (12%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
AR V+V L + + FF +G + V + K +VE R+++
Sbjct: 252 ARSVFVSQLAARLTARDLGYFFED---KLGEGTVMDSRIVTDRLSRRSKGIGYVEFRSID 308
Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
A++L G + G+ V V+ + L PG NLNL + G
Sbjct: 309 MVEKAISLSGTVVMGLPVMVQ----LTESERNKLHPG--DGNLNLPP------GVSAPHG 356
Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
+++VG L + TE+ IK++ E FG L DL +D TG SKGY F Y+ +A
Sbjct: 357 AMQLYVGSLHFNLTESDIKQVFEPFGELEFVDLHRDPMTGRSKGYAFVQYKRAEDARMAL 416
Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQ--------AQQHIAIQKMAL------ 399
+ G ++ +TL V G ++ Q+ L + A + +QK+A
Sbjct: 417 EQMEGFELAGRTLRVNTVHEKGTARYAQQDTLDEAGGGNLNAASRQALMQKLARIEPIPK 476
Query: 400 -QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREEC-GKYGTLV 457
T+ T+ M L + E D D + ED++ EC KYG +
Sbjct: 477 PPTNNKPTIPQAMQSRSVLLKNMFDPEEETERDWDKD------LAEDVKGECEDKYGQVD 530
Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
+ + + Q GK+++++ A L+GR FGG V+A +
Sbjct: 531 AIKVEQETQ--------GKIYVKFNSIDSAKNAIQGLNGRWFGGRQVSAGF 573
>gi|85001331|ref|XP_955384.1| RNA splicing factor [Theileria annulata strain Ankara]
gi|65303530|emb|CAI75908.1| RNA splicing factor, putative [Theileria annulata]
Length = 643
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 133/314 (42%), Gaps = 57/314 (18%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE T E A+++ G+ +G +RV + AA AA
Sbjct: 359 KGIAYVEFYTQESVIKALSMTGMSMKGQGIRVH-SSQAEKNRAAK------------AAK 405
Query: 282 GLASGAIGGAEGPDRVFVG---GLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
L A+ ++ P + V G+ Y E ++ +L FG + L + D G SKGY
Sbjct: 406 QLQDNALKESDNPTTIVVSNLLGVLSYLNEIELNQLFSPFGNIIDVALAR-TDNGESKGY 464
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI-------------- 384
+ ++ A +NG + + + V A K+ S+
Sbjct: 465 AYIRFKRWNEAKEALNVMNGFDINGQQIKVAYANTRKDPKSRLHSLGDLDMERLDDDDAG 524
Query: 385 -LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEY-EEI 442
++ + IA+ K Q +N+ L L+ T+ AD+ E+ +EI
Sbjct: 525 LISGSNVKIALMKKLQQRQPLNSSN-------------LVLSNMYTSADYADNHEFFDEI 571
Query: 443 LEDMREECGKYGTLVNVVIPR--PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
ED++EECGKYGT+V V + R PD GKV++++ + +A +L GR F
Sbjct: 572 EEDVKEECGKYGTVVQVFVNRRNPD---------GKVYVKFKNNDDAQSANKSLQGRYFA 622
Query: 501 GNTVNAFYYPEDKY 514
GNT+ Y +D+Y
Sbjct: 623 GNTIQVSYISDDQY 636
>gi|5822501|pdb|2U2F|A Chain A, Solution Structure Of The Second Rna-Binding Domain Of
Hu2af65
Length = 85
Score = 95.5 bits (236), Expect = 5e-17, Method: Composition-based stats.
Identities = 45/82 (54%), Positives = 58/82 (70%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY FC Y D VTD A A
Sbjct: 3 KLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAG 62
Query: 356 LNGLKMGDKTLTVRRATASGQS 377
LNG+++GDK L V+RA+ ++
Sbjct: 63 LNGMQLGDKKLLVQRASVGAKN 84
>gi|402217675|gb|EJT97754.1| splicing factor CC1-like protein [Dacryopinax sp. DJM-731 SS1]
Length = 640
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 87/350 (24%), Positives = 151/350 (43%), Gaps = 38/350 (10%)
Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEE 234
R V+V L + + FF + +G S V + K A+VE+ +++
Sbjct: 303 RSVFVSQLAARLTARDLGYFFEE---KLGEGSVRDVRIVTDRVSRRSKGIAYVELSSIDM 359
Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
S A+AL G I G+ + V+ T+ AA G +++ L G G
Sbjct: 360 VSRAIALTGTIVMGLPIMVQL-TESERNKVAASG----------SSMHLPPGVTAPPPGS 408
Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
+++VG L + TE+ +K++ E FG L DL +D TG SKG+ F Y+ +A
Sbjct: 409 MQLYVGSLHFNLTESDVKQVFEPFGELEFVDLHRDPLTGRSKGFAFVQYKRSEDARMALQ 468
Query: 355 ALNGLKMGDKTLTVRRATASG-------QSKTEQES---ILAQAQQHIAIQKMALQTSGM 404
+++G + + L V G QS + ES L A + +QK+A
Sbjct: 469 SMDGFDLAGRQLKVNTVHEKGGAIRYQSQSDSLDESGGGNLNAASRQALMQKLARIEPPK 528
Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADALADDEE-----YEEILEDMREEC-GKYGTLVN 458
+ SL + L + + +++E +E+ +D+++EC KYG LV+
Sbjct: 529 PAISPMASLPKAAMQSRSVLLRNMFKEPELEEKENGPNWAKELTDDVKQECEDKYG-LVD 587
Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
+ PD G+++L++ + A L+GR FGG + A +
Sbjct: 588 FIKLEPDSQ-------GEMYLKFKSIEAASKAIEGLNGRYFGGQPIQATF 630
>gi|336376609|gb|EGO04944.1| hypothetical protein SERLA73DRAFT_174031 [Serpula lacrymans var.
lacrymans S7.3]
Length = 583
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 86/351 (24%), Positives = 143/351 (40%), Gaps = 45/351 (12%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
AR V+V L + + FF +G + V + K +VE R+++
Sbjct: 252 ARSVFVSQLAARLTARDLGYFFED---KLGEGTVMDSRIVTDRLSRRSKGIGYVEFRSID 308
Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
A++L G + G+ V V+ + L PG NLNL + G
Sbjct: 309 MVEKAISLSGTVVMGLPVMVQ----LTESERNKLHPG--DGNLNLPP------GVSAPHG 356
Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
+++VG L + TE+ IK++ E FG L DL +D TG SKGY F Y+ +A
Sbjct: 357 AMQLYVGSLHFNLTESDIKQVFEPFGELEFVDLHRDPMTGRSKGYAFVQYKRAEDARMAL 416
Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQ--------AQQHIAIQKMAL------ 399
+ G ++ +TL V G ++ Q+ L + A + +QK+A
Sbjct: 417 EQMEGFELAGRTLRVNTVHEKGTARYAQQDTLDEAGGGNLNAASRQALMQKLARIEPIPK 476
Query: 400 -QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREEC-GKYGTLV 457
T+ T+ M L + E D D + ED++ EC KYG +
Sbjct: 477 PPTNNKPTIPQAMQSRSVLLKNMFDPEEETERDWDKD------LAEDVKGECEDKYGQVD 530
Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
+ + + Q G++++++ A L+GR FGG V+A +
Sbjct: 531 AIKVEQETQ--------GEIYVKFNSIDSAKNAIQGLNGRWFGGRQVSAGF 573
>gi|409083550|gb|EKM83907.1| hypothetical protein AGABI1DRAFT_110515 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 563
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 89/347 (25%), Positives = 149/347 (42%), Gaps = 37/347 (10%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
AR V+V L + + FF +G + V + K +VE RT+E
Sbjct: 232 ARSVFVSQLAARLTARDLGYFFED---KLGEGTVMDARIVTDRLSRRSKGIGYVEFRTIE 288
Query: 234 EASNAMALDGIIFEGVAVRVR-RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
A+ L G + G+ + V+ + N T A G S NL V GAI
Sbjct: 289 LVEKAIGLSGTVVMGLPIMVQLTEAERNKTHA-----GDGSINLP-PGVSAPHGAI---- 338
Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
++VG L + TE+ IK++ E FG L DL +D TG SKGY F Y+ +A
Sbjct: 339 ----LYVGSLHFNLTESDIKQVFEVFGELEFVDLHRDAMTGRSKGYAFVQYKRAEDARMA 394
Query: 353 CAALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMAL--QTS 402
+ G ++ +TL V G +K T+Q+S+ L A + +QK+A Q +
Sbjct: 395 LQQMEGFELAGRTLRVNTVHEKGTTKYTQQDSLDESGGGNLNAASRQALMQKLARTDQPA 454
Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREEC-GKYGTLVNVVI 461
++ ++ + L D + + E+ +D++ EC KYG ++ + +
Sbjct: 455 PRPEPVQRPNIPQAMQSRSVLLKNMFDPDEETEKDWDRELAQDVKGECESKYGKVLAIKV 514
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
+ Q G++++++ A L+GR FGG V+A +
Sbjct: 515 EKDSQ--------GEIYVKFDSIDYAQKAIQGLNGRWFGGRQVSAVF 553
>gi|47223170|emb|CAG11305.1| unnamed protein product [Tetraodon nigroviridis]
Length = 515
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 72/267 (26%), Positives = 113/267 (42%), Gaps = 47/267 (17%)
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
G+ GP R++VG L + TE ++ + E FG + G L+ D +TG SKGYGF + D
Sbjct: 239 GSSGPMRLYVGSLHFNITEEMLRGIFEPFGKIEGIQLMMDSETGRSKGYGFISFADAECA 298
Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKTEQE--------------------SILAQAQ 389
A LNG ++ + + V T S T ++A+
Sbjct: 299 KKALEQLNGFELAGRPMKVGHVTERSDSSTASSILDNDELERTGIDLGTTGRLQLMARLA 358
Query: 390 QHIAIQ-----KMALQTSG---MNTLGG-----------GMSLFGETLAK-VLCLTEAIT 429
+ ++ + ALQ +G T+GG ++L + LA L L+
Sbjct: 359 EGTGLKIPPAAQQALQMTGSMSFPTIGGPPAVPTPSPSQALNLPAQPLATHCLQLSNLFN 418
Query: 430 ADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT 489
A D EI +D+ EEC K+G +V++ + D+N + G V+++
Sbjct: 419 PQAENDPSWAVEIQDDVIEECNKHGGVVHIYV---DKNSAQ----GNVYVKCPSIPAAMA 471
Query: 490 AKNALSGRKFGGNTVNAFYYPEDKYFN 516
NAL GR F G + A Y P Y N
Sbjct: 472 TVNALHGRWFAGKMITAAYVPLPTYHN 498
>gi|392571432|gb|EIW64604.1| splicing factor CC1-like protein [Trametes versicolor FP-101664
SS1]
Length = 344
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/349 (26%), Positives = 149/349 (42%), Gaps = 40/349 (11%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
AR V+V L + + FF +G S V + K +VE RTVE
Sbjct: 12 ARSVFVSQLAARLTARDLGYFFED---KLGEGSVMDSRIVTDRISRRSKGIGYVEFRTVE 68
Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
A+ L G + G+ ++++ + L PG NLNL + G
Sbjct: 69 LVDRAIGLSGTVVMGLPIQIQ----HTEAERNRLHPG--DGNLNLPP------GVSAPHG 116
Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
+++VG L + TE+ IK++ E FG L DL +D TG SKGY F Y+ +A
Sbjct: 117 GMQLYVGSLHFNLTESDIKQVFEPFGELEFVDLHRDPMTGRSKGYAFVQYKRAEDAKMAL 176
Query: 354 AALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMALQTSGMN 405
+ G ++ +TL V G ++ T+Q+++ L A + +QK+A S
Sbjct: 177 EQMEGFELAGRTLRVNTVHEKGSTRYTQQDTLDEAGGGNLNAASRQALMQKLARTDSAPV 236
Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYE-----EILEDMREECG-KYGTLVNV 459
L + + + + + + +EE E ++ +D++ EC KYG + +
Sbjct: 237 KLE---PVARPHIPQTMQSRSVLLKNMFNPEEETERDWDKDLADDVKSECATKYGPVQAI 293
Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
+ + ET G V E DA G A L+GR FGG ++A +
Sbjct: 294 KVEK------ETQGEIYVLFETVDAAGQAI--EGLNGRWFGGRQISAAF 334
>gi|71026268|ref|XP_762815.1| splicing factor [Theileria parva strain Muguga]
gi|68349767|gb|EAN30532.1| splicing factor, putative [Theileria parva]
Length = 644
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 128/302 (42%), Gaps = 33/302 (10%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE T E A+++ G+ +G +RV A A
Sbjct: 360 KGIAYVEFYTQESVIKALSMTGMSMKGQGIRVHSSQAEKNRAAKAQKQ------------ 407
Query: 282 GLASGAIGGAEGPDRVFVG---GLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
L A+ ++ P + V G+ Y E ++ +L FG + L + D GNSKGY
Sbjct: 408 -LQDNALKESDNPTTIVVSNLLGVLSYLNEIELNQLFSPFGNIIDVALAR-TDDGNSKGY 465
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
+ ++ A +NG + + + V A SK+ S+ + +
Sbjct: 466 AYIRFKRWNEAKEALNVMNGFDINGQQIKVAYANTRKDSKSRLHSLGDVDMERLDDDDAG 525
Query: 399 LQTSGMNTLGGGMSLFGETL---AKVLCLTEAITADALADDEEY-EEILEDMREECGKYG 454
L SG N M + + L L+ T+ D+ E+ +EI ED++EECGKYG
Sbjct: 526 L-ISGSNIKIALMKKLQQRQPLNSSNLVLSNMYTSADYEDNREFFDEIEEDVKEECGKYG 584
Query: 455 TLVNVVIPR--PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
T++ V + + PD GKV++++ + A +L GR F GNT+ Y +D
Sbjct: 585 TVIQVFVNKRNPD---------GKVYVKFKNNDDAQAANKSLQGRYFAGNTIQVSYISDD 635
Query: 513 KY 514
+Y
Sbjct: 636 QY 637
>gi|426201409|gb|EKV51332.1| hypothetical protein AGABI2DRAFT_189584 [Agaricus bisporus var.
bisporus H97]
Length = 563
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/347 (25%), Positives = 149/347 (42%), Gaps = 37/347 (10%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
AR V+V L + + FF +G + V + K +VE RT+E
Sbjct: 232 ARSVFVSQLAARLTARDLGYFFED---KLGEGTVMDARIVTDRLSRRSKGIGYVEFRTIE 288
Query: 234 EASNAMALDGIIFEGVAVRVR-RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
A+ L G + G+ + V+ + N T A G S NL V GAI
Sbjct: 289 LVEKAIGLSGTVVMGLPIMVQLTEAERNKTHA-----GDGSINLP-PGVSAPHGAI---- 338
Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
++VG L + TE+ IK++ E FG L DL +D TG SKGY F Y+ +A
Sbjct: 339 ----LYVGSLHFNLTESDIKQVFEVFGELEFVDLHRDAMTGRSKGYAFVQYKRAEDARMA 394
Query: 353 CAALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMAL--QTS 402
+ G ++ +TL V G +K T+Q+S+ L A + +QK+A Q +
Sbjct: 395 LQQMEGFELAGRTLRVNTVHEKGTTKYTQQDSLDESGGGNLNAASRQALMQKLARTDQPA 454
Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREEC-GKYGTLVNVVI 461
++ ++ + L D + + E+ +D++ EC KYG ++ + +
Sbjct: 455 PRPEPVQRPNIPQAMQSRSVLLKNMFDPDEETEKDWDRELAQDVKGECESKYGKVLAIKV 514
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
+ Q G++++++ A L+GR FGG V+A +
Sbjct: 515 EKDSQ--------GEIYVKFDSIDYAQKAIQGLNGRWFGGRQVSAVF 553
>gi|443921112|gb|ELU40879.1| splicing factor, CC1-like family protein [Rhizoctonia solani AG-1
IA]
Length = 399
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 88/363 (24%), Positives = 153/363 (42%), Gaps = 56/363 (15%)
Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
AR V+V L + + FF +G + V + K +VE + +
Sbjct: 63 EARSVFVSQLAARLTARDLGYFFED---KLGEGAVRDARIVTDRLSRRSKGIGYVEFKNI 119
Query: 233 EEASNAMALDGIIFEGVAVRVR----RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
+ + A+AL G I G+ + ++ P+ + L PG P+
Sbjct: 120 DLVNKAIALSGTIVMGLPIMIQLTESERNKIGPSSSLHLPPGVSHPH------------- 166
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
G +++VG L + TE+ I+++ E FG L DL +D TG SKGY F Y+ P
Sbjct: 167 ---AGSMQLYVGSLHFNLTESDIRQVFEPFGELDFVDLHRDPATGKSKGYCFIQYKRPED 223
Query: 349 TDIACAALNGLKMGDKTL-----------TVRRATASGQSKTEQESILAQA-QQHIAIQK 396
+A + G ++ + L TVR +TA S + +L + +H +QK
Sbjct: 224 ARMALEQMEGFELAGRQLRVNTVHDKGQGTVRISTAPQDSLEDTGGVLNNSTSRHQLMQK 283
Query: 397 MAL--QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE-----EILEDMREE 449
+A Q S NT+ L + + L + + DEE E ++ +D+R E
Sbjct: 284 LARTEQPSKNNTM-----LMKSNIPQTLSSRCVLLRNMFDPDEETERDWDKDLADDVRGE 338
Query: 450 C-GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
C KYG ++++ + + + G++++++ A L+GR FGG V A
Sbjct: 339 CEEKYGKVLDLKVEKESE--------GEIYIKFESVESAEKAIKGLNGRWFGGKQVTASP 390
Query: 509 YPE 511
P+
Sbjct: 391 IPD 393
>gi|348503003|ref|XP_003439056.1| PREDICTED: RNA-binding protein 39-like [Oreochromis niloticus]
Length = 498
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 111/267 (41%), Gaps = 47/267 (17%)
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
G+ GP R++VG L + TE ++ + E FG + G L+ D +TG SKGYGF + D
Sbjct: 222 GSSGPMRLYVGSLHFNITEEMLRGIFEPFGKIEGIQLMMDSETGRSKGYGFISFADAECA 281
Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKTEQE--------------------SILAQAQ 389
A LNG ++ + + V T S T ++A+
Sbjct: 282 KKALEQLNGFELAGRPMKVGHVTERSDSSTASSFLDNDELERTGIDLGTTGRLQLMARLA 341
Query: 390 QHIAIQ-----KMALQTSGMNTLGG--------------GMSLFGETLAK-VLCLTEAIT 429
+ ++ + ALQ +G GG ++L + LA L L+
Sbjct: 342 EGTGLKIPPAAQQALQMTGSIPFGGIGAPAAVPTPAPSQALNLPSQPLATHCLQLSNLFN 401
Query: 430 ADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT 489
A D EI +D+ EEC K+G +V++ + D+N + G V+++
Sbjct: 402 PQAENDPSWAAEIQDDVIEECNKHGGIVHIYV---DKNSPQ----GNVYVKCPSIPAAMA 454
Query: 490 AKNALSGRKFGGNTVNAFYYPEDKYFN 516
NAL GR F G + A Y P Y N
Sbjct: 455 TVNALHGRWFAGKMITAAYVPLPTYHN 481
>gi|403362995|gb|EJY81233.1| hypothetical protein OXYTRI_21372 [Oxytricha trifallax]
Length = 411
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 83/385 (21%), Positives = 150/385 (38%), Gaps = 85/385 (22%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
+H R++YVG LPP Q + + + + N PG +++ +I+ + +AFVE RT
Sbjct: 70 KHERQLYVGNLPPTITHQKLVELLNIAVCVMKLN-VKPGQPILSAWISQDGHYAFVEFRT 128
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
+EE N L+ I +G ++V + T + + P N V + S A+ +
Sbjct: 129 IEECMNGHQLNQIAIQGHPLKVGK-TRIQNQINSQNPHNFPCQNSANQQVLMLSQALSNS 187
Query: 292 EGPDRVFVGGLPYYF---TETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
+ + +P ++ TE +K LL+ FG + + ++ +C Y+
Sbjct: 188 -----IEISNIPKFYENDTEALVK-LLKMFGVYRQYQM----KALQNQIICYCEYESDEQ 237
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
T A N L + + L VRR G
Sbjct: 238 TKKALNGFNDLLVKESKLQVRRLPN----------------------------------G 263
Query: 409 GGMSLFGETLAKVL-CL------------------TEAITADALADDEEYEEILEDMREE 449
+FG T AK + C+ + + + +E+ E+ +D+REE
Sbjct: 264 HASQIFGSTTAKSIDCIQGNQKSSDEPRSSRVVVLNNLLVLENMKTKQEFYEVEDDIREE 323
Query: 450 CGKYGTLVNVVIPRPD--------------QNGGE---TPGVGKVFLEYYDAVGCATAKN 492
C KYG + V+IP+P Q G G GK+++++ +
Sbjct: 324 CEKYGKIRQVMIPKPSHLSHRQKLPFCIQIQRYGSYLVNEGAGKIYIKFDKSEQAKKCVE 383
Query: 493 ALSGRKFGGNTVNAFYYPEDKYFNK 517
++ R + V A Y EDK++++
Sbjct: 384 QMNKRLYNQREVIASLYSEDKWYDR 408
>gi|299755304|ref|XP_002912089.1| hypothetical protein CC1G_13622 [Coprinopsis cinerea okayama7#130]
gi|298411164|gb|EFI28595.1| hypothetical protein CC1G_13622 [Coprinopsis cinerea okayama7#130]
Length = 580
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 82/345 (23%), Positives = 148/345 (42%), Gaps = 35/345 (10%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
AR V+V L + + FF +G + V + K +VE R+++
Sbjct: 251 ARSVFVSQLAARLTARDLGYFFED---KLGEGTVMDARIVTDRLSRRSKGIGYVEFRSID 307
Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
A+AL G I G+ + V+ A G +L+L ASGAI
Sbjct: 308 LVEKAIALSGTIVMGLPINVQLTESERNKSHAGDG------SLHLPPGVTASGAI----- 356
Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
++VG L + TE+ IK++ E FG L DL KD TG SKGY F Y+ +A
Sbjct: 357 ---LYVGSLHFNLTESDIKQVFEPFGELEFVDLHKDPMTGRSKGYAFVQYKRAEDARMAL 413
Query: 354 AALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMALQTSGMN 405
+ G ++ +TL V G + T+ +S+ L A + +QK+A +
Sbjct: 414 EQMEGFELAGRTLRVNTVHEKGSVRYTQTDSLDDSGGANLNAASRQALMQKLARTEQPVV 473
Query: 406 TLGGGMSLFGETL-AKVLCLTEAITADALADDEEYEEILEDMREEC-GKYGTLVNVVIPR 463
+ + + ++ + L + + +++ +D++ EC KYG ++ + + +
Sbjct: 474 PAEPVKPIIPQAMQSRSVLLKNMFNPEEETEQNWDKDLADDVKGECENKYGKVLAIKVEK 533
Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
Q G++++++ +A L+GR FGG ++A +
Sbjct: 534 DSQ--------GEIYVKFDTVDTAKSAVQGLNGRWFGGRQISANF 570
>gi|70954273|ref|XP_746191.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56526725|emb|CAH88205.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 686
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 131/314 (41%), Gaps = 46/314 (14%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
A + AR +YVG +P + Q I F + + + + I + ++AFVE
Sbjct: 249 AEKTARELYVGNIPQNIDIQEIVKFLNTCLLILYNKENENESICLKACIRGDTRYAFVEF 308
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
R++++ SN M L+GI F +R+ RP + + P P + L+ G IG
Sbjct: 309 RSLQDTSNCMLLNGIYFYSNNLRIGRPKTFPAEYTKLIPPATIPP---IDTYYLSQGLIG 365
Query: 290 GA------EGPDRVFVGGLPYYFTETQ---------------IKELLESFGTLHGFDLVK 328
+ D LP + Q IKELLE+FG + F+ +
Sbjct: 366 IKAFVIFHQNRDETKNEYLPVDMIKLQKLCVSNISKNNETNKIKELLEAFGEIQSFEFFE 425
Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
+ NS Y I N ++ + + S + + E E I+
Sbjct: 426 GEE--NSDTY------------ICLVEYNNVENAIQAHKILNQNTSYRIQFEYE-IVNDP 470
Query: 389 QQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMRE 448
+ ++K +QT L + KV+ L++ T D L++ E+Y+EI ED++
Sbjct: 471 TINQLVKKKYMQTK-------NAILSQQIPTKVVVLSKIATFDELSNPEDYKEISEDIKI 523
Query: 449 ECGKYGTLVNVVIP 462
EC KYG ++ VV+P
Sbjct: 524 ECEKYGPVLEVVLP 537
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 25/47 (53%)
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
+G F+ + G + LSGRKFG N + A YY E K+ K++
Sbjct: 607 SIGCAFIYFETIEGATKTRKELSGRKFGANIIEANYYSEKKFLMKNF 653
>gi|410899827|ref|XP_003963398.1| PREDICTED: RNA-binding protein 39-like [Takifugu rubripes]
Length = 500
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 112/267 (41%), Gaps = 47/267 (17%)
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
G+ GP R++VG L + TE ++ + E FG + G L+ D +TG SKGYGF + D
Sbjct: 224 GSSGPMRLYVGSLHFNITEEMLRGIFEPFGKIEGIQLMMDSETGRSKGYGFISFADAECA 283
Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKTEQE--------------------SILAQAQ 389
A LNG ++ + + V T S T ++A+
Sbjct: 284 KKALEQLNGFELAGRPMKVGHVTERSDSSTASSILDNDELERTGIDLGTTGRLQLMARLA 343
Query: 390 QHIAIQ-----KMALQTSG---MNTLGG-----------GMSLFGETLAK-VLCLTEAIT 429
+ ++ + ALQ +G T+ G ++L + LA L L+
Sbjct: 344 EGTGLKIPPAAQQALQMTGSMSFPTISGPPAVPTPSPSQALNLPAQPLATHCLQLSNLFN 403
Query: 430 ADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT 489
A D EI +D+ EEC K+G +V++ + D+N + G V+++
Sbjct: 404 PQAENDPSWAVEIQDDVIEECNKHGGVVHIYV---DKNSTQ----GNVYVKCPSIPAAMA 456
Query: 490 AKNALSGRKFGGNTVNAFYYPEDKYFN 516
NAL GR F G + A Y P Y N
Sbjct: 457 TVNALHGRWFAGKMITAAYVPLPTYHN 483
>gi|255575831|ref|XP_002528813.1| splicing factor u2af large subunit, putative [Ricinus communis]
gi|223531725|gb|EEF33547.1| splicing factor u2af large subunit, putative [Ricinus communis]
Length = 844
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 97/392 (24%), Positives = 162/392 (41%), Gaps = 97/392 (24%)
Query: 93 RNRSKSLSPSRSPSK---SKRRSGFDMAPPAAAMLPGAAVP------GQLPGVP-----S 138
+ RS++ + + SP+K K+++ +D+AP A +VP Q+ + S
Sbjct: 303 KRRSEAAARTPSPTKHSPEKKKAKWDLAPEGADSTFSVSVPPIFKLSNQIASLNARATVS 362
Query: 139 AVPEMAQNMLPFGATQLGAF------PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIA 192
AVP + + P + VQ+ QATR RR+YV +P A+E+A+
Sbjct: 363 AVPVASIPVKPLSGVSSNILLTNKNDTIDSVQLT--QATRPMRRLYVENIPAEASEKAVL 420
Query: 193 TFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVR 252
+ ++ + G N ++ I+ EK A VE T E+AS A++ DG F G ++
Sbjct: 421 ERLNNLLISSGVNHIQGTQPCISCIIHKEKGQALVEFLTPEDASAALSFDGSYFSGSTIK 480
Query: 253 VRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIK 312
+RRP D+ +A+ GP L A Y+F
Sbjct: 481 IRRPKDFIMEIASTFGP--------LKA-----------------------YHF------ 503
Query: 313 ELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA- 371
E+ ++G F Y D +VT ACA LNG+K+G + ++ +
Sbjct: 504 ---ENIDDVNG-------------PCAFVEYADQSVTFRACAGLNGMKLGGQVISAVQVI 547
Query: 372 -TASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
AS ++ +Q + Q + L +L +L+++
Sbjct: 548 PNASTLEIDGKQPFYGVPEQAKPLLDKPTQVLKLKNLFDPETL--PSLSRI--------- 596
Query: 431 DALADDEEYEEILEDMREECGKYGTL--VNVV 460
E EE+LED+R EC ++GT+ VNVV
Sbjct: 597 -------EIEEVLEDVRLECARFGTVKSVNVV 621
>gi|159163083|pdb|1U2F|A Chain A, Solution Structure Of The First Rna-Binding Domain Of
Hu2af65
Length = 90
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 43/88 (48%), Positives = 62/88 (70%), Gaps = 1/88 (1%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+
Sbjct: 1 ARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVD 59
Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNP 261
E + AMA DGIIF+G ++++RRP DY P
Sbjct: 60 ETTQAMAFDGIIFQGQSLKIRRPHDYQP 87
>gi|428672327|gb|EKX73241.1| RNA recognition motif domain containing protein [Babesia equi]
Length = 511
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 89/369 (24%), Positives = 157/369 (42%), Gaps = 48/369 (13%)
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
++A R V V L A+E+ I FS+ V ++ K A+V
Sbjct: 164 EEAQRADLTVLVINLSLSADERDIYELFSE-----HAGKVRDIQCVRDLRSGKSKGIAYV 218
Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPN-LNLAAVGLASG 286
E T E A+++ G+ +G ++++ Q N AA L
Sbjct: 219 EFYTQESVIKALSMTGLDLKGQRIKIQ--------------SSQAEKNRAAKAAKMLQQT 264
Query: 287 AIGGAEGPDRVFVGGLP---YYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
A+ ++ P ++VGGL E ++K+L FGT+ ++ +D +TG SKGY F +
Sbjct: 265 AMDASDSPFTIYVGGLIGALSALNEVELKQLFSPFGTIIDVEIFRDPETGESKGYAFLKF 324
Query: 344 QDPAVTDIACAALNGLKMGDKTLTV-----------RRATASGQSKTEQES-----ILAQ 387
+ + A +NG +G + + V R ++ G E+ +++
Sbjct: 325 RRSSEAKEAMNTMNGFDIGGQQIKVGYANLNTTDSKSRLSSLGDVDIERLDDDGGGLISG 384
Query: 388 AQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEE--YEEILED 445
A IA+ + +T+ S + L+ TA+ DE + EI ED
Sbjct: 385 ATNKIALMEKLQRTTAAPISATFSSGKASGPTSNIILSNMFTANDPGADEPNFFVEIEED 444
Query: 446 MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
++EEC KYG +V V + + + GKV++++ ++ +TA L+GR F GNT+
Sbjct: 445 VKEECEKYGKVVAVYLNKKTID-------GKVWVKFQNSTDASTAYKGLNGRYFAGNTIK 497
Query: 506 AFYYPEDKY 514
Y +D +
Sbjct: 498 VEYVTDDFW 506
>gi|195996811|ref|XP_002108274.1| hypothetical protein TRIADDRAFT_37071 [Trichoplax adhaerens]
gi|190589050|gb|EDV29072.1| hypothetical protein TRIADDRAFT_37071 [Trichoplax adhaerens]
Length = 351
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 80/321 (24%), Positives = 134/321 (41%), Gaps = 53/321 (16%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE R V+ A+ L+G EG+ + ++R Q N +
Sbjct: 37 KGIAYVEFRLVDSVDKALKLNGTKVEGIPIMIQRT--------------QSEKN----KI 78
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+GP R+++G L Y E ++ + E FG + ++++D DT SKGYGF
Sbjct: 79 AALQAQQKAQQGPTRLYIGSLHYNINEDMLRAIFEPFGLVENVNIIRDSDTNVSKGYGFI 138
Query: 342 VYQDPAVTDIACAALNGLKMGDK-----TLTVRRATASGQS-----KTEQESILAQAQQH 391
Y++P A LNGL++ + T+T R A S S TE+ I +
Sbjct: 139 QYKEPDSARRALEQLNGLEVAGRPIKVGTVTDRSADLSAMSALDDDDTERGGIEMNSLSR 198
Query: 392 IAIQKMALQTSGMNTL----------------GGGMSLFGETLAKVLCLTEAITAD-ALA 434
+A+ QT T+ G+ T+ C + D A
Sbjct: 199 VALMAKLSQTHNATTVPVSVPVPVPVPGPTLPATGLIPAANTVQASPCFLISNMFDPAKE 258
Query: 435 DDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNA 493
D++++ +I +D+ EEC K+G + +V + + T G V+++ A A +
Sbjct: 259 TDQDWDLDIRDDIIEECNKHGNVYHVYVDK-------TSPKGIVYVKCQTIDVAARAVKS 311
Query: 494 LSGRKFGGNTVNAFYYPEDKY 514
L+GR F GN + A + Y
Sbjct: 312 LNGRWFAGNMITAQFLSLASY 332
>gi|294876942|ref|XP_002767845.1| splicing factor u2af large subunit, putative [Perkinsus marinus
ATCC 50983]
gi|239869760|gb|EER00563.1| splicing factor u2af large subunit, putative [Perkinsus marinus
ATCC 50983]
Length = 220
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 99/197 (50%), Gaps = 6/197 (3%)
Query: 186 ANEQAIATFFSQVMTAIGGN--SAGPGDAVVNVYI---NHEKKFAFVEMRTVEEASNAMA 240
+++Q++ FF + A+ GN P VV+V+ + + A VE RT A+ AM
Sbjct: 16 SSQQSVMDFFKGALFAVTGNGGKTTPLHPVVSVFFLISDGHSRTALVEFRTPIAATVAMR 75
Query: 241 LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD-RVFV 299
L+GI +G + + RP YN + + + + + S A G + ++ +
Sbjct: 76 LNGIDLDGRKLAITRPHGYNKEDPSKSITAEDIQKVTIEELCGGSSTKKTAPGSNLQLGI 135
Query: 300 GGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGL 359
LP TET +++LLE FG L L++D+ TG SKGYGFC ++DP D AL+
Sbjct: 136 YHLPPVMTETYLRDLLEQFGALTMVSLIRDKTTGLSKGYGFCQFEDPNDADRCLYALDQF 195
Query: 360 KMGDKTLTVRRATASGQ 376
+G+ +L+V R Q
Sbjct: 196 VLGNYSLSVTRLVPDAQ 212
>gi|118788821|ref|XP_317010.3| AGAP008433-PA [Anopheles gambiae str. PEST]
gi|116122929|gb|EAA12873.4| AGAP008433-PA [Anopheles gambiae str. PEST]
Length = 526
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/384 (24%), Positives = 150/384 (39%), Gaps = 63/384 (16%)
Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
P++ M+Q+ R AR V+ L + + + FFS S G V + N
Sbjct: 159 PLEEMSQE-DRDARTVFCMQLSQRIHARDLEEFFS---------SVGKVRDVRLITCNKT 208
Query: 222 KKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
K+F A++E + E + A+ L G G+ + V+ T A+ P P N
Sbjct: 209 KRFKGIAYIEFKDPESVALALGLSGQKLLGIPISVQH-TQAEKNRMASQPPVAPPKN--- 264
Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
GP R++VG L + TE + + E FG + L+ D DTG SKGY
Sbjct: 265 ------------PSGPMRLYVGSLHFNITEDMLNGIFEPFGKIDNIQLIMDADTGRSKGY 312
Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
GF + + A LNG ++ + + V T T S+ I+ A
Sbjct: 313 GFITFHNADDAKKALEQLNGFELAGRPMKVGNVTER-LDVTTHASLDTDEMDRSGIELGA 371
Query: 399 ---LQTSGMNTLGGGMS----------------------LFGETLAKVLCLTEAITADAL 433
LQ G G++ + +A L + A
Sbjct: 372 TGRLQLMFKLAEGAGLAVPRAAADALLATAPQPVPQQPIMQSPPIATQCFLLSNMFDPAT 431
Query: 434 ADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
+ ++ EI +D+ EEC K+G + +V + + ++P G V+++ A N
Sbjct: 432 ETNPNWDLEIQDDVIEECNKHGGVQHVYVDK------QSPS-GNVYVKCPSIATAVLAVN 484
Query: 493 ALSGRKFGGNTVNAFYYPEDKYFN 516
AL GR F G + A Y P Y+N
Sbjct: 485 ALHGRWFAGRVIGAAYVPLINYYN 508
>gi|209879137|ref|XP_002141009.1| RNA recognition motif. family protein [Cryptosporidium muris RN66]
gi|209556615|gb|EEA06660.1| RNA recognition motif. family protein [Cryptosporidium muris RN66]
Length = 442
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 98/424 (23%), Positives = 163/424 (38%), Gaps = 95/424 (22%)
Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV-------------------- 214
R V V +P L N Q ++TF + A+ G A + +
Sbjct: 30 RTVIVEKVPMLFNSQTLSTFLCGAICALQGKPADSLNTFISSIKEITCEDAQLTVNDNNS 89
Query: 215 ---NVY------------INHEKKFAF-VEMRTVEEASNAMALDGIIF--EGVAVRVRRP 256
N+Y NH F VE++T+ + L+GI + RRP
Sbjct: 90 TIHNIYSSNAGSKSGGSSFNHGISRTFRVELQTIIYTLLCLKLNGIPIGTSSSKLICRRP 149
Query: 257 TDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLE 316
+Y P G P + S I ++ + LP +E +++ LE
Sbjct: 150 KEYIPP-----PEGDPINTFQIVLDKPESKQIST----EKCILKDLPIDISEESLRKQLE 200
Query: 317 SFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQ 376
+ G + ++ D TG KG G ++D + A +G + T +G
Sbjct: 201 TIGPIKTLVVIYDPITGVPKGVGSFEFEDSLNCNKAVEKFHGRPI--------EGTKNGV 252
Query: 377 SKTEQES-ILAQAQQHIAIQKMALQTSG--------------------MNTLGGGMSL-- 413
+ S ILA++ ++++ +L S + +L MS
Sbjct: 253 WNIQLSSGILAKSNNNVSLNSTSLPISSTTPSNQASVISFITPREYKPVTSLTKSMSYKL 312
Query: 414 ---------------FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
GET +KV+ L I + L DD+EY IL+ ++ E K+GT++
Sbjct: 313 LSSPIIGLILCASKKVGETPSKVVQLLNIIQPEELLDDQEYHSILDSVKTEAEKFGTILE 372
Query: 459 VVIPRP-DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG-NTVNAFYYPEDKYFN 516
+ PRP + G GK+F+ + D A+ L+GR F TV A ++P +KY N
Sbjct: 373 IFSPRPKSRENLYCNGAGKIFIYFADITSARRAQYQLNGRIFDHVKTVCASFFPLEKYLN 432
Query: 517 KDYS 520
++YS
Sbjct: 433 REYS 436
>gi|327271618|ref|XP_003220584.1| PREDICTED: RNA-binding protein 39-like [Anolis carolinensis]
Length = 578
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 88/336 (26%), Positives = 135/336 (40%), Gaps = 65/336 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 250 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 292
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 293 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 352
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 353 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 412
Query: 382 ESILAQAQQHIAIQ-----KMALQTSG------MNTLGGGMSLFGETLAKVLCL----TE 426
++A+ + +Q + ALQ SG + L +S E LA + T+
Sbjct: 413 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFSAVADLQTRLSQQSEVLAAAASVQPLATQ 472
Query: 427 AITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
+ + + E EI +D+ EEC K+G +V++ + D+N + G V+++
Sbjct: 473 CFQLSNMFNPQTEEEAGWDTEIKDDVIEECNKHGGVVHIYV---DKNSAQ----GNVYVK 525
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
A NAL GR F G + A Y P Y N
Sbjct: 526 CPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 561
>gi|393247915|gb|EJD55422.1| splicing factor, CC1-like protein [Auricularia delicata TFB-10046
SS5]
Length = 581
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 83/348 (23%), Positives = 145/348 (41%), Gaps = 35/348 (10%)
Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
AR V+V L + + FF +G + V + K +VE+R++
Sbjct: 247 EARSVFVSQLAARLTARDLGYFFED---KLGEGAVRDARIVTDRISRRSKGIGYVELRSI 303
Query: 233 EEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
+ + A+ L G I G+ + V+ + A NLNL S GGA
Sbjct: 304 DLVTKALDLSGTIVMGLPIMVQLTEAERNRVHAG-------ENLNLPPG--VSAPQGGAM 354
Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
+++VG L + TE IK++ E FG L DL +D TG SKGY F Y+ +A
Sbjct: 355 ---QLYVGSLHFNLTEQDIKQVFEPFGELDFVDLHRDPGTGRSKGYAFVQYKRAEDAKMA 411
Query: 353 CAALNGLKMGDKTLTVRRATASGQSKTEQESI----------LAQAQQHIAIQKMA-LQT 401
++G ++ +TL V G + +I L A + +QK+A +
Sbjct: 412 LEQMDGFELAGRTLRVNSVNEKGVAVRNTTTIDSLEDSGGGNLNAASRQALMQKLARIDP 471
Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECG-KYGTLVNVV 460
+ + + + L D + + +++ +D++ EC KYG + +
Sbjct: 472 PKSSQPEARKHIPQNQSTRSVLLLNMFDPDEETEPDWDKDLADDVKGECASKYGPVTALK 531
Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
I + Q G++++++ A ++L+GR FGG VNA +
Sbjct: 532 IEKDSQ--------GEIYVQFESVDSAKKAVDSLNGRWFGGRQVNARF 571
>gi|409051610|gb|EKM61086.1| hypothetical protein PHACADRAFT_247456 [Phanerochaete carnosa
HHB-10118-sp]
Length = 584
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 151/363 (41%), Gaps = 45/363 (12%)
Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
P+ ++ AR V+V L + + FF + G A +V I+
Sbjct: 241 PLAEEPREDDSEARSVFVSQLAARLTARDLGYFFEDKL----GEGAVMDSRIVTDRISRR 296
Query: 222 KK-FAFVEMRTVEEASNAMALDGIIFEGVAVRVRR-PTDYNPTLAA---ALGPGQPSPNL 276
K +VE RT+E A+ L G I G+ ++V+ + N T A L PG
Sbjct: 297 SKGIGYVEFRTIELVEKAIGLSGTIVMGLPIQVQHTEAERNRTHAGDSLHLPPG------ 350
Query: 277 NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
S GG + ++VG L + TE+ I+++ E FG L DL +D TG SK
Sbjct: 351 -------VSSHHGGMQ----LYVGSLHFNLTESDIRQVFEPFGELEFVDLHRDPMTGRSK 399
Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQA 388
GY F Y+ +A + G ++ +TL V G + T QES+ L A
Sbjct: 400 GYAFVQYKRGEDAKMALEQMEGFELAGRTLRVNTVHEKGNVRYTPQESLDDTGGGNLNAA 459
Query: 389 QQHIAIQKMAL--QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
+ +QK+A Q + ++ +K + L + + + +E+ +D+
Sbjct: 460 SRQALMQKLARTDQPAARPQPIMKPNIPQSMQSKSVLLKNMFNPEEETERDWDKELADDV 519
Query: 447 REEC-GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
+ E KYG + + + R Q G++++++ A L GR FGG V+
Sbjct: 520 KNEVEDKYGDVNFIKVERESQ--------GEIYVKFDSIESAKKAIEGLHGRWFGGRQVS 571
Query: 506 AFY 508
A +
Sbjct: 572 AAF 574
>gi|256082940|ref|XP_002577709.1| splicing factor [Schistosoma mansoni]
gi|360043602|emb|CCD81148.1| putative splicing factor [Schistosoma mansoni]
Length = 463
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 91/380 (23%), Positives = 147/380 (38%), Gaps = 72/380 (18%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK---FAFVE 228
R AR V+V L ++ + FF+ S G V + N K+ A+VE
Sbjct: 101 RDARTVFVWQLSARIRQRDLEDFFT---------SVGKIRDVRLIMDNKTKRSKGIAYVE 151
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
R VE A A+ L G GV +++++ ++A P P P+
Sbjct: 152 FREVESAQLALGLTGTRLLGVPIQIQQSHAEKNRVSAT--PSLPRPSQQ----------- 198
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP ++++G L Y TE +K + E FG + L+KD T S+GYGF Y +
Sbjct: 199 --NRGPMKLYIGSLHYNITEEMLKGIFEPFGKIEDIKLIKDPTTNRSQGYGFVTYVNSDD 256
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL- 407
A LNG ++ + + V T ++E + A + L T+G L
Sbjct: 257 AKKALDQLNGFELAGRPMKVNHVT----ERSEYACLSALDNDEADRSGVDLGTTGRLALM 312
Query: 408 -----GGGMSLFGETLAKV------------------------LCLTEAITADA----LA 434
G G+ + LA++ +C + ++ +A
Sbjct: 313 AKLAEGTGLEIPKAALAQLHIGQNNPILGSAGSVSSSSAIAPPVCTQCFMLSNMFDPHVA 372
Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNAL 494
+EEI +D+ EEC K G +++ + R T G V+++ N L
Sbjct: 373 THSVFEEIRDDVIEECTKAGGCLHIFVDR-------TSAQGNVYVKCPSIAVATQCVNML 425
Query: 495 SGRKFGGNTVNAFYYPEDKY 514
GR F G + A Y P Y
Sbjct: 426 HGRYFSGRLITAAYVPLINY 445
>gi|62122939|ref|NP_001014392.1| RNA binding motif protein 39b [Danio rerio]
gi|61402832|gb|AAH91794.1| RNA binding motif protein 39b [Danio rerio]
Length = 539
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 72/281 (25%), Positives = 119/281 (42%), Gaps = 51/281 (18%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A +AS G GP R++VG L + TE ++ + E FG + G L+ D +TG SKGYG
Sbjct: 249 AAAMASMLQRGGAGPMRLYVGSLHFNITEDMLRGIFEPFGKIEGIQLMMDSETGRSKGYG 308
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
F + D A LNG ++ + + V R+ AS S T
Sbjct: 309 FISFADAECAKKALEQLNGFELAGRPMKVGHVTERSDASSASSFLDNDELERTGIDLGTT 368
Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLGG------------------GMSLFGE 416
+ ++A+ + +Q K ALQ SG + G M+L +
Sbjct: 369 GRLQLMARLAEGTGLQIPAAAKQALQMSGSVSFGNMPNASATPPLIPNPGMNQAMNLPTQ 428
Query: 417 TLAKVLCLTEAITADALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
LA + + ++ ++ EI +D+ EEC K+G ++++ + D+N + G
Sbjct: 429 PLATHCLQLSNMFNPQMENEPGWDIEIRDDVIEECRKHGGVIHIYV---DKNSAQ----G 481
Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
V+++ ++L GR F G + A Y P Y N
Sbjct: 482 NVYVKCPTIPVAMAVVSSLHGRWFAGKMITAAYVPLPTYHN 522
>gi|432865706|ref|XP_004070573.1| PREDICTED: RNA-binding protein 39-like [Oryzias latipes]
Length = 516
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/266 (25%), Positives = 109/266 (40%), Gaps = 46/266 (17%)
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
G+ GP R++VG L + TE ++ + E FG + G L+ D +TG SKGYGF + D
Sbjct: 241 GSAGPMRLYVGSLHFNITEEMLRGIFEPFGKIEGIQLMMDSETGRSKGYGFISFADAECA 300
Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKTEQE--------------------SILAQAQ 389
A LNG ++ + + V T S T ++A+
Sbjct: 301 KKALEQLNGFELAGRPMKVGHVTERSDSSTASSFLDNDELERTGIDLGTTGRLQLMARLA 360
Query: 390 QHIAIQ-----KMALQTSGMNTLGG-------------GMSLFGETLA-KVLCLTEAITA 430
+ ++ + ALQ +G G ++L + LA L L+
Sbjct: 361 EGTGLKIPPAAQQALQMTGSIPFGNMAAPAIPTPAPSQALNLPSQPLATHCLQLSNLFDP 420
Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATA 490
A D EI +D+ EEC K+G +V++ + D+N + G V+++
Sbjct: 421 QAENDPAWASEIQDDVIEECNKHGGVVHIYV---DKNSPQ----GNVYVKCPSIPAAMAT 473
Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFN 516
NAL GR F + A Y P Y N
Sbjct: 474 VNALHGRWFARKMITAAYVPLPTYHN 499
>gi|326931688|ref|XP_003211958.1| PREDICTED: LOW QUALITY PROTEIN: RNA-binding protein 39-like
[Meleagris gallopavo]
Length = 571
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 87/334 (26%), Positives = 131/334 (39%), Gaps = 65/334 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 243 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 285
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 286 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 345
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 346 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 405
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA-- 434
++A+ + +Q + ALQ SG G L + L A + LA
Sbjct: 406 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQNEVLAAAASVQPLATQ 465
Query: 435 -----------DDEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
+EE EI +D+ EEC K+G ++++ + D+N + G V+++
Sbjct: 466 CFQLSNMFNPQTEEEAGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVYVK 518
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
A NAL GR F G + A Y P Y
Sbjct: 519 CPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTY 552
>gi|226469236|emb|CAX70097.1| RNA-binding protein 39 [Schistosoma japonicum]
Length = 463
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 91/380 (23%), Positives = 148/380 (38%), Gaps = 72/380 (18%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK---FAFVE 228
R AR V+V L ++ + FF+ S G V + N K+ A+VE
Sbjct: 101 RDARTVFVWQLSARIRQRDLEDFFT---------SVGKIRDVRLIMDNKTKRSKGIAYVE 151
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
R VE A A+ L G GV +++++ ++A P P P+
Sbjct: 152 FREVESAQLALGLTGTRLLGVPIQIQQSHAEKNRVSAT--PSLPRPSQQ----------- 198
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
+GP ++++G L Y TE +K + E FG + L+KD T S+GYGF Y +
Sbjct: 199 --NKGPMKLYIGSLHYNITEEMLKGIFEPFGKIEDIKLIKDPATNRSQGYGFVTYVNSDD 256
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL- 407
A LNG ++ + + V T ++E + A + L T+G L
Sbjct: 257 AKKALDQLNGFELAGRPMKVNHVT----ERSEYACLSALDNDEADRSGVDLGTTGRLALM 312
Query: 408 -----GGGMSLFGETLAKV------------------------LCLTEAITADA----LA 434
G G+ + LA++ +C + ++ +A
Sbjct: 313 AKLAEGTGLEIPKAALAQLHIGQNNPILGSAGSVSSSSAIAPPVCTQCFMLSNMFDPHVA 372
Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNAL 494
+EEI +D+ EEC K G +++ + R T G V+++ N L
Sbjct: 373 THSVFEEIRDDVIEECTKAGGCLHIFVDR-------TSAQGNVYVKCPSIAVATQCVNML 425
Query: 495 SGRKFGGNTVNAFYYPEDKY 514
GR F G + A Y P Y
Sbjct: 426 HGRYFSGRLITAAYVPLINY 445
>gi|126291195|ref|XP_001371651.1| PREDICTED: RNA-binding protein 39 isoform 1 [Monodelphis domestica]
Length = 524
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/338 (26%), Positives = 135/338 (39%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFG--ETLAKVLCLTEAITADALA 434
++A+ + +Q + ALQ SG G L ++V L A + LA
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQSEVTALAAAASVQPLA 416
Query: 435 ---------------DDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
D+ ++ EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 417 TQCFQLSNMFNPQTEDELGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 469
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 470 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 507
>gi|340500276|gb|EGR27170.1| splicing factor u2af large subunit, putative [Ichthyophthirius
multifiliis]
Length = 201
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 82/149 (55%), Gaps = 12/149 (8%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGG--NSAGPGDAVVNVYINHEKKFAFVEM 229
RHARR+Y+G +P N++ ++ + + + A GG +S + ++ I+ + KFAF+E+
Sbjct: 22 RHARRLYIGNIPDSINQEYLSEWLYRSLEAAGGLVDSLPNENPIIKCEIDSKGKFAFIEI 81
Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL---AAVGLASG 286
RT+EE + + LDGII +R+RRPT+Y + P LNL +G+
Sbjct: 82 RTIEETTTLLQLDGIILWHRQLRIRRPTEYEK--FPQIYPNYNVKKLNLDLFKTIGIVII 139
Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELL 315
+GP+++F+ LP TQ+ EL+
Sbjct: 140 PTVVDDGPNKIFLANLP-----TQMDELM 163
>gi|47212427|emb|CAF93583.1| unnamed protein product [Tetraodon nigroviridis]
Length = 500
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/265 (27%), Positives = 112/265 (42%), Gaps = 46/265 (17%)
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
G GP R++VG L + TE ++ + E FG + L+ D DTG SKGYGF + D
Sbjct: 226 GMMGPLRLYVGSLHFNITEEMLRGIFEPFGRIENIQLMVDSDTGRSKGYGFITFADAECA 285
Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKT-------EQE------------SILAQ--- 387
A LNG ++ + + V T + EQE ++AQ
Sbjct: 286 KKALEQLNGFELAGRPMKVGHVTDRSDAVAPPFPDGEEQERAGADLGSTGRLQLMAQLSE 345
Query: 388 ---------AQQ------HIAIQKMALQTSGMNTLGGGMSLFGETLA-KVLCLTEAITAD 431
AQQ IA+ MA ++ MN G MS+ + LA L+ + +
Sbjct: 346 GTGLPMPPSAQQALQMSGAIALGAMAAVSAAMNP-GLNMSIPSQPLATHCFQLSNMFSPN 404
Query: 432 ALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAK 491
+ +I ++ EEC K+G +V++ + D++ E G V+++ A
Sbjct: 405 SELPPGWELDIQHNVIEECNKHGGVVHIYV---DKDSAE----GNVYIKCPTIPAAMAAV 457
Query: 492 NALSGRKFGGNTVNAFYYPEDKYFN 516
N L GR F G + A Y P Y N
Sbjct: 458 NVLHGRFFNGKLITAAYVPLPTYHN 482
>gi|145502691|ref|XP_001437323.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404473|emb|CAK69926.1| unnamed protein product [Paramecium tetraurelia]
Length = 438
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 66/110 (60%), Gaps = 1/110 (0%)
Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
M+ + + +L + +T + + DEE+ +I+ED+REEC K+GT+ NV+IPRP+ G
Sbjct: 294 MARYVQIPTNILVIKNVLTLEDVTIDEEFNDIMEDIREECSKFGTVKNVIIPRPE-FGKI 352
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
GVGK+F+EY TA+ L+GR +G TV Y +K+ + ++
Sbjct: 353 IVGVGKIFVEYEKTQEARTARRYLAGRMYGDKTVECEYLSREKWAKRQFT 402
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 56/223 (25%), Positives = 98/223 (43%), Gaps = 41/223 (18%)
Query: 81 RHRSRSHSSDRFRNR---SKSLSPSRSPSKSKRRSGFDMAPPAAAM-----------LPG 126
+ RSR+ S ++ + R K +P+++ ++ R FD +PP + L
Sbjct: 41 KKRSRNVSKEKDKKREEFQKPKAPTKANAEQSRGFRFD-SPPKDPLQNTPFSNFKSKLID 99
Query: 127 AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ---QATRHA-RRVYVGGL 182
G+ + A P QN L PL+ +Q + Q QA A R++YVG L
Sbjct: 100 QVSLGEFETILPANP--LQNPLASLEALQAMTPLIQMQRLQQLRAQADVKADRKLYVGNL 157
Query: 183 PPLANEQAIAT----------FFSQVMTAIGGNSAGPGDAVVNVYINHEKK--------- 223
PP + + + F +Q + +G +S G ++ N +I+ +
Sbjct: 158 PPNSQPKEVEMVMDILNQLQDFLNQTLLKMGVSSEHAG-SICNCWIDSNGQILRLIYLGH 216
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAA 266
F F+E R+ EEA+ L +IF+G +++ RP + +LAA
Sbjct: 217 FGFIEFRSPEEATQGFILKDVIFKGHQLKIGRPKSFLTSLAAV 259
>gi|395505312|ref|XP_003756986.1| PREDICTED: RNA-binding protein 39 [Sarcophilus harrisii]
Length = 557
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/338 (26%), Positives = 135/338 (39%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 227 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 269
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 270 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 329
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 330 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 389
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFG--ETLAKVLCLTEAITADALA 434
++A+ + +Q + ALQ SG G L ++V L A + LA
Sbjct: 390 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQSEVTALAAAASVQPLA 449
Query: 435 ---------------DDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
D+ ++ EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 450 TQCFQLSNMFNPQTEDELGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 502
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 503 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 540
>gi|224077247|ref|XP_002192236.1| PREDICTED: RNA-binding protein 39 [Taeniopygia guttata]
Length = 522
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 87/334 (26%), Positives = 131/334 (39%), Gaps = 65/334 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA-- 434
++A+ + +Q + ALQ SG G L + L A + LA
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVTDLQTRLSQQNEVLAAAASVQPLATQ 416
Query: 435 -----------DDEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
+EE EI +D+ EEC K+G ++++ + D+N + G V+++
Sbjct: 417 CFQLSNMFNPQTEEEAGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVYVK 469
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
A NAL GR F G + A Y P Y
Sbjct: 470 CPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTY 503
>gi|384252245|gb|EIE25721.1| hypothetical protein COCSUDRAFT_64802 [Coccomyxa subellipsoidea
C-169]
Length = 581
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/92 (43%), Positives = 55/92 (59%), Gaps = 5/92 (5%)
Query: 422 LCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE-----TPGVGK 476
L +T +T D L DDEEY E+++D++EEC KYG ++ V++PRP + GK
Sbjct: 480 LQVTGMVTPDVLVDDEEYSEVIQDLQEECSKYGQVLRVLVPRPPNPAASNELFGSNNYGK 539
Query: 477 VFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
F E+ D GC+ AK A+ GR F G TV A Y
Sbjct: 540 AFAEFADVSGCSAAKAAIHGRLFAGETVQATY 571
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 50/88 (56%), Gaps = 3/88 (3%)
Query: 175 RRVYVGGLPP-LANEQAIATFFSQVMTAIGGNSAGPG-DAVVNVYINHEKKFAFVEMRTV 232
R VYVG L L E A+ F+ M A N G +AVV+V ++ E ++AFVE+RT
Sbjct: 294 REVYVGNLVAGLVTEDALRQLFNSTMAAAFPNLLAQGLEAVVSVSMHSEGRYAFVELRTP 353
Query: 233 EEASNAMAL-DGIIFEGVAVRVRRPTDY 259
E AS A+ L + + G ++ V RP+ Y
Sbjct: 354 EMASAALQLSNQVQLLGQSISVGRPSGY 381
>gi|403222792|dbj|BAM40923.1| RNA splicing factor [Theileria orientalis strain Shintoku]
Length = 649
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 76/305 (24%), Positives = 138/305 (45%), Gaps = 40/305 (13%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL-AA 280
K A+VE T E A++++G+ +G +RV+ Q N AA
Sbjct: 367 KGIAYVEFYTQESVIKALSMNGMSLKGQGIRVQ--------------SSQAEKNRAARAA 412
Query: 281 VGLASGAIGGAEGPDRVFVG---GLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
L A+ A+ P V V G+ +E +++L FG + + ++ D G SKG
Sbjct: 413 KQLQENALKEADNPTTVMVSNLVGVLSNLSEGDLQQLFAPFGNVAEVAVARN-DLGLSKG 471
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
Y + ++ A +NG + + + V T++ + + + + L + I+++
Sbjct: 472 YAYVRFKRWTEAREALNVMNGFDISGQPIKVSYVTSNKRGRGSRLNELG----DLDIERL 527
Query: 398 ALQTSGM-----NTLGGGMSLFGETLAKVLCLTEAITADALADDEEY-EEILEDMREECG 451
+ +G+ N + L A + L+ T++ AD+ ++ +EI +D+REEC
Sbjct: 528 DDEEAGLISGSSNKIALMKKLQQRVNAANIVLSNMYTSEDYADNNDFFDEIEDDVREECK 587
Query: 452 KYGTLVNVVIPR--PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
KYG +V V + R PD GKV++++ TA +L GR F GNT+ Y
Sbjct: 588 KYGEVVKVYLNRRKPD---------GKVYVKFRSNTDAQTAHKSLQGRYFAGNTIQVGYL 638
Query: 510 PEDKY 514
+D++
Sbjct: 639 SDDQF 643
>gi|302695543|ref|XP_003037450.1| hypothetical protein SCHCODRAFT_80935 [Schizophyllum commune H4-8]
gi|300111147|gb|EFJ02548.1| hypothetical protein SCHCODRAFT_80935 [Schizophyllum commune H4-8]
Length = 409
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 151/357 (42%), Gaps = 52/357 (14%)
Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
AR V+V L + + FF +G N+ V + K +VE RTV
Sbjct: 74 EARSVFVSQLAARLTARDLGYFFED---KLGENTVMDARIVTDRISRRSKGIGYVEFRTV 130
Query: 233 EEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
+ A+ L G + G+ + V+ L A G NLNL S GGA
Sbjct: 131 DLVDKALDLSGTVVMGLPIMVQLTEAERNRLHAGDG------NLNLPPG--VSAPHGGAM 182
Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
+++VG L + TE IK++ E FG L DL +D TG SKGY F Y+ +A
Sbjct: 183 ---QLYVGSLHFNLTEADIKQVFEPFGELEFVDLHRDPTTGRSKGYAFVQYKRAEDARMA 239
Query: 353 CAALNGLKMGDKTLTVRRATASG------QSKTEQES---ILAQAQQHIAIQKMALQTSG 403
+ G ++ + L V G Q+++ +S L A + +QK+A S
Sbjct: 240 MEQMEGFELAGRQLKVNTVHDKGGVVRYAQTESLDDSGGGNLNAASRQALMQKLARTDSA 299
Query: 404 MNTLGGGMSLFGETLA----------KVLCLTEAITADALADDEEYEEILEDMREEC-GK 452
L E +A + + L + + D+ +E+ +D++ EC K
Sbjct: 300 --------PLLPEPVARPNIPQTMESRSVLLKNMFDPEEESGDDWDKELADDVKGECESK 351
Query: 453 YGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCA-TAKNALSGRKFGGNTVNAFY 508
YG + + + + Q G+++++ +DAV A A L+GR FGG V+A +
Sbjct: 352 YGKVSAIKVEKETQ--------GEIYVK-FDAVDAARKAVQGLNGRWFGGKQVSAAF 399
>gi|118100450|ref|XP_425690.2| PREDICTED: RNA-binding protein 39 [Gallus gallus]
gi|363741409|ref|XP_003642487.1| PREDICTED: RNA-binding protein 39-like [Gallus gallus]
Length = 522
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 87/334 (26%), Positives = 131/334 (39%), Gaps = 65/334 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA-- 434
++A+ + +Q + ALQ SG G L + L A + LA
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQNEVLAAAASVQPLATQ 416
Query: 435 -----------DDEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
+EE EI +D+ EEC K+G ++++ + D+N + G V+++
Sbjct: 417 CFQLSNMFNPQTEEEAGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVYVK 469
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
A NAL GR F G + A Y P Y
Sbjct: 470 CPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTY 503
>gi|325191168|emb|CCA25956.1| Poly(U)bindingsplicing factor PUF60 putative [Albugo laibachii
Nc14]
Length = 454
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/359 (25%), Positives = 145/359 (40%), Gaps = 56/359 (15%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE------KKFAFV 227
A+R+YVG L E I F+ P A+ ++ ++ E K F F+
Sbjct: 135 AKRLYVGNLYYELKEDDIRNVFA------------PFGAIHSIDLSMEPGTGRSKGFCFL 182
Query: 228 EMRTVEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
E V A +A+ L+G A++V RP G +P + AAV +
Sbjct: 183 EFNDVLAAESAVQVLNGSTMANRAIKVGRPHR-----------GNQNPKDSEAAVNIGKE 231
Query: 287 AIGGAEGPDR-VFVGGLPYYFTETQIKELLESFGTLHGFDL--VKDRDTGNSKGYGFCVY 343
AI P + V++GG+ I+ + FG + + V ++G +GYGF +
Sbjct: 232 AIRNV--PTKCVYIGGVRTELNSRHIESIFAPFGEIKHCVMTAVSSSESGVHRGYGFIEF 289
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASG---QSKTEQESILAQAQQHIAIQKMALQ 400
D A +NG ++ +TL V +A+A K + ++ I ++
Sbjct: 290 GDEICAMNAIQHMNGFELAGQTLKVGKASAVALLVNLKISNDKVVD------GIHSLSDA 343
Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
+ + L + LCL I + E + ++ EC KYG + VV
Sbjct: 344 KQRRKIIEPILELEEKEEQICLCLLNLIKPGDVD-----ENLRGEVASECSKYGDIAQVV 398
Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
I E ++F++Y D G AK AL GR FGGN V A +YP + K Y
Sbjct: 399 I-------HELSSHVRIFVQYEDEAGALRAKGALHGRYFGGNAVKAHFYPIQMFLEKKY 450
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 60/136 (44%), Gaps = 4/136 (2%)
Query: 257 TDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLE 316
T +P +A A Q NL + L S E R++VG L Y E I+ +
Sbjct: 102 TALDPEIAKARALAQA----NLLSQSLPSTLFNPIEFAKRLYVGNLYYELKEDDIRNVFA 157
Query: 317 SFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQ 376
FG +H DL + TG SKG+ F + D + A LNG M ++ + V R Q
Sbjct: 158 PFGAIHSIDLSMEPGTGRSKGFCFLEFNDVLAAESAVQVLNGSTMANRAIKVGRPHRGNQ 217
Query: 377 SKTEQESILAQAQQHI 392
+ + E+ + ++ I
Sbjct: 218 NPKDSEAAVNIGKEAI 233
>gi|432101442|gb|ELK29624.1| RNA-binding protein 39 [Myotis davidii]
Length = 491
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 86/327 (26%), Positives = 132/327 (40%), Gaps = 56/327 (17%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 172 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 214
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 215 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 274
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 275 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 334
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVLCL-TEAITADALAD 435
++A+ + +Q + ALQ SG G + A V L T+ + +
Sbjct: 335 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAASALAAAASVQPLATQCFQLSNMFN 394
Query: 436 DEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT 489
+ E EI +D+ EEC K+G ++++ + D+N + G V+++
Sbjct: 395 PQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVYVKCPSIAAAIA 447
Query: 490 AKNALSGRKFGGNTVNAFYYPEDKYFN 516
A NAL GR F G + A Y P Y N
Sbjct: 448 AVNALHGRWFAGKMITAAYVPLPTYHN 474
>gi|449265754|gb|EMC76900.1| RNA-binding protein 39, partial [Columba livia]
Length = 423
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 87/334 (26%), Positives = 131/334 (39%), Gaps = 65/334 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 95 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 137
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 138 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 197
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 198 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 257
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA-- 434
++A+ + +Q + ALQ SG G L + L A + LA
Sbjct: 258 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQNEVLAAAASVQPLATQ 317
Query: 435 -----------DDEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
+EE EI +D+ EEC K+G ++++ + D+N + G V+++
Sbjct: 318 CFQLSNMFNPQTEEEAGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVYVK 370
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
A NAL GR F G + A Y P Y
Sbjct: 371 CPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTY 404
>gi|346465875|gb|AEO32782.1| hypothetical protein [Amblyomma maculatum]
Length = 558
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 87/327 (26%), Positives = 128/327 (39%), Gaps = 54/327 (16%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE VE AM L+G G+ + V+ PT AAA S L
Sbjct: 238 KGIAYVEFLDVESVPLAMGLNGQKLFGIPIVVQ-PTQAERNRAAAQNASTSSSTLQ---- 292
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
G GP R++VG L + TE +K + E FG + +L+KD +T SKGYGF
Sbjct: 293 -------RGNVGPMRLYVGSLHFNITEDMLKGIFEPFGKIDKIELIKDMETNRSKGYGFI 345
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQE-------------- 382
+ D A LNG ++ + + V R + S + E
Sbjct: 346 TFHDSEDAKKALEQLNGFELAGRPMKVGHVTERTDVSQAPSFLDSEELDRSGIDLGATGR 405
Query: 383 -SILAQAQQHIAIQ--KMALQTSGMNTLG----------GGMSLFGETLAKVLCLTEAIT 429
++A+ + Q + A+ MNT G + T+A C +
Sbjct: 406 LQLMAKLAEGTGFQIPQAAVNALQMNTTGLPGQPQAAAVAAAAAAAPTIA-TQCFLLSNM 464
Query: 430 ADALADDEEY--EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
D L + EEI D+ EEC K+G ++V + R G V+++
Sbjct: 465 FDPLTETNPSWDEEIRRDVIEECRKHGGALHVYVDRASPE-------GHVYVKCPTIASA 517
Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ NAL GR F G + A Y P Y
Sbjct: 518 VASVNALHGRWFAGRIITAAYVPVMSY 544
>gi|125584846|gb|EAZ25510.1| hypothetical protein OsJ_09334 [Oryza sativa Japonica Group]
Length = 942
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 88/350 (25%), Positives = 142/350 (40%), Gaps = 80/350 (22%)
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQ-VMTAIGGNSAGPGDAVVNVYINHEKK 223
V QATR RR+++ LP LA E + ++ ++++ + ++ IN +K+
Sbjct: 464 VQLTQATRPLRRLHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKR 523
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
AFVE T E+A+ A++ DG F G ++++RRP +Y A + P +PS + L + +
Sbjct: 524 QAFVEFLTPEDATAALSFDGRSFGGSSLKIRRPKEY--VEMAHVAPKKPSEEIKLISDVV 581
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
A+ P ++F+ G+ + +Y
Sbjct: 582 -------ADSPHKIFIAGISGVISSE--------------------------------MY 602
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
D +T ACA LNG+K+G LT A + TEQ +A I A
Sbjct: 603 IDHPITSKACAGLNGMKLGGGILT---AVNVFPNSTEQ--AFNEASPFYGIPDSA----- 652
Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADA--LADDEEYEEILEDMREECG--------KY 453
SL E KVL L + L E EEILED+R EC ++
Sbjct: 653 -------KSLLEEP-TKVLQLKNVFDQEEYLLLSKSELEEILEDVRVECASLHYGQDDRF 704
Query: 454 GTLVNV-VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
G + ++ V+ P + T G E C + +++GGN
Sbjct: 705 GAVKSINVVKYPASSDNTT---GDTITE------CEDGSTKIEPKEYGGN 745
>gi|66358384|ref|XP_626370.1| splicing factor U2AF U2 SnRNP auxiliary factor large subunit, RRM
domain [Cryptosporidium parvum Iowa II]
gi|46227901|gb|EAK88821.1| splicing factor U2AF U2 SnRNP auxiliary factor large subunit, RRM
domain [Cryptosporidium parvum Iowa II]
Length = 438
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 133/316 (42%), Gaps = 47/316 (14%)
Query: 238 AMALDGIIFEGVAVRV--RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
+ LDG+I + +++ RRP Y+ NLN V L + I D
Sbjct: 127 CLKLDGLIIDSQNIKLFCRRPNKYS--------------NLNNEKV-LDTFIIPRISQHD 171
Query: 296 ------RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
+ + LP E +I++ LES G L ++ D TG KG G +++ ++
Sbjct: 172 NFKENEKCILKNLPTDINEEKIRQHLESIGKLKSLTIIYDPITGIPKGVGSFEFEESSLC 231
Query: 350 DIACAALNG------------LKMGDKTLTVRRATAS--GQSKTEQESILAQAQQHIAIQ 395
A A L+G + +G T+T ++ QS S + Q +++ I
Sbjct: 232 KKAIAILHGKPIESTKNGIWNIYLGSGTITNYKSNKGQFNQSNFSVNSNIIQNSEYLHIT 291
Query: 396 K----MALQTSGMNTLGGGMSL---FGETLAKVLCLTEAITADALADDEEYEEILEDMRE 448
+ M LG M GET ++++ L + L D+E Y L+ +R
Sbjct: 292 EIPTSMTYNIFSNPVLGLMMKYSKQVGETPSQIIQLLNIFLPEELVDNEIYNSTLDSVRS 351
Query: 449 ECGKYGTLVNVVIPRPD--QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG-NTVN 505
E YGT++ + PRP + G GKVF+ + D A+ +GR F TV+
Sbjct: 352 EAEVYGTILEIFCPRPKVIEEFHSCSGAGKVFIYFSDITAARRAQYQFNGRVFDNIKTVS 411
Query: 506 AFYYPEDKYFNKDYSA 521
A ++P +KY +YS
Sbjct: 412 ATFFPLEKYLKHEYSV 427
>gi|224003373|ref|XP_002291358.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973134|gb|EED91465.1| predicted protein, partial [Thalassiosira pseudonana CCMP1335]
Length = 99
Score = 84.7 bits (208), Expect = 9e-14, Method: Composition-based stats.
Identities = 47/101 (46%), Positives = 59/101 (58%), Gaps = 6/101 (5%)
Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFL 479
+V+ L +T L DD EY EILED R+EC +GTL N++IPR NG PG K+FL
Sbjct: 4 RVVELKHMLTQQDLEDDNEYNEILEDTRDECSSFGTLKNIIIPR---NG---PGATKIFL 57
Query: 480 EYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
EY A A L+GR F G V A + E K+ N+DYS
Sbjct: 58 EYMTNEDAAKAIAGLAGRTFDGRQVTAVCFDEIKFANEDYS 98
>gi|294878000|ref|XP_002768233.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239870430|gb|EER00951.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 638
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 131/562 (23%), Positives = 216/562 (38%), Gaps = 141/562 (25%)
Query: 48 RRDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRHRSRSHSSD----------------- 90
R D N + R+ RD + HR N D E + + + D
Sbjct: 48 REDSNARSQRDNPRD----EPHRSRNDDPEDSPTRKRKEDTGDDGDRKSRERSRRRSPIR 103
Query: 91 -RFRNRSKSLSPSRSPS-KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNML 148
R R + SRSPS K K+ FD P A QL G +++ + Q ++
Sbjct: 104 RDRRPRDRRKRWSRSPSEKQKKPFKFDSPPKELA--------AQLDGSGTSLLGLPQTVV 155
Query: 149 PFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAG 208
+T AF + + + AR +Y+G +PP + + + + +G N A
Sbjct: 156 SSSSTIKEAFN----ATLAAERQKIARELYIGQIPPGISAAHLIDVLNDSLMNMGAN-AM 210
Query: 209 PGDAVVNVYINHEKKFAFVEMRTVEEASNAMA-LDGIIFE--GVAVRVRRPTDYNPTLAA 265
PG +V+ ++ + FAFVE RT EEAS A+ L+G + GV+++V RP Y
Sbjct: 211 PGRPIVHGWLGGDGLFAFVEFRTAEEASIALERLNGHQLKSYGVSIKVGRPKGY------ 264
Query: 266 ALGPGQPSPNLNLAAVG-----LASGAIGG-------AEGPDRVFVGGLPYYFTETQIKE 313
+GP P ++N G +S AI G A R+ + G P +E IK
Sbjct: 265 -MGPA-PEDSVNAYTAGGNTASSSSSAIPGGISAAEVASDTSRLCLIGFPLKASEHSIKR 322
Query: 314 LLESF--GTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
L S G + +++K T N + V++ + D
Sbjct: 323 ALRSAAKGEIRHLEILK--HTWNDEQIVLAVFECVNIED--------------------- 359
Query: 372 TASGQSKTEQESILAQAQQHIAIQKMALQTSGMN----TLGGGMSLFGETL-AKVLCLTE 426
+ K + E + + I K A+ MN + GM L E + ++VL +T
Sbjct: 360 --EHRLKKKGEVEIQGVKARIINPKDAIVKGYMNFDGDIMKKGMGL--EVVPSRVLVMTN 415
Query: 427 -AITADALADDEEYEEILEDMREECGKY---GTLVNVVIPRPDQNGG------------- 469
A + + L DD Y ++++D++ EC + +++IPRP+ N
Sbjct: 416 FAGSVEELLDDINYSDLMDDIKVECKSITGGADVRSIIIPRPETNTTIPTVNDVNTPNGD 475
Query: 470 -------------------------------ETPGVGKVFLEYYDAVGCATAKNALSGRK 498
+ PG+G F+E+ K L GR
Sbjct: 476 AHHHDSATMEDSHQTTVQGNTSTAAVPAVDMQVPGLGCCFIEFRSVEEAGQVKRILDGRI 535
Query: 499 FGGNTVNAFYYPEDKYFNKDYS 520
FGG+ V Y+ E ++ D++
Sbjct: 536 FGGHEVFVTYFSETRFQRGDFA 557
>gi|449689952|ref|XP_004212193.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like, partial [Hydra
magnipapillata]
Length = 210
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 60/170 (35%), Positives = 83/170 (48%), Gaps = 29/170 (17%)
Query: 102 SRSPSKSKRRSGFDMAPPA---------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGA 152
S S KSKR + +D+ P A+ V P SAVP ++ LP GA
Sbjct: 46 SHSVPKSKRNTLWDVPPKGYEDITPVQFKALRAAGKVEVANPVCGSAVPAVS---LPQGA 102
Query: 153 TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDA 212
Q T ARR+Y+G +P +E + FF+ M + PG+
Sbjct: 103 ----------------QTTWQARRIYLGNIPFGISEDLMVDFFNAKMRE-SDIARQPGNP 145
Query: 213 VVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPT 262
V+ IN EK FAF+E R+VEE + AMA DGI+ +G A+++RRP DY P
Sbjct: 146 VLACQINLEKNFAFLEFRSVEETTLAMAFDGIMLQGQALKIRRPKDYQPI 195
>gi|215820610|ref|NP_001135964.1| RNA binding motif protein 39 [Nasonia vitripennis]
Length = 516
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 146/376 (38%), Gaps = 69/376 (18%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR ++ L + + FFS S G V + N ++F A+VE
Sbjct: 154 RDARTIFCMQLSQRIRARDLEEFFS---------SVGKVQDVRLITCNKTRRFKGIAYVE 204
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
+ E + A+ L G GV + V+ T A G PNL
Sbjct: 205 FKDPESVTLALGLSGQKLLGVPIVVQH------TQAEKNRMGNSMPNL----------MP 248
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
G GP R++VG L + TE +K + E FG + L+ D +TG SKGYGF +++
Sbjct: 249 KGQTGPMRLYVGSLLFNITEEMLKGIFEPFGKIENIQLIMDPETGRSKGYGFLTFRNADD 308
Query: 349 TDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
A LNG ++ + + V R G S + + + A ++ L
Sbjct: 309 AKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLLDTDELDRSGIDLGATGRLQL---- 364
Query: 404 MNTLGGGMSL-FGETLAKVLCLTEAITADALAD-----------------DEEYE----- 440
M L G L A L +T +TA + D + E
Sbjct: 365 MFKLAEGTGLEIPPAAANALNMTPVVTAPQINQQTAPPIATQCFMLSNMFDPQNENNSLW 424
Query: 441 --EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRK 498
EI +D+ EEC K+G +++V + DQ + G V+++ A N+L GR
Sbjct: 425 VKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVYVKCPSIATAVAAVNSLHGRW 477
Query: 499 FGGNTVNAFYYPEDKY 514
F G + A Y P Y
Sbjct: 478 FAGRVITAAYVPVVNY 493
>gi|145523992|ref|XP_001447829.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415351|emb|CAK80432.1| unnamed protein product [Paramecium tetraurelia]
Length = 419
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 66/110 (60%), Gaps = 1/110 (0%)
Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
M+ + + +L + +T + + DEE+ +I++D++EEC K+GT+ N++IPRP+ G
Sbjct: 275 MARYVQIPTNILVIKNVLTLEDVTIDEEFNDIMDDIKEECSKFGTVKNIIIPRPE-FGKI 333
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
GVGK+F+EY TA+ L+GR +G TV Y +K+ + ++
Sbjct: 334 IIGVGKIFVEYEKTQEARTARRYLAGRMYGDKTVECEYLSREKWAKRQFT 383
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/204 (27%), Positives = 96/204 (47%), Gaps = 22/204 (10%)
Query: 81 RHRSRSHSSDRFRNR---SKSLSPSRSPSKSKRRSGFDMAPPAAAM-----------LPG 126
+ RSR+ S ++ + R K +P++ ++ R FD +PP + L
Sbjct: 41 KKRSRNVSKEKEKKRDEFQKPKAPTKQNAEQSRGFRFD-SPPKDPLQNTPFSNFKSKLID 99
Query: 127 AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ---QATRHA-RRVYVGGL 182
G+ + A P QN L PL+ +Q + Q QA A R++YVG L
Sbjct: 100 QVSLGEFETILPANP--LQNPLASLEALQAMTPLIQMQRLQQLRAQADVKADRKLYVGNL 157
Query: 183 PPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALD 242
PP + + + F +Q + +G +S G ++ N +I+ F F+E R+ EEA+ L
Sbjct: 158 PPNSQPKELQDFLNQTLLKMGVSSEHAG-SICNCWIDSNGHFGFIEFRSPEEATQGFILK 216
Query: 243 GIIFEGVAVRVRRPTDYNPTLAAA 266
+IF+G +++ RP + +LAA
Sbjct: 217 DVIFKGHQLKIGRPKSFLTSLAAV 240
>gi|242015973|ref|XP_002428613.1| RNA-binding region-containing protein, putative [Pediculus humanus
corporis]
gi|212513276|gb|EEB15875.1| RNA-binding region-containing protein, putative [Pediculus humanus
corporis]
Length = 593
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 118/521 (22%), Positives = 195/521 (37%), Gaps = 80/521 (15%)
Query: 27 RTGERGRDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKER-RHRHRSR 85
+ +R RD+H + ++ RDK+ RE RD DR R RD + +ER R+RH
Sbjct: 26 KRSKRDRDKHDKRSRNSRSRHSRDKDRHSSRE--RDRDRHSRERDRHSSRERDRNRH--- 80
Query: 86 SHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQ 145
S DR+ ++ + S+ R + + P L P +
Sbjct: 81 SRDRDRY-SKDRDRRSRDRDRHSRERDRYSRERDRYRSRRRSISPNNL------APHLLN 133
Query: 146 NMLPFGA----TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTA 201
N + + F +P+ +T + R R V+ L + + FFS
Sbjct: 134 NEYAYKKYASYRKSPTFSKLPIDDLTPEE-RDQRTVFCMQLSQRIRGRDLEEFFS----- 187
Query: 202 IGGNSAGPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTD 258
S G V + N ++F A+VE + E AM L G G+ + V+
Sbjct: 188 ----SVGKVRDVKLITCNKTRRFKGIAYVEFKDPESVPLAMGLTGQKLLGIPISVQ---- 239
Query: 259 YNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESF 318
PT A G + + + + GP R++VG L + TE ++ + E F
Sbjct: 240 --PTQAEKNRQGNSTAPMMMPS---------DMRGPMRLYVGSLHFNITEDMLRGIFEPF 288
Query: 319 GTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV-----RRATA 373
G + L+ D +TG SKGYGF + A LNG ++ + + V R
Sbjct: 289 GKIDSIQLIMDPETGRSKGYGFITFHSADDAKKALEQLNGFELAGRPMKVGNVQERTDNI 348
Query: 374 SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAK------------- 420
+G S + + + A ++ L G GM +
Sbjct: 349 AGTSILDTDELDRSGIDLGATGRLQLMYKLAE--GTGMQIPPAAATALNLANALPQAVQP 406
Query: 421 -----VLCLTEAITAD-ALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPG 473
C A D A + ++ EI +D+ EEC K+G +++V + +
Sbjct: 407 APPIATQCFMLANMFDPATETNPTWDVEIRDDVIEECNKHGGVLHVYVDKTSN------- 459
Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
G V+++ + N+L GR F G + A Y P Y
Sbjct: 460 -GNVYVKCPTIATAVASVNSLHGRWFAGRIITAAYVPLLNY 499
>gi|114794658|pdb|2HZC|A Chain A, Crystal Structure Of The N-terminal Rrm Of The U2af Large
Subunit
Length = 87
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/83 (48%), Positives = 59/83 (71%), Gaps = 1/83 (1%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
ARR+YVG +P E+A+ FF+ M +GG + PG+ V+ V IN +K FAF+E R+V+
Sbjct: 6 ARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVD 64
Query: 234 EASNAMALDGIIFEGVAVRVRRP 256
E + AMA DGIIF+G ++++RRP
Sbjct: 65 ETTQAMAFDGIIFQGQSLKIRRP 87
>gi|59858555|ref|NP_001012304.1| RNA-binding protein 39 [Danio rerio]
gi|27882534|gb|AAH44487.1| RNA binding motif protein 39a [Danio rerio]
gi|182892014|gb|AAI65689.1| Rbm39a protein [Danio rerio]
Length = 523
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 118/283 (41%), Gaps = 53/283 (18%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A LA+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYG
Sbjct: 231 AAALANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIDSIQLMMDSETGRSKGYG 290
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
F + D A LNG ++ + + V R AS S T
Sbjct: 291 FITFSDAECAKKALEQLNGFELAGRPMKVGHVTERTDASTASSFLDNDELERTGIDLGTT 350
Query: 380 EQESILAQAQQHIAIQ-----KMALQTSG-------------------MNT-LGGGMSLF 414
+ ++A+ + +Q + ALQ SG +N ++L
Sbjct: 351 GRLQLMARLAEGTGLQIPPAAQQALQMSGSMVAMAAATAAMNPGLSFNINVPTNQALNLP 410
Query: 415 GETLA-KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPG 473
+ +A L+ ++ D EI +D+ EEC K+G ++++ + D+ E
Sbjct: 411 SQPIATHCFQLSNMFNPNSENDHGWEIEIQDDVIEECNKHGGVIHIYV---DKKSAE--- 464
Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A +AL GR FGG + A Y P Y N
Sbjct: 465 -GNVYVKCPTIPAAMAAVSALHGRWFGGKMITAAYVPLPTYHN 506
>gi|126291198|ref|XP_001371677.1| PREDICTED: RNA-binding protein 39 isoform 2 [Monodelphis domestica]
Length = 533
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 75/289 (25%), Positives = 120/289 (41%), Gaps = 59/289 (20%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A +A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYG
Sbjct: 235 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 294
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
F + D A LNG ++ + + V R AS S T
Sbjct: 295 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 354
Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLGG-GMSLFGETL----------AKVLC 423
+ ++A+ + +Q + ALQ SG G +F ++V
Sbjct: 355 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAAKIFFPFFIDLQTRLSQQSEVTA 414
Query: 424 LTEAITADALA---------------DDEEYE-EILEDMREECGKYGTLVNVVIPRPDQN 467
L A + LA D+ ++ EI +D+ EEC K+G ++++ + D+N
Sbjct: 415 LAAAASVQPLATQCFQLSNMFNPQTEDELGWDTEIKDDVIEECNKHGGVIHIYV---DKN 471
Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+ G V+++ A NAL GR F G + A Y P Y N
Sbjct: 472 SAQ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 516
>gi|321472566|gb|EFX83536.1| hypothetical protein DAPPUDRAFT_194972 [Daphnia pulex]
Length = 366
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 84/374 (22%), Positives = 141/374 (37%), Gaps = 60/374 (16%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS V G V + N ++F +VE
Sbjct: 6 RDARTVFCMQLSQRIRARDLEEFFSAV---------GKVRDVRLITCNKTRRFKGLCYVE 56
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
E A+AL G GV + V+ LA + P S N
Sbjct: 57 FAEPESVPLAIALTGQRLCGVPIVVQPTQAEKNRLAGSNMPAMSSFN------------- 103
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
G GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF +++
Sbjct: 104 KGPNGPMRLYVGSLHFNITEDMLRSIFEPFGKIEHMQLMIDTETGRSKGYGFITFRNAED 163
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSK------------------TEQESILAQAQQ 390
A LNG ++ + + + T T + ++A+ Q
Sbjct: 164 AKKAMEQLNGFELAGRPMKINHVTEHFTGNHTYLDSDEMDRAGIDLGATGRLQLMAKLAQ 223
Query: 391 HIAIQKMALQTSGMNTLGGGMSLFGETL----------AKVLCLTEAITADALADDEEYE 440
++ A S +N + + L + L+ + + ++
Sbjct: 224 GTGLEIPAAAQSALNLQASIQAAQQQALPVASVAPPIATQCFMLSNMFDSSSETHPLWHQ 283
Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
EI +D+ +EC K+G ++++ + + G V+++ A NAL GR F
Sbjct: 284 EICDDVMDECNKHGGVLHIYVDKASPQ-------GNVYVKCPSVTVAVNAVNALHGRWFA 336
Query: 501 GNTVNAFYYPEDKY 514
G + A Y P Y
Sbjct: 337 GRIITAAYVPLINY 350
>gi|297801306|ref|XP_002868537.1| hypothetical protein ARALYDRAFT_355725 [Arabidopsis lyrata subsp.
lyrata]
gi|297314373|gb|EFH44796.1| hypothetical protein ARALYDRAFT_355725 [Arabidopsis lyrata subsp.
lyrata]
Length = 1370
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 70/289 (24%), Positives = 121/289 (41%), Gaps = 40/289 (13%)
Query: 96 SKSLSPSRSPSKSKRRSGFDMAPPA-AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
+K++SP + S K+ + +D+AP AAM G+ G +A P ++ L
Sbjct: 815 AKAVSPP-NLSSEKKSAKWDLAPAVTAAMFSGSVFSGLQAAAQTAYPTNSEASLTLLKPL 873
Query: 155 LGAFPLMPV--------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS 206
+ A P V ++TR RR+Y + A+E+++ F+ M + G N
Sbjct: 874 MEAPFRTPSAREITSVDSVQLTESTRRMRRLYAENVSDSASEKSLIECFNSYMLSSGSNH 933
Query: 207 AGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAA 266
+ ++ IN EK A VE T +AS A++LDG F G+ +++RRP Y T
Sbjct: 934 IKGSEPCISCIINKEKSQALVEFLTPHDASAALSLDGCSFAGLNLKIRRPKGYVETTGVY 993
Query: 267 LG-------PGQPS----------PNLNLAAVGLASGAIGGAE------------GPDRV 297
+G G + A+ + SG + E +++
Sbjct: 994 VGYVIIHIQEGDEAVCYVMVTIHEAGFQTVAIFMQSGELAKKEPATNAISDNVKDSSNKI 1053
Query: 298 FVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
F+GG P + + E++ FG L + V + D N + V ++P
Sbjct: 1054 FIGGFPKSISSEMLMEIVSVFGPLKAYRFVINNDL-NKRCAFLEVNENP 1101
>gi|209155056|gb|ACI33760.1| RNA-binding protein 39 [Salmo salar]
Length = 535
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 115/288 (39%), Gaps = 58/288 (20%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A +A+ G GP R++VG L + TE ++ + E FG + L+ D +T SKGYG
Sbjct: 238 AAAMANNLQKGNAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETARSKGYG 297
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE----------------- 382
F + D A LNG ++ + + V T S T
Sbjct: 298 FISFADAECAKKALEQLNGFELAGRPMKVGNVTERTDSSTASSFLDNDELERTGIDLGTT 357
Query: 383 ---SILAQAQQHIAIQ-----KMALQTSG--------MNTLGGGMSL------FGETLAK 420
++A+ + +Q + ALQ SG + G ++ G ++ +
Sbjct: 358 GRLQLMARLAEGTGLQIPPAAQQALQMSGSMHSSSIHFGNMAAGTAIANPALNLGPSMNQ 417
Query: 421 VLCL-TEAITADALADDEEYE-----------EILEDMREECGKYGTLVNVVIPRPDQNG 468
+ L T+ + L + EI +D+ EEC K+G +V++ + D+N
Sbjct: 418 AMNLPTQPLATHCLQLSNMFSPQSENEPGWDIEIQDDVMEECNKHGGIVHIYV---DKNS 474
Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+ G V+++ A NAL GR F G + A Y P Y N
Sbjct: 475 PQ----GNVYVKCPTIPTAMAAVNALHGRWFAGKMITAAYVPLPTYHN 518
>gi|417411155|gb|JAA52027.1| Putative transcriptional coactivator caper rrm superfamily, partial
[Desmodus rotundus]
Length = 491
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 130/338 (38%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 161 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 203
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 204 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 263
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 264 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 323
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVL------------CL 424
++A+ + +Q + ALQ SG G L +
Sbjct: 324 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 383
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 384 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 436
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 437 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 474
>gi|67617167|ref|XP_667532.1| U2 snRNP auxiliary factor [Cryptosporidium hominis TU502]
gi|54658687|gb|EAL37312.1| U2 snRNP auxiliary factor [Cryptosporidium hominis]
Length = 438
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/401 (22%), Positives = 161/401 (40%), Gaps = 69/401 (17%)
Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT--V 232
R V G P + N + + F + V+ I G ++ + + + E F+ + T +
Sbjct: 42 RTVEFGSDPHVFNSETVEIFLTGVILTILGKASNDSEKLKLIEEVVESDFSSLSCSTGLI 101
Query: 233 EEASNAMALDGI--------IFEGVAVRV--------------RRPTDYNPTLAAALGPG 270
N++ +D I +F + +++ RRP Y+
Sbjct: 102 ANLENSIKIDNIFCVTFTSSLFSLICLKLDGHIIDSQNIKLFCRRPNKYS---------- 151
Query: 271 QPSPNLNLAAVGLASGAI------GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
NLN V L + I + ++ + LP E +I++ LE+ G L
Sbjct: 152 ----NLNNEKV-LDTFIIPRISQYDNFKENEKCILKNLPTDINEEKIRQHLENIGKLKSL 206
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNG------------LKMGDKTLTVRRAT 372
++ D TG KG G +++ ++ A A L+G + +G T+T ++
Sbjct: 207 TIIYDPITGIPKGVGSFEFEESSLCKKAIAILHGKPIESTKNGIWNIYLGSGTITNYKSN 266
Query: 373 AS--GQSKTEQESILAQAQQHIAIQK----MALQTSGMNTLGGGMSL---FGETLAKVLC 423
QS S + Q +++ I + M LG M GET ++++
Sbjct: 267 KGQFNQSNFSANSSIIQNSEYLHITEIPTSMTYNIFSNPVLGLMMKYSKQVGETPSQIIQ 326
Query: 424 LTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD--QNGGETPGVGKVFLEY 481
L + L D+E Y L+ +R E YGT++ + PRP + G GKVF+ +
Sbjct: 327 LLNIFLPEELVDNEIYNSTLDSVRSEAEVYGTILEIFCPRPKVIEEFHSCSGAGKVFIYF 386
Query: 482 YDAVGCATAKNALSGRKFGG-NTVNAFYYPEDKYFNKDYSA 521
D A+ +GR F TV+A ++P +KY +YS
Sbjct: 387 SDITAARRAQYQFNGRVFDNIKTVSATFFPLEKYLKHEYSV 427
>gi|395863336|ref|XP_003803852.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Otolemur
garnettii]
gi|395863340|ref|XP_003803854.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Otolemur
garnettii]
Length = 514
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 100/403 (24%), Positives = 149/403 (36%), Gaps = 79/403 (19%)
Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
P+ +T + R AR V+ L + + FFS V + +
Sbjct: 126 PIDNLTPE-ERDARTVFCMQLAARIRPRDLEAFFSTV------GKVRDVRMISDRNARRS 178
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + VR Q N A
Sbjct: 179 KGIAYVEFVDVSSVPLAIGLTGQRVFGVPILVR--------------ASQAEKN---RAA 221
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L TE ++ + E FG L L+KD +TG SKGYGF
Sbjct: 222 AMANNLQKGRAGPMRLYVGSLHLNITEAMLRGIFEPFGRLESIQLMKDSETGRSKGYGFI 281
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI------- 394
+ D A LNGL++ + + V T + T + + + I
Sbjct: 282 TFSDSECAKKALEQLNGLELAGRPMKVGHVTEGTDASTASSFLNSDELERTGIDLGTAGG 341
Query: 395 -QKMALQTSG--------------MNT--LGGGMSLFGETLAKVLCLTEAITADALA--- 434
Q MA G MN+ G + F L + L++ A LA
Sbjct: 342 LQFMARLAEGTGLQIPPAAQQALQMNSPLAFGATAEFSFRLDLLTRLSQQTQASDLAAAA 401
Query: 435 ------------------DDEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETPG 473
EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 402 SVQPLATQCFQLSNMFNPQTEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ--- 455
Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A +AL GR F G + A Y P Y N
Sbjct: 456 -GNVYVKCPSIAAAVAAVSALHGRWFAGKMITAAYVPLPTYHN 497
>gi|328865553|gb|EGG13939.1| RNA-binding region RNP-1 domain-containing protein [Dictyostelium
fasciculatum]
Length = 949
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 126/318 (39%), Gaps = 43/318 (13%)
Query: 224 FAFVEMRTVEEASNAMA-LDGIIFEGVAVRVRRPTD--YNPTLAAA------------LG 268
F F+E E A NA+ ++ G ++VR+P+ NP L
Sbjct: 646 FCFIEYTYPEAAINAIQNMNQKTISGRQIKVRQPSIPVINPAATGVSVGMGGGGMSEILQ 705
Query: 269 PGQPSPNLNLAAVG----------LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESF 318
P N L++ L + + + +RV+VG +P+ TE QIK + S
Sbjct: 706 PNIIPSNTFLSSTSVASSFSSQALLNNTPVKERDNDNRVYVGSVPWNATEDQIKTIFSSI 765
Query: 319 GTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK 378
G + L + +TG GYGF Y +P + A + NG + + L VR+ +
Sbjct: 766 GNVVSCSLKPNLETGRHMGYGFIDYDNPKSAEDAISTFNGYDINGRQLKVRKPVRNAPKV 825
Query: 379 TEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEE 438
+ L + + ++ L T + L A C+ DE
Sbjct: 826 NNNDGNLLEDNISLNNEQKILLTQKL--------LAASEPATNRCMVMRNLGSPAELDEY 877
Query: 439 YEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRK 498
+E E+++ EC +G + VVI N G + K ++ + DA CA + +GR
Sbjct: 878 FE---EEIKNECSSFGAVEKVVI----TNEGTS---VKAYVLFRDAPSCAMCLSKQNGRY 927
Query: 499 FGGNTVNAFYYPEDKYFN 516
F G V A YY + + N
Sbjct: 928 FSGYLVKAEYYNVNLFLN 945
>gi|148674239|gb|EDL06186.1| RNA binding motif protein 39, isoform CRA_d [Mus musculus]
Length = 507
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 177 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 219
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 220 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 279
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 280 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 339
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
++A+ + +Q + ALQ SG G L A V L
Sbjct: 340 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 399
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 400 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 452
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 453 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 490
>gi|88682991|gb|AAI05542.1| RBM39 protein [Bos taurus]
Length = 528
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 198 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 240
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 241 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 300
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 301 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 360
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
++A+ + +Q + ALQ SG G L A V L
Sbjct: 361 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 420
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 421 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 473
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 474 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 511
>gi|226469234|emb|CAX70096.1| RNA-binding protein 39 [Schistosoma japonicum]
Length = 327
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 79/327 (24%), Positives = 129/327 (39%), Gaps = 60/327 (18%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE R VE A A+ L G GV +++++ ++A P P P+
Sbjct: 9 KGIAYVEFREVESAQLALGLTGTRLLGVPIQIQQSHAEKNRVSAT--PSLPRPSQQ---- 62
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+GP ++++G L Y TE +K + E FG + L+KD T S+GYGF
Sbjct: 63 ---------NKGPMKLYIGSLHYNITEEMLKGIFEPFGKIEDIKLIKDPATNRSQGYGFV 113
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
Y + A LNG ++ + + V T ++E + A + L T
Sbjct: 114 TYVNSDDAKKALDQLNGFELAGRPMKVNHVT----ERSEYACLSALDNDEADRSGVDLGT 169
Query: 402 SGMNTL------GGGMSLFGETLAKV------------------------LCLTEAITAD 431
+G L G G+ + LA++ +C + ++
Sbjct: 170 TGRLALMAKLAEGTGLEIPKAALAQLHIGQNNPILGSAGSVSSSSAIAPPVCTQCFMLSN 229
Query: 432 A----LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
+A +EEI +D+ EEC K G +++ + R T G V+++
Sbjct: 230 MFDPHVATHSVFEEIRDDVIEECTKAGGCLHIFVDR-------TSAQGNVYVKCPSIAVA 282
Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKY 514
N L GR F G + A Y P Y
Sbjct: 283 TQCVNMLHGRYFSGRLITAAYVPLINY 309
>gi|74179655|dbj|BAE22477.1| unnamed protein product [Mus musculus]
Length = 521
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 191 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 233
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 234 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 293
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 294 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 353
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
++A+ + +Q + ALQ SG G L A V L
Sbjct: 354 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 413
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 414 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 466
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 467 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 504
>gi|395863338|ref|XP_003803853.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Otolemur
garnettii]
gi|395863342|ref|XP_003803855.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Otolemur
garnettii]
Length = 487
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 100/403 (24%), Positives = 149/403 (36%), Gaps = 79/403 (19%)
Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
P+ +T + R AR V+ L + + FFS V + +
Sbjct: 99 PIDNLTPEE-RDARTVFCMQLAARIRPRDLEAFFSTV------GKVRDVRMISDRNARRS 151
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + VR Q N A
Sbjct: 152 KGIAYVEFVDVSSVPLAIGLTGQRVFGVPILVR--------------ASQAEKN---RAA 194
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L TE ++ + E FG L L+KD +TG SKGYGF
Sbjct: 195 AMANNLQKGRAGPMRLYVGSLHLNITEAMLRGIFEPFGRLESIQLMKDSETGRSKGYGFI 254
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI------- 394
+ D A LNGL++ + + V T + T + + + I
Sbjct: 255 TFSDSECAKKALEQLNGLELAGRPMKVGHVTEGTDASTASSFLNSDELERTGIDLGTAGG 314
Query: 395 -QKMALQTSG--------------MNT--LGGGMSLFGETLAKVLCLTEAITADALA--- 434
Q MA G MN+ G + F L + L++ A LA
Sbjct: 315 LQFMARLAEGTGLQIPPAAQQALQMNSPLAFGATAEFSFRLDLLTRLSQQTQASDLAAAA 374
Query: 435 ------------------DDEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETPG 473
EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 375 SVQPLATQCFQLSNMFNPQTEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ--- 428
Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A +AL GR F G + A Y P Y N
Sbjct: 429 -GNVYVKCPSIAAAVAAVSALHGRWFAGKMITAAYVPLPTYHN 470
>gi|256082942|ref|XP_002577710.1| splicing factor [Schistosoma mansoni]
gi|360043601|emb|CCD81147.1| putative splicing factor [Schistosoma mansoni]
Length = 327
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 79/327 (24%), Positives = 128/327 (39%), Gaps = 60/327 (18%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE R VE A A+ L G GV +++++ ++A P P P+
Sbjct: 9 KGIAYVEFREVESAQLALGLTGTRLLGVPIQIQQSHAEKNRVSAT--PSLPRPSQQ---- 62
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
GP ++++G L Y TE +K + E FG + L+KD T S+GYGF
Sbjct: 63 ---------NRGPMKLYIGSLHYNITEEMLKGIFEPFGKIEDIKLIKDPTTNRSQGYGFV 113
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
Y + A LNG ++ + + V T ++E + A + L T
Sbjct: 114 TYVNSDDAKKALDQLNGFELAGRPMKVNHVT----ERSEYACLSALDNDEADRSGVDLGT 169
Query: 402 SGMNTL------GGGMSLFGETLAKV------------------------LCLTEAITAD 431
+G L G G+ + LA++ +C + ++
Sbjct: 170 TGRLALMAKLAEGTGLEIPKAALAQLHIGQNNPILGSAGSVSSSSAIAPPVCTQCFMLSN 229
Query: 432 A----LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
+A +EEI +D+ EEC K G +++ + R T G V+++
Sbjct: 230 MFDPHVATHSVFEEIRDDVIEECTKAGGCLHIFVDR-------TSAQGNVYVKCPSIAVA 282
Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKY 514
N L GR F G + A Y P Y
Sbjct: 283 TQCVNMLHGRYFSGRLITAAYVPLINY 309
>gi|330688445|ref|NP_001193433.1| RNA-binding protein 39 [Bos taurus]
Length = 530
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 200 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 242
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 243 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 302
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 303 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 362
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
++A+ + +Q + ALQ SG G L A V L
Sbjct: 363 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 422
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 423 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 475
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 476 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 513
>gi|157108428|ref|XP_001650224.1| splicing factor [Aedes aegypti]
gi|108879330|gb|EAT43555.1| AAEL005046-PA [Aedes aegypti]
Length = 544
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 140/372 (37%), Gaps = 62/372 (16%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS V G V + N K+F A++E
Sbjct: 186 RDARTVFCMQLSQRIRARDLEEFFSSV---------GKVRDVRLITCNKTKRFKGIAYIE 236
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
+ E + A+ L G G+ + V+ +A+ P QP P +
Sbjct: 237 FKDPESVALALGLSGQRLLGIPISVQHTQAEKNRMAST--PPQPPPKV------------ 282
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP R++VG L + TE ++ + E FG + L+ D DTG SKGYGF + +
Sbjct: 283 --TSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDSDTGRSKGYGFITFHNADD 340
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA---LQTSGMN 405
A LNG ++ + + V T T S+ I A LQ
Sbjct: 341 AKKALEQLNGFELAGRPMKVGNVTER-LDVTTHASLDTDEMDRSGIDLGATGRLQLMFKL 399
Query: 406 TLGGGMSL----------------------FGETLAKVLCLTEAITADALADDEEYEEIL 443
G G+++ +A L + + ++ +
Sbjct: 400 AEGAGLAVPRAAADALLATAPQPAPQQPVAPSPPIATQCFLLSNMFDPTTETNPTWDTEI 459
Query: 444 ED-MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
ED + EEC K+G +++V + + E P G V+++ A NAL GR F G
Sbjct: 460 EDDVIEECNKHGGVLHVYVDK------ENPA-GNVYVKCPSIATAVLAVNALHGRWFAGR 512
Query: 503 TVNAFYYPEDKY 514
+ A Y P Y
Sbjct: 513 IITAAYVPLVNY 524
>gi|47215490|emb|CAG01598.1| unnamed protein product [Tetraodon nigroviridis]
Length = 463
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L I D L +DEEYE+I++DM+EEC KYG++V+++IPR E PG G+V++E
Sbjct: 369 VLRLINLIDDDHLNNDEEYEDIMDDMKEECQKYGSVVSLLIPR------ENPGKGQVYVE 422
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y ++ A+ L GR F G V A +YP Y
Sbjct: 423 YANSSDSKEAQRLLMGRTFDGKFVVATFYPLSAY 456
>gi|61557287|ref|NP_001013225.1| RNA-binding protein 39 [Rattus norvegicus]
gi|392346874|ref|XP_003749654.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Rattus
norvegicus]
gi|60552170|gb|AAH91394.1| RNA binding motif protein 39 [Rattus norvegicus]
gi|74196119|dbj|BAE32977.1| unnamed protein product [Mus musculus]
gi|149030834|gb|EDL85861.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_f [Rattus
norvegicus]
Length = 524
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
++A+ + +Q + ALQ SG G L A V L
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 416
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 417 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 469
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 470 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 507
>gi|4757926|ref|NP_004893.1| RNA-binding protein 39 isoform b [Homo sapiens]
gi|197097940|ref|NP_001125339.1| RNA-binding protein 39 [Pongo abelii]
gi|149733223|ref|XP_001501876.1| PREDICTED: RNA-binding protein 39 isoform 2 [Equus caballus]
gi|194044529|ref|XP_001925282.1| PREDICTED: RNA-binding protein 39 isoform 2 [Sus scrofa]
gi|296199701|ref|XP_002747278.1| PREDICTED: RNA-binding protein 39 isoform 1 [Callithrix jacchus]
gi|301762104|ref|XP_002916459.1| PREDICTED: RNA-binding protein 39-like [Ailuropoda melanoleuca]
gi|332858226|ref|XP_514808.3| PREDICTED: uncharacterized protein LOC458443 isoform 5 [Pan
troglodytes]
gi|344279921|ref|XP_003411734.1| PREDICTED: RNA-binding protein 39 isoform 2 [Loxodonta africana]
gi|345789988|ref|XP_865124.2| PREDICTED: RNA-binding protein 39 isoform 12 [Canis lupus
familiaris]
gi|354477984|ref|XP_003501197.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Cricetulus
griseus]
gi|426391509|ref|XP_004062115.1| PREDICTED: RNA-binding protein 39 isoform 2 [Gorilla gorilla
gorilla]
gi|75070825|sp|Q5RC80.1|RBM39_PONAB RecName: Full=RNA-binding protein 39; AltName: Full=RNA-binding
motif protein 39
gi|405192|gb|AAA16346.1| splicing factor [Homo sapiens]
gi|55727753|emb|CAH90627.1| hypothetical protein [Pongo abelii]
gi|119596568|gb|EAW76162.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_c [Homo
sapiens]
gi|119596569|gb|EAW76163.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_c [Homo
sapiens]
gi|296480931|tpg|DAA23046.1| TPA: RNA binding motif protein 39 [Bos taurus]
gi|307686241|dbj|BAJ21051.1| RNA binding motif protein 39 [synthetic construct]
gi|344246681|gb|EGW02785.1| RNA-binding protein 39 [Cricetulus griseus]
gi|380783275|gb|AFE63513.1| RNA-binding protein 39 isoform b [Macaca mulatta]
gi|383408127|gb|AFH27277.1| RNA-binding protein 39 isoform b [Macaca mulatta]
gi|384939256|gb|AFI33233.1| RNA-binding protein 39 isoform b [Macaca mulatta]
gi|410218748|gb|JAA06593.1| RNA binding motif protein 39 [Pan troglodytes]
gi|410255438|gb|JAA15686.1| RNA binding motif protein 39 [Pan troglodytes]
gi|410292902|gb|JAA25051.1| RNA binding motif protein 39 [Pan troglodytes]
gi|410350855|gb|JAA42031.1| RNA binding motif protein 39 [Pan troglodytes]
gi|410350863|gb|JAA42035.1| RNA binding motif protein 39 [Pan troglodytes]
Length = 524
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
++A+ + +Q + ALQ SG G L A V L
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 416
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 417 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 469
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 470 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 507
>gi|449458894|ref|XP_004147181.1| PREDICTED: uncharacterized protein LOC101213128 [Cucumis sativus]
Length = 910
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 75/302 (24%), Positives = 119/302 (39%), Gaps = 71/302 (23%)
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
V QATR RR+Y+ LP A+E+AI + + + G N ++ I+ ++
Sbjct: 453 VQLTQATRPMRRLYIENLPHSASEKAIIDCLNGFLMSSGVNHIEGTQPCISCIIHKDRGQ 512
Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
A VE T E+AS A+ DG F G +++RRP DY TL
Sbjct: 513 ALVEFLTPEDASAALLFDGSDFSGSTLKIRRPKDYIETL--------------------- 551
Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
++++ +FG L + + D F Y
Sbjct: 552 ---------------------------RDVVTAFGRLKAYHFEINDDLNGP--CAFLEYV 582
Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM 404
D +V ACA LNG+K+G + L V A ++ +H+ K LQ +
Sbjct: 583 DESVVSKACAGLNGMKIGGQVLKVFPAVPFPLTERTGCQPCYGIPEHV---KPLLQRPSV 639
Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADAL--ADDEEYEEILEDMREECGKYGTLVNVVIP 462
VL + AD L + + +E+LED+R EC ++GT+ +V
Sbjct: 640 ----------------VLKINNVFNADVLPVLSESDIDEVLEDIRFECARFGTVKSVNFV 683
Query: 463 RP 464
+P
Sbjct: 684 KP 685
>gi|410953912|ref|XP_003983612.1| PREDICTED: LOW QUALITY PROTEIN: RNA-binding protein 39 [Felis
catus]
Length = 523
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 74/280 (26%), Positives = 116/280 (41%), Gaps = 50/280 (17%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A +A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYG
Sbjct: 234 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 293
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
F + D A LNG ++ + + V R AS S T
Sbjct: 294 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 353
Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLC 423
+ ++A+ + +Q + ALQ SG G L A V
Sbjct: 354 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQP 413
Query: 424 L-TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGK 476
L T+ + + + E EI +D+ EEC K+G ++++ + D+N + G
Sbjct: 414 LATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GN 466
Query: 477 VFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
V+++ A NAL GR F G + A Y P Y N
Sbjct: 467 VYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 506
>gi|449498643|ref|XP_004160593.1| PREDICTED: uncharacterized LOC101213128 [Cucumis sativus]
Length = 918
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 75/302 (24%), Positives = 119/302 (39%), Gaps = 71/302 (23%)
Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
V QATR RR+Y+ LP A+E+AI + + + G N ++ I+ ++
Sbjct: 461 VQLTQATRPMRRLYIENLPHSASEKAIIDCLNGFLMSSGVNHIEGTQPCISCIIHKDRGQ 520
Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
A VE T E+AS A+ DG F G +++RRP DY TL
Sbjct: 521 ALVEFLTPEDASAALLFDGSDFSGSTLKIRRPKDYIETL--------------------- 559
Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
++++ +FG L + + D F Y
Sbjct: 560 ---------------------------RDVVTAFGRLKAYHFEINDDLNGP--CAFLEYV 590
Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM 404
D +V ACA LNG+K+G + L V A ++ +H+ K LQ +
Sbjct: 591 DESVVSKACAGLNGMKIGGQVLKVFPAVPFPLTERTGCQPCYGIPEHV---KPLLQRPSV 647
Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADAL--ADDEEYEEILEDMREECGKYGTLVNVVIP 462
VL + AD L + + +E+LED+R EC ++GT+ +V
Sbjct: 648 ----------------VLKINNVFNADVLPVLSESDIDEVLEDIRFECARFGTVKSVNFV 691
Query: 463 RP 464
+P
Sbjct: 692 KP 693
>gi|432858814|ref|XP_004068952.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Oryzias latipes]
Length = 502
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 69/270 (25%), Positives = 113/270 (41%), Gaps = 52/270 (19%)
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
G GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF + D
Sbjct: 220 GLTGPMRLYVGSLHFNITEDMLRGIFEPFGRIENIQLMMDSETGRSKGYGFITFSDAECA 279
Query: 350 DIACAALNGLKMGDKTLTVRRAT-----ASGQSKTEQE---------------SILAQAQ 389
A LNG ++ + + V T +S S + + ++A+
Sbjct: 280 KKALEQLNGFELAGRPMKVGHVTERTDPSSAPSILDNDELERSGIDLGTTGRLQLMARLA 339
Query: 390 QHIAIQ-----KMALQTS-------------------GMNTLGGGMSLFGETLAKVLCLT 425
+ +Q + ALQ S +N G ++L + LA
Sbjct: 340 EGTGLQIPPAAQQALQMSGAIAIGAMAAVSAAMNPSLNVNMNSGALNLPSQPLATHCFQL 399
Query: 426 EAITADALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDA 484
+ + + +E +I D+ EEC K+G +V++ + D+N E G V+++
Sbjct: 400 SNMFNPSSENTFGWEVDIQRDVIEECNKHGGVVHIYV---DKNSAE----GNVYVKCPSI 452
Query: 485 VGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+A NAL GR FGG + A Y P Y
Sbjct: 453 PAAMSAVNALHGRFFGGKMITAAYVPLPTY 482
>gi|426241410|ref|XP_004014584.1| PREDICTED: RNA-binding protein 39 isoform 5 [Ovis aries]
Length = 530
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 130/338 (38%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 200 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 242
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 243 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 302
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 303 TFSDSECAKKALEQLNGFELTGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 362
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVL------------CL 424
++A+ + +Q + ALQ SG G L +
Sbjct: 363 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 422
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 423 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 475
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 476 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 513
>gi|426241404|ref|XP_004014581.1| PREDICTED: RNA-binding protein 39 isoform 2 [Ovis aries]
Length = 524
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 297 TFSDSECAKKALEQLNGFELTGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
++A+ + +Q + ALQ SG G L A V L
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 416
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 417 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 469
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 470 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 507
>gi|431894348|gb|ELK04148.1| RNA-binding protein 39 [Pteropus alecto]
Length = 601
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 130/338 (38%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 271 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 313
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 314 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 373
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 374 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 433
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVL------------CL 424
++A+ + +Q + ALQ SG G L +
Sbjct: 434 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 493
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 494 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 546
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 547 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 584
>gi|332249061|ref|XP_003273679.1| PREDICTED: RNA-binding protein 39 [Nomascus leucogenys]
Length = 432
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 118/287 (41%), Gaps = 50/287 (17%)
Query: 273 SPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
SP A +A+ G+ GP R++VG L + TE ++ + E FG + L+ D +T
Sbjct: 136 SPAEKNRAAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSET 195
Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK---------- 378
G SKGYGF + D A LNG ++ + + V R AS S
Sbjct: 196 GRSKGYGFITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERT 255
Query: 379 ------TEQESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGE 416
T + ++A+ + +Q + ALQ SG G L
Sbjct: 256 GIDLGTTGRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALA 315
Query: 417 TLAKVLCL-TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGG 469
A V L T+ + + + E EI +D+ EEC K+G ++++ + D+N
Sbjct: 316 AAASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSA 372
Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+ G V+++ A NAL GR F G + A Y P Y N
Sbjct: 373 Q----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 415
>gi|417411216|gb|JAA52053.1| Putative transcriptional coactivator caper rrm superfamily, partial
[Desmodus rotundus]
Length = 499
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 163 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 205
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 206 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 265
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 266 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 325
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 326 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 385
Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 386 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 440
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 441 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 482
>gi|296199709|ref|XP_002747282.1| PREDICTED: RNA-binding protein 39 isoform 5 [Callithrix jacchus]
Length = 504
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 168 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 210
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 211 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 270
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 271 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 330
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 331 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 390
Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 391 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 445
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 446 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 487
>gi|194386036|dbj|BAG59582.1| unnamed protein product [Homo sapiens]
Length = 503
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 167 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 209
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 210 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 269
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 270 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 329
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 330 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 389
Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 390 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 444
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 445 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 486
>gi|194384132|dbj|BAG64839.1| unnamed protein product [Homo sapiens]
Length = 521
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 185 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 227
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 228 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 287
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 288 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 347
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 348 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 407
Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 408 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 462
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 463 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 504
>gi|336176066|ref|NP_001229529.1| RNA-binding protein 39 isoform d [Homo sapiens]
gi|73991836|ref|XP_865202.1| PREDICTED: RNA-binding protein 39 isoform 16 [Canis lupus
familiaris]
gi|296199705|ref|XP_002747280.1| PREDICTED: RNA-binding protein 39 isoform 3 [Callithrix jacchus]
gi|332858230|ref|XP_003316933.1| PREDICTED: uncharacterized protein LOC458443 isoform 3 [Pan
troglodytes]
gi|335304749|ref|XP_003360015.1| PREDICTED: RNA-binding protein 39 [Sus scrofa]
gi|338719245|ref|XP_003363967.1| PREDICTED: RNA-binding protein 39 [Equus caballus]
gi|426391513|ref|XP_004062117.1| PREDICTED: RNA-binding protein 39 isoform 4 [Gorilla gorilla
gorilla]
Length = 502
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 130/338 (38%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 172 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 214
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 215 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 274
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 275 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 334
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVL------------CL 424
++A+ + +Q + ALQ SG G L +
Sbjct: 335 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 394
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 395 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 447
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 448 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 485
>gi|194386804|dbj|BAG61212.1| unnamed protein product [Homo sapiens]
Length = 502
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 130/338 (38%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 172 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 214
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 215 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 274
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 275 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 334
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVL------------CL 424
++A+ + +Q + ALQ SG G L +
Sbjct: 335 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 394
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 395 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 447
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 448 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 485
>gi|391336770|ref|XP_003742751.1| PREDICTED: RNA-binding protein 39-like [Metaseiulus occidentalis]
Length = 520
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 82/330 (24%), Positives = 134/330 (40%), Gaps = 58/330 (17%)
Query: 213 VVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP 272
+V+ K A+VE +E AM L+G GV + V+ P Q
Sbjct: 207 IVDNKTRKSKGIAYVEFFDLESVPLAMGLNGQKLFGVPIIVQ--------------PTQA 252
Query: 273 SPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
N A+ +GP R++VG L + +E +KE+ E FG L +L+K+ DT
Sbjct: 253 ERNRQ------ANQTAASTKGPMRLYVGSLHFDISEQMLKEIFEPFGRLDRVELIKE-DT 305
Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQ-----SKTEQE----- 382
G SKGYGF + + A LNG ++ + + V T G S + E
Sbjct: 306 GKSKGYGFVTFHEADAAKKAMEQLNGFELAGRPMKVGNVTERGMDGSAPSILDNEELDRT 365
Query: 383 ----------SILAQAQQHIAIQ-----KMAL-QTSGMNTLGGGMSLFGETLA-KVLCLT 425
+++A+ + IQ K AL Q + G + E++A + L+
Sbjct: 366 GIELGAHGRLALMAKLAEGTGIQLPDAAKTALQQMQSAPSFGQTNNAQQESIATQCFLLS 425
Query: 426 EAITADALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDA 484
A +++++ E+ ED+ +EC K+G V+ + + N V+++
Sbjct: 426 NMFDAAEAHQEKDWDLELREDVLQECRKHGGAVHCFVDKEAAN---------VYVKCPSI 476
Query: 485 VGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
A L GR F G + A Y P Y
Sbjct: 477 ATAVAAVGVLHGRFFAGRVITAAYVPVMTY 506
>gi|281346065|gb|EFB21649.1| hypothetical protein PANDA_004543 [Ailuropoda melanoleuca]
Length = 497
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 161 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 203
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 204 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 263
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 264 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 323
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 324 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 383
Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 384 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 438
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 439 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 480
>gi|52545994|emb|CAH18281.2| hypothetical protein [Homo sapiens]
Length = 513
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 177 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 219
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 220 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 279
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 280 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 339
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 340 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 399
Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 400 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 454
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 455 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 496
>gi|344298613|ref|XP_003420986.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Loxodonta
africana]
Length = 434
Score = 82.0 bits (201), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 121/318 (38%), Gaps = 62/318 (19%)
Query: 61 RDHDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPA 120
RD DR + +R +ER+HRHRSRS R S SRS + + +PP
Sbjct: 66 RDRDRHRQRNSLSRSRERQHRHRSRSWDHQRS-------SESRSWDRRREDRVRYRSPPL 118
Query: 121 AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVG 180
A S V E N+ P R AR V+
Sbjct: 119 ATGRRYGHSKSPHFREKSPVREPVDNLSP--------------------EERDARTVFCM 158
Query: 181 GLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI------NHEKKFAFVEMRTVEE 234
L + + FFS A+G V +V I K A+VE ++
Sbjct: 159 QLAARIRPRDLEDFFS----AVG--------KVRDVRIISDRNSRRSKGIAYVEFCEIQS 206
Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
A+ L G GV + V+ LAA +A+ G+ GP
Sbjct: 207 VPLAIGLTGQWLLGVPIIVQASQAEKNRLAA-----------------MANNLQKGSGGP 249
Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
R++VG L + TE ++ + E FG + L KD DTG+SKGYGF + D A
Sbjct: 250 MRLYVGSLHFNITEDMLRGIFEPFGKIDDILLTKDSDTGHSKGYGFITFSDSECARRALE 309
Query: 355 ALNGLKMGDKTLTVRRAT 372
LNG ++ + + V AT
Sbjct: 310 QLNGFELAGRPMRVGHAT 327
>gi|410930137|ref|XP_003978455.1| PREDICTED: serine/threonine-protein kinase Kist-like [Takifugu
rubripes]
Length = 433
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 58/94 (61%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L I L +DEEYE+I++DM+EEC KYG++V+++IPR E PG G+VF+E
Sbjct: 336 VLRLINLIDDSHLNNDEEYEDIMDDMKEECQKYGSVVSLLIPR------ENPGKGQVFVE 389
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y ++ A+ L GR F G V A +YP Y
Sbjct: 390 YANSGDSKEAQRLLMGRTFDGKFVVATFYPSSAY 423
>gi|301088364|ref|XP_002996880.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110854|gb|EEY68906.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 96
Score = 81.6 bits (200), Expect = 8e-13, Method: Composition-based stats.
Identities = 37/91 (40%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 424 LTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYD 483
+ ++ D L DDEEY ++ ED+ EEC ++G + + IPRP ++G E PG+G +++ +
Sbjct: 1 MANMVSIDELRDDEEYADLAEDVEEECKRFGGVTGMEIPRP-KDGEEVPGLGCIYVRFGK 59
Query: 484 AVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+A AL+GRKFGGN V Y+P DK+
Sbjct: 60 EEDAVSALKALNGRKFGGNIVKVTYFPVDKF 90
>gi|17063213|gb|AAL32373.1| transcription coactivator CAPER [Mus musculus]
Length = 530
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A +A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYG
Sbjct: 235 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 294
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
F + D A LNG ++ + + V R AS S T
Sbjct: 295 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 354
Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
+ ++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 355 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 414
Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 415 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 471
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 472 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 513
>gi|334349754|ref|XP_001379564.2| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Monodelphis
domestica]
Length = 348
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 73/251 (29%), Positives = 104/251 (41%), Gaps = 52/251 (20%)
Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
RSP K K R +D+ PP + P Q + +A A +LP A
Sbjct: 78 RSPRHEKKKKIRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133
Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
PV V+ Q TR ARR+YVG +P E +A +++ A+ G A
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEGQVAISAARLPAALPGPPA------------ 181
Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
+ + + G +A +++ P + P
Sbjct: 182 ------------CRPRAGNLGVGGAGPRPLAAQLKWPFAFPPA----------------- 212
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
G+ S + + ++F+GGLP Y + Q+KELL SFG L F+LVKD TG SKGY
Sbjct: 213 --GVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 268
Query: 340 FCVYQDPAVTD 350
FC Y D VTD
Sbjct: 269 FCEYVDINVTD 279
>gi|344298617|ref|XP_003420988.1| PREDICTED: probable RNA-binding protein 23 isoform 3 [Loxodonta
africana]
Length = 416
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 85/318 (26%), Positives = 120/318 (37%), Gaps = 80/318 (25%)
Query: 61 RDHDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPA 120
RD DR + +R +ER+HRHRSRS R S SRS + + +PP
Sbjct: 66 RDRDRHRQRNSLSRSRERQHRHRSRSWDHQRS-------SESRSWDRRREDRVRYRSPPL 118
Query: 121 AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVG 180
A P V + PE R AR V+
Sbjct: 119 ATGEP----------VDNLSPE----------------------------ERDARTVFCM 140
Query: 181 GLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI------NHEKKFAFVEMRTVEE 234
L + + FFS A+G V +V I K A+VE ++
Sbjct: 141 QLAARIRPRDLEDFFS----AVG--------KVRDVRIISDRNSRRSKGIAYVEFCEIQS 188
Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
A+ L G GV + V+ LAA +A+ G+ GP
Sbjct: 189 VPLAIGLTGQWLLGVPIIVQASQAEKNRLAA-----------------MANNLQKGSGGP 231
Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
R++VG L + TE ++ + E FG + L KD DTG+SKGYGF + D A
Sbjct: 232 MRLYVGSLHFNITEDMLRGIFEPFGKIDDILLTKDSDTGHSKGYGFITFSDSECARRALE 291
Query: 355 ALNGLKMGDKTLTVRRAT 372
LNG ++ + + V AT
Sbjct: 292 QLNGFELAGRPMRVGHAT 309
>gi|344298615|ref|XP_003420987.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Loxodonta
africana]
Length = 450
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 121/318 (38%), Gaps = 62/318 (19%)
Query: 61 RDHDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPA 120
RD DR + +R +ER+HRHRSRS R S SRS + + +PP
Sbjct: 82 RDRDRHRQRNSLSRSRERQHRHRSRSWDHQRS-------SESRSWDRRREDRVRYRSPPL 134
Query: 121 AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVG 180
A S V E N+ P R AR V+
Sbjct: 135 ATGRRYGHSKSPHFREKSPVREPVDNLSP--------------------EERDARTVFCM 174
Query: 181 GLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI------NHEKKFAFVEMRTVEE 234
L + + FFS A+G V +V I K A+VE ++
Sbjct: 175 QLAARIRPRDLEDFFS----AVG--------KVRDVRIISDRNSRRSKGIAYVEFCEIQS 222
Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
A+ L G GV + V+ LAA +A+ G+ GP
Sbjct: 223 VPLAIGLTGQWLLGVPIIVQASQAEKNRLAA-----------------MANNLQKGSGGP 265
Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
R++VG L + TE ++ + E FG + L KD DTG+SKGYGF + D A
Sbjct: 266 MRLYVGSLHFNITEDMLRGIFEPFGKIDDILLTKDSDTGHSKGYGFITFSDSECARRALE 325
Query: 355 ALNGLKMGDKTLTVRRAT 372
LNG ++ + + V AT
Sbjct: 326 QLNGFELAGRPMRVGHAT 343
>gi|432855875|ref|XP_004068316.1| PREDICTED: serine/threonine-protein kinase Kist-like [Oryzias
latipes]
Length = 435
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L I L ++EEYE+I+EDM+EEC KYG++V+++IP+ E PG G+VF+E
Sbjct: 338 VLRLLNLIDDSHLHNEEEYEDIMEDMKEECQKYGSVVSLLIPK------ENPGKGQVFVE 391
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y ++ A+ L+GR F G V A +YP Y
Sbjct: 392 YANSSDSKEAQRLLTGRTFDGKFVVATFYPLSAY 425
>gi|118403314|ref|NP_573505.2| RNA-binding protein 39 [Mus musculus]
gi|392346872|ref|XP_003749653.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Rattus
norvegicus]
gi|341941811|sp|Q8VH51.2|RBM39_MOUSE RecName: Full=RNA-binding protein 39; AltName: Full=Coactivator of
activating protein 1 and estrogen receptors;
Short=Coactivator of AP-1 and ERs; AltName:
Full=RNA-binding motif protein 39; AltName:
Full=RNA-binding region-containing protein 2; AltName:
Full=Transcription coactivator CAPER
gi|55991480|gb|AAH86645.1| RNA binding motif protein 39 [Mus musculus]
gi|74151058|dbj|BAE27657.1| unnamed protein product [Mus musculus]
gi|148674237|gb|EDL06184.1| RNA binding motif protein 39, isoform CRA_b [Mus musculus]
Length = 530
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 416
Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 417 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 471
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 472 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 513
>gi|35493811|ref|NP_909122.1| RNA-binding protein 39 isoform a [Homo sapiens]
gi|281182530|ref|NP_001162566.1| RNA-binding protein 39 [Papio anubis]
gi|284004921|ref|NP_001164806.1| RNA-binding protein 39 [Oryctolagus cuniculus]
gi|149733225|ref|XP_001501869.1| PREDICTED: RNA-binding protein 39 isoform 1 [Equus caballus]
gi|296199703|ref|XP_002747279.1| PREDICTED: RNA-binding protein 39 isoform 2 [Callithrix jacchus]
gi|332858224|ref|XP_003316931.1| PREDICTED: uncharacterized protein LOC458443 isoform 1 [Pan
troglodytes]
gi|335304742|ref|XP_003360012.1| PREDICTED: RNA-binding protein 39 [Sus scrofa]
gi|344279919|ref|XP_003411733.1| PREDICTED: RNA-binding protein 39 isoform 1 [Loxodonta africana]
gi|345789986|ref|XP_864959.2| PREDICTED: RNA-binding protein 39 isoform 3 [Canis lupus
familiaris]
gi|354477982|ref|XP_003501196.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Cricetulus
griseus]
gi|397523808|ref|XP_003831910.1| PREDICTED: RNA-binding protein 39 [Pan paniscus]
gi|426391507|ref|XP_004062114.1| PREDICTED: RNA-binding protein 39 isoform 1 [Gorilla gorilla
gorilla]
gi|28201880|sp|Q14498.2|RBM39_HUMAN RecName: Full=RNA-binding protein 39; AltName: Full=Hepatocellular
carcinoma protein 1; AltName: Full=RNA-binding motif
protein 39; AltName: Full=RNA-binding region-containing
protein 2; AltName: Full=Splicing factor HCC1
gi|405194|gb|AAA16347.1| splicing factor [Homo sapiens]
gi|119596565|gb|EAW76159.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_a [Homo
sapiens]
gi|119596567|gb|EAW76161.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_a [Homo
sapiens]
gi|146327034|gb|AAI41836.1| RNA binding motif protein 39 [Homo sapiens]
gi|164623752|gb|ABY64678.1| RNA binding motif protein 39, isoform 1 (predicted) [Papio anubis]
gi|165971473|gb|AAI58173.1| RNA binding motif protein 39 [Homo sapiens]
gi|166831598|gb|ABY90123.1| RNA binding motif protein 39 isoform a (predicted) [Callithrix
jacchus]
gi|169731519|gb|ACA64891.1| RNA binding motif protein 39 isoform a (predicted) [Callicebus
moloch]
gi|197215647|gb|ACH53039.1| RNA binding motif protein 39 isoform a (predicted) [Otolemur
garnettii]
gi|217038339|gb|ACJ76632.1| RNA binding motif protein 39 isoform a (predicted) [Oryctolagus
cuniculus]
gi|229368730|gb|ACQ63013.1| RNA binding motif protein 39 isoform a (predicted) [Dasypus
novemcinctus]
gi|351702535|gb|EHB05454.1| RNA-binding protein 39 [Heterocephalus glaber]
gi|380783277|gb|AFE63514.1| RNA-binding protein 39 isoform a [Macaca mulatta]
gi|383408125|gb|AFH27276.1| RNA-binding protein 39 isoform a [Macaca mulatta]
gi|384939254|gb|AFI33232.1| RNA-binding protein 39 isoform a [Macaca mulatta]
gi|410218746|gb|JAA06592.1| RNA binding motif protein 39 [Pan troglodytes]
gi|410255434|gb|JAA15684.1| RNA binding motif protein 39 [Pan troglodytes]
gi|410292900|gb|JAA25050.1| RNA binding motif protein 39 [Pan troglodytes]
gi|410292904|gb|JAA25052.1| RNA binding motif protein 39 [Pan troglodytes]
gi|410350859|gb|JAA42033.1| RNA binding motif protein 39 [Pan troglodytes]
gi|440902514|gb|ELR53299.1| RNA-binding protein 39 [Bos grunniens mutus]
Length = 530
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 416
Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 417 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 471
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 472 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 513
>gi|355784531|gb|EHH65382.1| RNA-binding motif protein 39 [Macaca fascicularis]
Length = 530
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A +A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYG
Sbjct: 235 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 294
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
F + D A LNG ++ + + V R AS S T
Sbjct: 295 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 354
Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
+ ++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 355 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 414
Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 415 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 471
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 472 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 513
>gi|432858816|ref|XP_004068953.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Oryzias latipes]
Length = 529
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 69/270 (25%), Positives = 113/270 (41%), Gaps = 52/270 (19%)
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
G GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF + D
Sbjct: 247 GLTGPMRLYVGSLHFNITEDMLRGIFEPFGRIENIQLMMDSETGRSKGYGFITFSDAECA 306
Query: 350 DIACAALNGLKMGDKTLTVRRAT-----ASGQSKTEQE---------------SILAQAQ 389
A LNG ++ + + V T +S S + + ++A+
Sbjct: 307 KKALEQLNGFELAGRPMKVGHVTERTDPSSAPSILDNDELERSGIDLGTTGRLQLMARLA 366
Query: 390 QHIAIQ-----KMALQTS-------------------GMNTLGGGMSLFGETLAKVLCLT 425
+ +Q + ALQ S +N G ++L + LA
Sbjct: 367 EGTGLQIPPAAQQALQMSGAIAIGAMAAVSAAMNPSLNVNMNSGALNLPSQPLATHCFQL 426
Query: 426 EAITADALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDA 484
+ + + +E +I D+ EEC K+G +V++ + D+N E G V+++
Sbjct: 427 SNMFNPSSENTFGWEVDIQRDVIEECNKHGGVVHIYV---DKNSAE----GNVYVKCPSI 479
Query: 485 VGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+A NAL GR FGG + A Y P Y
Sbjct: 480 PAAMSAVNALHGRFFGGKMITAAYVPLPTY 509
>gi|427792527|gb|JAA61715.1| Putative transcriptional coactivator caper rrm superfamily, partial
[Rhipicephalus pulchellus]
Length = 497
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 85/328 (25%), Positives = 130/328 (39%), Gaps = 58/328 (17%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE + VE AM L+G G+ + V+ PT AAA + L V
Sbjct: 179 KGIAYVEFQDVESVPLAMGLNGQKLFGIPIVVQ-PTQAERNRAAAQNASTSNSTLQRGNV 237
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
G P R++VG L + TE +K + E FG + +L+KD +T SKGYGF
Sbjct: 238 G-----------PMRLYVGSLHFNITEEMLKGIFEPFGKIDKIELIKDMETNRSKGYGFI 286
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQESI------LAQAQQ 390
+ D A LNG ++ + + V R + S + E + L +
Sbjct: 287 TFHDSEDAKKALEQLNGFELAGRPMKVGHVTERTDVSQAPSFLDSEELDRSGIDLGATGR 346
Query: 391 HIAIQKMA------LQTSGMNTLGGGMSLF-------------GETLAKVLCLTEAITAD 431
+ K+A + + +N L ++ T+A C + D
Sbjct: 347 LQLMAKLAEGTGFQIPQAAVNALQMNPAVLPGQPQAAAVAAAAAPTIA-TQCFLLSNMFD 405
Query: 432 ALAD-----DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
L + DEE I D+ EEC K+G ++V + R G V+++
Sbjct: 406 PLTETNPSWDEE---IRRDVIEECRKHGGALHVYVDRASPE-------GHVYVKCPTIAS 455
Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ NAL GR F G + A Y P Y
Sbjct: 456 AVASVNALHGRWFAGRIITAAYVPVMSY 483
>gi|427794973|gb|JAA62938.1| Putative transcriptional coactivator caper rrm superfamily, partial
[Rhipicephalus pulchellus]
Length = 509
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 85/328 (25%), Positives = 130/328 (39%), Gaps = 58/328 (17%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE + VE AM L+G G+ + V+ PT AAA + L V
Sbjct: 191 KGIAYVEFQDVESVPLAMGLNGQKLFGIPIVVQ-PTQAERNRAAAQNASTSNSTLQRGNV 249
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
G P R++VG L + TE +K + E FG + +L+KD +T SKGYGF
Sbjct: 250 G-----------PMRLYVGSLHFNITEEMLKGIFEPFGKIDKIELIKDMETNRSKGYGFI 298
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQESI------LAQAQQ 390
+ D A LNG ++ + + V R + S + E + L +
Sbjct: 299 TFHDSEDAKKALEQLNGFELAGRPMKVGHVTERTDVSQAPSFLDSEELDRSGIDLGATGR 358
Query: 391 HIAIQKMA------LQTSGMNTLGGGMSLF-------------GETLAKVLCLTEAITAD 431
+ K+A + + +N L ++ T+A C + D
Sbjct: 359 LQLMAKLAEGTGFQIPQAAVNALQMNPAVLPGQPQAAAVAAAAAPTIA-TQCFLLSNMFD 417
Query: 432 ALAD-----DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
L + DEE I D+ EEC K+G ++V + R G V+++
Sbjct: 418 PLTETNPSWDEE---IRRDVIEECRKHGGALHVYVDRASPE-------GHVYVKCPTIAS 467
Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ NAL GR F G + A Y P Y
Sbjct: 468 AVASVNALHGRWFAGRIITAAYVPVMSY 495
>gi|326924922|ref|XP_003208671.1| PREDICTED: serine/threonine-protein kinase Kist-like [Meleagris
gallopavo]
Length = 550
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L ++ +L +EEYE+ILED+REEC KYG +V+++IP+ E PG G+VF+E
Sbjct: 453 VLRLLNVLSDASLQSEEEYEDILEDIREECQKYGPVVSLLIPK------ENPGKGQVFVE 506
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+G+ F G V A +YP Y
Sbjct: 507 YANAGDSKAAQKMLTGKIFDGKFVVATFYPLSAY 540
>gi|193785136|dbj|BAG54289.1| unnamed protein product [Homo sapiens]
Length = 506
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 170 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 212
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 213 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 272
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 273 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 332
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 333 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 392
Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 393 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 447
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 448 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 489
>gi|194758325|ref|XP_001961412.1| GF14957 [Drosophila ananassae]
gi|190615109|gb|EDV30633.1| GF14957 [Drosophila ananassae]
Length = 594
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/370 (22%), Positives = 130/370 (35%), Gaps = 59/370 (15%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS V G V + N K+F A++E
Sbjct: 234 RDARTVFCIQLSQRVRARDLEEFFSSV---------GKVRDVRLILCNKTKRFKGIAYIE 284
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
E + A+ L G GV + V+ L A QP +
Sbjct: 285 FEDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQNAAPAFQPKSHT------------ 332
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF Y +
Sbjct: 333 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDAIQLIMDTETGRSKGYGFITYHNADD 388
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA-LQTSGMNTL 407
A LNG ++ + + V T T + I + LQ
Sbjct: 389 AKKALEQLNGFELAGRLMKVGNVTERLDMNTSSLDTDEMDRTGIDLGATGRLQLMFKLAE 448
Query: 408 GGGMSL-----------------------FGETLAKVLCLTEAITADALADDEEYEEILE 444
G G+++ + L+ + EI +
Sbjct: 449 GAGLAVPQAAANALLATAPQPAPVQQQQQAPSIATQCFILSNMFDPRTETNPTWDAEIRD 508
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
D+ EEC K+G ++++ + G V+++ A NAL GR F G +
Sbjct: 509 DVLEECAKHGGVLHIHV-------DTVSPTGTVYVKCPSTTTAVLAVNALHGRWFAGRVI 561
Query: 505 NAFYYPEDKY 514
A Y P Y
Sbjct: 562 TAAYVPLVNY 571
>gi|215820612|ref|NP_001135965.1| RNA binding motif protein 39 [Acyrthosiphon pisum]
Length = 501
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 99/389 (25%), Positives = 150/389 (38%), Gaps = 65/389 (16%)
Query: 161 MPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH 220
+P M R AR V+ L + + FFS V G V + N
Sbjct: 126 VPPASMLTPEERDARTVFCMQLSKTIRARDLEEFFSSV---------GKVRDVRMITCNK 176
Query: 221 EKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLN 277
++F A++E + E AM L+G GV + V+ PT A PN+
Sbjct: 177 TRRFKGIAYIEFKDPESVPLAMGLNGQKLLGVPIVVQ------PTQAEKNRMANSMPNM- 229
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
GP +++VG L Y TE ++ + E FG + L+ D +TG SKG
Sbjct: 230 ---------VQRTHYGPMKLYVGSLHYNITEEMLRGIFEPFGHVDNIQLMMDTETGRSKG 280
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA--SGQSKTEQE-SILAQAQQHIA- 393
YGF Y++ A LNG ++ + + V T S KT E L +A +
Sbjct: 281 YGFLTYRNAEDAKKALEHLNGFEIAGRPMKVGHVTENHSVYDKTAFEVDELDRAGYDLGA 340
Query: 394 ---IQKM-----------------ALQT-SGMNTLGGGMSLFGETLAKVLCLTEAITADA 432
+Q M ALQ SG+ ++ C A D
Sbjct: 341 TGRLQLMYKLAEGTGFPIPQAAANALQVASGVQAAPAAPTVQVTPPIATQCFLLANMFDP 400
Query: 433 LADDEE----YE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
+D + +E EI +D+ EEC K+G +++V + + G V+++
Sbjct: 401 NKEDVDSNTTWETEIRDDVIEECNKHGGVLHVYVDKASPQGN-------VYVKCTTIETA 453
Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
+ AL GR FGG + A Y P Y N
Sbjct: 454 LASVAALHGRWFGGRVITAAYVPVTNYHN 482
>gi|350410158|ref|XP_003488966.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Bombus impatiens]
Length = 532
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 149/396 (37%), Gaps = 64/396 (16%)
Query: 148 LPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
LPFG G PL R AR V+ L + + FFS S
Sbjct: 149 LPFGK---GVSPLGIRNDELTPEERDARTVFCMQLSQRIRARDLEEFFS---------SV 196
Query: 208 GPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA 264
G V + N ++F A+VE + E + A+ L G GV + V+ T A
Sbjct: 197 GKVQDVRLITCNKTRRFKGIAYVEFKDPESVTLALGLSGQKLLGVPIVVQH------TQA 250
Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
G PNL G GP R++VG L + TE ++ + E FG +
Sbjct: 251 EKNRMGNSMPNL----------MPKGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNI 300
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKT 379
L+ D +TG SKGYGF +++ A LNG ++ + + V R G S
Sbjct: 301 QLIMDPETGRSKGYGFLTFRNADDAKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLL 360
Query: 380 EQESI------LAQAQQHIAIQKMALQT---------------SGMNTLGGGMSLFGETL 418
+ + + L + + K+A T M+T
Sbjct: 361 DTDELDRSGIDLGATGRLQLMFKLAEGTGLEIPPAAANALNMAPVMSTPQPPPQAAPPIA 420
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
+ L+ + +EI +D+ EEC K+G +++V + DQ + G V+
Sbjct: 421 TQCFMLSNMFDPQNETNPNWAKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVY 473
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++ A N+L GR F G + A Y P Y
Sbjct: 474 VKCPSIATAVAAVNSLHGRWFAGRVITAAYVPVVNY 509
>gi|426241406|ref|XP_004014582.1| PREDICTED: RNA-binding protein 39 isoform 3 [Ovis aries]
Length = 502
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 130/338 (38%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 172 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 214
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 215 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 274
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 275 TFSDSECAKKALEQLNGFELTGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 334
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVL------------CL 424
++A+ + +Q + ALQ SG G L +
Sbjct: 335 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 394
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 395 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 447
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 448 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 485
>gi|270007747|gb|EFA04195.1| hypothetical protein TcasGA2_TC014444 [Tribolium castaneum]
Length = 522
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 94/400 (23%), Positives = 159/400 (39%), Gaps = 67/400 (16%)
Query: 149 PFGA---TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGN 205
PFG + LG PV+ ++ + R AR V+V L + + FFS
Sbjct: 136 PFGRRNRSPLGLRSNSPVEELSPE-ERDARTVFVMQLSQRIRARDLEEFFS--------- 185
Query: 206 SAGPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPT 262
S G V + N ++F A++E + E + A+ L G GV + V+ T
Sbjct: 186 SVGKVRDVRLIVCNKTRRFKGIAYIEFKDPESVTLALGLSGQKLLGVPIIVQH------T 239
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
A G PNL GP R++VG L + TE ++ + E FG +
Sbjct: 240 QAEKNRMGNSMPNL----------MPKNMTGPMRLYVGSLHFNITEDMLRSIFEPFGKID 289
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
L+ D +TG SKGYGF +++ A LNG ++ + + V T + +
Sbjct: 290 NIQLIMDPETGRSKGYGFIAFRNCEDAKKALEQLNGFELAGRPMKVGNVTERLDLQQQGP 349
Query: 383 SILAQ-----------AQQHIAIQKMALQTSGMN---------TLGGGMSLFGET----- 417
SIL A + + + +GM ++ G + +
Sbjct: 350 SILDSDELDRSGIDLGATGRLQLMFKLAEGAGMQVPQAAANALSIATGQPVVPQVQTNST 409
Query: 418 --LAKVLCLTEAITADALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
+A + + A + ++ EI +D+ EEC K+G +++V + D+ +
Sbjct: 410 PPIATQCFMLSNMFDPATESTQTWDVEIRDDVIEECNKHGGVLHVYV---DKGSPQ---- 462
Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
G V+++ + N+L GR F G + A Y P Y
Sbjct: 463 GNVYVKCPSIATAVASVNSLHGRWFAGRVITAAYVPLLNY 502
>gi|426241402|ref|XP_004014580.1| PREDICTED: RNA-binding protein 39 isoform 1 [Ovis aries]
Length = 530
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 297 TFSDSECAKKALEQLNGFELTGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 416
Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 417 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 471
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 472 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 513
>gi|428182175|gb|EKX51036.1| hypothetical protein GUITHDRAFT_134574 [Guillardia theta CCMP2712]
Length = 458
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 97/414 (23%), Positives = 169/414 (40%), Gaps = 74/414 (17%)
Query: 150 FGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
G T + L+ + M + R+ R+YVG L E + F GP
Sbjct: 75 LGLTVIAQLALVTIAGMVKPPQRN--RLYVGSLHFDLKEADVRAIFQPF---------GP 123
Query: 210 GDAVVNVY---INHEKKFAFVE-MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAA 265
+ Y K +AF+E M + + A+DG + G ++V RP +N A
Sbjct: 124 IKTIEMSYEPTTGKSKGYAFIEYMNDAQADACEKAMDGFMIAGRPIKVGRP--HNTVSAN 181
Query: 266 A----------------LGPGQPSPNLNLAAVGLASGAIGG----AEGPDRVFVGGLPYY 305
A L P PS A + A A P R+++G + +
Sbjct: 182 APVHRRLFFLLNFSSVDLQPWPPSLPQQAALAAQKAQAQPLNTPVAGPPARIYIGSVLFD 241
Query: 306 FTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ--DPAVTDIACAALNGLKMG- 362
E+++K++ + FG++ ++ + + G KGYGF Y+ D AV A A+NG ++
Sbjct: 242 VKESEVKQIFQVFGSIKQISMIPNPENGKHKGYGFIEYEKHDDAVQ--AIQAMNGFQLAG 299
Query: 363 -----DKT----------LTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
DKT + T S + + + + ++++++ + + M L
Sbjct: 300 RPLKEDKTSNPIIVAAANAIADKVTTSLVTSSSNDITTVEDEENLSVSSVLQRKEIMCKL 359
Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE-DMREECGKYGTLVNVVIPRPDQ 466
S +V+ L + + E+ + +LE ++ EEC K+G + V+I +
Sbjct: 360 ANRPS-------RVVLLKN------MVEPEDVDPLLEQEIAEECSKFGKVNKVLIVTMVE 406
Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
G + KVF+E+ D A L R FGG VNA Y E+++ +D S
Sbjct: 407 QGSR---LVKVFVEFGDQEAATKAVARLDKRWFGGKIVNASTYEEERFVRQDLS 457
>gi|384252120|gb|EIE25597.1| splicing factor, CC1-like protein [Coccomyxa subellipsoidea C-169]
Length = 497
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 94/346 (27%), Positives = 142/346 (41%), Gaps = 70/346 (20%)
Query: 34 DRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDR-----HRDYNRDKERRHRHRSRSHS 88
D H+ K DR RD++ +RE RD DR + HRD + D+ER+ H S H
Sbjct: 25 DGTHKREKREKKDRTRDRDS--ERERTRDQDRDRKSSKREHRDKSPDRERKRHHSSHDHH 82
Query: 89 SDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNML 148
R+R S P RS K + R+ P V E +
Sbjct: 83 RSE-RDRKHSSRP-RSLEKRRERT------------------------PPEVREQREK-- 114
Query: 149 PFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAG 208
+ ++ R R V+ LP A E+ + FFS+ AG
Sbjct: 115 ---------------ERELKELDRDIRTVFAYNLPLKAEERDLFEFFSK---------AG 150
Query: 209 PGDAVVNVYINHEKK---FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAA 265
P + V + + +K FA++E + AMAL G I G AV V+ ++ LA
Sbjct: 151 PIEDVKIIMDRNTRKSKGFAYIEYTNKADIVTAMALTGQILMGQAVMVK-SSEAEKNLAW 209
Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFD 325
Q + L ++ +G A GP ++++G L E +K++ E+FG +
Sbjct: 210 EAAQAQNASMLQMSTIGNA------GTGPCKLYIGNLHPNIQEQDLKQVFEAFGAVEYIT 263
Query: 326 LVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
L KD TG S+GYGF YQ A L+GL + ++V+ A
Sbjct: 264 LQKD-PTGRSQGYGFVQYQTTPDATKAMQQLDGLDIAGSQISVKIA 308
Score = 42.0 bits (97), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 36/69 (52%), Gaps = 8/69 (11%)
Query: 440 EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKF 499
+EI D+ EEC KYG + + + D+N G V+L++ G A A+ AL GR F
Sbjct: 425 QEIATDVTEECSKYGPVSHTHV---DKNSK-----GFVYLKFVTVEGSAAAQKALHGRWF 476
Query: 500 GGNTVNAFY 508
G V A +
Sbjct: 477 AGRQVVAEF 485
>gi|115495227|ref|NP_001070127.1| serine/threonine-protein kinase Kist [Danio rerio]
gi|115313546|gb|AAI24287.1| Zgc:153241 [Danio rerio]
gi|182890586|gb|AAI64787.1| Zgc:153241 protein [Danio rerio]
Length = 410
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/94 (44%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L I L +++EYE+I+EDM+EEC KYGT+V+++IP+ E PG G+VF+E
Sbjct: 313 VLRLLNVIDDSHLYNEDEYEDIIEDMKEECQKYGTVVSLLIPK------ENPGKGQVFVE 366
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+GR F G V A +YP Y
Sbjct: 367 YANAGDSKEAQRLLTGRTFDGKFVVATFYPLGAY 400
>gi|350410161|ref|XP_003488967.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Bombus impatiens]
Length = 508
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 149/396 (37%), Gaps = 64/396 (16%)
Query: 148 LPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
LPFG G PL R AR V+ L + + FFS S
Sbjct: 125 LPFGK---GVSPLGIRNDELTPEERDARTVFCMQLSQRIRARDLEEFFS---------SV 172
Query: 208 GPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA 264
G V + N ++F A+VE + E + A+ L G GV + V+ T A
Sbjct: 173 GKVQDVRLITCNKTRRFKGIAYVEFKDPESVTLALGLSGQKLLGVPIVVQH------TQA 226
Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
G PNL G GP R++VG L + TE ++ + E FG +
Sbjct: 227 EKNRMGNSMPNL----------MPKGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNI 276
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKT 379
L+ D +TG SKGYGF +++ A LNG ++ + + V R G S
Sbjct: 277 QLIMDPETGRSKGYGFLTFRNADDAKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLL 336
Query: 380 EQESI------LAQAQQHIAIQKMALQT---------------SGMNTLGGGMSLFGETL 418
+ + + L + + K+A T M+T
Sbjct: 337 DTDELDRSGIDLGATGRLQLMFKLAEGTGLEIPPAAANALNMAPVMSTPQPPPQAAPPIA 396
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
+ L+ + +EI +D+ EEC K+G +++V + DQ + G V+
Sbjct: 397 TQCFMLSNMFDPQNETNPNWAKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVY 449
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++ A N+L GR F G + A Y P Y
Sbjct: 450 VKCPSIATAVAAVNSLHGRWFAGRVITAAYVPVVNY 485
>gi|340718900|ref|XP_003397900.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Bombus
terrestris]
Length = 508
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 149/396 (37%), Gaps = 64/396 (16%)
Query: 148 LPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
LPFG G PL R AR V+ L + + FFS S
Sbjct: 125 LPFGK---GVSPLGIRNDELTPEERDARTVFCMQLSQRIRARDLEEFFS---------SV 172
Query: 208 GPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA 264
G V + N ++F A+VE + E + A+ L G GV + V+ T A
Sbjct: 173 GKVQDVRLITCNKTRRFKGIAYVEFKDPESVTLALGLSGQKLLGVPIVVQH------TQA 226
Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
G PNL G GP R++VG L + TE ++ + E FG +
Sbjct: 227 EKNRMGNSMPNL----------MPKGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNI 276
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKT 379
L+ D +TG SKGYGF +++ A LNG ++ + + V R G S
Sbjct: 277 QLIMDPETGRSKGYGFLTFRNADDAKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLL 336
Query: 380 EQESI------LAQAQQHIAIQKMALQT---------------SGMNTLGGGMSLFGETL 418
+ + + L + + K+A T M+T
Sbjct: 337 DTDELDRSGIDLGATGRLQLMFKLAEGTGLEIPPAAANALNMAPVMSTPQPPPQAAPPIA 396
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
+ L+ + +EI +D+ EEC K+G +++V + DQ + G V+
Sbjct: 397 TQCFMLSNMFDPQNETNPNWAKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVY 449
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++ A N+L GR F G + A Y P Y
Sbjct: 450 VKCPSIATAVAAVNSLHGRWFAGRVITAAYVPVVNY 485
>gi|336176064|ref|NP_001229528.1| RNA-binding protein 39 isoform c [Homo sapiens]
gi|296199707|ref|XP_002747281.1| PREDICTED: RNA-binding protein 39 isoform 4 [Callithrix jacchus]
gi|332858228|ref|XP_003316932.1| PREDICTED: uncharacterized protein LOC458443 isoform 2 [Pan
troglodytes]
gi|335304745|ref|XP_003360013.1| PREDICTED: RNA-binding protein 39 [Sus scrofa]
gi|338719242|ref|XP_003363966.1| PREDICTED: RNA-binding protein 39 [Equus caballus]
gi|345789990|ref|XP_003433300.1| PREDICTED: RNA-binding protein 39 [Canis lupus familiaris]
gi|426391511|ref|XP_004062116.1| PREDICTED: RNA-binding protein 39 isoform 3 [Gorilla gorilla
gorilla]
gi|124297482|gb|AAI31544.1| RBM39 protein [Homo sapiens]
gi|194389138|dbj|BAG61586.1| unnamed protein product [Homo sapiens]
Length = 508
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A +A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYG
Sbjct: 213 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 272
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
F + D A LNG ++ + + V R AS S T
Sbjct: 273 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 332
Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
+ ++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 333 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 392
Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 393 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 449
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 450 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 491
>gi|340718898|ref|XP_003397899.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Bombus
terrestris]
Length = 520
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 149/396 (37%), Gaps = 64/396 (16%)
Query: 148 LPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
LPFG G PL R AR V+ L + + FFS S
Sbjct: 137 LPFGK---GVSPLGIRNDELTPEERDARTVFCMQLSQRIRARDLEEFFS---------SV 184
Query: 208 GPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA 264
G V + N ++F A+VE + E + A+ L G GV + V+ T A
Sbjct: 185 GKVQDVRLITCNKTRRFKGIAYVEFKDPESVTLALGLSGQKLLGVPIVVQH------TQA 238
Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
G PNL G GP R++VG L + TE ++ + E FG +
Sbjct: 239 EKNRMGNSMPNL----------MPKGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNI 288
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKT 379
L+ D +TG SKGYGF +++ A LNG ++ + + V R G S
Sbjct: 289 QLIMDPETGRSKGYGFLTFRNADDAKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLL 348
Query: 380 EQESI------LAQAQQHIAIQKMALQT---------------SGMNTLGGGMSLFGETL 418
+ + + L + + K+A T M+T
Sbjct: 349 DTDELDRSGIDLGATGRLQLMFKLAEGTGLEIPPAAANALNMAPVMSTPQPPPQAAPPIA 408
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
+ L+ + +EI +D+ EEC K+G +++V + DQ + G V+
Sbjct: 409 TQCFMLSNMFDPQNETNPNWAKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVY 461
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++ A N+L GR F G + A Y P Y
Sbjct: 462 VKCPSIATAVAAVNSLHGRWFAGRVITAAYVPVVNY 497
>gi|383864352|ref|XP_003707643.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Megachile
rotundata]
Length = 530
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 142/372 (38%), Gaps = 61/372 (16%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS S G V + N ++F A+VE
Sbjct: 168 RDARTVFCMQLSQRIRARDLEDFFS---------SVGKVQDVRLITCNKTRRFKGIAYVE 218
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
+ E + A+ L G GV + V+ T A G PNL
Sbjct: 219 FKDPESVTLALGLSGQKLLGVPIVVQH------TQAEKNRMGNSMPNL----------MP 262
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
G GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF +++
Sbjct: 263 KGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDPETGRSKGYGFLTFRNADD 322
Query: 349 TDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQESI------LAQAQQHIAIQKM 397
A LNG ++ + + V R G S + + + L + + K+
Sbjct: 323 AKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLLDTDELDRSGIDLGATGRLQLMFKL 382
Query: 398 ALQT---------------SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEI 442
A T M+T + L+ + +EI
Sbjct: 383 AEGTGLEIPPAAANALNMAPVMSTPQPPPQAAPPIATQCFMLSNMFDPQNETNPNWAKEI 442
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+D+ EEC K+G +++V + DQ + G V+++ A N+L GR F G
Sbjct: 443 RDDVIEECNKHGGVLHVYV---DQASPQ----GNVYVKCPSIATAVAAVNSLHGRWFAGR 495
Query: 503 TVNAFYYPEDKY 514
+ A Y P Y
Sbjct: 496 VITAAYVPVVNY 507
>gi|383864354|ref|XP_003707644.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Megachile
rotundata]
Length = 507
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 142/372 (38%), Gaps = 61/372 (16%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS S G V + N ++F A+VE
Sbjct: 145 RDARTVFCMQLSQRIRARDLEDFFS---------SVGKVQDVRLITCNKTRRFKGIAYVE 195
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
+ E + A+ L G GV + V+ T A G PNL
Sbjct: 196 FKDPESVTLALGLSGQKLLGVPIVVQH------TQAEKNRMGNSMPNL----------MP 239
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
G GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF +++
Sbjct: 240 KGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDPETGRSKGYGFLTFRNADD 299
Query: 349 TDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQESI------LAQAQQHIAIQKM 397
A LNG ++ + + V R G S + + + L + + K+
Sbjct: 300 AKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLLDTDELDRSGIDLGATGRLQLMFKL 359
Query: 398 ALQT---------------SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEI 442
A T M+T + L+ + +EI
Sbjct: 360 AEGTGLEIPPAAANALNMAPVMSTPQPPPQAAPPIATQCFMLSNMFDPQNETNPNWAKEI 419
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+D+ EEC K+G +++V + DQ + G V+++ A N+L GR F G
Sbjct: 420 RDDVIEECNKHGGVLHVYV---DQASPQ----GNVYVKCPSIATAVAAVNSLHGRWFAGR 472
Query: 503 TVNAFYYPEDKY 514
+ A Y P Y
Sbjct: 473 VITAAYVPVVNY 484
>gi|410055060|ref|XP_003953766.1| PREDICTED: uncharacterized protein LOC458443 [Pan troglodytes]
gi|426391517|ref|XP_004062119.1| PREDICTED: RNA-binding protein 39 isoform 6 [Gorilla gorilla
gorilla]
gi|20988961|gb|AAH30493.1| Rbm39 protein [Mus musculus]
gi|34364789|emb|CAE45833.1| hypothetical protein [Homo sapiens]
gi|111598490|gb|AAH82607.1| Rbm39 protein [Mus musculus]
gi|119596570|gb|EAW76164.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_d [Homo
sapiens]
gi|149030835|gb|EDL85862.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_g [Rattus
norvegicus]
Length = 367
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 37 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 79
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 80 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 139
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 140 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 199
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
++A+ + +Q + ALQ SG G L A V L
Sbjct: 200 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 259
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 260 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 312
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 313 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 350
>gi|13278367|gb|AAH04000.1| Rbm39 protein [Mus musculus]
Length = 429
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A +A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYG
Sbjct: 134 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 193
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
F + D A LNG ++ + + V R AS S T
Sbjct: 194 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 253
Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
+ ++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 254 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 313
Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 314 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 370
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 371 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 412
>gi|328781105|ref|XP_624668.3| PREDICTED: RNA-binding protein 39-like [Apis mellifera]
Length = 506
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 95/396 (23%), Positives = 148/396 (37%), Gaps = 64/396 (16%)
Query: 148 LPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
LPFG G PL R AR V+ L + + FFS S
Sbjct: 123 LPFGK---GVSPLGIRNDELTPEERDARTVFCMQLSQRIRARDLEEFFS---------SV 170
Query: 208 GPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA 264
G V + N ++F A+VE + E + A+ L G GV + V+ T A
Sbjct: 171 GKVQDVRLITCNKTRRFKGIAYVEFKDPESVTLALGLSGQKLLGVPIVVQH------TQA 224
Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
G PNL G GP R++VG L + TE ++ + E FG +
Sbjct: 225 EKNRMGNSMPNL----------MPKGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNI 274
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKT 379
L+ D +TG SKGYGF +++ A LNG ++ + + V R G S
Sbjct: 275 QLIMDPETGRSKGYGFLTFRNADDAKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLL 334
Query: 380 EQESILAQAQQHIAIQKMAL---------------------QTSGMNTLGGGMSLFGETL 418
+ + + + A ++ L M+T +
Sbjct: 335 DTDELDRSGIELGATGRLQLMFKLAEGTGLEIPPAAANALNMAPVMSTPQPPPQVAPPIA 394
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
+ L+ + +EI +D+ EEC K+G +++V + DQ + G V+
Sbjct: 395 TQCFMLSNMFDPQNETNPNWAKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVY 447
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++ A N+L GR F G + A Y P Y
Sbjct: 448 VKCPSIGTAVAAVNSLHGRWFAGRVITAAYVPVVNY 483
>gi|449509171|ref|XP_002189260.2| PREDICTED: serine/threonine-protein kinase Kist [Taeniopygia
guttata]
Length = 593
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L ++ +L +EEYE+ILED+REEC KYG +V+++IP+ E PG G+VF+E
Sbjct: 496 VLRLLNVLSDASLQCEEEYEDILEDIREECQKYGPVVSLLIPK------ENPGKGQVFVE 549
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+G+ F G V A +YP Y
Sbjct: 550 YANAGDSKAAQKMLTGKIFDGKFVVATFYPLSAY 583
>gi|426241408|ref|XP_004014583.1| PREDICTED: RNA-binding protein 39 isoform 4 [Ovis aries]
Length = 508
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A +A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYG
Sbjct: 213 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 272
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
F + D A LNG ++ + + V R AS S T
Sbjct: 273 FITFSDSECAKKALEQLNGFELTGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 332
Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
+ ++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 333 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 392
Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 393 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 449
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 450 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 491
>gi|224110606|ref|XP_002315575.1| predicted protein [Populus trichocarpa]
gi|222864615|gb|EEF01746.1| predicted protein [Populus trichocarpa]
Length = 251
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 68/242 (28%), Positives = 105/242 (43%), Gaps = 31/242 (12%)
Query: 284 ASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
A AIG + P ++F+GG+ + + E+ +FG L + +D + + F
Sbjct: 27 AIDAIGDIVKDSPHKIFIGGISKVLSSKMLMEIASAFGPLKAYQFENSKDP--DEPFAFL 84
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK-TEQESILAQAQQHIAIQKMALQ 400
Y D +VT ACA LNG+K+G + +T +A + S ++ S Q QH K L
Sbjct: 85 EYADESVTFKACAGLNGMKLGGQVITAIQAVPNASSSGSDGNSQFGQISQH---AKALL- 140
Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALA--DDEEYEEILEDMREECGKYGTLVN 458
E +VL L +++L+ + E EE+LED+R EC +Y +
Sbjct: 141 ---------------EKPTEVLKLKNVFDSESLSSLSNTEVEEVLEDVRLECARYYNVDK 185
Query: 459 V-----VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
V + D N G G VF+E+ A + L GR F V Y+P D
Sbjct: 186 VTDDIEIEEVDDCNLGLIFERGCVFVEFRRTEAACMAAHCLHGRLFDDRAVVVEYFPLDI 245
Query: 514 YF 515
Y
Sbjct: 246 YL 247
>gi|219112083|ref|XP_002177793.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410678|gb|EEC50607.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 109
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 43/105 (40%), Positives = 63/105 (60%), Gaps = 5/105 (4%)
Query: 416 ETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
+ +++V+ L ++ + L D++ Y+E+LED REEC ++G L++VVIP+ GET G G
Sbjct: 10 QVVSRVVELQNMLSDEDLVDEQAYQEVLEDTREECSQFGKLISVVIPKK----GET-GEG 64
Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
K+FLEY A A AL GR F G V A E K+ DY+
Sbjct: 65 KIFLEYETTNDAAQAIQALEGRTFDGRRVQATSCAEAKFVAMDYA 109
>gi|189237575|ref|XP_974855.2| PREDICTED: similar to splicing factor [Tribolium castaneum]
Length = 501
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 94/400 (23%), Positives = 159/400 (39%), Gaps = 67/400 (16%)
Query: 149 PFGA---TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGN 205
PFG + LG PV+ ++ + R AR V+V L + + FFS
Sbjct: 115 PFGRRNRSPLGLRSNSPVEELSPE-ERDARTVFVMQLSQRIRARDLEEFFS--------- 164
Query: 206 SAGPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPT 262
S G V + N ++F A++E + E + A+ L G GV + V+ T
Sbjct: 165 SVGKVRDVRLIVCNKTRRFKGIAYIEFKDPESVTLALGLSGQKLLGVPIIVQH------T 218
Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
A G PNL GP R++VG L + TE ++ + E FG +
Sbjct: 219 QAEKNRMGNSMPNL----------MPKNMTGPMRLYVGSLHFNITEDMLRSIFEPFGKID 268
Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
L+ D +TG SKGYGF +++ A LNG ++ + + V T + +
Sbjct: 269 NIQLIMDPETGRSKGYGFIAFRNCEDAKKALEQLNGFELAGRPMKVGNVTERLDLQQQGP 328
Query: 383 SILAQ-----------AQQHIAIQKMALQTSGMN---------TLGGGMSLFGET----- 417
SIL A + + + +GM ++ G + +
Sbjct: 329 SILDSDELDRSGIDLGATGRLQLMFKLAEGAGMQVPQAAANALSIATGQPVVPQVQTNST 388
Query: 418 --LAKVLCLTEAITADALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
+A + + A + ++ EI +D+ EEC K+G +++V + D+ +
Sbjct: 389 PPIATQCFMLSNMFDPATESTQTWDVEIRDDVIEECNKHGGVLHVYV---DKGSPQ---- 441
Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
G V+++ + N+L GR F G + A Y P Y
Sbjct: 442 GNVYVKCPSIATAVASVNSLHGRWFAGRVITAAYVPLLNY 481
>gi|195397963|ref|XP_002057597.1| GJ18017 [Drosophila virilis]
gi|194141251|gb|EDW57670.1| GJ18017 [Drosophila virilis]
Length = 599
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 83/370 (22%), Positives = 132/370 (35%), Gaps = 59/370 (15%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS S G V + N K+F A++E
Sbjct: 239 RDARTVFCIQLSQRVRARDLEEFFS---------SVGKVRDVRLITCNKTKRFKGIAYIE 289
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
E + A+ L G GV + V+ L +A P QP +
Sbjct: 290 FEDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQSAPPPFQPKAHT------------ 337
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF Y +
Sbjct: 338 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDVIQLIMDTETGRSKGYGFITYHNADD 393
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA-LQTSGMNTL 407
A LNG ++ + + V T T + I + LQ
Sbjct: 394 AKKALEQLNGFELAGRPMKVGNVTERLDMNTSSLDTDEMDRSGIDLGATGRLQLMFKLAE 453
Query: 408 GGGMSL-----------------------FGETLAKVLCLTEAITADALADDEEYEEILE 444
G G+++ + L+ + ++ E
Sbjct: 454 GAGLAVPQAAANALLATAPQPAPLQQQQQTPSIATQCFILSNMFDPRTETNPTWDTDVRE 513
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
D+ +EC K+G ++++ + G V+++ A NAL GR F G +
Sbjct: 514 DVLDECAKHGGVLHIHV-------DTVSPTGTVYVKCPSTTTAVLAVNALHGRWFAGRVI 566
Query: 505 NAFYYPEDKY 514
A Y P Y
Sbjct: 567 TAAYVPVINY 576
>gi|426241414|ref|XP_004014586.1| PREDICTED: RNA-binding protein 39 isoform 7 [Ovis aries]
Length = 367
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 37 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 79
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 80 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 139
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 140 TFSDSECAKKALEQLNGFELTGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 199
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
++A+ + +Q + ALQ SG G L A V L
Sbjct: 200 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 259
Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
T+ + + + E EI +D+ EEC K+G ++++ + D+N + G V+
Sbjct: 260 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 312
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
++ A NAL GR F G + A Y P Y N
Sbjct: 313 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 350
>gi|363736469|ref|XP_422213.3| PREDICTED: serine/threonine-protein kinase Kist [Gallus gallus]
Length = 388
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L ++ +L +EEYE+ILED+REEC KYG +V+++IP+ E PG G+VF+E
Sbjct: 291 VLRLLNVLSDASLQSEEEYEDILEDIREECQKYGPVVSLLIPK------ENPGKGQVFVE 344
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+G+ F G V A +YP Y
Sbjct: 345 YANAGDSKAAQKMLTGKIFDGKFVVATFYPLSAY 378
>gi|307180960|gb|EFN68748.1| RNA-binding protein 39 [Camponotus floridanus]
Length = 529
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 144/372 (38%), Gaps = 61/372 (16%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS S G V + N ++F A+VE
Sbjct: 167 RDARTVFCMQLSQRIRARDLEEFFS---------SVGKVQDVRLITCNKTRRFKGIAYVE 217
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
+ E + A+ L G GV + V+ T A G PNL
Sbjct: 218 FKDPESVTLALGLSGQKLLGVPIVVQH------TQAEKNRMGNSMPNL----------MP 261
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
G GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF +++
Sbjct: 262 KGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDPETGRSKGYGFLTFRNADD 321
Query: 349 TDIACAALNGLKMGDKTLTVRRAT-----ASGQSKTEQESI------LAQAQQHIAIQKM 397
A LNG ++ + + V T G S + + + L + + K+
Sbjct: 322 AKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLLDTDELDRSGIDLGATGRLQLMFKL 381
Query: 398 --------------ALQTSGMNTLGGGMSLFGETLA-KVLCLTEAITADALADDEEYEEI 442
AL + + T +A + L+ + +EI
Sbjct: 382 AEGTGLEIPPAAANALNMAPVMTQPQPPPQAAPPIATQCFMLSNMFDPQNETNPNWAKEI 441
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+D+ EEC K+G +++V + DQ + G V+++ A N+L GR F G
Sbjct: 442 RDDVIEECNKHGGVLHVYV---DQASPQ----GNVYVKCPSIATAVAAVNSLHGRWFAGR 494
Query: 503 TVNAFYYPEDKY 514
+ A Y P Y
Sbjct: 495 VITAAYVPVVNY 506
>gi|300121045|emb|CBK21427.2| unnamed protein product [Blastocystis hominis]
Length = 457
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 157/384 (40%), Gaps = 45/384 (11%)
Query: 151 GATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPG 210
GAT LGA + M+ +T + + G+P I + +M ++ + GPG
Sbjct: 70 GATGLGAAMMTIADSMSAMST-GVTSLAITGVPATITPDEICNSINILMKSLKL-TTGPG 127
Query: 211 DAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPG 270
+ V + + A EMR+ EA+N +ALDG+ + V RP +Y GP
Sbjct: 128 NPCSGVGMEANGQTAIAEMRSPLEATNGLALDGLTVFNHVMHVNRPDNYT-------GPD 180
Query: 271 QPSPNL--NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFD--- 325
P P L NL + + +G+ E + V VG ++LLE++ +
Sbjct: 181 TPPPKLDPNLL-LQICTGSYAEKEYEEIVRVG-----------RKLLETYKEPDPENDAD 228
Query: 326 --LVKDRDTGNSKGYGF--CVYQ-------DPAVTDIACAALNGLK----MGDKTLTVRR 370
+ G GY CV + A C GLK + DK
Sbjct: 229 KPTISSLKDGEKTGYDIEKCVLMHNIPRELEEAEIHTFCEPFGGLKKIYMLKDKNCRFLG 288
Query: 371 ATASGQSKTEQESILAQAQQHIAI-QKMALQTSGMNTLGGGMSLFGETLAK---VLCLTE 426
+ T I + Q + I + ++ + G T++ VL ++
Sbjct: 289 DAVAEYRDTLNYEIAMEGLQDLPIFNDIVIKVEKPDPKWPGFPQRVNTISNPSPVLRMSN 348
Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
I+ + L +DE+YE +LED+RE C K GT++N+ +PR E PG+G F++Y +
Sbjct: 349 IISLEDLEEDEDYEALLEDLREGCEKLGTVLNMHVPRIHSGEKEIPGLGFAFVQYSSVIE 408
Query: 487 CATAKNALSGRKFGGNTVNAFYYP 510
A A L F G V YYP
Sbjct: 409 AAQAAKQLRLLTFNGKQVQVDYYP 432
>gi|357612395|gb|EHJ67964.1| putative RNA-binding region-containing protein [Danaus plexippus]
Length = 536
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 87/370 (23%), Positives = 139/370 (37%), Gaps = 59/370 (15%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R R V+ L + + FFS V G V + N ++F A++E
Sbjct: 179 RDLRTVFCMQLSQRIRAKDLEEFFSSV---------GKVRDVRLITCNKTRRFKGIAYIE 229
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
+ E A+ L G GV + V+ T A G PNL A
Sbjct: 230 FKDAESVPLALGLTGQKLLGVPIIVQH------TQAEKNRVGNTLPNL----------AP 273
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
+ GP R++VG L + TE ++ + E FG + L+ D DTG SKGYGF +
Sbjct: 274 KTSNGPTRLYVGSLHFNITEDMLRGIFEPFGKIDHIQLMTDPDTGKSKGYGFLTFHHATD 333
Query: 349 TDIACAALNGLKMGDKTLTVRRAT--ASGQSKT-------EQESILAQAQQHIAIQKMAL 399
A LNG ++ + + V T A G S T ++ + A + +
Sbjct: 334 AKKAMEQLNGFELAGRPMKVGNVTERADGGSSTRFDADELDRAGVDLGATGRLQLMFKLA 393
Query: 400 QTSGMNTLGGGMSLF---GETLA------------KVLCLTEAITADALADDEEYEEILE 444
+ +G+ S+ G TL + L + ++ EI +
Sbjct: 394 EGTGLQIPPAAASVLMGAGSTLVAPQPQVAPPIATQCFMLNNMFDPSSESNPSWDIEIRD 453
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
D+ EC K+G +++V + + G V+ + + N+L GR F G +
Sbjct: 454 DVISECNKHGGVLHVYVDKASPQGN-------VYCKCPTIATAVASVNSLHGRWFAGRVI 506
Query: 505 NAFYYPEDKY 514
A Y P Y
Sbjct: 507 TAAYVPLVNY 516
>gi|323456301|gb|EGB12168.1| hypothetical protein AURANDRAFT_8852, partial [Aureococcus
anophagefferens]
Length = 98
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 57/97 (58%), Gaps = 1/97 (1%)
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE-TPGVGKV 477
+KVL L +T L DDE Y ++++D+ +ECG YG + NV IPRP+ PG G V
Sbjct: 2 SKVLQLRHMVTDADLIDDEAYADVVDDVLQECGSYGDVENVEIPRPEPGTTRPAPGQGSV 61
Query: 478 FLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
F+ + DA A+ A GR F G T+ A YYP+D +
Sbjct: 62 FVAFGDAFFAQAAREAFEGRAFDGKTIIAGYYPQDLF 98
>gi|340370502|ref|XP_003383785.1| PREDICTED: RNA-binding protein 39-like [Amphimedon queenslandica]
Length = 497
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 131/315 (41%), Gaps = 46/315 (14%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE + A++ G G+ + ++ PT+A LAA
Sbjct: 195 KGIAYVEFQEESSVFTALSFSGQKVHGIPIMIQ------PTMAE---------KNRLAA- 238
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
A+ + AEGP +++VG L Y TE ++ + FG + +++D T S+GY F
Sbjct: 239 --AAENLKKAEGPKKLYVGSLHYNITEDMLQGIFSPFGNVERVSIMRDTATNVSRGYAFV 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
++D + A A LNG ++ + + V T S +S+ + +
Sbjct: 297 EFRDSDSAERAMANLNGFELAGRPMKVNYGTVD-TSLVNIDSLDGEDMDVGVGMTPQSRV 355
Query: 402 SGMNTLGGG----MSLFGETL--------AKVLCLTEAITADALADDEEYE-------EI 442
+ M+ L G MS+ G + C+T D E EI
Sbjct: 356 ALMHKLAAGHNADMSIPGVQVPPPPFAVPTMPTCITSCCFVIGNMFDPSKETGSDWDKEI 415
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
ED+ EEC K+G + ++ + + Q GKV+++ + A + +GR++ GN
Sbjct: 416 REDVLEECVKFGNIFHIHVDKFSQ--------GKVYIKSQTPQTASAAVGSFNGRRYAGN 467
Query: 503 TVNAFYYPEDKYFNK 517
++A PE+ Y K
Sbjct: 468 VIHAELVPENTYHLK 482
>gi|195338839|ref|XP_002036031.1| GM16278 [Drosophila sechellia]
gi|194129911|gb|EDW51954.1| GM16278 [Drosophila sechellia]
Length = 596
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 135/372 (36%), Gaps = 63/372 (16%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS V G V + N K+F A++E
Sbjct: 236 RDARTVFCIQLSQRVRARDLEEFFSSV---------GKVRDVRLITCNKTKRFKGIAYIE 286
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
E + A+ L G GV + V+ L A QP +
Sbjct: 287 FDDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQNATPAFQPKSHT------------ 334
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF Y +
Sbjct: 335 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDAIQLIMDTETGRSKGYGFITYHNADD 390
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA---LQTSGMN 405
A LNG ++ + + V T T S+ I A LQ
Sbjct: 391 AKKALEQLNGFELAGRLMKVGNVTERLDMNT--TSLDTDEMDRTGIDLGATGRLQLMFKL 448
Query: 406 TLGGGMS------------------LFGETLAKVLCLTEAITADALADDEEYE-----EI 442
G G++ L + +A + I ++ E EI
Sbjct: 449 AEGAGLAVPQAAANALLATAPQPAPLQQQEVAPSIATQCFILSNMFDPRTETNPTWDVEI 508
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+D+ EEC K+G ++++ + G V+++ A NAL GR F G
Sbjct: 509 RDDVLEECAKHGGVLHIHV-------DTISHTGTVYVKCPSTTTAVLAVNALHGRWFAGR 561
Query: 503 TVNAFYYPEDKY 514
+ A Y P Y
Sbjct: 562 VITAAYLPVINY 573
>gi|66475436|ref|XP_627534.1| splicing factor U2AF U2 snRNP auxiliary factor large subunit; 3 RRM
domains [Cryptosporidium parvum Iowa II]
gi|32398751|emb|CAD98711.1| splicing factor, possible [Cryptosporidium parvum]
gi|46228987|gb|EAK89836.1| splicing factor U2AF U2 snRNP auxiliary factor large subunit; 3 RRM
domains [Cryptosporidium parvum Iowa II]
Length = 492
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 86/375 (22%), Positives = 155/375 (41%), Gaps = 47/375 (12%)
Query: 167 TQQATRHARRVYVGGLPPLANEQAIAT--FFSQVMTAIGGNSAG--PGDAVVNVYINHEK 222
T ++ R VYVG LP Q I + +I NS G+ VV+ +IN +
Sbjct: 108 TSFTSKPLREVYVGNLP-----QGITVTELLEYINRSIIKNSVSHTNGNPVVSAWINSDG 162
Query: 223 KFAFVEMRTVEEASNAMALDGII-FEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K+AF E R++EEA+ + L+ ++ F+G +R+ + P ++ + QPS N L
Sbjct: 163 KYAFCECRSIEEANTLLRLNNLLSFKGNLLRIGK-----PKVSENIIGDQPSNNSTLINQ 217
Query: 282 GLASGAIGG---------AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
S +I + + + + G+ F IKE+L S + +L+ R+
Sbjct: 218 ITQSTSIISPYFNNIPLVLKKKETILITGINKKFVLEDIKEML-SIKNIEILELIDYRN- 275
Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA----SGQSKTEQESILAQA 388
Y + + TDI +N L K L ++ + + + + S + +
Sbjct: 276 ----KYKIAICEGDLNTDITDKVVNKLGTEIKILRMKNCNSKVIHAVNNHLKNLSCIVRE 331
Query: 389 QQHIAIQKMALQTSGMNTLGGGMS---LFGETLAKVLCLTEAITADALADDEEYEEILED 445
K+ L+T + L + + + L+ +T + L Y I E+
Sbjct: 332 S-----NKLLLKTEKFENIQSKNVISLLLPQKPCRCILLSNILTVEELLIPSTYSSIHEE 386
Query: 446 MREECGKYGTLVNVVIPRPD-----QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
+ E+C KYG + IP P+ ++ P G+ F+ +Y+ AK L +F
Sbjct: 387 IHEKCLKYGEIYKTTIPIPERALSNKDQFNDPYFGRAFIFFYNVESAIKAKLDLFKMRFL 446
Query: 501 GNTVNAFYYPEDKYF 515
G + YY E ++
Sbjct: 447 GRNMKISYYCEHEFL 461
>gi|19920866|ref|NP_609095.1| CG11266, isoform B [Drosophila melanogaster]
gi|24582412|ref|NP_723243.1| CG11266, isoform A [Drosophila melanogaster]
gi|7297213|gb|AAF52478.1| CG11266, isoform A [Drosophila melanogaster]
gi|15292031|gb|AAK93284.1| LD35730p [Drosophila melanogaster]
gi|22945834|gb|AAN10614.1| CG11266, isoform B [Drosophila melanogaster]
gi|220946034|gb|ACL85560.1| CG11266-PA [synthetic construct]
gi|220955788|gb|ACL90437.1| CG11266-PA [synthetic construct]
Length = 594
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 135/372 (36%), Gaps = 63/372 (16%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS S G V + N K+F A++E
Sbjct: 234 RDARTVFCIQLSQRVRARDLEEFFS---------SVGKVRDVRLITCNKTKRFKGIAYIE 284
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
E + A+ L G GV + V+ L A QP +
Sbjct: 285 FDDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQNAAPAFQPKSHT------------ 332
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF Y +
Sbjct: 333 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDAIQLIMDTETGRSKGYGFITYHNADD 388
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA---LQTSGMN 405
A LNG ++ + + V T T S+ I A LQ
Sbjct: 389 AKKALEQLNGFELAGRLMKVGNVTERLDMNT--TSLDTDEMDRTGIDLGATGRLQLMFKL 446
Query: 406 TLGGGMS------------------LFGETLAKVLCLTEAITADALADDEEYE-----EI 442
G G++ L + +A + I ++ E EI
Sbjct: 447 AEGAGLAVPQAAANALLATAPQPAPLQQQEVAPSIATQCFILSNMFDPRTETNPTWDVEI 506
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+D+ EEC K+G ++++ + G V+++ A NAL GR F G
Sbjct: 507 RDDVLEECAKHGGVLHIHV-------DTISHTGTVYVKCPSTTTAVLAVNALHGRWFAGR 559
Query: 503 TVNAFYYPEDKY 514
+ A Y P Y
Sbjct: 560 VITAAYVPVINY 571
>gi|380012525|ref|XP_003690330.1| PREDICTED: LOW QUALITY PROTEIN: RNA-binding protein 39-like [Apis
florea]
Length = 506
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 95/396 (23%), Positives = 147/396 (37%), Gaps = 64/396 (16%)
Query: 148 LPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
LPFG G PL R AR V+ L + + FFS S
Sbjct: 123 LPFGK---GVSPLGIRNDELTPEERDARTVFCMQLSQRIRARDLEEFFS---------SV 170
Query: 208 GPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA 264
G V + N ++F A+VE + E + A+ L G GV + V+ T A
Sbjct: 171 GKVQDVRLITCNKTRRFKGIAYVEFKDPESVTLALGLSGQKLLGVPIVVQH------TQA 224
Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
G PNL G GP R++VG L + TE ++ + E FG +
Sbjct: 225 EKNRMGNSMPNL----------MPKGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNI 274
Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKT 379
L+ D TG SKGYGF +++ A LNG ++ + + V R G S
Sbjct: 275 QLIMDPXTGRSKGYGFLTFRNADDAKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLL 334
Query: 380 EQESILAQAQQHIAIQKMAL---------------------QTSGMNTLGGGMSLFGETL 418
+ + + + A ++ L M+T +
Sbjct: 335 DTDELDRSGIELGATGRLQLMFKLAEGTGLEIPPAAANALNMAPVMSTPQPPPQVAPPIA 394
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
+ L+ + +EI +D+ EEC K+G +++V + DQ + G V+
Sbjct: 395 TQCFMLSNMFDPQNETNPNWAKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVY 447
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
++ A N+L GR F G + A Y P Y
Sbjct: 448 VKCPSIGTAVAAVNSLHGRWFAGRVITAAYVPVVNY 483
>gi|195577213|ref|XP_002078467.1| GD23448 [Drosophila simulans]
gi|194190476|gb|EDX04052.1| GD23448 [Drosophila simulans]
Length = 608
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 87/370 (23%), Positives = 134/370 (36%), Gaps = 59/370 (15%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS S G V + N K+F A++E
Sbjct: 248 RDARTVFCIQLSQRVRARDLEEFFS---------SVGKVRDVRLITCNKTKRFKGIAYIE 298
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
E + A+ L G GV + V+ L A QP +
Sbjct: 299 FDDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQNATPAFQPKSH------------- 345
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF Y +
Sbjct: 346 ---TGPMRLYVGSLHFNITEDMLRGIFEPFGKIDAIQLIMDTETGRSKGYGFITYHNADD 402
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA-LQTSGMNTL 407
A LNG ++ + + V T T + I + LQ
Sbjct: 403 AKKALEQLNGFELAGRLMKVGNVTERLDMNTTSLDTDEMDRTGIDLGATGRLQLMFKLAE 462
Query: 408 GGGMS------------------LFGETLAKVLCLTEAITADALADDEEYE-----EILE 444
G G++ L + +A + I ++ E EI +
Sbjct: 463 GAGLAVPQAAANALLATAPQPAPLQQQEVAPSIATQCFILSNMFDPRTETNPTWDVEIRD 522
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
D+ EEC K+G ++++ + G V+++ A NAL GR F G +
Sbjct: 523 DVLEECAKHGGVLHIHV-------DTISHTGTVYVKCPSTTTAVLAVNALHGRWFAGRVI 575
Query: 505 NAFYYPEDKY 514
A Y P Y
Sbjct: 576 TAAYLPVINY 585
>gi|34365067|emb|CAE45890.1| hypothetical protein [Homo sapiens]
Length = 373
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 136/344 (39%), Gaps = 73/344 (21%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V+ Q N A
Sbjct: 37 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIVVQ--------------ASQAEKN---RAA 79
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 80 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 139
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
+ D A LNG ++ + + V R AS S T +
Sbjct: 140 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 199
Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 200 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 259
Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 260 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 314
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 315 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 356
>gi|31873732|emb|CAD97833.1| hypothetical protein [Homo sapiens]
Length = 373
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A +A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYG
Sbjct: 78 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 137
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
F + D A LNG ++ + + V R AS S T
Sbjct: 138 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDGLERTGIDLGTT 197
Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
+ ++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 198 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 257
Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 258 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 314
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 315 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 356
>gi|413920209|gb|AFW60141.1| hypothetical protein ZEAMMB73_955987, partial [Zea mays]
Length = 72
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 41/66 (62%), Positives = 45/66 (68%)
Query: 454 GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
G LV V+IPRPD +G GVGKVFLEY D G A AK AL GRKFGGN V A Y EDK
Sbjct: 5 GNLVKVIIPRPDPSGQPVVGVGKVFLEYADIDGAAKAKTALHGRKFGGNPVVAVCYAEDK 64
Query: 514 YFNKDY 519
+ N +Y
Sbjct: 65 FANGEY 70
>gi|410055058|ref|XP_003316934.2| PREDICTED: uncharacterized protein LOC458443 isoform 4 [Pan
troglodytes]
gi|410055062|ref|XP_003953767.1| PREDICTED: uncharacterized protein LOC458443 [Pan troglodytes]
gi|426391515|ref|XP_004062118.1| PREDICTED: RNA-binding protein 39 isoform 5 [Gorilla gorilla
gorilla]
gi|426391519|ref|XP_004062120.1| PREDICTED: RNA-binding protein 39 isoform 7 [Gorilla gorilla
gorilla]
gi|119596566|gb|EAW76160.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_b [Homo
sapiens]
gi|119596571|gb|EAW76165.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_b [Homo
sapiens]
Length = 373
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A +A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYG
Sbjct: 78 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 137
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
F + D A LNG ++ + + V R AS S T
Sbjct: 138 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 197
Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
+ ++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 198 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 257
Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 258 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 314
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 315 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 356
>gi|294905728|ref|XP_002777665.1| Splicing factor U2AF 65 kDa subunit, putative [Perkinsus marinus
ATCC 50983]
gi|239885556|gb|EER09481.1| Splicing factor U2AF 65 kDa subunit, putative [Perkinsus marinus
ATCC 50983]
Length = 680
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 98/385 (25%), Positives = 164/385 (42%), Gaps = 69/385 (17%)
Query: 102 SRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLM 161
SRSPS+ ++ FD P A QL S +P Q ++ +T AF
Sbjct: 184 SRSPSEKRKPFKFDSPPKELA--------AQLAAGTSMLP---QTVVSSSSTIKEAFNA- 231
Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
+ + + AR +Y+G +PP + + + + +G N A PG +V+ ++ +
Sbjct: 232 ---TLAAERQKIARELYIGQIPPGISAAELIDVLNDGLMNMGAN-AMPGRPIVHGWLGGD 287
Query: 222 KKFAFVEMRTVEEASNAMA-LDGIIFE--GVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
FAFVE RT EEAS A+ L+G + GV+++V RP Y +GP P ++N
Sbjct: 288 GLFAFVEFRTPEEASIALERLNGHQLKSYGVSIKVGRPKGY-------MGPAAPDDSVNA 340
Query: 279 AAVGLAS------GAIGGAE---GPDRVFVGGLPYYFTETQIKELLE--SFGTLHGFDLV 327
G A+ G I AE R+ + G P +E IK L S G + +L+
Sbjct: 341 YTAGHAATSSTTPGGISAAEVSSDTSRLCLIGFPLKASEHSIKRALRNASKGEIRHLELL 400
Query: 328 KDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
K T N + V++ + D + K + E +
Sbjct: 401 K--HTWNDEEIVMAVFECVNIED-----------------------EHRLKKKGEVEVQG 435
Query: 388 AQQHIAIQKMALQTSGMNTLGGGM-SLFGETL--AKVLCLTE-AITADALADDEEYEEIL 443
+ I K A+ MN G M G + +++L +T A + + L DD Y +++
Sbjct: 436 VKARIINPKDAIVKGYMNFDGDIMKKAMGLEIVPSRILVMTNFAGSVEELLDDINYSDLM 495
Query: 444 EDMREECGKY---GTLVNVVIPRPD 465
+D++ EC + +++IPRP+
Sbjct: 496 DDIKVECKSITAGADVRSIIIPRPE 520
>gi|312374824|gb|EFR22303.1| hypothetical protein AND_15459 [Anopheles darlingi]
Length = 560
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 113/513 (22%), Positives = 184/513 (35%), Gaps = 79/513 (15%)
Query: 50 DKNYKYDREG-------IRDHDRTDRHRDYNRDKERRH-RHRSRSHSSDRFRNRSKSLSP 101
D++ + DR+G +D R+ R RD R+K+RR + RS+S S R R++
Sbjct: 63 DRDKERDRDGGEGRSRKDKDRSRSPRPRDKEREKDRRKSKERSKSRSPRRERSKDHKEKD 122
Query: 102 SRSPS---------KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGA 152
RS + +S+ R G + + + M+ G
Sbjct: 123 HRSKNDHHRSVEKRRSRERGGGMIDHRRKSRERDHRRRSRSRDAGRRRRSMSPRHYRRGR 182
Query: 153 TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDA 212
G++ Q R AR V+ L + + FFS S G
Sbjct: 183 GGYGSYRDRTPGDEVSQEDRDARTVFCMQLSQRIRARDLEEFFS---------SVGKVRD 233
Query: 213 VVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
V + N K+F A++E + E + A+ L G G+ + V+
Sbjct: 234 VRLITCNKTKRFKGIAYIEFKDPESVALALGLSGQRLLGIPISVQ--------------- 278
Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD 329
N A GP R++VG L + TE ++ + E FG + L+ D
Sbjct: 279 -HTQAEKNRMANQPPPAPPKNPAGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMD 337
Query: 330 RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
DTG SKGYGF + + A LNG ++ + + V T T S+
Sbjct: 338 TDTGRSKGYGFITFHNADDAKKALEQLNGFELAGRPMKVGNVTER-LDVTTHASLDTDEM 396
Query: 390 QHIAIQKMA---LQTSGMNTLGGGMSL-----------------------FGETLAKVLC 423
I+ A LQ G G+++ +
Sbjct: 397 DRSGIELGATGRLQLMFKLAEGAGLAVPRAAADALLATAPQPIPQQPLQQSPPIATQCFL 456
Query: 424 LTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYD 483
L+ + EI +D+ EEC K+G +++V + + +G V+++ +
Sbjct: 457 LSNMFDPSTETNPNWDVEIQDDVIEECNKHGGVLHVYVDKLSPSGN-------VYVKCPN 509
Query: 484 AVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
A NAL GR F G + A Y P Y+N
Sbjct: 510 VATAVLAVNALHGRWFAGRVIGAAYVPLVNYYN 542
>gi|291397536|ref|XP_002715130.1| PREDICTED: kinase interacting stathmin [Oryctolagus cuniculus]
Length = 419
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 40/94 (42%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L + D L ++EEYE+I+ED++EEC KYG +V++++P+ E PG G+VF+E
Sbjct: 322 VLRLLNVLDDDYLENEEEYEDIVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 375
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+GR F G V A +YP Y
Sbjct: 376 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 409
>gi|159482188|ref|XP_001699155.1| hypothetical protein CHLREDRAFT_106436 [Chlamydomonas reinhardtii]
gi|158273218|gb|EDO99010.1| predicted protein [Chlamydomonas reinhardtii]
Length = 80
Score = 79.0 bits (193), Expect = 7e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 51/76 (67%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
++FVGGLP ++E +KELL +GTL F+LV D+ TG SKGY FC Y + + D+
Sbjct: 2 KLFVGGLPCEWSEDMVKELLAPYGTLKSFNLVMDKSTGKSKGYAFCEYSEESSADLLIKN 61
Query: 356 LNGLKMGDKTLTVRRA 371
L+ ++G K LTV+RA
Sbjct: 62 LHMRRVGSKALTVKRA 77
>gi|209154564|gb|ACI33514.1| RNA-binding protein 39 [Salmo salar]
Length = 525
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 116/281 (41%), Gaps = 51/281 (18%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A +A+ G GP R++VG L + TE ++ + E FG + L+ D +TG SKGYG
Sbjct: 234 AAAMANNLQKGNAGPMRLYVGSLHFNITEEMLRGIFEPFGKIESIQLMMDSETGRSKGYG 293
Query: 340 FCVYQDPAVTDIACAALNG-------LKMGDKTLTVRRATASG-------------QSKT 379
F + D A LNG +K+G T +TAS T
Sbjct: 294 FITFSDTECAKKALDQLNGFELAGRPMKVGHVTERTDASTASSFLDSDELERTGIDLGTT 353
Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMS------------------LFGE 416
+ ++A+ + +Q + ALQ SG +G + L +
Sbjct: 354 GRLQLMARLAEGTGLQIPPAAQQALQMSGAIAIGAMAAVSAAMNPAMNMNMNTAMNLPSQ 413
Query: 417 TLAKVLCLTEAITADALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
LA + D+ +++ +I D+ EEC K+G +V++ + D+N E G
Sbjct: 414 PLATHCFQLSNMFNPQSEDNPDWDVDIQHDVIEECNKHGGVVHIYV---DKNSTE----G 466
Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
V+++ A NAL GR F G + A Y P Y N
Sbjct: 467 NVYVKCPSIPAAMAAVNALHGRYFAGKMITAAYVPLPTYHN 507
>gi|307195359|gb|EFN77277.1| RNA-binding protein 39 [Harpegnathos saltator]
Length = 370
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 144/372 (38%), Gaps = 61/372 (16%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS S G V + N ++F A+VE
Sbjct: 8 RDARTVFCMQLSQRIRARDLEEFFS---------SVGKVQDVRLITCNKTRRFKGIAYVE 58
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
+ E + A+ L G GV + V+ T A G PNL
Sbjct: 59 FKDPESVTLALGLSGQKLLGVPIVVQH------TQAEKNRMGNSMPNL----------MP 102
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
G GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF +++
Sbjct: 103 KGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDPETGRSKGYGFLTFRNADD 162
Query: 349 TDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQESI------LAQAQQHIAIQKM 397
A LNG ++ + + V R G S + + + L + + K+
Sbjct: 163 AKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLLDTDELDRSGIDLGATGRLQLMFKL 222
Query: 398 --------------ALQTSGMNTLGGGMSLFGETLA-KVLCLTEAITADALADDEEYEEI 442
AL + + T +A + L+ + +EI
Sbjct: 223 AEGTGLEIPPAAANALNMAPVMTAPQPPPQAAPPIATQCFMLSNMFDPQNETNPNWAKEI 282
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+D+ EEC K+G +++V + DQ + G V+++ A N+L GR F G
Sbjct: 283 RDDVIEECNKHGGVLHVYV---DQASPQ----GNVYVKCPSIATAVAAVNSLHGRWFAGR 335
Query: 503 TVNAFYYPEDKY 514
+ A Y P Y
Sbjct: 336 VITAAYVPVVNY 347
>gi|67593828|ref|XP_665753.1| splicing factor [Cryptosporidium hominis TU502]
gi|54656571|gb|EAL35522.1| splicing factor [Cryptosporidium hominis]
Length = 491
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 86/375 (22%), Positives = 153/375 (40%), Gaps = 45/375 (12%)
Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP--GDAVVNVYINHEKKF 224
T A++ R VYVG LP +A + +I NS G+ VV+ +IN + K+
Sbjct: 107 TSFASKPLREVYVGNLPQGI---TVAELLEYINRSIIKNSVSHTHGNPVVSAWINSDGKY 163
Query: 225 AFVEMRTVEEASNAMALDGII-FEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
AF E R++EEA+ + L+ ++ F+G +R+ + P ++ + QPS N L
Sbjct: 164 AFCECRSIEEANALLRLNNLLSFKGNLLRIGK-----PKVSENIIGDQPSNNSTLINQIS 218
Query: 284 ASGAIGG---------AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
S AI + + + + G+ F IKE+ S + +L+ R+
Sbjct: 219 QSTAIISPYFNNIPLVLKKKETILITGINKKFVLEDIKEMF-SIKNIEILELIDYRN--- 274
Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-----ASGQSKTEQESILAQAQ 389
Y + + TDI +N L K L ++ A I+ ++
Sbjct: 275 --KYKIAICEGDLNTDITDKVVNKLGTEIKILRMKSCNSKVIHAVNNHLKNMSCIVRES- 331
Query: 390 QHIAIQKMALQTSGMNTLGGGMS---LFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
K+ L+ N + L + + + L+ + + L Y I +++
Sbjct: 332 -----NKLLLKREKFNNIQNKNVISLLLPQKPCRCILLSNILAVEELLIPSTYSSIHKEI 386
Query: 447 REECGKYGTLVNVVIPRPD-----QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
E+C KYG + IP P+ ++ P G+ F+ +Y+ AK L +F G
Sbjct: 387 HEKCLKYGEIYKTTIPIPERALSSKDQFNDPYFGRAFIFFYNVESAIKAKLDLFRMRFLG 446
Query: 502 NTVNAFYYPEDKYFN 516
+ YY E ++ N
Sbjct: 447 RNIKISYYCEHEFLN 461
>gi|426241412|ref|XP_004014585.1| PREDICTED: RNA-binding protein 39 isoform 6 [Ovis aries]
gi|426241416|ref|XP_004014587.1| PREDICTED: RNA-binding protein 39 isoform 8 [Ovis aries]
Length = 373
Score = 78.6 bits (192), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A +A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYG
Sbjct: 78 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 137
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
F + D A LNG ++ + + V R AS S T
Sbjct: 138 FITFSDSECAKKALEQLNGFELTGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 197
Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
+ ++A+ + +Q + ALQ SG G S + ++ TE
Sbjct: 198 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 257
Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
A L++ +EE EI +D+ EEC K+G ++++ + D+N +
Sbjct: 258 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 314
Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
G V+++ A NAL GR F G + A Y P Y N
Sbjct: 315 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 356
>gi|449268165|gb|EMC79035.1| Serine/threonine-protein kinase Kist, partial [Columba livia]
Length = 331
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L ++ +L +EEYE+ILED+REEC KYG +V+++IP+ E PG G+VF+E
Sbjct: 234 VLRLLNVLSDASLQCEEEYEDILEDIREECQKYGPVVSLLIPK------ENPGKGQVFVE 287
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+G+ F G V A +YP Y
Sbjct: 288 YANAGDSKAAQKMLTGKIFDGKFVVATFYPLSAY 321
>gi|432107103|gb|ELK32526.1| Splicing factor U2AF 65 kDa subunit [Myotis davidii]
Length = 243
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 53/84 (63%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
++F+GGLP Y + Q+KELL FG L F+LVKD TG SKG F Y D ++ D A A
Sbjct: 128 KLFMGGLPNYMKDDQVKELLTWFGPLKAFNLVKDSTTGLSKGCAFYEYVDISIRDQAMAG 187
Query: 356 LNGLKMGDKTLTVRRATASGQSKT 379
NG+++G K L V+RA ++ T
Sbjct: 188 PNGMQLGVKKLLVQRAGVGAKNAT 211
>gi|170071297|ref|XP_001869868.1| splicing factor [Culex quinquefasciatus]
gi|167867202|gb|EDS30585.1| splicing factor [Culex quinquefasciatus]
Length = 524
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 139/372 (37%), Gaps = 62/372 (16%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R R V+ L + + FFS V G V + N K+F A++E
Sbjct: 166 RDMRTVFCMQLSQRIRARDLEEFFSSV---------GKVRDVRLITCNKTKRFKGIAYIE 216
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
R E + A+ L G G+ + V+ LA N+ +
Sbjct: 217 FRDPESVALALGLSGQRLLGIPISVQHTQAEKNRLA------------NIPPPPPPKVIV 264
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
G P R++VG L + TE ++ + E FG + L+ D DTG SKGYGF + +
Sbjct: 265 G----PMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDSDTGRSKGYGFITFHNADD 320
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA---LQTSGMN 405
A LNG ++ + + V T T S+ I A LQ
Sbjct: 321 AKKALEQLNGFELAGRPMKVGNVTER-LDVTTHASLDTDEMDRSGIDLGATGRLQLMFKL 379
Query: 406 TLGGGMSL----------------------FGETLAKVLCLTEAITADALADDEEYE-EI 442
G G+++ +A L + A + ++ EI
Sbjct: 380 AEGAGLAVPRAAADALLATAPQPAPNQPVQDSPAIATQCFLLSNMFDPATETNPSWDVEI 439
Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+D+ EEC K+G +++V + + ++P G V+++ A NAL GR F G
Sbjct: 440 EDDVIEECNKHGGVLHVYVDK------QSPA-GNVYVKCPSIATAVLAVNALHGRWFAGR 492
Query: 503 TVNAFYYPEDKY 514
+ A Y P Y
Sbjct: 493 VIAAAYVPLVNY 504
>gi|294659352|ref|XP_461720.2| DEHA2G04004p [Debaryomyces hansenii CBS767]
gi|199433897|emb|CAG90172.2| DEHA2G04004p [Debaryomyces hansenii CBS767]
Length = 636
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/307 (22%), Positives = 128/307 (41%), Gaps = 29/307 (9%)
Query: 220 HEKKFAFVEMRTVEEASNAMALDGI---IFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL 276
+ +K +F EM+ V+ + I + + + + RP +Y + L P
Sbjct: 346 NSRKLSFNEMKLVKNDDEGNPIHQIGQDTGDDIVLDISRPGEY---VVQCLPP------- 395
Query: 277 NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
+ + + P ++ V +P ET++ + ++ GT+ GF ++++ T S
Sbjct: 396 -YSEIKEDEIEESVTDSPRKITVL-VPSTLDETELIKNIKEVGTIKGFQMLREIGTKKSL 453
Query: 337 GYGFC-VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQ 395
G F Y DP A+ ++ + L S E H +IQ
Sbjct: 454 GIAFLEFYIDPTKYQKTINAIPVIQTLVEDL-------KQSSFIEDAFFSCIIPDHTSIQ 506
Query: 396 KMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGT 455
+ S + L + ++V+ L +TA L DD ++ I +D+++E K+G
Sbjct: 507 DCPIDLSTLKKLVKNEHVTTHPSSRVIQLINIVTAKDLMDDASFKFIQKDIQQEVSKFGN 566
Query: 456 LVNVVIPRP--DQNGGET----PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
L + IPRP D G + PG+GK+++E+ D A L+GR + TV +Y
Sbjct: 567 LKTIKIPRPANDYTPGISQFTQPGLGKIYIEFDDEETALNAIMGLAGRMYNDRTVLCSFY 626
Query: 510 PEDKYFN 516
D + N
Sbjct: 627 DYDDFKN 633
>gi|301101828|ref|XP_002900002.1| Poly(U)-binding-splicing factor PUF60, putative [Phytophthora
infestans T30-4]
gi|262102577|gb|EEY60629.1| Poly(U)-binding-splicing factor PUF60, putative [Phytophthora
infestans T30-4]
Length = 444
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 155/371 (41%), Gaps = 63/371 (16%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGN-SAGPGDAVVNVYINHEKKFAFVEM 229
T ARR+Y+G L E+ I+ F+ T + S PG + K F F+E
Sbjct: 114 TDLARRLYIGNLYYDLKEEDISNVFAPFGTIRSIDLSLEPG-------ASRSKGFCFLEY 166
Query: 230 RTVEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
V A +A+ L+G A+RV RP N +L GQ AI
Sbjct: 167 EDVLAAESAVQVLNGTPLANRAIRVGRPHRGNTNSNDSLSIGQE--------------AI 212
Query: 289 GGAEGPDR-VFVGGLPYYFTETQIKELLESFGTLHGFDL--VKDRDTGNSKGYGFCVYQD 345
P + ++V + ++ + FG +H + V ++G+ +GYGF + +
Sbjct: 213 KNV--PTKCIYVANVRVELNSQHLESIFSPFGAIHSCVMTAVSPLESGH-RGYGFMRFVE 269
Query: 346 PAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
+ A +NG ++ + L V +A+ + LA +Q + TSG N
Sbjct: 270 ESCALSAIQHMNGFELAGQALKVGKASEAAMLIN-----LATSQDKVVRDGSGATTSGAN 324
Query: 406 TLGG-GMSLFGE-----------TLAKV-LCLTEAITADALADDEEYEEILEDMREECGK 452
+ FGE T AK LCL + + D E+ +++R ECGK
Sbjct: 325 VIAAPEKKPFGEDDVEKVKDTTETDAKCCLCLVNLVNCGEVDD-----ELEDEVRGECGK 379
Query: 453 YGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYP-- 510
+G++ V I E +VF+ + DA G A AK AL GR FGGN V A YYP
Sbjct: 380 FGSVNKVDI-------HELADHVRVFVLFDDAAGAAKAKQALHGRFFGGNQVQAHYYPLR 432
Query: 511 --EDKYFNKDY 519
E K + D+
Sbjct: 433 ELEQKRYTSDF 443
>gi|384498450|gb|EIE88941.1| hypothetical protein RO3G_13652 [Rhizopus delemar RA 99-880]
Length = 454
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 85/353 (24%), Positives = 140/353 (39%), Gaps = 49/353 (13%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
R R V+V L + FFSQ + + K +VE
Sbjct: 121 RDRRTVFVTQLAARLTTREFDAFFSQ------AGRVREAKIITDRNSRKSKGCGYVEFYD 174
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
NA+AL G G+ V V+ L+ A N A+ A+G
Sbjct: 175 ETSVQNALALSGQKLLGIPVLVQ--------LSEA--------EKNRLAMAAQRNAMGVT 218
Query: 292 EGP--DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
P R+++G L + TE ++++ E FG L +L KD +TG SKG+GF Y++
Sbjct: 219 TEPLYQRLYIGSLHFSLTENDVRQIFEPFGPLDFVNLHKDPETGRSKGFGFIQYKNANDA 278
Query: 350 DIACAALNGLKMGDKTLTV--------RRATASGQSKTEQESI----LAQAQQHIAIQKM 397
A +NG ++ + L V + G E E + L++A+ +
Sbjct: 279 KQALEKMNGFELAGRNLKVGLVSEKSGTTMSTFGLDDEETEGLALNSLSRAELMAKLAAR 338
Query: 398 ALQTSGMNTLGGGMSL---FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
Q S + L + + L + D + ++ D++ EC KYG
Sbjct: 339 DPQNSPPSRHAPAPVLKPNIPTASTRYVMLNNMFNPNEETDPDWVSDLEADIKIECEKYG 398
Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCA-TAKNALSGRKFGGNTVNA 506
+ ++ + + +G+VFL++ D VG A A +AL+GR FGG + A
Sbjct: 399 RVEHIKV--------NSDSMGEVFLKF-DRVGSAEKAISALNGRWFGGKQITA 442
>gi|21726713|emb|CAA71714.2| KIS protein kinase [Mus musculus]
gi|117616788|gb|ABK42412.1| Kist [synthetic construct]
Length = 414
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L + D L +++EYE+++ED++EEC KYG +V++++P+ E PG G+VF+E
Sbjct: 322 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 375
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+GR F G V A +YP Y
Sbjct: 376 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 409
>gi|12850652|dbj|BAB28802.1| unnamed protein product [Mus musculus]
Length = 419
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L + D L +++EYE+++ED++EEC KYG +V++++P+ E PG G+VF+E
Sbjct: 322 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 375
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+GR F G V A +YP Y
Sbjct: 376 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 409
>gi|149058099|gb|EDM09256.1| rCG46339, isoform CRA_a [Rattus norvegicus]
Length = 251
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L + D L +++EYE+++ED++EEC KYG +V++++P+ E PG G+VF+E
Sbjct: 154 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 207
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+GR F G V A +YP Y
Sbjct: 208 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 241
>gi|8393668|ref|NP_058989.1| serine/threonine-protein kinase Kist [Rattus norvegicus]
gi|24211854|sp|Q63285.1|UHMK1_RAT RecName: Full=Serine/threonine-protein kinase Kist; AltName:
Full=Kinase interacting with stathmin; AltName: Full=PAM
COOH-terminal interactor protein 2; Short=P-CIP2;
AltName: Full=U2AF homology motif kinase 1
gi|1403532|emb|CAA67021.1| KIS [Rattus norvegicus]
gi|5821768|gb|AAC53031.2| PAM COOH-terminal interactor protein 2 [Rattus norvegicus]
gi|149058100|gb|EDM09257.1| rCG46339, isoform CRA_b [Rattus norvegicus]
Length = 419
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L + D L +++EYE+++ED++EEC KYG +V++++P+ E PG G+VF+E
Sbjct: 322 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 375
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+GR F G V A +YP Y
Sbjct: 376 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 409
>gi|40254330|ref|NP_034763.3| serine/threonine-protein kinase Kist [Mus musculus]
gi|57015387|sp|P97343.3|UHMK1_MOUSE RecName: Full=Serine/threonine-protein kinase Kist; AltName:
Full=Kinase interacting with stathmin; AltName: Full=PAM
COOH-terminal interactor protein 2; Short=P-CIP2;
AltName: Full=U2AF homology motif kinase 1
gi|27501712|gb|AAO13515.1| KIS kinase [Mus musculus]
gi|37194893|gb|AAH58732.1| U2AF homology motif (UHM) kinase 1 [Mus musculus]
Length = 419
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L + D L +++EYE+++ED++EEC KYG +V++++P+ E PG G+VF+E
Sbjct: 322 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 375
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+GR F G V A +YP Y
Sbjct: 376 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 409
>gi|293354575|ref|XP_574033.2| PREDICTED: serine/threonine-protein kinase Kist-like [Rattus
norvegicus]
Length = 419
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L + D L +++EYE+++ED++EEC KYG +V++++P+ E PG G+VF+E
Sbjct: 322 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 375
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+GR F G V A +YP Y
Sbjct: 376 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 409
>gi|344250069|gb|EGW06173.1| Serine/threonine-protein kinase Kist [Cricetulus griseus]
Length = 375
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L + D L +++EYE+++ED++EEC KYG +V++++P+ E PG G+VF+E
Sbjct: 278 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 331
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+GR F G V A +YP Y
Sbjct: 332 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 365
>gi|354487456|ref|XP_003505889.1| PREDICTED: serine/threonine-protein kinase Kist-like [Cricetulus
griseus]
Length = 461
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L + D L +++EYE+++ED++EEC KYG +V++++P+ E PG G+VF+E
Sbjct: 364 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 417
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+GR F G V A +YP Y
Sbjct: 418 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 451
>gi|148707213|gb|EDL39160.1| U2AF homology motif (UHM) kinase 1 [Mus musculus]
Length = 232
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L + D L +++EYE+++ED++EEC KYG +V++++P+ E PG G+VF+E
Sbjct: 135 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 188
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+GR F G V A +YP Y
Sbjct: 189 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 222
>gi|442570696|pdb|4FXW|A Chain A, Structure Of Phosphorylated Sf1 Complex With U2af65-uhm
Domain
gi|442570698|pdb|4FXW|C Chain C, Structure Of Phosphorylated Sf1 Complex With U2af65-uhm
Domain
Length = 106
Score = 76.3 bits (186), Expect = 3e-11, Method: Composition-based stats.
Identities = 45/100 (45%), Positives = 61/100 (61%), Gaps = 1/100 (1%)
Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFL 479
+VLCL + + L DDEEYEEI+ED+R+EC KYG + ++ IPRP +G E PG GK+F+
Sbjct: 7 EVLCLXNXVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFV 65
Query: 480 EYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
E+ C A L+GRKF V Y D Y +D+
Sbjct: 66 EFTSVFDCQKAXQGLTGRKFANRVVVTKYCDPDSYHRRDF 105
>gi|195434196|ref|XP_002065089.1| GK15272 [Drosophila willistoni]
gi|194161174|gb|EDW76075.1| GK15272 [Drosophila willistoni]
Length = 612
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 86/372 (23%), Positives = 134/372 (36%), Gaps = 63/372 (16%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS V G V + N K+F A++E
Sbjct: 252 RDARTVFCIQLSQRVRARDLEEFFSSV---------GKVRDVRMITCNKTKRFKGIAYIE 302
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
E S A+ L G GV + V+ L A QP +
Sbjct: 303 FEDPESVSLALGLSGQRLLGVPIMVQHTQAEKNRLQNAAPAFQPKSHT------------ 350
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF Y +
Sbjct: 351 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDVIQLIMDTETGRSKGYGFITYHNADD 406
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA---LQTSGMN 405
A LNG ++ + + V T T S+ I A LQ
Sbjct: 407 AKKALEQLNGFELAGRPMKVGNVTERLDMNT--TSLDTDEMDRTGIDLGATGRLQLMFKL 464
Query: 406 TLGGGMSL----------------------FGETLAKVLCLTEAITADALADDEEYEEIL 443
G G+++ ++A + + A + ++ +
Sbjct: 465 AEGAGLAVPQAAANALLATAPQPAPVQQQQQTPSIATQCFILSNMFDPATETNTTWDSEI 524
Query: 444 ED-MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
D + EEC K+G ++++ + G V+++ A NAL GR F G
Sbjct: 525 RDDVLEECAKHGGVLHIHV-------DTASSTGTVYVKCPSTTTAVLAVNALHGRWFAGR 577
Query: 503 TVNAFYYPEDKY 514
+ A Y P Y
Sbjct: 578 VITAAYVPLINY 589
>gi|159162769|pdb|1O0P|A Chain A, Solution Structure Of The Third Rna Recognition Motif
(Rrm) Of U2af65 In Complex With An N-Terminal Sf1
Peptide
Length = 104
Score = 76.3 bits (186), Expect = 4e-11, Method: Composition-based stats.
Identities = 45/100 (45%), Positives = 61/100 (61%), Gaps = 1/100 (1%)
Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFL 479
+VLCL + + L DDEEYEEI+ED+R+EC KYG + ++ IPRP +G E PG GK+F+
Sbjct: 5 EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFV 63
Query: 480 EYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
E+ C A L+GRKF V Y D Y +D+
Sbjct: 64 EFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 103
>gi|390980893|pdb|3V4M|A Chain A, Crystal Structure Of A Rna Binding Domain Of A U2 Small
Nuclear Ribonucleoprotein Auxiliary Factor 2 (U2af) From
Mus Musculus At 1.80 A Resolution
gi|390980894|pdb|3V4M|B Chain B, Crystal Structure Of A Rna Binding Domain Of A U2 Small
Nuclear Ribonucleoprotein Auxiliary Factor 2 (U2af) From
Mus Musculus At 1.80 A Resolution
Length = 105
Score = 76.3 bits (186), Expect = 4e-11, Method: Composition-based stats.
Identities = 45/100 (45%), Positives = 61/100 (61%), Gaps = 1/100 (1%)
Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFL 479
+VLCL + + L DDEEYEEI+ED+R+EC KYG + ++ IPRP +G E PG GK+F+
Sbjct: 6 EVLCLXNXVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFV 64
Query: 480 EYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
E+ C A L+GRKF V Y D Y +D+
Sbjct: 65 EFTSVFDCQKAXQGLTGRKFANRVVVTKYCDPDSYHRRDF 104
>gi|159162801|pdb|1OPI|A Chain A, Solution Structure Of The Third Rna Recognition Motif
(Rrm) Of U2af65 In Complex With An N-Terminal Sf1
Peptide
gi|444302011|pdb|2M0G|B Chain B, Structure, Phosphorylation And U2af65 Binding Of The
Nterminal Domain Of Splicing Factor 1 During 3 Splice
Site Recognition
Length = 104
Score = 76.3 bits (186), Expect = 4e-11, Method: Composition-based stats.
Identities = 45/100 (45%), Positives = 61/100 (61%), Gaps = 1/100 (1%)
Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFL 479
+VLCL + + L DDEEYEEI+ED+R+EC KYG + ++ IPRP +G E PG GK+F+
Sbjct: 5 EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFV 63
Query: 480 EYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
E+ C A L+GRKF V Y D Y +D+
Sbjct: 64 EFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 103
>gi|26326137|dbj|BAC26812.1| unnamed protein product [Mus musculus]
Length = 330
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L + D L +++EYE+++ED++EEC KYG +V++++P+ E PG G+VF+E
Sbjct: 233 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 286
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+GR F G V A +YP Y
Sbjct: 287 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 320
>gi|388583572|gb|EIM23873.1| splicing factor, CC1-like protein [Wallemia sebi CBS 633.66]
Length = 459
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 80/374 (21%), Positives = 145/374 (38%), Gaps = 38/374 (10%)
Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
P +P++ R V+V L N + FF +G S V++
Sbjct: 82 PEIPIEDNVDSLESEQRSVFVSQLSTRTNSSDLRRFFQD---RLGERSIVDARIVMDKNS 138
Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
K +VE++T A+ L G + G+ + V + ++ + A + ++
Sbjct: 139 RRSKGIGYVEVKTASLIDKALELTGELLNGIPMIVTQ-SEADKNRQAKASSSLQTQSVQA 197
Query: 279 AAVGLA---------SGAIGGAEGPD--RVFVGGLPYYFTETQIKELLESFGTLHGFDLV 327
V + S I A P +V+VG L Y E ++ + E FG + +L
Sbjct: 198 EEVRRSTKSRDYDNRSSTINPANDPTLYKVYVGSLSYTLKEYDVRSVFEPFGEIEDVELS 257
Query: 328 KDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQ-SKTEQESILA 386
D D SKGY + Y+ + +AC +N ++ +TL V+ G + ++SI
Sbjct: 258 VD-DQNRSKGYAYVKYKRMEDSRMACEQMNRFELAGRTLKVQLVNYYGDPVRMPEQSIEN 316
Query: 387 QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA-------------KVLCLTEAITADAL 433
+ ++ + L + M + E A K + L A
Sbjct: 317 EGLNLNSVSRHELMKTLMRSHDPNAQFEQELAAREKEKKVQERMKTKGVLLKYMFKASEE 376
Query: 434 ADDEEYEEILEDMREEC-GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
+ +E+ ED++ EC KYG + + + + + G++ +++Y A N
Sbjct: 377 TEAGWEKELAEDVKTECENKYGKVQEIGVDKESEE-------GEIVVKFYTIESAEDAIN 429
Query: 493 ALSGRKFGGNTVNA 506
L+GR FGG V A
Sbjct: 430 GLNGRWFGGRQVKA 443
>gi|302838915|ref|XP_002951015.1| hypothetical protein VOLCADRAFT_48801 [Volvox carteri f.
nagariensis]
gi|300263710|gb|EFJ47909.1| hypothetical protein VOLCADRAFT_48801 [Volvox carteri f.
nagariensis]
Length = 82
Score = 75.9 bits (185), Expect = 5e-11, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
++FVGGLP + E +KELL FGTL F+LV D+ TG SKGY FC Y + + ++
Sbjct: 2 KLFVGGLPCEWGEDMVKELLIPFGTLKSFNLVMDKSTGKSKGYAFCEYVEDSSAEVLIKN 61
Query: 356 LNGLKMGDKTLTVRRA 371
L+ ++G K LTV+RA
Sbjct: 62 LHMRRIGSKALTVKRA 77
>gi|414591752|tpg|DAA42323.1| TPA: hypothetical protein ZEAMMB73_939656 [Zea mays]
Length = 270
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 43/88 (48%), Positives = 51/88 (57%), Gaps = 5/88 (5%)
Query: 112 SGFDMAPP--AAAMLPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
SGFD AP A ++ +PGQLPGV + +P + N+ A Q + P Q MT
Sbjct: 184 SGFDQAPTQQAVPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQFNPHVIQP-QAMT 242
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFF 195
QQATRHAR VYVGGLPP ANEQ I
Sbjct: 243 QQATRHARPVYVGGLPPTANEQVITWIV 270
>gi|357154605|ref|XP_003576839.1| PREDICTED: splicing factor U2af large subunit A-like [Brachypodium
distachyon]
Length = 177
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 47/104 (45%), Positives = 65/104 (62%), Gaps = 2/104 (1%)
Query: 420 KVLCLTEAITADA--LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKV 477
KV+CL + I+ADA L DDE YE++++++ +E K+G L++VVIPRP GVG+V
Sbjct: 66 KVVCLAQMISADAEDLRDDELYEDLVDEVEDEAWKFGHLMSVVIPRPGHAPAAAAGVGRV 125
Query: 478 FLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
FLEY D G K L R FGG + A +YP+DK+ DY
Sbjct: 126 FLEYADLEGSDRCKTKLHWRWFGGRRIVAAFYPKDKFAGGDYDV 169
>gi|323451698|gb|EGB07574.1| hypothetical protein AURANDRAFT_64670 [Aureococcus anophagefferens]
Length = 214
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 62/110 (56%)
Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
Q+ + ATR RR+YVG LP + + + F ++ + A G AG + VV+ +++ +KK
Sbjct: 43 QMPSNPATRKERRLYVGNLPQTFDSEQLRIFLNEALRACGAIPAGVDEVVVSSWVSPDKK 102
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPS 273
FAFVE+ TVE A+ ++ L GI G +++ P +Y GP P+
Sbjct: 103 FAFVELSTVEAATTSLGLSGITCMGCQLKICHPNNYVVGALPGTGPTLPA 152
>gi|300637966|gb|ADK26147.1| kinase-interacting stathmin [Oryctolagus cuniculus]
Length = 254
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 58/94 (61%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L + D L ++EEYE+I+ED++EEC KYG +V++++P+ E PG +VF+E
Sbjct: 165 VLRLLNVLDDDYLENEEEYEDIVEDVKEECQKYGPVVSLLVPK------ENPGRRQVFVE 218
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+GR F G V A +YP Y
Sbjct: 219 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 252
>gi|403281374|ref|XP_003932163.1| PREDICTED: RNA-binding protein 39 [Saimiri boliviensis boliviensis]
Length = 502
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 122/315 (38%), Gaps = 43/315 (13%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
+ D A LNG ++ + + V T + + + + + I L T
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGID---LGT 353
Query: 402 SG----MNTLGGGMSL-FGETLAKVLCLTEAITADALAD-----DEEYE-------EILE 444
+G M L G L + L ++ ++ A+AD ++ E ++
Sbjct: 354 TGRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQ 413
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGV---GKVFLEYYDAVGCATAKNALSGRKFGG 501
+ +C + + N P+ N P G V+++ A NAL GR F G
Sbjct: 414 PLATQCFQLSNMFN---PQTQWNNYHKPFCGYRGNVYVKCPSIAAAIAAVNALHGRWFAG 470
Query: 502 NTVNAFYYPEDKYFN 516
+ A Y P Y N
Sbjct: 471 KMITAAYVPLPTYHN 485
>gi|60599450|gb|AAX26270.1| unknown [Schistosoma japonicum]
Length = 156
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 89/164 (54%), Gaps = 11/164 (6%)
Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHI-AIQKMALQTSGMNTLGGGMSLF--G 415
+++GDK L V+RA+ + T +L Q + ++ +Q NT G G G
Sbjct: 1 MQLGDKKLIVQRASVGAKHTT---GVLPQTLLSLPGLEDGTVQ----NTTGSGNITIRSG 53
Query: 416 ETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
+VLCL I L DDEEYE+I+ED+R EC KYG + ++ IPRP G E PGVG
Sbjct: 54 GPPTEVLCLMNMIETSELEDDEEYEDIVEDVRAECSKYGVVRSLEIPRPIP-GVEVPGVG 112
Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
K+++E+ + C A AL+GRKF V ++ D Y +++
Sbjct: 113 KIYVEFASLIDCQKAATALTGRKFNQRLVVTSFFSPDNYHRREF 156
>gi|307108143|gb|EFN56384.1| expressed protein [Chlorella variabilis]
Length = 404
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 54/101 (53%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
Q TR RR+YVGGLP + + TF +Q + A+G ++ + E+ FAF+E
Sbjct: 246 QMTRPMRRLYVGGLPQPCYDFMLTTFLNQALMALGICQVAGKAPIIACQVTPERNFAFIE 305
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
+A+ A+ LDGI F G ++++RP DY P A P
Sbjct: 306 FGDTSDATAALQLDGIPFRGNTLKIKRPKDYTPPFGAPPDP 346
>gi|340506971|gb|EGR33003.1| u2 small nuclear ribonucleoprotein auxiliary factor 2, putative
[Ichthyophthirius multifiliis]
Length = 302
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 58/106 (54%), Gaps = 5/106 (4%)
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP---DQNGGE--TPG 473
+VL L I L DEEY++I ED+++EC K+G +V++ IPRP D G+ G
Sbjct: 192 TQVLVLKNMINDGELIIDEEYKQIEEDVKDECSKHGKVVSIAIPRPSVDDVKAGKEHVLG 251
Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
GK+++EY A+ L+GR F TV Y+ KY +DY
Sbjct: 252 KGKIYVEYESIEAAREARRYLNGRLFSNRTVQVSYFNYQKYLEQDY 297
>gi|358342556|dbj|GAA49996.1| RNA-binding protein 39 [Clonorchis sinensis]
Length = 730
Score = 75.1 bits (183), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 58/208 (27%), Positives = 88/208 (42%), Gaps = 27/208 (12%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK---FAFVE 228
R AR V+V L ++ + FF+ V G V + N K+ A+VE
Sbjct: 76 RDARTVFVWQLSARIRQRDLEDFFTSV---------GKIRDVRLIMDNKTKRSKGIAYVE 126
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
R VE A A+ L G GV +++++ + A P P P
Sbjct: 127 FREVESAQLALGLTGTRLLGVPIQIQQSHAEKNRMNAI--PSVPKPTQQ----------- 173
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP ++++G L Y TE +K + E FG + L+KD TG S+GYGF Y +
Sbjct: 174 --NRGPMKLYIGSLHYNITEEMLKGIFEPFGKIDDIKLIKDPATGRSQGYGFVTYANSDD 231
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQ 376
A LNG ++ + + V T G+
Sbjct: 232 AKKALDQLNGFELAGRPMKVNHVTERGE 259
>gi|345496803|ref|XP_003427819.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 2
[Nasonia vitripennis]
Length = 425
Score = 75.1 bits (183), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE +K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 19 VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E +S+L
Sbjct: 79 NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106
>gi|156553552|ref|XP_001601896.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 1
[Nasonia vitripennis]
Length = 434
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE +K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 19 VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E +S+L
Sbjct: 79 NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106
>gi|195116809|ref|XP_002002944.1| GI10246 [Drosophila mojavensis]
gi|193913519|gb|EDW12386.1| GI10246 [Drosophila mojavensis]
Length = 617
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 87/370 (23%), Positives = 134/370 (36%), Gaps = 59/370 (15%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS V G V + N K+F A++E
Sbjct: 257 RDARTVFCIQLSQRVRARDLEEFFSSV---------GKVRDVRLITCNKTKRFKGIAYIE 307
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
E + A+ L G GV + V+ L +A P QP +
Sbjct: 308 FEDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQSAPPPFQPKAHT------------ 355
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF Y +
Sbjct: 356 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDAIQLIMDTETGRSKGYGFITYHNADD 411
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA-LQTSGMNTL 407
A LNG ++ + + V T T + I + LQ
Sbjct: 412 AKKALEQLNGFELAGRPMKVGNVTERLDMNTSSLDTDEMDRSGIDLGATGRLQLMFKLAE 471
Query: 408 GGGMSLFGETL----------AKVL-------CLTEAITADALAD--DEEYEEILEDMRE 448
G G+++ A VL T+ + D E D+R+
Sbjct: 472 GAGLAVPQAAANALLATAPQPAPVLQQQQTPSIATQCFILSNMFDPRTETNPTWATDVRD 531
Query: 449 E----CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
+ C K+G ++++ + G V+++ A NAL GR F G +
Sbjct: 532 DVLDECAKHGGVLHIHV-------DTVSPTGTVYVKCPSTTTAVLAVNALHGRWFAGRVI 584
Query: 505 NAFYYPEDKY 514
A Y P Y
Sbjct: 585 TAAYVPVINY 594
>gi|307202383|gb|EFN81811.1| Cleavage stimulation factor 64 kDa subunit [Harpegnathos saltator]
Length = 439
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE +K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 19 VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E +S+L
Sbjct: 79 NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106
>gi|350401751|ref|XP_003486249.1| PREDICTED: cleavage stimulation factor subunit 2-like [Bombus
impatiens]
Length = 441
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE +K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 19 VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E +S+L
Sbjct: 79 NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106
>gi|170058744|ref|XP_001865056.1| splicing factor [Culex quinquefasciatus]
gi|167877732|gb|EDS41115.1| splicing factor [Culex quinquefasciatus]
Length = 546
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 138/372 (37%), Gaps = 62/372 (16%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R R V+ L + + FFS V G V + N K+F A++E
Sbjct: 188 RDMRTVFCMQLSQRIRARDLEEFFSSV---------GKVRDVRLITCNKTKRFKGIAYIE 238
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
R E + A+ L G G+ + V+ LA N+ +
Sbjct: 239 FRDPESVALALGLSGQRLLGIPISVQHTQAEKNRLA------------NIPPPPPPKVIV 286
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
G P R++VG L + TE ++ + E FG + L+ D DTG SKGYGF + +
Sbjct: 287 G----PMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDSDTGRSKGYGFITFHNADD 342
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA---LQTSGMN 405
A LNG ++ + + V T T S+ I A LQ
Sbjct: 343 AKKALEQLNGFELAGRPMKVGNVTER-LDVTTHASLDTDEMDRSGIDLGATGRLQLMFKL 401
Query: 406 TLGGGMSL----------------------FGETLAKVLCLTEAITADALADDEEYEEIL 443
G G+++ +A L + A + ++ +
Sbjct: 402 AEGAGLAVPRAAADALLATAPQPAPNQPVQDSPAIATQCFLLSNMFDPATETNPSWDVEI 461
Query: 444 E-DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
E D+ EEC K+G +++V + + ++P G V+++ A NAL GR F G
Sbjct: 462 EDDVIEECNKHGGVLHVYVDK------QSPA-GNVYVKCPSIATAVLAVNALHGRWFAGR 514
Query: 503 TVNAFYYPEDKY 514
+ A Y P Y
Sbjct: 515 VIAAAYVPLVNY 526
>gi|66516308|ref|XP_623321.1| PREDICTED: cleavage stimulation factor subunit 2 [Apis mellifera]
Length = 441
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE +K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 19 VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E +S+L
Sbjct: 79 NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106
>gi|432103844|gb|ELK30681.1| Serine/threonine-protein kinase Kist [Myotis davidii]
Length = 435
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 57/94 (60%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L + D L +E YE+ +ED+REEC KYG +V++++P+ E+PG G+VF+E
Sbjct: 338 VLRLLNVLDGDYLESEEGYEDAVEDVREECQKYGPVVSLLVPK------ESPGRGQVFVE 391
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ L+GR F G V A +YP Y
Sbjct: 392 YANAGDSKAAQKLLTGRLFDGKFVVATFYPLSAY 425
>gi|193598819|ref|XP_001951232.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 1
[Acyrthosiphon pisum]
gi|328706164|ref|XP_003243012.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 2
[Acyrthosiphon pisum]
Length = 386
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 54/89 (60%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 14 VFVGNIPYEATEEKLKDIFSEVGPVISFKLVYDRETGKPKGYGFCEYKDQETALSAMRNL 73
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E +S++
Sbjct: 74 NGYEIGGRTLRVDNA-CTEKSRLEMQSLM 101
>gi|380028061|ref|XP_003697730.1| PREDICTED: cleavage stimulation factor subunit 2-like [Apis florea]
Length = 441
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE +K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 19 VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E +S+L
Sbjct: 79 NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106
>gi|116793682|gb|ABK26841.1| unknown [Picea sitchensis]
Length = 347
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 98/209 (46%), Gaps = 29/209 (13%)
Query: 176 RVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY---INHEKKFAFVEMRTV 232
++YVG LP + + +A F + +G + V +Y + FAFV M TV
Sbjct: 160 KLYVGNLPFDIDSEGLAKMFDE---------SGVVEMVEVIYDRSSGRSRGFAFVTMSTV 210
Query: 233 EEASNAMA-LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
EEA A+ +G +G ++RV P P L P + N G
Sbjct: 211 EEAEAAIKKFNGFEIDGRSLRVNFPE--VPRLQNGRSPARSPSNFG-----------GFV 257
Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
+ P +V+VG L + T ++E L G + G +++DR+TG S+G+GF + A +
Sbjct: 258 DSPHKVYVGNLAWSVTSETLREALNGKGNVLGAKVIQDRETGRSRGFGFVSFSSEAEVEA 317
Query: 352 ACAALNGLKMGDKTLTVRRA---TASGQS 377
A + ++GL++ +++ V A + GQS
Sbjct: 318 AVSEMDGLEVEGRSIRVNVAKSRSTEGQS 346
>gi|281209343|gb|EFA83511.1| RNA-binding region RNP-1 domain-containing protein [Polysphondylium
pallidum PN500]
Length = 1109
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 102/236 (43%), Gaps = 25/236 (10%)
Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
+R++VG +P+ E QIK + S G + L+ + ++G KG+GF Y + + A A
Sbjct: 873 NRIYVGSIPWNVNEDQIKVIFSSIGNVVSCSLMPNLESGRHKGFGFIDYDNSKSAEDAIA 932
Query: 355 ALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLF 414
LNG +G + L V R + + ES + ++ + G +++ ++L
Sbjct: 933 TLNGYDIGGRQLKVGRPIKNASISSSNESKQTTPLSTPMVPTLS-SSVGTDSIEDDVTLS 991
Query: 415 GETLAKVLCLTEAITADA------------LADDEEYEEIL-EDMREECGKYGTLVNVVI 461
E ++L + + D L +E ++ E+++ EC +G + V+
Sbjct: 992 TE--QRILLTQKLLRQDISRTSNRCLVLRNLGSPKEIDDFFEEEIKAECMSFGQVEKFVL 1049
Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
K F+ + + CAT N +GR F G V A YY + FNK
Sbjct: 1050 TH--------DASVKAFILFKEPAACATCFNKQNGRYFSGYIVKAEYY-DISLFNK 1096
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 37/75 (49%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
R+++G + + TET + + FG + L KD TG SKGY F Y P + A +
Sbjct: 686 RIYIGNIHFNLTETDLTSIFSPFGPIKSLSLSKDPATGKSKGYCFIEYSYPEAANNAISH 745
Query: 356 LNGLKMGDKTLTVRR 370
+N + + + V R
Sbjct: 746 MNHQSLAGRQIKVGR 760
>gi|383860217|ref|XP_003705587.1| PREDICTED: cleavage stimulation factor subunit 2-like [Megachile
rotundata]
Length = 441
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE +K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 19 VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E +S+L
Sbjct: 79 NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106
>gi|307172466|gb|EFN63915.1| Cleavage stimulation factor 64 kDa subunit [Camponotus floridanus]
Length = 438
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE +K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 19 VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E +S+L
Sbjct: 79 NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106
>gi|322788027|gb|EFZ13868.1| hypothetical protein SINV_14012 [Solenopsis invicta]
Length = 291
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE +K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 19 VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E +S+L
Sbjct: 79 NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106
>gi|193671655|ref|XP_001946102.1| PREDICTED: cleavage stimulation factor subunit 2-like
[Acyrthosiphon pisum]
Length = 388
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 54/89 (60%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 14 VFVGNIPYEATEEKLKDIFNEVGPVISFKLVYDRETGKPKGYGFCEYKDQETALSAMRNL 73
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E +S++
Sbjct: 74 NGYEIGGRTLRVDNA-CTEKSRLEMQSLM 101
>gi|332026262|gb|EGI66401.1| Cleavage stimulation factor 64 kDa subunit [Acromyrmex echinatior]
Length = 480
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE +K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 19 VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E +S+L
Sbjct: 79 NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106
>gi|242018247|ref|XP_002429590.1| A-kinase anchor protein, putative [Pediculus humanus corporis]
gi|212514557|gb|EEB16852.1| A-kinase anchor protein, putative [Pediculus humanus corporis]
Length = 408
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 58/96 (60%), Gaps = 2/96 (2%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +P+ TE ++KE+ G + F LV DR+ G KGYGFC Y+D + A L
Sbjct: 16 VFVGNIPFDLTEEKLKEIFSEVGPVLSFKLVYDRENGKPKGYGFCEYKDIETANSAMRNL 75
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI 392
NG ++G + L V A A+ +++ E ++++ QA + I
Sbjct: 76 NGFEIGGRVLKVDNA-ANEKTRMEMQNMI-QANEPI 109
>gi|195471585|ref|XP_002088083.1| GE14328 [Drosophila yakuba]
gi|194174184|gb|EDW87795.1| GE14328 [Drosophila yakuba]
Length = 590
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/372 (23%), Positives = 135/372 (36%), Gaps = 63/372 (16%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS V G V + N K+F A++E
Sbjct: 230 RDARTVFCIQLSQRVRARDLEEFFSSV---------GKVRDVRLITCNKTKRFKGIAYIE 280
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
E + A+ L G GV + V+ L A QP +
Sbjct: 281 FEDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQNAAPAFQPKSHT------------ 328
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF Y +
Sbjct: 329 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDAIQLIMDTETGRSKGYGFITYHNADD 384
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA---LQTSGMN 405
A LNG ++ + + V T T S+ I A LQ
Sbjct: 385 AKKALEQLNGFELAGRLMKVGNVTERLDMNT--TSLDTDEMDRTGIDLGATGRLQLMFKL 442
Query: 406 TLGGGMSL-----------------FGETLAKVLCLTEAITADALAD-----DEEYEEIL 443
G G+++ + A T+ + D + ++ +
Sbjct: 443 AEGAGLAVPQAAANALLATAPQPAPLQQQEAAPSIATQCFILSNMFDPRTETNPTWDVEI 502
Query: 444 ED-MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
+D + EEC K+G ++++ + G V+++ A NAL GR F G
Sbjct: 503 KDDVLEECAKHGGVLHIHV-------DTISPTGTVYVKCPSTTTAVLAVNALHGRWFAGR 555
Query: 503 TVNAFYYPEDKY 514
+ A Y P Y
Sbjct: 556 VITAAYVPVINY 567
>gi|116781814|gb|ABK22250.1| unknown [Picea sitchensis]
Length = 355
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 98/209 (46%), Gaps = 29/209 (13%)
Query: 176 RVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY---INHEKKFAFVEMRTV 232
++YVG LP + + +A F + +G + V +Y + FAFV M TV
Sbjct: 168 KLYVGNLPFDIDSEGLAKMFDE---------SGVVEMVEVIYDRSSGRSRGFAFVTMSTV 218
Query: 233 EEASNAMA-LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
EEA A+ +G +G ++RV P P L P + N G
Sbjct: 219 EEAEAAIKKFNGFEIDGRSLRVNFPE--VPRLQNGRSPARSPSNFG-----------GFV 265
Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
+ P +V+VG L + T ++E L G + G +++DR+TG S+G+GF + A +
Sbjct: 266 DSPHKVYVGNLAWSVTSETLREALNGKGNVLGAKVIQDRETGRSRGFGFVSFSSEAEVEA 325
Query: 352 ACAALNGLKMGDKTLTVRRA---TASGQS 377
A + ++GL++ +++ V A + GQS
Sbjct: 326 AVSEMDGLEVEGRSIRVNVAKSRSTEGQS 354
>gi|2459426|gb|AAB80661.1| putative splicing factor U2AF large chain [Arabidopsis thaliana]
Length = 475
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 122/557 (21%), Positives = 204/557 (36%), Gaps = 141/557 (25%)
Query: 2 SGRDIRYDAVGEGSRHKSSWV--SGRSRTGERGRDRHHRDFKSGGDDRRRDKNYKYDREG 59
S + +R V + R +SS +G R + G + +R+ +R D + E
Sbjct: 9 SKKRLRSLVVADVPRDESSIKPDNGDKRKNQNGNHKKNREINMS---KRHDPGKVHSVEV 65
Query: 60 IRDHDRTDRHRDYNRD-KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAP 118
+R ++ + RD +E+R R RSR H DR ++ SKS SP K + ++ A
Sbjct: 66 SERWERREQPKSRQRDLREKRRRSRSRDHGQDRQKSASKSELGGYSPRKRREQASTKAAS 125
Query: 119 P------------------AAAMLPGAAVPGQLPGVPSAVPEMAQNML-----------P 149
P A M + G +A P +++ L P
Sbjct: 126 PPNLSSEKKSAKWGLAATVTAGMFSDSVFSGLQAATQTAYPTISEASLTLLKPLMVMDAP 185
Query: 150 F---GATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS 206
F A Q +F V ++TR RR+Y +P A+E+++ F+ M + G N
Sbjct: 186 FRTPPARQTTSFD----SVQLTESTRRMRRLYAENVPDSASEKSLIECFNGYMLSSGSNH 241
Query: 207 AGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAA 266
+ ++ T ++AS A++LDG F G +++RRP DY + +
Sbjct: 242 IKGSEPCISCI-----------FLTPQDASAALSLDGCSFAGSNLKIRRPKDYLMEIVSV 290
Query: 267 LGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
GP L +
Sbjct: 291 FGP---------------------------------------------------LKAYRF 299
Query: 327 VKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQES--- 383
V + D ++ F Y D +VT ACA LNG+++G +T A S E+
Sbjct: 300 VSNNDL--NQRCAFLEYTDGSVTLKACAGLNGMRLGGSVITAVCAFPDASSVAVNENPPF 357
Query: 384 --ILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA--DDEEY 439
I + A+ + K +L L + + L ++E
Sbjct: 358 YGIPSHAKPLLGKPK-----------------------NILKLKNVVDPEDLTSFSEQEV 394
Query: 440 EEILEDMREECGKY--GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGR 497
+EILED+R EC ++ G + ++ ET G +F+EY A ++L GR
Sbjct: 395 KEILEDVRLECARWDAGDKIEEEQEEDPEDVFET---GCIFIEYRRPEATCDAAHSLHGR 451
Query: 498 KFGGNTVNAFYYPEDKY 514
+ V A Y ++ Y
Sbjct: 452 LYDNRIVKAEYVSKELY 468
>gi|391347243|ref|XP_003747874.1| PREDICTED: cleavage stimulation factor subunit 2-like [Metaseiulus
occidentalis]
Length = 422
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 54/91 (59%), Gaps = 1/91 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K + E G + F LV DR+TG KGYGFC ++D A L
Sbjct: 20 VFVGNIPYDATEEQLKTIFEEVGPVVNFRLVYDRETGKPKGYGFCEFKDQETAMSAMRNL 79
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
N ++G + L V A AS ++K E +++ AQ
Sbjct: 80 NSFEIGGRALRVDHA-ASERNKEELKALYAQ 109
>gi|330793087|ref|XP_003284617.1| hypothetical protein DICPUDRAFT_148416 [Dictyostelium purpureum]
gi|325085416|gb|EGC38823.1| hypothetical protein DICPUDRAFT_148416 [Dictyostelium purpureum]
Length = 829
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/223 (25%), Positives = 108/223 (48%), Gaps = 19/223 (8%)
Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
+R+++G + + TE QI+ + FG + L+++ +TG KGYGF +++ D A
Sbjct: 622 NRIYIGSINWNVTEEQIRGIFSQFGKIISCFLMQNTETGKHKGYGFIDFENKKSADDAL- 680
Query: 355 ALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLF 414
A+NG ++ + + V R T + T I + +++ A+ T+ + L
Sbjct: 681 AMNGFELLGRAMKVGRPTKGASANT----ISNGSIDKTSLEGEAMLTTSDQRIQLTQKLL 736
Query: 415 GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
G K L L A + D + D +E ED+R C ++G + +VI + D +
Sbjct: 737 GNE-NKCLVLRNAGSPDDI--DPSFE---EDIRSGCNEFGEIEKLVI-KTDSS------T 783
Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
+V++ + +A C ++ L+G+ F + + A +Y + FNK
Sbjct: 784 VRVYIVFKEAPSCVACQSKLNGKYFSYHCIKAEFY-DINLFNK 825
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 39/81 (48%), Gaps = 1/81 (1%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
RV++G L + E I + FGT+ L KD + G SKGY F Y+ P A +
Sbjct: 464 RVYIGNLHFSLAEDAIIQAFSQFGTVKSILLGKDAN-GKSKGYAFIEYESPDSATKAIES 522
Query: 356 LNGLKMGDKTLTVRRATASGQ 376
++ M + + V R A GQ
Sbjct: 523 MSNYVMAGRVIKVNRPLAGGQ 543
>gi|194862772|ref|XP_001970115.1| GG10454 [Drosophila erecta]
gi|190661982|gb|EDV59174.1| GG10454 [Drosophila erecta]
Length = 593
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 86/374 (22%), Positives = 133/374 (35%), Gaps = 67/374 (17%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS S G V + N K+F A++E
Sbjct: 233 RDARTVFCIQLSQRVRARDLEEFFS---------SVGKVRDVRLITCNKTKRFKGIAYIE 283
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
E + A+ L G GV + V+ L A QP +
Sbjct: 284 FEDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQNAAPAFQPKSHT------------ 331
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF Y +
Sbjct: 332 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDAIQLIMDTETGRSKGYGFITYHNADD 387
Query: 349 TDIACAALNGLKMGDKTLTVRRAT-----------------------ASGQS----KTEQ 381
A LNG ++ + + V T A+G+ K +
Sbjct: 388 AKKALEQLNGFELAGRLMKVGNVTERLDMNTTSLDTDEMDRTGIDLGATGRLQLMFKLAE 447
Query: 382 ESILAQAQQHIAIQKMAL-QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE 440
+ LA Q Q + M S+ + + D E +
Sbjct: 448 GAGLAVPQAAANALLATAPQPAPMQQQEAAPSIATQCFILSNMFDPRTETNPTWDVEIRD 507
Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
++LE EC K+G ++++ + G V+++ A NAL GR F
Sbjct: 508 DVLE----ECAKHGGVLHIHV-------DTISPTGTVYVKCPSTTTAVLAVNALHGRWFA 556
Query: 501 GNTVNAFYYPEDKY 514
G + A Y P Y
Sbjct: 557 GRVITAAYVPVINY 570
>gi|30685698|ref|NP_850209.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
gi|330253740|gb|AEC08834.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
Length = 979
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 76/165 (46%), Gaps = 18/165 (10%)
Query: 117 APPAAAMLPGAAVPGQLPGVPSAVPEMAQNML-----------PFG---ATQLGAFPLMP 162
A A M + G +A P +++ L PF A Q +F
Sbjct: 810 ATVTAGMFSDSVFSGLQAATQTAYPTISEASLTLLKPLMVMDAPFRTPPARQTTSFD--- 866
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
V ++TR RR+Y +P A+E+++ F+ M + G N + ++ IN EK
Sbjct: 867 -SVQLTESTRRMRRLYAENVPDSASEKSLIECFNGYMLSSGSNHIKGSEPCISCIINKEK 925
Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAAL 267
A VE T ++AS A++LDG F G +++RRP DY T +++
Sbjct: 926 SQALVEFLTPQDASAALSLDGCSFAGSNLKIRRPKDYVRTTVSSI 970
>gi|262348230|gb|ACY56333.1| putative splicing factor u2af large subunit, partial [Monascus
ruber]
Length = 90
Score = 73.2 bits (178), Expect = 3e-10, Method: Composition-based stats.
Identities = 35/75 (46%), Positives = 54/75 (72%), Gaps = 2/75 (2%)
Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AKNALSGRKF 499
EI++D+R+EC KYGT++ + IPRP G ++PGVGK++++ +D+V AT A AL+GRKF
Sbjct: 12 EIMDDVRDECSKYGTILELKIPRPTTGGRQSPGVGKIYVK-FDSVKSATEALKALAGRKF 70
Query: 500 GGNTVNAFYYPEDKY 514
TV Y+ E+ +
Sbjct: 71 SDRTVVTTYFSEENF 85
>gi|426252723|ref|XP_004020052.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
isoform 1 [Ovis aries]
Length = 572
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|126342916|ref|XP_001364467.1| PREDICTED: cleavage stimulation factor subunit 2-like [Monodelphis
domestica]
Length = 551
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|156363387|ref|XP_001626026.1| predicted protein [Nematostella vectensis]
gi|156212886|gb|EDO33926.1| predicted protein [Nematostella vectensis]
Length = 468
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 105/248 (42%), Gaps = 37/248 (14%)
Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
GP R++VG L + TE +K + E FGT+ L+ D +T SKGYGF +++ A
Sbjct: 213 GPTRLYVGSLHFNITEAMVKAVFEPFGTVDSVQLIYDSETNRSKGYGFVQFREAEAAKRA 272
Query: 353 CAALNGLKMGDKTLTVRRATASGQS-------------------KTEQESILAQAQQ-HI 392
+NG ++ + L + T G S + + +++A+ Q H
Sbjct: 273 MEQMNGFELAGRPLKIGPVTERGDSSAYSFLDDEEYEKGGVELNSSARAALMAKLSQGHS 332
Query: 393 AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALAD-----DEEYE-EILEDM 446
A L G + G+ T V T + D D ++ +I +D+
Sbjct: 333 A----GLSVPGAPPIVSGVQQALATPVAVSLPTPCFMLTNMFDPTKERDAGWDLDIRDDV 388
Query: 447 REECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
EEC K+G +V++ + D+N + G V+++ +A +L GR F G + A
Sbjct: 389 LEECNKFGPIVHIHV---DKNSPQ----GIVYVKCATPDIAISASKSLHGRWFAGKQIIA 441
Query: 507 FYYPEDKY 514
P Y
Sbjct: 442 APVPLSNY 449
>gi|88682979|gb|AAI05553.1| CSTF2 protein [Bos taurus]
Length = 632
Score = 73.2 bits (178), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|22478042|gb|AAH36719.1| Cstf2 protein [Mus musculus]
Length = 510
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|332372985|gb|AEE61634.1| unknown [Dendroctonus ponderosae]
Length = 409
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 55/91 (60%), Gaps = 1/91 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 16 VFVGNIPYEATEEKLKDIFGEVGQVLSFKLVFDRETGKPKGYGFCEYRDQETALSAMRNL 75
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
NG ++G ++L V A + +S+ E +++L Q
Sbjct: 76 NGYEIGGRSLRVDNA-CTEKSRMEMQNLLNQ 105
>gi|157821159|ref|NP_001101056.1| cleavage stimulation factor subunit 2 tau variant [Rattus
norvegicus]
gi|149062701|gb|EDM13124.1| rCG47773 [Rattus norvegicus]
Length = 629
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|449498377|ref|XP_002191180.2| PREDICTED: cleavage stimulation factor subunit 2 [Taeniopygia
guttata]
Length = 575
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|344275007|ref|XP_003409305.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
[Loxodonta africana]
Length = 609
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|301788792|ref|XP_002929813.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
[Ailuropoda melanoleuca]
Length = 552
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|197246895|gb|AAI69065.1| Cleavage stimulation factor, 3' pre-RNA subunit 2, tau [Rattus
norvegicus]
Length = 629
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|196115100|ref|NP_001124486.1| cleavage stimulation factor, 3' pre-RNA subunit 2 [Rattus
norvegicus]
gi|195539770|gb|AAI68251.1| Cstf2 protein [Rattus norvegicus]
Length = 575
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|336088638|ref|NP_001229511.1| cleavage stimulation factor subunit 2 tau variant [Bos taurus]
gi|296472872|tpg|DAA14987.1| TPA: CSTF2 protein-like [Bos taurus]
Length = 642
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|426252727|ref|XP_004020054.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
isoform 3 [Ovis aries]
Length = 623
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|426223062|ref|XP_004005698.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
[Ovis aries]
Length = 607
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|426252729|ref|XP_004020055.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
isoform 4 [Ovis aries]
Length = 646
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|47225344|emb|CAG09844.1| unnamed protein product [Tetraodon nigroviridis]
Length = 570
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 28 VFVGNIPYEATEEQLKDIFSEVGLVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 87
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 88 NGREFSGRALRVDNA-ASEKNKEELKSL 114
>gi|194332803|ref|NP_001123707.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Xenopus
(Silurana) tropicalis]
gi|189442601|gb|AAI67314.1| LOC100170457 protein [Xenopus (Silurana) tropicalis]
Length = 498
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|431895723|gb|ELK05144.1| Cleavage stimulation factor 64 kDa subunit [Pteropus alecto]
Length = 577
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|50510589|dbj|BAD32280.1| mKIAA0689 protein [Mus musculus]
Length = 643
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 29 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 88
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 89 NGREFSGRALRVDNA-ASEKNKEELKSL 115
>gi|20072518|gb|AAH26995.1| Cstf2t protein [Mus musculus]
Length = 637
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|26328597|dbj|BAC28037.1| unnamed protein product [Mus musculus]
Length = 580
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|68481591|ref|XP_715304.1| potential spliceosomal U2AF large subunit [Candida albicans SC5314]
gi|46436920|gb|EAK96275.1| potential spliceosomal U2AF large subunit [Candida albicans SC5314]
Length = 719
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 59/214 (27%), Positives = 90/214 (42%), Gaps = 30/214 (14%)
Query: 311 IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
I+ L ++ G L G ++ + T N G F ++D ++ L L + R
Sbjct: 524 IESLEKNVGALQGIQFLRQKGTKNLLGLVFVEFKD--SSNDVIGKLRRLPF------ITR 575
Query: 371 ATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
A S IL IQK + + L ++ ++V+ L A+T
Sbjct: 576 AFHS--------CILPNK---TPIQKGPIDFHSLKNLVENKNVAPHPSSRVIRLLNAVTE 624
Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV---------GKVFLEY 481
LADD Y I DM E KYG +VNV IPRP +N TPG+ G +++E+
Sbjct: 625 SELADDATYSFIRNDMYNEASKYGEVVNVRIPRPSRN--HTPGILQFNTSTGLGTIYIEF 682
Query: 482 YDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
D A L+G+ + TV A +Y D +
Sbjct: 683 KDEKIALAAMMELAGKSYNDRTVLATFYDFDDFL 716
>gi|18875338|ref|NP_573459.1| cleavage stimulation factor subunit 2 [Mus musculus]
gi|71153229|sp|Q8BIQ5.2|CSTF2_MOUSE RecName: Full=Cleavage stimulation factor subunit 2; AltName:
Full=CF-1 64 kDa subunit; AltName: Full=Cleavage
stimulation factor 64 kDa subunit; Short=CSTF 64 kDa
subunit; Short=CstF-64
gi|11139720|gb|AAG31814.1|AF317552_1 polyadenylation protein CSTF64 [Mus musculus]
gi|26353226|dbj|BAC40243.1| unnamed protein product [Mus musculus]
Length = 580
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|148709763|gb|EDL41709.1| cleavage stimulation factor, 3' pre-RNA subunit 2, tau [Mus
musculus]
Length = 644
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 30 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 89
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 90 NGREFSGRALRVDNA-ASEKNKEELKSL 116
>gi|426252725|ref|XP_004020053.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
isoform 2 [Ovis aries]
Length = 612
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|270000744|gb|EEZ97191.1| hypothetical protein TcasGA2_TC004378 [Tribolium castaneum]
Length = 409
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 1/91 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 16 VFVGNIPYEATEEKLKDIFGEVGQVLSFKLVFDRETGKPKGYGFCEYRDQETALSAMRNL 75
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
NG ++G + L V A + +S+ E +++L Q
Sbjct: 76 NGYEIGGRNLRVDNA-CTEKSRMEMQNLLNQ 105
>gi|148277061|ref|NP_112539.2| cleavage stimulation factor subunit 2 tau variant [Mus musculus]
gi|71153235|sp|Q8C7E9.2|CSTFT_MOUSE RecName: Full=Cleavage stimulation factor subunit 2 tau variant;
AltName: Full=CF-1 64 kDa subunit tau variant; AltName:
Full=Cleavage stimulation factor 64 kDa subunit tau
variant; Short=CSTF 64 kDa subunit tau variant; AltName:
Full=TauCstF-64
gi|26330250|dbj|BAC28855.1| unnamed protein product [Mus musculus]
gi|26350087|dbj|BAC38683.1| unnamed protein product [Mus musculus]
Length = 632
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|26341156|dbj|BAC34240.1| unnamed protein product [Mus musculus]
Length = 632
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|410043862|ref|XP_003951699.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant [Pan
troglodytes]
Length = 576
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|301788318|ref|XP_002929575.1| PREDICTED: cleavage stimulation factor subunit 2-like [Ailuropoda
melanoleuca]
Length = 582
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|192764314|gb|ACF05701.1| alphaCstF-64 variant 4 [Mus musculus]
Length = 554
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|194373969|dbj|BAG62297.1| unnamed protein product [Homo sapiens]
Length = 553
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|194042423|ref|XP_001926989.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
isoform 1 [Sus scrofa]
Length = 615
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|351715564|gb|EHB18483.1| Cleavage stimulation factor 64 kDa subunit, tau variant
[Heterocephalus glaber]
Length = 642
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|348570430|ref|XP_003471000.1| PREDICTED: cleavage stimulation factor subunit 2-like [Cavia
porcellus]
Length = 577
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|291408103|ref|XP_002720432.1| PREDICTED: cleavage stimulation factor subunit 2 [Oryctolagus
cuniculus]
Length = 576
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|57530169|ref|NP_001006433.1| cleavage stimulation factor subunit 2 [Gallus gallus]
gi|53128673|emb|CAG31323.1| hypothetical protein RCJMB04_5b8 [Gallus gallus]
Length = 475
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|51258747|gb|AAH80037.1| Cstf-64-prov protein [Xenopus laevis]
Length = 498
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|74007924|ref|XP_861405.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 5 [Canis
lupus familiaris]
Length = 577
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|26351239|dbj|BAC39256.1| unnamed protein product [Mus musculus]
Length = 642
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|221111756|ref|XP_002159647.1| PREDICTED: RNA-binding protein 39-like [Hydra magnipapillata]
Length = 528
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 89/363 (24%), Positives = 144/363 (39%), Gaps = 48/363 (13%)
Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
Q R +R V++ L + I FFS+V + + K +VE
Sbjct: 178 QEERDSRTVFIMQLAKQVTIRDIQDFFSKV------GQVRDVRLISDRNSRRSKGIGYVE 231
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
+ A+ L G GV + V +PT+A N A A A+
Sbjct: 232 FTDASAVTLAIKLSGQKLLGVPIMV------SPTMAEK----------NRYAA--AQAAL 273
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
+GP +++VG L Y TE ++ + E FGT+ L D +T SKG+GF +++
Sbjct: 274 VKPQGPMKLYVGSLHYNITEPMLRAIFEPFGTVESVQLQYDSETNRSKGFGFVNFREAGA 333
Query: 349 TDIACAALNGLKMGDKTL---TVRRATASGQS-----KTEQESILAQAQQHIAI-QKMA- 398
A +NG ++ + + TV T S +TE+ I AQ ++ QK+A
Sbjct: 334 AKRAMEQMNGFELAGRPMKVNTVSERTDGSMSFLDDEETEKGGIEMNAQSRASLMQKLAQ 393
Query: 399 -----LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDE-EYE-EILEDMREECG 451
LQ + + +A CL + D + E ++E +I D+ EE
Sbjct: 394 THGSGLQVPTAPIIPAMLPTPMMNVAGSTCLILSNLFDPRKETESDWELDIRNDVLEEVT 453
Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
K G +V++ I D+ E G V+++ +GR F G T+ A P
Sbjct: 454 KMGIVVHISI---DKISAE----GNVYIKTLIPDTAQKILQTFNGRWFAGRTIRAVAIPV 506
Query: 512 DKY 514
Y
Sbjct: 507 ANY 509
>gi|410927616|ref|XP_003977237.1| PREDICTED: cleavage stimulation factor subunit 2-like [Takifugu
rubripes]
Length = 497
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 31 VFVGNIPYEATEEQLKDIFSEVGLVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 90
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 91 NGREFSGRALRVDNA-ASEKNKEELKSL 117
>gi|348515337|ref|XP_003445196.1| PREDICTED: cleavage stimulation factor subunit 2-like [Oreochromis
niloticus]
Length = 478
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 31 VFVGNIPYEATEEQLKDIFSEVGLVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 90
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 91 NGREFSGRALRVDNA-ASEKNKEELKSL 117
>gi|27807239|ref|NP_777110.1| cleavage stimulation factor subunit 2 [Bos taurus]
gi|71153228|sp|Q8HXM1.1|CSTF2_BOVIN RecName: Full=Cleavage stimulation factor subunit 2; AltName:
Full=CF-1 64 kDa subunit; AltName: Full=Cleavage
stimulation factor 64 kDa subunit; Short=CSTF 64 kDa
subunit; Short=CstF-64
gi|24416593|gb|AAN05427.1| CstF-64 [Bos taurus]
gi|296470997|tpg|DAA13112.1| TPA: cleavage stimulation factor 64 kDa subunit [Bos taurus]
Length = 572
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|68481460|ref|XP_715369.1| potential spliceosomal U2AF large subunit [Candida albicans SC5314]
gi|46436988|gb|EAK96342.1| potential spliceosomal U2AF large subunit [Candida albicans SC5314]
gi|238882073|gb|EEQ45711.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 717
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 59/214 (27%), Positives = 90/214 (42%), Gaps = 30/214 (14%)
Query: 311 IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
I+ L ++ G L G ++ + T N G F ++D ++ L L + R
Sbjct: 522 IESLEKNVGALQGIQFLRQKGTKNLLGLVFVEFKD--SSNDVIGKLRRLPF------ITR 573
Query: 371 ATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
A S IL IQK + + L ++ ++V+ L A+T
Sbjct: 574 AFHS--------CILPNK---TPIQKGPIDFHSLKNLVENKNVAPHPSSRVIRLLNAVTE 622
Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV---------GKVFLEY 481
LADD Y I DM E KYG +VNV IPRP +N TPG+ G +++E+
Sbjct: 623 SELADDATYSFIRNDMYNEASKYGEVVNVRIPRPSRN--HTPGILQFNTSTGLGTIYIEF 680
Query: 482 YDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
D A L+G+ + TV A +Y D +
Sbjct: 681 KDEKIALAAMMELAGKSYNDRTVLATFYDFDDFL 714
>gi|426257829|ref|XP_004022524.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1 [Ovis
aries]
Length = 572
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|344296752|ref|XP_003420068.1| PREDICTED: cleavage stimulation factor subunit 2-like [Loxodonta
africana]
Length = 582
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|632500|gb|AAB50269.1| polyadenylation factor 64 kDa subunit [Xenopus laevis]
Length = 497
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|260790683|ref|XP_002590371.1| hypothetical protein BRAFLDRAFT_216236 [Branchiostoma floridae]
gi|229275563|gb|EEN46382.1| hypothetical protein BRAFLDRAFT_216236 [Branchiostoma floridae]
Length = 222
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 55/103 (53%), Gaps = 1/103 (0%)
Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
+AAVGL VFVG +PY TE Q+K++ G + F LV DR+TG KG
Sbjct: 1 MAAVGLQGLNPAQDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVISFRLVYDRETGKPKG 60
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTE 380
YGFC Y+D A LNG ++ + L V A AS +SK E
Sbjct: 61 YGFCEYKDQETALSAMRNLNGHELNGRQLRVDNA-ASEKSKEE 102
>gi|147898871|ref|NP_001080179.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Xenopus
laevis]
gi|27735464|gb|AAH41291.1| Cstf-64 protein [Xenopus laevis]
Length = 518
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|395820735|ref|XP_003783716.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
[Otolemur garnettii]
Length = 601
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|332254742|ref|XP_003276491.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1
[Nomascus leucogenys]
Length = 577
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|21619877|gb|AAH33135.1| Similar to cleavage stimulation factor, 3' pre-RNA, subunit 2,
64kD, partial [Homo sapiens]
Length = 559
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 17 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 76
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 77 NGREFSGRALRVDNA-ASEKNKEELKSL 103
>gi|4557493|ref|NP_001316.1| cleavage stimulation factor subunit 2 [Homo sapiens]
gi|332861154|ref|XP_003317595.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1 [Pan
troglodytes]
gi|397478196|ref|XP_003810439.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1 [Pan
paniscus]
gi|426396653|ref|XP_004064546.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1 [Gorilla
gorilla gorilla]
gi|461847|sp|P33240.1|CSTF2_HUMAN RecName: Full=Cleavage stimulation factor subunit 2; AltName:
Full=CF-1 64 kDa subunit; AltName: Full=Cleavage
stimulation factor 64 kDa subunit; Short=CSTF 64 kDa
subunit; Short=CstF-64
gi|181139|gb|AAA35724.1| cleavage stimulation factor [Homo sapiens]
gi|17389334|gb|AAH17712.1| Cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Homo
sapiens]
gi|32879899|gb|AAP88780.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Homo
sapiens]
gi|61359609|gb|AAX41742.1| cleavage stimulation factor 3' pre-RNA subunit 2 [synthetic
construct]
gi|61359616|gb|AAX41743.1| cleavage stimulation factor 3' pre-RNA subunit 2 [synthetic
construct]
gi|119623223|gb|EAX02818.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, isoform
CRA_a [Homo sapiens]
gi|123981258|gb|ABM82458.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa
[synthetic construct]
gi|123996091|gb|ABM85647.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa
[synthetic construct]
gi|261860120|dbj|BAI46582.1| Cleavage stimulation factor 64 kDa subunit [synthetic construct]
gi|410256936|gb|JAA16435.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Pan
troglodytes]
gi|410289934|gb|JAA23567.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Pan
troglodytes]
Length = 577
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|194205912|ref|XP_001917732.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
[Equus caballus]
Length = 619
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|444707906|gb|ELW49054.1| Cleavage stimulation factor subunit 2 tau variant [Tupaia
chinensis]
Length = 654
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|431839028|gb|ELK00957.1| Cleavage stimulation factor 64 kDa subunit, tau variant [Pteropus
alecto]
Length = 601
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|402910773|ref|XP_003918026.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1 [Papio
anubis]
gi|402910775|ref|XP_003918027.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 2 [Papio
anubis]
Length = 577
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|102269210|gb|ABF55966.2| cleavage stimulation factor 64-kDa subunit [Bombyx mori]
Length = 326
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 55/93 (59%), Gaps = 1/93 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 20 VFVGNIPYEATEEKLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 79
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
NG ++G ++L V A + +S+ E ++++ Q
Sbjct: 80 NGYEIGGRSLRVDNA-CTEKSRMEMQALMQGPQ 111
>gi|74007936|ref|XP_549135.2| PREDICTED: cleavage stimulation factor subunit 2 isoform 2 [Canis
lupus familiaris]
Length = 597
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|417412278|gb|JAA52529.1| Putative mrna cleavage and polyadenylation factor i complex subunit
rna15, partial [Desmodus rotundus]
Length = 678
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 52 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 111
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 112 NGREFSGRALRVDNA-ASEKNKEELKSL 138
>gi|355681345|gb|AER96778.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Mustela
putorius furo]
Length = 582
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|197100099|ref|NP_001125111.1| cleavage stimulation factor subunit 2 [Pongo abelii]
gi|71153230|sp|Q5RDA3.1|CSTF2_PONAB RecName: Full=Cleavage stimulation factor subunit 2; AltName:
Full=CF-1 64 kDa subunit; AltName: Full=Cleavage
stimulation factor 64 kDa subunit; Short=CSTF 64 kDa
subunit; Short=CstF-64
gi|55726993|emb|CAH90254.1| hypothetical protein [Pongo abelii]
Length = 577
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|410988989|ref|XP_004000752.1| PREDICTED: LOW QUALITY PROTEIN: cleavage stimulation factor subunit
2 [Felis catus]
Length = 577
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|348576422|ref|XP_003473986.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
[Cavia porcellus]
Length = 630
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|335306285|ref|XP_003360436.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 2
[Sus scrofa]
Length = 592
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|440901693|gb|ELR52585.1| Cleavage stimulation factor subunit 2 [Bos grunniens mutus]
Length = 619
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|395850633|ref|XP_003797884.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 2
[Otolemur garnettii]
Length = 596
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|311276614|ref|XP_003135279.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 1
[Sus scrofa]
Length = 572
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|380798899|gb|AFE71325.1| cleavage stimulation factor subunit 2, partial [Macaca mulatta]
Length = 575
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 16 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 75
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 76 NGREFSGRALRVDNA-ASEKNKEELKSL 102
>gi|332254744|ref|XP_003276492.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 2
[Nomascus leucogenys]
Length = 597
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|119224067|gb|AAI26544.1| CSTF2 protein [Bos taurus]
Length = 592
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|403298766|ref|XP_003940178.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 3 [Saimiri
boliviensis boliviensis]
Length = 597
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|291404363|ref|XP_002718535.1| PREDICTED: cleavage stimulation factor, 3' pre-RNA, subunit 2,
64kDa, tau variant [Oryctolagus cuniculus]
Length = 601
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|192764310|gb|ACF05699.1| betaCstF-64 variant 2 [Homo sapiens]
Length = 597
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|444518829|gb|ELV12414.1| Cleavage stimulation factor subunit 2 [Tupaia chinensis]
Length = 409
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|332861156|ref|XP_529072.3| PREDICTED: cleavage stimulation factor subunit 2 isoform 2 [Pan
troglodytes]
gi|397478198|ref|XP_003810440.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 2 [Pan
paniscus]
gi|426396655|ref|XP_004064547.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 2 [Gorilla
gorilla gorilla]
Length = 597
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|296235968|ref|XP_002763125.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1
[Callithrix jacchus]
Length = 597
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|281348298|gb|EFB23882.1| hypothetical protein PANDA_020101 [Ailuropoda melanoleuca]
Length = 612
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|410353769|gb|JAA43488.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Pan
troglodytes]
Length = 577
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|390480015|ref|XP_003735829.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 2
[Callithrix jacchus]
Length = 577
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|432877328|ref|XP_004073146.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
isoform 1 [Oryzias latipes]
Length = 494
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 31 VFVGNIPYEATEEQLKDIFSEVGLVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 90
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 91 NGREFSGRALRVDNA-ASEKNKEELKSL 117
>gi|11762098|gb|AAG40327.1|AF322194_1 variant polyadenylation protein CSTF-64 [Mus musculus]
Length = 630
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|395850631|ref|XP_003797883.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 1
[Otolemur garnettii]
Length = 576
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|192764312|gb|ACF05700.1| betaCstF-64 variant 3 [Mus musculus]
Length = 630
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|297304342|ref|XP_001089558.2| PREDICTED: cleavage stimulation factor subunit 2-like [Macaca
mulatta]
gi|402910777|ref|XP_003918028.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 3 [Papio
anubis]
gi|355704983|gb|EHH30908.1| hypothetical protein EGK_20728 [Macaca mulatta]
gi|355757534|gb|EHH61059.1| hypothetical protein EGM_18986 [Macaca fascicularis]
Length = 597
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|432877330|ref|XP_004073147.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
isoform 2 [Oryzias latipes]
Length = 479
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 31 VFVGNIPYEATEEQLKDIFSEVGLVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 90
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 91 NGREFSGRALRVDNA-ASEKNKEELKSL 117
>gi|426257831|ref|XP_004022525.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 2 [Ovis
aries]
Length = 592
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|403298762|ref|XP_003940176.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403298764|ref|XP_003940177.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 577
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|351709510|gb|EHB12429.1| Cleavage stimulation factor 64 kDa subunit [Heterocephalus glaber]
Length = 597
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|402880821|ref|XP_003903988.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant [Papio
anubis]
Length = 620
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|432111364|gb|ELK34639.1| Cleavage stimulation factor subunit 2 tau variant [Myotis davidii]
Length = 641
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|417403608|gb|JAA48603.1| Putative mrna cleavage and polyadenylation factor i complex subunit
rna15 [Desmodus rotundus]
Length = 647
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|332212188|ref|XP_003255200.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
[Nomascus leucogenys]
Length = 622
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|326673799|ref|XP_003199996.1| PREDICTED: cleavage stimulation factor subunit 2-like [Danio rerio]
Length = 488
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 26 VFVGNIPYEATEEQLKDIFSEVGLVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 85
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 86 NGREFSGRALRVDNA-ASEKNKEELKSL 112
>gi|192764316|gb|ACF05702.1| betaCstF-64 variant 1 [Mus musculus]
Length = 604
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|76253771|ref|NP_956408.2| cleavage stimulation factor subunit 2 [Danio rerio]
gi|41107668|gb|AAH65442.1| Cleavage stimulation factor, 3' pre-RNA, subunit 2 [Danio rerio]
Length = 488
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 26 VFVGNIPYEATEEQLKDIFSEVGLVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 85
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 86 NGREFSGRALRVDNA-ASEKNKEELKSL 112
>gi|20380061|gb|AAH28239.1| Cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
variant [Homo sapiens]
gi|325463311|gb|ADZ15426.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
variant [synthetic construct]
Length = 616
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|62896707|dbj|BAD96294.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
variant variant [Homo sapiens]
Length = 616
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGSIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|354476121|ref|XP_003500273.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
isoform 1 [Cricetulus griseus]
Length = 614
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|355562595|gb|EHH19189.1| hypothetical protein EGK_19854 [Macaca mulatta]
Length = 610
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 8 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 67
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 68 NGREFSGRALRVDNA-ASEKNKEELKSL 94
>gi|14149675|ref|NP_056050.1| cleavage stimulation factor subunit 2 tau variant [Homo sapiens]
gi|71153234|sp|Q9H0L4.1|CSTFT_HUMAN RecName: Full=Cleavage stimulation factor subunit 2 tau variant;
AltName: Full=CF-1 64 kDa subunit tau variant; AltName:
Full=Cleavage stimulation factor 64 kDa subunit tau
variant; Short=CSTF 64 kDa subunit tau variant; AltName:
Full=TauCstF-64
gi|12053011|emb|CAB66681.1| hypothetical protein [Homo sapiens]
gi|24416591|gb|AAN05429.1| tCstF-64 [Homo sapiens]
gi|119574527|gb|EAW54142.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
variant [Homo sapiens]
gi|189067256|dbj|BAG36966.1| unnamed protein product [Homo sapiens]
Length = 616
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|410974961|ref|XP_003993907.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant [Felis
catus]
Length = 613
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|388452994|ref|NP_001253727.1| cleavage stimulation factor subunit 2 tau variant [Macaca mulatta]
gi|355758732|gb|EHH61510.1| hypothetical protein EGM_21244 [Macaca fascicularis]
gi|383416951|gb|AFH31689.1| cleavage stimulation factor subunit 2 tau variant [Macaca mulatta]
gi|384946038|gb|AFI36624.1| cleavage stimulation factor subunit 2 tau variant [Macaca mulatta]
gi|387541538|gb|AFJ71396.1| cleavage stimulation factor subunit 2 tau variant [Macaca mulatta]
Length = 620
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|397469483|ref|XP_003806381.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant [Pan
paniscus]
Length = 615
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|358421036|ref|XP_001254105.2| PREDICTED: cleavage stimulation factor subunit 2-like [Bos taurus]
Length = 331
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|410043860|ref|XP_001163035.2| PREDICTED: cleavage stimulation factor subunit 2 tau variant
isoform 4 [Pan troglodytes]
gi|410335357|gb|JAA36625.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
variant [Pan troglodytes]
gi|410335359|gb|JAA36626.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
variant [Pan troglodytes]
gi|410335361|gb|JAA36627.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
variant [Pan troglodytes]
gi|410335363|gb|JAA36628.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
variant [Pan troglodytes]
Length = 615
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|426364779|ref|XP_004049473.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
[Gorilla gorilla gorilla]
gi|426364781|ref|XP_004049474.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
[Gorilla gorilla gorilla]
Length = 617
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|354476123|ref|XP_003500274.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
isoform 2 [Cricetulus griseus]
Length = 623
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|344234537|gb|EGV66405.1| hypothetical protein CANTEDRAFT_91566 [Candida tenuis ATCC 10573]
Length = 641
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 60/225 (26%), Positives = 100/225 (44%), Gaps = 17/225 (7%)
Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ----DP--AVTDIACAALNGLKM 361
E ++++ +E + F +K++ S G F +Q DP A+T + LN L
Sbjct: 413 EDEVRKSIEEHIPIKQFQFLKEKYNKESMGIAFANFQLQSYDPTSAITVVQQVLLN-LTQ 471
Query: 362 GDKTLTVRRATASGQSKT---EQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
G + ++T EQ S +A Q + Q ++ + N +
Sbjct: 472 GSSIFSKADFACIVPNQTSIQEQPSNMASLQSFVKNQLISTTVTDNNPNIVSEVVRDNRK 531
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN------GGETP 472
+KV+ + A+T L DDE Y I D+ +E K+G ++ V IPRP + TP
Sbjct: 532 SKVIQIINAVTTKDLKDDETYGFISSDVEQEVKKFGEVIRVKIPRPANDFTPGLTESSTP 591
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
G+G++F+E+ + A L+GR + TV Y+ D FNK
Sbjct: 592 GLGRIFVEFSNEDSAFKAILGLAGRMYNDRTVLCSYFDVDD-FNK 635
>gi|297686917|ref|XP_002820977.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant [Pongo
abelii]
Length = 625
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|344241215|gb|EGV97318.1| Cleavage stimulation factor 64 kDa subunit, tau variant [Cricetulus
griseus]
Length = 645
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|357627631|gb|EHJ77269.1| hypothetical protein KGM_03087 [Danaus plexippus]
Length = 425
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 54/89 (60%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 8 VFVGNIPYEATEEKLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 67
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G ++L V A + +S+ E ++++
Sbjct: 68 NGYEIGGRSLRVDNA-CTEKSRMEMQALM 95
>gi|403260042|ref|XP_003922497.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
[Saimiri boliviensis boliviensis]
Length = 621
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|149055454|gb|EDM07038.1| rCG38164 [Rattus norvegicus]
Length = 363
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|91094479|ref|XP_970762.1| PREDICTED: similar to cleavage stimulation factor 64-kDa subunit
[Tribolium castaneum]
Length = 424
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 1/91 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 16 VFVGNIPYEATEEKLKDIFGEVGQVLSFKLVFDRETGKPKGYGFCEYRDQETALSAMRNL 75
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
NG ++G + L V A + +S+ E +++L Q
Sbjct: 76 NGYEIGGRNLRVDNA-CTEKSRMEMQNLLNQ 105
>gi|59808930|gb|AAH89996.1| U2af2 protein, partial [Rattus norvegicus]
Length = 88
Score = 72.0 bits (175), Expect = 7e-10, Method: Composition-based stats.
Identities = 34/79 (43%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
EI+ED+R+EC KYG + ++ IPRP +G E PG GK+F+E+ C A L+GRKF
Sbjct: 10 EIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFA 68
Query: 501 GNTVNAFYYPEDKYFNKDY 519
V Y D Y +D+
Sbjct: 69 NRVVVTKYCDPDSYHRRDF 87
>gi|294936223|ref|XP_002781665.1| Heterogeneous nuclear ribonucleoprotein A1, putative [Perkinsus
marinus ATCC 50983]
gi|239892587|gb|EER13460.1| Heterogeneous nuclear ribonucleoprotein A1, putative [Perkinsus
marinus ATCC 50983]
Length = 482
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 59/212 (27%), Positives = 100/212 (47%), Gaps = 18/212 (8%)
Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP-GDAVVNV--YINHEKKFAFVEMRT 231
++V+VGGLP A++ A+ +FSQ GP D+VV + + + F FV T
Sbjct: 171 KKVFVGGLPREADKPALDAYFSQF---------GPVEDSVVMMDRFTGRSRGFGFVTFET 221
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAA----ALGPGQPSPNLNLAAVGLASGA 287
E+ +A + G +V VRR + + T A + G G +P N G G
Sbjct: 222 KEQMLGCVAAAPHVIMGKSVEVRRSINDDGTSTAHERRSAGKGAGAPR-NYDDYGSGKGG 280
Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
+ P+++FVGGLP T +++ +G L ++ DR TG S+G+G+ Y+D +
Sbjct: 281 HRD-QNPNKLFVGGLPREITSEALRDFFIQYGNLVDCTVITDRMTGQSRGFGYVTYEDSS 339
Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKT 379
+ A + + K + V+ T G ++
Sbjct: 340 AAEAAISNSANNIIDGKWVDVKHTTREGPRRS 371
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 1/87 (1%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
+VFVGGLP + + FG + ++ DR TG S+G+GF ++ AA
Sbjct: 172 KVFVGGLPREADKPALDAYFSQFGPVEDSVVMMDRFTGRSRGFGFVTFETKEQMLGCVAA 231
Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQE 382
+ MG K++ VRR+ + T E
Sbjct: 232 APHVIMG-KSVEVRRSINDDGTSTAHE 257
>gi|195050249|ref|XP_001992854.1| GH13506 [Drosophila grimshawi]
gi|193899913|gb|EDV98779.1| GH13506 [Drosophila grimshawi]
Length = 628
Score = 72.0 bits (175), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 55/200 (27%), Positives = 82/200 (41%), Gaps = 28/200 (14%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS V G V + N K+F A++E
Sbjct: 268 RDARTVFCIQLSQRVRARDLEEFFSSV---------GKVRDVRLITCNKTKRFKGIAYIE 318
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
E + A+ L G GV + V+ L +A P QP +
Sbjct: 319 FEDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQSAPPPFQPKLHT------------ 366
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF Y +
Sbjct: 367 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDVIQLIMDNETGRSKGYGFITYHNADD 422
Query: 349 TDIACAALNGLKMGDKTLTV 368
A LNG ++ + + V
Sbjct: 423 AKKALEQLNGFELAGRPMKV 442
>gi|443694236|gb|ELT95429.1| hypothetical protein CAPTEDRAFT_160825 [Capitella teleta]
Length = 548
Score = 72.0 bits (175), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 49/88 (55%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY +E Q+KE+ + G + F LV DR+TG KGYGFC YQD A L
Sbjct: 26 VFVGNIPYEASEEQLKEVFQQAGPVISFRLVYDRETGKPKGYGFCEYQDVETAQSAMRNL 85
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
N + L V A A QSK E +S+
Sbjct: 86 NNYDYNGRPLRVGVA-AGEQSKDENKSM 112
>gi|147852616|emb|CAN81690.1| hypothetical protein VITISV_009755 [Vitis vinifera]
Length = 544
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 59/190 (31%), Positives = 88/190 (46%), Gaps = 35/190 (18%)
Query: 97 KSLSPS-RSPSKSKRRSGFDMAPPAAAMLPGAAVPGQL----PGV-------PSAVPEMA 144
K+ SP+ RSP K + +G+D+ P + +V L P V PSAVP
Sbjct: 343 KTPSPTNRSPEK--KNAGWDLPPSRTDGMNAGSVLSSLQVLKPTVSSNADELPSAVPVA- 399
Query: 145 QNMLPFGATQLGAFPLMPV---------------QVMTQQATRHARRVYVGGLPPLANEQ 189
+P AT A P +P + QATR RR+YV LP ++E+
Sbjct: 400 ---VPVTATT--AKPPLPRIYSDAVSKNKNVSIDSIQLTQATRPMRRLYVENLPVSSSEK 454
Query: 190 AIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV 249
A+ + + + G N ++ I+ EK A VE T E+AS A++ DGI F G
Sbjct: 455 ALMECLNNFLLSSGINHVQGTPPCISCIIHKEKGQALVEFLTPEDASAALSFDGISFSGS 514
Query: 250 AVRVRRPTDY 259
+++RRP D+
Sbjct: 515 ILKIRRPKDF 524
>gi|321479154|gb|EFX90110.1| hypothetical protein DAPPUDRAFT_299933 [Daphnia pulex]
Length = 381
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 54/91 (59%), Gaps = 1/91 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++K++ G + F +V DR+TG KGYGFC Y+D A L
Sbjct: 16 VFVGNIPYDVTEEKLKDIFSEAGPVVSFKIVYDRETGKPKGYGFCEYRDQETALCAMRNL 75
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
NG ++ +TL V A + +S+ E +S++ +
Sbjct: 76 NGYEIAGRTLRVDNA-CTEKSRLEMQSLMQE 105
>gi|148688462|gb|EDL20409.1| cleavage stimulation factor, 3' pre-RNA subunit 2 [Mus musculus]
Length = 363
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|303272315|ref|XP_003055519.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463493|gb|EEH60771.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 394
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 109/284 (38%), Gaps = 61/284 (21%)
Query: 248 GVAVRVRRPTDYNPTLAAALGPGQPSPNLNL----------------------AAVGLAS 285
G R+RRP+ + T ++ PG LNL A V AS
Sbjct: 126 GYVPRIRRPSKHGITQPSS-APGASDAALNLVDPAERQRQAQQWILQQQAGSLARVQEAS 184
Query: 286 G-AIGGAEGPDRVFVGGLPYYFTETQIKELLE-----SFGTLHGFDL--VKDRDTGNSKG 337
A+GG +FVG + T+ + L + +F D V + G+
Sbjct: 185 TVALGGPRKNREIFVGNIDAVVTKQALTALFDDALAVAFPNATSGDAKPVVNIQLGDQAT 244
Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
YGF + A A LNGL + LT+ R T A +
Sbjct: 245 YGFVELLSEELATAAIAGLNGLVFCGRPLTIARPTGWVDPAAAATITARAAAER------ 298
Query: 398 ALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
GE + ++CL+ +T LAD+E Y E+L D+R EC K G +
Sbjct: 299 -----------------GEEHSTIVCLSNIVTESDLADEEAYAELLADVRTECAKCGEVK 341
Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
++ IPR GG VG VF++ D G + ++GR+F G
Sbjct: 342 DIRIPR----GGP---VGSVFVKMGDESGANKVQTEMAGRRFDG 378
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 49/90 (54%), Gaps = 2/90 (2%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVM-TAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
R R ++VG + + +QA+ F + A ++G VVN+ + + + FVE+
Sbjct: 192 RKNREIFVGNIDAVVTKQALTALFDDALAVAFPNATSGDAKPVVNIQLGDQATYGFVELL 251
Query: 231 TVEEASNAMA-LDGIIFEGVAVRVRRPTDY 259
+ E A+ A+A L+G++F G + + RPT +
Sbjct: 252 SEELATAAIAGLNGLVFCGRPLTIARPTGW 281
>gi|384247050|gb|EIE20538.1| hypothetical protein COCSUDRAFT_54348 [Coccomyxa subellipsoidea
C-169]
Length = 213
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 61/104 (58%), Gaps = 3/104 (2%)
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREEC-GKYGTLVNVVIPRPDQNG--GETPGVG 475
++VL L +T + L D EEY +I++D+ E KYGTL ++VIP+P Q G + GVG
Sbjct: 110 SRVLRLANMVTREELLDPEEYSDIVDDITSELESKYGTLSSLVIPQPSQKGPASDPSGVG 169
Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
VF+++ A+ L+GRKFG +++ ++ E + K +
Sbjct: 170 LVFVQFPKLSDAVKAQEKLNGRKFGAGNIHSEFFDEGLFQRKHF 213
>gi|343429703|emb|CBQ73275.1| related to Cleavage stimulation factor [Sporisorium reilianum SRZ2]
Length = 391
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/83 (42%), Positives = 48/83 (57%), Gaps = 4/83 (4%)
Query: 293 GPDR----VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP R VFVG +PY +E Q+ ++ G + GF LV DRDTG KGYGFC ++DP
Sbjct: 3 GPQRGSRVVFVGNIPYDMSEEQLTDVFREVGKVVGFRLVNDRDTGKFKGYGFCEFEDPET 62
Query: 349 TDIACAALNGLKMGDKTLTVRRA 371
A LN +++G + L + A
Sbjct: 63 AASAVRNLNEVEVGGRPLRISFA 85
>gi|237829727|ref|XP_002364161.1| U2 small nuclear ribonucleoprotein auxiliary factor U2AF
[Toxoplasma gondii ME49]
gi|211961825|gb|EEA97020.1| U2 small nuclear ribonucleoprotein auxiliary factor U2AF
[Toxoplasma gondii ME49]
gi|221481075|gb|EEE19483.1| U2 small nuclear ribonucleoprotein auxiliary factor U2AF, putative
[Toxoplasma gondii GT1]
gi|221507020|gb|EEE32624.1| U2 small nuclear ribonucleoprotein auxiliary factor U2AF, putative
[Toxoplasma gondii VEG]
Length = 704
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 69/285 (24%), Positives = 116/285 (40%), Gaps = 75/285 (26%)
Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
P+R+ + LP +E ++++L+E+FG ++ F L+K D S+ Y D + A
Sbjct: 302 PERLCILDLPPLMSEEKVRQLVETFGPVNAFHLLKKDD--GSEMVCIVEYVDLESQEQAM 359
Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSL 413
L+ + + A + Q + +H+ K + + L G +
Sbjct: 360 DILHS----NSPYRILLAEEAIQQEVIAPFFKKAKAKHL---KTEDEEEEEDDLADGERM 412
Query: 414 FGETL------AKVLCLTEAITADALADDEEYEEILEDMR---EECGKYGTLVNVVIPRP 464
++L +VL L+ + + L DD+EYE+I+ED+R EECG G +++V IPRP
Sbjct: 413 NIQSLLRPQVCTRVLLLSNIVEVEDLLDDKEYEDIVEDIRLECEECG--GPVLSVNIPRP 470
Query: 465 ----------------------------------DQNGGETPG----------------- 473
+Q GE
Sbjct: 471 VRGFEHESKPEFQQQQEREALAKKEEVTVKQEVTEQTAGEEDAGEKGRQEKEKAKSSEEQ 530
Query: 474 ----VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+G ++E+ D A A+ AL+GRKFGG V A Y+ E K+
Sbjct: 531 KPATIGFAYVEFEDCEWSAKARKALNGRKFGGKIVEAHYFSEVKF 575
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 48/85 (56%), Gaps = 1/85 (1%)
Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEE 234
R +YVG LPP + F + M A+GG + PG V + + + +AFVE RT+EE
Sbjct: 113 RELYVGNLPPSLEVPQLMEFLNAAMAAVGG-ALLPGPPAVKAWRSTDGHYAFVEFRTMEE 171
Query: 235 ASNAMALDGIIFEGVAVRVRRPTDY 259
ASN M L+G+ G +R+ RP Y
Sbjct: 172 ASNGMQLNGLNCMGFNLRIGRPKTY 196
>gi|430811846|emb|CCJ30702.1| unnamed protein product [Pneumocystis jirovecii]
Length = 486
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 64/263 (24%), Positives = 105/263 (39%), Gaps = 62/263 (23%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
R++VG + + TE ++++ E FG L L K+ DTG S+GYGF Y+DPA A
Sbjct: 227 RLYVGNIHFNLTEDDLRQIFEPFGELEFVQLQKEPDTGRSRGYGFVQYRDPAQARDALEK 286
Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS---GMNTLG---- 408
+NG ++ + + V G K ES A + A + G +G
Sbjct: 287 MNGFELAGRAIRV----GLGNDKFTPESTSAVLARFSGFTGSAFENKNRGGTERIGGPRD 342
Query: 409 ------------GGMSLFG---ETLAKVLCLTEAITADALAD------------------ 435
GG+S + L + L E + LA
Sbjct: 343 GSSSVSLDDNEAGGVSFNNISRDALMRKLAREEGKESIGLAPPKPTPAVQMTSRCVLLKN 402
Query: 436 ---------DEEYEEILEDMREEC-GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAV 485
D E+ +D++ EC KYG ++++ + E G V++++ + V
Sbjct: 403 MFNPQEESGDNWIRELEDDVKAECENKYGKVLHIHV--------EENSPGDVYIKFDNVV 454
Query: 486 GCATAKNALSGRKFGGNTVNAFY 508
A L+GR FGG T++A +
Sbjct: 455 AGERAIQGLNGRWFGGRTISASF 477
>gi|125774537|ref|XP_001358527.1| GA20525 [Drosophila pseudoobscura pseudoobscura]
gi|54638266|gb|EAL27668.1| GA20525 [Drosophila pseudoobscura pseudoobscura]
Length = 418
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 52/89 (58%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++KE+ G + LV DR++G KG+GFC Y+D A L
Sbjct: 18 VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E + +L
Sbjct: 78 NGYEIGGRTLRVDNA-CTEKSRMEMQQLL 105
>gi|34810648|pdb|1P1T|A Chain A, Nmr Structure Of The N-Terminal Rrm Domain Of Cleavage
Stimulation Factor 64 Kda Subunit
Length = 104
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 11 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 70
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 71 NGREFSGRALRVDNA-ASEKNKEELKSL 97
>gi|195145742|ref|XP_002013849.1| GL24357 [Drosophila persimilis]
gi|194102792|gb|EDW24835.1| GL24357 [Drosophila persimilis]
Length = 418
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 52/89 (58%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++KE+ G + LV DR++G KG+GFC Y+D A L
Sbjct: 18 VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E + +L
Sbjct: 78 NGYEIGGRTLRVDNA-CTEKSRMEMQQLL 105
>gi|327284085|ref|XP_003226769.1| PREDICTED: serine/threonine-protein kinase Kist-like [Anolis
carolinensis]
Length = 217
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 55/94 (58%), Gaps = 6/94 (6%)
Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
VL L + + +EE+E+I++D++EEC KYG +V++ +P+ E PG G VF+E
Sbjct: 120 VLRLLNILNDASFQSEEEFEDIVDDIKEECSKYGQIVSLFVPK------ENPGKGHVFVE 173
Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
Y +A A+ AL+G++F V +YP Y
Sbjct: 174 YTNAGDSKAAQQALTGKRFDCKFVVTTFYPLSAY 207
>gi|222619898|gb|EEE56030.1| hypothetical protein OsJ_04814 [Oryza sativa Japonica Group]
Length = 658
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 102/244 (41%), Gaps = 46/244 (18%)
Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
+ P ++F+ G+P + +++++ SFG L + + + D G + F Y D ++T
Sbjct: 375 DSPHKIFIAGIPRVISSKMLRDIVSSFGQLAAYRFLFNEDLGGA--CAFLEYIDHSITSK 432
Query: 352 ACAALNGLKMGDKTLTVRRATAS--GQSKTEQ---ESILAQAQQHIAIQKMALQTSGMNT 406
ACA LNG+K+G +T GQ+ E I A + +A+ LQ
Sbjct: 433 ACAGLNGMKLGGCVITAVGVLTDHPGQAGNEACPFHGIPANPKPLLAVPTQVLQLK---- 488
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL--VNVVIPRP 464
++F + +L E + +LED+R +C +YG + +NVV
Sbjct: 489 -----NVFDQ------------EEYSLLSKYEVDAVLEDVRVKCARYGAVKSINVVEYPA 531
Query: 465 DQNGGETPGV----------------GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
+ + P V G + +E+ A ++L GR FG V+A Y
Sbjct: 532 GSDNTKAPAVDARDNALASNNTALEAGCILVEFLCKEASFMAAHSLHGRPFGSRIVSAGY 591
Query: 509 YPED 512
P D
Sbjct: 592 APYD 595
>gi|241997552|ref|XP_002433425.1| RNA recognition motif-containing protein [Ixodes scapularis]
gi|215490848|gb|EEC00489.1| RNA recognition motif-containing protein [Ixodes scapularis]
Length = 466
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/90 (42%), Positives = 51/90 (56%), Gaps = 1/90 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYKDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILA 386
N + + L V A AS +SK E +++ A
Sbjct: 78 NAFDLNGRPLRVDNA-ASEKSKEELKNLQA 106
>gi|198424504|ref|XP_002131946.1| PREDICTED: similar to poly-U binding splicing factor 60KDa [Ciona
intestinalis]
Length = 491
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 60/106 (56%), Gaps = 6/106 (5%)
Query: 416 ETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE-TPGV 474
E +KV+ L + + + DD E E + EECGK+G++ VVI + Q+ E P
Sbjct: 390 EEASKVMVLHNMVDVEEIDDDLESE-----VTEECGKFGSVSRVVIYQEKQSEAEDAPVT 444
Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
K+++E+ D+V C A +L+GR FGG + A YP+ K+ + D +
Sbjct: 445 VKIYVEFTDSVFCKKAVESLNGRWFGGRKIEAIIYPQHKFNHNDLT 490
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 87/210 (41%), Gaps = 31/210 (14%)
Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAV---VNVYINHEKKF 224
QQA RVYVG + E+ I FS GP ++ + K F
Sbjct: 91 QQALVLMCRVYVGSIYYDLKEEIIRNAFSPF---------GPFKSINMSFDPITGKHKGF 141
Query: 225 AFVEMRTVEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
AF+E T E A ++ + G++ G +++V RP + Q P ++L
Sbjct: 142 AFIEYETPEAAQLSLDQMGGVMLGGRSIKVGRPANM----------PQSHPVIDLL---- 187
Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
+ ++ R+++ + +K + +FG + LV D TG KGYGF Y
Sbjct: 188 ----LDESKMQKRIYISSVHTDLNTEDLKSVFSAFGNILSCALVPDVLTGKHKGYGFIEY 243
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATA 373
+ A A++N +G + L V +A A
Sbjct: 244 DTLQAANDAVASMNLFDLGGQYLRVGKAIA 273
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 58/130 (44%), Gaps = 3/130 (2%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
RV+VG + Y E I+ FG ++ D TG KG+ F Y+ P ++
Sbjct: 99 RVYVGSIYYDLKEEIIRNAFSPFGPFKSINMSFDPITGKHKGFAFIEYETPEAAQLSLDQ 158
Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT--LGGGMSL 413
+ G+ +G +++ V R QS + +L +++ I ++ T +NT L S
Sbjct: 159 MGGVMLGGRSIKVGRPANMPQSHPVIDLLLDESKMQKRIYISSVHTD-LNTEDLKSVFSA 217
Query: 414 FGETLAKVLC 423
FG L+ L
Sbjct: 218 FGNILSCALV 227
>gi|17137710|ref|NP_477453.1| cleavage stimulation factor 64 kilodalton subunit [Drosophila
melanogaster]
gi|5713194|gb|AAD47839.1|AF170082_1 cleavage stimulation factor 64 kilodalton subunit [Drosophila
melanogaster]
gi|23171661|gb|AAF55577.2| cleavage stimulation factor 64 kilodalton subunit [Drosophila
melanogaster]
gi|205360993|gb|ACI03573.1| FI01908p [Drosophila melanogaster]
Length = 419
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++KE+ G + LV DR++G KG+GFC Y+D A L
Sbjct: 18 VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
NG ++G +TL V A + +S+ E + +L Q
Sbjct: 78 NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 109
>gi|150864148|ref|XP_001382862.2| hypothetical protein PICST_42021 [Scheffersomyces stipitis CBS
6054]
gi|149385404|gb|ABN64833.2| predicted protein, partial [Scheffersomyces stipitis CBS 6054]
Length = 533
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 58/219 (26%), Positives = 91/219 (41%), Gaps = 22/219 (10%)
Query: 307 TETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC-VYQDPAV---TDIACAALNGLKMG 362
TETQI L + + F + ++ T S G F + DPA TD L L
Sbjct: 321 TETQIITELNLYSPVRAFQMFREVGTKVSLGMAFVEFFIDPASYKHTDQVIERLQEL--- 377
Query: 363 DKTLTVRRATASGQSKTEQESILAQAQQH-IAIQKMALQTSGMNTLGGGMSLFGETLAKV 421
QS+ E+ + H +IQ + + L ++ ++V
Sbjct: 378 --------LQKLDQSQIIDEAFFSCIIPHKTSIQDCQINFDSLKHLVRNENVSTHPKSRV 429
Query: 422 LCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG------ETPGVG 475
+ L +T L +D Y+ IL+D++ E + GT+V++ IPRP PG+G
Sbjct: 430 IQLLNVVTPKDLVEDSNYQFILKDIKREASRIGTVVSIKIPRPANEFTPGLAQFSVPGLG 489
Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
KVF+E+ D A L+GR + V +Y D Y
Sbjct: 490 KVFIEFEDEEVAFRAIMELAGRSYNDRCVICAFYNVDDY 528
>gi|195343246|ref|XP_002038209.1| GM17877 [Drosophila sechellia]
gi|194133059|gb|EDW54627.1| GM17877 [Drosophila sechellia]
Length = 419
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++KE+ G + LV DR++G KG+GFC Y+D A L
Sbjct: 18 VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
NG ++G +TL V A + +S+ E + +L Q
Sbjct: 78 NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 109
>gi|195395200|ref|XP_002056224.1| GJ10336 [Drosophila virilis]
gi|194142933|gb|EDW59336.1| GJ10336 [Drosophila virilis]
Length = 427
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++KE+ G + LV DR++G KG+GFC Y+D A L
Sbjct: 18 VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
NG ++G +TL V A + +S+ E + +L Q
Sbjct: 78 NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 109
>gi|194900156|ref|XP_001979623.1| GG22991 [Drosophila erecta]
gi|190651326|gb|EDV48581.1| GG22991 [Drosophila erecta]
Length = 416
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++KE+ G + LV DR++G KG+GFC Y+D A L
Sbjct: 18 VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
NG ++G +TL V A + +S+ E + +L Q
Sbjct: 78 NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 109
>gi|195497709|ref|XP_002096214.1| GE25546 [Drosophila yakuba]
gi|194182315|gb|EDW95926.1| GE25546 [Drosophila yakuba]
Length = 414
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 52/89 (58%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++KE+ G + LV DR++G KG+GFC Y+D A L
Sbjct: 18 VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E + +L
Sbjct: 78 NGYEIGGRTLRVDNA-CTEKSRMEMQQLL 105
>gi|332019312|gb|EGI59819.1| RNA-binding protein 39 [Acromyrmex echinatior]
Length = 528
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 90/376 (23%), Positives = 144/376 (38%), Gaps = 69/376 (18%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS S G V + N ++F A+VE
Sbjct: 166 RDARTVFCMQLSQRIRARDLEEFFS---------SVGKVQDVRLITCNKTRRFKGIAYVE 216
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
+ E + A+ L G GV + V+ T A G PNL
Sbjct: 217 FKDPESVTLALGLSGQKLLGVPIVVQH------TQAEKNRMGNSMPNL----------MP 260
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
G GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF +++
Sbjct: 261 KGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDPETGRSKGYGFLTFRNADD 320
Query: 349 TDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
A LNG ++ + + V R G S + + + A ++ L
Sbjct: 321 AKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLLDTDELDRSGIDLGATGRLQL---- 376
Query: 404 MNTLGGGMSL-------FGETLAKVL------------CLTEAITADALAD--DEEYEEI 442
M L G L +A V+ T+ + D +E
Sbjct: 377 MFKLAEGTGLEIPPAAANALNMAPVMAQPQPPPQAAPPIATQCFMLSNMFDPQNETNPNW 436
Query: 443 LEDMREE----CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRK 498
+++R++ C K+G +++V + DQ + G V+++ A N+L GR
Sbjct: 437 AKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVYVKCPSIATAVAAVNSLHGRW 489
Query: 499 FGGNTVNAFYYPEDKY 514
F G + A Y P Y
Sbjct: 490 FAGRVITAAYVPVVNY 505
>gi|195569859|ref|XP_002102926.1| GD19237 [Drosophila simulans]
gi|194198853|gb|EDX12429.1| GD19237 [Drosophila simulans]
Length = 419
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++KE+ G + LV DR++G KG+GFC Y+D A L
Sbjct: 18 VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
NG ++G +TL V A + +S+ E + +L Q
Sbjct: 78 NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 109
>gi|403364994|gb|EJY82272.1| Snrnp splicing factor (U2AF), putative [Oxytricha trifallax]
Length = 411
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 102/219 (46%), Gaps = 20/219 (9%)
Query: 80 HRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSG-----FDMAPPAAAMLPGAAVPGQLP 134
HR +SR + +R R+R + R +K+ G FD +PP L
Sbjct: 48 HREKSRDRNGNRGRDREREKDRDRGGRDNKKGGGRDDFRFD-SPPKDHELT--------K 98
Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
G+ +A + + + L + M + Q + R++YVG LPP ++ +
Sbjct: 99 GIMAAAASIGGGTIANAQSILQSIHSMSMA----QTAKIDRKLYVGNLPPGITQRMLIDV 154
Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
++ M ++ PG+ VV+ +I+ + +AFVE RT EEA++ L G+ + +++
Sbjct: 155 VNEAMLSLNVIEE-PGNPVVSAWISSDSHYAFVEFRTAEEANHGFNLQGMNIQNNEIKIG 213
Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
RP Y+ T+ A+G + +N+ A+ A+ G +G
Sbjct: 214 RPKAYSGTM-NAIGLMASAGGMNVQGGSFANAALMGMKG 251
>gi|195108753|ref|XP_001998957.1| GI23336 [Drosophila mojavensis]
gi|193915551|gb|EDW14418.1| GI23336 [Drosophila mojavensis]
Length = 428
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++KE+ G + LV DR++G KG+GFC Y+D A L
Sbjct: 18 VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
NG ++G +TL V A + +S+ E + +L Q
Sbjct: 78 NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 109
>gi|218189760|gb|EEC72187.1| hypothetical protein OsI_05261 [Oryza sativa Indica Group]
Length = 485
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 102/244 (41%), Gaps = 46/244 (18%)
Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
+ P ++F+ G+P + +++++ SFG L + + + D G + F Y D ++T
Sbjct: 259 DSPHKIFIAGIPRVISSKMLRDIVSSFGQLAAYRFLFNEDLGGA--CAFLEYIDHSITSK 316
Query: 352 ACAALNGLKMGDKTLTVRRATAS--GQSKTEQ---ESILAQAQQHIAIQKMALQTSGMNT 406
ACA LNG+K+G +T GQ+ E I A + +A+ LQ
Sbjct: 317 ACAGLNGMKLGGCVITAVGVLTDHPGQAGNEACPFHGIPANPKPLLAVPTQVLQLK---- 372
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL--VNVVIPRP 464
++F + +L E + +LED+R +C +YG + +NVV
Sbjct: 373 -----NVFDQEEYSLL------------SKYEVDAVLEDVRVKCARYGAVKSINVVEYPA 415
Query: 465 DQNGGETPGV----------------GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
+ + P V G + +E+ A ++L GR FG V+A Y
Sbjct: 416 GSDNTKAPAVDARDNALASNNTALEAGCILVEFLCKEASFMAAHSLHGRPFGSRIVSAGY 475
Query: 509 YPED 512
P D
Sbjct: 476 APYD 479
>gi|195452858|ref|XP_002073531.1| GK14167 [Drosophila willistoni]
gi|194169616|gb|EDW84517.1| GK14167 [Drosophila willistoni]
Length = 401
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++KE+ G + LV DR++G KG+GFC Y+D A L
Sbjct: 19 VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 78
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
NG ++G +TL V A + +S+ E + +L Q
Sbjct: 79 NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 110
>gi|194743216|ref|XP_001954096.1| GF16912 [Drosophila ananassae]
gi|190627133|gb|EDV42657.1| GF16912 [Drosophila ananassae]
Length = 415
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 52/89 (58%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++KE+ G + LV DR++G KG+GFC Y+D A L
Sbjct: 18 VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E + +L
Sbjct: 78 NGYEIGGRTLRVDNA-CTEKSRMEMQQLL 105
>gi|195037535|ref|XP_001990216.1| GH18352 [Drosophila grimshawi]
gi|193894412|gb|EDV93278.1| GH18352 [Drosophila grimshawi]
Length = 430
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 52/89 (58%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++KE+ G + LV DR++G KG+GFC Y+D A L
Sbjct: 18 VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++G +TL V A + +S+ E + +L
Sbjct: 78 NGYEIGGRTLRVDNA-CTEKSRMEMQQLL 105
>gi|28557621|gb|AAO45216.1| RE27227p [Drosophila melanogaster]
Length = 437
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++KE+ G + LV DR++G KG+GFC Y+D A L
Sbjct: 18 VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
NG ++G +TL V A + +S+ E + +L Q
Sbjct: 78 NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 109
>gi|115442323|ref|NP_001045441.1| Os01g0956600 [Oryza sativa Japonica Group]
gi|57900079|dbj|BAD88141.1| splicing factor family protein-like [Oryza sativa Japonica Group]
gi|57900192|dbj|BAD88277.1| splicing factor family protein-like [Oryza sativa Japonica Group]
gi|113534972|dbj|BAF07355.1| Os01g0956600 [Oryza sativa Japonica Group]
gi|215736836|dbj|BAG95765.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 608
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 102/244 (41%), Gaps = 46/244 (18%)
Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
+ P ++F+ G+P + +++++ SFG L + + + D G + F Y D ++T
Sbjct: 375 DSPHKIFIAGIPRVISSKMLRDIVSSFGQLAAYRFLFNEDLGGA--CAFLEYIDHSITSK 432
Query: 352 ACAALNGLKMGDKTLTVRRATAS--GQSKTEQ---ESILAQAQQHIAIQKMALQTSGMNT 406
ACA LNG+K+G +T GQ+ E I A + +A+ LQ
Sbjct: 433 ACAGLNGMKLGGCVITAVGVLTDHPGQAGNEACPFHGIPANPKPLLAVPTQVLQLK---- 488
Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL--VNVVIPRP 464
++F + +L E + +LED+R +C +YG + +NVV
Sbjct: 489 -----NVFDQ------------EEYSLLSKYEVDAVLEDVRVKCARYGAVKSINVVEYPA 531
Query: 465 DQNGGETPGV----------------GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
+ + P V G + +E+ A ++L GR FG V+A Y
Sbjct: 532 GSDNTKAPAVDARDNALASNNTALEAGCILVEFLCKEASFMAAHSLHGRPFGSRIVSAGY 591
Query: 509 YPED 512
P D
Sbjct: 592 APYD 595
>gi|301781268|ref|XP_002926045.1| PREDICTED: probable RNA-binding protein 23-like isoform 1
[Ailuropoda melanoleuca]
Length = 446
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 68/151 (45%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 209 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 252
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 253 -MANNLQKGSSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 311
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V + T
Sbjct: 312 TFSDSECARRALEQLNGFELAGRPMRVGQVT 342
>gi|291228918|ref|XP_002734426.1| PREDICTED: cleavage stimulation factor subunit 2-like [Saccoglossus
kowalevskii]
Length = 220
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/99 (40%), Positives = 55/99 (55%), Gaps = 5/99 (5%)
Query: 286 GAIGGAEGPDR----VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
A+G + DR VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC
Sbjct: 2 SAVGQSAATDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFC 61
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTE 380
Y+D A L+G ++ + L V A AS ++K E
Sbjct: 62 EYKDQETALSAMRNLSGYELNGRQLRVDNA-ASEKNKEE 99
>gi|301100496|ref|XP_002899338.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262104255|gb|EEY62307.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 414
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/85 (41%), Positives = 47/85 (55%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE +KE+ G++ F LV DR+TG KGYGFC Y D A A L
Sbjct: 15 VFVGNIPYDVTEDMLKEIFSEAGSVMNFRLVTDRETGKPKGYGFCEYADGATALSAMRNL 74
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQ 381
NG ++ + L V A +S + +
Sbjct: 75 NGYEINGRNLRVDFADGGDKSNSAE 99
>gi|427798067|gb|JAA64485.1| Putative mrna cleavage and polyadenylation factor i complex subunit
rna15, partial [Rhipicephalus pulchellus]
Length = 377
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 38/90 (42%), Positives = 51/90 (56%), Gaps = 1/90 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYKDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILA 386
N + + L V A AS +SK E +++ A
Sbjct: 78 NAFDLNGRPLRVDNA-ASEKSKEELKNLQA 106
>gi|255089613|ref|XP_002506728.1| predicted protein [Micromonas sp. RCC299]
gi|226522001|gb|ACO67986.1| predicted protein [Micromonas sp. RCC299]
Length = 317
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 48/81 (59%)
Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
P +VFVG +PY TE +++++ G +H F LV DR+TG KGYGFC Y D A + A
Sbjct: 5 PYQVFVGNVPYDATEERLRDMFSEVGPVHDFRLVTDRETGKLKGYGFCEYMDLATAESAK 64
Query: 354 AALNGLKMGDKTLTVRRATAS 374
LNG + + L V A A+
Sbjct: 65 RNLNGREYNGRNLRVDFADAA 85
>gi|124360616|gb|ABN08615.1| hypothetical protein MtrDRAFT_AC157507g26v2 [Medicago truncatula]
Length = 64
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 32/39 (82%), Positives = 36/39 (92%)
Query: 248 GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
GVAVRVRRPTDYNP+LAA LGP QPS NLNL+AVGL++G
Sbjct: 26 GVAVRVRRPTDYNPSLAAVLGPCQPSANLNLSAVGLSAG 64
>gi|301781270|ref|XP_002926046.1| PREDICTED: probable RNA-binding protein 23-like isoform 2
[Ailuropoda melanoleuca]
Length = 430
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 68/151 (45%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 193 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 236
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 237 -MANNLQKGSSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 295
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V + T
Sbjct: 296 TFSDSECARRALEQLNGFELAGRPMRVGQVT 326
>gi|402875680|ref|XP_003901625.1| PREDICTED: probable RNA-binding protein 23 [Papio anubis]
Length = 497
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 68/151 (45%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 262 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 305
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG+SKGYGF
Sbjct: 306 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 364
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 365 TFSDSECARRALEQLNGFELAGRPMRVGHVT 395
>gi|73962357|ref|XP_537365.2| PREDICTED: probable RNA-binding protein 23 isoform 1 [Canis lupus
familiaris]
Length = 445
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 208 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 251
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 252 -MANNLQKGSSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 310
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 311 TFSDSECARRALEQLNGFELAGRPMRVGHVT 341
>gi|395859409|ref|XP_003802032.1| PREDICTED: probable RNA-binding protein 23 [Otolemur garnettii]
Length = 449
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 212 KGIAYVEFCEIQSVPLAIGLTGQQLLGVPIIVQASQAEKNRLAA---------------- 255
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP +FVG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 256 -MANNLQKGSSGPMHLFVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 314
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V + T
Sbjct: 315 TFSDSECARRALEQLNGFELAGRPMKVGQVT 345
>gi|281343373|gb|EFB18957.1| hypothetical protein PANDA_015655 [Ailuropoda melanoleuca]
Length = 369
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 68/151 (45%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 132 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 175
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 176 -MANNLQKGSSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 234
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V + T
Sbjct: 235 TFSDSECARRALEQLNGFELAGRPMRVGQVT 265
>gi|388853962|emb|CCF52460.1| related to Cleavage stimulation factor [Ustilago hordei]
Length = 402
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 48/83 (57%), Gaps = 4/83 (4%)
Query: 293 GPDR----VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP R VFVG +PY +E Q+ ++ G + GF LV DR+TG KGYGFC ++DP
Sbjct: 3 GPQRGSRVVFVGNIPYDMSEEQLTDVFREVGKVVGFRLVNDRETGKFKGYGFCEFEDPET 62
Query: 349 TDIACAALNGLKMGDKTLTVRRA 371
A LN +++G + L + A
Sbjct: 63 AASAVRNLNEVEVGGRALRISFA 85
>gi|320582580|gb|EFW96797.1| splicing factor u2af large subunit [Ogataea parapolymorpha DL-1]
Length = 281
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 60/219 (27%), Positives = 96/219 (43%), Gaps = 25/219 (11%)
Query: 301 GLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF--CVYQDPAVTDIACAALNG 358
+PY ++ ELL+ G + V D+ + SKG F V +D V LN
Sbjct: 75 NVPYGTPREKLLELLQPLGKVRSLAQVLDKLSYESKGVAFFEMVSEDSEVL----TKLNE 130
Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
L + D+ L + RA + + K EQ IL+ A+Q +
Sbjct: 131 LAIDDQELQIFRACENPERKYEQAVILSAETLFGALQSDKISPHAP-------------- 176
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD---QNGGET-PGV 474
++++ + + L D +Y +I EC +G V+IPRP+ + G T P V
Sbjct: 177 SEIVQFLNCVAVEDLVDSVKYNDIKVAFEAECSCHGHPEKVLIPRPEGDFRPGMPTKPEV 236
Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTV-NAFYYPED 512
G++F+++ + AL+GRKF G T+ AFY ED
Sbjct: 237 GRIFVKFATSEEAQKCAEALAGRKFNGRTILAAFYETED 275
>gi|405963791|gb|EKC29337.1| RNA-binding protein 39 [Crassostrea gigas]
Length = 557
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 58/272 (21%), Positives = 103/272 (37%), Gaps = 54/272 (19%)
Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
G GP R++VG L + TE ++ + E FG + L++D +T S+GYGF + D
Sbjct: 255 GNIGPMRLYVGSLHFNITEEMLRGIFEPFGKIDDIKLIRDHETNRSQGYGFITFHDSEDA 314
Query: 350 DIACAALNGLKMGDKTLTV-----RRATASGQSKTEQE---------------SILAQAQ 389
A LNG ++ + + V R+ G S + + ++A+
Sbjct: 315 KKALEQLNGFELAGRPMKVGHVTERQGEIQGASMLDSDEMDRAGIDLGATGRLQLMAKLA 374
Query: 390 QHIAIQKMALQTSGMNTLGGG----------------MSLFGETLA-----------KVL 422
+ Q S +N ++ G A +
Sbjct: 375 EGTGFQIPEYAVSALNITQQAPGVASAAPPAGPAPNVSAILGSQAAGQDNTAPPIATQCF 434
Query: 423 CLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYY 482
L+ +A + +EI +D+ EEC K+G ++++ + + G V+++
Sbjct: 435 MLSNMFDPNAESRSSWDQEIRDDVIEECNKHGGVLHLYVDKASPQGN-------VYVKCP 487
Query: 483 DAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ AL GR FGG + A Y P Y
Sbjct: 488 TISAAVASVRALHGRYFGGKMITAAYVPLPNY 519
>gi|73962355|ref|XP_848788.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Canis lupus
familiaris]
Length = 429
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 192 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 235
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 236 -MANNLQKGSSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 294
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 295 TFSDSECARRALEQLNGFELAGRPMRVGHVT 325
>gi|444728803|gb|ELW69245.1| putative RNA-binding protein 23 [Tupaia chinensis]
Length = 450
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 69/159 (43%), Gaps = 17/159 (10%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 195 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 238
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 239 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 297
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTE 380
+ D A LNG ++ + + V T T+
Sbjct: 298 TFSDSECARRALEQLNGFELAGRPMRVGHVTERADGSTD 336
>gi|88911212|gb|ABD58896.1| chloroplast single strand DNA binding protein [Mesostigma viride]
Length = 299
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 57/214 (26%), Positives = 97/214 (45%), Gaps = 8/214 (3%)
Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
+T + ++YVG L +++ + FSQ + V++ + FAFV M
Sbjct: 85 STAASTKLYVGNLAWSCDDEMLNQAFSQF------GEVKAAEVVLDRESGRSRGFAFVTM 138
Query: 230 RTVEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
+ + A A LDG G A+RV P A + + G
Sbjct: 139 ASPDAAEKARRGLDGTELAGRAIRVNFPQPKGERAPRAERGERSERSERSERTYTPRGD- 197
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
G A +R++VG LP+ + +++L FGT++ +V DRD+G S+G+ F P
Sbjct: 198 GEAGDANRLYVGNLPWSMDDGMLEDLFMEFGTVNYARVVMDRDSGRSRGFAFVALSTPEE 257
Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
+ A A L+G ++G +T+ V AT S ++ +E
Sbjct: 258 ANEAMANLDGEEIGGRTIRVNLATKSSGNREGRE 291
>gi|301781272|ref|XP_002926047.1| PREDICTED: probable RNA-binding protein 23-like isoform 3
[Ailuropoda melanoleuca]
Length = 412
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 68/151 (45%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 175 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 218
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 219 -MANNLQKGSSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 277
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V + T
Sbjct: 278 TFSDSECARRALEQLNGFELAGRPMRVGQVT 308
>gi|325189600|emb|CCA24085.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 358
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 32/72 (44%), Positives = 41/72 (56%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++E+ G + F LV DRD+G KGYGFC Y D A A L
Sbjct: 17 VFVGNIPYDVTEEMLREIFSEAGAVMNFRLVTDRDSGKPKGYGFCEYADGATALSAMRNL 76
Query: 357 NGLKMGDKTLTV 368
NG ++ + L V
Sbjct: 77 NGYEINGRNLRV 88
>gi|302757135|ref|XP_002961991.1| hypothetical protein SELMODRAFT_9008 [Selaginella moellendorffii]
gi|302775356|ref|XP_002971095.1| hypothetical protein SELMODRAFT_9010 [Selaginella moellendorffii]
gi|300161077|gb|EFJ27693.1| hypothetical protein SELMODRAFT_9010 [Selaginella moellendorffii]
gi|300170650|gb|EFJ37251.1| hypothetical protein SELMODRAFT_9008 [Selaginella moellendorffii]
Length = 80
Score = 69.3 bits (168), Expect = 5e-09, Method: Composition-based stats.
Identities = 34/75 (45%), Positives = 46/75 (61%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
++VG LP + + +K L SFG + ++KDR TG SKGYGF + DPA A ++
Sbjct: 6 LYVGYLPATYDDESLKRLFSSFGQIEEVKVIKDRTTGASKGYGFVKFTDPAAASQAVFSM 65
Query: 357 NGLKMGDKTLTVRRA 371
NG K+ DKTL VR A
Sbjct: 66 NGWKIEDKTLAVRIA 80
>gi|90085597|dbj|BAE91539.1| unnamed protein product [Macaca fascicularis]
Length = 295
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 47/78 (60%), Gaps = 5/78 (6%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G++ F LV DR+TG KGYGFC YQD + A +A+
Sbjct: 18 VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQD---QETALSAM 74
Query: 357 NGLKMGDKTLTVRRATAS 374
L D ++ RA AS
Sbjct: 75 RNLN--DAPESITRAVAS 90
>gi|354474899|ref|XP_003499667.1| PREDICTED: cleavage stimulation factor subunit 2 [Cricetulus
griseus]
gi|344238061|gb|EGV94164.1| Cleavage stimulation factor 64 kDa subunit [Cricetulus griseus]
Length = 558
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 50/88 (56%), Gaps = 1/88 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
V VG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A L
Sbjct: 18 VCVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
NG + + L V A AS ++K E +S+
Sbjct: 78 NGREFSGRALRVDNA-ASEKNKEELKSL 104
>gi|294877868|ref|XP_002768167.1| Nucleolysin TIAR, putative [Perkinsus marinus ATCC 50983]
gi|239870364|gb|EER00885.1| Nucleolysin TIAR, putative [Perkinsus marinus ATCC 50983]
Length = 474
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 98/208 (47%), Gaps = 22/208 (10%)
Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP-GDAVVNV--YINHEKKFAFVEMRT 231
++V+VGGLP A++ A+ +FSQ GP D+VV + + + F FV T
Sbjct: 157 KKVFVGGLPREADKPALDEYFSQF---------GPVEDSVVMMDRFTGRSRGFGFVTFET 207
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAA----ALGPGQPSPNLNLAAVGLASGA 287
E+ +A + G V VRR + + T A + G G +P + +SG
Sbjct: 208 KEQMLGCVAAAPHVIMGKTVEVRRSINDDGTSTANERRSAGKGSGAPR---SYDDYSSGK 264
Query: 288 IGGA---EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
G + P+++FVGGLP T +++ +G L ++ DR TG S+G+G+ Y+
Sbjct: 265 GKGGHRDQNPNKLFVGGLPREVTSDVLRDFFIQYGNLVDCTVITDRMTGQSRGFGYITYE 324
Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRAT 372
D A + A + + K + V+ T
Sbjct: 325 DLAAAEAAISNSANNVIDGKWVDVKHTT 352
>gi|401410983|ref|XP_003884939.1| putative U2 small nuclear ribonucleoprotein auxiliary factor U2AF
[Neospora caninum Liverpool]
gi|325119358|emb|CBZ54911.1| putative U2 small nuclear ribonucleoprotein auxiliary factor U2AF
[Neospora caninum Liverpool]
Length = 588
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 61/117 (52%), Gaps = 3/117 (2%)
Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEE 234
R +YVG LPP + F + M A+GG + PG V + + + +AFVE RT+EE
Sbjct: 117 RELYVGNLPPSLEVPQLMEFLNAAMAAVGG-ALLPGPPAVKAWRSTDGHYAFVEFRTMEE 175
Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
ASN M L+G+ G +R+ RP Y + + P P+ L + +G +GGA
Sbjct: 176 ASNGMQLNGLNCMGFNLRIGRPKTYPQDMNHLIPP--PTIPLLHPQAAMGAGIVGGA 230
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 81/174 (46%), Gaps = 11/174 (6%)
Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
P+R+ + LP +E ++++L+E+FG ++ F L+K D S+ Y D + A
Sbjct: 313 PERLCILDLPPLMSEEKVRQLVETFGPVNAFHLLKKDD--GSEMVCIVEYVDLESQEQAM 370
Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSL 413
L+ + + A + Q + + + + + + L
Sbjct: 371 DILHS----NSPYRILLAEEAIQQEVIAPFFKKAKAKQMKAEDEEEEEMDGEEMSIQALL 426
Query: 414 FGETLAKVLCLTEAITADALADDEEYEEILEDMR---EECGKYGTLVNVVIPRP 464
+ +VL L+ + + L DD+EYEEI+ED+R EECG G +++V IPRP
Sbjct: 427 RPQVCTRVLLLSNIVDVEDLLDDKEYEEIVEDIRLECEECG--GPVLSVNIPRP 478
>gi|449667931|ref|XP_002156035.2| PREDICTED: poly(U)-binding-splicing factor PUF60-like [Hydra
magnipapillata]
Length = 597
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 96/397 (24%), Positives = 165/397 (41%), Gaps = 64/397 (16%)
Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
++ + Q+A ++AR +YV + P +E I + F + I P + K
Sbjct: 225 IEQILQEARQYAR-IYVSSIHPDLSESDIKSVF-EAFGEILSCKLAP-----DQLTGKHK 277
Query: 223 KFAFVEMRTVEEASNAM-ALDGIIFEGVAVRVRR---PTDYNPTLAAALGPGQPSPNLNL 278
+ F+E A++A+ A++ G +RV R P D+ AL G P P +L
Sbjct: 278 GYGFIEYANQSSANDAIVAMNLFDLGGQYIRVGRAITPPDH------ALKQG-PPPAASL 330
Query: 279 AAVGLASGAIGGAEG-------------------PDRVFVGGLPYY---FTETQIKELLE 316
A + S +I G E PD + +P F+ T
Sbjct: 331 LAANVISASIQGQEAVSAHGATALHMNPVTPSLSPDPFGLPPIPLQIPGFSPT------V 384
Query: 317 SFGTLHGFDLVKDRDTG---NSKGYGFCVYQ-----DPAVTDIACAALNGLKM---GDKT 365
S G+++ + YG Q P + ++LN G++
Sbjct: 385 SNGSVYNIQTSSQAQVSYGQTTLSYGLSTMQSPFSQQPVSYSQSTSSLNNTSYPTPGNQP 444
Query: 366 LTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGET-LAKVLCL 424
+ R T+ Q K ++ A + ++ L+ SG + M + + VL L
Sbjct: 445 VPEERLTSRQQRKKQELLDHKHAGEQNLEREEKLEISGKDARYMMMQKLARSGSSPVLVL 504
Query: 425 TEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV-GKVFLEYYD 483
+T+D E EE+ ++ EEC ++G +V VVI + Q + V K+F+E+
Sbjct: 505 KNMVTSD-----EVDEELQTEVTEECSRFGDVVRVVIYQERQGEEDNAEVIVKIFVEFSK 559
Query: 484 AVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
+A++AL+GR FGGN++ A Y EDKY ++DY+
Sbjct: 560 HSEAESAQSALNGRWFGGNSIQADIYDEDKYKSQDYT 596
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/201 (26%), Positives = 91/201 (45%), Gaps = 33/201 (16%)
Query: 176 RVYVGGLPPLANEQAI-ATF--FSQV-MTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
R Y+G + NE+++ A+F F + M + +SA + H K FAFVE
Sbjct: 140 RTYIGSINFQLNEESVRASFLPFGPIKMIDLSWDSAT---------MKH-KGFAFVEYEI 189
Query: 232 VEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
E A A+ ++ ++ G ++V RP+ N+ AA + +
Sbjct: 190 PEAAQLALEQMNNVLMGGRNIKVGRPS-----------------NVPQAAPWIEQ-ILQE 231
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
A R++V + +E+ IK + E+FG + L D+ TG KGYGF Y + + +
Sbjct: 232 ARQYARIYVSSIHPDLSESDIKSVFEAFGEILSCKLAPDQLTGKHKGYGFIEYANQSSAN 291
Query: 351 IACAALNGLKMGDKTLTVRRA 371
A A+N +G + + V RA
Sbjct: 292 DAIVAMNLFDLGGQYIRVGRA 312
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 46/99 (46%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
R ++G + + E ++ FG + DL D T KG+ F Y+ P +A
Sbjct: 140 RTYIGSINFQLNEESVRASFLPFGPIKMIDLSWDSATMKHKGFAFVEYEIPEAAQLALEQ 199
Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
+N + MG + + V R + Q+ E IL +A+Q+ I
Sbjct: 200 MNNVLMGGRNIKVGRPSNVPQAAPWIEQILQEARQYARI 238
>gi|302834772|ref|XP_002948948.1| hypothetical protein VOLCADRAFT_89331 [Volvox carteri f.
nagariensis]
gi|300265693|gb|EFJ49883.1| hypothetical protein VOLCADRAFT_89331 [Volvox carteri f.
nagariensis]
Length = 729
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/104 (34%), Positives = 58/104 (55%), Gaps = 9/104 (8%)
Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKY--GTLVNVVIPRPDQ----NGGETPG 473
+ C+ + AD L DDEEYE +++D+++EC ++ G +V V +PRP + + G
Sbjct: 544 RFFCVLGMLNADMLLDDEEYEAVIDDLKDECDRHAPGNVVAVKVPRPPEEVRAQTADFIG 603
Query: 474 V---GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
+ GK F+ + DA A A+ GR F GNTV Y E+++
Sbjct: 604 IGQYGKAFVCFKDATSAQRAHAAIHGRLFAGNTVQVQYITEEEF 647
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 48/87 (55%), Gaps = 3/87 (3%)
Query: 175 RRVYVGGLPPLA-NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
R +Y+G L P A + A+ F+ + A G + VVNV ++ + ++AFVE RT E
Sbjct: 230 RELYIGNLVPGAVTDVALRQLFNTTLVA-AFPVTGSAEPVVNVNLHSDGRYAFVEFRTPE 288
Query: 234 EASNAMALDG-IIFEGVAVRVRRPTDY 259
A+ A+AL+ + G + V RP+ Y
Sbjct: 289 MATAALALNAQVQLLGQTISVGRPSGY 315
>gi|50553814|ref|XP_504318.1| YALI0E23628p [Yarrowia lipolytica]
gi|49650187|emb|CAG79917.1| YALI0E23628p [Yarrowia lipolytica CLIB122]
Length = 621
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 60/204 (29%), Positives = 92/204 (45%), Gaps = 19/204 (9%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAV--VNVYINHEKKFAFVEM 229
R R VYV + P + FF++ AGP V V + + AFVE
Sbjct: 327 RDKRTVYVQQVAPHVQSTELFDFFAE---------AGPVHDVSLVKDRSSRCRGVAFVEF 377
Query: 230 RTVEEASNAMALDGIIFEGVAVRVR-----RPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
VE S A+ L G G A+ +R R + + A+ G S + +A V +
Sbjct: 378 EDVESVSRAIGLTGRSLHGQALLIRCTDSARNREEQQSEASFNSSGAGSTHA-VANVNAS 436
Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
+ AI R++VG + + TE +I ++ E+FG + DL K++ TG SKGY F Y
Sbjct: 437 TSAIDSVRF-HRLYVGNIYFGVTEGEIIQIFEAFGPIEFADLQKEK-TGKSKGYCFIQYV 494
Query: 345 DPAVTDIACAALNGLKMGDKTLTV 368
+P A +NG ++ + L V
Sbjct: 495 NPDDAKTALEKMNGFELAGRKLRV 518
>gi|313234527|emb|CBY10484.1| unnamed protein product [Oikopleura dioica]
Length = 333
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 50/88 (56%), Gaps = 6/88 (6%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA- 355
VFVG +PY TE QI+++ G + F LV DR+TG KGYGFC Y+D TD A +A
Sbjct: 27 VFVGNIPYEATEEQIRDIFNEVGVVLSFRLVYDRETGKPKGYGFCEYKD---TDTAMSAM 83
Query: 356 --LNGLKMGDKTLTVRRATASGQSKTEQ 381
LN ++ + L V AT + EQ
Sbjct: 84 RNLNTRELHGRNLRVDHATRDHGVEKEQ 111
>gi|383419607|gb|AFH33017.1| putative RNA-binding protein 23 isoform 1 [Macaca mulatta]
Length = 441
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 251 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340
>gi|388490330|ref|NP_001253303.1| probable RNA-binding protein 23 [Macaca mulatta]
gi|380814244|gb|AFE78996.1| putative RNA-binding protein 23 isoform 1 [Macaca mulatta]
gi|384947950|gb|AFI37580.1| putative RNA-binding protein 23 isoform 1 [Macaca mulatta]
Length = 441
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 251 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340
>gi|387539272|gb|AFJ70263.1| putative RNA-binding protein 23 isoform 1 [Macaca mulatta]
Length = 439
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 251 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340
>gi|355693134|gb|EHH27737.1| hypothetical protein EGK_18008 [Macaca mulatta]
Length = 441
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 251 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340
>gi|355778434|gb|EHH63470.1| hypothetical protein EGM_16442, partial [Macaca fascicularis]
Length = 366
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 132 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 175
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 176 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 234
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 235 TFSDSECARRALEQLNGFELAGRPMRVGHVT 265
>gi|387539270|gb|AFJ70262.1| putative RNA-binding protein 23 isoform 2 [Macaca mulatta]
Length = 423
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 235 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324
>gi|149756178|ref|XP_001494868.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Equus
caballus]
Length = 446
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 210 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 253
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 254 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 312
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 313 TFSDSECARRALEQLNGFELAGRPMRVGHVT 343
>gi|345804022|ref|XP_003435135.1| PREDICTED: probable RNA-binding protein 23 [Canis lupus familiaris]
Length = 411
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 174 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 217
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 218 -MANNLQKGSSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 276
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 277 TFSDSECARRALEQLNGFELAGRPMRVGHVT 307
>gi|332222976|ref|XP_003260645.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Nomascus
leucogenys]
Length = 442
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 251 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340
>gi|348667221|gb|EGZ07047.1| hypothetical protein PHYSODRAFT_251824 [Phytophthora sojae]
Length = 419
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 33/72 (45%), Positives = 42/72 (58%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE +KE+ G++ F LV DR+TG KGYGFC Y D A A L
Sbjct: 15 VFVGNIPYDVTEDMLKEIFSEAGSVVNFRLVTDRETGKPKGYGFCEYADGATALSAMRNL 74
Query: 357 NGLKMGDKTLTV 368
NG ++ + L V
Sbjct: 75 NGYEINGRNLRV 86
>gi|390342940|ref|XP_001198098.2| PREDICTED: RNA-binding protein 39-like [Strongylocentrotus
purpuratus]
Length = 666
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 86/215 (40%), Gaps = 23/215 (10%)
Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
TR AR V+V L A E+ + FFS V + + K +VE
Sbjct: 204 TRDARTVFVMQLSQRAKERELKEFFSSV------GKVRTVKIITDRNSRRSKGVGYVEYD 257
Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
+ A+ L+ GV + V+ P+ +A GQ N+ L V
Sbjct: 258 VADSVPLALGLNNQKLLGVPIIVQ-PSHAEKNRSA----GQ---NVTLQKVN-------- 301
Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
GP R++VG L Y TE ++ + E FG + L+ D D SKGYGF + D
Sbjct: 302 -SGPMRLYVGSLHYNITEAMLRGIFEPFGKIDNIQLMMDTDANRSKGYGFITFHDAEDAK 360
Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESIL 385
A LNG ++ + + V T + + S L
Sbjct: 361 RALDQLNGFELAGRPMKVNHVTERNEQGQQAPSFL 395
>gi|384947948|gb|AFI37579.1| putative RNA-binding protein 23 isoform 2 [Macaca mulatta]
Length = 425
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 235 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324
>gi|380814242|gb|AFE78995.1| putative RNA-binding protein 23 isoform 2 [Macaca mulatta]
gi|383419605|gb|AFH33016.1| putative RNA-binding protein 23 isoform 2 [Macaca mulatta]
Length = 425
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 235 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324
>gi|224125470|ref|XP_002329813.1| predicted protein [Populus trichocarpa]
gi|222870875|gb|EEF08006.1| predicted protein [Populus trichocarpa]
Length = 59
Score = 68.6 bits (166), Expect = 8e-09, Method: Composition-based stats.
Identities = 34/58 (58%), Positives = 35/58 (60%), Gaps = 4/58 (6%)
Query: 116 MAPPAAAMLPGAAV----PGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQ 169
MAP MLPGAAV GQLP VP +P M QN L FG TQ G PLMP MTQQ
Sbjct: 1 MAPSMVGMLPGAAVTVNDAGQLPSVPQTMPGMIQNTLQFGTTQFGVLPLMPAHAMTQQ 58
>gi|119586626|gb|EAW66222.1| RNA binding motif protein 23, isoform CRA_a [Homo sapiens]
Length = 483
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 251 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 294
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 295 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 353
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 354 TFSDSECARRALEQLNGFELAGRPMRVGHVT 384
>gi|71017595|ref|XP_759028.1| hypothetical protein UM02881.1 [Ustilago maydis 521]
gi|46098750|gb|EAK83983.1| hypothetical protein UM02881.1 [Ustilago maydis 521]
Length = 403
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 47/83 (56%)
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
G G VFVG +PY +E Q+ ++ G + GF LV DR+TG KGYGFC ++DP
Sbjct: 3 GAQRGSRVVFVGNIPYDMSEEQLTDVFREVGKVVGFRLVNDRETGKFKGYGFCEFEDPET 62
Query: 349 TDIACAALNGLKMGDKTLTVRRA 371
A LN +++G + L + A
Sbjct: 63 AASAVRNLNEVEVGGRPLRISFA 85
>gi|298710792|emb|CBJ32209.1| RNA-binding protein SiahBP [Ectocarpus siliculosus]
Length = 696
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 82/315 (26%), Positives = 129/315 (40%), Gaps = 62/315 (19%)
Query: 111 RSGFDMAPPAAAMLPG---------------AAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
+ G+ APP A ++P AA GQ P V S A L GA
Sbjct: 209 KKGWGDAPPVAPVVPMTPLQELQRKLAEEQVAAAMGQAPPVTSVSAGAAALGLTMGAAMQ 268
Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
Q Q + + RR+YVG L E I + F+ A+
Sbjct: 269 APGAGTRTQTAPAQPS-NPRRIYVGSLHYELKESDITSIFANF------------GALKL 315
Query: 216 VYINHE------KKFAFVEMRTVEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAAL- 267
V ++H+ K F F+E V+ A A+ A++G G A++V RP + + +A +
Sbjct: 316 VDMSHDSSTGRHKGFCFIEYVDVKSADAALRAMNGFELAGRAIKVGRPLNTDSGVAGGIE 375
Query: 268 GPGQPS----PNLNLAAVGLASGAIGGAEGP-------------------DRVFVGGLPY 304
G G P+ P + ++ A GGA+G +++VG +
Sbjct: 376 GMGLPAAMQLPGMAAFMAQHSTSAAGGAQGSGVAAEQLKAAMGMTTAPAQTKIYVGNVEP 435
Query: 305 YFTETQIKELLESFGTLHGFDLVKD-RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGD 363
+ T IK + E FG + G ++V+D + GN KG+GF Y +V ++ ++
Sbjct: 436 HITTEMIKTVFEPFGMVVGAEMVQDPSNPGNHKGFGFIQYAQESVARTVIDTMSSFELAG 495
Query: 364 KTLTVRRATASGQSK 378
+TL R A A QSK
Sbjct: 496 RTL--RVAWAQDQSK 508
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 70/267 (26%), Positives = 108/267 (40%), Gaps = 24/267 (8%)
Query: 263 LAAALGPGQP--SPNLNLAAVGLASGAIGGAEG--------------PDRVFVGGLPYYF 306
+AAA+G P S + AA+GL GA A G P R++VG L Y
Sbjct: 239 VAAAMGQAPPVTSVSAGAAALGLTMGAAMQAPGAGTRTQTAPAQPSNPRRIYVGSLHYEL 298
Query: 307 TETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTL 366
E+ I + +FG L D+ D TG KG+ F Y D D A A+NG ++ + +
Sbjct: 299 KESDITSIFANFGALKLVDMSHDSSTGRHKGFCFIEYVDVKSADAALRAMNGFELAGRAI 358
Query: 367 TVRRA--TASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG--GMSLFGETLAKVL 422
V R T SG + + L A Q + Q S + GG G + E L +
Sbjct: 359 KVGRPLNTDSGVAGGIEGMGLPAAMQLPGMAAFMAQHS-TSAAGGAQGSGVAAEQLKAAM 417
Query: 423 CLTEAITADAL-ADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEY 481
+T A + + E E ++ +G +V + + N G G G F++Y
Sbjct: 418 GMTTAPAQTKIYVGNVEPHITTEMIKTVFEPFGMVVGAEMVQDPSNPGNHKGFG--FIQY 475
Query: 482 YDAVGCATAKNALSGRKFGGNTVNAFY 508
T + +S + G T+ +
Sbjct: 476 AQESVARTVIDTMSSFELAGRTLRVAW 502
>gi|410961880|ref|XP_003987506.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Felis catus]
Length = 445
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 208 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 251
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 252 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 310
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 311 TFSDSECARRALEQLNGFELAGRPMRVGHVT 341
>gi|281354415|gb|EFB29999.1| hypothetical protein PANDA_019778 [Ailuropoda melanoleuca]
Length = 568
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 49/87 (56%), Gaps = 1/87 (1%)
Query: 298 FVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALN 357
VG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A LN
Sbjct: 5 IVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNLN 64
Query: 358 GLKMGDKTLTVRRATASGQSKTEQESI 384
G + + L V A AS ++K E +S+
Sbjct: 65 GREFSGRALRVDNA-ASEKNKEELKSL 90
>gi|343427636|emb|CBQ71163.1| related to HRP1-subunit of cleavage factor I [Sporisorium reilianum
SRZ2]
Length = 588
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 66/237 (27%), Positives = 100/237 (42%), Gaps = 18/237 (7%)
Query: 176 RVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEA 235
++++GGL E ++ +FSQ G + + + FAF+ +
Sbjct: 168 KMFIGGLNWDTTEDSLRRYFSQF------GEVGHCTVMRDNMTGRSRGFAFLNFVNPKAV 221
Query: 236 SNAMA----LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
+ + LDG + + R D N G GQ S N N G G +
Sbjct: 222 NTVVVREHYLDGKVIDPKRAIPRPQRDSNFNAHHNGGQGQASYNNNGGGAGGGGGYNAQS 281
Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
+ ++FVGGLP T + E FGTL + DR+TGN +G+GF YQD A
Sbjct: 282 Q---KLFVGGLPASVTPASFRMFFEQFGTLAECTCMMDRETGNPRGFGFLTYQDDAALQH 338
Query: 352 ACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
+ L K + V+RA QSK + +S+ + QQ I +MA+ GM G
Sbjct: 339 VLST-RPLVFDGKEVDVKRA----QSKNDPQSLQIRRQQRIDNPEMAMGGMGMQQPG 390
>gi|332222974|ref|XP_003260644.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Nomascus
leucogenys]
Length = 426
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 235 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324
>gi|114652057|ref|XP_001159475.1| PREDICTED: probable RNA-binding protein 23 isoform 10 [Pan
troglodytes]
Length = 442
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG+SKGYGF
Sbjct: 251 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 309
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340
>gi|410218752|gb|JAA06595.1| RNA binding motif protein 23 [Pan troglodytes]
gi|410307724|gb|JAA32462.1| RNA binding motif protein 23 [Pan troglodytes]
gi|410307728|gb|JAA32464.1| RNA binding motif protein 23 [Pan troglodytes]
Length = 442
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG+SKGYGF
Sbjct: 251 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 309
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340
>gi|116734696|ref|NP_001070819.1| probable RNA-binding protein 23 isoform 1 [Homo sapiens]
gi|34925229|sp|Q86U06.1|RBM23_HUMAN RecName: Full=Probable RNA-binding protein 23; AltName:
Full=RNA-binding motif protein 23; AltName:
Full=RNA-binding region-containing protein 4; AltName:
Full=Splicing factor SF2
gi|28071058|emb|CAD61910.1| unnamed protein product [Homo sapiens]
gi|119586629|gb|EAW66225.1| RNA binding motif protein 23, isoform CRA_d [Homo sapiens]
Length = 439
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 251 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340
>gi|28059803|gb|AAO30095.1| splicing factor-like protein [Arabidopsis thaliana]
Length = 527
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 91/389 (23%), Positives = 149/389 (38%), Gaps = 78/389 (20%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
R R V+ +P A E+ + FFS+ +++ K ++E
Sbjct: 165 RDQRTVFAYQMPLKATERDVYEFFSK------AGKVRDVRLIMDRNSRRSKGVGYIEFYD 218
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
V A+AL G +F G V V+ P++ LA + S +GG
Sbjct: 219 VMSVPMAIALSGQLFLGQPVMVK-PSEAEKNLAQS-----------------NSTTVGGT 260
Query: 292 EGPDR-VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
DR ++VG L + +E Q++++ E+FG + L D +TG KG+GF + +
Sbjct: 261 GPADRKLYVGNLHFNMSELQLRQIFEAFGPVELVQLPLDPETGQCKGFGFIQFVQLEHSK 320
Query: 351 IACAALNG-LKMGDKTLTVR----------RATASGQSKTEQESILAQAQQHIAIQKMAL 399
A ALNG L++ +T+ V A S + LA Q A+ L
Sbjct: 321 AAQIALNGKLEIAGRTIKVSSVSDHIGTQDSAPKSADFDDDDGGGLALNAQSRAMLMQKL 380
Query: 400 QTSGMNT----------LGG------GM---------------SLFGETL---AKVLCLT 425
SG+ T L G GM S E + ++ L L
Sbjct: 381 DRSGIATSIVGSLGVPGLNGAAFNQPGMNPSFPTSVLPTTAIPSFVNEHVGLPSECLLLK 440
Query: 426 EAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAV 485
+ EI +D+ +EC KYG + ++ + D+N G V+L +
Sbjct: 441 NMFDPATETEPNFDLEIRDDVADECSKYGPVNHIYV---DKNSA-----GFVYLRFQSVE 492
Query: 486 GCATAKNALSGRKFGGNTVNAFYYPEDKY 514
A A+ A+ R F ++A + P +Y
Sbjct: 493 AAAAAQRAMHMRWFAQKMISATFMPPHEY 521
>gi|410961882|ref|XP_003987507.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Felis catus]
Length = 429
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 192 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 235
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 236 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 294
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 295 TFSDSECARRALEQLNGFELAGRPMRVGHVT 325
>gi|397473313|ref|XP_003808159.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Pan paniscus]
Length = 442
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 251 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340
>gi|296214508|ref|XP_002753659.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Callithrix
jacchus]
Length = 439
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 206 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 249
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 250 -MANNLQKGTGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 308
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 309 TFSDSECARRAMDQLNGFELAGRPMRVGHVT 339
>gi|149756180|ref|XP_001494897.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Equus
caballus]
Length = 430
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 194 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 237
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 238 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 297 TFSDSECARRALEQLNGFELAGRPMRVGHVT 327
>gi|410261842|gb|JAA18887.1| RNA binding motif protein 23 [Pan troglodytes]
gi|410349771|gb|JAA41489.1| RNA binding motif protein 23 [Pan troglodytes]
Length = 442
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG+SKGYGF
Sbjct: 251 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 309
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340
>gi|12803481|gb|AAH02566.1| RNA binding motif protein 23 [Homo sapiens]
gi|189055004|dbj|BAG37988.1| unnamed protein product [Homo sapiens]
gi|312151810|gb|ADQ32417.1| RNA binding motif protein 23 [synthetic construct]
Length = 424
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324
>gi|119586634|gb|EAW66230.1| RNA binding motif protein 23, isoform CRA_h [Homo sapiens]
Length = 467
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 235 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 278
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 279 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 337
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 338 TFSDSECARRALEQLNGFELAGRPMRVGHVT 368
>gi|116734694|ref|NP_060577.3| probable RNA-binding protein 23 isoform 2 [Homo sapiens]
gi|18848317|gb|AAH24208.1| RNA binding motif protein 23 [Homo sapiens]
gi|119586627|gb|EAW66223.1| RNA binding motif protein 23, isoform CRA_b [Homo sapiens]
gi|119586633|gb|EAW66229.1| RNA binding motif protein 23, isoform CRA_b [Homo sapiens]
Length = 423
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324
>gi|410218750|gb|JAA06594.1| RNA binding motif protein 23 [Pan troglodytes]
gi|410307726|gb|JAA32463.1| RNA binding motif protein 23 [Pan troglodytes]
Length = 426
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG+SKGYGF
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 293
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324
>gi|119586630|gb|EAW66226.1| RNA binding motif protein 23, isoform CRA_e [Homo sapiens]
Length = 449
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 217 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 260
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 261 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 319
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 320 TFSDSECARRALEQLNGFELAGRPMRVGHVT 350
>gi|18416114|ref|NP_568220.1| RNA-binding protein 39 [Arabidopsis thaliana]
gi|15451046|gb|AAK96794.1| splicing factor-like protein [Arabidopsis thaliana]
gi|332004077|gb|AED91460.1| RNA-binding protein 39 [Arabidopsis thaliana]
Length = 527
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 91/389 (23%), Positives = 149/389 (38%), Gaps = 78/389 (20%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
R R V+ +P A E+ + FFS+ +++ K ++E
Sbjct: 165 RDQRTVFAYQMPLKATERDVYEFFSK------AGKVRDVRLIMDRNSRRSKGVGYIEFYD 218
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
V A+AL G +F G V V+ P++ LA + S +GG
Sbjct: 219 VMSVPMAIALSGQLFLGQPVMVK-PSEAEKNLAQS-----------------NSTTVGGT 260
Query: 292 EGPDR-VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
DR ++VG L + +E Q++++ E+FG + L D +TG KG+GF + +
Sbjct: 261 GPADRKLYVGNLHFNMSELQLRQIFEAFGPVELVQLPLDPETGQCKGFGFIQFVQLEHSK 320
Query: 351 IACAALNG-LKMGDKTLTVR----------RATASGQSKTEQESILAQAQQHIAIQKMAL 399
A ALNG L++ +T+ V A S + LA Q A+ L
Sbjct: 321 AAQIALNGKLEIAGRTIKVSSVSDHIGTQDSAPKSADFDDDDGGGLALNAQSRAMLMQKL 380
Query: 400 QTSGMNT----------LGG------GM---------------SLFGETL---AKVLCLT 425
SG+ T L G GM S E + ++ L L
Sbjct: 381 DRSGIATSIVGSLGVPGLNGAAFNQPGMNPSFPTSVLPTTAIPSFVNEHVGLPSECLLLK 440
Query: 426 EAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAV 485
+ EI +D+ +EC KYG + ++ + D+N G V+L +
Sbjct: 441 NMFDPATETEPNFDLEIRDDVADECSKYGPVNHIYV---DKNSA-----GFVYLRFQSVE 492
Query: 486 GCATAKNALSGRKFGGNTVNAFYYPEDKY 514
A A+ A+ R F ++A + P +Y
Sbjct: 493 AAAAAQRAMHMRWFAQKMISATFMPPHEY 521
>gi|114652069|ref|XP_001159523.1| PREDICTED: probable RNA-binding protein 23 isoform 11 [Pan
troglodytes]
Length = 426
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG+SKGYGF
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 293
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324
>gi|410261840|gb|JAA18886.1| RNA binding motif protein 23 [Pan troglodytes]
gi|410349769|gb|JAA41488.1| RNA binding motif protein 23 [Pan troglodytes]
gi|410349773|gb|JAA41490.1| RNA binding motif protein 23 [Pan troglodytes]
Length = 426
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG+SKGYGF
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 293
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324
>gi|255085602|ref|XP_002505232.1| predicted protein [Micromonas sp. RCC299]
gi|226520501|gb|ACO66490.1| predicted protein [Micromonas sp. RCC299]
Length = 211
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/205 (27%), Positives = 89/205 (43%), Gaps = 33/205 (16%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK-----FAFVE 228
A ++YVG LP N + + F + ++V + E++ FAFV
Sbjct: 32 AAKLYVGHLPSTMNAERMLEMFKPFGRVLQ----------IDVIPDRERQLSCKGFAFVL 81
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
T EEA A AL+G + EG ++ VR + P P +N + A
Sbjct: 82 FSTPEEAIAAKALNGHVVEGKSIDVRLKAE----------PRAPREPVNAPVAPVNDDA- 130
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
+++V +P ++ ++K LL+ +G ++ DR+TG S+G+GF D
Sbjct: 131 -------KLYVAYMPDHYRAEELKMLLQPYGLPSDVRVITDRETGRSRGFGFAQMMDEQQ 183
Query: 349 TDIACAALNGLKMGDKTLTVRRATA 373
A LNG + KTL VR A A
Sbjct: 184 AMAAIQGLNGQMLDGKTLVVRIAGA 208
>gi|221129809|ref|XP_002164481.1| PREDICTED: RNA-binding motif protein, X-linked 2-like [Hydra
magnipapillata]
Length = 255
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 44/72 (61%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
+F+GGLPY TE + + +G + +LV+D+ TG KGYGF Y+D T +A
Sbjct: 39 IFIGGLPYDLTEGDVLAVFSQYGEIVNINLVRDKKTGKFKGYGFLCYEDQRSTILAVDNF 98
Query: 357 NGLKMGDKTLTV 368
NG+K+G +T+ V
Sbjct: 99 NGIKLGGRTIRV 110
>gi|326431687|gb|EGD77257.1| hypothetical protein PTSG_08350 [Salpingoeca sp. ATCC 50818]
Length = 397
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 47/84 (55%)
Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
P V+VG +PY TE ++ + + G + F L+ D++TG SKG+GFC + D A + A
Sbjct: 3 APTSVWVGNIPYEATEEELIKFFSAVGDVKNFHLITDQNTGRSKGFGFCYFLDAAAAESA 62
Query: 353 CAALNGLKMGDKTLTVRRATASGQ 376
L+G + D+ L V AT Q
Sbjct: 63 VRNLSGQPLRDRPLRVDLATPRSQ 86
>gi|261858408|dbj|BAI45726.1| RNA binding motif protein 23 [synthetic construct]
Length = 406
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 173 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 216
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 217 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 275
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 276 TFSDSECARRALEQLNGFELAGRPMRVGHVT 306
>gi|225711846|gb|ACO11769.1| Cleavage stimulation factor 64 kDa subunit [Lepeophtheirus
salmonis]
Length = 330
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 50/93 (53%), Gaps = 4/93 (4%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++K++ G + F LV DR+ G KGYGFC Y+D + A L
Sbjct: 18 VFVGNIPYEATEEKLKDIFSEVGPVTSFKLVYDRENGKPKGYGFCEYKDADMALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQ----SKTEQESIL 385
NG ++ +TL V A +K E E I+
Sbjct: 78 NGYEIEGRTLRVDNACTEKNRLEMAKGEAEEIV 110
>gi|397473311|ref|XP_003808158.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Pan paniscus]
Length = 426
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324
>gi|198476543|ref|XP_001357388.2| GA10876 [Drosophila pseudoobscura pseudoobscura]
gi|198137744|gb|EAL34457.2| GA10876 [Drosophila pseudoobscura pseudoobscura]
Length = 625
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 80/200 (40%), Gaps = 28/200 (14%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
R AR V+ L + + FFS S G V + N K+F A++E
Sbjct: 264 RDARTVFCIQLSQRVRARDLEEFFS---------SVGKVRDVRLITCNKTKRFKGIAYIE 314
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
E + A+ L G GV + V+ L A QP ++
Sbjct: 315 FEDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQNATPAFQPKSHV------------ 362
Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
GP R++VG L + TE ++ + E FG + L+ D +T SKGYGF Y +
Sbjct: 363 ----GPMRLYVGSLHFDITEEMLRGIFEPFGKIDAIQLIMDTETNRSKGYGFITYHNAED 418
Query: 349 TDIACAALNGLKMGDKTLTV 368
A LNG ++ + + V
Sbjct: 419 AKKALEQLNGFELAGRPMKV 438
Score = 40.0 bits (92), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 34/74 (45%), Gaps = 7/74 (9%)
Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
EI +D+ EEC K+G ++++ + G V+++ A NAL GR F
Sbjct: 535 EIRDDVLEECAKHGGVLHIHV-------DTASPTGTVYVKCPSTTTAVLAVNALHGRWFA 587
Query: 501 GNTVNAFYYPEDKY 514
G + A Y P Y
Sbjct: 588 GRVITAAYVPVVNY 601
>gi|395547925|ref|XP_003775192.1| PREDICTED: cleavage stimulation factor subunit 2 [Sarcophilus
harrisii]
Length = 556
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 49/86 (56%), Gaps = 1/86 (1%)
Query: 299 VGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNG 358
VG +PY TE Q+K++ G + F LV DR+TG KGYGFC YQD A LNG
Sbjct: 5 VGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNLNG 64
Query: 359 LKMGDKTLTVRRATASGQSKTEQESI 384
+ + L V A AS ++K E +S+
Sbjct: 65 REFSGRALRVDNA-ASEKNKEELKSL 89
>gi|7022544|dbj|BAA91638.1| unnamed protein product [Homo sapiens]
Length = 406
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 173 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 216
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 217 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 275
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 276 TFSDSECARRALEQLNGFELAGRPMRVGHVT 306
>gi|256085765|ref|XP_002579083.1| rna recognition motif containing protein [Schistosoma mansoni]
gi|360043212|emb|CCD78624.1| putative rna recognition motif containing protein [Schistosoma
mansoni]
Length = 412
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 50/88 (56%), Gaps = 2/88 (2%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
+FVG +PY TE ++ EL G + GF LV DR++G KGYGFC Y +PA+ A L
Sbjct: 18 IFVGNIPYEATEEKLIELFSKAGPVIGFRLVYDRESGKPKGYGFCEYNNPAIAASALRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
++ + L R A+G+ + + ++
Sbjct: 78 QNIEFNGRPL--RIGPAAGEQNSAELAL 103
>gi|351697087|gb|EHB00006.1| Putative RNA-binding protein 23 [Heterocephalus glaber]
Length = 436
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/203 (27%), Positives = 85/203 (41%), Gaps = 35/203 (17%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI------NHEKKFA 225
R AR V+ L + + FFS AIG V +V I K A
Sbjct: 163 RDARTVFCMQLAARIRPRDLEDFFS----AIG--------KVHDVRIISDRNSRRSKGIA 210
Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLAS 285
+VE ++ A+ L G GV + V+ LAA +A+
Sbjct: 211 YVEFCDIQSVPLAIGLTGQRLLGVPIVVQASQAEKNRLAA-----------------MAN 253
Query: 286 GAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQD 345
G+ GP R++VG L + TE ++ + E FG + L+KD +TG+SKGYGF + +
Sbjct: 254 NLQKGSGGPKRLYVGCLHFNITEDMLRGIFEPFGKIENIVLMKDSETGHSKGYGFITFSE 313
Query: 346 PAVTDIACAALNGLKMGDKTLTV 368
A LNG ++ + + V
Sbjct: 314 SECARRAVEQLNGFELAGRPMRV 336
>gi|149756182|ref|XP_001494921.1| PREDICTED: probable RNA-binding protein 23 isoform 3 [Equus
caballus]
Length = 412
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 176 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 219
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 220 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 278
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 279 TFSDSECARRALEQLNGFELAGRPMRVGHVT 309
>gi|335310533|ref|XP_003362077.1| PREDICTED: probable RNA-binding protein 23 [Sus scrofa]
Length = 443
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 209 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 252
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 253 -MANNLQKGTGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 311
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + + T
Sbjct: 312 TFSDSECARRALEQLNGFELAGRPMRIGHVT 342
>gi|116734698|ref|NP_001070820.1| probable RNA-binding protein 23 isoform 3 [Homo sapiens]
gi|119586628|gb|EAW66224.1| RNA binding motif protein 23, isoform CRA_c [Homo sapiens]
Length = 405
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 173 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 216
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 217 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 275
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 276 TFSDSECARRALEQLNGFELAGRPMRVGHVT 306
>gi|426376372|ref|XP_004054975.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Gorilla
gorilla gorilla]
Length = 437
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 251 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340
>gi|294882869|ref|XP_002769861.1| splicing factor, putative [Perkinsus marinus ATCC 50983]
gi|239873674|gb|EER02579.1| splicing factor, putative [Perkinsus marinus ATCC 50983]
Length = 364
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 86/208 (41%), Gaps = 28/208 (13%)
Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV--NVYINHEKK 223
M QQA R V V G+ P E+ + F SQ N+ D V + N K
Sbjct: 146 MVQQAHRDDCTVMVMGIHPKCTEKEVYVFMSQ-------NAGKVRDVQVIRDPRTNRSKG 198
Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
A+VE T + A+A +G G +R++ Q N A +
Sbjct: 199 VAYVEFYTPDSILKALACNGQALMGHPIRIQ--------------ASQAEKNRAAEAARV 244
Query: 284 ASGAIGGAEGPDRVFVGGLP---YYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
+ P RV+VGGL + E++I++L FG + ++ K TG +G+ F
Sbjct: 245 VQNQ--QQDLPMRVYVGGLTGVLIHLQESEIRKLFAPFGDIQCIEIAKSPYTGRPRGFAF 302
Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTV 368
+Y +A AA++ ++ D TL V
Sbjct: 303 VIYSRACDARVAIAAMHKYRIADTTLEV 330
>gi|296483628|tpg|DAA25743.1| TPA: RNA binding motif protein 23 [Bos taurus]
Length = 463
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 80/190 (42%), Gaps = 24/190 (12%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 222 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 265
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L TE ++ +LE FG + L+KD +TG SKGYGF
Sbjct: 266 -MANNLQKGSGGPVRLYVGSLHCNITEDMLRGILEPFGKIDNIVLMKDSETGRSKGYGFI 324
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT-------EQESILAQAQQHIAI 394
+ D A LNG ++ + + + T T +QE L A H+ +
Sbjct: 325 TFSDSECARRALEQLNGFELAGRPMRIGHVTERPDGGTDITFPDGDQELDLGSAGGHLQL 384
Query: 395 QKMALQTSGM 404
+ SG+
Sbjct: 385 MAKLAEGSGI 394
>gi|198418855|ref|XP_002123179.1| PREDICTED: similar to cleavage stimulation factor, 3 pre-RNA,
subunit 2, 64kDa [Ciona intestinalis]
Length = 455
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 49/92 (53%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+K++ G + F LV DR++G KGYGF YQD + L
Sbjct: 18 VFVGNIPYEATEEQLKDIFNEVGNVISFRLVFDRESGKPKGYGFAEYQDKETALSSMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
NG ++ + L V AT+ E ++ + A
Sbjct: 78 NGRELHGRPLRVDHATSERNRNDEFNNLRSMA 109
>gi|10880789|gb|AAG24388.1|AF275678_1 PP239 protein [Homo sapiens]
Length = 418
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 185 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 228
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 229 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 287
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 288 TFSDSECARRALEQLNGFELAGRPMRVGHVT 318
>gi|255082273|ref|XP_002508355.1| predicted protein [Micromonas sp. RCC299]
gi|226523631|gb|ACO69613.1| predicted protein [Micromonas sp. RCC299]
Length = 518
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 94/396 (23%), Positives = 156/396 (39%), Gaps = 81/396 (20%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH---EKKFAFVE 228
R R V+ L A+E+ I FFS+ AG + V +Y + K A++E
Sbjct: 133 RDTRTVFAYNLSTKADERDIYQFFSK---------AGTVNDVRIIYDRNTPRSKGMAYIE 183
Query: 229 MRTVEEASNAMALDGIIFEGVAVRVR-RPTDYNPTLAAALGPGQPSPNLNLAAVG----- 282
++A+AL G + V V+ + N A Q L + A+G
Sbjct: 184 FADKANITDALALTGQMLRNQVVMVKASEAEKNIAWEAE----QAQKKLEMKALGATDPA 239
Query: 283 ---LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A A GP ++ V GL ET +K + E FG + +D TG S+G G
Sbjct: 240 SAAAAVNAQAHGNGPCKLQVHGLDVNIGETDLKAVFEPFGETDFISIQRD-STGRSRGVG 298
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT-------------EQESILA 386
F Y+ +A + LNGL++ ++L V A + + EQE +
Sbjct: 299 FVQYKQTQHAVLAISQLNGLELVGQSLKVTMAPIAASTLNAAQAASMVTDKIDEQEGVRL 358
Query: 387 QAQQHIAIQ-KMALQ--TSGMNTLGG-----GMSLFGETLA------------------- 419
++ A+ K+A Q T G GG G+ + E +A
Sbjct: 359 DSRSRAALMAKLAGQDETQGALYSGGIDPKTGLPVSAEEMAAAQRAAHMTEVEFAQGVLG 418
Query: 420 -------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
+ L L + E + +I ED+++EC K+G + ++ + + +
Sbjct: 419 PASPIPTQCLLLKNMFDPAEETEPEWWIDIGEDVKDECSKHGPVSHIHVDKESR------ 472
Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
G V+L++ G + A+ AL GR F G + A +
Sbjct: 473 --GFVYLKFGSTEGASAARQALHGRWFAGKMIAAEF 506
>gi|410961884|ref|XP_003987508.1| PREDICTED: probable RNA-binding protein 23 isoform 3 [Felis catus]
Length = 411
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 174 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 217
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 218 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 276
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 277 TFSDSECARRALEQLNGFELAGRPMRVGHVT 307
>gi|115497272|ref|NP_001069104.1| probable RNA-binding protein 23 [Bos taurus]
gi|113911797|gb|AAI22594.1| RNA binding motif protein 23 [Bos taurus]
Length = 463
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 80/190 (42%), Gaps = 24/190 (12%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 222 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 265
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L TE ++ +LE FG + L+KD +TG SKGYGF
Sbjct: 266 -MANNLQKGSGGPVRLYVGSLHCNITEDMLRGILEPFGKIDNIVLMKDSETGRSKGYGFI 324
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT-------EQESILAQAQQHIAI 394
+ D A LNG ++ + + + T T +QE L A H+ +
Sbjct: 325 TFSDSECARRALEQLNGFELAGRPMRIGHVTERPDGGTDITFPDGDQELDLGSAGGHLQL 384
Query: 395 QKMALQTSGM 404
+ SG+
Sbjct: 385 MAKLAEGSGI 394
>gi|114652071|ref|XP_522797.2| PREDICTED: probable RNA-binding protein 23 isoform 12 [Pan
troglodytes]
Length = 408
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 173 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 216
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG+SKGYGF
Sbjct: 217 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 275
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 276 TFSDSECARRALEQLNGFELAGRPMRVGHVT 306
>gi|426376370|ref|XP_004054974.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Gorilla
gorilla gorilla]
Length = 421
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324
>gi|397473315|ref|XP_003808160.1| PREDICTED: probable RNA-binding protein 23 isoform 3 [Pan paniscus]
Length = 408
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 173 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 216
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 217 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 275
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 276 TFSDSECARRALEQLNGFELAGRPMRVGHVT 306
>gi|291403543|ref|XP_002718110.1| PREDICTED: RNA binding motif protein 23 isoform 3 [Oryctolagus
cuniculus]
Length = 428
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 193 KGIAYVEFCDIQAVPLAIGLTGQRLLGVPIMVQASQAEKNRLAA---------------- 236
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 237 -MANNLQKGSGGPLRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 295
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 296 TFSDSECGRRALEQLNGFELAGRPMRVGHVT 326
>gi|226466604|emb|CAX69437.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa
[Schistosoma japonicum]
Length = 414
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 50/88 (56%), Gaps = 2/88 (2%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
+FVG +PY TE ++ EL G + GF LV DR++G KGYGFC Y +PA+ A L
Sbjct: 18 IFVGNIPYEATEEKLIELFSKAGPVIGFRLVYDRESGKPKGYGFCEYNNPAIAASALRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
++ + L R A+G+ + + ++
Sbjct: 78 QNIEFNGRPL--RIGPAAGEQNSAELAL 103
>gi|291403541|ref|XP_002718109.1| PREDICTED: RNA binding motif protein 23 isoform 2 [Oryctolagus
cuniculus]
Length = 444
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 209 KGIAYVEFCDIQAVPLAIGLTGQRLLGVPIMVQASQAEKNRLAA---------------- 252
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 253 -MANNLQKGSGGPLRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 311
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 312 TFSDSECGRRALEQLNGFELAGRPMRVGHVT 342
>gi|224081877|ref|XP_002306512.1| predicted protein [Populus trichocarpa]
gi|222855961|gb|EEE93508.1| predicted protein [Populus trichocarpa]
Length = 367
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 90/347 (25%), Positives = 138/347 (39%), Gaps = 76/347 (21%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL-NLAA 280
K +VE A+AL G + G V V+ P + NL +A
Sbjct: 35 KGVGYVEFYDAMSVPMAIALSGQLLFGQPVMVK--------------PSEAEKNLVQSSA 80
Query: 281 VGLASGAIGGAEGP-DR-VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
+ + G GP DR ++VG L + TE Q+++L E FGT+ L D +TG KG+
Sbjct: 81 SSGGTSGVAGPFGPVDRKLYVGNLHFNMTEMQLRQLFEPFGTVELVQLPLDLETGQCKGF 140
Query: 339 GFCVYQDPAVTDIACAALNG-LKMGDKTLTVRRATA-SGQSKTEQES---------ILAQ 387
GF + A +ALNG L++ +T+ V T GQ T +S LA
Sbjct: 141 GFVQFTQLENAKAAQSALNGKLEIAGRTIKVSSVTEHGGQQDTGAKSADFDDDDGGGLAL 200
Query: 388 AQQHIAIQKMALQTSGMNTLGGG------------------MSLFGETL-------AKVL 422
Q A+ L +G+ T G + + G+T A VL
Sbjct: 201 NAQSRALLMQKLDRTGIATSIAGSLGVPLLNGSASNQQAISLPIIGQTAIGAAALPAPVL 260
Query: 423 -------------CLTEAITADALADDE-EYE-EILEDMREECGKYGTLVNVVIPRPDQN 467
CL D + E +++ +I ED+ EEC KYG + ++ + D+N
Sbjct: 261 SSPAYEPIGQPSECLMLKNMFDPATETEPDFDLDIKEDVEEECSKYGQVEHIFV---DKN 317
Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
G V+L + A A+ A+ R F + A + P +Y
Sbjct: 318 -----STGCVYLRFGSIEAAAGAQRAMHMRWFARRLILAVFMPTREY 359
>gi|196005405|ref|XP_002112569.1| hypothetical protein TRIADDRAFT_56724 [Trichoplax adhaerens]
gi|190584610|gb|EDV24679.1| hypothetical protein TRIADDRAFT_56724 [Trichoplax adhaerens]
Length = 316
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 50/91 (54%)
Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
+AS I VFVG +PY TE Q+K++ S G + F LV DR++G KGYGFC
Sbjct: 1 MASATISRERSLRSVFVGNIPYEATEEQLKDIFGSAGPVVSFRLVYDRESGKPKGYGFCE 60
Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
+QD A L+G ++ ++L V A +
Sbjct: 61 FQDKETALSAMRNLSGYELNGRSLRVDSAAS 91
>gi|91085985|ref|XP_972080.1| PREDICTED: similar to CG10466 CG10466-PA [Tribolium castaneum]
gi|270010180|gb|EFA06628.1| hypothetical protein TcasGA2_TC009547 [Tribolium castaneum]
Length = 266
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 46/72 (63%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VF+GGLP+ TE I + +G + +L++D+D+G SKG+ F Y+D TD+A
Sbjct: 36 VFIGGLPFDLTEGDIICIFSQYGEVVNINLIRDKDSGKSKGFCFLCYEDQRSTDLAVDNF 95
Query: 357 NGLKMGDKTLTV 368
NG+K+ ++T+ V
Sbjct: 96 NGIKILNRTIRV 107
>gi|255088499|ref|XP_002506172.1| predicted protein [Micromonas sp. RCC299]
gi|226521443|gb|ACO67430.1| predicted protein [Micromonas sp. RCC299]
Length = 628
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/258 (24%), Positives = 110/258 (42%), Gaps = 24/258 (9%)
Query: 139 AVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQV 198
AV E A+ A + A L+ +V + T+ ARRV++G + + A
Sbjct: 213 AVREEAKGKRGMNAAMVDAL-LLHNKVRWRDDTKPARRVHIGNVNAGVKAEEFARVLETR 271
Query: 199 MTAIGGNSA----------------GPGDAVV-NVYINHEKKFAFVEMRTVEEASNAMAL 241
+ + + PG V+ ++Y+N +K F F+E +E+ +AL
Sbjct: 272 IRTLSPEAVPWHYPLDKRGRVDERRAPGTRVIEHLYLN-DKGFGFLETTALEDVPAILAL 330
Query: 242 DGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGG 301
+G+ G R RRP DY+P + G + S + + P +VFVGG
Sbjct: 331 NGVRVNGGVTRFRRPKDYDPDNNPLVRDGSYRDVFQRVFTAVLSDEV--VDSPTKVFVGG 388
Query: 302 L-PYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQD-PAVTDIACAALNGL 359
+ P T+ + E++ SFG L F D G +G+ + Y + +V A A L+G
Sbjct: 389 VEPRALTKLDLLEIVSSFGALTAFRCETD-GAGLCRGFAWMEYAEGESVAAKAVAGLSGY 447
Query: 360 KMGDKTLTVRRATASGQS 377
++ K + AT ++
Sbjct: 448 QLRGKPIAAALATPRAEA 465
>gi|313243391|emb|CBY42167.1| unnamed protein product [Oikopleura dioica]
Length = 199
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 50/88 (56%), Gaps = 6/88 (6%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA- 355
VFVG +PY TE QI+++ G + F LV DR+TG KGYGFC Y+D TD A +A
Sbjct: 27 VFVGNIPYEATEEQIRDIFNEVGVVLSFRLVYDRETGKPKGYGFCEYKD---TDTAMSAM 83
Query: 356 --LNGLKMGDKTLTVRRATASGQSKTEQ 381
LN ++ + L V AT + EQ
Sbjct: 84 RNLNTRELHGRNLRVDHATRDHGVEKEQ 111
>gi|255724230|ref|XP_002547044.1| predicted protein [Candida tropicalis MYA-3404]
gi|240134935|gb|EER34489.1| predicted protein [Candida tropicalis MYA-3404]
Length = 700
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 53/104 (50%), Gaps = 7/104 (6%)
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ-------NGGET 471
+KV+ L A+T L+D E Y+ D+ E KYG + VVIPRP + +
Sbjct: 593 SKVIRLLNAVTERELSDVETYKFTKNDIYREASKYGVVEQVVIPRPIRGRTPGILKLNRS 652
Query: 472 PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
PG+G VF+EY D TA LSGR + TV A ++ D Y
Sbjct: 653 PGMGSVFIEYKDEKTALTAMMELSGRTYNDRTVLATFFDYDDYL 696
>gi|227206234|dbj|BAH57172.1| AT5G09880 [Arabidopsis thaliana]
Length = 505
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 91/389 (23%), Positives = 149/389 (38%), Gaps = 78/389 (20%)
Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
R R V+ +P A E+ + FFS+ +++ K ++E
Sbjct: 143 RDQRTVFAYQMPLKATERDVYEFFSK------AGKVRDVRLIMDRNSRRSKGVGYIEFYD 196
Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
V A+AL G +F G V V+ P++ LA + S +GG
Sbjct: 197 VMSVPMAIALSGQLFLGQPVMVK-PSEAEKNLAQS-----------------NSTTVGGT 238
Query: 292 EGPDR-VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
DR ++VG L + +E Q++++ E+FG + L D +TG KG+GF + +
Sbjct: 239 GPADRKLYVGNLHFNMSELQLRQIFEAFGPVELVQLPLDPETGQCKGFGFIQFVQLEHSK 298
Query: 351 IACAALNG-LKMGDKTLTVR----------RATASGQSKTEQESILAQAQQHIAIQKMAL 399
A ALNG L++ +T+ V A S + LA Q A+ L
Sbjct: 299 AAQIALNGKLEIAGRTIKVSSVSDHIGTQDSAPKSADFDDDDGGGLALNAQSRAMLMQKL 358
Query: 400 QTSGMNT----------LGG------GM---------------SLFGETL---AKVLCLT 425
SG+ T L G GM S E + ++ L L
Sbjct: 359 DRSGIATSIVGSLGVPGLNGAAFNQPGMNPSFPTSVLPTTAIPSFVNEHVGLPSECLLLK 418
Query: 426 EAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAV 485
+ EI +D+ +EC KYG + ++ + D+N G V+L +
Sbjct: 419 NMFDPATETEPNFDLEIRDDVADECSKYGPVNHIYV---DKNSA-----GFVYLRFQSVE 470
Query: 486 GCATAKNALSGRKFGGNTVNAFYYPEDKY 514
A A+ A+ R F ++A + P +Y
Sbjct: 471 AAAAAQRAMHMRWFAQKMISATFMPPHEY 499
>gi|426376374|ref|XP_004054976.1| PREDICTED: probable RNA-binding protein 23 isoform 3 [Gorilla
gorilla gorilla]
Length = 403
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 173 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 216
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 217 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 275
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 276 TFSDSECARRALEQLNGFELAGRPMRVGHVT 306
>gi|291400895|ref|XP_002716702.1| PREDICTED: RNA binding motif protein, X-linked 2 [Oryctolagus
cuniculus]
Length = 198
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 45/76 (59%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVGGLPY TE I + +G + LV+D+ TG S+G+GF Y+D T +A
Sbjct: 37 VFVGGLPYELTEGDILCVFSQYGEIVNIHLVRDKKTGKSRGFGFICYEDQRSTVLAVDNF 96
Query: 357 NGLKMGDKTLTVRRAT 372
NG+K+ +T+ V A+
Sbjct: 97 NGIKIKGRTIRVDHAS 112
>gi|440797518|gb|ELR18604.1| RNA recognition motif domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 696
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 99/251 (39%), Gaps = 60/251 (23%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV---NVYINHEKKFAFVEMR 230
A R+YVG L +E+ I T FS GP +V + K FAFVE
Sbjct: 250 ACRIYVGSLNFELSEEDIKTAFSPF---------GPVKSVSLTKDPLTQRSKGFAFVEYA 300
Query: 231 TVEEASNAMA-LDGIIFEGVAVRVRRP------------------------TDYNPTLAA 265
+ A+ A+ ++G + G ++V RP NP+L
Sbjct: 301 YPDAATAALKHMNGFMLAGRQLKVGRPHTPGAGLPGMPGMPGVMMPGLSPFPQLNPSLPV 360
Query: 266 ALGPG----------------------QPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLP 303
+ P QP+P + L A +R++VG +
Sbjct: 361 -MNPSILLQANAAIEAQKAAAAAANGSQPTPEMMQEFTKLTGKTAADATASNRIYVGSIH 419
Query: 304 YYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGD 363
+ T IK + E+FGT+ L+ + +TG KGYGF Y++ + A +NG +G
Sbjct: 420 WDLTSDDIKTVFEAFGTVKSCVLMPNPETGKHKGYGFVEYEESKSAEEAIQQMNGWDLGG 479
Query: 364 KTLTVRRATAS 374
+ + V RA +S
Sbjct: 480 RPIKVGRAISS 490
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 71/147 (48%), Gaps = 22/147 (14%)
Query: 377 SKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADD 436
S E ++ Q++ +QK+A T+ GG S + + +A
Sbjct: 567 SHEENLTLSTPNQRYALMQKLA-----RGTITGGKS------------SRCVVLKDMAGP 609
Query: 437 EEYEEILE-DMREECGKYGTLVNVVIPRPDQNGGETPG--VGKVFLEYYDAVGCATAKNA 493
E+ ++ LE ++ +E KYG + VVI + Q+ E PG + K+F+ + A A +
Sbjct: 610 EDVDDELEGEITDEATKYGIVERVVIYQERQS--EKPGDVIIKIFILFQSADQAQKALTS 667
Query: 494 LSGRKFGGNTVNAFYYPEDKYFNKDYS 520
L+GR FGG + A +Y E K+ +DYS
Sbjct: 668 LNGRWFGGRQIKAAFYDEKKFLAEDYS 694
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/240 (24%), Positives = 88/240 (36%), Gaps = 40/240 (16%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
R++VG L + +E IK FG + L KD T SKG+ F Y P A
Sbjct: 252 RIYVGSLNFELSEEDIKTAFSPFGPVKSVSLTKDPLTQRSKGFAFVEYAYPDAATAALKH 311
Query: 356 LNGLKMGDKTLTVRRATASGQS-------------------------KTEQESILAQAQQ 390
+NG + + L V R G SIL QA
Sbjct: 312 MNGFMLAGRQLKVGRPHTPGAGLPGMPGMPGVMMPGLSPFPQLNPSLPVMNPSILLQANA 371
Query: 391 HIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEY------EEILE 444
I QK A + G E + + LT ADA A + Y + +
Sbjct: 372 AIEAQKAAAAAA------NGSQPTPEMMQEFTKLTGKTAADATASNRIYVGSIHWDLTSD 425
Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
D++ +GT+ + V+ P+ G+ G G F+EY ++ A ++G GG +
Sbjct: 426 DIKTVFEAFGTVKSCVL-MPNPETGKHKGYG--FVEYEESKSAEEAIQQMNGWDLGGRPI 482
>gi|260942693|ref|XP_002615645.1| hypothetical protein CLUG_04527 [Clavispora lusitaniae ATCC 42720]
gi|238850935|gb|EEQ40399.1| hypothetical protein CLUG_04527 [Clavispora lusitaniae ATCC 42720]
Length = 559
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 93/215 (43%), Gaps = 12/215 (5%)
Query: 307 TETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA-ALNGLKM-GDK 364
TET + + L+ + F LV+ T S G F + ++ C + LK+ G
Sbjct: 342 TETMLLDDLQKIAKVKAFKLVRAVGTKESLGVAFVEF---YISSKECTNTKSALKLIGTY 398
Query: 365 TLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCL 424
++ + + I +IQ + + +L + +KV+ L
Sbjct: 399 VEEAKKLDIVSKIEFSCIKIGENYTSLTSIQDCPIDFKTLKSLVRNEYVQFHPKSKVIQL 458
Query: 425 TEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG------ETPGVGKVF 478
+T + L +DE Y+ I D+ EE +GT++++ IP+P PGVGKVF
Sbjct: 459 INIVTIEDLCNDETYKFIYSDIFEEAKTFGTVLSLKIPKPSYKKSPGVEEVNEPGVGKVF 518
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTV-NAFYYPED 512
+EY D +A L+GR + TV AF+ ED
Sbjct: 519 VEYEDEKTALSAIMGLAGRSYNDRTVLCAFFNHED 553
>gi|440906315|gb|ELR56591.1| Putative RNA-binding protein 23 [Bos grunniens mutus]
Length = 463
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 79/190 (41%), Gaps = 24/190 (12%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 222 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 265
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L TE ++ +LE FG + L+KD +TG SKGYGF
Sbjct: 266 -MANNLQKGNGGPVRLYVGSLHCNITEDMLRGILEPFGKIDNIVLMKDSETGRSKGYGFI 324
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT-------EQESILAQAQQHIAI 394
+ D A LNG ++ + + + T T +QE L A H+ +
Sbjct: 325 TFSDSECARRALEQLNGFELAGRPMRIGHVTERPDGGTDITFPDGDQELDLGSAAGHLQL 384
Query: 395 QKMALQTSGM 404
+ SG+
Sbjct: 385 MAKLAEGSGI 394
>gi|170054071|ref|XP_001862961.1| cleavage stimulation factor 64 kDa subunit [Culex quinquefasciatus]
gi|167874431|gb|EDS37814.1| cleavage stimulation factor 64 kDa subunit [Culex quinquefasciatus]
Length = 400
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 52/93 (55%), Gaps = 1/93 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++K++ G + LV DR++G KGYGFC Y+D A L
Sbjct: 17 VFVGNIPYEATEEKLKDIFSEVGPVISLKLVFDRESGKPKGYGFCEYKDQETALSAMRNL 76
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
NG ++G + L V A + +S+ E ++L Q
Sbjct: 77 NGYEIGGRALRVDNA-CTEKSRMEMAALLQGPQ 108
>gi|324514401|gb|ADY45855.1| Cleavage stimulation factor subunit 2 [Ascaris suum]
Length = 324
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 44/84 (52%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG + Y E Q+K++ G + LV DR+TG KGYGFC Y DP + A L
Sbjct: 25 VFVGNISYEVGEEQLKQVFSQVGPVVHLRLVHDRETGKPKGYGFCEYNDPQTAESAIRNL 84
Query: 357 NGLKMGDKTLTVRRATASGQSKTE 380
NG ++ + L V A +S E
Sbjct: 85 NGYELNGRQLRVDSAAGGERSADE 108
>gi|298248970|ref|ZP_06972774.1| RNP-1 like RNA-binding protein [Ktedonobacter racemifer DSM 44963]
gi|297546974|gb|EFH80841.1| RNP-1 like RNA-binding protein [Ktedonobacter racemifer DSM 44963]
Length = 104
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 44/76 (57%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
R++VGGLPY TE + +L E G + ++ DR+TG SKG+GF T A
Sbjct: 2 RIYVGGLPYQSTEQDLIQLFEQIGQVTSATVITDRETGRSKGFGFVEMSSDDETRAAIEQ 61
Query: 356 LNGLKMGDKTLTVRRA 371
LNG +GD+T+TV A
Sbjct: 62 LNGSTLGDRTITVNEA 77
>gi|291403539|ref|XP_002718108.1| PREDICTED: RNA binding motif protein 23 isoform 1 [Oryctolagus
cuniculus]
Length = 410
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 175 KGIAYVEFCDIQAVPLAIGLTGQRLLGVPIMVQASQAEKNRLAA---------------- 218
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 219 -MANNLQKGSGGPLRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 277
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 278 TFSDSECGRRALEQLNGFELAGRPMRVGHVT 308
>gi|148231281|ref|NP_001085808.1| RNA binding motif protein 23 [Xenopus laevis]
gi|49118375|gb|AAH73374.1| MGC80803 protein [Xenopus laevis]
Length = 416
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 66/154 (42%), Gaps = 17/154 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE + A+ L G GV + V+ LAA S NL
Sbjct: 194 KGIAYVEFCEIHSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAAM------SNNLQ---- 243
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
G GP R++VG L + TE ++ + E FG + L+K+ DTG SKG+GF
Sbjct: 244 -------RGNFGPMRLYVGSLHFNITEEMLRGIFEPFGKIENIQLLKEPDTGRSKGFGFI 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASG 375
+ D A LNG ++ K + V T G
Sbjct: 297 TFTDAECARRALEQLNGFELAGKPMKVGHVTGGG 330
>gi|156543304|ref|XP_001603981.1| PREDICTED: RNA-binding motif protein, X-linked 2-like [Nasonia
vitripennis]
Length = 139
Score = 67.0 bits (162), Expect = 3e-08, Method: Composition-based stats.
Identities = 30/72 (41%), Positives = 45/72 (62%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
+F+GGLPY TE + + +G + +LV+D+DTG KGYGF Y+D T +A L
Sbjct: 36 IFIGGLPYDLTEGDVIAVFSQYGEIVNINLVRDKDTGKQKGYGFLCYEDQRSTILAVDNL 95
Query: 357 NGLKMGDKTLTV 368
NG+K+ +T+ V
Sbjct: 96 NGIKILGRTIRV 107
>gi|426232776|ref|XP_004010396.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Ovis aries]
Length = 447
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 206 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAAT--------------- 250
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
AS G+ GP R++VG L + TE ++ + E FG + L+KD +TG SKGYGF
Sbjct: 251 --ASNLQKGSGGPVRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSETGCSKGYGFI 308
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + + T
Sbjct: 309 TFSDSECARRALEQLNGFELAGRPMRIGHVT 339
>gi|426232774|ref|XP_004010395.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Ovis aries]
Length = 463
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 222 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAAT--------------- 266
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
AS G+ GP R++VG L + TE ++ + E FG + L+KD +TG SKGYGF
Sbjct: 267 --ASNLQKGSGGPVRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSETGCSKGYGFI 324
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + + T
Sbjct: 325 TFSDSECARRALEQLNGFELAGRPMRIGHVT 355
>gi|358254188|dbj|GAA54213.1| cleavage stimulation factor subunit 2 [Clonorchis sinensis]
Length = 437
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 42/72 (58%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
+FVG +PY TE ++ EL G + GF LV DR++G KGYGFC Y +PA+ A L
Sbjct: 23 IFVGNIPYEATEEKLIELFGKAGPVIGFRLVYDRESGKPKGYGFCEYNNPAIAASALRNL 82
Query: 357 NGLKMGDKTLTV 368
++ + L +
Sbjct: 83 QNIEFNGRPLRI 94
>gi|348676634|gb|EGZ16451.1| hypothetical protein PHYSODRAFT_314245 [Phytophthora sojae]
Length = 449
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 152/373 (40%), Gaps = 70/373 (18%)
Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE------KKFAFV 227
ARR+Y+G L E+ I + F+ P A+ ++ ++ E K F F+
Sbjct: 119 ARRLYIGNLYYDLKEEDIRSAFA------------PFGAIHSIDLSLEPGASRSKGFCFL 166
Query: 228 EMRTVEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
E V A +A+ L+G A+RV RP G +PN +L+ +
Sbjct: 167 EYEDVLAAESAVQVLNGTPLANRAMRVGRPHR-----------GNTNPNDSLS---IGQE 212
Query: 287 AIGGAEGPDR-VFVGGLPYYFTETQIKELLESFGTLHGFDL--VKDRDTGNSKGYGFCVY 343
AI P + +++ + ++ + FG + + V ++G +GYGF +
Sbjct: 213 AIKNV--PTKCIYIANVRVELNSQHLESIFSPFGAIRSSVMAAVSPLESGVHRGYGFMEF 270
Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
+ + A +NG ++ + L V +A+ + LA +Q + T G
Sbjct: 271 VEESCAASAIQHMNGFELAGQPLKVGKASEAAMLIN-----LATSQDKVVRDGPGATTDG 325
Query: 404 MNTL---GGGMSLFGETLAK------------VLCLTEAITADALADDEEYEEILEDMRE 448
N L + F E + LCL + + ++ E E +R
Sbjct: 326 ANGLVPEPKKTATFAEDDVEGVKDVADGDDKCCLCLMNLVNRGEVDEELEDE-----VRG 380
Query: 449 ECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
ECGK+G + V I E +VF+ + +A G + AK AL GR FGGN V A Y
Sbjct: 381 ECGKFGNVNKVEI-------HELADHVRVFVLFDEAFGASKAKQALHGRFFGGNQVQAHY 433
Query: 509 YPEDKYFNKDYSA 521
YP + + Y++
Sbjct: 434 YPLRELEQQRYTS 446
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 53/116 (45%)
Query: 277 NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
+L ++ L S + + R+++G L Y E I+ FG +H DL + SK
Sbjct: 102 SLLSLNLPSAKVSPNDLARRLYIGNLYYDLKEEDIRSAFAPFGAIHSIDLSLEPGASRSK 161
Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI 392
G+ F Y+D + A LNG + ++ + V R + + SI +A +++
Sbjct: 162 GFCFLEYEDVLAAESAVQVLNGTPLANRAMRVGRPHRGNTNPNDSLSIGQEAIKNV 217
>gi|341882558|gb|EGT38493.1| hypothetical protein CAEBREN_09163 [Caenorhabditis brenneri]
Length = 757
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 61/102 (59%), Gaps = 10/102 (9%)
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
+ V+ L +T D D +EY E ++REECGK+GT+++VVI + GV K+F
Sbjct: 665 SSVIVLRNMVTPD---DIDEYLE--GEIREECGKFGTVLDVVIAN-----FASSGVVKIF 714
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
++Y D++ AK AL GR FGGN+V A Y + + + DY+
Sbjct: 715 VKYADSMQVDRAKAALDGRFFGGNSVKAEAYDQILFDHADYT 756
Score = 45.1 bits (105), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 19/73 (26%), Positives = 38/73 (52%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
R++VG + + E +++ + FG + ++ D TG+ K + F Y+ P +A +
Sbjct: 103 RIYVGSISFEIREDMLRKAFDPFGPIKSINMSWDPATGHHKTFAFVEYEIPEAALLAQES 162
Query: 356 LNGLKMGDKTLTV 368
+NG +G + L V
Sbjct: 163 MNGQMLGGRNLKV 175
>gi|312371125|gb|EFR19385.1| hypothetical protein AND_22609 [Anopheles darlingi]
Length = 377
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 47/89 (52%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE +KE+ G + LV DR+TG KGYGFC Y+D A L
Sbjct: 18 VFVGNIPYDATEEALKEIFCEVGLVMSMKLVYDRETGKPKGYGFCEYKDKETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG G + L V A +S+ E ++L
Sbjct: 78 NGYVFGGRPLRVDNACTE-KSRLEMAALL 105
>gi|410989439|ref|XP_004000969.1| PREDICTED: RNA-binding motif protein, X-linked 2 [Felis catus]
Length = 507
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 60/129 (46%), Gaps = 22/129 (17%)
Query: 262 TLAAALGPGQPSPNLNLAAV--------------------GLASGAIGGAEGPDR--VFV 299
LAA GP + P+L A + G+A +E D +F+
Sbjct: 162 CLAAERGPAEMKPHLTSACLCSPLTKVKLINELNEREVQLGVADKVSWHSEYKDSAWIFL 221
Query: 300 GGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGL 359
GGLPY TE I + +G + +LV+D+ TG SKG+ F Y+D T +A NG+
Sbjct: 222 GGLPYELTEGDIICVFSQYGEIVNINLVRDKKTGKSKGFCFLCYEDQRSTVLAVDNFNGI 281
Query: 360 KMGDKTLTV 368
K+ +T+ V
Sbjct: 282 KIKGRTIRV 290
>gi|449665135|ref|XP_002159315.2| PREDICTED: cleavage stimulation factor subunit 2-like [Hydra
magnipapillata]
Length = 413
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 47/89 (52%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY ++ Q+K++ G + F LV DR+TG KGYGFC Y+D A L
Sbjct: 28 VFVGNIPYEASDDQLKDIFSQAGPVLSFRLVYDRETGKPKGYGFCEYKDSETAQSAMRNL 87
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG ++ + L V A + + E L
Sbjct: 88 NGTEIHGRQLRVDSAASQKGNGVEDPKAL 116
>gi|395503034|ref|XP_003755878.1| PREDICTED: probable RNA-binding protein 23 [Sarcophilus harrisii]
Length = 451
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 219 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 262
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKG+GF
Sbjct: 263 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDPDTGRSKGFGFL 321
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 322 TFSDSECARRALEQLNGFELAGRPMRVGHVT 352
>gi|126277396|ref|XP_001369125.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Monodelphis
domestica]
Length = 449
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 217 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 260
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKG+GF
Sbjct: 261 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDPDTGRSKGFGFL 319
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 320 TFSDSECARRALEQLNGFELAGRPMRVGHVT 350
>gi|197102126|ref|NP_001124751.1| probable RNA-binding protein 23 [Pongo abelii]
gi|55725769|emb|CAH89665.1| hypothetical protein [Pongo abelii]
Length = 423
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 69/151 (45%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ A+ +P+
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQ---------ASQAEKNRPA-------- 233
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 234 AMANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324
>gi|221044666|dbj|BAH14010.1| unnamed protein product [Homo sapiens]
Length = 270
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 37 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 80
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 81 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 139
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 140 TFSDSECARRALEQLNGFELAGRPMRVGHVT 170
>gi|158292144|ref|XP_313699.3| AGAP004414-PA [Anopheles gambiae str. PEST]
gi|157017295|gb|EAA09129.3| AGAP004414-PA [Anopheles gambiae str. PEST]
Length = 390
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 47/89 (52%), Gaps = 1/89 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE +KE+ G + LV DR+TG KGYGFC Y+D A L
Sbjct: 18 VFVGNIPYDATEEALKEIFCEVGLVLSMKLVYDRETGKPKGYGFCEYKDKETALSAMRNL 77
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
NG G + L V A +S+ E ++L
Sbjct: 78 NGYVFGGRPLRVDNACTE-KSRMEMAALL 105
>gi|126277398|ref|XP_001369153.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Monodelphis
domestica]
Length = 433
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 201 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 244
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+KD DTG SKG+GF
Sbjct: 245 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDPDTGRSKGFGFL 303
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 304 TFSDSECARRALEQLNGFELAGRPMRVGHVT 334
>gi|119586631|gb|EAW66227.1| RNA binding motif protein 23, isoform CRA_f [Homo sapiens]
Length = 269
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 37 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 80
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYGF
Sbjct: 81 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 139
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 140 TFSDSECARRALEQLNGFELAGRPMRVGHVT 170
>gi|213408745|ref|XP_002175143.1| RNA-binding protein [Schizosaccharomyces japonicus yFS275]
gi|212003190|gb|EEB08850.1| RNA-binding protein [Schizosaccharomyces japonicus yFS275]
Length = 308
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 38/72 (52%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE Q+ ++ GT+ F LV D +T KGYGFC + DP A L
Sbjct: 9 VFVGNIPYDATEKQMADIFHQIGTVRSFKLVLDPETNQPKGYGFCEFHDPETAASAVRNL 68
Query: 357 NGLKMGDKTLTV 368
N G + L V
Sbjct: 69 NNFPFGARKLRV 80
>gi|321460847|gb|EFX71885.1| hypothetical protein DAPPUDRAFT_93311 [Daphnia pulex]
Length = 81
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY +E Q+K + G + F +V+DR+TG S+G+GFC +Q P A L
Sbjct: 4 VFVGNIPYGVSEDQLKAIFSEAGPVVSFRIVQDRETGRSRGFGFCEFQSPDSAQTAMRNL 63
Query: 357 NGLKMGDKTLTVRRAT 372
NG ++ ++L V A
Sbjct: 64 NGYELNGRSLRVDSAN 79
>gi|449460375|ref|XP_004147921.1| PREDICTED: zinc finger CCCH domain-containing protein 25-like
[Cucumis sativus]
Length = 395
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 51/89 (57%), Gaps = 2/89 (2%)
Query: 275 NLNLAAVGLASGAIGGAEGPDR--VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
N AA+G++ A A+ D VFVGG+PY TE + + +G + +L++D+ T
Sbjct: 14 NSQEAALGISEEASWHAKYKDSAYVFVGGIPYDLTEGDLLAVFAQYGEIVDVNLIRDKGT 73
Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKM 361
G SKGY F Y+D T++A LNG ++
Sbjct: 74 GKSKGYAFVAYEDQRSTNLAVDNLNGAQI 102
>gi|349604477|gb|AEQ00017.1| RNA-binding protein 39-like protein, partial [Equus caballus]
Length = 374
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 64/151 (42%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 177 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 219
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 220 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 279
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 280 TFSDSECAKKALEQLNGFELAGRPMKVGHVT 310
>gi|190402270|gb|ACE77680.1| RNA binding motif protein 39 isoform a (predicted) [Sorex araneus]
Length = 435
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 64/151 (42%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVT 327
>gi|281351269|gb|EFB26853.1| hypothetical protein PANDA_007127 [Ailuropoda melanoleuca]
Length = 334
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 56/117 (47%), Gaps = 5/117 (4%)
Query: 254 RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDR--VFVGGLPYYFTETQI 311
R P NP L N +G+A +E D +F+GGLPY TE I
Sbjct: 5 REPKKMNPLTKVKL---INELNEREVQLGVADKVSWHSEYKDSAWIFLGGLPYELTEGDI 61
Query: 312 KELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV 368
+ +G + +LV+D+ TG SKG+ F Y+D T +A NG+K+ +T+ V
Sbjct: 62 ICVFSQYGEIVNINLVRDKKTGKSKGFCFLCYEDQRSTILAVDNFNGIKIKGRTIRV 118
>gi|441628930|ref|XP_003275701.2| PREDICTED: ELAV-like protein 3 [Nomascus leucogenys]
Length = 364
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 92/206 (44%), Gaps = 16/206 (7%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
+ V LP T+ + K L S G + LV+D+ TG S GYGF Y DP D A L
Sbjct: 157 LIVNYLPQNMTQDEFKSLFGSIGDIESCKLVRDKITGQSLGYGFVNYSDPNDADKAINTL 216
Query: 357 NGLKMGDKTLTVRRATASGQSKT--EQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLF 414
NGLK+ KT+ V + ++ + T E +++L A +A+ GM+ L G
Sbjct: 217 NGLKLQTKTIKVGASFSNPPNSTTLELDNLLNMAYGVKRFSPIAID--GMSGLAGVGLSG 274
Query: 415 GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
G A ++ +A DE + + G +G + NV + R D + G
Sbjct: 275 GAAGAGWCIFVYNLSPEA---DESV------LWQLFGPFGAVTNVKVIR-DFTTNKCKGF 324
Query: 475 GKVFLEYYDAVGCATAKNALSGRKFG 500
G V + YD A A +L+G + G
Sbjct: 325 GFVTMTNYDEAAMAIA--SLNGYRLG 348
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 39/77 (50%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
+FV L E+ + +L FG + +++D T KG+GF + +A A+L
Sbjct: 283 IFVYNLSPEADESVLWQLFGPFGAVTNVKVIRDFTTNKCKGFGFVTMTNYDEAAMAIASL 342
Query: 357 NGLKMGDKTLTVRRATA 373
NG ++G++ L V T+
Sbjct: 343 NGYRLGERVLQVSFKTS 359
>gi|341895702|gb|EGT51637.1| CBN-RNP-6 protein [Caenorhabditis brenneri]
Length = 757
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 60/102 (58%), Gaps = 10/102 (9%)
Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
+ V+ L +T D D +EY E ++REECGK+GT+++VVI + GV K+F
Sbjct: 665 SSVIVLRNMVTPD---DIDEYLE--GEIREECGKFGTVLDVVIAN-----FASSGVVKIF 714
Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
++Y D++ AK AL GR FGGN V A Y + + + DY+
Sbjct: 715 VKYADSMQVDRAKAALDGRFFGGNIVKAEAYDQILFDHADYT 756
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 19/73 (26%), Positives = 38/73 (52%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
R++VG + + E +++ + FG + ++ D TG+ K + F Y+ P +A +
Sbjct: 103 RIYVGSISFEIREDMLRKAFDPFGPIKSINMSWDPATGHHKTFAFVEYEIPEAALLAQES 162
Query: 356 LNGLKMGDKTLTV 368
+NG +G + L V
Sbjct: 163 MNGQMLGGRNLKV 175
>gi|157137809|ref|XP_001664044.1| RNA-binding protein [Aedes aegypti]
gi|108869641|gb|EAT33866.1| AAEL013869-PA [Aedes aegypti]
Length = 399
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 52/93 (55%), Gaps = 1/93 (1%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
VFVG +PY TE ++K++ G + LV DR++G KGYGFC Y+D A L
Sbjct: 16 VFVGNIPYEATEEKLKDIFCEVGPVISLKLVFDRESGKPKGYGFCEYKDQETALSAMRNL 75
Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
NG ++G + L V A + +S+ E ++L Q
Sbjct: 76 NGYEIGGRALRVDNA-CTEKSRMEMAALLQGPQ 107
>gi|27734072|ref|NP_775552.1| RNA-binding motif protein, X-linked 2 [Mus musculus]
gi|61230302|sp|Q8R0F5.1|RBMX2_MOUSE RecName: Full=RNA-binding motif protein, X-linked 2
gi|20071694|gb|AAH26976.1| RNA binding motif protein, X-linked 2 [Mus musculus]
gi|26345092|dbj|BAC36195.1| unnamed protein product [Mus musculus]
gi|74207503|dbj|BAE40004.1| unnamed protein product [Mus musculus]
Length = 326
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 43/72 (59%)
Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
+FVGGLPY TE I + +G + +LV+D+ TG SKG+ F Y+D T +A
Sbjct: 38 IFVGGLPYELTEGDIICVFSQYGEIVNINLVRDKKTGKSKGFCFLCYEDQRSTVLAVDNF 97
Query: 357 NGLKMGDKTLTV 368
NG+K+ +T+ V
Sbjct: 98 NGIKIKGRTIRV 109
>gi|119596572|gb|EAW76166.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_e [Homo
sapiens]
Length = 445
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 64/151 (42%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE V A+ L G GV + V Q S A
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYGF
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVT 327
>gi|48146631|emb|CAG33538.1| RNPC4 [Homo sapiens]
Length = 423
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 65/151 (43%), Gaps = 17/151 (11%)
Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
K A+VE ++ A+ L G GV + V+ LAA
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234
Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
+A+ G GP R++VG L + TE ++ + E FG + L+KD DTG SKGYG
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGLI 293
Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
+ D A LNG ++ + + V T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324
>gi|17510025|ref|NP_491176.1| Protein RNP-6, isoform b [Caenorhabditis elegans]
gi|373220165|emb|CCD72565.1| Protein RNP-6, isoform b [Caenorhabditis elegans]
Length = 749
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 57/97 (58%), Gaps = 6/97 (6%)
Query: 425 TEAITADALADDEEYEEILE-DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYD 483
+ I + ++ +E LE ++REECGKYG +++VVI + G+ K+F++Y D
Sbjct: 657 SNVIVLRNMVTPQDIDEFLEGEIREECGKYGNVIDVVI-----ANFASSGLVKIFVKYSD 711
Query: 484 AVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
++ AK AL GR FGGNTV A Y + + + DY+
Sbjct: 712 SMQVDRAKAALDGRFFGGNTVKAEAYDQILFDHADYT 748
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 19/73 (26%), Positives = 37/73 (50%)
Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
R++VG + + E ++ + FG + ++ D TG+ K + F Y+ P +A +
Sbjct: 103 RIYVGSISFEIREDMLRRAFDPFGPIKSINMSWDPATGHHKTFAFVEYEVPEAALLAQES 162
Query: 356 LNGLKMGDKTLTV 368
+NG +G + L V
Sbjct: 163 MNGQMLGGRNLKV 175
>gi|328850276|gb|EGF99443.1| hypothetical protein MELLADRAFT_31912 [Melampsora larici-populina
98AG31]
Length = 79
Score = 65.9 bits (159), Expect = 5e-08, Method: Composition-based stats.
Identities = 33/77 (42%), Positives = 46/77 (59%), Gaps = 9/77 (11%)
Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN---------GGETPGVGKVFLEYYD 483
L DDEEY+EILED+ EEC KY + +V IPRP +N G+GKVF+++
Sbjct: 2 LVDDEEYKEILEDIIEECSKYVKIEDVKIPRPKKNQKGRIHSKASESVEGLGKVFIKFEQ 61
Query: 484 AVGCATAKNALSGRKFG 500
C A +A++GR+F
Sbjct: 62 IEDCGQALSAIAGRQFA 78
>gi|119596573|gb|EAW76167.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_f [Homo
sapiens]
Length = 423
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 48/93 (51%)
Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
A +A+ G+ GP R++VG L + TE ++ + E FG + L+ D +TG SKGYG
Sbjct: 235 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 294
Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
F + D A LNG ++ + + V T
Sbjct: 295 FITFSDSECAKKALEQLNGFELAGRPMKVGHVT 327
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.134 0.391
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,788,772,934
Number of Sequences: 23463169
Number of extensions: 397342696
Number of successful extensions: 1680132
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 13389
Number of HSP's successfully gapped in prelim test: 12927
Number of HSP's that attempted gapping in prelim test: 1410265
Number of HSP's gapped (non-prelim): 158424
length of query: 521
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 374
effective length of database: 8,910,109,524
effective search space: 3332380961976
effective search space used: 3332380961976
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 79 (35.0 bits)