BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 009954
         (521 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|388520789|gb|AFK48456.1| unknown [Lotus japonicus]
          Length = 527

 Score =  630 bits (1625), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 328/501 (65%), Positives = 389/501 (77%), Gaps = 15/501 (2%)

Query: 25  RSRTGERG--RDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRH 82
           R + GER    DRHHRD+K GG + R D+   Y+R   RD+DR   H D  +D ERRH++
Sbjct: 34  RKQDGERRDFHDRHHRDYKDGGFNGR-DRYNSYNRHRSRDYDR---HNDRVKDGERRHKY 89

Query: 83  RSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPE 142
            + S    R R  S+S S S S S+SKR SGFDMAPP+A     +AV GQ  G+      
Sbjct: 90  EAHS---KRSRGESRSPSRSPSRSESKRVSGFDMAPPSADGT--SAVSGQTLGINHLNQG 144

Query: 143 MAQNMLPFGATQ--LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMT 200
            AQN   FG +Q  +GA  LM VQ MTQQATRHARRVY+GGLPPL NEQ+IATFFS VMT
Sbjct: 145 TAQNFSLFGISQPQIGALSLMQVQPMTQQATRHARRVYIGGLPPLTNEQSIATFFSHVMT 204

Query: 201 AIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYN 260
           AIGGNSAG GD+VVNVYINHEKKFAF+EMRTVEEASNAM+LDGI+FEGV+VRVRRPTDYN
Sbjct: 205 AIGGNSAGAGDSVVNVYINHEKKFAFLEMRTVEEASNAMSLDGIVFEGVSVRVRRPTDYN 264

Query: 261 PTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGT 320
           PTLAAALGP QPSP LNL+AVGL+ G IGG +G DR+FVGGLPYYF E QI+ELL++FG 
Sbjct: 265 PTLAAALGPCQPSPYLNLSAVGLSGGTIGGTDGLDRIFVGGLPYYFAEEQIRELLQAFGP 324

Query: 321 LHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTE 380
           L  FDLV+D++TGNSKGYGFC+YQDPAVTD+ACA+LNGLK+GDKTLTVRRAT SG SKTE
Sbjct: 325 LRSFDLVRDKETGNSKGYGFCIYQDPAVTDMACASLNGLKVGDKTLTVRRATVSGHSKTE 384

Query: 381 QESILAQAQQHIAIQKMALQTSGMNTLGGGM--SLFGETLAKVLCLTEAITADALADDEE 438
           QE I AQAQQ+I +QK+AL+  G+N  G     +   E+  KVLCLTEAIT D L D+ E
Sbjct: 385 QEHIFAQAQQNITMQKVALEVVGLNIPGVERVPTTIDESATKVLCLTEAITTDELMDNGE 444

Query: 439 YEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRK 498
           YEEI+EDMR+ECGK+GTL+NVVIPRP+ +G +TPG+GKVFLEY D      AK+AL+GRK
Sbjct: 445 YEEIVEDMRDECGKFGTLMNVVIPRPNPSGEQTPGIGKVFLEYSDTAASFAAKSALNGRK 504

Query: 499 FGGNTVNAFYYPEDKYFNKDY 519
           FGGN V A+YYPE+K+ N ++
Sbjct: 505 FGGNMVTAYYYPEEKFHNMEF 525


>gi|359497050|ref|XP_002267854.2| PREDICTED: splicing factor U2af large subunit B-like, partial
           [Vitis vinifera]
          Length = 410

 Score =  612 bits (1579), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 308/411 (74%), Positives = 330/411 (80%), Gaps = 45/411 (10%)

Query: 112 SGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQAT 171
           SGFDMAPP AA+LPGAAVPG+LPGVP  VP M QNM PFGATQLGA PLMPVQ MTQQAT
Sbjct: 44  SGFDMAPPVAALLPGAAVPGELPGVPQMVPGMIQNMFPFGATQLGALPLMPVQAMTQQAT 103

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           RHARRVYVGGLPPLANEQ IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR+
Sbjct: 104 RHARRVYVGGLPPLANEQTIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRS 163

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
           VEEASNAMALDGI+FEGV+VRVRRPTDYNP+LAAALGP QPSP+LNLAAVGL  G IGGA
Sbjct: 164 VEEASNAMALDGIMFEGVSVRVRRPTDYNPSLAAALGPSQPSPHLNLAAVGLMPGVIGGA 223

Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
           EGPDR+FVGGLPYYFTE QI+ELLESFG L GFDLVKDRDTGNSKGYGFCVYQDPAVTDI
Sbjct: 224 EGPDRIFVGGLPYYFTEEQIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 283

Query: 352 ACAALNGLKMGDKTLTVRRATA-SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
           ACAALNGLKMGDKTLTVRRAT  SGQ+K+EQ++ILAQAQQHIAIQK+ALQ  G+N  G  
Sbjct: 284 ACAALNGLKMGDKTLTVRRATVGSGQAKSEQDNILAQAQQHIAIQKIALQAGGLNLPGA- 342

Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
                                                      G LV+VVIPRP  NG  
Sbjct: 343 -------------------------------------------GALVHVVIPRPSPNGDL 359

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
            PGVGKVFLEY D  G ++A+NALSGRKFGGN V+A YYPEDKY++ DY A
Sbjct: 360 IPGVGKVFLEYSDTAGSSSARNALSGRKFGGNVVSAVYYPEDKYYDGDYGA 410


>gi|156070760|gb|ABU45175.1| unknown [Solanum melongena]
          Length = 553

 Score =  588 bits (1515), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 302/458 (65%), Positives = 353/458 (77%), Gaps = 26/458 (5%)

Query: 67  DRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPG 126
           DR R+ ++D+E RHRH+  S +  R          SRSPSKSKR SGFDMAPP +A+L G
Sbjct: 117 DRRRNNDKDREDRHRHKPSSRARSR----------SRSPSKSKRISGFDMAPPTSALLSG 166

Query: 127 AA-VPGQLPGVPS-AVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPP 184
           A  V GQ+PG+ + ++P M  NM P  A Q GA P+MPVQ MTQQATRHARRVYVGGLPP
Sbjct: 167 ATDVAGQVPGITNPSIPGMFSNMFPVAAGQFGALPVMPVQAMTQQATRHARRVYVGGLPP 226

Query: 185 LANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGI 244
            ANEQ++ATFFS VM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR+VEEASNAMALDG+
Sbjct: 227 TANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAMALDGV 286

Query: 245 IFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPY 304
           IFEG  V+VRRP+DYNP+LAA LGP QPSPNLNLAAVGL  G+ GG EGPDR+FVGGLPY
Sbjct: 287 IFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGLTPGSSGGLEGPDRIFVGGLPY 346

Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDK 364
           YFTE+QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTDIACAALNG+KMGDK
Sbjct: 347 YFTESQIRELLESFGQLRGFDLVKDRETGNSKGYAFCVYQDVSVTDIACAALNGIKMGDK 406

Query: 365 TLTVRRATAS-GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLC 423
           TLTVRRA     Q   EQES+L  AQQ IA+Q+  LQ   + T             K+LC
Sbjct: 407 TLTVRRANQGITQPNPEQESVLLHAQQQIALQRFMLQPGALAT-------------KILC 453

Query: 424 LTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYD 483
           LT+ ++ D L DD++Y++ILEDMR ECGK+G L+NVVIPRP+ NG  TPG+GKVFLEY D
Sbjct: 454 LTQVVSVDELKDDDDYQDILEDMRIECGKFGALLNVVIPRPNPNGEPTPGLGKVFLEYAD 513

Query: 484 AVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
               + A+  L+GRKFGGN V A +YPE+K+   DY A
Sbjct: 514 VDSSSKARQGLNGRKFGGNQVIAVFYPENKFSEGDYEA 551


>gi|188998293|gb|ACD67872.1| U2 snRNP auxiliary factor large subunit [Solanum melongena]
          Length = 554

 Score =  588 bits (1515), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 302/458 (65%), Positives = 353/458 (77%), Gaps = 26/458 (5%)

Query: 67  DRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPG 126
           DR R+ ++D+E RHRH+  S +  R          SRSPSKSKR SGFDMAPP +A+L G
Sbjct: 118 DRRRNNDKDREDRHRHKPSSRARSR----------SRSPSKSKRISGFDMAPPTSALLSG 167

Query: 127 AA-VPGQLPGVPS-AVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPP 184
           A  V GQ+PG+ + ++P M  NM P  A Q GA P+MPVQ MTQQATRHARRVYVGGLPP
Sbjct: 168 ATDVAGQVPGITNPSIPGMFSNMFPVAAGQFGALPVMPVQAMTQQATRHARRVYVGGLPP 227

Query: 185 LANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGI 244
            ANEQ++ATFFS VM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR+VEEASNAMALDG+
Sbjct: 228 TANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAMALDGV 287

Query: 245 IFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPY 304
           IFEG  V+VRRP+DYNP+LAA LGP QPSPNLNLAAVGL  G+ GG EGPDR+FVGGLPY
Sbjct: 288 IFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGLTPGSSGGLEGPDRIFVGGLPY 347

Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDK 364
           YFTE+QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTDIACAALNG+KMGDK
Sbjct: 348 YFTESQIRELLESFGQLRGFDLVKDRETGNSKGYAFCVYQDVSVTDIACAALNGIKMGDK 407

Query: 365 TLTVRRATAS-GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLC 423
           TLTVRRA     Q   EQES+L  AQQ IA+Q+  LQ   + T             K+LC
Sbjct: 408 TLTVRRANQGITQPNPEQESVLLHAQQQIALQRFMLQPGALAT-------------KILC 454

Query: 424 LTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYD 483
           LT+ ++ D L DD++Y++ILEDMR ECGK+G L+NVVIPRP+ NG  TPG+GKVFLEY D
Sbjct: 455 LTQVVSVDELKDDDDYQDILEDMRIECGKFGALLNVVIPRPNPNGEPTPGLGKVFLEYAD 514

Query: 484 AVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
               + A+  L+GRKFGGN V A +YPE+K+   DY A
Sbjct: 515 VDSSSKARQGLNGRKFGGNQVIAVFYPENKFSEGDYEA 552


>gi|75338884|sp|Q9ZR40.1|U2A2B_NICPL RecName: Full=Splicing factor U2af large subunit B; AltName:
           Full=NpU2AF65b; AltName: Full=U2 auxiliary factor 65 kDa
           subunit B; AltName: Full=U2 small nuclear
           ribonucleoprotein auxiliary factor large subunit B;
           Short=U2 snRNP auxiliary factor large subunit B
 gi|3850821|emb|CAA77135.1| U2 snRNP auxiliary factor, large subunit [Nicotiana
           plumbaginifolia]
          Length = 573

 Score =  584 bits (1505), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 284/410 (69%), Positives = 328/410 (80%), Gaps = 15/410 (3%)

Query: 112 SGFDMAPPAAAMLPG-AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQA 170
           SGFDMAPP +AMLPG  A  GQ+PG    +P M  NM P  + Q GA P+MP+Q MTQQA
Sbjct: 175 SGFDMAPPTSAMLPGITAAAGQVPGTNPPIPGMFPNMFPLASGQFGALPVMPIQAMTQQA 234

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           TRHARRVYVGGLP  ANEQ++ATFFS VM+AIGGN+AGPGDAVVNVYIN+EKKFAFVEMR
Sbjct: 235 TRHARRVYVGGLPAHANEQSVATFFSHVMSAIGGNTAGPGDAVVNVYINYEKKFAFVEMR 294

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           +VEEASNAMALDGIIFEG   +VRRP+DYNP+LAA LGP QP+PNLNLAAVGL+ G+ GG
Sbjct: 295 SVEEASNAMALDGIIFEGAPCKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGLSPGSAGG 354

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
            EGPDR+FVGGLPYYFTE QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTD
Sbjct: 355 LEGPDRIFVGGLPYYFTEAQIRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDVSVTD 414

Query: 351 IACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
           IACAALNG+KMGDKTLTVRRA   + Q K EQES+L  AQQ IA+Q++ LQ + + T   
Sbjct: 415 IACAALNGIKMGDKTLTVRRANQGTTQPKPEQESVLLHAQQQIALQRLMLQPATLAT--- 471

Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
                     KVL LTE I+AD L DDE+Y++ILEDMR ECGK+G+LVNVVIPRP  NG 
Sbjct: 472 ----------KVLSLTEVISADELNDDEDYQDILEDMRTECGKFGSLVNVVIPRPSPNGE 521

Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            TPGVGKVFLEY D    + A+ +L+GRKFGGN V A +YPE+K++  DY
Sbjct: 522 PTPGVGKVFLEYADVDSSSKARQSLNGRKFGGNQVVAVFYPENKFYEGDY 571


>gi|75338883|sp|Q9ZR39.1|U2A2A_NICPL RecName: Full=Splicing factor U2af large subunit A; AltName:
           Full=NpU2AF65a; AltName: Full=U2 auxiliary factor 65 kDa
           subunit A; AltName: Full=U2 small nuclear
           ribonucleoprotein auxiliary factor large subunit A;
           Short=U2 snRNP auxiliary factor large subunit A
 gi|3850823|emb|CAA77136.1| U2 snRNP auxiliary factor, large subunit [Nicotiana
           plumbaginifolia]
          Length = 555

 Score =  584 bits (1505), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 284/412 (68%), Positives = 325/412 (78%), Gaps = 15/412 (3%)

Query: 112 SGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQA 170
           SGFDMAPP  A+LPGA    GQ+PG   A+P +  NM P  ++Q GA P+MPVQ MTQQA
Sbjct: 157 SGFDMAPPTTALLPGATDAAGQVPGTNPAIPGLFSNMFPLASSQFGALPMMPVQAMTQQA 216

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           TRHARRVYVGGLPP ANEQ++ATFFS VM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR
Sbjct: 217 TRHARRVYVGGLPPTANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKKFAFVEMR 276

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           +VEEASNAMALDG+IFEG  V+VRRP+DYNP+LAA LGP QPSPNLNLAAVG   G+ GG
Sbjct: 277 SVEEASNAMALDGVIFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGSTPGSSGG 336

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
            EGPDR+FVGGLPYYFTE+QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTD
Sbjct: 337 LEGPDRIFVGGLPYYFTESQIRELLESFGQLRGFDLVKDRETGNSKGYAFCVYQDVSVTD 396

Query: 351 IACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
           IACAALNG+KMGDKTLTVRRA   + Q   EQES+L  AQQ IA+Q+  LQ   + T   
Sbjct: 397 IACAALNGIKMGDKTLTVRRANQGTTQPNPEQESVLLHAQQQIALQRFMLQPGALAT--- 453

Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
                     KVLCLTE +T D L DD++Y++ILEDMR EC K+G LVNVVIPRP+ NG 
Sbjct: 454 ----------KVLCLTEVVTVDELNDDDDYQDILEDMRTECEKFGALVNVVIPRPNPNGV 503

Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
            TPG+GKVFLEY D  G + A+  L+GRKFGGN V A +YPE+K+   DY A
Sbjct: 504 PTPGLGKVFLEYADVDGSSKARQGLNGRKFGGNQVVAVFYPENKFSEGDYEA 555


>gi|156070776|gb|ABU45190.1| unknown [Capsicum frutescens]
          Length = 551

 Score =  583 bits (1503), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 282/412 (68%), Positives = 328/412 (79%), Gaps = 15/412 (3%)

Query: 112 SGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQA 170
           SGFDMAPP +A+LPGA  V GQ+PG   ++P M  NM P  A Q GA P+MPVQ MTQQA
Sbjct: 153 SGFDMAPPTSALLPGATDVTGQVPGANPSIPGMFSNMFPLAAGQFGALPIMPVQAMTQQA 212

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           TRHARRVYVGGLPP ANEQ++ATFFS VM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR
Sbjct: 213 TRHARRVYVGGLPPTANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKKFAFVEMR 272

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           +VEEASNAMALDG+IFEG  V+VRRP+DYNP+LAA LGP QPSPNLNLAAVGL  G+ GG
Sbjct: 273 SVEEASNAMALDGVIFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGLTPGSSGG 332

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
            EGPDR+FVGGLPYYFTE+QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTD
Sbjct: 333 LEGPDRIFVGGLPYYFTESQIRELLESFGQLRGFDLVKDRETGNSKGYAFCVYQDVSVTD 392

Query: 351 IACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
           IACAALNG+KMGDKTLTVRRA   + Q   EQES+L  AQQ IA+Q+  LQ   + T   
Sbjct: 393 IACAALNGIKMGDKTLTVRRANQGTNQPNPEQESVLLHAQQQIALQRFMLQPGALAT--- 449

Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
                     KVLCLT+ ++ D L DD++Y++ILEDMR ECGK+G+L+NVVIPRP+ +G 
Sbjct: 450 ----------KVLCLTQVVSVDELNDDDDYQDILEDMRVECGKFGSLLNVVIPRPNPSGE 499

Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
            TPG+GKVFLEY D    + A+  L+GRKFGGN V A +YPE+K+   +Y A
Sbjct: 500 PTPGLGKVFLEYADVESSSRARQGLNGRKFGGNEVIAVFYPENKFSEGEYEA 551


>gi|156070797|gb|ABU45209.1| unknown [Solanum bulbocastanum]
          Length = 558

 Score =  579 bits (1492), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 282/411 (68%), Positives = 326/411 (79%), Gaps = 16/411 (3%)

Query: 112 SGFDMAPPAAAMLPGAA-VPGQLPGVPS-AVPEMAQNMLPFGATQLGAFPLMPVQVMTQQ 169
           SGFDMAPP +A+L GA  V GQ+PG  + ++P M  NM P  A Q GA P+MPVQ MTQQ
Sbjct: 159 SGFDMAPPTSALLSGATDVAGQVPGTTNPSIPGMFSNMFPLAAGQFGALPIMPVQAMTQQ 218

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           ATRHARRVYVGGLPP ANEQ++ATFFS VM AIGGN+AGPGDAVVNVYINHEKKFAFVEM
Sbjct: 219 ATRHARRVYVGGLPPTANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKKFAFVEM 278

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
           R+VEEASNAMALDG++FEG  V+VRRP+DYNP+LAA LGP QPSPNLNLAAVGL  G+ G
Sbjct: 279 RSVEEASNAMALDGVVFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGLTPGSSG 338

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
           G EGPDR+FVGGLPYYFTE+QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VT
Sbjct: 339 GLEGPDRIFVGGLPYYFTESQIRELLESFGQLRGFDLVKDRETGNSKGYAFCVYQDVSVT 398

Query: 350 DIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
           DIACAALNG+KMGDKTLTVRRA   + Q   EQES+L  AQQ IA+Q+  LQ   + T  
Sbjct: 399 DIACAALNGIKMGDKTLTVRRANQGTTQPNPEQESVLLHAQQQIALQRFMLQPGALAT-- 456

Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
                      KVLCLTE ++ D L DD++Y++ILEDMR ECGK+G L+NVVIPRP+ NG
Sbjct: 457 -----------KVLCLTEVVSVDELKDDDDYQDILEDMRIECGKFGALLNVVIPRPNPNG 505

Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             TPG+GKVFLEY D    + A+  L+GRKFGGN V A +YPE+K+   DY
Sbjct: 506 EPTPGLGKVFLEYADVDSSSKARQGLNGRKFGGNQVIAVFYPENKFSEGDY 556


>gi|297840477|ref|XP_002888120.1| hypothetical protein ARALYDRAFT_475241 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297333961|gb|EFH64379.1| hypothetical protein ARALYDRAFT_475241 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 589

 Score =  578 bits (1489), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 284/411 (69%), Positives = 324/411 (78%), Gaps = 17/411 (4%)

Query: 113 GFDMAPPAAAMLPGAAVPGQLPGVPSA--VPEMAQNMLPF-GATQLGAFPLMPVQVMTQQ 169
           GFDMAPP        A  GQ+P VP+   +P M  NM P     QLGA P++PVQ MTQQ
Sbjct: 190 GFDMAPPDMLAATAVAAAGQVPSVPTTATIPGMFPNMFPMVPGQQLGALPVLPVQAMTQQ 249

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           ATRHARRVYVGGLPP ANEQ++ATFFSQVM+AIGGN+AGPGDAVVNVYINHEKKFAFVEM
Sbjct: 250 ATRHARRVYVGGLPPTANEQSVATFFSQVMSAIGGNTAGPGDAVVNVYINHEKKFAFVEM 309

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
           R+VEEASNAMALDGII EGV V+VRRPTDYNP+LAA LGP QP+PNLNLAAVGL+SG+ G
Sbjct: 310 RSVEEASNAMALDGIILEGVPVKVRRPTDYNPSLAATLGPSQPNPNLNLAAVGLSSGSTG 369

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
           G EGPDR+FVGGLPYYFTE QI+ELLESFG L GF+LVKDR+TGNSKGY FCVYQDP+VT
Sbjct: 370 GLEGPDRIFVGGLPYYFTEVQIRELLESFGPLRGFNLVKDRETGNSKGYAFCVYQDPSVT 429

Query: 350 DIACAALNGLKMGDKTLTVRRATASG-QSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
           DIACAALNG+KMGDKTLTVRRA     Q K EQE +L  AQQ IA+Q++ LQ  G     
Sbjct: 430 DIACAALNGIKMGDKTLTVRRAIQGVIQPKPEQEEVLLHAQQQIALQRLMLQPGG----- 484

Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
                   T  K++CLT+ +TAD L DDEEY +I+EDMR+E GK+G LVNVVIPRP+ + 
Sbjct: 485 --------TPTKIVCLTQVVTADDLRDDEEYADIMEDMRQEGGKFGNLVNVVIPRPNPDH 536

Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             TPGVGKVFLEY D  G + A++ ++GRKFGGN V A YYPEDKY   DY
Sbjct: 537 DPTPGVGKVFLEYADVDGSSKARSGMNGRKFGGNQVVAVYYPEDKYLQGDY 587


>gi|356509477|ref|XP_003523474.1| PREDICTED: splicing factor U2af large subunit B-like [Glycine max]
          Length = 600

 Score =  577 bits (1486), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 286/415 (68%), Positives = 324/415 (78%), Gaps = 20/415 (4%)

Query: 112 SGFDMAPPAAAMLPGA-AVPGQLPGVPSAVPEMAQNMLPFGATQL---GAFPLMPVQVMT 167
           SGFDMAPPA+AML GA AV GQ+ G    +P M  NM P   +Q+    A P+MPVQ MT
Sbjct: 197 SGFDMAPPASAMLTGASAVAGQITGANPTIPGMFPNMFPLATSQMQQFSALPVMPVQAMT 256

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           QQATRHARRVYVGGLPP ANEQ++ATFFSQVM  IGGN+AGPGDAVVNVYINH+KKFAFV
Sbjct: 257 QQATRHARRVYVGGLPPTANEQSVATFFSQVMAKIGGNTAGPGDAVVNVYINHDKKFAFV 316

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           EMR+VEEASNAMALDGIIFEG  V+VRRPTDYNP+LAA LGP QP+PNLNL AVGL  G+
Sbjct: 317 EMRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAATLGPSQPNPNLNLGAVGLTPGS 376

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
            GG +GPDRVFVGGLPYYFTETQI+ELLE+FG L GFDLVKDR+TGNSKGY FCVYQD A
Sbjct: 377 AGGLDGPDRVFVGGLPYYFTETQIRELLETFGPLRGFDLVKDRETGNSKGYAFCVYQDLA 436

Query: 348 VTDIACAALNGLKMGDKTLTVRRATASG---QSKTEQESILAQAQQHIAIQKMALQTSGM 404
           VTDIACAALNG+KMGDKTLTVRRA       Q K EQESIL  AQQ IA+QK+ LQ + +
Sbjct: 437 VTDIACAALNGIKMGDKTLTVRRANQGANPQQPKPEQESILMHAQQQIALQKLMLQPALV 496

Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP 464
            T             KV+CLT A+++D L DDE+YEEIL+DMR+EC K+GTLVNVVIPRP
Sbjct: 497 AT-------------KVVCLTHAVSSDELKDDEDYEEILDDMRQECSKFGTLVNVVIPRP 543

Query: 465 DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             +G    GVGKVFLEY D  G   A+  L+GRKF GN V A +YPE+K+   DY
Sbjct: 544 PSDGEPAAGVGKVFLEYVDIDGATKARAGLNGRKFDGNQVVAVFYPENKFAQGDY 598


>gi|168028774|ref|XP_001766902.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162681881|gb|EDQ68304.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 486

 Score =  577 bits (1486), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 287/411 (69%), Positives = 323/411 (78%), Gaps = 21/411 (5%)

Query: 110 RRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ-LGAFPLMPVQVMTQ 168
           + SGFDMAPP A +     V GQ+PG+P A+P +   M PFG TQ  G  P MP Q MTQ
Sbjct: 96  KTSGFDMAPPGATV-----VAGQIPGMPPAMPGVFPAMFPFGGTQQFGGLPGMPAQAMTQ 150

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           QATRHARRVYVGGLPP+ANEQ IAT+FSQVM A+GGN+AGPGDAVVNVYIN EKKFAFVE
Sbjct: 151 QATRHARRVYVGGLPPMANEQTIATYFSQVMAAVGGNTAGPGDAVVNVYINQEKKFAFVE 210

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           MRTVEEASNAMALDGIIFEGV+VRVRRP+DYNP++AA LGP QPSP+LNLAAVGL  GA 
Sbjct: 211 MRTVEEASNAMALDGIIFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNLAAVGLTPGAA 270

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           GGA+GPDR+FVGGLPYY TE QIKELLESFG L GFDLVKDRDTGNSKGYGFCVYQDP+V
Sbjct: 271 GGADGPDRIFVGGLPYYLTEVQIKELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPSV 330

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
            DIACA LNG+KM DKTL VRRATASGQ K +Q ++LA AQQ IAIQK+ALQ        
Sbjct: 331 VDIACATLNGMKMDDKTLNVRRATASGQPKPDQANVLAHAQQQIAIQKLALQA------- 383

Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
                  +T  KV+ LTE +T + L DDEEY+EI+EDM  ECGKYGTLVN VIPRP ++G
Sbjct: 384 -------KTPTKVVALTEVVTPNQLEDDEEYQEIMEDMGTECGKYGTLVNCVIPRP-RSG 435

Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
              PG+GKVFLEY D  G + AK +L GR+F  N V A YYPEDK+   DY
Sbjct: 436 ENVPGLGKVFLEYSDIAGASKAKASLHGRRFDENLVVAVYYPEDKFAAGDY 486


>gi|30696485|ref|NP_176287.3| Splicing factor U2af large subunit B [Arabidopsis thaliana]
 gi|209572798|sp|Q8L716.2|U2A2B_ARATH RecName: Full=Splicing factor U2af large subunit B; AltName:
           Full=U2 auxiliary factor 65 kDa subunit B; AltName:
           Full=U2 small nuclear ribonucleoprotein auxiliary factor
           large subunit B; Short=U2 snRNP auxiliary factor large
           subunit B
 gi|332195625|gb|AEE33746.1| Splicing factor U2af large subunit B [Arabidopsis thaliana]
          Length = 589

 Score =  573 bits (1477), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 282/411 (68%), Positives = 322/411 (78%), Gaps = 17/411 (4%)

Query: 113 GFDMAPPAAAMLPGAAVPGQLPGVPSA--VPEMAQNMLPF-GATQLGAFPLMPVQVMTQQ 169
           GFDMAPP        A  GQ+P VP+   +P M  NM P     QLGA P++PVQ MTQQ
Sbjct: 190 GFDMAPPDMLAATAVAAAGQVPSVPTTATIPGMFSNMFPMVPGQQLGALPVLPVQAMTQQ 249

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           ATRHARRVYVGGLPP ANEQ+++TFFSQVM+AIGGN+AGPGDAVVNVYINHEKKFAFVEM
Sbjct: 250 ATRHARRVYVGGLPPTANEQSVSTFFSQVMSAIGGNTAGPGDAVVNVYINHEKKFAFVEM 309

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
           R+VEEASNAMALDGII EGV V+VRRPTDYNP+LAA LGP QP+PNLNL AVGL+SG+ G
Sbjct: 310 RSVEEASNAMALDGIILEGVPVKVRRPTDYNPSLAATLGPSQPNPNLNLGAVGLSSGSTG 369

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
           G EGPDR+FVGGLPYYFTE QI+ELLESFG L GF+LVKDR+TGNSKGY FCVYQDP+VT
Sbjct: 370 GLEGPDRIFVGGLPYYFTEVQIRELLESFGPLRGFNLVKDRETGNSKGYAFCVYQDPSVT 429

Query: 350 DIACAALNGLKMGDKTLTVRRATASG-QSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
           DIACAALNG+KMGDKTLTVRRA     Q K EQE +L  AQQ IA+Q++  Q  G     
Sbjct: 430 DIACAALNGIKMGDKTLTVRRAIQGAIQPKPEQEEVLLYAQQQIALQRLMFQPGG----- 484

Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
                   T  K++CLT+ +TAD L DDEEY EI+EDMR+E GK+G LVNVVIPRP+ + 
Sbjct: 485 --------TPTKIVCLTQVVTADDLRDDEEYAEIMEDMRQEGGKFGNLVNVVIPRPNPDH 536

Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             TPGVGKVFLEY D  G + A++ ++GRKFGGN V A YYPEDKY   DY
Sbjct: 537 DPTPGVGKVFLEYADVDGSSKARSGMNGRKFGGNQVVAVYYPEDKYAQGDY 587


>gi|357455533|ref|XP_003598047.1| Splicing factor U2af large subunit B [Medicago truncatula]
 gi|355487095|gb|AES68298.1| Splicing factor U2af large subunit B [Medicago truncatula]
          Length = 626

 Score =  573 bits (1477), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 281/414 (67%), Positives = 322/414 (77%), Gaps = 19/414 (4%)

Query: 112 SGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL---GAFPLMPVQVMTQ 168
           SGFDMAPP +A+L    V GQ+ G   A+P M  NM P    Q+    A P++PVQ MTQ
Sbjct: 224 SGFDMAPPTSAILGATGVAGQITGASPAIPGMFPNMFPLPTNQVQPFSALPVLPVQAMTQ 283

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           QATRHARRVYVGGL P ANEQ++ATFFSQVM  IGGN+AGPGDAVVNVYINH+KKFAFVE
Sbjct: 284 QATRHARRVYVGGLSPTANEQSVATFFSQVMATIGGNTAGPGDAVVNVYINHDKKFAFVE 343

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           MR+VEEASNAMALDGIIFEG  V+VRRPTDYNP+LAAALGP QP+PNLNL  VGL+ G+ 
Sbjct: 344 MRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLGLVGLSPGSA 403

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           GG +GPDR+FVGG+PYYFTETQI+ELLE+FG L GFDLVKDR+TGNSKGY FCVYQD AV
Sbjct: 404 GGLDGPDRIFVGGVPYYFTETQIRELLETFGPLRGFDLVKDRETGNSKGYAFCVYQDLAV 463

Query: 349 TDIACAALNGLKMGDKTLTVRRA---TASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
           TDIACAALNG+KMGDKTLTVRRA   T   Q K EQESIL  AQQ IA+QK+ LQ + + 
Sbjct: 464 TDIACAALNGIKMGDKTLTVRRANQNTNPMQPKPEQESILMHAQQQIALQKLMLQPALVA 523

Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
           T             KVLCLT A++ D L DDE+YEEIL+DMR+EC K+G LVNVVIPRP 
Sbjct: 524 T-------------KVLCLTHAVSPDELKDDEDYEEILDDMRQECSKFGNLVNVVIPRPR 570

Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            +G   PGVGKVFLEY D  G   A++ L+GRKFGGN V A +YPE+K+   DY
Sbjct: 571 PDGELCPGVGKVFLEYADVDGSTKARSGLNGRKFGGNQVIAVFYPENKFAQGDY 624


>gi|356517814|ref|XP_003527581.1| PREDICTED: splicing factor U2af large subunit A-like [Glycine max]
          Length = 605

 Score =  572 bits (1475), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 284/415 (68%), Positives = 323/415 (77%), Gaps = 20/415 (4%)

Query: 112 SGFDMAPPAAAMLPGA-AVPGQLPGVPSAVPEMAQNMLPFGATQL---GAFPLMPVQVMT 167
           SGFDMAPPA+AML GA AV GQ+ G    +P M  NM P    Q+    A P+MPVQ MT
Sbjct: 202 SGFDMAPPASAMLAGASAVAGQITGANPTIPGMFPNMFPLATNQMQQFSALPVMPVQAMT 261

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           QQATRHARRVYVGGLPP ANEQ++ATFFSQVM  IGGN+AGPGDAVVNVYINH+KKFAFV
Sbjct: 262 QQATRHARRVYVGGLPPTANEQSVATFFSQVMAKIGGNTAGPGDAVVNVYINHDKKFAFV 321

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           EMR+VEEASNAMALDGIIFEG  V+VRRPTDYNP+LAA LGP QP+PNLNL AVGL  G+
Sbjct: 322 EMRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAATLGPSQPNPNLNLGAVGLTPGS 381

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
            GG +GPDR+FVGGLPYYFTETQI+ELLE+FG L GFDLVKDR+TGNSKGY FCVYQD A
Sbjct: 382 AGGLDGPDRIFVGGLPYYFTETQIRELLETFGPLRGFDLVKDRETGNSKGYAFCVYQDLA 441

Query: 348 VTDIACAALNGLKMGDKTLTVRRATASG---QSKTEQESILAQAQQHIAIQKMALQTSGM 404
           VTDIACAALNG+KMGDKTLTVRRA       Q K EQESIL  AQQ IA+QK+ LQ + +
Sbjct: 442 VTDIACAALNGIKMGDKTLTVRRANQGANPQQPKPEQESILMHAQQQIALQKLMLQPALV 501

Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP 464
            T             KV+CLT A+++D L DDE+Y+EIL+DMR+EC K+GTLVNVVIPRP
Sbjct: 502 AT-------------KVVCLTHAVSSDELKDDEDYDEILDDMRQECSKFGTLVNVVIPRP 548

Query: 465 DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             +G    GVGKVFLEY D  G   A+  L+GRKF GN V A +YPE+K+   DY
Sbjct: 549 PPDGEPAAGVGKVFLEYVDIDGATKARAGLNGRKFDGNQVVAVFYPENKFAQGDY 603


>gi|156070781|gb|ABU45194.1| unknown [Petunia integrifolia subsp. inflata]
          Length = 557

 Score =  572 bits (1475), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 279/412 (67%), Positives = 323/412 (78%), Gaps = 19/412 (4%)

Query: 112 SGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQA 170
           SGFDMAPP +A+LPGA    GQ+PG   ++P M  NM P  + Q GA P+MP+Q MTQQA
Sbjct: 162 SGFDMAPPTSALLPGATDTAGQVPGASPSIPGMFSNMFPLTSGQFGALPVMPIQAMTQQA 221

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           TRHARRVYVGGLPP ANEQ++ATFFS VM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR
Sbjct: 222 TRHARRVYVGGLPPSANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKKFAFVEMR 281

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           +VEEASNAMALDG+IFEG  V+VRRP+DYNP+LAA LGP QPSPNLNLAAVGL  G+ GG
Sbjct: 282 SVEEASNAMALDGVIFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGLTPGSSGG 341

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
            EGPDR+FVGGLPYYFTE+QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTD
Sbjct: 342 LEGPDRIFVGGLPYYFTESQIRELLESFGQLRGFDLVKDRETGNSKGYAFCVYQDVSVTD 401

Query: 351 IACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
           IACAALNG+KMGDKTLTVRRA   + Q   EQES+L  AQQ IA+QK   Q   + T   
Sbjct: 402 IACAALNGIKMGDKTLTVRRANQGTPQPNPEQESVLLHAQQQIALQKFMFQPGALAT--- 458

Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
                     KVLCLT+A++ D L DD++Y++ILEDMR ECGK+G L+NVVIPRP+ NG 
Sbjct: 459 ----------KVLCLTQAVSVDELNDDDDYQDILEDMRTECGKFGALLNVVIPRPNPNGE 508

Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
            TPG+GK    Y D  G + A+  L+GRKFGGN V A +YPE+K+   DY A
Sbjct: 509 PTPGIGK----YADVDGSSKARQGLNGRKFGGNQVVAVFYPENKFSEGDYEA 556


>gi|22655131|gb|AAM98156.1| putative U2 snRNP auxiliary factor [Arabidopsis thaliana]
          Length = 589

 Score =  570 bits (1469), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 281/411 (68%), Positives = 321/411 (78%), Gaps = 17/411 (4%)

Query: 113 GFDMAPPAAAMLPGAAVPGQLPGVPSA--VPEMAQNMLPF-GATQLGAFPLMPVQVMTQQ 169
           GFDMAPP        A  GQ+P VP+   +P M  NM P     QLGA P++PVQ MTQQ
Sbjct: 190 GFDMAPPDMLAATAVAAAGQVPSVPTTATIPGMFSNMFPMVPGQQLGALPVLPVQAMTQQ 249

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           ATRHA RVYVGGLPP ANEQ+++TFFSQVM+AIGGN+AGPGDAVVNVYINHEKKFAFVEM
Sbjct: 250 ATRHAPRVYVGGLPPTANEQSVSTFFSQVMSAIGGNTAGPGDAVVNVYINHEKKFAFVEM 309

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
           R+VEEASNAMALDGII EGV V+VRRPTDYNP+LAA LGP QP+PNLNL AVGL+SG+ G
Sbjct: 310 RSVEEASNAMALDGIILEGVPVKVRRPTDYNPSLAATLGPSQPNPNLNLGAVGLSSGSTG 369

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
           G EGPDR+FVGGLPYYFTE QI+ELLESFG L GF+LVKDR+TGNSKGY FCVYQDP+VT
Sbjct: 370 GLEGPDRIFVGGLPYYFTEVQIRELLESFGPLRGFNLVKDRETGNSKGYAFCVYQDPSVT 429

Query: 350 DIACAALNGLKMGDKTLTVRRATASG-QSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
           DIACAALNG+KMGDKTLTVRRA     Q K EQE +L  AQQ IA+Q++  Q  G     
Sbjct: 430 DIACAALNGIKMGDKTLTVRRAIQGAIQPKPEQEEVLLYAQQQIALQRLMFQPGG----- 484

Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
                   T  K++CLT+ +TAD L DDEEY EI+EDMR+E GK+G LVNVVIPRP+ + 
Sbjct: 485 --------TPTKIVCLTQVVTADDLRDDEEYAEIMEDMRQEGGKFGNLVNVVIPRPNPDH 536

Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             TPGVGKVFLEY D  G + A++ ++GRKFGGN V A YYPEDKY   DY
Sbjct: 537 DPTPGVGKVFLEYADVDGSSKARSGMNGRKFGGNQVVAVYYPEDKYAQGDY 587


>gi|224134362|ref|XP_002327819.1| predicted protein [Populus trichocarpa]
 gi|222836904|gb|EEE75297.1| predicted protein [Populus trichocarpa]
          Length = 541

 Score =  568 bits (1465), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 284/426 (66%), Positives = 327/426 (76%), Gaps = 23/426 (5%)

Query: 108 SKRRSGFDMAPPAAAMLPGAAVP------------GQLPGVPSAVPEMAQNMLPFGATQ- 154
           SKR SGFDMAPP++A+LP AA              GQ+ G    +P M  NM P G +Q 
Sbjct: 123 SKRMSGFDMAPPSSAILPNAAAAAAAAAAASAAASGQIAGTTPPIPGMFPNMFPLGTSQQ 182

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
            GA P+MPVQ MTQQATRHARRVYVGGLPP ANEQ++ATFFSQVM AIGGN+AGPGDAVV
Sbjct: 183 FGALPVMPVQAMTQQATRHARRVYVGGLPPTANEQSVATFFSQVMAAIGGNTAGPGDAVV 242

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
           NVYINHEKKFAFVEMR+VEEASNAMALDGIIFEG  V+VRRP+DYNP+LAA LGP QP+P
Sbjct: 243 NVYINHEKKFAFVEMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNP 302

Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
           NLNL+AVGLA G+ GG EGPDR+FVGGLPYYFTE+QI+ELLESFG L GFDLVKDR+TGN
Sbjct: 303 NLNLSAVGLAPGSAGGLEGPDRIFVGGLPYYFTESQIRELLESFGPLRGFDLVKDRETGN 362

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIA 393
           SKGY FCVYQD +VTDIACAALNG+KMGDKTLTVRRA   + Q K EQE++L  AQQ IA
Sbjct: 363 SKGYAFCVYQDLSVTDIACAALNGIKMGDKTLTVRRANQGTNQPKPEQENVLLHAQQQIA 422

Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
           +Q++ LQ                   KV+CLT+ +T D L DD+EYE+ILEDMR E GK+
Sbjct: 423 LQRLMLQPQPQQQ---------PVPTKVVCLTQVVTGDELKDDDEYEDILEDMRTEAGKF 473

Query: 454 GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
           G LVNVVIPRP  NG   PGVGKVFLEY D  G + A+  ++GRKF GN V A +YPE+K
Sbjct: 474 GLLVNVVIPRPRPNGENAPGVGKVFLEYADTEGSSKARAGMNGRKFDGNQVVAVFYPENK 533

Query: 514 YFNKDY 519
           +   +Y
Sbjct: 534 FSQGEY 539


>gi|12323333|gb|AAG51641.1|AC018908_7 putative U2 snRNP auxiliary factor; 19096-22891 [Arabidopsis
           thaliana]
          Length = 568

 Score =  568 bits (1465), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 284/412 (68%), Positives = 324/412 (78%), Gaps = 18/412 (4%)

Query: 113 GFDMAPP-AAAMLPGAAVPGQLPGVPSA--VPEMAQNMLPF-GATQLGAFPLMPVQVMTQ 168
           GFDMAPP   A    AA  GQ+P VP+   +P M  NM P     QLGA P++PVQ MTQ
Sbjct: 168 GFDMAPPDMLAATAVAAAAGQVPSVPTTATIPGMFSNMFPMVPGQQLGALPVLPVQAMTQ 227

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           QATRHARRVYVGGLPP ANEQ+++TFFSQVM+AIGGN+AGPGDAVVNVYINHEKKFAFVE
Sbjct: 228 QATRHARRVYVGGLPPTANEQSVSTFFSQVMSAIGGNTAGPGDAVVNVYINHEKKFAFVE 287

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           MR+VEEASNAMALDGII EGV V+VRRPTDYNP+LAA LGP QP+PNLNL AVGL+SG+ 
Sbjct: 288 MRSVEEASNAMALDGIILEGVPVKVRRPTDYNPSLAATLGPSQPNPNLNLGAVGLSSGST 347

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           GG EGPDR+FVGGLPYYFTE QI+ELLESFG L GF+LVKDR+TGNSKGY FCVYQDP+V
Sbjct: 348 GGLEGPDRIFVGGLPYYFTEVQIRELLESFGPLRGFNLVKDRETGNSKGYAFCVYQDPSV 407

Query: 349 TDIACAALNGLKMGDKTLTVRRATASG-QSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
           TDIACAALNG+KMGDKTLTVRRA     Q K EQE +L  AQQ IA+Q++  Q  G    
Sbjct: 408 TDIACAALNGIKMGDKTLTVRRAIQGAIQPKPEQEEVLLYAQQQIALQRLMFQPGG---- 463

Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
                    T  K++CLT+ +TAD L DDEEY EI+EDMR+E GK+G LVNVVIPRP+ +
Sbjct: 464 ---------TPTKIVCLTQVVTADDLRDDEEYAEIMEDMRQEGGKFGNLVNVVIPRPNPD 514

Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
              TPGVGKVFLEY D  G + A++ ++GRKFGGN V A YYPEDKY   DY
Sbjct: 515 HDPTPGVGKVFLEYADVDGSSKARSGMNGRKFGGNQVVAVYYPEDKYAQGDY 566


>gi|357455535|ref|XP_003598048.1| Splicing factor U2af large subunit B [Medicago truncatula]
 gi|355487096|gb|AES68299.1| Splicing factor U2af large subunit B [Medicago truncatula]
          Length = 629

 Score =  568 bits (1463), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 281/417 (67%), Positives = 322/417 (77%), Gaps = 22/417 (5%)

Query: 112 SGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL---GAFPLMPVQVMTQ 168
           SGFDMAPP +A+L    V GQ+ G   A+P M  NM P    Q+    A P++PVQ MTQ
Sbjct: 224 SGFDMAPPTSAILGATGVAGQITGASPAIPGMFPNMFPLPTNQVQPFSALPVLPVQAMTQ 283

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           QATRHARRVYVGGL P ANEQ++ATFFSQVM  IGGN+AGPGDAVVNVYINH+KKFAFVE
Sbjct: 284 QATRHARRVYVGGLSPTANEQSVATFFSQVMATIGGNTAGPGDAVVNVYINHDKKFAFVE 343

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           MR+VEEASNAMALDGIIFEG  V+VRRPTDYNP+LAAALGP QP+PNLNL  VGL+ G+ 
Sbjct: 344 MRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLGLVGLSPGSA 403

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           GG +GPDR+FVGG+PYYFTETQI+ELLE+FG L GFDLVKDR+TGNSKGY FCVYQD AV
Sbjct: 404 GGLDGPDRIFVGGVPYYFTETQIRELLETFGPLRGFDLVKDRETGNSKGYAFCVYQDLAV 463

Query: 349 TDIACAALNGLKMGDKTLTVRRA---TASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
           TDIACAALNG+KMGDKTLTVRRA   T   Q K EQESIL  AQQ IA+QK+ LQ + + 
Sbjct: 464 TDIACAALNGIKMGDKTLTVRRANQNTNPMQPKPEQESILMHAQQQIALQKLMLQPALVA 523

Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG---TLVNVVIP 462
           T             KVLCLT A++ D L DDE+YEEIL+DMR+EC K+G    LVNVVIP
Sbjct: 524 T-------------KVLCLTHAVSPDELKDDEDYEEILDDMRQECSKFGNICNLVNVVIP 570

Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           RP  +G   PGVGKVFLEY D  G   A++ L+GRKFGGN V A +YPE+K+   DY
Sbjct: 571 RPRPDGELCPGVGKVFLEYADVDGSTKARSGLNGRKFGGNQVIAVFYPENKFAQGDY 627


>gi|357455537|ref|XP_003598049.1| Splicing factor U2af large subunit B [Medicago truncatula]
 gi|355487097|gb|AES68300.1| Splicing factor U2af large subunit B [Medicago truncatula]
          Length = 627

 Score =  567 bits (1462), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 281/415 (67%), Positives = 321/415 (77%), Gaps = 20/415 (4%)

Query: 112 SGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ-LGAFPLMPVQVMTQQA 170
           SGFDMAPP +A+L    V GQ+ G   A+P M  NM P    Q   A P++PVQ MTQQA
Sbjct: 224 SGFDMAPPTSAILGATGVAGQITGASPAIPGMFPNMFPLPTNQPFSALPVLPVQAMTQQA 283

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           TRHARRVYVGGL P ANEQ++ATFFSQVM  IGGN+AGPGDAVVNVYINH+KKFAFVEMR
Sbjct: 284 TRHARRVYVGGLSPTANEQSVATFFSQVMATIGGNTAGPGDAVVNVYINHDKKFAFVEMR 343

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           +VEEASNAMALDGIIFEG  V+VRRPTDYNP+LAAALGP QP+PNLNL  VGL+ G+ GG
Sbjct: 344 SVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLGLVGLSPGSAGG 403

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
            +GPDR+FVGG+PYYFTETQI+ELLE+FG L GFDLVKDR+TGNSKGY FCVYQD AVTD
Sbjct: 404 LDGPDRIFVGGVPYYFTETQIRELLETFGPLRGFDLVKDRETGNSKGYAFCVYQDLAVTD 463

Query: 351 IACAALNGLKMGDKTLTVRRA---TASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
           IACAALNG+KMGDKTLTVRRA   T   Q K EQESIL  AQQ IA+QK+ LQ + + T 
Sbjct: 464 IACAALNGIKMGDKTLTVRRANQNTNPMQPKPEQESILMHAQQQIALQKLMLQPALVAT- 522

Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG---TLVNVVIPRP 464
                       KVLCLT A++ D L DDE+YEEIL+DMR+EC K+G    LVNVVIPRP
Sbjct: 523 ------------KVLCLTHAVSPDELKDDEDYEEILDDMRQECSKFGNICNLVNVVIPRP 570

Query: 465 DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             +G   PGVGKVFLEY D  G   A++ L+GRKFGGN V A +YPE+K+   DY
Sbjct: 571 RPDGELCPGVGKVFLEYADVDGSTKARSGLNGRKFGGNQVIAVFYPENKFAQGDY 625


>gi|449496757|ref|XP_004160219.1| PREDICTED: splicing factor U2af large subunit B-like [Cucumis
           sativus]
          Length = 587

 Score =  566 bits (1458), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 282/412 (68%), Positives = 321/412 (77%), Gaps = 17/412 (4%)

Query: 113 GFDMAPPAAAMLPGA-AVPGQLPGVPSAVPEMAQNMLPFGATQ-LGAFPLMPVQVMTQQA 170
           GFDMAPP  A+L GA A  GQ+PG   A+P M   M P    Q  GA P+MPVQ MTQQA
Sbjct: 190 GFDMAPPTTAILSGATAAAGQIPGTTPAIPGMFPTMFPLATGQPFGALPVMPVQAMTQQA 249

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           TRHARRVYVGGLPP ANEQ++ATFFSQVM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR
Sbjct: 250 TRHARRVYVGGLPPTANEQSVATFFSQVMAAIGGNTAGPGDAVVNVYINHEKKFAFVEMR 309

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           +VEEASNAMALDGIIFEG  V+VRRP+DYNP+LAA LGP QP+PNLNLAAVGL  G+ GG
Sbjct: 310 SVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGLTPGSAGG 369

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
            EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTD
Sbjct: 370 LEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLSVTD 429

Query: 351 IACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
           IACAALNG+KMGDKTLTVRRA   + Q K EQES+L  AQQ IA+QK+ LQ   ++T   
Sbjct: 430 IACAALNGIKMGDKTLTVRRANQGANQPKPEQESVLLHAQQQIALQKLMLQPGAVST--- 486

Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
                     KVLCLT+ +T + L +DE+YE+I+EDMR E GK+GTLVNVVIPRP  N  
Sbjct: 487 ----------KVLCLTQVVTPEELINDEDYEDIMEDMRGEGGKFGTLVNVVIPRPRPNEA 536

Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
             PGVGKVFLEY D      A+  L+GRKFGGN V A +YPE+K+   +Y A
Sbjct: 537 -APGVGKVFLEYADIDSATKARAGLNGRKFGGNQVMAVFYPENKFAQGEYDA 587


>gi|449441167|ref|XP_004138355.1| PREDICTED: splicing factor U2af large subunit B-like [Cucumis
           sativus]
          Length = 587

 Score =  566 bits (1458), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 282/412 (68%), Positives = 321/412 (77%), Gaps = 17/412 (4%)

Query: 113 GFDMAPPAAAMLPGA-AVPGQLPGVPSAVPEMAQNMLPFGATQ-LGAFPLMPVQVMTQQA 170
           GFDMAPP  A+L GA A  GQ+PG   A+P M   M P    Q  GA P+MPVQ MTQQA
Sbjct: 190 GFDMAPPTTAILSGATAAAGQIPGTTPAIPGMFPTMFPLATGQPFGALPVMPVQAMTQQA 249

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           TRHARRVYVGGLPP ANEQ++ATFFSQVM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR
Sbjct: 250 TRHARRVYVGGLPPTANEQSVATFFSQVMAAIGGNTAGPGDAVVNVYINHEKKFAFVEMR 309

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           +VEEASNAMALDGIIFEG  V+VRRP+DYNP+LAA LGP QP+PNLNLAAVGL  G+ GG
Sbjct: 310 SVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGLTPGSAGG 369

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
            EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTD
Sbjct: 370 LEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLSVTD 429

Query: 351 IACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
           IACAALNG+KMGDKTLTVRRA   + Q K EQES+L  AQQ IA+QK+ LQ   ++T   
Sbjct: 430 IACAALNGIKMGDKTLTVRRANQGANQPKPEQESVLLHAQQQIALQKLMLQPGAVST--- 486

Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
                     KVLCLT+ +T + L +DE+YE+I+EDMR E GK+GTLVNVVIPRP  N  
Sbjct: 487 ----------KVLCLTQVVTPEELINDEDYEDIMEDMRGEGGKFGTLVNVVIPRPRPNEA 536

Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
             PGVGKVFLEY D      A+  L+GRKFGGN V A +YPE+K+   +Y A
Sbjct: 537 -APGVGKVFLEYADIDSATKARAGLNGRKFGGNQVMAVFYPENKFAQGEYDA 587


>gi|297798226|ref|XP_002866997.1| hypothetical protein ARALYDRAFT_490965 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297312833|gb|EFH43256.1| hypothetical protein ARALYDRAFT_490965 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 567

 Score =  565 bits (1455), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 312/539 (57%), Positives = 358/539 (66%), Gaps = 65/539 (12%)

Query: 35  RHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKER---RHRHRSRSHSSDR 91
           R H    S   DR R+K    DRE   +  R  R RD  + KER   + R R R H S R
Sbjct: 42  RDHERETSRSKDREREKGRDRDRERDSEVSRRSRDRDGEKGKERSREKDRDRERHHRSSR 101

Query: 92  FRNRSKSLSPSRS-------------------------------------------PSKS 108
            R+ S+  S  R                                            PSKS
Sbjct: 102 HRDHSRDRSERRERGGRDDDDYRRSRDRDHDRRRDDRGGRRIRRSRSRSKDRSERSPSKS 161

Query: 109 -KRRSGFDMAPP-AAAMLPGAAVPGQLPGVPSAVP--EMAQNMLPFGATQ-LGAFPLMPV 163
            KR SGFDMAPP +A +  GAAV GQ+P  P  +P   M  NM P    Q  G   +MP+
Sbjct: 162 NKRVSGFDMAPPASAMLAAGAAVTGQVPPAPPTLPGAGMFPNMFPLPTGQSFGGLSMMPI 221

Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
           Q MTQQATRHARRVYVGGL P ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYINHEKK
Sbjct: 222 QAMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINHEKK 281

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
           FAFVEMR+VEEASNAM+LDGIIFEG  V+VRRP+DYNP+LAA LGP QPSP+LNLAAVGL
Sbjct: 282 FAFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLNLAAVGL 341

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
             GA GG EGPDR+FVGGLPYYFTE+Q++ELLESFG L GFDLVKDR+TGNSKGY FCVY
Sbjct: 342 TPGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGALKGFDLVKDRETGNSKGYAFCVY 401

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           QD +VTDIACAALNG+KMGDKTLTVRRA   + Q K EQES+L  AQQ IA Q++ LQ  
Sbjct: 402 QDLSVTDIACAALNGIKMGDKTLTVRRANQGTMQPKPEQESVLLHAQQQIAFQRIMLQPG 461

Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
            M T              V+CLT+ +T D L DDEEYE+I+EDMR+E GK+G L NVVIP
Sbjct: 462 VMATT-------------VVCLTQVVTEDELRDDEEYEDIMEDMRQEGGKFGALTNVVIP 508

Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
           RP  NG   PG+GKVFL+Y D  G   A+  ++GRKFGGN V A YYPEDK+   DY A
Sbjct: 509 RPSPNGEPVPGLGKVFLKYADTDGSTRARTGMNGRKFGGNEVVAVYYPEDKFEQGDYGA 567


>gi|168021052|ref|XP_001763056.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685868|gb|EDQ72261.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 503

 Score =  564 bits (1453), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 282/414 (68%), Positives = 330/414 (79%), Gaps = 14/414 (3%)

Query: 109 KRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ-LGAFPLMPVQVMT 167
           +++SGFDMAPP AA++ GAA+ GQ+PG+   +P +   M PFG TQ  G  P MP Q MT
Sbjct: 99  RKQSGFDMAPPGAAVVSGAALAGQIPGIAQPMPGVYPGMFPFGGTQQFGGIPGMPAQAMT 158

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           QQATRHARRVYVGGLPPLANEQ IAT+FSQVM A+GGN+AGPGDAVVNVYIN EKKFAFV
Sbjct: 159 QQATRHARRVYVGGLPPLANEQTIATYFSQVMAAVGGNTAGPGDAVVNVYINQEKKFAFV 218

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG- 286
           EMRTVEEASNAMALDGI+FEGV+VRVRRP+DYNP++AA LGP QPSP+LNLAAVGL  G 
Sbjct: 219 EMRTVEEASNAMALDGIMFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNLAAVGLTPGN 278

Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
           A GGA+GPDR+FVGGLPYY TE QIKELLESFG L GFDLVKDRDTGNSKGYGFCVYQDP
Sbjct: 279 AAGGADGPDRIFVGGLPYYLTEIQIKELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDP 338

Query: 347 AVTDIACAALNGLKMGDKTLTVRRATAS-GQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
           +V DIACA LNG+KM DKTL VRRATA   + K +Q ++LA AQQ IAIQ   L  S M+
Sbjct: 339 SVVDIACATLNGMKMDDKTLNVRRATARLARPKPDQANVLAHAQQQIAIQ--VLVYSWMS 396

Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
            +        ET   V+ LT+ ++ D L DDEEY++ILEDM+EECGKYG LV +VIPRP 
Sbjct: 397 PV--------ETPTNVVALTQVVSPDELKDDEEYQDILEDMKEECGKYGNLVKLVIPRP- 447

Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           ++G + PGVGKVF+EY D  G A AK +L GR+FGG++V A YYP +K+  +DY
Sbjct: 448 RDGEDVPGVGKVFVEYSDTAGAAKAKASLHGRRFGGHSVVAVYYPAEKFSIEDY 501


>gi|168026451|ref|XP_001765745.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682922|gb|EDQ69336.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 491

 Score =  562 bits (1448), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 289/408 (70%), Positives = 331/408 (81%), Gaps = 15/408 (3%)

Query: 113 GFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ-LGAFPLMPVQVMTQQAT 171
           GFDMAPP AA++ GAAVPGQLPG+   +P +   M PFG TQ  G  P MP Q MTQQAT
Sbjct: 96  GFDMAPPGAAVIAGAAVPGQLPGMAQPMPGVFPGMFPFGGTQQFGGIPGMPAQAMTQQAT 155

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           RHARRVYVGGLPP+ANEQ IAT+FSQVM A+GGN+AGPGDAVVNVYIN EKKFAFVEMRT
Sbjct: 156 RHARRVYVGGLPPMANEQTIATYFSQVMAAVGGNTAGPGDAVVNVYINQEKKFAFVEMRT 215

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
           VEEASNAMALDGI+FEGV+VRVRRP+DYNP++AA LGP QPSP+LNLAAVGL  GA GGA
Sbjct: 216 VEEASNAMALDGIMFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNLAAVGLTPGAAGGA 275

Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
           +GPDR+FVGGLPYY TE QIKELLESFG L GFDLVKDRDTGNSKGYGFCVYQDPAVTD+
Sbjct: 276 DGPDRIFVGGLPYYLTEIQIKELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAVTDV 335

Query: 352 ACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGM 411
           A AALNGLKMGDKTL+VRRA+ASGQ K +Q ++LA AQQ IAIQ        M+ L    
Sbjct: 336 AIAALNGLKMGDKTLSVRRASASGQPKPDQANVLAHAQQQIAIQVFW-----MSPL---- 386

Query: 412 SLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGET 471
               ET  KV+ LT+ ++ D L DDEEY++ILEDM+EECGKYG L+ VVIPRP ++G + 
Sbjct: 387 ----ETSTKVVALTQVVSPDELKDDEEYQDILEDMKEECGKYGNLLRVVIPRP-RDGEDV 441

Query: 472 PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           PGVGKVF+EY D  G A AK +L GR+FGG++V A YYPE+K+   DY
Sbjct: 442 PGVGKVFVEYSDTAGAAKAKASLHGRRFGGHSVVAVYYPEEKFAAGDY 489


>gi|147840634|emb|CAN68321.1| hypothetical protein VITISV_032193 [Vitis vinifera]
          Length = 565

 Score =  561 bits (1445), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 283/413 (68%), Positives = 323/413 (78%), Gaps = 15/413 (3%)

Query: 109 KRRSGFDMAPPAAAMLPGAAVP-GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMT 167
           KR SGFDMAPPA+AML GAA   GQ+PG    +P M  NM P  + Q GA P+MPVQ MT
Sbjct: 164 KRVSGFDMAPPASAMLAGAAAAAGQIPGTTPTIPGMFPNMFPLASGQFGALPVMPVQAMT 223

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           QQATRHARRVYVGGL P ANEQ++ATFFSQVM+AIGGN+AGPGDAVVNVYINHEKKFAFV
Sbjct: 224 QQATRHARRVYVGGLSPTANEQSVATFFSQVMSAIGGNTAGPGDAVVNVYINHEKKFAFV 283

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           EMR+VEEASNAMALDGIIFEG  V+VRRP+DYNP+LAA LGP QP+PNLNLAAVGL  G+
Sbjct: 284 EMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGLTPGS 343

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
            GG EGPDR+FVGGLPYYFTE QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +
Sbjct: 344 AGGLEGPDRIFVGGLPYYFTEAQIRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLS 403

Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
           VTDIACAALNG+KMGDKTLTVRRA   + Q K EQE++L  AQQ IA+Q++  Q   + T
Sbjct: 404 VTDIACAALNGIKMGDKTLTVRRANQGASQPKPEQENVLLHAQQQIALQRLMFQPGALAT 463

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
                        KV+CLT+ + AD L DDE YE+I+EDMR E GK+G LVNV IPRP  
Sbjct: 464 -------------KVVCLTQVVNADELQDDEAYEDIVEDMRIEGGKFGNLVNVAIPRPKP 510

Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           NG  TPG+GKVFLEY D  G   A+  L+GRKF GN V A +YPE+K+   +Y
Sbjct: 511 NGEPTPGLGKVFLEYADIDGAXKARTGLNGRKFDGNQVVAVFYPENKFSQGEY 563


>gi|338762830|gb|AEI98617.1| hypothetical protein 111O18.4 [Coffea canephora]
          Length = 570

 Score =  560 bits (1444), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 287/443 (64%), Positives = 327/443 (73%), Gaps = 21/443 (4%)

Query: 78  RRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVP 137
           +RHR RSR   ++                  KR SGFDMAPP   ++ GA    Q+ G  
Sbjct: 146 QRHRSRSREGRAEHRSRSRSRSR------SKKRISGFDMAPPTNPLMTGATSLPQVTGAA 199

Query: 138 SAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQ 197
            AVP +  NM      QLGA P+MPVQ MTQQATRHARRVYVGGLPP ANEQ++ATFFS 
Sbjct: 200 PAVPGVFPNMFSLPTGQLGALPVMPVQAMTQQATRHARRVYVGGLPPTANEQSVATFFSH 259

Query: 198 VMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPT 257
           VM+AIGGN+AGPGDAVVNVYINHEKKFAFVEMR+VEEASNAMALDGIIFEG  V+VRRP+
Sbjct: 260 VMSAIGGNTAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAMALDGIIFEGAPVKVRRPS 319

Query: 258 DYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLES 317
           DYNP+LAA LGP QP+PNLNLAAVGL  G+ GG EGPDR+FVGGLPYYFTE QI+ELLES
Sbjct: 320 DYNPSLAATLGPSQPNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEGQIRELLES 379

Query: 318 FGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQ 376
           FG L GFDLVKDR+TGNSKGY FCVYQD +VTDIACAALNG+KMGDKTLTVRRA     Q
Sbjct: 380 FGPLRGFDLVKDRETGNSKGYAFCVYQDLSVTDIACAALNGIKMGDKTLTVRRANQGVTQ 439

Query: 377 SKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADD 436
            K EQES+L  AQQ IA+QK+ LQ   + T             KVLCLT+ ++AD L DD
Sbjct: 440 PKPEQESVLLHAQQQIALQKLMLQPGTLAT-------------KVLCLTQVVSADELRDD 486

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
           E+Y +ILEDMR ECGK+ TLVN+VIPRP   G  TPGVGKVFLEY D      A+  L G
Sbjct: 487 EDYADILEDMRLECGKF-TLVNLVIPRPSPTGDPTPGVGKVFLEYADVESANKARQGLHG 545

Query: 497 RKFGGNTVNAFYYPEDKYFNKDY 519
           R+FGGN V A +YPE+++   DY
Sbjct: 546 RRFGGNQVVAVFYPENRFSQGDY 568


>gi|359476715|ref|XP_002271463.2| PREDICTED: splicing factor U2af large subunit B-like [Vitis
           vinifera]
          Length = 568

 Score =  559 bits (1441), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 283/416 (68%), Positives = 323/416 (77%), Gaps = 18/416 (4%)

Query: 109 KRRSGFDMAPPAAAMLPGAA----VPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQ 164
           KR SGFDMAPPA+AML GAA      GQ+PG    +P M  NM P  + Q GA P+MPVQ
Sbjct: 164 KRVSGFDMAPPASAMLAGAAAAADFTGQIPGTTPTIPGMFPNMFPLASGQFGALPVMPVQ 223

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
            MTQQATRHARRVYVGGL P ANEQ++ATFFSQVM+AIGGN+AGPGDAVVNVYINHEKKF
Sbjct: 224 AMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMSAIGGNTAGPGDAVVNVYINHEKKF 283

Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
           AFVEMR+VEEASNAMALDGIIFEG  V+VRRP+DYNP+LAA LGP QP+PNLNLAAVGL 
Sbjct: 284 AFVEMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGLT 343

Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
            G+ GG EGPDR+FVGGLPYYFTE QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQ
Sbjct: 344 PGSAGGLEGPDRIFVGGLPYYFTEAQIRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQ 403

Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
           D +VTDIACAALNG+KMGDKTLTVRRA   + Q K EQE++L  AQQ IA+Q++  Q   
Sbjct: 404 DLSVTDIACAALNGIKMGDKTLTVRRANQGASQPKPEQENVLLHAQQQIALQRLMFQPGA 463

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
           + T             KV+CLT+ + AD L DDE YE+I+EDMR E GK+G LVNV IPR
Sbjct: 464 LAT-------------KVVCLTQVVNADELQDDEAYEDIVEDMRIEGGKFGNLVNVAIPR 510

Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           P  NG  TPG+GKVFLEY D  G   A+  L+GRKF GN V A +YPE+K+   +Y
Sbjct: 511 PKPNGEPTPGLGKVFLEYADIDGATKARTGLNGRKFDGNQVVAVFYPENKFSQGEY 566


>gi|302816055|ref|XP_002989707.1| hypothetical protein SELMODRAFT_160385 [Selaginella moellendorffii]
 gi|300142484|gb|EFJ09184.1| hypothetical protein SELMODRAFT_160385 [Selaginella moellendorffii]
          Length = 421

 Score =  556 bits (1434), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 291/418 (69%), Positives = 333/418 (79%), Gaps = 16/418 (3%)

Query: 116 MAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ-----LGAFPLMPVQVMTQQA 170
           MAPP AA++ G   PGQLPG+   VP +  +M PF  TQ         P MP Q MTQQA
Sbjct: 1   MAPPGAAVVTGT-TPGQLPGITQPVPGVF-SMFPFAGTQARLLFFAGLPTMPAQAMTQQA 58

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           TRHARRVYVGGLPPLANEQ IATFFSQVM+AIGGN+AGPGDAVVNVYIN EKKFAFVEMR
Sbjct: 59  TRHARRVYVGGLPPLANEQTIATFFSQVMSAIGGNTAGPGDAVVNVYINQEKKFAFVEMR 118

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           TVEEASNAMALDGIIFEGV+VRVRRP+DYNP++AA LGP QPSP+LNLAAVGL  GA GG
Sbjct: 119 TVEEASNAMALDGIIFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNLAAVGLTPGAAGG 178

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
           A+GPDR+FVGGLPYY TE QIKELLESFG L GFDLVKDRDTGNSKGYGFCVYQDPAVTD
Sbjct: 179 ADGPDRIFVGGLPYYLTEGQIKELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAVTD 238

Query: 351 IACAALNGLKMGDKTLTVRRATA---SGQSKTEQESILAQAQQHIAIQKMALQTSG---- 403
           +ACAALNGLKMGDKTLTVRRATA   SGQ K +Q ++LAQAQQ IA+QK+ALQ +     
Sbjct: 239 VACAALNGLKMGDKTLTVRRATASVHSGQPKPDQANVLAQAQQQIALQKLALQGAPYYNM 298

Query: 404 -MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
            M  +  GM++  ET  KV+CL + ++ D L +D+EYEEILEDMREECGKYG++  +V+P
Sbjct: 299 MMPGVDNGMTM-PETPTKVVCLKQVVSPDELKEDDEYEEILEDMREECGKYGSVATLVLP 357

Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           RP  +G E  GVGKVF+EY        AKN+L+GRKFGGN V A Y+PEDK+   +Y+
Sbjct: 358 RPKSDGEEVAGVGKVFVEYATIEEAIKAKNSLNGRKFGGNIVAAVYFPEDKFLQGEYN 415


>gi|168056046|ref|XP_001780033.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668531|gb|EDQ55136.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 536

 Score =  552 bits (1423), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 277/416 (66%), Positives = 319/416 (76%), Gaps = 18/416 (4%)

Query: 110 RRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQ 169
           + SGFDMAPP   ++PGAAVPGQ+ G+P  +P +  +M PFG  Q G  P MP Q MTQQ
Sbjct: 131 KTSGFDMAPPGGTIVPGAAVPGQISGMPPQMPGVFPSMFPFGGAQFGGLPGMPAQAMTQQ 190

Query: 170 -ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
            ATRHARRVYVGGLPP+ANEQ IAT+FSQVM A+GGN+AGPGDAVVNVYIN EKKFAFVE
Sbjct: 191 QATRHARRVYVGGLPPMANEQTIATYFSQVMAAVGGNTAGPGDAVVNVYINQEKKFAFVE 250

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           MRTVEEASNAM+LDGIIFEGV+VRVRRP+DYNP++AA LGP QPSP+LNLAAVGL  GA 
Sbjct: 251 MRTVEEASNAMSLDGIIFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNLAAVGLTPGAA 310

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           GGA+GPDR+FVGGLPYY TE QIKELLESFG L GFDLVKDRDTGNSKGYGFCVYQDP+V
Sbjct: 311 GGADGPDRIFVGGLPYYLTEPQIKELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPSV 370

Query: 349 -TDIACAALNGLKMGDKTLTVRRATAS---GQSKTEQESILAQAQQHIAIQKMALQTSGM 404
            TD+A AALNGLKMGDKTL+VRRA+A    GQ K +Q ++L  AQQ IA+Q         
Sbjct: 371 TTDVAIAALNGLKMGDKTLSVRRASARYGIGQPKPDQANVLIHAQQQIALQVTLKMLLHR 430

Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP 464
            T     + + +           +T + L DDEEY+EILEDMR ECGKYG L+NVVIPRP
Sbjct: 431 KTFTAAWTFYAQV----------VTPNQLEDDEEYQEILEDMRMECGKYGNLLNVVIPRP 480

Query: 465 DQNGGET-PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
               GET PG+GKVFL+Y D  G   AK +L GR+F  N V A +YPEDK+  KD+
Sbjct: 481 --RAGETVPGLGKVFLDYSDTTGATKAKTSLHGRRFDENLVVAVFYPEDKFAAKDF 534


>gi|334187224|ref|NP_001190937.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
 gi|332661290|gb|AEE86690.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
          Length = 551

 Score =  552 bits (1422), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 302/523 (57%), Positives = 349/523 (66%), Gaps = 49/523 (9%)

Query: 35  RHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKERRHR----HRSRSHSSD 90
           R H    S   DR R+K    DRE   +  R  R RD  + KER       HR R H S 
Sbjct: 42  RDHERETSRSKDREREKGRDKDRERDSEVSRRSRDRDGEKSKERSRDKDRDHRERHHRSS 101

Query: 91  RFRNRSKSLSPSRSPSK---------------------------SKRRSGFDMAPPAAAM 123
           R R+ S+     R                                +R SGFDMAPPA+AM
Sbjct: 102 RHRDHSRERGERRERGGGRRSRRSRSRSKDRSERRTRSRSPSKSKQRVSGFDMAPPASAM 161

Query: 124 LPGAA-VPGQLPGVPSAVPE--MAQNMLPFGATQ-LGAFPLMPVQVMTQQATRHARRVYV 179
           L   A V GQ+P  P  +P   M  NM P    Q  G   +MP+Q MTQQATRHARRVYV
Sbjct: 162 LAAGAAVTGQVPPAPPTLPGAGMFPNMFPLPTGQSFGGLSMMPIQAMTQQATRHARRVYV 221

Query: 180 GGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAM 239
           GGL P ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYINHEKKFAFVEMR+VEEASNAM
Sbjct: 222 GGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAM 281

Query: 240 ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFV 299
           +LDGIIFEG  V+VRRP+DYNP+LAA LGP QPSP+LNLAAVGL  GA GG EGPDR+FV
Sbjct: 282 SLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLNLAAVGLTPGASGGLEGPDRIFV 341

Query: 300 GGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGL 359
           GGLPYYFTE+Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTDIACAALNG+
Sbjct: 342 GGLPYYFTESQVRELLESFGGLKGFDLVKDRETGNSKGYAFCVYQDLSVTDIACAALNGI 401

Query: 360 KMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
           KMGDKTLTVRRA   +   K EQE++L  AQQ IA Q++ LQ   + T            
Sbjct: 402 KMGDKTLTVRRANQGTMLQKPEQENVLLHAQQQIAFQRVMLQPGAVATT----------- 450

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
             V+CLT+ +T D L DDEEY +I+EDMR+E GK+G L NVVIPRP  NG    G+GKVF
Sbjct: 451 --VVCLTQVVTEDELRDDEEYGDIMEDMRQEGGKFGALTNVVIPRPSPNGEPVAGLGKVF 508

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
           L+Y D  G   A+  ++GRKFGGN V A YYPEDK+   DY A
Sbjct: 509 LKYADTDGSTRARFGMNGRKFGGNEVVAVYYPEDKFEQGDYGA 551


>gi|302820212|ref|XP_002991774.1| hypothetical protein SELMODRAFT_161898 [Selaginella moellendorffii]
 gi|300140455|gb|EFJ07178.1| hypothetical protein SELMODRAFT_161898 [Selaginella moellendorffii]
          Length = 420

 Score =  550 bits (1417), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 287/416 (68%), Positives = 325/416 (78%), Gaps = 13/416 (3%)

Query: 116 MAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ-----LGAFPLMPVQVMTQQA 170
           MAPP AA++ G   PGQLPG+   VP +  +M PF  TQ         P MP Q MTQQA
Sbjct: 1   MAPPGAAVVTGT-TPGQLPGITQPVPGVF-SMFPFAGTQASLLFFAGLPTMPAQAMTQQA 58

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           TRHARRVYVGGLPPLANEQ IATFFSQVM+AIGGN+AGPGDAVVNVYIN EKKFAFVEMR
Sbjct: 59  TRHARRVYVGGLPPLANEQTIATFFSQVMSAIGGNTAGPGDAVVNVYINQEKKFAFVEMR 118

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           TVEEASNAMALDGIIFEGV+VRVRRP+DYNP++AA LGP QPSP+LNLAAVGL  GA GG
Sbjct: 119 TVEEASNAMALDGIIFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNLAAVGLTPGAAGG 178

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
           A+GPDR+FVGGLPYY TE QIKELLESFG L GFDLVKDRDTGNSKGYGFCVYQDPAVTD
Sbjct: 179 ADGPDRIFVGGLPYYLTEGQIKELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAVTD 238

Query: 351 IACAALNGLKMGDKTLTVRRATA---SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
           +ACAALNGLKMGDKTLTVRRATA   SGQ K +Q ++LAQAQQ IA+Q         N +
Sbjct: 239 VACAALNGLKMGDKTLTVRRATASVHSGQPKPDQANVLAQAQQQIALQLALQGAPYYNMM 298

Query: 408 GGGMS---LFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP 464
             G+       ET  KV+CL + ++ D L +D+EYEEILEDMREECGKYG++  +V+PRP
Sbjct: 299 MPGVDNGMTMPETPTKVVCLKQVVSPDELKEDDEYEEILEDMREECGKYGSVATLVLPRP 358

Query: 465 DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
             NG E  GVGKVF+EY        AKN+L+GRKFGGN V A Y+PEDK+   +Y+
Sbjct: 359 KSNGEEVAGVGKVFVEYATIEEAIKAKNSLNGRKFGGNIVAAVYFPEDKFLQGEYN 414


>gi|297735185|emb|CBI17547.3| unnamed protein product [Vitis vinifera]
          Length = 393

 Score =  547 bits (1409), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 266/390 (68%), Positives = 305/390 (78%), Gaps = 14/390 (3%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
           GQ+PG    +P M  NM P  + Q GA P+MPVQ MTQQATRHARRVYVGGL P ANEQ+
Sbjct: 15  GQIPGTTPTIPGMFPNMFPLASGQFGALPVMPVQAMTQQATRHARRVYVGGLSPTANEQS 74

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +ATFFSQVM+AIGGN+AGPGDAVVNVYINHEKKFAFVEMR+VEEASNAMALDGIIFEG  
Sbjct: 75  VATFFSQVMSAIGGNTAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAMALDGIIFEGAP 134

Query: 251 VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQ 310
           V+VRRP+DYNP+LAA LGP QP+PNLNLAAVGL  G+ GG EGPDR+FVGGLPYYFTE Q
Sbjct: 135 VKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQ 194

Query: 311 IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
           I+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTDIACAALNG+KMGDKTLTVRR
Sbjct: 195 IRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLSVTDIACAALNGIKMGDKTLTVRR 254

Query: 371 AT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAIT 429
           A   + Q K EQE++L  AQQ IA+Q++  Q   + T             KV+CLT+ + 
Sbjct: 255 ANQGASQPKPEQENVLLHAQQQIALQRLMFQPGALAT-------------KVVCLTQVVN 301

Query: 430 ADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT 489
           AD L DDE YE+I+EDMR E GK+G LVNV IPRP  NG  TPG+GKVFLEY D  G   
Sbjct: 302 ADELQDDEAYEDIVEDMRIEGGKFGNLVNVAIPRPKPNGEPTPGLGKVFLEYADIDGATK 361

Query: 490 AKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           A+  L+GRKF GN V A +YPE+K+   +Y
Sbjct: 362 ARTGLNGRKFDGNQVVAVFYPENKFSQGEY 391


>gi|122245120|sp|Q2QKB4.1|U2A2B_WHEAT RecName: Full=Splicing factor U2af large subunit B; AltName:
           Full=U2 auxiliary factor 65 kDa subunit B; AltName:
           Full=U2 small nuclear ribonucleoprotein auxiliary factor
           large subunit B; Short=U2 snRNP auxiliary factor large
           subunit B
 gi|68036764|gb|AAY84880.1| U2AF large subunit [Triticum aestivum]
          Length = 543

 Score =  543 bits (1400), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 283/424 (66%), Positives = 325/424 (76%), Gaps = 20/424 (4%)

Query: 102 SRSPSKSKRRSGFDMAPPAAAMLP---GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAF 158
           SRS SKSKR SGFD+ P A ++LP       P QLPG  S++P M  NMLPF   Q+   
Sbjct: 134 SRSHSKSKRVSGFDLGPTAQSVLPQFPTIPTPSQLPG--SSIPGMFPNMLPFADGQINPL 191

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
            + P Q MTQQATRHARRVYVGGLPP ANEQ++A +F+QVM AIGGN+AGPGDAV+NVYI
Sbjct: 192 VMQP-QAMTQQATRHARRVYVGGLPPSANEQSVAIYFNQVMAAIGGNTAGPGDAVLNVYI 250

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           NH+KKFAFVEMR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+LAAALGP QPS NLNL
Sbjct: 251 NHDKKFAFVEMRSVEEASNAMALDGILFEGAPVKVRRPTDYNPSLAAALGPSQPSSNLNL 310

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
           AAVGL  G+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY
Sbjct: 311 AAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGY 370

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKM 397
            FCVYQD  VTDIACAALNG+KMGDKTLTVRRA   S Q + EQE+IL QAQQ + +QK+
Sbjct: 371 AFCVYQDLNVTDIACAALNGIKMGDKTLTVRRANQGSAQPRPEQENILLQAQQQVQLQKL 430

Query: 398 ALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
             Q   + T             KV+CLT+ +TAD L DDEEYE+I+EDMR E GKYG LV
Sbjct: 431 VYQVGALPT-------------KVVCLTQVVTADELKDDEEYEDIMEDMRLEAGKYGNLV 477

Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
            VVIPRP  +G    GVGKVFLEY D  G   AK A+ GRKFGGN V A +YPE+K+ ++
Sbjct: 478 KVVIPRPHPSGEPVSGVGKVFLEYADVDGSTKAKTAMHGRKFGGNPVVAVFYPENKFADE 537

Query: 518 DYSA 521
           DY A
Sbjct: 538 DYDA 541


>gi|224094725|ref|XP_002310209.1| predicted protein [Populus trichocarpa]
 gi|222853112|gb|EEE90659.1| predicted protein [Populus trichocarpa]
          Length = 394

 Score =  543 bits (1399), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 265/387 (68%), Positives = 306/387 (79%), Gaps = 15/387 (3%)

Query: 135 GVPSAVPEMAQNMLPFGA-TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIAT 193
           G    +P M  NM P G   Q GA P+MPVQ MTQQATRHARRVYVGGLPP+ANEQ++AT
Sbjct: 19  GTTPPIPGMFPNMFPLGTGQQFGALPVMPVQAMTQQATRHARRVYVGGLPPIANEQSVAT 78

Query: 194 FFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRV 253
           FFSQVM AIGGN+AGPGDAVVNVYINHEKKFAFVEMR+VEEASNAMALDGIIFEG  V+V
Sbjct: 79  FFSQVMAAIGGNTAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAMALDGIIFEGAPVKV 138

Query: 254 RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
           RRP+DYNP+LAA LGP QP+PNLNLAAVGL  G+ GG EGPDR+FVGGLPYYFTE QI+E
Sbjct: 139 RRPSDYNPSLAATLGPSQPNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQIRE 198

Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT- 372
           LLESFG L GFDLVKDR+TGNSKGY FCVYQD +VTDIACAALNG+KMGDKTLTVRRA  
Sbjct: 199 LLESFGALRGFDLVKDRETGNSKGYAFCVYQDLSVTDIACAALNGIKMGDKTLTVRRANQ 258

Query: 373 ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADA 432
            + Q K EQE++L  AQQ IA+Q++ LQ   + T             KV+CLT+ +T D 
Sbjct: 259 GTNQPKPEQENVLLHAQQQIALQRLMLQPPPVVT-------------KVVCLTQVVTVDE 305

Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
           L DD+EYE+ILED+R E GK+G LVNVVIPRP  +G   PGVGKVFLEY D  G + A+ 
Sbjct: 306 LKDDDEYEDILEDIRMEAGKFGQLVNVVIPRPRPDGENAPGVGKVFLEYADTEGSSKARA 365

Query: 493 ALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            ++GRKFGGN V A ++PE+K+   +Y
Sbjct: 366 GMNGRKFGGNHVVAVFFPENKFSQGEY 392


>gi|357156009|ref|XP_003577312.1| PREDICTED: splicing factor U2af large subunit B-like [Brachypodium
           distachyon]
          Length = 576

 Score =  541 bits (1395), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 273/413 (66%), Positives = 311/413 (75%), Gaps = 18/413 (4%)

Query: 113 GFDMAPPAAAML--PGAAVPGQLPGVPSAVPEMAQNMLPFGA-TQLGAFPLMPVQVMTQQ 169
           GFD  P     L  PGA  PGQLP V   +P M  NM  F A TQ     + P Q MTQQ
Sbjct: 178 GFDQGPSQGVPLVTPGA-TPGQLPAVAPLIPGMLPNMFNFTAPTQFNPLAMQP-QAMTQQ 235

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           ATRHARRVYVGGLPP ANEQ +A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAFVEM
Sbjct: 236 ATRHARRVYVGGLPPTANEQTVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVEM 295

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
           R+VEEASNAMALDGI+FEG  V+VRRPTDYNP+LAAALGP QP+PNLNL AVGL  G+ G
Sbjct: 296 RSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLGAVGLTPGSAG 355

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
           G EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD  VT
Sbjct: 356 GLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNVT 415

Query: 350 DIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
           DIACAALNG+KMGDKTLTVRRA   + Q + EQE+IL QA Q + +Q++ LQ      +G
Sbjct: 416 DIACAALNGIKMGDKTLTVRRANQGASQPRPEQETILMQAHQQVQMQRLVLQ------VG 469

Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
           G +        KV+CLT+ ++AD L DDEEYE+ILEDMREE  KYG LV  VIPRPD +G
Sbjct: 470 GALP------TKVVCLTQVVSADELRDDEEYEDILEDMREEGRKYGNLVKAVIPRPDPSG 523

Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
              PGVGKVFLEY D  G   AK  + GRKFGGN V A +YPE+K+   DY A
Sbjct: 524 APVPGVGKVFLEYLDVDGSTKAKTGMHGRKFGGNQVVAVFYPENKFAEGDYDA 576


>gi|15234495|ref|NP_195387.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
 gi|75318082|sp|O23212.2|U2A2A_ARATH RecName: Full=Splicing factor U2af large subunit A; AltName:
           Full=U2 auxiliary factor 65 kDa subunit A; AltName:
           Full=U2 small nuclear ribonucleoprotein auxiliary factor
           large subunit A; Short=U2 snRNP auxiliary factor large
           subunit A
 gi|18087531|gb|AAL58899.1|AF462805_1 At4g35590/C7A10_670 [Arabidopsis thaliana]
 gi|4006898|emb|CAB16828.1| splicing factor-like protein [Arabidopsis thaliana]
 gi|7270617|emb|CAB80335.1| splicing factor-like protein [Arabidopsis thaliana]
 gi|23506119|gb|AAN28919.1| At4g35590/C7A10_670 [Arabidopsis thaliana]
 gi|24030414|gb|AAN41365.1| putative splicing factor [Arabidopsis thaliana]
 gi|332661287|gb|AEE86687.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
          Length = 573

 Score =  541 bits (1394), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 276/418 (66%), Positives = 318/418 (76%), Gaps = 18/418 (4%)

Query: 109 KRRSGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPE--MAQNMLPFGATQ-LGAFPLMPVQ 164
           +R SGFDMAPPA+AML   A V GQ+P  P  +P   M  NM P    Q  G   +MP+Q
Sbjct: 169 QRVSGFDMAPPASAMLAAGAAVTGQVPPAPPTLPGAGMFPNMFPLPTGQSFGGLSMMPIQ 228

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
            MTQQATRHARRVYVGGL P ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYINHEKKF
Sbjct: 229 AMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINHEKKF 288

Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
           AFVEMR+VEEASNAM+LDGIIFEG  V+VRRP+DYNP+LAA LGP QPSP+LNLAAVGL 
Sbjct: 289 AFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLNLAAVGLT 348

Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
            GA GG EGPDR+FVGGLPYYFTE+Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQ
Sbjct: 349 PGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGGLKGFDLVKDRETGNSKGYAFCVYQ 408

Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
           D +VTDIACAALNG+KMGDKTLTVRRA   +   K EQE++L  AQQ IA Q++ LQ   
Sbjct: 409 DLSVTDIACAALNGIKMGDKTLTVRRANQGTMLQKPEQENVLLHAQQQIAFQRVMLQPGA 468

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
           + T              V+CLT+ +T D L DDEEY +I+EDMR+E GK+G L NVVIPR
Sbjct: 469 VATT-------------VVCLTQVVTEDELRDDEEYGDIMEDMRQEGGKFGALTNVVIPR 515

Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
           P  NG    G+GKVFL+Y D  G   A+  ++GRKFGGN V A YYPEDK+   DY A
Sbjct: 516 PSPNGEPVAGLGKVFLKYADTDGSTRARFGMNGRKFGGNEVVAVYYPEDKFEQGDYGA 573


>gi|156070782|gb|ABU45195.1| unknown [Petunia integrifolia subsp. inflata]
          Length = 506

 Score =  536 bits (1381), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 283/487 (58%), Positives = 339/487 (69%), Gaps = 60/487 (12%)

Query: 68  RHRDYNRDKERRHRHRSRSH-SSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPG 126
           R R+ ++D E  HRHR  S   +DR          S+SPSKS+R SGFDMAPP +A+LPG
Sbjct: 46  RRRENDKDIEDPHRHRPGSRGKTDR----------SQSPSKSRRISGFDMAPPTSALLPG 95

Query: 127 AA-VPGQLPGVPSAVPEMAQNMLPF-----------------------------GATQLG 156
           A    GQ+PG   ++P M  NM P                              G  Q G
Sbjct: 96  ATDAAGQVPGTNPSIPGMFSNMFPLASDQVLPQIPSYYTSNGLLIFSFLIHLVCGFFQCG 155

Query: 157 AFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
            FP+MP+Q MTQQATRHARRVYVGGLP  ANEQ++ATFFS VM AIGGN+AGPGDAV++V
Sbjct: 156 PFPVMPIQEMTQQATRHARRVYVGGLPSSANEQSVATFFSHVMYAIGGNTAGPGDAVIDV 215

Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL 276
           YINHEKKFAFVEMR+VEEASNAMALDG+IFEG  VRVRRP+DYN +LAA LGP QPSPNL
Sbjct: 216 YINHEKKFAFVEMRSVEEASNAMALDGVIFEGEPVRVRRPSDYNASLAATLGPSQPSPNL 275

Query: 277 NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
           NLAAVGL  G+ GG EGPD +F+GGLP YFTE QI+ELLESFG L GF+LVKDR++GNSK
Sbjct: 276 NLAAVGLTPGSSGGLEGPDCIFIGGLPDYFTEAQIRELLESFGPLRGFNLVKDRESGNSK 335

Query: 337 GYGFCVYQDPAVTDIACAALNGLK-MGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAI 394
           G+ F VYQD +VT+IAC ALNG+K M DKTL VRRA   + Q   EQES+L    Q I++
Sbjct: 336 GHAFFVYQDVSVTEIACGALNGIKIMHDKTLIVRRANQGTQQLNPEQESVL----QQISL 391

Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
           Q++ L    + T             KVLCLTEA+  D L DD++Y++ILEDMR ECGK+G
Sbjct: 392 QRLMLLPGALAT-------------KVLCLTEAVRLDELNDDDDYQDILEDMRTECGKFG 438

Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
            L+NV+IPRP+ NG  TPGVGKVFLEY D    + A+  L+GRKFGGN V A +YPE+K+
Sbjct: 439 ALLNVIIPRPNPNGEPTPGVGKVFLEYADVDSSSKAQQGLNGRKFGGNQVIAVFYPENKF 498

Query: 515 FNKDYSA 521
              +Y A
Sbjct: 499 SEGNYEA 505


>gi|122232770|sp|Q2QZL4.2|U2A2B_ORYSJ RecName: Full=Splicing factor U2af large subunit B; AltName:
           Full=U2 auxiliary factor 65 kDa subunit B; AltName:
           Full=U2 small nuclear ribonucleoprotein auxiliary factor
           large subunit B; Short=U2 snRNP auxiliary factor large
           subunit B
 gi|108864649|gb|ABA95281.2| transposon protein, putative, CACTA, En/Spm sub-class, expressed
           [Oryza sativa Japonica Group]
 gi|222616418|gb|EEE52550.1| hypothetical protein OsJ_34796 [Oryza sativa Japonica Group]
          Length = 548

 Score =  535 bits (1379), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 276/414 (66%), Positives = 318/414 (76%), Gaps = 20/414 (4%)

Query: 112 SGFDMAPPAAAMLP---GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ 168
           SGFDMAPPA A++P       P Q PG  +A+P M  NMLP G  Q     + P Q MTQ
Sbjct: 151 SGFDMAPPAQAVVPQFPAIPTPSQFPG--TAIPGMFPNMLPMGVGQFNPLVIQP-QAMTQ 207

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           QATRHARRVYVGGLPP ANEQ++A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAFVE
Sbjct: 208 QATRHARRVYVGGLPPTANEQSVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVE 267

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           MR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL  G+ 
Sbjct: 268 MRSVEEASNAMALDGILFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTPGSA 327

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD  V
Sbjct: 328 GGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNV 387

Query: 349 TDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
           TDIACAALNG+KMGDKTLTVRRA   + Q + EQESIL QAQQ + +QK+  Q   + T 
Sbjct: 388 TDIACAALNGIKMGDKTLTVRRANQGAAQPRPEQESILLQAQQQVQLQKLVYQVGALPT- 446

Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
                       KV+CLT+ ++AD L DDEEYE+I+EDMR E GKYG L+ VVIPRPD +
Sbjct: 447 ------------KVVCLTQVVSADELKDDEEYEDIMEDMRLEAGKYGNLIKVVIPRPDPS 494

Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
           G    GVGKVFLEY D  G   AK A+ GRKFGGN V A +YPE+K+ + +Y A
Sbjct: 495 GLPVAGVGKVFLEYADVDGATKAKTAMHGRKFGGNPVVAVFYPENKFSSAEYDA 548


>gi|218191627|gb|EEC74054.1| hypothetical protein OsI_09051 [Oryza sativa Indica Group]
          Length = 548

 Score =  535 bits (1379), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 276/414 (66%), Positives = 318/414 (76%), Gaps = 20/414 (4%)

Query: 112 SGFDMAPPAAAMLP---GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ 168
           SGFDMAPPA A++P       P Q PG  +A+P M  NMLP G  Q     + P Q MTQ
Sbjct: 151 SGFDMAPPAQAVVPQFPAIPTPSQFPG--TAIPGMFPNMLPMGVGQFNPLVIQP-QAMTQ 207

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           QATRHARRVYVGGLPP ANEQ++A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAFVE
Sbjct: 208 QATRHARRVYVGGLPPTANEQSVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVE 267

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           MR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL  G+ 
Sbjct: 268 MRSVEEASNAMALDGILFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTPGSA 327

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD  V
Sbjct: 328 GGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNV 387

Query: 349 TDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
           TDIACAALNG+KMGDKTLTVRRA   + Q + EQESIL QAQQ + +QK+  Q   + T 
Sbjct: 388 TDIACAALNGIKMGDKTLTVRRANQGAAQPRPEQESILLQAQQQVQLQKLVYQVGALPT- 446

Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
                       KV+CLT+ ++AD L DDEEYE+I+EDMR E GKYG L+ VVIPRPD +
Sbjct: 447 ------------KVVCLTQVVSADELKDDEEYEDIMEDMRLESGKYGNLIKVVIPRPDPS 494

Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
           G    GVGKVFLEY D  G   AK A+ GRKFGGN V A +YPE+K+ + +Y A
Sbjct: 495 GLPVAGVGKVFLEYADVDGATKAKTAMHGRKFGGNPVVAVFYPENKFASAEYDA 548


>gi|357470349|ref|XP_003605459.1| Splicing factor U2af large subunit B [Medicago truncatula]
 gi|355506514|gb|AES87656.1| Splicing factor U2af large subunit B [Medicago truncatula]
          Length = 634

 Score =  533 bits (1374), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 263/400 (65%), Positives = 305/400 (76%), Gaps = 28/400 (7%)

Query: 124 LPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL---GAFPLMPVQVMTQQATRHARRVYVG 180
           +P  A+ G LP           NM P GA Q+    A P+MP+Q MTQQATRHARRVYVG
Sbjct: 257 VPNPAISGVLP-----------NMFPMGANQMPQFSALPMMPIQAMTQQATRHARRVYVG 305

Query: 181 GLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMA 240
           GLPP ANEQ++A FFSQVM  IGGN+AGPGDAVVNVYINH+KKFAFVEMR+VEEASNAMA
Sbjct: 306 GLPPTANEQSVAIFFSQVMANIGGNTAGPGDAVVNVYINHDKKFAFVEMRSVEEASNAMA 365

Query: 241 LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVG 300
           LDGIIFEG  V+VRRPTDYNP+LAA LGP QP+PNLNL AVGL  G+ GG EGPDR+FVG
Sbjct: 366 LDGIIFEGAPVKVRRPTDYNPSLAATLGPSQPNPNLNLGAVGLTPGSAGGLEGPDRIFVG 425

Query: 301 GLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLK 360
           GLPYYFTETQI+ELLE+FG L GFDLVKDR+TGNSKGY FCVY D AVTDIACAALNG+K
Sbjct: 426 GLPYYFTETQIRELLETFGPLRGFDLVKDRETGNSKGYAFCVYADLAVTDIACAALNGIK 485

Query: 361 MGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA 419
           MGDKTLTVRRA   + Q K EQESIL  AQQ IA+QK+  Q + + T             
Sbjct: 486 MGDKTLTVRRANQGTTQPKPEQESILMHAQQQIALQKLIFQPALVAT------------- 532

Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFL 479
           KV+CLT A+  + L +DE++EEI++DMR+EC K+G+LVNVVIPRP  +G  + GVGKVFL
Sbjct: 533 KVVCLTNAVAPEELKEDEDFEEIIDDMRQECSKFGSLVNVVIPRPQPDGDLSGGVGKVFL 592

Query: 480 EYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           EY D  G   A+  L+GRKFGGN V A +Y E+K+   DY
Sbjct: 593 EYVDIEGATKARTGLNGRKFGGNEVIAVFYSENKFAQGDY 632


>gi|357438349|ref|XP_003589450.1| Splicing factor U2af large subunit B [Medicago truncatula]
 gi|355478498|gb|AES59701.1| Splicing factor U2af large subunit B [Medicago truncatula]
          Length = 611

 Score =  532 bits (1371), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 267/414 (64%), Positives = 307/414 (74%), Gaps = 34/414 (8%)

Query: 112 SGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL---GAFPLMPVQVMTQ 168
           SGFDMAPP +A+L    V GQ+ G   A+P M  NM P    Q+    A P++PVQ MTQ
Sbjct: 224 SGFDMAPPTSAILGATGVAGQITGASPAIPGMFPNMFPLPTNQVQPFSALPVLPVQAMTQ 283

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           QATRHARRVYVGGL P ANEQ++ATFFSQVM  IGGN+AGPGDAV               
Sbjct: 284 QATRHARRVYVGGLSPTANEQSVATFFSQVMATIGGNTAGPGDAV--------------- 328

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           MR+VEEASNAMALDGIIFEG  V+VRRPTDYNP+LAAALGP QP+PNLNL  VGL+ G+ 
Sbjct: 329 MRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLGLVGLSPGSA 388

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           GG +GPDR+FVGG+PYYFTETQI+ELLE+FG L GFDLVKDR+TGNSKGY FCVYQD AV
Sbjct: 389 GGLDGPDRIFVGGVPYYFTETQIRELLETFGPLRGFDLVKDRETGNSKGYAFCVYQDLAV 448

Query: 349 TDIACAALNGLKMGDKTLTVRRA---TASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
           TDIACAALNG+KMGDKTLTVRRA   T   Q K EQESIL  AQQ IA+QK+ LQ + + 
Sbjct: 449 TDIACAALNGIKMGDKTLTVRRANQNTNPMQPKPEQESILMHAQQQIALQKLMLQPALVA 508

Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
           T             KVLCLT A++ D L DDE+YEEIL+DMR+EC K+G LVNVVIPRP 
Sbjct: 509 T-------------KVLCLTHAVSPDELKDDEDYEEILDDMRQECSKFGNLVNVVIPRPR 555

Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            +G   PGVGKVFLEY D  G   A++ L+GRKFGGN V A +YPE+K+   DY
Sbjct: 556 PDGELCPGVGKVFLEYADVDGSTKARSGLNGRKFGGNQVIAVFYPENKFAQGDY 609


>gi|357446501|ref|XP_003593528.1| Splicing factor U2af large subunit B [Medicago truncatula]
 gi|355482576|gb|AES63779.1| Splicing factor U2af large subunit B [Medicago truncatula]
          Length = 593

 Score =  531 bits (1369), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 296/542 (54%), Positives = 358/542 (66%), Gaps = 79/542 (14%)

Query: 49  RDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKS 108
           R K   Y+R+  RD+DR   H DY+RD++ R+R+ + S  S R              S+ 
Sbjct: 58  RGKYDSYNRQRGRDYDR---HNDYDRDRDTRNRYGAHSKRSRRESRSRSRSRSPSQ-SEG 113

Query: 109 KRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ 168
           KR SGFDMAPPA  + P   V GQ+PG+   +    QN  P+G +Q+GA  LM VQ MTQ
Sbjct: 114 KRTSGFDMAPPATGVTP--TVSGQMPGIAHMIQGATQNFSPYGISQIGALSLMQVQPMTQ 171

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           QATRHARRVYVGGLPP ANEQ+IA+FFSQVM AIGGNSAG GD+VVNVYINHEKKFAFVE
Sbjct: 172 QATRHARRVYVGGLPPFANEQSIASFFSQVMIAIGGNSAGSGDSVVNVYINHEKKFAFVE 231

Query: 229 MRTVEEASNAMALDGIIFEGVAV---------RV-------RRPTD-------------- 258
           MRTVEEASNAMALDGI+FEG+ V         R+       RRP D              
Sbjct: 232 MRTVEEASNAMALDGIVFEGIGVAPIVKMVENRLRWFGHVERRPIDSVARRVDQMEDSQM 291

Query: 259 ------------------YNPTL-------AAALGPGQPSP-NLNLAAV--------GLA 284
                             Y+ TL         A+   +P+  N +LAAV         L 
Sbjct: 292 DKTIRKDLEINKLDRNMVYDRTLWRNLIHVGVAVRVRRPTDYNPSLAAVLGPCQPSANLN 351

Query: 285 SGAIGGAEGP-------DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
             A+G + G        DR+FVGGLPYYFTE Q++ELL++FG L  FD+V+D++TGNSKG
Sbjct: 352 LSAVGLSAGTIGGAEGLDRIFVGGLPYYFTEVQMRELLQAFGPLRSFDIVRDKETGNSKG 411

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           YGFC+YQDPAVTDIACAALNGLKMGDKTLTVRRAT S  SK E+++I A+AQQHIA+QK+
Sbjct: 412 YGFCIYQDPAVTDIACAALNGLKMGDKTLTVRRATVSAHSKPEEDNIFARAQQHIAMQKI 471

Query: 398 ALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
           AL+  G+N    G+    E+  KVLCLTEA+T + L D+ EYEEILEDMR+EC K+GTLV
Sbjct: 472 ALEVVGLNI--PGVPTNDESPTKVLCLTEAVTTEQLTDNGEYEEILEDMRDECRKFGTLV 529

Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
           NVVIPRP+ NG  + G+GKVFLEY D   C  AKNAL+GRKFGG+ V AFYYPE+KY + 
Sbjct: 530 NVVIPRPNPNGELSTGIGKVFLEYSDCTACLAAKNALNGRKFGGSIVTAFYYPEEKYHSM 589

Query: 518 DY 519
           DY
Sbjct: 590 DY 591


>gi|168030966|ref|XP_001767993.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680835|gb|EDQ67268.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 499

 Score =  525 bits (1352), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 263/414 (63%), Positives = 312/414 (75%), Gaps = 31/414 (7%)

Query: 112 SGFDMAPPAAAMLPGAAVPGQLPGV--PSAVPEMAQNMLPF-GATQLGAFPLMPVQVMTQ 168
           SGFDMAPP   +LP +A+ GQ+ G+  PS        + PF G TQ+G FPL     +  
Sbjct: 109 SGFDMAPPGVTVLPASALSGQIAGMGFPS--------IFPFAGGTQVGPFPLH-FHAIGL 159

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
             TRHARRVYVGGLPP+ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYIN EK+FAFVE
Sbjct: 160 SFTRHARRVYVGGLPPMANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINQEKRFAFVE 219

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           MRTVEEASNAMALDGI++EGV+VRVRRP+DYNP++AA LGP QPS +LNL AVGL  GA+
Sbjct: 220 MRTVEEASNAMALDGIVYEGVSVRVRRPSDYNPSMAATLGPSQPSSHLNLTAVGLTPGAL 279

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           GGA+GPDR+FVGGLPYY +E QI +LL SFG L  FDLVKDRDTGNSKGYGFCVYQDP+V
Sbjct: 280 GGADGPDRIFVGGLPYYLSEEQIMDLLSSFGHLRAFDLVKDRDTGNSKGYGFCVYQDPSV 339

Query: 349 TDIACAALNGLKMGDKTLTVRRATAS---GQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
            DIACAALNGLKMGD+TLTVRRA+A    GQ K +Q +I+ QAQQ IA+Q  A       
Sbjct: 340 MDIACAALNGLKMGDRTLTVRRASARLRFGQPKPDQSNIIVQAQQQIALQVAA------- 392

Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
                     ET  KV+CL++ ++   L DD E++EI+EDM+EECGKYG+L+NVVIPRP 
Sbjct: 393 ---------PETATKVICLSQVVSIVDLKDDVEFDEIVEDMKEECGKYGSLLNVVIPRPS 443

Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            +  + PG+G VF+EY D  G A AK AL  RKFGG  V A YY EDK+ N DY
Sbjct: 444 YDEEDVPGIGMVFVEYSDLEGAAKAKQALHNRKFGGKLVIASYYSEDKFLNGDY 497


>gi|242069419|ref|XP_002449986.1| hypothetical protein SORBIDRAFT_05g026740 [Sorghum bicolor]
 gi|241935829|gb|EES08974.1| hypothetical protein SORBIDRAFT_05g026740 [Sorghum bicolor]
          Length = 545

 Score =  522 bits (1345), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 278/414 (67%), Positives = 315/414 (76%), Gaps = 23/414 (5%)

Query: 113 GFDMAPPAA--AMLPGAAVPGQLPGVPSAVPE---MAQNMLPFG-ATQLGAFPLMPVQVM 166
           GFD  PP A  +  P    P QLPG  +++P    M  NMLPFG A Q     + P Q M
Sbjct: 146 GFDAPPPQAMGSPFPVIPTPSQLPG--TSLPNIGGMFPNMLPFGVAGQFNPLVIQP-QAM 202

Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
           TQQATRHARRVYVGGLPP ANEQ +A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAF
Sbjct: 203 TQQATRHARRVYVGGLPPSANEQTVAVYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAF 262

Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
           VEMR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL +G
Sbjct: 263 VEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTAG 322

Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
           + GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD 
Sbjct: 323 STGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDL 382

Query: 347 AVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
            VTDIACAALNG+KMGDKTLTVRRA   + Q + EQESIL QAQQ + +QK+  Q   + 
Sbjct: 383 TVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQLQKLVYQVGALP 442

Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
           T             KV+CLT+ +TAD L DDEEYE+I+EDMR E GKYGTLV V+IPRPD
Sbjct: 443 T-------------KVVCLTQVVTADELKDDEEYEDIMEDMRLEAGKYGTLVKVIIPRPD 489

Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            +G    GVGKVFLEY D  G A AK AL GRKFGGN V A  Y EDK+ N +Y
Sbjct: 490 PSGQPVAGVGKVFLEYADIDGAAKAKTALHGRKFGGNPVVAVCYAEDKFANGEY 543


>gi|223950169|gb|ACN29168.1| unknown [Zea mays]
 gi|413920349|gb|AFW60281.1| splicing factor U2AF subunit [Zea mays]
          Length = 594

 Score =  521 bits (1343), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 270/413 (65%), Positives = 312/413 (75%), Gaps = 18/413 (4%)

Query: 112 SGFDMAPP--AAAMLPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
           SGFD APP  A  ++   A+PGQLPG+ + +P +    N+    A Q     + P Q MT
Sbjct: 194 SGFDQAPPQHALPIVAAGAIPGQLPGITAPIPGVGVLPNLYNLAAGQFNPLVIQP-QAMT 252

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           QQATRHARRVYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV+NVYINH+KKFAFV
Sbjct: 253 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 312

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           EMR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+LAAALGP QP+PNLNLAAVGL  G+
Sbjct: 313 EMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGS 372

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
            GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD  
Sbjct: 373 AGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLN 432

Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
           VTDIACAALNG+KMGDKTLTVRRA   + Q + EQESIL QAQQ + +QK+  Q      
Sbjct: 433 VTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQMQKLVYQ------ 486

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
           +GG +        KV+CLT+ +TAD L DDEEY +I+EDMREE  KYG LV VVIPRPD 
Sbjct: 487 VGGALP------TKVVCLTQVVTADELRDDEEYNDIVEDMREEGRKYGNLVKVVIPRPDP 540

Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           +     GVGKVFLEY D  G   AK  + GRKFGGN V A +YPEDK+  + Y
Sbjct: 541 SDAPVAGVGKVFLEYADVEGSTKAKTGMHGRKFGGNQVVAVFYPEDKFAAEQY 593


>gi|212723502|ref|NP_001131562.1| uncharacterized protein LOC100192903 [Zea mays]
 gi|194691860|gb|ACF80014.1| unknown [Zea mays]
 gi|195646366|gb|ACG42651.1| splicing factor U2AF 65 kDa subunit [Zea mays]
 gi|413920213|gb|AFW60145.1| Splicing factor U2AF subunit [Zea mays]
          Length = 539

 Score =  520 bits (1338), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 278/414 (67%), Positives = 314/414 (75%), Gaps = 23/414 (5%)

Query: 113 GFDMAPPAA--AMLPGAAVPGQLPGVPSAVPE---MAQNMLPFG-ATQLGAFPLMPVQVM 166
           GFD  PP A  +  P    P QLPG  S++P    M  NMLPFG A Q     + P Q M
Sbjct: 140 GFDAPPPQAMGSTFPVIPTPSQLPG--SSLPNIGGMFPNMLPFGVAGQFNPLVIQP-QAM 196

Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
           TQQATRHARRVYVGGLPP ANEQ +A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAF
Sbjct: 197 TQQATRHARRVYVGGLPPSANEQTVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAF 256

Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
           VEMR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL +G
Sbjct: 257 VEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTAG 316

Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
           + GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD 
Sbjct: 317 SNGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDL 376

Query: 347 AVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
            VTDIACAALNG+KMGDKTLTVRRA   + Q + EQESIL QAQQ + +QK+  Q   + 
Sbjct: 377 TVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQLQKLVYQVGALP 436

Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
           T             KV+CLT+ +TAD L DDEEYE+I+EDMR E GKYG LV V+IPRPD
Sbjct: 437 T-------------KVVCLTQVVTADELKDDEEYEDIMEDMRLEAGKYGNLVKVIIPRPD 483

Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            +G    GVGKVFLEY D  G A AK AL GRKFGGN V A  Y EDK+ N +Y
Sbjct: 484 PSGQPVVGVGKVFLEYADIDGAAKAKTALHGRKFGGNPVVAVCYAEDKFANGEY 537


>gi|226497766|ref|NP_001152419.1| LOC100286059 [Zea mays]
 gi|195656099|gb|ACG47517.1| splicing factor U2AF 65 kDa subunit [Zea mays]
          Length = 596

 Score =  519 bits (1336), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 269/413 (65%), Positives = 311/413 (75%), Gaps = 18/413 (4%)

Query: 112 SGFDMAPP--AAAMLPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
           SGFD APP  A  ++   A+PGQLPG+ + +P +    N+    A Q     + P Q MT
Sbjct: 196 SGFDQAPPQHALPIVAAGAIPGQLPGITAPIPGVGVLPNLYNLAAGQFNPLVIQP-QAMT 254

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           QQATRHARRVYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV+NVYINH+KKFAFV
Sbjct: 255 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 314

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           EMR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+LAAALGP QP+PNLNLAAVGL  G+
Sbjct: 315 EMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGS 374

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
            GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD  
Sbjct: 375 AGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLN 434

Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
           VTDIACAALNG+KMGDKTLTV RA   + Q + EQESIL QAQQ + +QK+  Q      
Sbjct: 435 VTDIACAALNGIKMGDKTLTVSRANQGASQPRPEQESILLQAQQQVQMQKLVYQ------ 488

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
           +GG +        KV+CLT+ +TAD L DDEEY +I+EDMREE  KYG LV VVIPRPD 
Sbjct: 489 VGGALP------TKVVCLTQVVTADELRDDEEYNDIVEDMREEGRKYGNLVKVVIPRPDP 542

Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           +     GVGKVFLEY D  G   AK  + GRKFGGN V A +YPEDK+  + Y
Sbjct: 543 SDAPVAGVGKVFLEYADVEGSTKAKTGMHGRKFGGNQVVAVFYPEDKFAAEQY 595


>gi|115486631|ref|NP_001068459.1| Os11g0682300 [Oryza sativa Japonica Group]
 gi|113645681|dbj|BAF28822.1| Os11g0682300, partial [Oryza sativa Japonica Group]
          Length = 378

 Score =  518 bits (1333), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 265/394 (67%), Positives = 306/394 (77%), Gaps = 17/394 (4%)

Query: 129 VPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANE 188
           + GQ PG  +A+P M  NMLP G  Q     + P Q MTQQATRHARRVYVGGLPP ANE
Sbjct: 1   IAGQFPG--TAIPGMFPNMLPMGVGQFNPLVIQP-QAMTQQATRHARRVYVGGLPPTANE 57

Query: 189 QAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEG 248
           Q++A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAMALDGI+FEG
Sbjct: 58  QSVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMALDGILFEG 117

Query: 249 VAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTE 308
             V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL  G+ GG EGPDR+FVGGLPYYFTE
Sbjct: 118 APVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTE 177

Query: 309 TQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV 368
            Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD  VTDIACAALNG+KMGDKTLTV
Sbjct: 178 AQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNVTDIACAALNGIKMGDKTLTV 237

Query: 369 RRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEA 427
           RRA   + Q + EQESIL QAQQ + +QK+  Q   + T             KV+CLT+ 
Sbjct: 238 RRANQGAAQPRPEQESILLQAQQQVQLQKLVYQVGALPT-------------KVVCLTQV 284

Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
           ++AD L DDEEYE+I+EDMR E GKYG L+ VVIPRPD +G    GVGKVFLEY D  G 
Sbjct: 285 VSADELKDDEEYEDIMEDMRLEAGKYGNLIKVVIPRPDPSGLPVAGVGKVFLEYADVDGA 344

Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
             AK A+ GRKFGGN V A +YPE+K+ + +Y A
Sbjct: 345 TKAKTAMHGRKFGGNPVVAVFYPENKFSSAEYDA 378


>gi|414591747|tpg|DAA42318.1| TPA: hypothetical protein ZEAMMB73_924732 [Zea mays]
          Length = 590

 Score =  511 bits (1316), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 264/413 (63%), Positives = 307/413 (74%), Gaps = 18/413 (4%)

Query: 112 SGFDMAPPAAAM--LPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
           SGFD AP   A+  +    +PGQLPGV + +P +    N+    A Q     + P Q MT
Sbjct: 190 SGFDQAPTQQALPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQFNPLAIQP-QAMT 248

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           QQATRHARRVYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV+NVYINH+KKFAFV
Sbjct: 249 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 308

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           EMR+VEEASNAM LDGI+FEG  V++RRPTDYNP+LAAALGP QP+PNLNL+AVGL  G+
Sbjct: 309 EMRSVEEASNAMVLDGIMFEGAPVKIRRPTDYNPSLAAALGPSQPNPNLNLSAVGLTPGS 368

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
            GG EGPDR+FVGGL YYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD  
Sbjct: 369 AGGLEGPDRIFVGGLQYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLN 428

Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
           VTDIACAALNG+KMGDKTLTVRRA   + Q + EQESIL QAQQ + +QK   Q      
Sbjct: 429 VTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQMQKFVYQ------ 482

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
           +GG +        KV+CLT+ +T D L DDEEY++I+EDMREE  KYG LV V IPRPD 
Sbjct: 483 VGGALP------TKVVCLTQVVTEDELRDDEEYDDIVEDMREEGHKYGNLVKVAIPRPDP 536

Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           +G    GVGKVFLEY D  G   AK  + GRKFGGN V A +YPEDK+  + Y
Sbjct: 537 SGAPVAGVGKVFLEYADVEGSTKAKTGMHGRKFGGNQVVAVFYPEDKFAAEQY 589


>gi|193848546|gb|ACF22733.1| U2AF large subunit [Brachypodium distachyon]
          Length = 569

 Score =  510 bits (1314), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 265/397 (66%), Positives = 307/397 (77%), Gaps = 22/397 (5%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQ-----ATRHARRVYVGGLPPL 185
           GQLPG  S++P M  NMLPF   Q     + P Q MTQQ     ATRHARRVYVGGLPP 
Sbjct: 189 GQLPG--SSIPGMFPNMLPFAVGQFNPLVMQP-QAMTQQHIFPQATRHARRVYVGGLPPT 245

Query: 186 ANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGII 245
           ANEQ++A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAMALDGI+
Sbjct: 246 ANEQSVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMALDGIL 305

Query: 246 FEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYY 305
           FEG  V+VRRPTDYNP+LA+ALGP QPS NLNLAAVGL  G+ GG EGPDR+FVGGLPYY
Sbjct: 306 FEGAPVKVRRPTDYNPSLASALGPSQPSSNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYY 365

Query: 306 FTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKT 365
           FTE Q++ELLESFG+L GFDLVKDR+TGNSKGY FCVYQD  VTDIACAALNG+KMGDKT
Sbjct: 366 FTEAQVRELLESFGSLRGFDLVKDRETGNSKGYAFCVYQDLNVTDIACAALNGIKMGDKT 425

Query: 366 LTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCL 424
           LTVRRA   S Q + EQE+IL QAQQ + +QK+  Q   + T             KV+CL
Sbjct: 426 LTVRRANQGSAQPRPEQENILLQAQQQVQLQKLVYQVGALPT-------------KVICL 472

Query: 425 TEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDA 484
           T+ +TAD L DDEEYE+I+EDMR E GKYGTLV VVIPRP  +G    GVGKVFLEY D 
Sbjct: 473 TQVVTADELKDDEEYEDIMEDMRLEAGKYGTLVKVVIPRPHPSGEPVAGVGKVFLEYADV 532

Query: 485 VGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
            G   AK A+ GRKFGGN V A +YPE+K+ ++++ A
Sbjct: 533 DGSTKAKTAMHGRKFGGNPVVAVFYPENKFSDEEFDA 569


>gi|108864648|gb|ABG22574.1| transposon protein, putative, CACTA, En/Spm sub-class, expressed
           [Oryza sativa Japonica Group]
          Length = 366

 Score =  508 bits (1307), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 259/380 (68%), Positives = 297/380 (78%), Gaps = 15/380 (3%)

Query: 143 MAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAI 202
           M  NMLP G  Q     + P Q MTQQATRHARRVYVGGLPP ANEQ++A +F+QVM AI
Sbjct: 1   MFPNMLPMGVGQFNPLVIQP-QAMTQQATRHARRVYVGGLPPTANEQSVAIYFNQVMAAI 59

Query: 203 GGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPT 262
           GGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+
Sbjct: 60  GGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMALDGILFEGAPVKVRRPTDYNPS 119

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
           LAAALGP QPSPNLNLAAVGL  G+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L 
Sbjct: 120 LAAALGPSQPSPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLR 179

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQ 381
           GFDLVKDR+TGNSKGY FCVYQD  VTDIACAALNG+KMGDKTLTVRRA   + Q + EQ
Sbjct: 180 GFDLVKDRETGNSKGYAFCVYQDLNVTDIACAALNGIKMGDKTLTVRRANQGAAQPRPEQ 239

Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEE 441
           ESIL QAQQ + +QK+  Q   + T             KV+CLT+ ++AD L DDEEYE+
Sbjct: 240 ESILLQAQQQVQLQKLVYQVGALPT-------------KVVCLTQVVSADELKDDEEYED 286

Query: 442 ILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
           I+EDMR E GKYG L+ VVIPRPD +G    GVGKVFLEY D  G   AK A+ GRKFGG
Sbjct: 287 IMEDMRLEAGKYGNLIKVVIPRPDPSGLPVAGVGKVFLEYADVDGATKAKTAMHGRKFGG 346

Query: 502 NTVNAFYYPEDKYFNKDYSA 521
           N V A +YPE+K+ + +Y A
Sbjct: 347 NPVVAVFYPENKFSSAEYDA 366


>gi|9858779|gb|AAG01126.1|AF273333_11 BAC19.11 [Solanum lycopersicum]
          Length = 532

 Score =  506 bits (1304), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 260/415 (62%), Positives = 300/415 (72%), Gaps = 46/415 (11%)

Query: 112 SGFDMAPPAAAMLPGAA-VPGQLPGVPS-AVPEMAQNMLPFGATQLGAFPLMPVQVMTQQ 169
           SGFDMAPP +A+L GA  V GQ+PG  + ++P M  NM P  A Q               
Sbjct: 159 SGFDMAPPTSALLSGATDVAGQVPGTTNPSIPGMFSNMFPLAAGQ--------------- 203

Query: 170 ATRHARRVYVGGLPPLANEQAIATF--FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           ATRHARRVYVGGLPP ANEQ +     FS             GDAVVNVYINHEKKFAFV
Sbjct: 204 ATRHARRVYVGGLPPTANEQVLKILLKFS-------------GDAVVNVYINHEKKFAFV 250

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           EMR+VEEASNAMALDG+IFEG  V+VRRP+DYNP+LAA LGP QPSPNLNLAAVGL  G+
Sbjct: 251 EMRSVEEASNAMALDGVIFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGLTPGS 310

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
            GG EGPDR+FVGGLPYYFTE+QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +
Sbjct: 311 SGGLEGPDRIFVGGLPYYFTESQIRELLESFGQLRGFDLVKDRETGNSKGYAFCVYQDVS 370

Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
           VTDIACAALNG+KMGDKTLTVRRA   + Q   EQES+L  AQQ IA+Q+  LQ   + T
Sbjct: 371 VTDIACAALNGIKMGDKTLTVRRANQGTTQPNPEQESVLLHAQQQIALQRFMLQPGALAT 430

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
                        KVLCLTE ++ D L DD++Y++ILEDMR ECGK+G L+NVVIPRP+ 
Sbjct: 431 -------------KVLCLTEVVSVDELKDDDDYQDILEDMRIECGKFGALLNVVIPRPNP 477

Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
           NG  TPG+GKVFLEY D    + A+  L+GRKFGGN V A +YPE+K+   DY A
Sbjct: 478 NGEPTPGLGKVFLEYADVDSSSKARQGLNGRKFGGNQVIAVFYPENKFSEGDYEA 532


>gi|413920211|gb|AFW60143.1| hypothetical protein ZEAMMB73_955987 [Zea mays]
          Length = 367

 Score =  506 bits (1302), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 264/379 (69%), Positives = 297/379 (78%), Gaps = 16/379 (4%)

Query: 143 MAQNMLPFG-ATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTA 201
           M  NMLPFG A Q     + P Q MTQQATRHARRVYVGGLPP ANEQ +A +F+QVM A
Sbjct: 1   MFPNMLPFGVAGQFNPLVIQP-QAMTQQATRHARRVYVGGLPPSANEQTVAIYFNQVMAA 59

Query: 202 IGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNP 261
           IGGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAMALDGI+FEG  V+VRRPTDYNP
Sbjct: 60  IGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNP 119

Query: 262 TLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTL 321
           +LAAALGP QPSPNLNLAAVGL +G+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L
Sbjct: 120 SLAAALGPSQPSPNLNLAAVGLTAGSNGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPL 179

Query: 322 HGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTE 380
            GFDLVKDR+TGNSKGY FCVYQD  VTDIACAALNG+KMGDKTLTVRRA   + Q + E
Sbjct: 180 RGFDLVKDRETGNSKGYAFCVYQDLTVTDIACAALNGIKMGDKTLTVRRANQGASQPRPE 239

Query: 381 QESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE 440
           QESIL QAQQ + +QK+  Q   + T             KV+CLT+ +TAD L DDEEYE
Sbjct: 240 QESILLQAQQQVQLQKLVYQVGALPT-------------KVVCLTQVVTADELKDDEEYE 286

Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
           +I+EDMR E GKYG LV V+IPRPD +G    GVGKVFLEY D  G A AK AL GRKFG
Sbjct: 287 DIMEDMRLEAGKYGNLVKVIIPRPDPSGQPVVGVGKVFLEYADIDGAAKAKTALHGRKFG 346

Query: 501 GNTVNAFYYPEDKYFNKDY 519
           GN V A  Y EDK+ N +Y
Sbjct: 347 GNPVVAVCYAEDKFANGEY 365


>gi|226532558|ref|NP_001140768.1| uncharacterized protein LOC100272843 [Zea mays]
 gi|194701008|gb|ACF84588.1| unknown [Zea mays]
 gi|414591744|tpg|DAA42315.1| TPA: hypothetical protein ZEAMMB73_924732 [Zea mays]
          Length = 583

 Score =  505 bits (1300), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 263/413 (63%), Positives = 305/413 (73%), Gaps = 25/413 (6%)

Query: 112 SGFDMAPPAAAM--LPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
           SGFD AP   A+  +    +PGQLPGV + +P +    N+    A Q         Q MT
Sbjct: 190 SGFDQAPTQQALPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQ--------PQAMT 241

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           QQATRHARRVYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV+NVYINH+KKFAFV
Sbjct: 242 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 301

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           EMR+VEEASNAM LDGI+FEG  V++RRPTDYNP+LAAALGP QP+PNLNL+AVGL  G+
Sbjct: 302 EMRSVEEASNAMVLDGIMFEGAPVKIRRPTDYNPSLAAALGPSQPNPNLNLSAVGLTPGS 361

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
            GG EGPDR+FVGGL YYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD  
Sbjct: 362 AGGLEGPDRIFVGGLQYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLN 421

Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
           VTDIACAALNG+KMGDKTLTVRRA   + Q + EQESIL QAQQ + +QK   Q      
Sbjct: 422 VTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQMQKFVYQ------ 475

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
           +GG +        KV+CLT+ +T D L DDEEY++I+EDMREE  KYG LV V IPRPD 
Sbjct: 476 VGGALP------TKVVCLTQVVTEDELRDDEEYDDIVEDMREEGHKYGNLVKVAIPRPDP 529

Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           +G    GVGKVFLEY D  G   AK  + GRKFGGN V A +YPEDK+  + Y
Sbjct: 530 SGAPVAGVGKVFLEYADVEGSTKAKTGMHGRKFGGNQVVAVFYPEDKFAAEQY 582


>gi|218186084|gb|EEC68511.1| hypothetical protein OsI_36782 [Oryza sativa Indica Group]
          Length = 574

 Score =  504 bits (1297), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 249/389 (64%), Positives = 288/389 (74%), Gaps = 13/389 (3%)

Query: 132 QLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAI 191
           Q+P V  A+  M  NM     T      + P Q MTQQATRHARRVYVGGLPP ANE  +
Sbjct: 196 QVPVVAPAISGMLPNMFNLTQTPFTPLVIQP-QAMTQQATRHARRVYVGGLPPTANEHTV 254

Query: 192 ATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAV 251
           A +F+QVM A+GGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAMALDGI+FEG  V
Sbjct: 255 AVYFNQVMAAVGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMALDGIMFEGAPV 314

Query: 252 RVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQI 311
           +VRRPTDYNP+LAAALGP QP+PNLNLAAVGL  G+ GG EGPDR+FVGGLPYYFTE Q+
Sbjct: 315 KVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQV 374

Query: 312 KELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
           +ELLESFG L GFDLVKDR+TGNSKGY FCVYQD  VTDIACAALNG+KMGDKTLTVRRA
Sbjct: 375 RELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNVTDIACAALNGIKMGDKTLTVRRA 434

Query: 372 T-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
              + Q + EQES+L   QQ   +QK+  Q  G           G    KV+CLT+ I+ 
Sbjct: 435 NQGASQPRPEQESMLLHVQQQAQMQKLMFQVGG-----------GALPTKVVCLTQVISP 483

Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATA 490
           D L DDEEYE+I++DMREE  +YG LV VVIPRPD +G    GVG+VFLE+ D      A
Sbjct: 484 DELRDDEEYEDIVQDMREEGCRYGNLVKVVIPRPDPSGAPVAGVGRVFLEFADIESSTKA 543

Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           KN + GRKF  N V A +YPEDK+    Y
Sbjct: 544 KNGMHGRKFANNQVVAVFYPEDKFAEGQY 572


>gi|115486373|ref|NP_001068330.1| Os11g0636900 [Oryza sativa Japonica Group]
 gi|122248736|sp|Q2R0Q1.2|U2A2A_ORYSJ RecName: Full=Splicing factor U2af large subunit A; AltName:
           Full=U2 auxiliary factor 65 kDa subunit A; AltName:
           Full=U2 small nuclear ribonucleoprotein auxiliary factor
           large subunit A; Short=U2 snRNP auxiliary factor large
           subunit A
 gi|108864607|gb|ABA94914.2| U2 snRNP auxilliary factor, large subunit, splicing factor family
           protein, expressed [Oryza sativa Japonica Group]
 gi|113645552|dbj|BAF28693.1| Os11g0636900 [Oryza sativa Japonica Group]
 gi|222616290|gb|EEE52422.1| hypothetical protein OsJ_34542 [Oryza sativa Japonica Group]
          Length = 574

 Score =  503 bits (1296), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 248/389 (63%), Positives = 288/389 (74%), Gaps = 13/389 (3%)

Query: 132 QLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAI 191
           Q+P V  A+  M  NM     T      + P Q MTQQATRHARRVYVGGLPP ANE  +
Sbjct: 196 QVPVVAPAISGMLPNMFNLTQTPFTPLVIQP-QAMTQQATRHARRVYVGGLPPTANEHTV 254

Query: 192 ATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAV 251
           A +F+QVM A+GGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAMALDGI+FEG  V
Sbjct: 255 AVYFNQVMAAVGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMALDGIMFEGAPV 314

Query: 252 RVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQI 311
           +VRRPTDYNP+LAAALGP QP+PNLNLAAVGL  G+ GG EGPDR+FVGGLPYYFTE Q+
Sbjct: 315 KVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQV 374

Query: 312 KELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
           +ELLESFG L GFDLVKDR+TGNSKGY FCVYQD  VTDIACAALNG+KMGDKTLTVRRA
Sbjct: 375 RELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNVTDIACAALNGIKMGDKTLTVRRA 434

Query: 372 T-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
              + Q + EQES+L   QQ   +QK+  Q  G           G    KV+CLT+ ++ 
Sbjct: 435 NQGASQPRPEQESMLLHVQQQAQMQKLMFQVGG-----------GALPTKVVCLTQVVSP 483

Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATA 490
           D L DDEEYE+I++DMREE  +YG LV VVIPRPD +G    GVG+VFLE+ D      A
Sbjct: 484 DELRDDEEYEDIVQDMREEGCRYGNLVKVVIPRPDPSGAPVAGVGRVFLEFADVESSTKA 543

Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           KN + GRKF  N V A +YPEDK+    Y
Sbjct: 544 KNGMHGRKFANNQVVAVFYPEDKFAEGQY 572


>gi|296088196|emb|CBI35712.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  500 bits (1287), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 248/301 (82%), Positives = 263/301 (87%), Gaps = 4/301 (1%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
           G+LPGVP  VP M QNM PFGATQLGA PLMPVQ MTQQATRHARRVYVGGLPPLANEQ 
Sbjct: 105 GELPGVPQMVPGMIQNMFPFGATQLGALPLMPVQAMTQQATRHARRVYVGGLPPLANEQT 164

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR+VEEASNAMALDGI+FE   
Sbjct: 165 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAMALDGIMFEACL 224

Query: 251 VRV-RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTET 309
             +   PTDYNP+LAAALGP QPSP+LNLAAVGL  G IGGAEGPDR+FVGGLPYYFTE 
Sbjct: 225 TLIFSLPTDYNPSLAAALGPSQPSPHLNLAAVGLMPGVIGGAEGPDRIFVGGLPYYFTEE 284

Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVR 369
           QI+ELLESFG L GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVR
Sbjct: 285 QIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVR 344

Query: 370 RATA-SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSL--FGETLAKVLCLTE 426
           RAT  SGQ+K+EQ++ILAQAQQHIAIQK+ALQ  G+N  G GM+     ET  KVLCLTE
Sbjct: 345 RATVGSGQAKSEQDNILAQAQQHIAIQKIALQAGGLNLPGAGMAFTAIAETPTKVLCLTE 404

Query: 427 A 427
            
Sbjct: 405 V 405


>gi|413920210|gb|AFW60142.1| hypothetical protein ZEAMMB73_955987 [Zea mays]
          Length = 364

 Score =  495 bits (1275), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 254/357 (71%), Positives = 286/357 (80%), Gaps = 14/357 (3%)

Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
           Q MTQQATRHARRVYVGGLPP ANEQ +A +F+QVM AIGGN+AGPGDAV+NVYINH+KK
Sbjct: 19  QAMTQQATRHARRVYVGGLPPSANEQTVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKK 78

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
           FAFVEMR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL
Sbjct: 79  FAFVEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGL 138

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
            +G+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVY
Sbjct: 139 TAGSNGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVY 198

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           QD  VTDIACAALNG+KMGDKTLTVRRA   + Q + EQESIL QAQQ + +QK+  Q  
Sbjct: 199 QDLTVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQLQKLVYQVG 258

Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
            + T             KV+CLT+ +TAD L DDEEYE+I+EDMR E GKYG LV V+IP
Sbjct: 259 ALPT-------------KVVCLTQVVTADELKDDEEYEDIMEDMRLEAGKYGNLVKVIIP 305

Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           RPD +G    GVGKVFLEY D  G A AK AL GRKFGGN V A  Y EDK+ N +Y
Sbjct: 306 RPDPSGQPVVGVGKVFLEYADIDGAAKAKTALHGRKFGGNPVVAVCYAEDKFANGEY 362


>gi|357160098|ref|XP_003578657.1| PREDICTED: splicing factor U2af large subunit B-like [Brachypodium
           distachyon]
          Length = 534

 Score =  494 bits (1273), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 269/480 (56%), Positives = 327/480 (68%), Gaps = 49/480 (10%)

Query: 69  HRDYNRDKERRHR--------------------HRSRSHSSDRFRNRSKS-------LSP 101
           H D NRD++R H+                     RSR+H S+R R R +         S 
Sbjct: 73  HGDRNRDRDRHHQEHRERSERREHRGRSDDHDYRRSRNHESER-RERDRDGHRRQRSRSR 131

Query: 102 SRSPSKSKRRSGFDMAPPAAAMLPGAAV-PGQLPGVPSAVPEMAQNMLPFGATQLGAFPL 160
           SRS ++SKR SGFD  P     +   +V PG LP VP+A+P M  NM       +   P 
Sbjct: 132 SRSRAQSKRVSGFDQGPSQTISIAAPSVTPGLLPAVPAAIPAMLPNMF---NIPIAGQP- 187

Query: 161 MPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH 220
              Q MTQQATRHARRVYVGGLPP ANEQ +A +F+ VM AIGGN+AG GDAVVNVYINH
Sbjct: 188 ---QAMTQQATRHARRVYVGGLPPSANEQTVAIYFNHVMAAIGGNAAGLGDAVVNVYINH 244

Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           +KKFAFVEMR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+ AA LGP QP+PNLNLAA
Sbjct: 245 DKKFAFVEMRSVEEASNAMALDGILFEGAPVKVRRPTDYNPSQAAVLGPSQPNPNLNLAA 304

Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
           VGL  G+ GG EGPDR+FVGGLPYYFTE Q++ELLE+FG L GFD+VKDR+TGNSKGY F
Sbjct: 305 VGLTPGSAGGLEGPDRIFVGGLPYYFTEAQVQELLETFGPLRGFDIVKDRETGNSKGYAF 364

Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMAL 399
           CVYQD AVTDIACAALNG+++GD+TLTVRRA   + + + E E+IL QAQ    ++K+  
Sbjct: 365 CVYQDLAVTDIACAALNGIQLGDRTLTVRRANQGAAEPRPEHENILLQAQHQAQMKKLVY 424

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
           +      +GG +        KV+CLT+ ++ D L +DEEY++ILEDM  E  KYG LV  
Sbjct: 425 E------VGGAIP------TKVVCLTQVVSEDDLRNDEEYKDILEDMTFEGRKYGNLVQA 472

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           VIPRP  NG    GVGKVFLEY D  G   AK  + GR+F G  V+A +YPE K+ + +Y
Sbjct: 473 VIPRPHPNGVPVAGVGKVFLEYADVDGSTNAKAGMHGRRFDGKVVDAVFYPEKKFADGEY 532


>gi|413920214|gb|AFW60146.1| hypothetical protein ZEAMMB73_955987 [Zea mays]
          Length = 536

 Score =  493 bits (1268), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 270/417 (64%), Positives = 307/417 (73%), Gaps = 32/417 (7%)

Query: 113 GFDMAPPAA--AMLPGAAVPGQLPGVPSAVPE---MAQNMLPFGATQLGAFPLMPVQVMT 167
           GFD  PP A  +  P    P QLPG  S++P    M  NMLPFG    G F   P+ +  
Sbjct: 140 GFDAPPPQAMGSTFPVIPTPSQLPG--SSLPNIGGMFPNMLPFGVA--GQF--NPLVIQP 193

Query: 168 QQAT----RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
           Q  T    RHARRVYVGGLPP ANEQ +A +F+QVM AIGGN+AGPGDAV+NVYINH+KK
Sbjct: 194 QAMTQQATRHARRVYVGGLPPSANEQTVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKK 253

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
           FAFVEMR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL
Sbjct: 254 FAFVEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGL 313

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
            +G+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVY
Sbjct: 314 TAGSNGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVY 373

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           QD  VTDIACAALNG+KMGDKTLTVRRA   + Q + EQESIL QAQQ + +QK+  Q  
Sbjct: 374 QDLTVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQLQKLVYQVG 433

Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
            + T             KV+CLT+ +TAD L DDEEYE+I+EDMR E G    LV V+IP
Sbjct: 434 ALPT-------------KVVCLTQVVTADELKDDEEYEDIMEDMRLEAGN---LVKVIIP 477

Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           RPD +G    GVGKVFLEY D  G A AK AL GRKFGGN V A  Y EDK+ N +Y
Sbjct: 478 RPDPSGQPVVGVGKVFLEYADIDGAAKAKTALHGRKFGGNPVVAVCYAEDKFANGEY 534


>gi|224077136|ref|XP_002305148.1| predicted protein [Populus trichocarpa]
 gi|222848112|gb|EEE85659.1| predicted protein [Populus trichocarpa]
          Length = 296

 Score =  487 bits (1253), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 242/296 (81%), Positives = 256/296 (86%), Gaps = 3/296 (1%)

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNP+LAA LGP QPSP LNLAAVGL  G I
Sbjct: 1   MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPSLAATLGPSQPSPLLNLAAVGLVPGTI 60

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
            GAEGPDRVFVGGLPYYFTE QI+ELLESFG L GFDLVKDRDTGNSKGYGFCVYQDPAV
Sbjct: 61  SGAEGPDRVFVGGLPYYFTEIQIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAV 120

Query: 349 TDIACAALNGLKMGDKTLTVRRATAS-GQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
           TDIACAALNGLKMGDKTLTVRRAT S GQSK+EQE+ILAQAQQHIAIQKMALQ   MN  
Sbjct: 121 TDIACAALNGLKMGDKTLTVRRATESGGQSKSEQENILAQAQQHIAIQKMALQAGVMNLP 180

Query: 408 GGGMSLF--GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
           G G+ L     T +KVLCLTEAIT + LADDEEYEEILEDMREEC K+GTL+NVVIPRP 
Sbjct: 181 GVGIPLAESAYTPSKVLCLTEAITMEVLADDEEYEEILEDMREECCKFGTLINVVIPRPS 240

Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
           QN  + PG GKVFLEY D + CA A+NAL+GRKFGGNTVNAFYYPE+KY N DY A
Sbjct: 241 QNEEKMPGAGKVFLEYSDTISCANARNALNGRKFGGNTVNAFYYPEEKYSNGDYGA 296


>gi|255568277|ref|XP_002525113.1| splicing factor u2af large subunit, putative [Ricinus communis]
 gi|223535572|gb|EEF37240.1| splicing factor u2af large subunit, putative [Ricinus communis]
          Length = 549

 Score =  487 bits (1253), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 243/347 (70%), Positives = 273/347 (78%), Gaps = 21/347 (6%)

Query: 113 GFDMAPPAAAMLPGAAVP----GQLPGVPSAVPEMAQNMLPFGA-TQLGAFPLMPVQVMT 167
           GFDMAPP +AML GAA      GQ+PG   A+P M  NM P G   Q G  P+MPVQ MT
Sbjct: 189 GFDMAPPPSAMLTGAAAVAAAAGQIPGTAPAIPGMFPNMFPLGTGQQFGTLPVMPVQAMT 248

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           QQATRHARRVYVGGLPP ANEQ++ATFFS VM AIGGN+AGPGDAVVNVYINHEKKFAFV
Sbjct: 249 QQATRHARRVYVGGLPPTANEQSVATFFSHVMAAIGGNTAGPGDAVVNVYINHEKKFAFV 308

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           EMR+VEEASNAMALDGIIFEG  V+VRRP+DYNP+LAA LGP QP+PNLNL AVGL  G+
Sbjct: 309 EMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNPNLNLGAVGLTPGS 368

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
            GG EGPDR+FVGGLPYYFTE QI+ELLESFG L GFDLVKDR+TGNSKGY FCVYQD +
Sbjct: 369 AGGLEGPDRIFVGGLPYYFTEAQIRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLS 428

Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
           VTDIACAALNG+KMGDKTLTVRRA   + Q K EQE++L  AQQ IA+Q++ LQ      
Sbjct: 429 VTDIACAALNGIKMGDKTLTVRRANQGANQPKPEQETVLLHAQQQIALQRLMLQP----- 483

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
                        KV+CLT+ +TAD L DD+EYE+ILEDMR E GK+
Sbjct: 484 ----------VPTKVVCLTQVVTADELKDDDEYEDILEDMRTEGGKF 520


>gi|413920212|gb|AFW60144.1| hypothetical protein ZEAMMB73_955987 [Zea mays]
          Length = 502

 Score =  479 bits (1232), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 257/379 (67%), Positives = 291/379 (76%), Gaps = 24/379 (6%)

Query: 113 GFDMAPPAA--AMLPGAAVPGQLPGVPSAVPE---MAQNMLPFG-ATQLGAFPLMPVQVM 166
           GFD  PP A  +  P    P QLPG  S++P    M  NMLPFG A Q     + P Q M
Sbjct: 140 GFDAPPPQAMGSTFPVIPTPSQLPG--SSLPNIGGMFPNMLPFGVAGQFNPLVIQP-QAM 196

Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
           TQQATRHARRVYVGGLPP ANEQ +A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAF
Sbjct: 197 TQQATRHARRVYVGGLPPSANEQTVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAF 256

Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
           VEMR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL +G
Sbjct: 257 VEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTAG 316

Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
           + GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD 
Sbjct: 317 SNGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDL 376

Query: 347 AVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
            VTDIACAALNG+KMGDKTLTVRRA   + Q + EQESIL QAQQ + +QK+  Q   + 
Sbjct: 377 TVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQLQKLVYQVGALP 436

Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
           T             KV+CLT+ +TAD L DDEEYE+I+EDMR E GKYG LV V+IPRPD
Sbjct: 437 T-------------KVVCLTQVVTADELKDDEEYEDIMEDMRLEAGKYGNLVKVIIPRPD 483

Query: 466 QNGGETPGVGKVFLE-YYD 483
            +G    GVGKV LE YYD
Sbjct: 484 PSGQPVVGVGKVSLELYYD 502


>gi|224125466|ref|XP_002329812.1| predicted protein [Populus trichocarpa]
 gi|222870874|gb|EEF08005.1| predicted protein [Populus trichocarpa]
          Length = 321

 Score =  472 bits (1215), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 235/294 (79%), Positives = 249/294 (84%), Gaps = 3/294 (1%)

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           MRTVEEASNAM LDGIIFEGVAVRVRRPTDYNP+LAA LGP QPSP LNLAAVGL  G I
Sbjct: 1   MRTVEEASNAMTLDGIIFEGVAVRVRRPTDYNPSLAATLGPSQPSPLLNLAAVGLVPGTI 60

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
            GAEGPDRVFVGGLPYYFTETQI+ELLESFG L GFDLVKDRDTGNSKGYGFCVYQDPAV
Sbjct: 61  SGAEGPDRVFVGGLPYYFTETQIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAV 120

Query: 349 TDIACAALNGLKMGDKTLTVRRATAS-GQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
           TDIACAALNGLKMGDKTLTVRR T S GQS++EQE+ILAQAQQHIAIQKMALQ   MN  
Sbjct: 121 TDIACAALNGLKMGDKTLTVRRGTESGGQSRSEQENILAQAQQHIAIQKMALQAGVMNLP 180

Query: 408 GGGMSLF--GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
           G G+ L     + +KVLCLTEAI  + LADDEEYEEILEDMREEC K+GTL+NVVIPRP 
Sbjct: 181 GVGIPLAESSHSPSKVLCLTEAIAMEVLADDEEYEEILEDMREECCKFGTLINVVIPRPS 240

Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           Q   +  G GKVFLEY D   CA A+NAL+GRKFGGNTVNA YYPEDKY N DY
Sbjct: 241 QTEEQISGAGKVFLEYSDTSSCANARNALNGRKFGGNTVNASYYPEDKYHNGDY 294


>gi|302813365|ref|XP_002988368.1| hypothetical protein SELMODRAFT_235524 [Selaginella moellendorffii]
 gi|300143770|gb|EFJ10458.1| hypothetical protein SELMODRAFT_235524 [Selaginella moellendorffii]
          Length = 353

 Score =  470 bits (1209), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 237/366 (64%), Positives = 279/366 (76%), Gaps = 17/366 (4%)

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
           L AF  MP Q MTQQATRHAR+VYVGGLP L NEQ IATFF+QVM  +GGN+AGPGD VV
Sbjct: 4   LAAFA-MPPQTMTQQATRHARQVYVGGLPGLVNEQTIATFFNQVMVNVGGNTAGPGDVVV 62

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
           NVYIN EKKFAFVEMRTVEEASNAMALDGI F+GV+VRVRRP+DYNP++AA LGP QPSP
Sbjct: 63  NVYINQEKKFAFVEMRTVEEASNAMALDGISFQGVSVRVRRPSDYNPSVAANLGPSQPSP 122

Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
           +LNLAAVGL  GA GG +GPDR+FVGGLPYY TE QI+ELLESFG L GFDLVKDR++GN
Sbjct: 123 SLNLAAVGLTPGAGGGVDGPDRIFVGGLPYYLTEPQIRELLESFGPLRGFDLVKDRESGN 182

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
           SKGYGFCVYQDP VTD+ACAALNGLKMGD+TLTVRRATA+GQ          Q   HI  
Sbjct: 183 SKGYGFCVYQDPNVTDVACAALNGLKMGDRTLTVRRATANGQQA-------GQDHAHILS 235

Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
              +L  +G+           E   +VLCL EA+    L +DE+++EILEDMR+ECGK+G
Sbjct: 236 LAKSLTMNGV--------FPDEGATRVLCLKEAVLEAELIEDEQFDEILEDMRDECGKFG 287

Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           T++++VIPRP Q   E  GVGKVF+ + D      A+ +L+GRKF G  V A YYPE+++
Sbjct: 288 TVLHLVIPRPSQ-AAEVDGVGKVFVHFEDTGAATRARISLNGRKFDGRAVVATYYPEEQF 346

Query: 515 FNKDYS 520
              D+S
Sbjct: 347 MVGDFS 352


>gi|302795921|ref|XP_002979723.1| hypothetical protein SELMODRAFT_13030 [Selaginella moellendorffii]
 gi|300152483|gb|EFJ19125.1| hypothetical protein SELMODRAFT_13030 [Selaginella moellendorffii]
          Length = 360

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 237/366 (64%), Positives = 279/366 (76%), Gaps = 17/366 (4%)

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
           L AF  MP Q MTQQATRHAR+VYVGGLP L NEQ IATFF+QVM  +GGN+AGPGD VV
Sbjct: 11  LAAFA-MPPQTMTQQATRHARQVYVGGLPGLVNEQTIATFFNQVMVNVGGNTAGPGDVVV 69

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
           NVYIN EKKFAFVEMRTVEEASNAMALDGI F+GV+VRVRRP+DYNP++AA LGP QPSP
Sbjct: 70  NVYINQEKKFAFVEMRTVEEASNAMALDGISFQGVSVRVRRPSDYNPSVAANLGPSQPSP 129

Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
           +LNLAAVGL  GA GG +GPDR+FVGGLPYY TE QI+ELLESFG L GFDLVKDR++GN
Sbjct: 130 SLNLAAVGLTPGAGGGVDGPDRIFVGGLPYYLTEPQIRELLESFGPLRGFDLVKDRESGN 189

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
           SKGYGFCVYQDP VTD+ACAALNGLKMGD+TLTVRRATA+GQ          Q   HI  
Sbjct: 190 SKGYGFCVYQDPNVTDVACAALNGLKMGDRTLTVRRATANGQQA-------GQDHAHILS 242

Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
              +L  +G+           E   +VLCL EA+    L +DE+++EILEDMR+ECGK+G
Sbjct: 243 LAKSLTMNGV--------FPDEGATRVLCLKEAVLEAELIEDEQFDEILEDMRDECGKFG 294

Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           T++++VIPRP Q   E  GVGKVF+ + D      A+ +L+GRKF G  V A YYPE+++
Sbjct: 295 TVLHLVIPRPSQ-AAEVDGVGKVFVHFEDTGAATRARISLNGRKFDGRAVVATYYPEEQF 353

Query: 515 FNKDYS 520
              D+S
Sbjct: 354 MVGDFS 359


>gi|30690730|ref|NP_849509.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
 gi|19310597|gb|AAL85029.1| putative splicing factor [Arabidopsis thaliana]
 gi|332661289|gb|AEE86689.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
          Length = 542

 Score =  466 bits (1200), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 238/351 (67%), Positives = 274/351 (78%), Gaps = 18/351 (5%)

Query: 109 KRRSGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPE--MAQNMLPFGATQ-LGAFPLMPVQ 164
           +R SGFDMAPPA+AML   A V GQ+P  P  +P   M  NM P    Q  G   +MP+Q
Sbjct: 169 QRVSGFDMAPPASAMLAAGAAVTGQVPPAPPTLPGAGMFPNMFPLPTGQSFGGLSMMPIQ 228

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
            MTQQATRHARRVYVGGL P ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYINHEKKF
Sbjct: 229 AMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINHEKKF 288

Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
           AFVEMR+VEEASNAM+LDGIIFEG  V+VRRP+DYNP+LAA LGP QPSP+LNLAAVGL 
Sbjct: 289 AFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLNLAAVGLT 348

Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
            GA GG EGPDR+FVGGLPYYFTE+Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQ
Sbjct: 349 PGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGGLKGFDLVKDRETGNSKGYAFCVYQ 408

Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
           D +VTDIACAALNG+KMGDKTLTVRRA   +   K EQE++L  AQQ IA Q++ LQ   
Sbjct: 409 DLSVTDIACAALNGIKMGDKTLTVRRANQGTMLQKPEQENVLLHAQQQIAFQRVMLQPGA 468

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
           + T              V+CLT+ +T D L DDEEY +I+EDMR+E GK+G
Sbjct: 469 VAT-------------TVVCLTQVVTEDELRDDEEYGDIMEDMRQEGGKFG 506


>gi|42573197|ref|NP_974695.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
 gi|332661288|gb|AEE86688.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
          Length = 565

 Score =  464 bits (1194), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 237/350 (67%), Positives = 273/350 (78%), Gaps = 18/350 (5%)

Query: 109 KRRSGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPE--MAQNMLPFGATQ-LGAFPLMPVQ 164
           +R SGFDMAPPA+AML   A V GQ+P  P  +P   M  NM P    Q  G   +MP+Q
Sbjct: 169 QRVSGFDMAPPASAMLAAGAAVTGQVPPAPPTLPGAGMFPNMFPLPTGQSFGGLSMMPIQ 228

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
            MTQQATRHARRVYVGGL P ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYINHEKKF
Sbjct: 229 AMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINHEKKF 288

Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
           AFVEMR+VEEASNAM+LDGIIFEG  V+VRRP+DYNP+LAA LGP QPSP+LNLAAVGL 
Sbjct: 289 AFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLNLAAVGLT 348

Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
            GA GG EGPDR+FVGGLPYYFTE+Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQ
Sbjct: 349 PGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGGLKGFDLVKDRETGNSKGYAFCVYQ 408

Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
           D +VTDIACAALNG+KMGDKTLTVRRA   +   K EQE++L  AQQ IA Q++ LQ   
Sbjct: 409 DLSVTDIACAALNGIKMGDKTLTVRRANQGTMLQKPEQENVLLHAQQQIAFQRVMLQPGA 468

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
           + T              V+CLT+ +T D L DDEEY +I+EDMR+E GK+
Sbjct: 469 VAT-------------TVVCLTQVVTEDELRDDEEYGDIMEDMRQEGGKF 505


>gi|222423510|dbj|BAH19725.1| AT4G36690 [Arabidopsis thaliana]
          Length = 565

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 236/350 (67%), Positives = 273/350 (78%), Gaps = 18/350 (5%)

Query: 109 KRRSGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPE--MAQNMLPFGATQ-LGAFPLMPVQ 164
           +R SGFDMAPPA+AML   A V GQ+P  P  +P   M  NM P    Q  G   +MP+Q
Sbjct: 169 QRVSGFDMAPPASAMLAAGAAVTGQVPPAPPTLPGAGMFPNMFPLPTGQSFGGLSMMPIQ 228

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
            MTQQATRHARRVYVGGL P ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYINHEKKF
Sbjct: 229 AMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINHEKKF 288

Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
           AFVEMR+VEEASNAM+LDGIIFEG  V+VRRP+DYNP+LAA LGP QPSP+LNLAAVGL 
Sbjct: 289 AFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLNLAAVGLT 348

Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
            GA GG EGPDR+FVGGLPYYFTE+Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQ
Sbjct: 349 PGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGGLKGFDLVKDRETGNSKGYAFCVYQ 408

Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
           D +VTDIACAALNG+KMGDKTLTVRRA   +   K EQE++L  AQQ IA Q++ LQ   
Sbjct: 409 DLSVTDIACAALNGIKMGDKTLTVRRANQGTMLQKPEQENVLLHAQQQIAFQRVMLQPGA 468

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
           + T              V+CLT+ +T D L DDEEY +I+EDMR+E G++
Sbjct: 469 VAT-------------TVVCLTQVVTEDELRDDEEYGDIMEDMRQEGGRF 505


>gi|29367529|gb|AAO72620.1| putative U2 snRNP auxiliary factor [Oryza sativa Japonica Group]
          Length = 331

 Score =  447 bits (1149), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 216/316 (68%), Positives = 250/316 (79%), Gaps = 12/316 (3%)

Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
           Q MTQQATRHARRVYVGGLPP ANE  +A +F+QVM A+GGN+AGPGDAV+NVYINH+KK
Sbjct: 22  QAMTQQATRHARRVYVGGLPPTANEHTVAVYFNQVMAAVGGNTAGPGDAVLNVYINHDKK 81

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
           FAFVEMR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+LAAALGP QP+PNLNLAAVGL
Sbjct: 82  FAFVEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLAAVGL 141

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
             G+ GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVY
Sbjct: 142 TPGSAGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVY 201

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           QD  VTDIACAALNG+KMGDKTLTVRRA   + Q + EQES+L   QQ   +QK+  Q  
Sbjct: 202 QDLNVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESMLLHVQQQAQMQKLMFQVG 261

Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
           G           G    KV+CLT+ ++ D L DDEEYE+I++DMREE  +YG LV V+ P
Sbjct: 262 G-----------GALPTKVVCLTQVVSPDELRDDEEYEDIVQDMREEGCRYGNLVKVLNP 310

Query: 463 RPDQNGGETPGVGKVF 478
           RPD +G    G G+ F
Sbjct: 311 RPDPSGAPVAGFGRCF 326


>gi|413920348|gb|AFW60280.1| hypothetical protein ZEAMMB73_339264 [Zea mays]
          Length = 549

 Score =  446 bits (1147), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 235/361 (65%), Positives = 273/361 (75%), Gaps = 21/361 (5%)

Query: 112 SGFDMAPP--AAAMLPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
           SGFD APP  A  ++   A+PGQLPG+ + +P +    N+    A Q     + P Q MT
Sbjct: 194 SGFDQAPPQHALPIVAAGAIPGQLPGITAPIPGVGVLPNLYNLAAGQFNPLVIQP-QAMT 252

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           QQATRHARRVYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV+NVYINH+KKFAFV
Sbjct: 253 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 312

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           EMR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+LAAALGP QP+PNLNLAAVGL  G+
Sbjct: 313 EMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGS 372

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
            GG EGPDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD  
Sbjct: 373 AGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLN 432

Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
           VTDIACAALNG+KMGDKTLTVRRA   + Q + EQESIL QAQQ + +QK+  Q      
Sbjct: 433 VTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQMQKLVYQ------ 486

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY---GTLVNVVIPR 463
           +GG +        KV+CLT+ +TAD L DDEEY +I+EDMREE  KY     +    I R
Sbjct: 487 VGGALP------TKVVCLTQVVTADELRDDEEYNDIVEDMREEGRKYVPHNAIAECFIVR 540

Query: 464 P 464
           P
Sbjct: 541 P 541


>gi|122245119|sp|Q2QKB3.1|U2A2A_WHEAT RecName: Full=Splicing factor U2af large subunit A; AltName:
           Full=U2 auxiliary factor 65 kDa subunit A; AltName:
           Full=U2 small nuclear ribonucleoprotein auxiliary factor
           large subunit A; Short=U2 snRNP auxiliary factor large
           subunit A
 gi|68036924|gb|AAY84881.1| U2AF large subunit [Triticum aestivum]
          Length = 591

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 249/443 (56%), Positives = 296/443 (66%), Gaps = 28/443 (6%)

Query: 29  GERGRDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRHRSRSHS 88
           G R R+RHHRD + G  DR R   +  D     +        D    +    R   R   
Sbjct: 131 GSRDRERHHRDHREGSRDRER---HHRDHRERSERREHRDRSDDRDYRRSCDRDAERRDR 187

Query: 89  SDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAA-AMLPGAAVPGQLPGVPSAVPEMAQNM 147
                 R +S SP RS S+SKR SGFD  P  A  +L   A P QLP +P+A P M  NM
Sbjct: 188 DRDGHRRHRSRSPLRSESQSKRMSGFDQRPSEAIPILAPDATPSQLPELPAANPGMFPNM 247

Query: 148 LP--FGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGN 205
           LP       LG  PL     MTQQATRHARRVYVGGLPP+ANEQ +A FF+QVM AIGGN
Sbjct: 248 LPNLVNVPALGQ-PL----AMTQQATRHARRVYVGGLPPIANEQTVAVFFNQVMAAIGGN 302

Query: 206 SAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAA 265
           +   G AVVNVYINH+KKFAFVEMR+VEEASNAMALDGI+FEG  V+VRRPTDYNP+ AA
Sbjct: 303 TFALGHAVVNVYINHDKKFAFVEMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSQAA 362

Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFD 325
           ALGP QP+PNLNLAAVGL  GA GG EGPDR+FVGGLPYYFTE Q++ELLE+FG L GFD
Sbjct: 363 ALGPSQPNPNLNLAAVGLTPGAGGGLEGPDRIFVGGLPYYFTEAQVRELLETFGPLRGFD 422

Query: 326 LVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESIL 385
           +VKD++TGNSKGY FC+Y+D  VTDIACAALNG+++GD+TLTVRRA    + + EQE+IL
Sbjct: 423 IVKDKETGNSKGYAFCLYKDGTVTDIACAALNGIQLGDRTLTVRRANQGAEPRPEQENIL 482

Query: 386 AQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL-AKVLCLTEAITADALADDEEYEEILE 444
            QAQQ   ++++  +              G TL  KV+CLT+ ++AD L DDEEY +ILE
Sbjct: 483 LQAQQEAQMKRLVYEV-------------GRTLTTKVVCLTQVVSADDLRDDEEYNDILE 529

Query: 445 DMREECGKY---GTLVNVVIPRP 464
           DM  E  KY    T+    I RP
Sbjct: 530 DMTLEGHKYVPHSTIAESFIIRP 552


>gi|414591746|tpg|DAA42317.1| TPA: hypothetical protein ZEAMMB73_924732 [Zea mays]
          Length = 538

 Score =  437 bits (1124), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 227/348 (65%), Positives = 265/348 (76%), Gaps = 18/348 (5%)

Query: 112 SGFDMAPPAAAM--LPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
           SGFD AP   A+  +    +PGQLPGV + +P +    N+    A Q     + P Q MT
Sbjct: 190 SGFDQAPTQQALPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQFNPLAIQP-QAMT 248

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           QQATRHARRVYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV+NVYINH+KKFAFV
Sbjct: 249 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 308

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           EMR+VEEASNAM LDGI+FEG  V++RRPTDYNP+LAAALGP QP+PNLNL+AVGL  G+
Sbjct: 309 EMRSVEEASNAMVLDGIMFEGAPVKIRRPTDYNPSLAAALGPSQPNPNLNLSAVGLTPGS 368

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
            GG EGPDR+FVGGL YYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD  
Sbjct: 369 AGGLEGPDRIFVGGLQYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLN 428

Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
           VTDIACAALNG+KMGDKTLTVRRA   + Q + EQESIL QAQQ + +QK   Q      
Sbjct: 429 VTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQMQKFVYQ------ 482

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
           +GG +        KV+CLT+ +T D L DDEEY++I+EDMREE  KYG
Sbjct: 483 VGGALP------TKVVCLTQVVTEDELRDDEEYDDIVEDMREEGHKYG 524


>gi|108864608|gb|ABG22562.1| U2 snRNP auxilliary factor, large subunit, splicing factor family
           protein, expressed [Oryza sativa Japonica Group]
          Length = 550

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 213/323 (65%), Positives = 247/323 (76%), Gaps = 13/323 (4%)

Query: 132 QLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAI 191
           Q+P V  A+  M  NM     T      + P Q MTQQATRHARRVYVGGLPP ANE  +
Sbjct: 196 QVPVVAPAISGMLPNMFNLTQTPFTPLVIQP-QAMTQQATRHARRVYVGGLPPTANEHTV 254

Query: 192 ATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAV 251
           A +F+QVM A+GGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAMALDGI+FEG  V
Sbjct: 255 AVYFNQVMAAVGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMALDGIMFEGAPV 314

Query: 252 RVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQI 311
           +VRRPTDYNP+LAAALGP QP+PNLNLAAVGL  G+ GG EGPDR+FVGGLPYYFTE Q+
Sbjct: 315 KVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQV 374

Query: 312 KELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
           +ELLESFG L GFDLVKDR+TGNSKGY FCVYQD  VTDIACAALNG+KMGDKTLTVRRA
Sbjct: 375 RELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNVTDIACAALNGIKMGDKTLTVRRA 434

Query: 372 T-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
              + Q + EQES+L   QQ   +QK+  Q  G           G    KV+CLT+ ++ 
Sbjct: 435 NQGASQPRPEQESMLLHVQQQAQMQKLMFQVGG-----------GALPTKVVCLTQVVSP 483

Query: 431 DALADDEEYEEILEDMREECGKY 453
           D L DDEEYE+I++DMREE  +Y
Sbjct: 484 DELRDDEEYEDIVQDMREEGCRY 506


>gi|414591745|tpg|DAA42316.1| TPA: hypothetical protein ZEAMMB73_924732 [Zea mays]
          Length = 538

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 227/361 (62%), Positives = 267/361 (73%), Gaps = 28/361 (7%)

Query: 112 SGFDMAPPAAAM--LPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
           SGFD AP   A+  +    +PGQLPGV + +P +    N+    A Q         Q MT
Sbjct: 190 SGFDQAPTQQALPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQ--------PQAMT 241

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           QQATRHARRVYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV+NVYINH+KKFAFV
Sbjct: 242 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 301

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           EMR+VEEASNAM LDGI+FEG  V++RRPTDYNP+LAAALGP QP+PNLNL+AVGL  G+
Sbjct: 302 EMRSVEEASNAMVLDGIMFEGAPVKIRRPTDYNPSLAAALGPSQPNPNLNLSAVGLTPGS 361

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
            GG EGPDR+FVGGL YYFTE Q++ELLESFG L GFDLVKDR+TGNSKGY FCVYQD  
Sbjct: 362 AGGLEGPDRIFVGGLQYYFTEAQVRELLESFGPLRGFDLVKDRETGNSKGYAFCVYQDLN 421

Query: 348 VTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
           VTDIACAALNG+KMGDKTLTVRRA   + Q + EQESIL QAQQ + +QK   Q      
Sbjct: 422 VTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQMQKFVYQ------ 475

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY---GTLVNVVIPR 463
           +GG +        KV+CLT+ +T D L DDEEY++I+EDMREE  KY     + +  + R
Sbjct: 476 VGGALP------TKVVCLTQVVTEDELRDDEEYDDIVEDMREEGHKYVPHNAIADCFVVR 529

Query: 464 P 464
           P
Sbjct: 530 P 530


>gi|224030681|gb|ACN34416.1| unknown [Zea mays]
          Length = 425

 Score =  417 bits (1072), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 242/379 (63%), Positives = 279/379 (73%), Gaps = 28/379 (7%)

Query: 80  HRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAM--LPGAAVPGQLPGVP 137
           HR RS S S DR R        SRS SKSKR SGFD AP   A+  +    +PGQLPGV 
Sbjct: 48  HRSRSPSMSRDRDRRSRSR---SRSRSKSKRVSGFDQAPTQQALPIVAAGVIPGQLPGVT 104

Query: 138 SAVPEMA--QNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFF 195
           + +P +    N+    A Q         Q MTQQATRHARRVYVGGLPP ANEQ +A FF
Sbjct: 105 APIPGVGVLPNLYNLAAGQ--------PQAMTQQATRHARRVYVGGLPPTANEQTVAIFF 156

Query: 196 SQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRR 255
           + VM AIGGN+AGPGDAV+NVYINH+KKFAFVEMR+VEEASNAM LDGI+FEG  V++RR
Sbjct: 157 NGVMAAIGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEEASNAMVLDGIMFEGAPVKIRR 216

Query: 256 PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELL 315
           PTDYNP+LAAALGP QP+PNLNL+AVGL  G+ GG EGPDR+FVGGL YYFTE Q++ELL
Sbjct: 217 PTDYNPSLAAALGPSQPNPNLNLSAVGLTPGSAGGLEGPDRIFVGGLQYYFTEAQVRELL 276

Query: 316 ESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-AS 374
           ESFG L GFDLVKDR+TGNSKGY FCVYQD  VTDIACAALNG+KMGDKTLTVRRA   +
Sbjct: 277 ESFGPLRGFDLVKDRETGNSKGYAFCVYQDLNVTDIACAALNGIKMGDKTLTVRRANQGA 336

Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA 434
            Q + EQESIL QAQQ + +QK   Q      +GG +        KV+CLT+ +T D L 
Sbjct: 337 SQPRPEQESILLQAQQQVQMQKFVYQ------VGGALP------TKVVCLTQVVTEDELR 384

Query: 435 DDEEYEEILEDMREECGKY 453
           DDEEY++I+EDMREE  KY
Sbjct: 385 DDEEYDDIVEDMREEGHKY 403


>gi|242069431|ref|XP_002449992.1| hypothetical protein SORBIDRAFT_05g026786 [Sorghum bicolor]
 gi|241935835|gb|EES08980.1| hypothetical protein SORBIDRAFT_05g026786 [Sorghum bicolor]
          Length = 296

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 202/292 (69%), Positives = 233/292 (79%), Gaps = 16/292 (5%)

Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
           Q MTQ ATRHARRVYVGGLPP ANEQ +A +F+Q+M AIGGN+AGPGDAV+NVYINH+KK
Sbjct: 9   QAMTQHATRHARRVYVGGLPPDANEQTVAVYFNQIMAAIGGNTAGPGDAVLNVYINHDKK 68

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
           FA VEMR+VEEASNAMALDGI+FEGV V+VRRPTDYNP+LAAALGP QPSPNLNLAAVGL
Sbjct: 69  FASVEMRSVEEASNAMALDGIMFEGVPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGL 128

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
            +G+ GG E PDR+FVGGLPYYFTE Q++ELLESFG L GFDLVKD++TGNSKGY FC Y
Sbjct: 129 TAGS-GGLEDPDRIFVGGLPYYFTEAQVRELLESFGPLRGFDLVKDKETGNSKGYAFCDY 187

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           QD  VTDIACAALNG+KMGDK LTVRRA   + Q   EQESIL QAQQ + +QK+A    
Sbjct: 188 QDLTVTDIACAALNGIKMGDKILTVRRANQGASQPTPEQESILLQAQQQVQMQKLAHPVG 247

Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
              T             KV+CL   ++AD L +DE YE+I++DMREE  +YG
Sbjct: 248 AAPT-------------KVVCLVHVVSADEL-EDEVYEDIMDDMREEARRYG 285


>gi|384249807|gb|EIE23288.1| hypothetical protein COCSUDRAFT_23864, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 464

 Score =  389 bits (1000), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 224/429 (52%), Positives = 274/429 (63%), Gaps = 51/429 (11%)

Query: 111 RSGFDMAPPAA-----AMLPGAAVPGQLPGV----------PSAVPEMAQN-----MLPF 150
           +SGFD  PP         LP    PG +PGV           +A P    N       P 
Sbjct: 58  KSGFDQPPPGGIPPVFGGLPAGLPPG-MPGVEAIAAVAAPLAAAAPTGFSNGGFSGAPPM 116

Query: 151 GATQLGAFPLMP-VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
             TQ+G   +MP VQ  +QQATRHARRVYVGGLPP  NEQ IATFFS  + AIGG +AGP
Sbjct: 117 IGTQMGG--MMPGVQPPSQQATRHARRVYVGGLPPTGNEQNIATFFSNALAAIGGTTAGP 174

Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
           G +VVNVYIN+EKKFAFVE RTVEE SNAMALDGI+FEGV+VRVRRP DYNP  A+ALGP
Sbjct: 175 GASVVNVYINYEKKFAFVEFRTVEETSNAMALDGIMFEGVSVRVRRPNDYNPAAASALGP 234

Query: 270 GQPSPNLNLAAVGLASG---AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
             P+PNLNLAA+GL +G   A+   +  +RVFVGGLPYY  E Q +ELL SFG +  FDL
Sbjct: 235 SVPNPNLNLAAIGLQAGGMNAVAMIDAAERVFVGGLPYYLNEEQCRELLGSFGGIKSFDL 294

Query: 327 VKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILA 386
           VKDR+TGNSKGYGF VY DP VTDIACA LNG++MG++TLTVRRA       TE +   A
Sbjct: 295 VKDRETGNSKGYGFVVYTDPNVTDIACAGLNGMRMGERTLTVRRA-------TENQGGAA 347

Query: 387 QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
            A    A+      T+                 ++L L EA++ D LA+DEEY +I++DM
Sbjct: 348 GAATAAALSADPFPTA----------------TRILALQEAVSLDELANDEEYVDIVQDM 391

Query: 447 REECGKYGTLVNVVIPRPDQNGGETP-GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
           R+E  K+GT+++V+IPRP   G   P G+GKVF+ + +  G   +   + GR+FGG TV 
Sbjct: 392 RDEASKFGTVIDVLIPRPAPEGQPPPSGLGKVFINFAEKEGAVNSFRVMHGRRFGGRTVV 451

Query: 506 AFYYPEDKY 514
           A Y  E  Y
Sbjct: 452 ASYVQEADY 460


>gi|359497129|ref|XP_003635431.1| PREDICTED: splicing factor U2af large subunit B-like, partial
           [Vitis vinifera]
          Length = 238

 Score =  367 bits (942), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 181/238 (76%), Positives = 199/238 (83%), Gaps = 3/238 (1%)

Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
            IGGAEGPDR+FVGGLPYYFTE QI+ELLESFG L GFDLVKDRDTGNSKGYGFCVYQDP
Sbjct: 1   VIGGAEGPDRIFVGGLPYYFTEEQIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDP 60

Query: 347 AVTDIACAALNGLKMGDKTLTVRRATA-SGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
           AVTDIACAALNGLKMGDKTLTVRRAT  SGQ+K+EQ++ILAQAQQHIAIQK+ALQ  G+N
Sbjct: 61  AVTDIACAALNGLKMGDKTLTVRRATVGSGQAKSEQDNILAQAQQHIAIQKIALQAGGLN 120

Query: 406 TLGGGMSL--FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
             G GM+     ET  KVLCLTE I  D L DDE YEEILEDMR+E GK+G LV+VVIPR
Sbjct: 121 LPGAGMAFTAIAETPTKVLCLTEVINIDELRDDEAYEEILEDMRDEGGKFGALVHVVIPR 180

Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
           P  NG   PGVGKVFLEY D  G ++A+NALSGRKFGGN V+A YYPEDKY++ DY A
Sbjct: 181 PSPNGDLIPGVGKVFLEYSDTAGSSSARNALSGRKFGGNVVSAVYYPEDKYYDGDYGA 238


>gi|297736736|emb|CBI25913.3| unnamed protein product [Vitis vinifera]
          Length = 6467

 Score =  367 bits (941), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 181/238 (76%), Positives = 199/238 (83%), Gaps = 3/238 (1%)

Query: 287  AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
             IGGAEGPDR+FVGGLPYYFTE QI+ELLESFG L GFDLVKDRDTGNSKGYGFCVYQDP
Sbjct: 6230 VIGGAEGPDRIFVGGLPYYFTEEQIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDP 6289

Query: 347  AVTDIACAALNGLKMGDKTLTVRRATA-SGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
            AVTDIACAALNGLKMGDKTLTVRRAT  SGQ+K+EQ++ILAQAQQHIAIQK+ALQ  G+N
Sbjct: 6290 AVTDIACAALNGLKMGDKTLTVRRATVGSGQAKSEQDNILAQAQQHIAIQKIALQAGGLN 6349

Query: 406  TLGGGMSL--FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
              G GM+     ET  KVLCLTE I  D L DDE YEEILEDMR+E GK+G LV+VVIPR
Sbjct: 6350 LPGAGMAFTAIAETPTKVLCLTEVINIDELRDDEAYEEILEDMRDEGGKFGALVHVVIPR 6409

Query: 464  PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
            P  NG   PGVGKVFLEY D  G ++A+NALSGRKFGGN V+A YYPEDKY++ DY A
Sbjct: 6410 PSPNGDLIPGVGKVFLEYSDTAGSSSARNALSGRKFGGNVVSAVYYPEDKYYDGDYGA 6467


>gi|159473054|ref|XP_001694654.1| U2 snRNP auxiliary factor, large subunit [Chlamydomonas
           reinhardtii]
 gi|158276466|gb|EDP02238.1| U2 snRNP auxiliary factor, large subunit [Chlamydomonas
           reinhardtii]
          Length = 306

 Score =  358 bits (919), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 178/302 (58%), Positives = 222/302 (73%), Gaps = 12/302 (3%)

Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFA 225
           ++QQATRHARR+YVGGLPP A EQ+I++FFS  + AIGGN+AGPG+AVVNVYIN EK FA
Sbjct: 4   VSQQATRHARRIYVGGLPPTATEQSISSFFSHALAAIGGNTAGPGNAVVNVYINREKNFA 63

Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLAS 285
           FVE+RTVEE SN+MALDGI+FEGV+VRVRRP DYNP  A +LGP  P+P LNLAA+GL  
Sbjct: 64  FVELRTVEETSNSMALDGIMFEGVSVRVRRPNDYNPAAAVSLGPSTPNPALNLAAIGLNP 123

Query: 286 GAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQD 345
                 + PDR+FVGGLPYY TE Q +ELL SFG +  FDLVKDRDTGNSKGYGF VYQD
Sbjct: 124 N-----DNPDRIFVGGLPYYLTEDQCRELLGSFGAIKSFDLVKDRDTGNSKGYGFVVYQD 178

Query: 346 PAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
            +VTDIACA LNGLKMGD+TLTVRRAT          ++       +A        +G+ 
Sbjct: 179 TSVTDIACAGLNGLKMGDRTLTVRRATEGAPGGGAAPAMGPAGLGGLAGLGGLNPLAGV- 237

Query: 406 TLGGGM---SLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
              GG+    L   T  +++ LT+A++A+ + DD+EY++ILEDM++E  ++G   NV+IP
Sbjct: 238 ---GGVVVNPLGLATATRIVVLTDAVSAEEIIDDQEYQDILEDMKDEASRHGLCNNVLIP 294

Query: 463 RP 464
           RP
Sbjct: 295 RP 296


>gi|302846543|ref|XP_002954808.1| splicing factor U2AF, large subunit [Volvox carteri f. nagariensis]
 gi|300259991|gb|EFJ44214.1| splicing factor U2AF, large subunit [Volvox carteri f. nagariensis]
          Length = 532

 Score =  349 bits (895), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 184/356 (51%), Positives = 235/356 (66%), Gaps = 14/356 (3%)

Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGD-AVVNVYINHEKKF 224
           ++QQATRHARR+YVGGLPP A EQ+I++FFS  + AIGGN+AGPG     +  I   +  
Sbjct: 187 VSQQATRHARRIYVGGLPPTATEQSISSFFSHALAAIGGNTAGPGGFPFHSTSITSPQSI 246

Query: 225 AFVEMRT-VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
               +R  +EE SNAMALDGI+FEGV+VRVRRP DYNP  AA+LGP  P+PNLNLAA+GL
Sbjct: 247 RSSILREFIEETSNAMALDGIMFEGVSVRVRRPNDYNPAAAASLGPSTPNPNLNLAAIGL 306

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
           ++ A GGA+  DR+FVGGLPYY TE Q +ELL SFG +  FDLVKDR+TGNSKGYGF VY
Sbjct: 307 SNAAGGGADQADRIFVGGLPYYLTEEQCRELLGSFGPIKSFDLVKDRETGNSKGYGFVVY 366

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
           QD AVTDIACA LNGL+MGD+TLTVRRAT    S +   +  + A     +   AL    
Sbjct: 367 QDSAVTDIACAGLNGLRMGDRTLTVRRATEGAPSASGAGAAASTALGPPGLVPAALAN-- 424

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
                      G  +   + L   ++A+ + D+ EYE+IL DM++E  ++G   NV+IPR
Sbjct: 425 ----------LGVGVGVGVGLNPLVSAEEIVDNTEYEDILADMKDEASRHGLCNNVLIPR 474

Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           P       PG+ KV +E+ D      A+NA+ GR+F G  VNA Y  E+ YF+  Y
Sbjct: 475 PTAENPNPPGMCKVIMEFNDVNSAVKARNAMHGRRFAGRVVNATYLTEEAYFSGRY 530


>gi|308801273|ref|XP_003077950.1| U2 snRNP auxiliary factor, large subunit (ISS) [Ostreococcus tauri]
 gi|116056401|emb|CAL52690.1| U2 snRNP auxiliary factor, large subunit (ISS) [Ostreococcus tauri]
          Length = 388

 Score =  345 bits (885), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 175/362 (48%), Positives = 239/362 (66%), Gaps = 30/362 (8%)

Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFA 225
           +T Q TRHARR+Y+GG P +ANEQ +++FF+  + A+GG ++     VVNVYIN EKKFA
Sbjct: 49  ITAQTTRHARRIYLGGCPTMANEQELSSFFNDALVAVGGTTSEEA-PVVNVYINLEKKFA 107

Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLAS 285
           FVE R+VEE SNA+ALDG++ +G  VR+RRP DYNP +A  LGP  P+P LNL A+GL  
Sbjct: 108 FVEFRSVEECSNALALDGVMIQGEPVRIRRPNDYNPQIAQGLGPSTPNPKLNLQAIGLDP 167

Query: 286 GAIGGA-------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
            A+  +       E P+R+F+GGLPYY  E Q++ELLE+FG +  FDLV+D++ GNSKGY
Sbjct: 168 SALARSATTNILQEDPNRIFIGGLPYYLEEPQVRELLEAFGPIARFDLVRDKENGNSKGY 227

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
           GF VYQD AVTDIAC  LNG++MG+KTLTVRRA    Q +T+    L   Q  +      
Sbjct: 228 GFVVYQDAAVTDIACQGLNGMQMGEKTLTVRRAE---QGRTD----LIGGQVSVPPPPAI 280

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTE-AITADALADDEEYEEILEDMREECGKYGTLV 457
              +                ++V+  T   IT + LADDEE+E I+EDM EECGKYG ++
Sbjct: 281 APAN--------------PPSEVVSFTNMGITEEELADDEEFENIMEDMNEECGKYGKII 326

Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
           +VVIPRP ++G    G+GKVF+ Y        A++AL+GR+FGGN+V A +   + + ++
Sbjct: 327 SVVIPRPSKSGESVTGIGKVFVRYESVEDATKARDALNGRRFGGNSVVADFIDIESFASQ 386

Query: 518 DY 519
            +
Sbjct: 387 TF 388


>gi|302813497|ref|XP_002988434.1| hypothetical protein SELMODRAFT_24180 [Selaginella moellendorffii]
 gi|300143836|gb|EFJ10524.1| hypothetical protein SELMODRAFT_24180 [Selaginella moellendorffii]
          Length = 339

 Score =  337 bits (864), Expect = 8e-90,   Method: Compositional matrix adjust.
 Identities = 189/374 (50%), Positives = 234/374 (62%), Gaps = 38/374 (10%)

Query: 147 MLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS 206
           +LP G  QL     MP      Q T+ ARRVYVGGLP + +E  IATFF+  M  I GN+
Sbjct: 3   VLPAGIAQLPVVLRMP---QMPQITKPARRVYVGGLPAVVDEARIATFFNHAMAVIEGNT 59

Query: 207 AGPG-DAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAA 265
            G G DAVV+V+I+H K +AFVEMR+VEEASNAMALDGIIFEG  VR+RRP++YNP  A 
Sbjct: 60  YGQGGDAVVSVFIDHAKNYAFVEMRSVEEASNAMALDGIIFEGSQVRIRRPSNYNPEHAM 119

Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFD 325
             G  QPSP+L L  VGL   A   A+GPDR+F+GGLPY + + ++++LLE FG L   D
Sbjct: 120 LFGSSQPSPSLRLDKVGLVYRA--HADGPDRIFIGGLPYEWGDAEVRQLLEPFGALRALD 177

Query: 326 LVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESIL 385
           +VKD  T  SKGYGF VY++PA TD ACAALN   +  K L V RAT S  S      +L
Sbjct: 178 IVKDSYTRKSKGYGFAVYENPASTDAACAALNQKPLEGKILRVHRATNS--SGNPALVLL 235

Query: 386 AQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILED 445
            Q+ +                LG           +V+CL  A++ + L D++EY EI+ED
Sbjct: 236 PQSSE----------------LG----------TRVVCLCNAVSEEMLRDEKEYAEIIED 269

Query: 446 MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
           M+EECGKYG LV+V IPR    G   PG+GKVFLEY D V    A++ L GR F   TV 
Sbjct: 270 MKEECGKYGPLVSVEIPR----GDGAPGLGKVFLEYKDLVSALKARHGLQGRSFDKRTVQ 325

Query: 506 AFYYPEDKYFNKDY 519
           A YYPEDK+  KDY
Sbjct: 326 ATYYPEDKFSAKDY 339


>gi|255073589|ref|XP_002500469.1| RNA binding protein [Micromonas sp. RCC299]
 gi|226515732|gb|ACO61727.1| RNA binding protein [Micromonas sp. RCC299]
          Length = 489

 Score =  335 bits (860), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 188/376 (50%), Positives = 246/376 (65%), Gaps = 22/376 (5%)

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAG-PGDAVVNVYINHEKK 223
           V  QQATRHARRVYVG LP    E  +A FF+  M AIGG  A  PGD V+NVYIN+EKK
Sbjct: 115 VPNQQATRHARRVYVGNLPGTVTEPKVAAFFNNAMHAIGGTVAALPGDPVLNVYINYEKK 174

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
           FAFVE RTVEE SN MALDG + EG+A+RVRRP DYN   A++LGP QP   LNL A+GL
Sbjct: 175 FAFVEFRTVEETSNCMALDGAVLEGIAMRVRRPNDYNVMAASSLGPSQPKDGLNLEAIGL 234

Query: 284 ------------ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRD 331
                       A+ ++   +   R+F+GGLPY+ TET +KEL+E+FG    F LV DR+
Sbjct: 235 NPAAAGGGGAGAANASLTEEDLQHRLFIGGLPYFLTETMVKELVEAFGPTKQFQLVVDRE 294

Query: 332 TGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQH 391
           TGNSKGYGF VYQD +VTD+AC  L+G+KMG+K+LTV+RA   G +   + +  +    H
Sbjct: 295 TGNSKGYGFFVYQDHSVTDVACQGLHGMKMGEKSLTVQRAMQGG-AGAPKPTAASVGPGH 353

Query: 392 IAI---QKMALQTSGMNTLGGGMSL----FGETLAKVLCLTEAITADALADDEEYEEILE 444
            A+    ++A   +G +    G+S+         ++V+ LTE +  + L DD EY EI+E
Sbjct: 354 TALPGADEVAAHLAGASGAPAGLSVPPPPSEHPASRVVSLTEMLDVEELRDDVEYGEIME 413

Query: 445 DMREECGKYGTLVNVVIPRP-DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNT 503
           DMREECGK+G + ++VIPRP D +G   PG+GKVF+ Y D  G A A+NAL GRKFGGN 
Sbjct: 414 DMREECGKFGRIESIVIPRPGDADGAAVPGLGKVFVRYEDDAGAAAARNALHGRKFGGNV 473

Query: 504 VNAFYYPEDKYFNKDY 519
           V A +  E  + ++ +
Sbjct: 474 VKADFIDETVFASRAF 489


>gi|302796203|ref|XP_002979864.1| hypothetical protein SELMODRAFT_153568 [Selaginella moellendorffii]
 gi|300152624|gb|EFJ19266.1| hypothetical protein SELMODRAFT_153568 [Selaginella moellendorffii]
          Length = 325

 Score =  333 bits (853), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 185/363 (50%), Positives = 229/363 (63%), Gaps = 41/363 (11%)

Query: 158 FPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPG-DAVVNV 216
            P MP      Q T+ ARRVYVGGLP + +E  IATFF+  M  I GN+ G G DAVV+V
Sbjct: 1   MPQMP------QITKPARRVYVGGLPAVVDEARIATFFNHAMAVIEGNTYGQGGDAVVSV 54

Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL 276
           +I+H K +AFVEMR+VEEASNAMALDGIIFEG  VR+RRP++YNP  A   G  QPSP+L
Sbjct: 55  FIDHAKNYAFVEMRSVEEASNAMALDGIIFEGSQVRIRRPSNYNPEHAMLFGSSQPSPSL 114

Query: 277 NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
            L  VGL   A   A+GPDR+F+GGLPY + + ++++LLE FG L   D+VKD  T  SK
Sbjct: 115 RLDKVGLVYRA--HADGPDRIFIGGLPYEWGDAEVRQLLEPFGALRALDIVKDSYTRKSK 172

Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQK 396
           GYGF VY++PA TD ACAALN   +  K L V RAT S  S      +L Q+ +      
Sbjct: 173 GYGFAVYENPASTDAACAALNQKPLEGKILRVHRATNS--SGNPALVLLPQSSE------ 224

Query: 397 MALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
                     LG           +V+CL  A++ + L D++EY EI+EDM+EECGKYG L
Sbjct: 225 ----------LG----------TRVVCLCNAVSEEMLRDEKEYAEIIEDMKEECGKYGPL 264

Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           V+V IPR    G   PG+GKVFLEY D V    A++ L GR F   TV A YYPEDK+  
Sbjct: 265 VSVEIPR----GDGAPGLGKVFLEYKDLVSALKARHGLQGRSFDKRTVQATYYPEDKFSA 320

Query: 517 KDY 519
           KDY
Sbjct: 321 KDY 323


>gi|303273844|ref|XP_003056274.1| RNA binding protein [Micromonas pusilla CCMP1545]
 gi|226462358|gb|EEH59650.1| RNA binding protein [Micromonas pusilla CCMP1545]
          Length = 564

 Score =  325 bits (832), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 192/409 (46%), Positives = 245/409 (59%), Gaps = 55/409 (13%)

Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGG---NSAGPG-DAVVNVYINHEK 222
           + QATRHARR+YVGGLP  ANE + ATFFS  + AIGG    +A  G + V+NVY+NHEK
Sbjct: 156 STQATRHARRIYVGGLPATANEASTATFFSNALAAIGGVVQTAAAAGVEPVLNVYMNHEK 215

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
           KFAFVE RTVEE SNA+ALDG++F+GV++RVRRP DYN  +AA LGP  PS +L+LAA+G
Sbjct: 216 KFAFVEFRTVEETSNAIALDGVVFDGVSLRVRRPNDYNAAIAATLGPSTPSTDLDLAAIG 275

Query: 283 LASGA------------------IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
           L  GA                  +   +  +R+FVGGLPY+ TE  +KEL+E+FG    F
Sbjct: 276 LVPGAGGAAGGAGAGGAAGGQNNLSPEDTANRLFVGGLPYFLTEPMVKELVEAFGPTKHF 335

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
            LV DR+TGNSKGYGF VYQD AVTD+AC  L+G+KMG+KTLTVRRAT +G       + 
Sbjct: 336 MLVMDRETGNSKGYGFFVYQDHAVTDVACQGLHGMKMGEKTLTVRRATGAGGGAAGATNA 395

Query: 385 --------------LAQAQQHIA-IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEA-- 427
                         +A+A+      ++ A    G    G   SL    L+   CLT+   
Sbjct: 396 SGLAGGAGGAPSTRVAEAEDATTGGERAARAPRGAMPRGAXASLTARFLSCRCCLTDPPP 455

Query: 428 --------------ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE--T 471
                         +  + L DD EY EI EDMREECGK+G ++ V IPRP   GG+   
Sbjct: 456 PSNPPSRVLSLNDMLDVEDLRDDVEYGEITEDMREECGKHGVVLEVRIPRPAAAGGDEIV 515

Query: 472 PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           PG+GKVF++Y +  G   A+ AL GRKFGG  V A +  E  +    YS
Sbjct: 516 PGLGKVFVQYEEVAGAEAARKALHGRKFGGQIVVADFVDEAAFAEGRYS 564


>gi|145344032|ref|XP_001416543.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576769|gb|ABO94836.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 291

 Score =  315 bits (806), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 165/306 (53%), Positives = 201/306 (65%), Gaps = 23/306 (7%)

Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
           T QATRHARR+YVGG+P   NE  +  FF+  + A+GG +   G  VVNVYIN EKKFAF
Sbjct: 1   TAQATRHARRIYVGGIPLTTNEADVNAFFNNALLAVGGTNGAEGQPVVNVYINVEKKFAF 60

Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
           VE R+VEEASNA+ALDGI+ +GV VR+RRP DYNP+LA  LGP  P+P LNLAA+GL   
Sbjct: 61  VEFRSVEEASNALALDGIVLDGVPVRIRRPNDYNPSLAHDLGPSMPNPALNLAAIGLDPS 120

Query: 287 A-----IGGA---EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
           A     +GG    E  DR+F+GGLPY+  E QI+ELLE+FG +  FDLV+D++TGNSKGY
Sbjct: 121 ALQRAGVGGNLLHEHEDRIFIGGLPYFLDEAQIRELLEAFGPIRQFDLVRDKETGNSKGY 180

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
           GF VY+D +VTDIAC  LNG+ MGDKTLTVRRA  S      Q  ++       AI    
Sbjct: 181 GFVVYEDVSVTDIACQGLNGMTMGDKTLTVRRAEQSNAPGGVQPGMMNVPPPPPAIAAPP 240

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
                       M L               T + LADDEEYE I+EDM+EECGK+G +V+
Sbjct: 241 TNPPSTVVSFDNMGL---------------TEEELADDEEYENIMEDMQEECGKHGEIVS 285

Query: 459 VVIPRP 464
           VVIPRP
Sbjct: 286 VVIPRP 291


>gi|242069429|ref|XP_002449991.1| hypothetical protein SORBIDRAFT_05g026783 [Sorghum bicolor]
 gi|241935834|gb|EES08979.1| hypothetical protein SORBIDRAFT_05g026783 [Sorghum bicolor]
          Length = 249

 Score =  311 bits (796), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 153/228 (67%), Positives = 178/228 (78%), Gaps = 2/228 (0%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P    Q+MT++AT   RRVYVG LPP ANEQ I  FF+QVM  IGGN+AGPGDAV  + +
Sbjct: 11  PHTTTQLMTREATLFTRRVYVGDLPPSANEQTIGVFFNQVMAVIGGNTAGPGDAVCGICM 70

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           NHE++FA VE R  EEASNAMALDGI+FEGV V+VRRP DYN + AAA+GP QPS  LNL
Sbjct: 71  NHEQRFALVEFRMAEEASNAMALDGILFEGVPVKVRRPADYNLSQAAAMGPTQPSRKLNL 130

Query: 279 AAVGLASG-AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
           AAVGL +G A GG+E PDR+FVGGLPYY++E Q+++LLE  G L GF+LVKDR+TGNSKG
Sbjct: 131 AAVGLTAGSAGGGSEDPDRIFVGGLPYYYSEAQVRDLLECIGPLRGFELVKDRETGNSKG 190

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASG-QSKTEQESI 384
           Y FCVY D   TDIACA LNG+KMGDK LTVRRA  S  Q + EQESI
Sbjct: 191 YAFCVYMDTTATDIACADLNGIKMGDKILTVRRANQSASQPRPEQESI 238


>gi|348510223|ref|XP_003442645.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 3
           [Oreochromis niloticus]
          Length = 467

 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 183/484 (37%), Positives = 260/484 (53%), Gaps = 44/484 (9%)

Query: 46  DRRRDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSP 105
           DR R+++ K  R G     R D+HR +++D+  R R +       + R+R  S    +  
Sbjct: 18  DRERERHKKRSRSG--SPGRGDKHRSWSKDRGSRSREKRSRSRDRKSRDRRSSSRDHKKH 75

Query: 106 SKSKRRSG-------FDMAPPAAAMLPGAAVPGQLPGVPSA--VPEMAQNMLPFGATQLG 156
           S S RR+        +D+ PP    +     P Q   + +A  +P +A  +L    T   
Sbjct: 76  SHSPRRTRKKRTCKYWDVPPPGFEHI----TPMQYKAMQAAGQIPTIA--LLATSTTTGV 129

Query: 157 AFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
           A     V ++  Q TR ARR+YVG +P    E+++A FF+  M  + G S  P + V+ V
Sbjct: 130 AAAPTQVPIVGSQMTRQARRLYVGNIPFGVTEESMAEFFNAQMR-LAGLSQAPSNPVLAV 188

Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL 276
            IN +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P    +  P    P  
Sbjct: 189 QINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPAFHVP-- 246

Query: 277 NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
                G+ S  +   + P ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  T  SK
Sbjct: 247 -----GVVSTVV--PDSPHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSK 299

Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI-AIQ 395
           GY FC Y D + TD A A LNG+++GDK L V+RA+   ++      I       +  +Q
Sbjct: 300 GYAFCEYVDISATDQAVAGLNGMQLGDKKLIVQRASVGAKNANPTSIIETPVTLQVPGLQ 359

Query: 396 KMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGT 455
           +  LQ SGM T             +VLCL   +  + L DDE+YEEILED+REEC KYGT
Sbjct: 360 R--LQNSGMPT-------------EVLCLLNMVMPEELVDDEDYEEILEDIREECCKYGT 404

Query: 456 LVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
           + ++ IPRP  +G E PG GK+F+EY  A  C  A  AL+GRKF    V   YY  D Y 
Sbjct: 405 VRSIEIPRP-VDGVEVPGCGKIFVEYVSAADCQKAMQALTGRKFANRVVVTKYYDPDMYH 463

Query: 516 NKDY 519
             ++
Sbjct: 464 RHEF 467


>gi|412990165|emb|CCO19483.1| predicted protein [Bathycoccus prasinos]
          Length = 495

 Score =  283 bits (725), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 171/388 (44%), Positives = 221/388 (56%), Gaps = 52/388 (13%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGG----NSAGPGDAVVNVYINHEKKF 224
           QATRHARRVYVGG PP  +E  +A FF+  + A+GG     + G  + VVNVY+NHEK F
Sbjct: 121 QATRHARRVYVGGFPPNVSEVRVADFFNNALMAVGGIAETQTEGNANPVVNVYMNHEKHF 180

Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
           AFVE R  EE SN MALD I F+   +RVRRP DYN   A  LGP  P+  +NL A+GL+
Sbjct: 181 AFVEFRNAEETSNCMALDSISFDSSQLRVRRPNDYNQPAAMKLGPIVPNIKMNLEAIGLS 240

Query: 285 SGAI-------------GGAEGP-------DRVFVGGLPYYFTETQIKELLESFGTLHGF 324
           +  +             G A G        DRVFVGGLPY+ TE QI+ELLE+FG +  F
Sbjct: 241 NEVLQRMQSGVASGQNNGNANGSNVADPNEDRVFVGGLPYFLTEAQIRELLEAFGPITRF 300

Query: 325 DLVKDRDTGNSKGYGFCVYQD-PAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQES 383
           DLV+DRDTG SKGYGF VY+D PA+TDIA   L+G++MGDK LTVRRA A+       E 
Sbjct: 301 DLVRDRDTGGSKGYGFVVYRDGPAITDIAIQGLHGMQMGDKQLTVRRANAT------LER 354

Query: 384 ILAQAQQHIAIQKMALQTSGMNTLGGGMSLF------------GETLAKVLCLTE-AITA 430
           +  + +  +  Q+   Q  G    GG  + F             E   + L L    I  
Sbjct: 355 MQQEQRAALQQQQQQHQLLGN---GGAAAQFLPQSAAPAAPEVDENATECLVLKNMGIKD 411

Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATA 490
           + L D EEYE I+ED +EEC K+G ++ + IP+P      +   G VF+ +  A     A
Sbjct: 412 EELNDPEEYEIIVEDTQEECEKFGKVLGMKIPKP-----PSKSAGVVFVRFETAESARKA 466

Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFNKD 518
           + +L+GRKF GN V+A Y   + Y   D
Sbjct: 467 RKSLNGRKFAGNIVSAQYDSIETYEKHD 494


>gi|452823555|gb|EME30564.1| U2 snRNP auxiliary factor large subunit, putative isoform 1
           [Galdieria sulphuraria]
          Length = 522

 Score =  282 bits (722), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 156/371 (42%), Positives = 220/371 (59%), Gaps = 34/371 (9%)

Query: 148 LPFGATQLGAFPLMPV-QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS 206
           L F A      P+ P  Q  TQQAT+HARR+YVG LP    E  +A FF+  +    G  
Sbjct: 181 LDFSALSQYMIPVAPTTQPNTQQATKHARRLYVGNLPSDVTESEVADFFNSALYLAKGVD 240

Query: 207 AGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAA 266
             PGD V +VY+N +K+FAF+E+ +  EA+ A+ +DG++F G+++R+RRP DYNP + A 
Sbjct: 241 V-PGDPVQSVYLNLDKRFAFIELNSAAEAAAAIQMDGVLFRGMSLRMRRPNDYNPNIHA- 298

Query: 267 LGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
             P  P    + +A+G+ S  +   +GPD+VF+GGLPY+ TE QIKE+L S+G L+ F+L
Sbjct: 299 --PVYPPIGFDPSALGVVSTQV--PDGPDKVFIGGLPYHLTEDQIKEILSSYGPLNAFNL 354

Query: 327 VKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILA 386
           VKD +TG SKGY F  Y+DP++ + A   LNG+ MGDKTLTVRRA+              
Sbjct: 355 VKDPNTGLSKGYAFFQYKDPSIVEAAIKGLNGMTMGDKTLTVRRASQV------------ 402

Query: 387 QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
                         +SG   LG   S       ++L L   +  + L DDEEYE+I+ED+
Sbjct: 403 --------------SSGSVELGQSFSPTVRYPTRILELRNMVEPEELVDDEEYEDIIEDV 448

Query: 447 REECGKYGTLVNVVIPRPDQ-NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
           REE  KYG +  V IPRP + +    PG+GKVF+ +        A  AL+GR+FGG +V 
Sbjct: 449 REESSKYGEVTEVKIPRPSKTDEANPPGLGKVFVSFKTVSDAEKAFAALTGRRFGGKSVI 508

Query: 506 AFYYPEDKYFN 516
           A YY E++Y++
Sbjct: 509 ANYYDEERYYS 519


>gi|348510219|ref|XP_003442643.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
           [Oreochromis niloticus]
          Length = 466

 Score =  281 bits (718), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 177/465 (38%), Positives = 251/465 (53%), Gaps = 42/465 (9%)

Query: 65  RTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSG-------FDMA 117
           R D+HR +++D+  R R +       + R+R  S    +  S S RR+        +D+ 
Sbjct: 34  RGDKHRSWSKDRGSRSREKRSRSRDRKSRDRRSSSRDHKKHSHSPRRTRKKRTCKYWDVP 93

Query: 118 PPAAAMLPGAAVPGQLPGVPSA--VPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHAR 175
           PP    +     P Q   + +A  +P +A  +L    T   A     V ++  Q TR AR
Sbjct: 94  PPGFEHI----TPMQYKAMQAAGQIPTIA--LLATSTTTGVAAAPTQVPIVGSQMTRQAR 147

Query: 176 RVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEA 235
           R+YVG +P    E+++A FF+  M  + G S  P + V+ V IN +K FAF+E R+V+E 
Sbjct: 148 RLYVGNIPFGVTEESMAEFFNAQMR-LAGLSQAPSNPVLAVQINQDKNFAFLEFRSVDET 206

Query: 236 SNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
           + AMA DGIIF+G ++++RRP DY P    +  P    P       G+ S  +   + P 
Sbjct: 207 TQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPAFHVP-------GVVSTVV--PDSPH 257

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  T  SKGY FC Y D + TD A A 
Sbjct: 258 KLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCEYVDISATDQAVAG 317

Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI-AIQKMALQTSGMNTLGGGMSLF 414
           LNG+++GDK L V+RA+   ++      I       +  +Q+  LQ SGM T        
Sbjct: 318 LNGMQLGDKKLIVQRASVGAKNANPTSIIETPVTLQVPGLQR--LQNSGMPT-------- 367

Query: 415 GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
                +VLCL   +  + L DDE+YEEILED+REEC KYGT+ ++ IPRP  +G E PG 
Sbjct: 368 -----EVLCLLNMVMPEELVDDEDYEEILEDIREECCKYGTVRSIEIPRP-VDGVEVPGC 421

Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           GK+F+EY  A  C  A  AL+GRKF    V   YY  D Y   ++
Sbjct: 422 GKIFVEYVSAADCQKAMQALTGRKFANRVVVTKYYDPDMYHRHEF 466


>gi|147902896|ref|NP_001080595.1| U2 small nuclear RNA auxiliary factor 2 [Xenopus laevis]
 gi|111185517|gb|AAH44032.2| U2af2 protein [Xenopus laevis]
          Length = 456

 Score =  278 bits (712), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 165/418 (39%), Positives = 234/418 (55%), Gaps = 37/418 (8%)

Query: 107 KSKRRSGFDMAPPAAAML-----PGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLM 161
           K K R  +D+ PP    +           GQ+P   + +P M  + L    T        
Sbjct: 70  KKKIRKYWDIPPPGFEHITPLQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT-------- 120

Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
           PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +
Sbjct: 121 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQD 179

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++   
Sbjct: 180 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVP 232

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
           G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC
Sbjct: 233 GVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFC 290

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
            Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L +
Sbjct: 291 EYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGLMS 346

Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           S +  +GG  +       +VLCL   +  + L DD+EYEEI+ED+R+ECGKYG + ++ I
Sbjct: 347 SQVQ-MGGHPT-------EVLCLMNMVVPEELIDDDEYEEIVEDVRDECGKYGAVKSIEI 398

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           PRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 399 PRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDGYHRRDF 455


>gi|348510221|ref|XP_003442644.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
           [Oreochromis niloticus]
          Length = 467

 Score =  278 bits (712), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 176/467 (37%), Positives = 250/467 (53%), Gaps = 45/467 (9%)

Query: 65  RTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSG-------FDMA 117
           R D+HR +++D+  R R +       + R+R  S    +  S S RR+        +D+ 
Sbjct: 34  RGDKHRSWSKDRGSRSREKRSRSRDRKSRDRRSSSRDHKKHSHSPRRTRKKRTCKYWDVP 93

Query: 118 PPAAAMLPGAAVPGQLPGVPSA--VPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHAR 175
           PP    +     P Q   + +A  +P +A  +L    T   A     V ++  Q TR AR
Sbjct: 94  PPGFEHI----TPMQYKAMQAAGQIPTIA--LLATSTTTGVAAAPTQVPIVGSQMTRQAR 147

Query: 176 RVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEA 235
           R+YVG +P    E+++A FF+  M  + G S  P + V+ V IN +K FAF+E R+V+E 
Sbjct: 148 RLYVGNIPFGVTEESMAEFFNAQMR-LAGLSQAPSNPVLAVQINQDKNFAFLEFRSVDET 206

Query: 236 SNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
           + AMA DGIIF+G ++++RRP DY P    +  P    P       G+ S  +   + P 
Sbjct: 207 TQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPAFHVP-------GVVSTVV--PDSPH 257

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  T  SKGY FC Y D + TD A A 
Sbjct: 258 KLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCEYVDISATDQAVAG 317

Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQK---MALQTSGMNTLGGGMS 412
           LNG+++GDK L V+RA+   ++      +       + +Q      LQ SGM T      
Sbjct: 318 LNGMQLGDKKLIVQRASVGAKNAN---PVSTSGNTPVTLQVPGLQRLQNSGMPT------ 368

Query: 413 LFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                  +VLCL   +  + L DDE+YEEILED+REEC KYGT+ ++ IPRP  +G E P
Sbjct: 369 -------EVLCLLNMVMPEELVDDEDYEEILEDIREECCKYGTVRSIEIPRP-VDGVEVP 420

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           G GK+F+EY  A  C  A  AL+GRKF    V   YY  D Y   ++
Sbjct: 421 GCGKIFVEYVSAADCQKAMQALTGRKFANRVVVTKYYDPDMYHRHEF 467


>gi|47575746|ref|NP_001001217.1| U2 small nuclear RNA auxiliary factor 2 isoform 2 [Xenopus
           (Silurana) tropicalis]
 gi|45709722|gb|AAH67966.1| U2 small nuclear ribonucleoprotein auxiliary factor (U2AF) 2
           [Xenopus (Silurana) tropicalis]
          Length = 456

 Score =  278 bits (710), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 165/413 (39%), Positives = 233/413 (56%), Gaps = 27/413 (6%)

Query: 107 KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVM 166
           K K R  +D+ PP    +     P Q   + +A    A  +LP       A    PV V+
Sbjct: 70  KKKIRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTPVPVV 125

Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
             Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +K FAF
Sbjct: 126 GSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAF 184

Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
           +E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++   G+ S 
Sbjct: 185 LEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVST 237

Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
            +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D 
Sbjct: 238 VV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDI 295

Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
            VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L +S +  
Sbjct: 296 NVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGLMSSQVQ- 350

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
           +GG  +       +VLCL   +  + L DD+EYEEI+ED+R+ECGKYG + ++ IPRP  
Sbjct: 351 MGGHPT-------EVLCLMNMVLPEELLDDDEYEEIVEDVRDECGKYGAVKSIEIPRP-V 402

Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 403 DGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDGYHRRDF 455


>gi|159487587|ref|XP_001701804.1| U2 snRNP auxiliary factor, large subunit [Chlamydomonas
           reinhardtii]
 gi|158281023|gb|EDP06779.1| U2 snRNP auxiliary factor, large subunit [Chlamydomonas
           reinhardtii]
          Length = 309

 Score =  275 bits (704), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 145/328 (44%), Positives = 208/328 (63%), Gaps = 20/328 (6%)

Query: 194 FFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRV 253
           FF+Q+M A G  +  PG  V++ ++N++K+FAF+EMR VEE SNAMA DGI  +G  ++V
Sbjct: 1   FFNQIMMASGATTQ-PGPPVMSCFMNNDKRFAFLEMRCVEETSNAMAFDGIQCQGEVLKV 59

Query: 254 RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
           RRP DYNP  A  LGP  PSP +NLA +G+ +  +   +GP++V++GGLP   +E Q+++
Sbjct: 60  RRPHDYNPAAAKLLGPTDPSPKVNLALLGVINTLV--EDGPNKVYIGGLPACLSEEQVRQ 117

Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
           +L++FGTL  F+LV DR+TGNSKGYGFC Y DP+VTD A   L+ L +  K LT RRA  
Sbjct: 118 ILQAFGTLKAFNLVLDRETGNSKGYGFCEYADPSVTDSAIQGLSALIIQGKPLTARRANT 177

Query: 374 SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADAL 433
           S ++    ++++ Q Q  +         S  +  GGG +        V+ L++ ++ D L
Sbjct: 178 SAETSLTLQTLIEQQQAAL--------VSTTSPAGGGHT--------VVRLSKMVSRDDL 221

Query: 434 ADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG-GETPGVGKVFLEYYDAVGCATAKN 492
            DD EY ++L+D+ EE GKYG LV V IPRP   G  + PGVG VFL Y D VG   A+ 
Sbjct: 222 LDDGEYADLLDDITEEVGKYGKLVGVEIPRPGAAGAADPPGVGLVFLCYEDTVGAKRAQV 281

Query: 493 ALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           AL GR+FG N   A +Y   ++  +D++
Sbjct: 282 ALKGRQFGANVAEATFYDRARFDARDFA 309


>gi|452823554|gb|EME30563.1| U2 snRNP auxiliary factor large subunit, putative isoform 2
           [Galdieria sulphuraria]
          Length = 538

 Score =  275 bits (702), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 157/386 (40%), Positives = 219/386 (56%), Gaps = 48/386 (12%)

Query: 148 LPFGATQLGAFPLMPV-QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS 206
           L F A      P+ P  Q  TQQAT+HARR+YVG LP    E  +A FF+  +    G  
Sbjct: 181 LDFSALSQYMIPVAPTTQPNTQQATKHARRLYVGNLPSDVTESEVADFFNSALYLAKGVD 240

Query: 207 AGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAA 266
             PGD V +VY+N +K+FAF+E+ +  EA+ A+ +DG++F G+++R+RRP DYNP + A 
Sbjct: 241 V-PGDPVQSVYLNLDKRFAFIELNSAAEAAAAIQMDGVLFRGMSLRMRRPNDYNPNIHAP 299

Query: 267 LGPGQPSPNLNLAA----------VGLASGAIGGA-----EGPDRVFVGGLPYYFTETQI 311
           + P    P   L            +G    A+G       +GPD+VF+GGLPY+ TE QI
Sbjct: 300 VYP----PVCQLLTCFLGYIEKFQIGFDPSALGVVSTQVPDGPDKVFIGGLPYHLTEDQI 355

Query: 312 KELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
           KE+L S+G L+ F+LVKD +TG SKGY F  Y+DP++ + A   LNG+ MGDKTLTVRRA
Sbjct: 356 KEILSSYGPLNAFNLVKDPNTGLSKGYAFFQYKDPSIVEAAIKGLNGMTMGDKTLTVRRA 415

Query: 372 TASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITAD 431
           +                            +SG   LG   S       ++L L   +  +
Sbjct: 416 SQV--------------------------SSGSVELGQSFSPTVRYPTRILELRNMVEPE 449

Query: 432 ALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ-NGGETPGVGKVFLEYYDAVGCATA 490
            L DDEEYE+I+ED+REE  KYG +  V IPRP + +    PG+GKVF+ +        A
Sbjct: 450 ELVDDEEYEDIIEDVREESSKYGEVTEVKIPRPSKTDEANPPGLGKVFVSFKTVSDAEKA 509

Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFN 516
             AL+GR+FGG +V A YY E++Y++
Sbjct: 510 FAALTGRRFGGKSVIANYYDEERYYS 535


>gi|45387787|ref|NP_991252.1| U2 small nuclear RNA auxiliary factor 2b [Danio rerio]
 gi|41389016|gb|AAH65869.1| U2 small nuclear RNA auxiliary factor 2b [Danio rerio]
          Length = 475

 Score =  273 bits (699), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 171/447 (38%), Positives = 243/447 (54%), Gaps = 36/447 (8%)

Query: 76  KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPG 135
           KERRHR   RS  +      + S SP R   K K +  +D+ PP    +     P Q   
Sbjct: 61  KERRHR---RSDHTQNHPQENVSRSPHRE-KKKKIKKYWDVPPPGFEHI----TPMQYKA 112

Query: 136 VPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFF 195
           + +A    A  +LP       A    PV V+  Q TR ARR+YVG +P    E+++  FF
Sbjct: 113 MQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEESMMDFF 172

Query: 196 SQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRR 255
           +  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+  ++++RR
Sbjct: 173 NAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQAQSLKIRR 231

Query: 256 PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD---RVFVGGLPYYFTETQIK 312
           P DY P       PG  S N ++   G+ S  +     PD   ++F+GGLP Y  + Q+K
Sbjct: 232 PHDYQPL------PGM-SENPSVYVPGVVSTVV-----PDSIHKLFIGGLPNYLNDDQVK 279

Query: 313 ELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
           ELL SFG L  F+LVKD  TG SKGY FC Y D  V D A A LNG+++ DK L V+RA+
Sbjct: 280 ELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNVNDQAIAGLNGMQLADKKLLVQRAS 339

Query: 373 ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADA 432
              ++ T    + +  +  + +Q   L ++ M  +GG          +VLCL   +  + 
Sbjct: 340 VGAKNAT----MTSINETPVTLQVPGLTSNPMIQMGG-------IPTEVLCLMNMVAPEE 388

Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
           L DDEEYEEI+ED++EEC KYG + ++ IPRP  +G + PG GK+F+E+        A  
Sbjct: 389 LIDDEEYEEIVEDVKEECSKYGQVKSIEIPRP-VDGLDIPGTGKIFVEFTSVYDSQKAMQ 447

Query: 493 ALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            L+GRKF    V   Y   D Y  +D+
Sbjct: 448 GLTGRKFANRVVVTKYCDPDAYHRRDF 474


>gi|410903107|ref|XP_003965035.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 3
           [Takifugu rubripes]
          Length = 461

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 151/358 (42%), Positives = 209/358 (58%), Gaps = 27/358 (7%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V+  Q TR ARR+YVG +P    E+++A FF+  M  + G S  P + V+ V IN +K
Sbjct: 130 VPVVGSQMTRQARRLYVGNIPFGVTEESMAEFFNAQMR-LAGLSQAPSNPVLAVQINQDK 188

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P    +  P    P       G
Sbjct: 189 NFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPVFHVP-------G 241

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           + S  +   + P ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  T  SKGY FC 
Sbjct: 242 VVSTVV--PDSPHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCE 299

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI-AIQKMALQT 401
           Y D + TD A A LNG+++GDK L V+RA+   ++      I A     +  +Q+  LQ 
Sbjct: 300 YVDVSATDQAVAGLNGMQLGDKKLIVQRASVGAKNANPSAIIEAPVTLQVPGLQR--LQN 357

Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           SGM T             +VLCL   +  + L DD++YEEILED+REEC KYG++ ++ I
Sbjct: 358 SGMPT-------------EVLCLLNMVMPEELVDDDDYEEILEDVREECCKYGSVRSIEI 404

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           PRP  +G + PG GK+F+EY  A  C  A  AL+GRKF    V   YY  D Y   ++
Sbjct: 405 PRP-VDGVDVPGCGKIFVEYVSASDCQKAMQALTGRKFANRVVVTKYYDLDLYHRHEF 461


>gi|410903105|ref|XP_003965034.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
           [Takifugu rubripes]
          Length = 454

 Score =  273 bits (697), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 151/358 (42%), Positives = 209/358 (58%), Gaps = 27/358 (7%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V+  Q TR ARR+YVG +P    E+++A FF+  M  + G S  P + V+ V IN +K
Sbjct: 123 VPVVGSQMTRQARRLYVGNIPFGVTEESMAEFFNAQMR-LAGLSQAPSNPVLAVQINQDK 181

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P    +  P    P       G
Sbjct: 182 NFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPVFHVP-------G 234

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           + S  +   + P ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  T  SKGY FC 
Sbjct: 235 VVSTVV--PDSPHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCE 292

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI-AIQKMALQT 401
           Y D + TD A A LNG+++GDK L V+RA+   ++      I A     +  +Q+  LQ 
Sbjct: 293 YVDVSATDQAVAGLNGMQLGDKKLIVQRASVGAKNANPSAIIEAPVTLQVPGLQR--LQN 350

Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           SGM T             +VLCL   +  + L DD++YEEILED+REEC KYG++ ++ I
Sbjct: 351 SGMPT-------------EVLCLLNMVMPEELVDDDDYEEILEDVREECCKYGSVRSIEI 397

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           PRP  +G + PG GK+F+EY  A  C  A  AL+GRKF    V   YY  D Y   ++
Sbjct: 398 PRP-VDGVDVPGCGKIFVEYVSASDCQKAMQALTGRKFANRVVVTKYYDLDLYHRHEF 454


>gi|410903103|ref|XP_003965033.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
           [Takifugu rubripes]
          Length = 446

 Score =  273 bits (697), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 151/358 (42%), Positives = 209/358 (58%), Gaps = 27/358 (7%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V+  Q TR ARR+YVG +P    E+++A FF+  M  + G S  P + V+ V IN +K
Sbjct: 115 VPVVGSQMTRQARRLYVGNIPFGVTEESMAEFFNAQMR-LAGLSQAPSNPVLAVQINQDK 173

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P    +  P    P       G
Sbjct: 174 NFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPVFHVP-------G 226

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           + S  +   + P ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  T  SKGY FC 
Sbjct: 227 VVSTVV--PDSPHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCE 284

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI-AIQKMALQT 401
           Y D + TD A A LNG+++GDK L V+RA+   ++      I A     +  +Q+  LQ 
Sbjct: 285 YVDVSATDQAVAGLNGMQLGDKKLIVQRASVGAKNANPSAIIEAPVTLQVPGLQR--LQN 342

Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           SGM T             +VLCL   +  + L DD++YEEILED+REEC KYG++ ++ I
Sbjct: 343 SGMPT-------------EVLCLLNMVMPEELVDDDDYEEILEDVREECCKYGSVRSIEI 389

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           PRP  +G + PG GK+F+EY  A  C  A  AL+GRKF    V   YY  D Y   ++
Sbjct: 390 PRP-VDGVDVPGCGKIFVEYVSASDCQKAMQALTGRKFANRVVVTKYYDLDLYHRHEF 446


>gi|389610875|dbj|BAM19048.1| U2 small nuclear riboprotein auxiliary factor 50 [Papilio polytes]
          Length = 422

 Score =  272 bits (695), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 150/361 (41%), Positives = 212/361 (58%), Gaps = 21/361 (5%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+    FF+Q M  + G +   G+ V+   I
Sbjct: 83  PQAAVPVVGSTITRQARRLYVGNIPFGVTEEETMEFFNQQM-HLSGLAQAAGNPVLACQI 141

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG  +P +N+
Sbjct: 142 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGTENPAINV 195

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
            A G+ S  +   + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKGY
Sbjct: 196 PA-GVISTVV--PDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSSTGLSKGY 252

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            F  Y D ++TD A A LNG+++GDK L V+RA+   ++ T    I    Q       + 
Sbjct: 253 AFAEYVDISMTDQAIAGLNGMQLGDKKLIVQRASIGAKNSTLGVYI----QSMTGAAPVT 308

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
           LQ +G+   G G +       +VLCL   +T D L D+EEYE+ILED++EEC KYG + +
Sbjct: 309 LQVAGLTLAGAGPA------TEVLCLLNMVTPDELRDEEEYEDILEDIKEECNKYGCVRS 362

Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
           + IPRP + G + PG GKVF+E+     C  A+  L+GRKF    V   YY  DKY  ++
Sbjct: 363 IEIPRPLE-GVDVPGCGKVFVEFNSIADCQKAQQTLTGRKFSNRVVVTSYYDPDKYHRRE 421

Query: 519 Y 519
           +
Sbjct: 422 F 422


>gi|325191172|emb|CCA25959.1| splicing factor U2af large subunit putative [Albugo laibachii Nc14]
          Length = 553

 Score =  272 bits (695), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 153/359 (42%), Positives = 218/359 (60%), Gaps = 16/359 (4%)

Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFA 225
           M QQ TRHARR+YVGG+  + NE  I+ FF+ V+    G     G AVV+VYIN E+ FA
Sbjct: 207 MAQQ-TRHARRIYVGGIGEV-NETEISAFFNDVIDRALGERQ-EGGAVVSVYINRERHFA 263

Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAA-LGPGQPSPNLNLAAVGLA 284
           FVE++++E  +  M LDGI F G  ++VRRP DYNP L    LGP    P LNLAA+G+ 
Sbjct: 264 FVELKSIELTTACMNLDGIAFRGQPLKVRRPNDYNPGLVPKDLGP---IPALNLAALGIV 320

Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
           S  +   +GP +VF+GG+PY+ TE QIKELL++FG L  F LVKD  T  SKGY FC Y 
Sbjct: 321 STTV--QDGPGKVFIGGIPYHLTEEQIKELLQAFGPLKSFHLVKDLTTNLSKGYAFCEYM 378

Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE---SILAQAQQHIAIQKMALQT 401
           D  VTD AC  LN +K+GD+TLTVRRA +   +K       ++ A  +  +   + A+QT
Sbjct: 379 DSGVTDAACIGLNDMKLGDRTLTVRRALSQESAKVIANAAGTVNAGVEMGLDPSRAAMQT 438

Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
             M   G  +   G + ++VL L   +T + L D++EY +I++D+R EC +YG +  +++
Sbjct: 439 ISM--AGVHLGPIG-SPSRVLVLRNMVTPEELEDEDEYRDIMDDIRSECERYGRVTTIIL 495

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           PR  +  G+   +GKV++E+ D      A N L GR F    V+A Y  E ++  ++ +
Sbjct: 496 PRAKEGYGDE-ALGKVYIEFGDISTSQAAANELHGRGFANRVVSAQYMEEAQFERRELT 553


>gi|114052735|ref|NP_001040494.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 [Bombyx mori]
 gi|95103122|gb|ABF51502.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 isoform 1
           [Bombyx mori]
          Length = 417

 Score =  271 bits (692), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 149/361 (41%), Positives = 211/361 (58%), Gaps = 26/361 (7%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+    FF+Q M  + G +   G+ V+   I
Sbjct: 83  PQAAVPVVGSTITRQARRLYVGNIPFGVTEEETMEFFNQQM-HLSGLAQAAGNPVLACQI 141

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG  +P +N+
Sbjct: 142 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGTENPAINV 195

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
            A G+ S  +   + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKGY
Sbjct: 196 PA-GVISTVV--PDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSSTGLSKGY 252

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            F  Y D ++TD A A LNG+++GDK L V+RA+   ++ T   +  A  Q         
Sbjct: 253 AFAEYVDISMTDQAIAGLNGMQLGDKKLIVQRASIGAKNSTLALTGAAPVQ--------- 303

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
           +Q +G+   G G         +VLCL   +T D L D+EEYE+ILED++EEC KYG + +
Sbjct: 304 IQVAGLTLAGAGPP------TEVLCLLNMVTPDELRDEEEYEDILEDIKEECNKYGVVRS 357

Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
           + IPRP + G E PG GKVF+E+     C  A+  L+GRKF    V   Y+  DKY  ++
Sbjct: 358 IEIPRPIE-GVEVPGCGKVFVEFNSIADCQKAQQTLTGRKFSNRVVVTSYFDPDKYHRRE 416

Query: 519 Y 519
           +
Sbjct: 417 F 417


>gi|357623461|gb|EHJ74600.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 [Danaus
           plexippus]
          Length = 350

 Score =  270 bits (691), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 150/361 (41%), Positives = 212/361 (58%), Gaps = 26/361 (7%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+    FF+Q M  + G +   G+ V+   I
Sbjct: 16  PQAAVPVVGSTITRQARRLYVGNIPFGVTEEETMEFFNQQM-HLSGLAQAAGNPVLACQI 74

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG  +P +N+
Sbjct: 75  NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGTENPAINV 128

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
            A G+ S  +   + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKGY
Sbjct: 129 PA-GVISTVV--PDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSSTGLSKGY 185

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            F  Y D ++TD A A LNG+++GDK L V+RA+   ++ T     LA          + 
Sbjct: 186 AFAEYVDISMTDQAIAGLNGMQLGDKKLIVQRASIGAKNST-----LAMT----GAAPVT 236

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
           LQ +G+   G G +       +VLCL   +T D L D+EEYE+ILED++EEC KYG + +
Sbjct: 237 LQVAGLTLAGAGPA------TEVLCLLNMVTPDELRDEEEYEDILEDIKEECNKYGCVRS 290

Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
           + IPRP + G E PG GKVF+E+     C  A+  L+GRKF    V   Y+  DKY  ++
Sbjct: 291 IEIPRPIE-GVEVPGCGKVFVEFNSIADCQKAQQTLTGRKFSNRVVVTSYFDPDKYHRRE 349

Query: 519 Y 519
           +
Sbjct: 350 F 350


>gi|62859443|ref|NP_001016998.1| U2 small nuclear RNA auxiliary factor 2 isoform 1 [Xenopus
           (Silurana) tropicalis]
 gi|89269799|emb|CAJ83531.1| U2 (RNU2) small nuclear RNA auxiliary factor 2 [Xenopus (Silurana)
           tropicalis]
 gi|115292148|gb|AAI22001.1| U2 small nuclear RNA auxiliary factor 2 [Xenopus (Silurana)
           tropicalis]
          Length = 465

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 165/422 (39%), Positives = 233/422 (55%), Gaps = 36/422 (8%)

Query: 107 KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVM 166
           K K R  +D+ PP    +     P Q   + +A    A  +LP       A    PV V+
Sbjct: 70  KKKIRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTPVPVV 125

Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
             Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +K FAF
Sbjct: 126 GSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAF 184

Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
           +E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++   G+ S 
Sbjct: 185 LEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVST 237

Query: 287 AIGGAEGPDRVFVGGLPYYFTETQI---------KELLESFGTLHGFDLVKDRDTGNSKG 337
            +   +   ++F+GGLP Y  + Q+         KELL SFG L  F+LVKD  TG SKG
Sbjct: 238 VV--PDSAHKLFIGGLPNYLNDDQVTMESLSLWVKELLTSFGPLKAFNLVKDSATGLSKG 295

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q  
Sbjct: 296 YAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVP 351

Query: 398 ALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
            L +S +  +GG  +       +VLCL   +  + L DD+EYEEI+ED+R+ECGKYG + 
Sbjct: 352 GLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDDEYEEIVEDVRDECGKYGAVK 403

Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
           ++ IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +
Sbjct: 404 SIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDGYHRR 462

Query: 518 DY 519
           D+
Sbjct: 463 DF 464


>gi|289741197|gb|ADD19346.1| splicing factor U2AF large subunit [Glossina morsitans morsitans]
          Length = 423

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 155/391 (39%), Positives = 217/391 (55%), Gaps = 29/391 (7%)

Query: 134 PGVPSAVPEMAQNMLPFGATQLGAFPLMP---VQVMTQQATRHARRVYVGGLPPLANEQA 190
           PG     P   + M   G     A P +P   V V+    TR ARR+YVG +P    E  
Sbjct: 57  PGFEHITPLQYKAMQAAGQIPANALPEIPQAAVPVVGSTITRQARRLYVGNIPFGVTEDE 116

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+Q M   G   A  G+ V+   IN +K FAF+E R+ +E + AMA DGI F+G +
Sbjct: 117 MMEFFNQQMHLTGLAQAA-GNPVLACQINLDKNFAFLEFRSTDETTQAMAFDGISFKGQS 175

Query: 251 VRVRRPTDYNPTLAAALGPG--QPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTE 308
           +++RRP DY P       PG  + +P     A G+ S  +   + P ++F+GGLP Y  E
Sbjct: 176 LKIRRPHDYQPM------PGVVESTPVAQPVANGVISAVV--PDSPHKIFIGGLPNYLNE 227

Query: 309 TQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV 368
            Q+KELL SFG L  F+LVKD  TG SKGY FC Y D ++TD A A LNG+++GDK L V
Sbjct: 228 DQVKELLLSFGQLRAFNLVKDAATGLSKGYAFCEYIDHSITDQAIAGLNGMQLGDKKLIV 287

Query: 369 RRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAI 428
           +RA+   ++             H     + +Q  G++ +G           +VLCL   +
Sbjct: 288 QRASVGAKNAQ---------NNHTTAAPVMIQVPGLSMVG-----ISGPPTEVLCLLNMV 333

Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCA 488
           T D L D+EEYE+ILED++EEC KYG + +V IPRP + G + PG GKVF+E+   + C 
Sbjct: 334 TPDELRDEEEYEDILEDIKEECNKYGVVRSVEIPRPIE-GVDVPGCGKVFVEFNSVMDCQ 392

Query: 489 TAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            A+ AL+GRKF    V   Y+  DKY  +++
Sbjct: 393 KAQQALTGRKFSDRVVVTSYFDPDKYHRREF 423


>gi|195167317|ref|XP_002024480.1| GL15893 [Drosophila persimilis]
 gi|198469588|ref|XP_001355063.2| GA22177 [Drosophila pseudoobscura pseudoobscura]
 gi|194107878|gb|EDW29921.1| GL15893 [Drosophila persimilis]
 gi|198146942|gb|EAL32119.2| GA22177 [Drosophila pseudoobscura pseudoobscura]
          Length = 418

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 155/392 (39%), Positives = 222/392 (56%), Gaps = 44/392 (11%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
           GQ+P   S VP+  Q  +P               V+    TR ARR+YVG +P    E+ 
Sbjct: 68  GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 110

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+Q M  +G   A  G  V+   IN +K FAF+E R+++E + AMA DGI  +G +
Sbjct: 111 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQS 169

Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
           +++RRP DY P       PG   +P +  A V  +SG I     + P ++F+GGLP Y  
Sbjct: 170 LKIRRPHDYQPM------PGITDTPAIKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 221

Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
           + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D ++TD + A LNG+++GDK L 
Sbjct: 222 DDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 281

Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEA 427
           V+RA+   ++     +         + Q + LQ  G++T+     +      +VLCL   
Sbjct: 282 VQRASVGAKNAQNASN---------SSQSVMLQVPGLSTV-----VSSGPPTEVLCLLNM 327

Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
           +T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+   + C
Sbjct: 328 VTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLDC 386

Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             A+ AL+GRKF    V   Y+  DKY  +++
Sbjct: 387 QKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 418


>gi|194770152|ref|XP_001967161.1| GF19596 [Drosophila ananassae]
 gi|190619281|gb|EDV34805.1| GF19596 [Drosophila ananassae]
          Length = 416

 Score =  268 bits (686), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 155/392 (39%), Positives = 221/392 (56%), Gaps = 44/392 (11%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
           GQ+P   S VP+  Q  +P               V+    TR ARR+YVG +P    E+ 
Sbjct: 66  GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 108

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+Q M  +G   A  G  V+   IN +K FAF+E R+++E + AMA DGI  +G +
Sbjct: 109 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQS 167

Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
           +++RRP DY P       PG   +P +  A V  +SG I     + P ++F+GGLP Y  
Sbjct: 168 LKIRRPHDYQPM------PGITDTPAIKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 219

Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
           + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D ++TD + A LNG+++GDK L 
Sbjct: 220 DDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 279

Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEA 427
           V+RA+   ++     +           Q + LQ  G++T+     +      +VLCL   
Sbjct: 280 VQRASVGAKNAQNASN---------TTQSVMLQVPGLSTV-----VTSGPPTEVLCLLNM 325

Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
           +T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+   + C
Sbjct: 326 VTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLDC 384

Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             A+ AL+GRKF    V   Y+  DKY  +++
Sbjct: 385 QKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 416


>gi|195448282|ref|XP_002071589.1| GK10063 [Drosophila willistoni]
 gi|194167674|gb|EDW82575.1| GK10063 [Drosophila willistoni]
          Length = 416

 Score =  268 bits (685), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 155/392 (39%), Positives = 221/392 (56%), Gaps = 44/392 (11%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
           GQ+P   S VP+  Q  +P               V+    TR ARR+YVG +P    E+ 
Sbjct: 66  GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 108

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+Q M  +G   A  G  V+   IN +K FAF+E R+++E + AMA DGI  +G +
Sbjct: 109 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQS 167

Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
           +++RRP DY P       PG   +P +  A V  +SG I     + P ++F+GGLP Y  
Sbjct: 168 LKIRRPHDYQPM------PGITDTPAIKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 219

Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
           + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D ++TD + A LNG+++GDK L 
Sbjct: 220 DDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 279

Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEA 427
           V+RA+   ++     +           Q + LQ  G++T+     +      +VLCL   
Sbjct: 280 VQRASVGAKNAQNAAN---------TTQSVMLQVPGLSTV-----VTSGPPTEVLCLLNM 325

Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
           +T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+   + C
Sbjct: 326 VTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLDC 384

Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             A+ AL+GRKF    V   Y+  DKY  +++
Sbjct: 385 QKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 416


>gi|291241059|ref|XP_002740425.1| PREDICTED: U2 (RNU2) small nuclear RNA auxiliary factor 2-like
           [Saccoglossus kowalevskii]
          Length = 466

 Score =  268 bits (684), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 168/421 (39%), Positives = 240/421 (57%), Gaps = 27/421 (6%)

Query: 101 PSRS--PSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAF 158
           PSRS  P + K    +D+ PP    +     P Q   +  A  ++ Q  L     Q  A 
Sbjct: 71  PSRSARPKRKKPFMYWDIPPPGFEHI----APLQYKAMQGA-GQIPQTALENQMAQAAAN 125

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
             MP+  +  Q TR ARR+YVG +P    E+A+  FF++ M +     A  G+ V+ V I
Sbjct: 126 SNMPI--VGSQMTRQARRLYVGNIPFGVTEEAMMDFFNRQMKSFRITQAQ-GNPVLAVQI 182

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           +  K FAF+E R+V+E + AMA DGI+F+G ++++RRP DY P    A  P    P+   
Sbjct: 183 DLNKNFAFLEFRSVDETTQAMAFDGILFQGQSLKIRRPKDYQPVPGMAEMPSVHVPDYLF 242

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
           +  G+ S  +   + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  T  SKGY
Sbjct: 243 SPTGVVSTVV--PDSPHKIFIGGLPNYLNEDQVKELLTSFGELKAFNLVKDSATSLSKGY 300

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            FC Y D  +TD A A LNG+++GDK L V+RA+   ++        AQ  Q IA  ++ 
Sbjct: 301 AFCEYIDEKITDQAIAGLNGMQLGDKKLIVQRASVGAKN--------AQTAQMIA--QLN 350

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
           +Q  G+N    G  L G T  +VLCL   +T D L D+EEYEEIL+D+R+ECGKYG + +
Sbjct: 351 IQVPGVNI---GQGLVGPT-TEVLCLMNMVTPDELQDEEEYEEILDDVRQECGKYGQVRS 406

Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
           + IPRP + G E PG GK+F+E+ + +    A+ AL+GRKF    V   +   DKY  ++
Sbjct: 407 LEIPRPIE-GVEVPGCGKIFVEFTNVLESQKAQTALAGRKFNNRIVVTSFCDPDKYHRRE 465

Query: 519 Y 519
           +
Sbjct: 466 F 466


>gi|255089803|ref|XP_002506823.1| predicted protein [Micromonas sp. RCC299]
 gi|226522096|gb|ACO68081.1| predicted protein [Micromonas sp. RCC299]
          Length = 554

 Score =  268 bits (684), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 163/358 (45%), Positives = 209/358 (58%), Gaps = 23/358 (6%)

Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAG--PGD---AVVNVYINH 220
           +  QATRHARRVYVGG P   NE  +A+F +  + AIGG S    P +    V++VYIN 
Sbjct: 209 INVQATRHARRVYVGGFPDNTNEPELASFIANALVAIGGASGAYDPDNGMTCVLSVYINR 268

Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           +K FAFVE RTVEEASNAMALDG++  G  +RVRRP DY P  AA +GP  P+ +LNLAA
Sbjct: 269 DKLFAFVEFRTVEEASNAMALDGVVMAGSQLRVRRPNDYQPQQAALIGPTTPADSLNLAA 328

Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
           VGL  G  G + G  +++VG LP Y TE Q+ ELL+SFG +  F+LV D+DTG  KGYGF
Sbjct: 329 VGLIPGVNGQSSG-RKLYVGNLPPYLTELQVLELLQSFGAVQAFNLVVDKDTGTLKGYGF 387

Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ 400
             Y D A  + A   L G+++GDK L V+RA   G            A           Q
Sbjct: 388 FEYADAAADEAAMEGLTGMRLGDKVLNVKRAAYDGGVGQGVGQASGSA-----------Q 436

Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
             G     G +   GE+ ++ + LT  +T + L D  E  EILED +EEC  +G L  VV
Sbjct: 437 APGFAP--GSLPANGESASECVRLTNMVTREELTDPTEAREILEDTQEECAGFGELTRVV 494

Query: 461 IPRPDQNGGETP-GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE---DKY 514
           +P P +   E P GVG+VFL + DA G A A  +L+GRKF    V+A +      DKY
Sbjct: 495 MPLPRRTRLEDPAGVGEVFLLFADAEGAARAVRSLNGRKFADRVVSAGFITRAEFDKY 552


>gi|386764548|ref|NP_001245708.1| U2 small nuclear riboprotein auxiliary factor 50, isoform B
           [Drosophila melanogaster]
 gi|383293438|gb|AFH07421.1| U2 small nuclear riboprotein auxiliary factor 50, isoform B
           [Drosophila melanogaster]
          Length = 427

 Score =  268 bits (684), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 156/393 (39%), Positives = 220/393 (55%), Gaps = 46/393 (11%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
           GQ+P   S VP+  Q  +P               V+    TR ARR+YVG +P    E+ 
Sbjct: 77  GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 119

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+Q M  +G   A  G  V+   IN +K FAF+E R+++E + AMA DGI  +G +
Sbjct: 120 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQS 178

Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
           +++RRP DY P       PG   +P +  A V  +SG I     + P ++F+GGLP Y  
Sbjct: 179 LKIRRPHDYQPM------PGITDTPAIKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 230

Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
           + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D ++TD + A LNG+++GDK L 
Sbjct: 231 DDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 290

Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM-NTLGGGMSLFGETLAKVLCLTE 426
           V+RA+   ++     +           Q + LQ  G+ N +  G         +VLCL  
Sbjct: 291 VQRASVGAKNAQNAAN---------TTQSVMLQVPGLSNVVTSGPP------TEVLCLLN 335

Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
            +T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+   + 
Sbjct: 336 MVTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLD 394

Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           C  A+ AL+GRKF    V   Y+  DKY  +++
Sbjct: 395 CQKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 427


>gi|17136764|ref|NP_476891.1| U2 small nuclear riboprotein auxiliary factor 50, isoform A
           [Drosophila melanogaster]
 gi|386764552|ref|NP_001245710.1| U2 small nuclear riboprotein auxiliary factor 50, isoform D
           [Drosophila melanogaster]
 gi|195351420|ref|XP_002042232.1| GM13406 [Drosophila sechellia]
 gi|195479195|ref|XP_002100800.1| GE15975 [Drosophila yakuba]
 gi|195555160|ref|XP_002077042.1| GD24494 [Drosophila simulans]
 gi|4033485|sp|Q24562.1|U2AF2_DROME RecName: Full=Splicing factor U2AF 50 kDa subunit; AltName: Full=U2
           auxiliary factor 50 kDa subunit; AltName: Full=U2 snRNP
           auxiliary factor large subunit
 gi|349761|gb|AAA03548.1| RNA binding protein [Drosophila melanogaster]
 gi|7293214|gb|AAF48596.1| U2 small nuclear riboprotein auxiliary factor 50, isoform A
           [Drosophila melanogaster]
 gi|17861976|gb|AAL39465.1| LD03714p [Drosophila melanogaster]
 gi|194124075|gb|EDW46118.1| GM13406 [Drosophila sechellia]
 gi|194188324|gb|EDX01908.1| GE15975 [Drosophila yakuba]
 gi|194203060|gb|EDX16636.1| GD24494 [Drosophila simulans]
 gi|220943258|gb|ACL84172.1| U2af50-PA [synthetic construct]
 gi|220953438|gb|ACL89262.1| U2af50-PA [synthetic construct]
 gi|383293440|gb|AFH07423.1| U2 small nuclear riboprotein auxiliary factor 50, isoform D
           [Drosophila melanogaster]
          Length = 416

 Score =  267 bits (683), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 156/393 (39%), Positives = 220/393 (55%), Gaps = 46/393 (11%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
           GQ+P   S VP+  Q  +P               V+    TR ARR+YVG +P    E+ 
Sbjct: 66  GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 108

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+Q M  +G   A  G  V+   IN +K FAF+E R+++E + AMA DGI  +G +
Sbjct: 109 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQS 167

Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
           +++RRP DY P       PG   +P +  A V  +SG I     + P ++F+GGLP Y  
Sbjct: 168 LKIRRPHDYQPM------PGITDTPAIKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 219

Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
           + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D ++TD + A LNG+++GDK L 
Sbjct: 220 DDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 279

Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM-NTLGGGMSLFGETLAKVLCLTE 426
           V+RA+   ++     +           Q + LQ  G+ N +  G         +VLCL  
Sbjct: 280 VQRASVGAKNAQNAAN---------TTQSVMLQVPGLSNVVTSGPP------TEVLCLLN 324

Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
            +T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+   + 
Sbjct: 325 MVTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLD 383

Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           C  A+ AL+GRKF    V   Y+  DKY  +++
Sbjct: 384 CQKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 416


>gi|194893848|ref|XP_001977952.1| GG19328 [Drosophila erecta]
 gi|190649601|gb|EDV46879.1| GG19328 [Drosophila erecta]
          Length = 416

 Score =  267 bits (683), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 156/393 (39%), Positives = 220/393 (55%), Gaps = 46/393 (11%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
           GQ+P   S VP+  Q  +P               V+    TR ARR+YVG +P    E+ 
Sbjct: 66  GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 108

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+Q M  +G   A  G  V+   IN +K FAF+E R+++E + AMA DGI  +G +
Sbjct: 109 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQS 167

Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
           +++RRP DY P       PG   +P +  A V  +SG I     + P ++F+GGLP Y  
Sbjct: 168 LKIRRPHDYQPM------PGITDTPAIKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 219

Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
           + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D ++TD + A LNG+++GDK L 
Sbjct: 220 DDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 279

Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM-NTLGGGMSLFGETLAKVLCLTE 426
           V+RA+   ++     +           Q + LQ  G+ N +  G         +VLCL  
Sbjct: 280 VQRASVGAKNAQNAAN---------TTQSVMLQVPGLSNVVTSGPP------TEVLCLLN 324

Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
            +T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+   + 
Sbjct: 325 MVTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLD 383

Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           C  A+ AL+GRKF    V   Y+  DKY  +++
Sbjct: 384 CQKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 416


>gi|386764550|ref|NP_001245709.1| U2 small nuclear riboprotein auxiliary factor 50, isoform C
           [Drosophila melanogaster]
 gi|383293439|gb|AFH07422.1| U2 small nuclear riboprotein auxiliary factor 50, isoform C
           [Drosophila melanogaster]
          Length = 360

 Score =  267 bits (682), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 156/393 (39%), Positives = 220/393 (55%), Gaps = 46/393 (11%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
           GQ+P   S VP+  Q  +P               V+    TR ARR+YVG +P    E+ 
Sbjct: 10  GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 52

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+Q M  +G   A  G  V+   IN +K FAF+E R+++E + AMA DGI  +G +
Sbjct: 53  MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQS 111

Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
           +++RRP DY P       PG   +P +  A V  +SG I     + P ++F+GGLP Y  
Sbjct: 112 LKIRRPHDYQPM------PGITDTPAIKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 163

Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
           + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D ++TD + A LNG+++GDK L 
Sbjct: 164 DDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 223

Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM-NTLGGGMSLFGETLAKVLCLTE 426
           V+RA+   ++     +           Q + LQ  G+ N +  G         +VLCL  
Sbjct: 224 VQRASVGAKNAQNAAN---------TTQSVMLQVPGLSNVVTSGPP------TEVLCLLN 268

Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
            +T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+   + 
Sbjct: 269 MVTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLD 327

Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           C  A+ AL+GRKF    V   Y+  DKY  +++
Sbjct: 328 CQKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 360


>gi|328721670|ref|XP_001951521.2| PREDICTED: splicing factor U2AF 50 kDa subunit-like isoform 1
           [Acyrthosiphon pisum]
          Length = 416

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 160/430 (37%), Positives = 227/430 (52%), Gaps = 50/430 (11%)

Query: 95  RSKSLSPSRSPSKSKRRSGFDMAPP-----AAAMLPGAAVPGQLPGVPSAVPEMAQNMLP 149
           RS+S SP     + K    +D+ PP     A          GQ+P   + +P+  Q  +P
Sbjct: 32  RSRSKSPKNKSRRRKPSLYWDVPPPGFEHIAPLQYKAMQAAGQIPA--NTMPDTPQTAVP 89

Query: 150 FGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
                          V+    TR ARR+YVG +P    E  +  FF+Q M  + G +   
Sbjct: 90  ---------------VVGSTITRQARRLYVGNIPFGVTEDEMMEFFNQQM-HLSGLAQAA 133

Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
           G+ V+   IN +K FAF+E R+++E + AMA DGI F+G ++++RRP DY PT    +  
Sbjct: 134 GNPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPT--PGMTE 191

Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD 329
             P  N N       SG       P ++F+GGLP Y  + Q+KELL SFG L  F+LVKD
Sbjct: 192 SNPVTNYN-------SGMTLDMNSPHKIFIGGLPAYLNDEQVKELLTSFGQLKAFNLVKD 244

Query: 330 RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
             TG SKGY FC Y D  +TD A A LNG+++G+K L V+RA+   ++       L Q  
Sbjct: 245 AATGLSKGYAFCEYADVVMTDQAIAGLNGMQLGEKKLIVQRASIGAKNPG-----LGQV- 298

Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
                  + +Q  G+  +G           +VLCL   +T D L D+EEYE+ILED+REE
Sbjct: 299 ------PVTIQVPGLTVVGT-----AGPPTEVLCLLNMVTPDELKDEEEYEDILEDIREE 347

Query: 450 CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
           C KYG + ++ IPRP + G + PG GKVF+E+   V C  A+ AL+GRKF    V   + 
Sbjct: 348 CNKYGVVRSLEIPRPIE-GIDVPGCGKVFIEFNAIVDCQKAQQALAGRKFNNRVVVTSFM 406

Query: 510 PEDKYFNKDY 519
             DKY  +++
Sbjct: 407 EPDKYHRREF 416


>gi|195042782|ref|XP_001991497.1| GH12033 [Drosophila grimshawi]
 gi|195134983|ref|XP_002011915.1| GI14308 [Drosophila mojavensis]
 gi|193901255|gb|EDW00122.1| GH12033 [Drosophila grimshawi]
 gi|193909169|gb|EDW08036.1| GI14308 [Drosophila mojavensis]
          Length = 416

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 157/392 (40%), Positives = 223/392 (56%), Gaps = 44/392 (11%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
           GQ+P   S VP+  Q  +P               V+    TR ARR+YVG +P    E+ 
Sbjct: 66  GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 108

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+Q M  +G   A  G  V+   IN +K FAF+E R+++E + AMA DGI  +G  
Sbjct: 109 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQD 167

Query: 251 VRVRRPTDYNPTLAAALGPG-QPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFT 307
           +++RRP DY P       PG   +P +  A V  +SG I     + P ++F+GGLP Y  
Sbjct: 168 LKIRRPHDYQPM------PGITDTPAVKPAVV--SSGVISTVVPDSPHKIFIGGLPNYLN 219

Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
           + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D ++TD + A LNG+++GDK L 
Sbjct: 220 DEQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLI 279

Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEA 427
           V+RA+   ++        AQ   + + Q + LQ  G++T+     +      +VLCL   
Sbjct: 280 VQRASVGAKN--------AQNAANTS-QSVMLQVPGLSTV-----VTSGPPTEVLCLLNM 325

Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
           +T D L D+EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+   + C
Sbjct: 326 VTPDELRDEEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLDC 384

Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             A+ AL+GRKF    V   Y+  DKY  +++
Sbjct: 385 QKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 416


>gi|156404394|ref|XP_001640392.1| predicted protein [Nematostella vectensis]
 gi|156227526|gb|EDO48329.1| predicted protein [Nematostella vectensis]
          Length = 332

 Score =  266 bits (680), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 149/350 (42%), Positives = 204/350 (58%), Gaps = 21/350 (6%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           TR ARR+YVG +P    E  +  FF+  M     N+A PG+ V+   IN E+ FAF+E+R
Sbjct: 2   TRQARRLYVGNIPFGVTENLMIEFFNAKMKEAKLNTA-PGNPVIAAQINTEQNFAFIELR 60

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           +VEE + AMA DGII +G A+++RRP DY P       PG  S N ++   G+ S  +  
Sbjct: 61  SVEETTQAMAFDGIILQGQALKIRRPKDYQPI------PGM-SENASVHVPGVVSTVV-- 111

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
            + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKGY FC Y D  +TD
Sbjct: 112 PDSPHKIFIGGLPNYLNEDQVKELLSSFGELRAFNLVKDSATGLSKGYAFCEYVDLGITD 171

Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
           +A   LNG+++GDK L V+RA+   +          QA   +  Q   LQ  G++     
Sbjct: 172 VAIQGLNGMQLGDKKLIVQRASVGAKQNLNN----PQAMNMVPAQ---LQIPGLDI---S 221

Query: 411 MSLFGETLA-KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
           M++ G   A +VL L   +T D L DDEE+EEI +D+REEC KYG + ++ IPRP +   
Sbjct: 222 MAVPGAVAATEVLALMNMVTPDELGDDEEFEEIYDDVREECSKYGRVKSMEIPRPMEGLM 281

Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           E PGVGK+F+E+        A  AL GRKF    V   YYP ++Y  + +
Sbjct: 282 EPPGVGKIFVEFSSIDDAKKAAAALGGRKFANRVVVTSYYPPEEYHRRIF 331


>gi|328721668|ref|XP_003247369.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like isoform 2
           [Acyrthosiphon pisum]
          Length = 451

 Score =  266 bits (680), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 162/437 (37%), Positives = 231/437 (52%), Gaps = 50/437 (11%)

Query: 95  RSKSLSPSRSPSKSKRRSGFDMAPP-----AAAMLPGAAVPGQLPGVPSAVPEMAQNMLP 149
           RS+S SP     + K    +D+ PP     A          GQ+P   + +P+  Q  +P
Sbjct: 53  RSRSKSPKNKSRRRKPSLYWDVPPPGFEHIAPLQYKAMQAAGQIPA--NTMPDTPQTAVP 110

Query: 150 FGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
                          V+    TR ARR+YVG +P    E  +  FF+Q M  + G +   
Sbjct: 111 ---------------VVGSTITRQARRLYVGNIPFGVTEDEMMEFFNQQM-HLSGLAQAA 154

Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
           G+ V+   IN +K FAF+E R+++E + AMA DGI F+G ++++RRP DY PT    +  
Sbjct: 155 GNPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPT--PGMTE 212

Query: 270 GQPSPNLN----LAAVGLASGAIGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLH 322
             P  N N    L  +   S + G    PD   ++F+GGLP Y  + Q+KELL SFG L 
Sbjct: 213 SNPVTNYNSGMTLDMMKYDSSSFGLGTVPDSPHKIFIGGLPAYLNDEQVKELLTSFGQLK 272

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            F+LVKD  TG SKGY FC Y D  +TD A A LNG+++G+K L V+RA+   ++     
Sbjct: 273 AFNLVKDAATGLSKGYAFCEYADVVMTDQAIAGLNGMQLGEKKLIVQRASIGAKNPG--- 329

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEI 442
             L Q         + +Q  G+  +G           +VLCL   +T D L D+EEYE+I
Sbjct: 330 --LGQV-------PVTIQVPGLTVVGT-----AGPPTEVLCLLNMVTPDELKDEEEYEDI 375

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
           LED+REEC KYG + ++ IPRP + G + PG GKVF+E+   V C  A+ AL+GRKF   
Sbjct: 376 LEDIREECNKYGVVRSLEIPRPIE-GIDVPGCGKVFIEFNAIVDCQKAQQALAGRKFNNR 434

Query: 503 TVNAFYYPEDKYFNKDY 519
            V   +   DKY  +++
Sbjct: 435 VVVTSFMEPDKYHRREF 451


>gi|270011684|gb|EFA08132.1| hypothetical protein TcasGA2_TC005736 [Tribolium castaneum]
          Length = 432

 Score =  265 bits (678), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 143/363 (39%), Positives = 208/363 (57%), Gaps = 33/363 (9%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 101 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 159

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY           QP P +  
Sbjct: 160 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDY-----------QPMPGMAE 208

Query: 279 AAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
           +++ + +G I     + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD   G SK
Sbjct: 209 SSISVPAGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDTAFGLSK 268

Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQK 396
           GY F  Y D  +TD A A LNG+++GDK L V+RA+   ++ T              I  
Sbjct: 269 GYAFAEYIDITMTDQAIAGLNGMQLGDKRLIVQRASVGAKNAT-------------VIPA 315

Query: 397 MALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
           + +Q  G++ +G           +VLCL   +T D L D+EEYE+ILED++EEC KYG +
Sbjct: 316 VQIQVPGLSLVGA-----SGPPTEVLCLLNMVTPDELKDEEEYEDILEDIKEECNKYGVV 370

Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            ++ IPRP  +G E PG GKVF+E+   + C  A+  L+GRKF    V   Y+  DKY  
Sbjct: 371 RSIEIPRP-IDGVEVPGCGKVFVEFNSVLDCQKAQQTLTGRKFSNRVVVTSYFDPDKYHR 429

Query: 517 KDY 519
           +++
Sbjct: 430 REF 432


>gi|410903109|ref|XP_003965036.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 4
           [Takifugu rubripes]
          Length = 455

 Score =  265 bits (677), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 151/367 (41%), Positives = 209/367 (56%), Gaps = 36/367 (9%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V+  Q TR ARR+YVG +P    E+++A FF+  M  + G S  P + V+ V IN +K
Sbjct: 115 VPVVGSQMTRQARRLYVGNIPFGVTEESMAEFFNAQMR-LAGLSQAPSNPVLAVQINQDK 173

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P    +  P    P       G
Sbjct: 174 NFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPVFHVP-------G 226

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQI---------KELLESFGTLHGFDLVKDRDTG 333
           + S  +   + P ++F+GGLP Y  + Q+         KELL SFG L  F+LVKD  T 
Sbjct: 227 VVSTVV--PDSPHKLFIGGLPNYLNDDQVLIRRLGWRVKELLTSFGPLKAFNLVKDSATS 284

Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI- 392
            SKGY FC Y D + TD A A LNG+++GDK L V+RA+   ++      I A     + 
Sbjct: 285 LSKGYAFCEYVDVSATDQAVAGLNGMQLGDKKLIVQRASVGAKNANPSAIIEAPVTLQVP 344

Query: 393 AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGK 452
            +Q+  LQ SGM T             +VLCL   +  + L DD++YEEILED+REEC K
Sbjct: 345 GLQR--LQNSGMPT-------------EVLCLLNMVMPEELVDDDDYEEILEDVREECCK 389

Query: 453 YGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
           YG++ ++ IPRP  +G + PG GK+F+EY  A  C  A  AL+GRKF    V   YY  D
Sbjct: 390 YGSVRSIEIPRP-VDGVDVPGCGKIFVEYVSASDCQKAMQALTGRKFANRVVVTKYYDLD 448

Query: 513 KYFNKDY 519
            Y   ++
Sbjct: 449 LYHRHEF 455


>gi|91088649|ref|XP_974465.1| PREDICTED: similar to U2 small nuclear ribonucleoprotein auxiliary
           factor 2 [Tribolium castaneum]
          Length = 450

 Score =  265 bits (676), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 143/363 (39%), Positives = 208/363 (57%), Gaps = 33/363 (9%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 119 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 177

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY           QP P +  
Sbjct: 178 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDY-----------QPMPGMAE 226

Query: 279 AAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
           +++ + +G I     + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD   G SK
Sbjct: 227 SSISVPAGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDTAFGLSK 286

Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQK 396
           GY F  Y D  +TD A A LNG+++GDK L V+RA+   ++ T              I  
Sbjct: 287 GYAFAEYIDITMTDQAIAGLNGMQLGDKRLIVQRASVGAKNAT-------------VIPA 333

Query: 397 MALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
           + +Q  G++ +G           +VLCL   +T D L D+EEYE+ILED++EEC KYG +
Sbjct: 334 VQIQVPGLSLVGA-----SGPPTEVLCLLNMVTPDELKDEEEYEDILEDIKEECNKYGVV 388

Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            ++ IPRP  +G E PG GKVF+E+   + C  A+  L+GRKF    V   Y+  DKY  
Sbjct: 389 RSIEIPRP-IDGVEVPGCGKVFVEFNSVLDCQKAQQTLTGRKFSNRVVVTSYFDPDKYHR 447

Query: 517 KDY 519
           +++
Sbjct: 448 REF 450


>gi|115496604|ref|NP_001068804.1| splicing factor U2AF 65 kDa subunit [Bos taurus]
 gi|89994093|gb|AAI14161.1| U2 small nuclear RNA auxiliary factor 2 [Bos taurus]
 gi|296477253|tpg|DAA19368.1| TPA: U2 (RNU2) small nuclear RNA auxiliary factor 2 [Bos taurus]
          Length = 475

 Score =  263 bits (673), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 169/420 (40%), Positives = 236/420 (56%), Gaps = 26/420 (6%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPLSTINQTPVTLQVPGL 363

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
            +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+ECGKYG + ++
Sbjct: 364 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECGKYGLVKSI 415

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 416 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 474


>gi|193629757|ref|XP_001950852.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Acyrthosiphon
           pisum]
          Length = 446

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 162/437 (37%), Positives = 231/437 (52%), Gaps = 50/437 (11%)

Query: 95  RSKSLSPSRSPSKSKRRSGFDMAPP-----AAAMLPGAAVPGQLPGVPSAVPEMAQNMLP 149
           RS+S SP     + K    +D+ PP     A          GQ+P   + +P+  Q  +P
Sbjct: 48  RSRSKSPKNKSRRRKPSLYWDVPPPGFEHIAPLQYKAMQAAGQIPA--NTMPDTPQTAVP 105

Query: 150 FGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
                          V+    TR ARR+YVG +P    E  +  FF+Q M  + G +   
Sbjct: 106 ---------------VVGSTITRQARRLYVGNIPFGVTEDEMMEFFNQQM-HLSGLAQAA 149

Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
           G+ V+   IN +K FAF+E R+++E + AMA DGI F+G ++++RRP DY PT    +  
Sbjct: 150 GNPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPT--PGMTE 207

Query: 270 GQPSPNLN----LAAVGLASGAIGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLH 322
             P  N N    L  +   S + G    PD   ++F+GGLP Y  + Q+KELL SFG L 
Sbjct: 208 SNPVTNYNSGMTLDMMKYDSSSFGLGTVPDSPHKIFIGGLPAYLNDEQVKELLTSFGQLK 267

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            F+LVKD  TG SKGY FC Y D  +TD A A LNG+++G+K L V+RA+   ++     
Sbjct: 268 AFNLVKDAATGLSKGYAFCEYADVVMTDQAIAGLNGMQLGEKKLIVQRASIGAKNPG--- 324

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEI 442
             L QA        + +Q  G+  +G           +VLCL   +T D L D+EEYE+I
Sbjct: 325 --LGQA-------PVTIQVPGLTVVGT-----AGPPTEVLCLLNMVTPDELKDEEEYEDI 370

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
           LED+REEC KYG + ++ IPRP + G + PG GKVF+E+     C  A+ AL+GRKF   
Sbjct: 371 LEDIREECNKYGVVRSLEIPRPIE-GIDVPGCGKVFIEFNAIPDCQKAQQALAGRKFNNR 429

Query: 503 TVNAFYYPEDKYFNKDY 519
            V   +   DKY  +++
Sbjct: 430 VVVTSFMEPDKYHRREF 446


>gi|432908697|ref|XP_004077989.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
           [Oryzias latipes]
          Length = 474

 Score =  263 bits (671), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 175/445 (39%), Positives = 244/445 (54%), Gaps = 29/445 (6%)

Query: 76  KERRHRHRS-RSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLP 134
           KERRHR +S   H S   +      SP R   K K +  +D+ PP    +     P Q  
Sbjct: 57  KERRHRRKSVHLHQSSCLKTSCYVRSPHRE-KKKKIKKYWDVPPPGFEHI----TPMQYK 111

Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
            + +A    A  +LP       A    PV V+  Q TR ARR+YVG +P    E+++  F
Sbjct: 112 AMQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEESMMDF 171

Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
           F+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G ++++R
Sbjct: 172 FNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIR 230

Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
           RP DY P       PG  S N ++   G+ S  +   +   ++F+GGLP Y  + Q+KEL
Sbjct: 231 RPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKEL 281

Query: 315 LESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATAS 374
           L SFG L  F+LVKD  TG SKGY FC Y D  + D A A LNG+++GDK L V+RA+  
Sbjct: 282 LTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQRASVG 341

Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA 434
            ++ T     L    Q      + LQ  G+N+    ++  G    +VLCL   +  + L 
Sbjct: 342 SKNAT-----LTSINQ----TPVTLQVPGLNS---SVTQMGGLPTEVLCLMNMVAPEELL 389

Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNAL 494
           DDEEYEEI+ED+REECGKYG + ++ IPRP  +G E PG GK+F+E+        A   L
Sbjct: 390 DDEEYEEIVEDVREECGKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFTSVFDAQKAMQGL 448

Query: 495 SGRKFGGNTVNAFYYPEDKYFNKDY 519
           +GRKF    V   Y   D Y  +D+
Sbjct: 449 TGRKFANRVVVTKYCDPDAYHRRDF 473


>gi|291190480|ref|NP_001167275.1| Splicing factor U2AF 65 kDa subunit [Salmo salar]
 gi|223648990|gb|ACN11253.1| Splicing factor U2AF 65 kDa subunit [Salmo salar]
          Length = 474

 Score =  261 bits (668), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 173/465 (37%), Positives = 243/465 (52%), Gaps = 67/465 (14%)

Query: 76  KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKR-RSGFDM-AP--------------- 118
           KERR RHR    +  + +    S SP+    K K+ R  +D+ AP               
Sbjct: 55  KERR-RHRRSIQTQKQSQETVVSRSPALHREKKKKVRKYWDVPAPGFEHITPLQYKAMQA 113

Query: 119 ----PAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHA 174
               PA A+LP    P  LP  P++VP                       V+  Q TR A
Sbjct: 114 AGQIPATALLPTMITPEGLPPAPTSVP-----------------------VVGSQMTRQA 150

Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEE 234
           RR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E
Sbjct: 151 RRLYVGNIPFGITEEAMMDFFNAQM-CLGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDE 209

Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
            + AMA DGIIF+G ++++RRP DY P    +  P    P       G+ S  +   +  
Sbjct: 210 TTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSESPSVYVP-------GVVSTVV--PDSA 260

Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
            ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  T  SKGY FC Y D  + D A A
Sbjct: 261 HKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATALSKGYAFCEYVDVNLNDQAIA 320

Query: 355 ALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLF 414
            LNG+++GDK L V+RA+   ++     ++    Q  + +Q   L  + M +LGG     
Sbjct: 321 GLNGMQLGDKKLLVQRASVGAKNA----ALTGMNQTPVTLQVPGLMPTSMASLGG----- 371

Query: 415 GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
                +VLCL   +  + L DDEEYEEI+ED+R+ECGKYG + ++ IPRP  +G E PG 
Sbjct: 372 --LPTEVLCLMNMVAVEELLDDEEYEEIVEDVRDECGKYGQVKSIEIPRP-VDGLEVPGT 428

Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           GK+F+E+        A  AL+GRKF    V   Y   D Y  +D+
Sbjct: 429 GKIFVEFMTLFDSQKAMQALTGRKFANRVVVTKYCDPDAYHRRDF 473


>gi|197692223|dbj|BAG70075.1| U2 small nuclear RNA auxiliary factor 2 isoform b [Homo sapiens]
 gi|197692475|dbj|BAG70201.1| U2 small nuclear RNA auxiliary factor 2 isoform b [Homo sapiens]
          Length = 471

 Score =  261 bits (666), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 169/420 (40%), Positives = 236/420 (56%), Gaps = 30/420 (7%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S NL++ 
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENLSVY 245

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGL 359

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
            +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 360 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 411

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 412 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 470


>gi|195999450|ref|XP_002109593.1| hypothetical protein TRIADDRAFT_21652 [Trichoplax adhaerens]
 gi|190587717|gb|EDV27759.1| hypothetical protein TRIADDRAFT_21652 [Trichoplax adhaerens]
          Length = 476

 Score =  260 bits (665), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 141/351 (40%), Positives = 207/351 (58%), Gaps = 22/351 (6%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           Q TR +RR+YVG +P    EQA+  FF++ M   G   A  GD V+ V IN +K FAF+E
Sbjct: 148 QITRQSRRLYVGNIPFGITEQAMMDFFNEKMVTTGLTQAN-GDPVLAVQINFDKNFAFLE 206

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            R++EE +NAMA DGI+F+  A+++RRP DY P        G P+ + N+   G+ S  +
Sbjct: 207 FRSIEETTNAMAFDGIMFQNQALKIRRPKDYQPPT------GDPNSSANIHVPGVISTVV 260

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
              + P ++F+GGLP Y TE Q+KELL+SFG L  F+LVKD  TG SKGY FC Y    V
Sbjct: 261 --PDTPHKLFIGGLPNYLTEDQVKELLQSFGELKAFNLVKDSATGLSKGYAFCEYVVVEV 318

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
           TD A A LN +++G+K L V+RA+   +      S+   +   + +Q   LQ S  N LG
Sbjct: 319 TDQAIAGLNNMQLGEKKLVVQRASVGAKHNY---SVRCLSGIPVTVQVPGLQISN-NALG 374

Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
                    + ++L L   +T + L DDEEYE+I+ED+R E  K   + ++ IPRP + G
Sbjct: 375 --------EVTEILQLMNMVTEEELVDDEEYEDIIEDVRAEVSKIAPVKSLEIPRPIE-G 425

Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            +  GVGK+++E+++   C  A+ +L+GRKF    V   +YP + Y  + +
Sbjct: 426 VDVAGVGKIYIEFHNLDDCLKAQQSLAGRKFANRVVMTSFYPPESYHMRQF 476


>gi|242019185|ref|XP_002430045.1| Splicing factor U2AF 50 kDa subunit, putative [Pediculus humanus
           corporis]
 gi|212515110|gb|EEB17307.1| Splicing factor U2AF 50 kDa subunit, putative [Pediculus humanus
           corporis]
          Length = 445

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 148/362 (40%), Positives = 214/362 (59%), Gaps = 28/362 (7%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 111 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 169

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG   +P++N
Sbjct: 170 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMSENPSVN 223

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
           + A G+ S  +   + P ++F+GGLP Y  E Q+KELL SFG L  F+LV D  TG SKG
Sbjct: 224 VPA-GVISTVV--PDSPHKIFIGGLPNYLNEDQLKELLMSFGQLRAFNLVMDSTTGLSKG 280

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y FC++ D  VTD A A LNG+++GDK L V+RA+   ++     + L Q Q  + IQ  
Sbjct: 281 YAFCLFVDINVTDQAIAGLNGMQLGDKKLIVQRASVGAKN-----TALGQ-QAPVQIQVP 334

Query: 398 ALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
            L + GM+              +VLCL   +T   L D+EEYE+ILED++EEC K+G + 
Sbjct: 335 GLTSVGMSG----------PPTEVLCLLNMVTPSELNDEEEYEDILEDIKEECNKHGVVK 384

Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
           ++ IPRP   G + PG GKVF+E+   + C  A+ AL+GRKF    V   Y+  DKY  +
Sbjct: 385 SLEIPRPIL-GVDVPGCGKVFVEFNSVLDCQKAQQALTGRKFNHRVVVTSYFDPDKYHRR 443

Query: 518 DY 519
           ++
Sbjct: 444 EF 445


>gi|410351437|gb|JAA42322.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
          Length = 475

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 168/420 (40%), Positives = 234/420 (55%), Gaps = 26/420 (6%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T         Q  + +Q   L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPPSTINQTPVTLQVPGL 363

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
            +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 364 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 415

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 416 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 474


>gi|347968827|ref|XP_311994.4| AGAP002908-PA [Anopheles gambiae str. PEST]
 gi|333467820|gb|EAA08228.4| AGAP002908-PA [Anopheles gambiae str. PEST]
          Length = 446

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 151/368 (41%), Positives = 210/368 (57%), Gaps = 37/368 (10%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 109 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQMH-LSGLAQAAGNPVLACQI 167

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           N +K FAF+E R+++E + AMA D I F+G ++++RRP DY P       PG      + 
Sbjct: 168 NLDKNFAFLEFRSIDETTQAMAFDSINFKGQSLKIRRPHDYQPM------PGM----TDS 217

Query: 279 AAVGLA---SGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTG 333
           AAV +    SG I     + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG
Sbjct: 218 AAVNVPEKFSGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLLSFGQLKAFNLVKDAATG 277

Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
             KGY F  Y +  VTD A A LNG+++GDK L V+RA+    +K    +++A  Q  + 
Sbjct: 278 LGKGYAFAEYVEYTVTDQAIAGLNGMQLGDKKLIVQRASVG--AKNSNAAVVAPVQIQVP 335

Query: 394 IQKMALQTSGMNTLGGGMSLFGET--LAKVLCLTEAITADALADDEEYEEILEDMREECG 451
                           G+SL G +    +VLCL   +T D L D+EEYE+ILED+REEC 
Sbjct: 336 ----------------GLSLVGSSGPPTEVLCLLNMVTPDELKDEEEYEDILEDIREECN 379

Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
           KYG + +V IPRP + G + PG GKVF+E+   V C  A+ AL+GRKF    V   Y+  
Sbjct: 380 KYGVVRSVEIPRPIE-GVDVPGCGKVFVEFNSIVDCQKAQQALTGRKFSDRVVVTSYFDP 438

Query: 512 DKYFNKDY 519
           DKY  +++
Sbjct: 439 DKYHRREF 446


>gi|1334149|emb|CAA45875.1| unnamed protein product [Mus musculus]
          Length = 492

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 169/420 (40%), Positives = 232/420 (55%), Gaps = 26/420 (6%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 95  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 150

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 151 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 209

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 210 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 262

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 263 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 320

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T     L      I    + L
Sbjct: 321 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT-----LVSLPSTINQTPVTL 375

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
           Q  G+ +    M   G    +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 376 QVPGLMSSQVQM---GGHPTEVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 432

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 433 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 491


>gi|417410850|gb|JAA51891.1| Putative splicing factor u2af large subunit rrm superfamily,
           partial [Desmodus rotundus]
          Length = 455

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 167/421 (39%), Positives = 235/421 (55%), Gaps = 27/421 (6%)

Query: 99  LSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAF 158
           + P R   K K R  +D+ PP    +     P Q   + +A    A  +LP       A 
Sbjct: 61  IRPPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAV 116

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
              PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V I
Sbjct: 117 TPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQI 175

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           N +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++
Sbjct: 176 NQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSV 228

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
              G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY
Sbjct: 229 YVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGY 286

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   
Sbjct: 287 AFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPG 342

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
           L +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + +
Sbjct: 343 LMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKS 394

Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
           + IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D
Sbjct: 395 IEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRD 453

Query: 519 Y 519
           +
Sbjct: 454 F 454


>gi|6005926|ref|NP_009210.1| splicing factor U2AF 65 kDa subunit isoform a [Homo sapiens]
 gi|267188|sp|P26368.4|U2AF2_HUMAN RecName: Full=Splicing factor U2AF 65 kDa subunit; AltName: Full=U2
           auxiliary factor 65 kDa subunit; Short=hU2AF(65);
           Short=hU2AF65; AltName: Full=U2 snRNP auxiliary factor
           large subunit
 gi|37545|emb|CAA45409.1| splicing factor U2AF [Homo sapiens]
 gi|380783065|gb|AFE63408.1| splicing factor U2AF 65 kDa subunit isoform a [Macaca mulatta]
 gi|410212804|gb|JAA03621.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
 gi|410260574|gb|JAA18253.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
 gi|410291504|gb|JAA24352.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
          Length = 475

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 168/420 (40%), Positives = 234/420 (55%), Gaps = 26/420 (6%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T         Q  + +Q   L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPPSTINQTPVTLQVPGL 363

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
            +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 364 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 415

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 416 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 474


>gi|228543|prf||1805352A splicing factor U2AF:SUBUNIT=large
          Length = 475

 Score =  259 bits (661), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 168/420 (40%), Positives = 234/420 (55%), Gaps = 26/420 (6%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPLHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T         Q  + +Q   L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPPSTINQTPVTLQVPGL 363

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
            +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 364 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 415

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 416 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 474


>gi|297277970|ref|XP_001091568.2| PREDICTED: splicing factor U2AF 65 kDa subunit [Macaca mulatta]
          Length = 471

 Score =  259 bits (661), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 168/425 (39%), Positives = 235/425 (55%), Gaps = 36/425 (8%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
           RSP    K K R  +D+ PP    +           GQ+P   + +P M  + L    T 
Sbjct: 74  RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT- 131

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
                  PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+
Sbjct: 132 -------PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVL 183

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
            V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S 
Sbjct: 184 AVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SE 236

Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
           N ++   G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG 
Sbjct: 237 NPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGL 294

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
           SKGY FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T         Q  + +
Sbjct: 295 SKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPPSTINQTPVTL 354

Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
           Q   L +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG
Sbjct: 355 QVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYG 406

Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
            + ++ IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y
Sbjct: 407 LVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSY 465

Query: 515 FNKDY 519
             +D+
Sbjct: 466 HRRDF 470


>gi|348526424|ref|XP_003450719.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
           [Oreochromis niloticus]
          Length = 475

 Score =  259 bits (661), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 172/447 (38%), Positives = 247/447 (55%), Gaps = 32/447 (7%)

Query: 76  KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSG---FDMAPPAAAMLPGAAVPGQ 132
           KERRHR RS    +  + ++   L   RSP + K++     +D+ PP    +     P Q
Sbjct: 57  KERRHRRRSVPVCNYIWASKQSKLL--RSPHREKKKKVKKYWDVPPPGFEHI----TPMQ 110

Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIA 192
              + +A    A  +LP       A    PV V+  Q TR ARR+YVG +P    E+++ 
Sbjct: 111 YKAMQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEESMM 170

Query: 193 TFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVR 252
            FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G +++
Sbjct: 171 DFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLK 229

Query: 253 VRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIK 312
           +RRP DY P       PG  S N ++   G+ S  +   +   ++F+GGLP Y  + Q+K
Sbjct: 230 IRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVK 280

Query: 313 ELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
           ELL SFG L  F+LVKD  TG SKGY FC Y D  + D A A LNG+++GDK L V+RA+
Sbjct: 281 ELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQRAS 340

Query: 373 ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADA 432
              ++ T     L+   Q      + LQ  G+N+    ++  G    +VLCL   +  + 
Sbjct: 341 VGSKNAT-----LSSINQ----TPVTLQVPGLNS---SVTQMGGLPTEVLCLMNMVAPEE 388

Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
           L DDEEYEEI+ED+R+EC KYG + ++ IPRP  +G E PG GK+F+E+        A  
Sbjct: 389 LLDDEEYEEIVEDVRDECSKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFTSVFDSQKAMQ 447

Query: 493 ALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            L+GRKF    V   Y   D Y  +D+
Sbjct: 448 GLTGRKFANRVVVTKYCDPDAYHRRDF 474


>gi|327365322|ref|NP_001192160.1| splicing factor U2AF 65 kDa subunit isoform 1 [Mus musculus]
 gi|348551789|ref|XP_003461711.1| PREDICTED: splicing factor U2AF 65 kDa subunit isoform 2 [Cavia
           porcellus]
 gi|392343893|ref|XP_003748811.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Rattus norvegicus]
 gi|136628|sp|P26369.3|U2AF2_MOUSE RecName: Full=Splicing factor U2AF 65 kDa subunit; AltName: Full=U2
           auxiliary factor 65 kDa subunit; AltName: Full=U2 snRNP
           auxiliary factor large subunit
 gi|55101|emb|CAA45874.1| splicing factor U2AF [Mus musculus]
 gi|26347321|dbj|BAC37309.1| unnamed protein product [Mus musculus]
          Length = 475

 Score =  259 bits (661), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 169/420 (40%), Positives = 232/420 (55%), Gaps = 26/420 (6%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T     L      I    + L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT-----LVSLPSTINQTPVTL 358

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
           Q  G+ +    M   G    +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 359 QVPGLMSSQVQM---GGHPTEVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 415

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 416 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 474


>gi|410351435|gb|JAA42321.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
          Length = 471

 Score =  258 bits (660), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 168/420 (40%), Positives = 235/420 (55%), Gaps = 30/420 (7%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGL 359

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
            +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 360 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 411

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 412 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 470


>gi|410982179|ref|XP_003997437.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Felis catus]
          Length = 471

 Score =  258 bits (659), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 167/421 (39%), Positives = 235/421 (55%), Gaps = 27/421 (6%)

Query: 99  LSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAF 158
           +S  R   K K R  +D+ PP    +     P Q   + +A    A  +LP       A 
Sbjct: 77  ISLPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAV 132

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
              PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V I
Sbjct: 133 TPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQI 191

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           N +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++
Sbjct: 192 NQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSV 244

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
              G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY
Sbjct: 245 YVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGY 302

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   
Sbjct: 303 AFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPG 358

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
           L +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + +
Sbjct: 359 LMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKS 410

Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
           + IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D
Sbjct: 411 IEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRD 469

Query: 519 Y 519
           +
Sbjct: 470 F 470


>gi|432908699|ref|XP_004077990.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 3
           [Oryzias latipes]
          Length = 479

 Score =  258 bits (659), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 174/444 (39%), Positives = 243/444 (54%), Gaps = 31/444 (6%)

Query: 76  KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPG 135
           KERRHR  S        +  + S SP R   K K +  +D+ PP    +     P Q   
Sbjct: 66  KERRHRRNSPPAYP---QENTASRSPHRE-KKKKIKKYWDVPPPGFEHI----TPMQYKA 117

Query: 136 VPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFF 195
           + +A    A  +LP       A    PV V+  Q TR ARR+YVG +P    E+++  FF
Sbjct: 118 MQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEESMMDFF 177

Query: 196 SQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRR 255
           +  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RR
Sbjct: 178 NAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRR 236

Query: 256 PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELL 315
           P DY P       PG  S N ++   G+ S  +   +   ++F+GGLP Y  + Q+KELL
Sbjct: 237 PHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELL 287

Query: 316 ESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASG 375
            SFG L  F+LVKD  TG SKGY FC Y D  + D A A LNG+++GDK L V+RA+   
Sbjct: 288 TSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQRASVGS 347

Query: 376 QSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALAD 435
           ++ T     L    Q      + LQ  G+N+    ++  G    +VLCL   +  + L D
Sbjct: 348 KNAT-----LTSINQ----TPVTLQVPGLNS---SVTQMGGLPTEVLCLMNMVAPEELLD 395

Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
           DEEYEEI+ED+REECGKYG + ++ IPRP  +G E PG GK+F+E+        A   L+
Sbjct: 396 DEEYEEIVEDVREECGKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFTSVFDAQKAMQGLT 454

Query: 496 GRKFGGNTVNAFYYPEDKYFNKDY 519
           GRKF    V   Y   D Y  +D+
Sbjct: 455 GRKFANRVVVTKYCDPDAYHRRDF 478


>gi|60279268|ref|NP_001012496.1| splicing factor U2AF 65 kDa subunit isoform b [Homo sapiens]
 gi|164565377|ref|NP_598432.2| splicing factor U2AF 65 kDa subunit isoform 2 [Mus musculus]
 gi|109461136|ref|XP_001060115.1| PREDICTED: splicing factor U2AF 65 kDa subunit isoform 6 [Rattus
           norvegicus]
 gi|338709958|ref|XP_001496159.3| PREDICTED: splicing factor U2AF 65 kDa subunit [Equus caballus]
 gi|348551787|ref|XP_003461710.1| PREDICTED: splicing factor U2AF 65 kDa subunit isoform 1 [Cavia
           porcellus]
 gi|359318549|ref|XP_003638845.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Canis lupus
           familiaris]
 gi|395861318|ref|XP_003802936.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Otolemur garnettii]
 gi|397471087|ref|XP_003807136.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Pan paniscus]
 gi|403308602|ref|XP_003944746.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Saimiri boliviensis
           boliviensis]
 gi|14250571|gb|AAH08740.1| U2 small nuclear RNA auxiliary factor 2 [Homo sapiens]
 gi|27695339|gb|AAH43071.1| U2af2 protein [Mus musculus]
 gi|39644972|gb|AAH30574.1| U2 small nuclear RNA auxiliary factor 2 [Homo sapiens]
 gi|119592810|gb|EAW72404.1| U2 (RNU2) small nuclear RNA auxiliary factor 2, isoform CRA_c [Homo
           sapiens]
 gi|148699339|gb|EDL31286.1| U2 small nuclear ribonucleoprotein auxiliary factor (U2AF) 2,
           isoform CRA_a [Mus musculus]
 gi|149016700|gb|EDL75886.1| similar to U2 (RNU2) small nuclear RNA auxiliary factor 2 isoform b
           [Rattus norvegicus]
 gi|261858294|dbj|BAI45669.1| U2 small nuclear RNA auxiliary factor 2 [synthetic construct]
 gi|325463253|gb|ADZ15397.1| U2 small nuclear RNA auxiliary factor 2 [synthetic construct]
 gi|380783067|gb|AFE63409.1| splicing factor U2AF 65 kDa subunit isoform b [Macaca mulatta]
 gi|389618965|gb|AFK92990.1| U2 small nuclear RNA auxiliary factor 2 [Sus scrofa]
 gi|410212802|gb|JAA03620.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
 gi|410260572|gb|JAA18252.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
 gi|410291502|gb|JAA24351.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
 gi|431902970|gb|ELK09152.1| Splicing factor U2AF 65 kDa subunit [Pteropus alecto]
          Length = 471

 Score =  258 bits (658), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 168/420 (40%), Positives = 235/420 (55%), Gaps = 30/420 (7%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGL 359

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
            +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 360 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 411

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 412 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 470


>gi|344270173|ref|XP_003406920.1| PREDICTED: LOW QUALITY PROTEIN: splicing factor U2AF 65 kDa
           subunit-like [Loxodonta africana]
          Length = 471

 Score =  258 bits (658), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 168/420 (40%), Positives = 235/420 (55%), Gaps = 30/420 (7%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGL 359

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
            +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 360 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 411

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 412 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 470


>gi|157132061|ref|XP_001662443.1| splicing factor u2af large subunit [Aedes aegypti]
 gi|108881728|gb|EAT45953.1| AAEL002818-PA [Aedes aegypti]
          Length = 418

 Score =  257 bits (657), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 150/368 (40%), Positives = 208/368 (56%), Gaps = 37/368 (10%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 81  PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 139

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG      + 
Sbjct: 140 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGM----TDS 189

Query: 279 AAVGLA---SGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTG 333
           AAV +    SG I     + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG
Sbjct: 190 AAVSVPEKFSGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLLSFGQLKAFNLVKDAATG 249

Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
            SKGY F  Y +  +TD A A LNG+++GDK L V+RA+   ++         Q Q    
Sbjct: 250 LSKGYAFAEYVEYTITDQAIAGLNGMQLGDKKLIVQRASVGAKNANVAAVAPVQIQVP-- 307

Query: 394 IQKMALQTSGMNTLGGGMSLFGET--LAKVLCLTEAITADALADDEEYEEILEDMREECG 451
                           G+SL G +    +VLCL   +T D L D+EEYE+ILED++EEC 
Sbjct: 308 ----------------GLSLVGSSGPPTEVLCLLNMVTPDELKDEEEYEDILEDIKEECN 351

Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
           KYG + +V IPRP + G + PG GKVF+E+   V C  A+ AL+GRKF    V   Y+  
Sbjct: 352 KYGVVRSVEIPRPIE-GVDVPGCGKVFVEFNSIVDCQKAQQALTGRKFSDRVVVTSYFDP 410

Query: 512 DKYFNKDY 519
           DKY  +++
Sbjct: 411 DKYHRREF 418


>gi|170054347|ref|XP_001863087.1| splicing factor u2af large subunit [Culex quinquefasciatus]
 gi|167874693|gb|EDS38076.1| splicing factor u2af large subunit [Culex quinquefasciatus]
          Length = 438

 Score =  257 bits (657), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 146/369 (39%), Positives = 205/369 (55%), Gaps = 39/369 (10%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 101 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQMH-LSGLAQAAGNPVLACQI 159

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY           QP P +  
Sbjct: 160 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDY-----------QPMPGMTD 208

Query: 279 AAVGLASGAIGGA------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
           +AV        G       + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  T
Sbjct: 209 SAVAPVQEKFSGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLLSFGQLKAFNLVKDAAT 268

Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI 392
           G SKGY F  Y + ++TD A A LNG+++GDK L V+RA+   ++         Q Q   
Sbjct: 269 GLSKGYAFAEYVEYSITDQAIAGLNGMQLGDKKLIVQRASVGAKNANVAAVAPVQIQVP- 327

Query: 393 AIQKMALQTSGMNTLGGGMSLFGET--LAKVLCLTEAITADALADDEEYEEILEDMREEC 450
                            G+SL G +    +VLCL   +T D L D+EEYE+ILED++EEC
Sbjct: 328 -----------------GLSLVGSSGPPTEVLCLLNMVTPDELKDEEEYEDILEDIKEEC 370

Query: 451 GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYP 510
            KYG + +  IPRP + G + PG GKVF+E+   V C  A+ AL+GRKF    V   Y+ 
Sbjct: 371 NKYGVVRSAEIPRPIE-GVDVPGCGKVFVEFNSIVDCQKAQQALTGRKFSDRVVVTSYFD 429

Query: 511 EDKYFNKDY 519
            DKY  +++
Sbjct: 430 PDKYHRREF 438


>gi|444724150|gb|ELW64768.1| Splicing factor U2AF 65 kDa subunit [Tupaia chinensis]
          Length = 447

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 166/417 (39%), Positives = 233/417 (55%), Gaps = 27/417 (6%)

Query: 103 RSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           R   K K R  +D+ PP    +     P Q   + +A    A  +LP       A    P
Sbjct: 57  RHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTP 112

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +K
Sbjct: 113 VPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDK 171

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++   G
Sbjct: 172 NFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPG 224

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           + S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC 
Sbjct: 225 VVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCE 282

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L +S
Sbjct: 283 YVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGLMSS 338

Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
            +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++ IP
Sbjct: 339 QVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIP 390

Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           RP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 391 RP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 446


>gi|301782083|ref|XP_002926459.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Ailuropoda
           melanoleuca]
          Length = 496

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 168/425 (39%), Positives = 236/425 (55%), Gaps = 40/425 (9%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
           RSP    K K R  +D+ PP    +           GQ+P   + +P M  + L    T 
Sbjct: 103 RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT- 160

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
                  PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+
Sbjct: 161 -------PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVL 212

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
            V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S 
Sbjct: 213 AVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SE 265

Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
           N ++   G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG 
Sbjct: 266 NPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGL 323

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
           SKGY FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +
Sbjct: 324 SKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTL 379

Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
           Q   L +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG
Sbjct: 380 QVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYG 431

Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
            + ++ IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y
Sbjct: 432 LVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSY 490

Query: 515 FNKDY 519
             +D+
Sbjct: 491 HRRDF 495


>gi|327280715|ref|XP_003225097.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
           [Anolis carolinensis]
          Length = 456

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 165/418 (39%), Positives = 233/418 (55%), Gaps = 37/418 (8%)

Query: 107 KSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLM 161
           K K R  +D+ PP    +           GQ+P   + +P M  + L    T        
Sbjct: 70  KKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT-------- 120

Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
           PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +
Sbjct: 121 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQD 179

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++   
Sbjct: 180 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVP 232

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
           G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC
Sbjct: 233 GVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFC 290

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
            Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L +
Sbjct: 291 EYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGLMS 346

Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++ I
Sbjct: 347 SQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGVVKSIEI 398

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           PRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 399 PRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 455


>gi|344277364|ref|XP_003410472.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Loxodonta
           africana]
          Length = 471

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 177/495 (35%), Positives = 257/495 (51%), Gaps = 37/495 (7%)

Query: 25  RSRTGERGRDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRHRS 84
           +S+ G    +RH +  +S    R RD+  +      R+HD+        R+  R  R RS
Sbjct: 13  KSKHGREEENRHRK--RSHSHSRNRDRKRRSQSRDRRNHDQ--------RNDPRDQRRRS 62

Query: 85  RSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMA 144
           +  S D    R   +  +R   K K    +D+ PP    +     P Q   + +A    A
Sbjct: 63  KPWSRDAEEERGGLIPSARHDRKRKVHKYWDVPPPGFEHI----TPMQYKAMQAAGQIPA 118

Query: 145 QNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGG 204
              LP       A    PV +   Q TR ARR+YVG +P    E+A+  FF+ +   +G 
Sbjct: 119 TAFLPTMTPDGLAMIPTPVPMGGSQMTRKARRLYVGNIPFGITEEAMMDFFN-IQMRLGV 177

Query: 205 NSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA 264
            +  PG+ ++ V IN +K FAF+E R+V+E + A ALDGIIF+G ++++RRP DY P  +
Sbjct: 178 LTQAPGNPILAVQINQDKNFAFLEFRSVDETTQATALDGIIFQGQSLKIRRPHDYQPLPS 237

Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
                   S NL+    G+AS  +  ++   ++F+ GLP Y  + Q+KELL SFG L  F
Sbjct: 238 M-------SENLSAYMAGVASTVVPDSD--HKLFIEGLPTYLNDDQVKELLTSFGPLKAF 288

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
            LVKD  TG SKGY  C Y D   TD A A LNG+++GDK L V R +   ++ T    +
Sbjct: 289 SLVKDSATGLSKGYAVCEYVDINDTDQATAGLNGMQLGDKKLLVLRGSVGAKNGT----L 344

Query: 385 LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
               Q  +  Q   L++S +  +GG  +       +VLCL   +  + L DDEEYEEI+E
Sbjct: 345 STINQVPVTPQVPGLRSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIME 396

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           D+REEC KYG + ++ IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V
Sbjct: 397 DVREECSKYGLVKSMEIPRP-VDGVEVPGCGKIFVEFTTVFDCQKAMQGLTGRKFANRVV 455

Query: 505 NAFYYPEDKYFNKDY 519
              Y   D Y ++D+
Sbjct: 456 VTKYCDLDSYHHRDF 470


>gi|63101571|gb|AAH94451.1| U2af2 protein, partial [Mus musculus]
          Length = 403

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 168/425 (39%), Positives = 236/425 (55%), Gaps = 40/425 (9%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
           RSP    K K R  +D+ PP    +           GQ+P   + +P M  + L    T 
Sbjct: 10  RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT- 67

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
                  PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+
Sbjct: 68  -------PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVL 119

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
            V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S 
Sbjct: 120 AVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SE 172

Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
           N ++   G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG 
Sbjct: 173 NPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGL 230

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
           SKGY FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +
Sbjct: 231 SKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTL 286

Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
           Q   L +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG
Sbjct: 287 QVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYG 338

Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
            + ++ IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y
Sbjct: 339 LVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSY 397

Query: 515 FNKDY 519
             +D+
Sbjct: 398 HRRDF 402


>gi|348526426|ref|XP_003450720.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
           [Oreochromis niloticus]
          Length = 487

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 174/457 (38%), Positives = 249/457 (54%), Gaps = 40/457 (8%)

Query: 76  KERRHRHRSRSHSSDRFRNRSKSL-----SPS-----RSPSKSKRRSG---FDMAPPAAA 122
           KERRHR RS    +  + ++   L     SP      RSP + K++     +D+ PP   
Sbjct: 57  KERRHRRRSVPVCNYIWASKQSKLLLRQESPHYTGMYRSPHREKKKKVKKYWDVPPPGFE 116

Query: 123 MLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGL 182
            +     P Q   + +A    A  +LP       A    PV V+  Q TR ARR+YVG +
Sbjct: 117 HI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNI 172

Query: 183 PPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALD 242
           P    E+++  FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E + AMA D
Sbjct: 173 PFGITEESMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFD 231

Query: 243 GIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGL 302
           GIIF+G ++++RRP DY P       PG  S N ++   G+ S  +   +   ++F+GGL
Sbjct: 232 GIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGL 282

Query: 303 PYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMG 362
           P Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D  + D A A LNG+++G
Sbjct: 283 PNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLG 342

Query: 363 DKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVL 422
           DK L V+RA+   ++ T     L+   Q      + LQ  G+N+    ++  G    +VL
Sbjct: 343 DKKLLVQRASVGSKNAT-----LSSINQ----TPVTLQVPGLNS---SVTQMGGLPTEVL 390

Query: 423 CLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYY 482
           CL   +  + L DDEEYEEI+ED+R+EC KYG + ++ IPRP  +G E PG GK+F+E+ 
Sbjct: 391 CLMNMVAPEELLDDEEYEEIVEDVRDECSKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFT 449

Query: 483 DAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
                  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 450 SVFDSQKAMQGLTGRKFANRVVVTKYCDPDAYHRRDF 486


>gi|432908695|ref|XP_004077988.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
           [Oryzias latipes]
          Length = 458

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 175/444 (39%), Positives = 242/444 (54%), Gaps = 43/444 (9%)

Query: 76  KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPG 135
           KERRHR           RNRS    P R   K K +  +D+ PP    +     P Q   
Sbjct: 57  KERRHR-----------RNRS----PHRE-KKKKIKKYWDVPPPGFEHI----TPMQYKA 96

Query: 136 VPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFF 195
           + +A    A  +LP       A    PV V+  Q TR ARR+YVG +P    E+++  FF
Sbjct: 97  MQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEESMMDFF 156

Query: 196 SQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRR 255
           +  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RR
Sbjct: 157 NAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRR 215

Query: 256 PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELL 315
           P DY P       PG  S N ++   G+ S  +   +   ++F+GGLP Y  + Q+KELL
Sbjct: 216 PHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELL 266

Query: 316 ESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASG 375
            SFG L  F+LVKD  TG SKGY FC Y D  + D A A LNG+++GDK L V+RA+   
Sbjct: 267 TSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQRASVGS 326

Query: 376 QSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALAD 435
           ++ T     L    Q      + LQ  G+N+    ++  G    +VLCL   +  + L D
Sbjct: 327 KNAT-----LTSINQ----TPVTLQVPGLNS---SVTQMGGLPTEVLCLMNMVAPEELLD 374

Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
           DEEYEEI+ED+REECGKYG + ++ IPRP  +G E PG GK+F+E+        A   L+
Sbjct: 375 DEEYEEIVEDVREECGKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFTSVFDAQKAMQGLT 433

Query: 496 GRKFGGNTVNAFYYPEDKYFNKDY 519
           GRKF    V   Y   D Y  +D+
Sbjct: 434 GRKFANRVVVTKYCDPDAYHRRDF 457


>gi|260800970|ref|XP_002595369.1| hypothetical protein BRAFLDRAFT_113856 [Branchiostoma floridae]
 gi|229280615|gb|EEN51381.1| hypothetical protein BRAFLDRAFT_113856 [Branchiostoma floridae]
          Length = 524

 Score =  254 bits (650), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 151/372 (40%), Positives = 210/372 (56%), Gaps = 26/372 (6%)

Query: 148 LPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
           +P  A    A   MP+     Q TR ARR+YVG +P    E+A+  FF+  M      + 
Sbjct: 178 IPSAALLANAGTAMPI---GSQMTRQARRLYVGNIPFGVTEEAMIDFFNTQMHR-ASLAQ 233

Query: 208 GPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAAL 267
            PG+ V+   +N +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P    A 
Sbjct: 234 APGNPVLACQVNLDKNFAFLEFRSVDETTLAMAFDGIIFQGQSLKLRRPHDYQPVPGMAE 293

Query: 268 GPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLV 327
            P    P       G+ S  +   + P ++F+GGLP Y  + Q+KELL SFG L  F+LV
Sbjct: 294 NPDIHVPGGFPVIPGVVSTVV--QDSPHKIFIGGLPNYLNDDQVKELLLSFGQLKAFNLV 351

Query: 328 KDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
           KD  T  SKGY FC Y DP VTD A A LNG+++GDK L V+RA+   ++          
Sbjct: 352 KDSSTALSKGYAFCEYVDPNVTDQAIAGLNGMQLGDKKLIVQRASVGAKNAQN------- 404

Query: 388 AQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMR 447
                  Q + LQ  G+   G           +VLCL   +  + L D+EEYE+ILED+R
Sbjct: 405 -------QPVQLQIPGLTLTGN-----AGPPTEVLCLMNMVMPEELMDEEEYEDILEDVR 452

Query: 448 EECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAF 507
           EECGKYG +++V IPRP + G + PG GK+++E+   + C  A+ AL+GRKF    V   
Sbjct: 453 EECGKYGAVLSVEIPRPIE-GVDVPGCGKIYVEFRSIMDCQKAQQALTGRKFAQRIVVTS 511

Query: 508 YYPEDKYFNKDY 519
           YY  DKY  +D+
Sbjct: 512 YYDPDKYHRRDF 523


>gi|226478958|emb|CAX72974.1| Splicing factor U2AF 65 kDa subunit [Schistosoma japonicum]
          Length = 520

 Score =  254 bits (650), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 176/458 (38%), Positives = 247/458 (53%), Gaps = 30/458 (6%)

Query: 74  RDKERRHRHRSRSHSSDRFRNRS------KSLSPSRSPSKS-KRRSGFDMAPPA-AAMLP 125
           R+++R H H  R HS  R R+ S      +  S  RSPS S      +D+ PP    + P
Sbjct: 81  RERKRSHSHGHRRHSKSRHRDYSGGHKSRRHQSHHRSPSNSVSAHKYWDVPPPGFEHVTP 140

Query: 126 GAAVPGQLPG-VPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPP 184
                 Q  G VP  V    Q  +P  A          V        R ARR+YVG +P 
Sbjct: 141 AQYKALQTSGQVPVNVYAAGQVPMPVHAPNAPLTLTTNVPFAGSAVCRQARRLYVGNIPF 200

Query: 185 LANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGI 244
            A E+ +  FF++ M A G   A  G+ ++ V IN EK FAF+E R+V+E +  +ALDG+
Sbjct: 201 TATEENMMEFFNKQMRAQGLIQA-EGNPIIAVQINMEKNFAFLEFRSVDETTQGLALDGV 259

Query: 245 IFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPY 304
           +F+  A+++RRP DY P    +  P    P       G+ S  +   + P ++FVGGLP 
Sbjct: 260 LFQNQALKLRRPRDYAPLPGVSEQPSVIVP-------GVVSTVV--QDSPHKIFVGGLPN 310

Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDK 364
           Y  E Q+KELL SFG L GF+LVKD  TG SKGY FC Y D  VTD ACA LNG+++GDK
Sbjct: 311 YLNEDQVKELLLSFGPLKGFNLVKDGSTGLSKGYAFCEYVDSNVTDHACAGLNGMQLGDK 370

Query: 365 TLTVRRATASGQSKTEQESILAQAQQHIA-IQKMALQTSGMNTLGGGMSLF--GETLAKV 421
            L V+RA+   +  T    +L Q    ++ +++ A+Q    NT G G      G    +V
Sbjct: 371 KLIVQRASVGAKHTT---GVLPQCLLQMSGLEEGAVQ----NTTGSGNLTVRSGGPPTEV 423

Query: 422 LCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEY 481
           LCL   I    L DDEEYE+I+ED+R EC KYG + ++ IPRP + G + PGVGK+++E+
Sbjct: 424 LCLMNMIETSELEDDEEYEDIVEDVRAECSKYGVVRSLEIPRPIR-GIDVPGVGKIYVEF 482

Query: 482 YDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
              + C  A  AL+GRKF    V   ++  + Y  +++
Sbjct: 483 ASLIDCQKAATALTGRKFNQRLVVTSFFSPNSYHRREF 520


>gi|355703931|gb|EHH30422.1| hypothetical protein EGK_11092, partial [Macaca mulatta]
          Length = 453

 Score =  254 bits (648), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 175/466 (37%), Positives = 247/466 (53%), Gaps = 45/466 (9%)

Query: 63  HDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPS-----------------RSP 105
           H      R++ R +E R+     +  S  FR+  +S+ P                  RSP
Sbjct: 10  HAHILHMRNFWRSRESRYGVFETNLKS--FRSAPRSVFPKQYKPLTRGAKEEHGGLIRSP 67

Query: 106 ---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
               K K R  +D+ PP    +     P Q   + +A    A  +LP       A    P
Sbjct: 68  RHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTP 123

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +K
Sbjct: 124 VPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDK 182

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++   G
Sbjct: 183 NFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPG 235

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           + S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC 
Sbjct: 236 VVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCE 293

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           Y D  VTD A A LNG+++GDK L V+RA+   ++ T         Q  + +Q   L +S
Sbjct: 294 YVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPPSTINQTPVTLQVPGLMSS 353

Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
            +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++ IP
Sbjct: 354 QVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIP 405

Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
           RP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y
Sbjct: 406 RP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKY 450


>gi|390341852|ref|XP_792919.3| PREDICTED: splicing factor U2AF 50 kDa subunit-like
           [Strongylocentrotus purpuratus]
          Length = 386

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 142/359 (39%), Positives = 207/359 (57%), Gaps = 37/359 (10%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           Q TR ARR+YVG +P    E A+  FF+  M  +G   A PG  V+ V +NH+K FAF+E
Sbjct: 57  QMTRQARRLYVGNIPFGVTEDAMVEFFNGKMHNVGLAQA-PGPPVLAVQVNHDKNFAFLE 115

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            R+VEE + AMA DGI+F+  A+++RRP DY      +  P    P       G+ S  +
Sbjct: 116 FRSVEETTQAMAFDGILFQNQALKIRRPKDYQAIPGMSATPTVHVP-------GVVSTVV 168

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
              + P+++F+GGLP Y  + Q+KELL SFG L  F+LVKD  T  SKGY FC Y +  +
Sbjct: 169 --QDSPNKIFIGGLPNYLNDDQVKELLSSFGPLKAFNLVKDSATSLSKGYAFCEYVETNL 226

Query: 349 TDI-------ACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ- 400
           TD+       A A LNG+++G+K L V+RA+   ++   Q       Q  I I  ++L  
Sbjct: 227 TDLGWETTDKAIAGLNGMQLGEKKLIVQRASVGAKNAMNQGQ-----QVQINIPGLSLPG 281

Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
           T+G NT             ++LCL   +T + L DDEEY++I+ED++EEC KYG + ++ 
Sbjct: 282 TTGPNT-------------EILCLMNMVTPEELKDDEEYDDIVEDVKEECQKYGQVRSLE 328

Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           IPRP   G + PG GK+++E+   +    A+ AL+GRKF   TV   +Y  DKY  +++
Sbjct: 329 IPRPIP-GLDVPGCGKIYVEFMTVMDAQAAQRALAGRKFANRTVVTSFYDVDKYHRREF 386


>gi|351710523|gb|EHB13442.1| Splicing factor U2AF 65 kDa subunit [Heterocephalus glaber]
          Length = 904

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 164/412 (39%), Positives = 231/412 (56%), Gaps = 27/412 (6%)

Query: 93  RNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGA 152
           R  ++S    R   K K R  +D+ PP    +     P Q   + +A    A  +LP   
Sbjct: 144 RTAAQSCRSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMT 199

Query: 153 TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDA 212
               A    PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ 
Sbjct: 200 PDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNP 258

Query: 213 VVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP 272
           V+ V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  
Sbjct: 259 VLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM- 311

Query: 273 SPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
           S N ++   G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  T
Sbjct: 312 SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSAT 369

Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI 392
           G SKGY FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  +
Sbjct: 370 GLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPV 425

Query: 393 AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGK 452
            +Q   L +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC K
Sbjct: 426 TLQVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSK 477

Query: 453 YGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           YG + ++ IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V
Sbjct: 478 YGLVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVV 528


>gi|303283510|ref|XP_003061046.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457397|gb|EEH54696.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 378

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 150/368 (40%), Positives = 214/368 (58%), Gaps = 30/368 (8%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           QATRHARRVYVG L    ++  +  FF ++M A G      G  VV+ YIN EK FAF+E
Sbjct: 19  QATRHARRVYVGALTADVDDAHLTQFFEEIMLATGATKRVDGGCVVSTYINREKLFAFIE 78

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL----- 283
            +TVEEASNA+  DG+++ G  +R+RRP DYN   A+ LGP QP+PNLN +A+G+     
Sbjct: 79  FQTVEEASNALGFDGVVYGGQQLRLRRPNDYNIAQASLLGPQQPNPNLNYSAIGINHTPT 138

Query: 284 -ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
               +   +  P ++FVGGLP Y TE Q+KEL+ SFG +  F+LV D+DTG SKGY F  
Sbjct: 139 PMVASTENSTSPYKLFVGGLPNYITENQVKELVCSFGEIKAFNLVFDKDTGLSKGYAFWE 198

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRA----------TASGQSKTEQESILAQAQQHI 392
           + DP+V++ A   L+G+++G+K + V+ A           A+G+  T   S +A  QQ  
Sbjct: 199 FLDPSVSEAAIKGLDGMRLGEKLINVKFANGNPPPIGGYNAAGEDGT---STVAAQQQLG 255

Query: 393 AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGK 452
            +  + L T+    L GG+          + L   ++ + LAD  E  EILED  EEC  
Sbjct: 256 YVANVPLATA--TALTGGVE------TTCVRLKGMVSREELADPTEAAEILEDTEEECKG 307

Query: 453 YGTLVNVVIPRPDQNGGETP-GVGKVFLEYYDAVGCA-TAKNALSGRKFGGNTVNAFYYP 510
           +G+LV V++PRP  +    P GVG+V L++   V CA  A+ +L+GRKF    V A +  
Sbjct: 308 FGSLVKVLMPRPGPHPDLDPVGVGEVMLKFA-TVECARRAQRSLNGRKFADRLVGAVFVK 366

Query: 511 EDKYFNKD 518
           E  +  +D
Sbjct: 367 ESVFDERD 374


>gi|388856534|emb|CCF49840.1| related to pre-mRNA splicing factor U2AF large chain [Ustilago
           hordei]
          Length = 718

 Score =  253 bits (647), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 145/377 (38%), Positives = 214/377 (56%), Gaps = 39/377 (10%)

Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
           P +V  Q   R ARR+YVG +   ANE  +  FF++ M  +   +  PG+  V+  +N +
Sbjct: 330 PEEVAQQNNNRQARRLYVGNITHSANEPNMVAFFNEQMLKLKLGTE-PGEPAVSAQVNVD 388

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K +AFVE R  EEA+NAM+ DGI+F+G ++++RRP DY        GP    P  N+   
Sbjct: 389 KGYAFVEFRHPEEATNAMSFDGIVFQGQSLKIRRPKDYT-------GPDV-RPASNIHVP 440

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
           G+ S  +   + P ++FVGGLP Y T+ Q+ ELL++FG L  F+LVKD   G SKG+ FC
Sbjct: 441 GVISTNV--PDSPHKIFVGGLPTYLTDDQVIELLQAFGELRAFNLVKDTANGASKGFAFC 498

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
            Y D A+TD+AC  LNG+++GD+ L V+RA+   + K   ++I A      A+    + +
Sbjct: 499 EYVDTALTDLACQGLNGMELGDRNLVVQRASVGSEKKA--QAIAATGANAGALGDAGMPS 556

Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           S     G G    GE  + ++ L   +T + L DDEEY +I+ED+REEC KYGT+ +V +
Sbjct: 557 SVQQFAGEGGDA-GEPRSCMVMLN-MVTPEELQDDEEYADIVEDIREECTKYGTVTDVRV 614

Query: 462 PRPDQ-------------------NGGETP-----GVGKVFLEYYDAVGCATAKNALSGR 497
           PRP +                   +GGE P     GVG+V++ Y +   CA A  A++GR
Sbjct: 615 PRPAKESKGAAAHQWKRTQDESAASGGEKPATEREGVGRVYVRYAETGDCAQALRAIAGR 674

Query: 498 KFGGNTVNAFYYPEDKY 514
           +FGG TV   +  ED +
Sbjct: 675 QFGGRTVICAFLKEDNW 691


>gi|71022561|ref|XP_761510.1| hypothetical protein UM05363.1 [Ustilago maydis 521]
 gi|46101379|gb|EAK86612.1| hypothetical protein UM05363.1 [Ustilago maydis 521]
          Length = 727

 Score =  253 bits (646), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 144/379 (37%), Positives = 211/379 (55%), Gaps = 41/379 (10%)

Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
           P ++  Q A R ARR+YVG +   ANEQ I  FF++ M  +   +  PG+  V+  +N +
Sbjct: 340 PAELAAQNANRQARRLYVGNITHQANEQNIVAFFNEQMLKLKLGTE-PGEPAVSAQVNVD 398

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K +AFVE R  EEA+NAM+ DGI+F+  ++++RRP DY        GP    P+ N+   
Sbjct: 399 KGYAFVEFRHPEEATNAMSFDGIVFQAQSLKIRRPKDYT-------GPDIRPPS-NIHVP 450

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
           G+ S  +   + P ++FVGGLP Y  + Q+ ELL++FG L  F+LVKD  TG SKG+ FC
Sbjct: 451 GVISTNV--PDSPHKIFVGGLPTYLNDDQVIELLQAFGELRAFNLVKDTGTGASKGFAFC 508

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
            Y DPA+TD+AC  LNG+++GD+ L V+RA+   + K +    LA    ++     A   
Sbjct: 509 EYVDPALTDLACQGLNGMELGDRNLVVQRASVGSEKKAQ---ALAATGANMGALGGAAVP 565

Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           S +    G     GE    ++ L   +T + L DDEEY +I+ED+R+EC KYG + +V I
Sbjct: 566 SSVQKFAGDGGDAGEPTTCMVMLN-MVTPEELQDDEEYADIVEDIRDECNKYGAVSDVRI 624

Query: 462 PRP--------------DQNGG------------ETPGVGKVFLEYYDAVGCATAKNALS 495
           PRP               Q+ G            E  GVG+V++ Y +   CA A  A++
Sbjct: 625 PRPAKESKGAAAHQWKRSQDEGATTVDGEKATSAEREGVGRVYVRYGETEHCAQALRAIA 684

Query: 496 GRKFGGNTVNAFYYPEDKY 514
           GR+FGG TV   +  ED +
Sbjct: 685 GRQFGGRTVICAFLREDDW 703


>gi|308477324|ref|XP_003100876.1| CRE-UAF-1 protein [Caenorhabditis remanei]
 gi|308264450|gb|EFP08403.1| CRE-UAF-1 protein [Caenorhabditis remanei]
          Length = 496

 Score =  253 bits (645), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 155/385 (40%), Positives = 219/385 (56%), Gaps = 33/385 (8%)

Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
           G  +  P   +NM   GA   G+     V V+    T  +RR+YVG +P   NE+A+  F
Sbjct: 145 GFENITPMEYKNMQASGAVPRGSVQ-SAVPVVGPSVTCQSRRLYVGNIPFGCNEEAMLDF 203

Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
           F+Q M  + G +  PG+ ++   IN +K FAF+E R+++E +  MA DGI F G  ++VR
Sbjct: 204 FNQQM-HLCGLAQAPGNPILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVR 262

Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
           RP DY P+        Q + ++N A + ++S  +   + P+++F+GGLP Y TE Q+KEL
Sbjct: 263 RPRDYQPS--------QNTFDMN-ARMPVSSIVV---DSPNKIFIGGLPNYLTEDQVKEL 310

Query: 315 LESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATAS 374
           L SFG L  F L  D   GNSKGY F  Y DP +TD A A LNG+++GDK L V+ A A+
Sbjct: 311 LCSFGPLKAFSLNMD-SQGNSKGYAFAEYLDPTLTDQAIAGLNGMQLGDKQLVVQLACAN 369

Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA 434
              +T   + L  +   IA   ++ Q +G  T             ++LCL   +T D L 
Sbjct: 370 ---QTRHNTHLPNSASAIAGIDLS-QGAGRAT-------------EILCLMNMVTEDELR 412

Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNAL 494
            DE+YEEILED+REEC KYG + ++ IPRP  +    PGVGKVF+E+     C  A+ AL
Sbjct: 413 SDEDYEEILEDVREECSKYGIVRSLEIPRP-YDEQPVPGVGKVFVEFATTSDCQRAQAAL 471

Query: 495 SGRKFGGNTVNAFYYPEDKYFNKDY 519
           +GRKF   TV   YY  DKY N+ +
Sbjct: 472 TGRKFANRTVVTSYYDVDKYHNRQF 496


>gi|339243511|ref|XP_003377681.1| splicing factor U2AFsubunit [Trichinella spiralis]
 gi|316973494|gb|EFV57074.1| splicing factor U2AFsubunit [Trichinella spiralis]
          Length = 402

 Score =  252 bits (643), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 140/357 (39%), Positives = 206/357 (57%), Gaps = 30/357 (8%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V+    T  +RR+YVG +P    E+A+  FF+Q M  + G +   G+ ++   IN +K
Sbjct: 76  VPVVGPSVTCQSRRLYVGNIPFGCTEEAMMDFFNQQMH-LCGLAQALGNPILACQINLDK 134

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R++ E + AMA DGI ++G ++++RRP DY           QP P  N    G
Sbjct: 135 NFAFIEFRSIAETTAAMAFDGINYQGQSLKIRRPRDY-----------QPLPGQNDTLAG 183

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           L S  +  A+ P ++F+GGLP Y +E Q+KELL SFG L  F+L+KD  T  SKGY F  
Sbjct: 184 LVSSVV--ADSPYKLFIGGLPNYLSEEQVKELLISFGQLKAFNLIKDPATQISKGYAFAE 241

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           Y D  +TD A A LNG+++GDK L V+ A           S+ A+     A   +A+Q  
Sbjct: 242 YSDSTLTDQAIAGLNGMQLGDKKLVVQLA-----------SVGAKNNMFSAAAPVAIQVP 290

Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
           GMN +    +       ++LCL   + A+ L D+EEY++I+ED++EEC KYG++ +V IP
Sbjct: 291 GMNVVNPAAT----PATEILCLMNMVVAEELVDNEEYDDIVEDIKEECCKYGSVKSVEIP 346

Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           RP + G + PGVGKVF+E+   + C  A+ AL+GRKF    V   Y+  D Y  + +
Sbjct: 347 RPIE-GLDVPGVGKVFVEFGTVMECQKAQQALTGRKFANRVVVTSYFDPDLYHRRQF 402


>gi|355756173|gb|EHH59920.1| hypothetical protein EGM_10153, partial [Macaca fascicularis]
          Length = 442

 Score =  252 bits (643), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 173/458 (37%), Positives = 245/458 (53%), Gaps = 45/458 (9%)

Query: 63  HDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPS-----------------RSP 105
           H      R++ R +E R+     +  S  FR+  +S+ P                  RSP
Sbjct: 10  HAHILHMRNFWRSRESRYGVFETNLKS--FRSAPRSVFPKQYKPLTRGAKEEHGGLIRSP 67

Query: 106 ---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
               K K R  +D+ PP    +     P Q   + +A    A  +LP       A    P
Sbjct: 68  RHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTP 123

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +K
Sbjct: 124 VPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDK 182

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++   G
Sbjct: 183 NFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPG 235

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           + S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC 
Sbjct: 236 VVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCE 293

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           Y D  VTD A A LNG+++GDK L V+RA+   ++ T         Q  + +Q   L +S
Sbjct: 294 YVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPPSTINQTPVTLQVPGLMSS 353

Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
            +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++ IP
Sbjct: 354 QVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIP 405

Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
           RP  +G E PG GK+F+E+     C  A   L+GRKF 
Sbjct: 406 RP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFA 442


>gi|354486866|ref|XP_003505598.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Cricetulus
           griseus]
 gi|344242983|gb|EGV99086.1| Splicing factor U2AF 65 kDa subunit [Cricetulus griseus]
          Length = 469

 Score =  251 bits (641), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 163/414 (39%), Positives = 234/414 (56%), Gaps = 29/414 (7%)

Query: 107 KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ-LGAFPLMPVQV 165
           K K R  +D+ PP    +     P Q   + +A    A  +LP   +  L A P MPV V
Sbjct: 83  KKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTVTSDGLVASP-MPVPV 137

Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFA 225
           +  Q TR ARR+YVG +P    E+A+  FF+  M  +G  +  PG+ V+ V IN EK FA
Sbjct: 138 VGSQMTRQARRLYVGNIPFGITEEAMKDFFNAQM-QLGVLTQVPGNPVLAVQINQEKNFA 196

Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLAS 285
           F+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++   G+ S
Sbjct: 197 FLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVS 249

Query: 286 GAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQD 345
             +   +   ++F+GG+P Y  + ++KELL SFGTL  F+LVKD  TG SKGY FC Y D
Sbjct: 250 TVV--PDSAHKLFIGGMPSYLNDDKVKELLTSFGTLKAFNLVKDSATGLSKGYAFCEYLD 307

Query: 346 PAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
              TD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L +S + 
Sbjct: 308 INATDQAIAGLNGMQLGDKKLIVQRASVGSKNAT----LSTINQTPVTVQVPGLMSSQVQ 363

Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
            +GG  +       +VLCL   +    L DDEEYEEI+ED+R+EC KYG + ++ IPRP 
Sbjct: 364 -MGGHPT-------EVLCLMNMVLPKELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRPV 415

Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           + G E PG G +F+E+     C  A   L+GR++    V   Y   D Y ++D+
Sbjct: 416 E-GVEVPGCGNIFVEFTSVFDCQKAMQGLTGRRYANKVVVTKYCDPDSYHSRDF 468


>gi|449669310|ref|XP_004206989.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Hydra
           magnipapillata]
          Length = 480

 Score =  251 bits (641), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 148/367 (40%), Positives = 201/367 (54%), Gaps = 35/367 (9%)

Query: 160 LMPVQVMTQ--QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
           ++P   + Q  Q T  ARR+Y G LP    E  +  FF+  M      +  PG+ V+   
Sbjct: 143 VIPATALPQGAQMTMQARRLYCGNLPFGITEDLMVDFFNAKMRE-SDMARQPGNPVLACQ 201

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLN 277
           IN EK FAF+E R+VEE + AMA DGII +G A+++RRP DY P +    G   P+    
Sbjct: 202 INLEKNFAFLEFRSVEETTLAMAFDGIILQGQALKIRRPKDYQP-IPGINGMAYPTLFAE 260

Query: 278 LAAV---GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
             A    G+ S  +  ++  ++VFVGGLP Y  E Q+KELL +FG L  F+LVKD  TG 
Sbjct: 261 SQATHIPGVVSTVV--SDTINKVFVGGLPNYLNEDQVKELLSTFGELRAFNLVKDSATGL 318

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
           SKGY FC Y D  +TD+A A +NG+++GDK L V+RA+   ++ T Q +I          
Sbjct: 319 SKGYAFCEYVDIGITDVAIAGMNGMQLGDKKLIVQRASVGSKTMTAQLNI---------- 368

Query: 395 QKMALQTSGMNTLGGGMSLFGE-TLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
                          G  L  E T   +LCL   + A+ L DDE+Y+EI ED+REEC KY
Sbjct: 369 --------------PGFDLNKEITATNILCLMNMVVAEELMDDEDYDEIFEDIREECSKY 414

Query: 454 GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
           G + ++ IPRP+ N     G+GKVF+EY  +     A  AL+GRKF    V   YY  D 
Sbjct: 415 GRIRSMQIPRPN-NEFLVSGIGKVFIEYATSGESKVASEALAGRKFANRVVVTAYYDPDS 473

Query: 514 YFNKDYS 520
           Y   D+S
Sbjct: 474 YHRHDFS 480


>gi|348688506|gb|EGZ28320.1| hypothetical protein PHYSODRAFT_309221 [Phytophthora sojae]
          Length = 694

 Score =  251 bits (640), Expect = 9e-64,   Method: Compositional matrix adjust.
 Identities = 148/357 (41%), Positives = 218/357 (61%), Gaps = 18/357 (5%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMT-AIGGNSAGPGDAVVNVYINHEKKFAFV 227
           Q TRHARR+YVGG+  ++ E  I  FF+ V+  A+G    G   +VV+VYIN E+ FAFV
Sbjct: 351 QQTRHARRLYVGGIGEIS-EPEITAFFNDVIDRALGEKQEG--GSVVSVYINRERHFAFV 407

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNP-TLAAALGPGQPSPNLNLAAVGLASG 286
           E+RT+E  +  M LDG+ + G  +++RRP DYNP T+   LGP    P LNLAA+G+ S 
Sbjct: 408 ELRTIELTTACMNLDGVSYNGQPLKIRRPNDYNPATVPKDLGP---IPQLNLAALGIVST 464

Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
            +  ++GP ++F+GGLPY+  E Q+KELL++FG L  F LVK+  +  SKGYGFC Y D 
Sbjct: 465 TV--SDGPGKIFIGGLPYHLNEEQVKELLQAFGPLRSFHLVKELSSNLSKGYGFCEYMDI 522

Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQSK---TEQESILAQAQQHIAIQKMALQTSG 403
            VTD AC  LN +++GDKTLTVRRA +   +K   +   ++    +  +   + A+Q   
Sbjct: 523 NVTDAACLGLNDMRLGDKTLTVRRAMSQENAKAVASAAGTVNTGLEMGLDPSRAAMQAM- 581

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
             +L G  SL   T ++V+ L   +T + L D+EEY +IL+D++ EC ++G + ++++PR
Sbjct: 582 --SLAGIPSLPLGTPSRVIVLLNMVTPEELEDEEEYADILDDIKGECERFGAVPSLLLPR 639

Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           P    G    VGKVF+E+ D     +A   L GR F   TV   +  E KY  ++ +
Sbjct: 640 P--RDGIPSAVGKVFVEFGDVQSAQSAATELHGRGFSNRTVAVEFMDEGKYARRELA 694


>gi|1710361|gb|AAB38280.1| splicing factor U2AF65 [Caenorhabditis briggsae]
          Length = 488

 Score =  251 bits (640), Expect = 9e-64,   Method: Compositional matrix adjust.
 Identities = 152/387 (39%), Positives = 218/387 (56%), Gaps = 35/387 (9%)

Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
           G  +  P   +NM   GA   G+     V V+    T  +RR+YVG +P   NE+A+  F
Sbjct: 136 GFENITPMEYKNMQASGAVPRGSVQ-SAVPVVGPSVTCQSRRLYVGNIPFGCNEEAMLDF 194

Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
           F+Q M  +   +  PG+ ++   IN +K FAF+E R+++E +  MA DGI F G  ++VR
Sbjct: 195 FNQQM-HLCNLAQAPGNPILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVR 253

Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
           RP DY P+        Q + ++N A + ++S  +  A   +++F+GGLP Y TE Q+KEL
Sbjct: 254 RPRDYQPS--------QNTFDMN-ARMPVSSIVVDSA---NKIFIGGLPNYLTEDQVKEL 301

Query: 315 LESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATAS 374
           L SFG L  F L  D   GNSKGY F  Y DP +TD A A LNG+++GDK L V+ A A+
Sbjct: 302 LCSFGPLKAFSLNVD-SQGNSKGYAFAEYLDPTLTDQAIAGLNGMQLGDKQLVVQLACAN 360

Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMN-TLGGGMSLFGETLAKVLCLTEAITADAL 433
                       Q + +  +   A   +G++ + G G +       ++LCL   +T D L
Sbjct: 361 ------------QTRHNTHLPNSASAIAGIDLSQGAGRA------TEILCLMNMVTEDEL 402

Query: 434 ADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNA 493
             DE+YEEILED+REEC KYG + ++ IPRP  +    PGVGKVF+E+     C  A+ A
Sbjct: 403 RSDEDYEEILEDVREECSKYGIVRSLEIPRP-YDDHPVPGVGKVFVEFATTSDCQRAQAA 461

Query: 494 LSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           L+GRKF   TV   YY  DKY N+ ++
Sbjct: 462 LTGRKFANRTVVTSYYDVDKYHNRQFN 488


>gi|384939340|gb|AFI33275.1| splicing factor U2AF 65 kDa subunit isoform a [Macaca mulatta]
          Length = 475

 Score =  250 bits (639), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 167/420 (39%), Positives = 234/420 (55%), Gaps = 26/420 (6%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T         Q  + +Q   L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPPSTINQTPVTLQVPGL 363

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
            +S +  +GG  +       +VLCL   +  + L DDEEYEEI+E++R+EC KYG + ++
Sbjct: 364 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEEVRDECSKYGLVKSI 415

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 416 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 474


>gi|268575804|ref|XP_002642882.1| C. briggsae CBR-UAF-1 protein [Caenorhabditis briggsae]
 gi|60415989|sp|P90727.2|U2AF2_CAEBR RecName: Full=Splicing factor U2AF 65 kDa subunit; AltName: Full=U2
           auxiliary factor 65 kDa subunit; Short=U2AF65; AltName:
           Full=U2 snRNP auxiliary factor large subunit
          Length = 488

 Score =  250 bits (639), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 152/387 (39%), Positives = 218/387 (56%), Gaps = 35/387 (9%)

Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
           G  +  P   +NM   GA   G+     V V+    T  +RR+YVG +P   NE+A+  F
Sbjct: 136 GFENITPMEYKNMQASGAVPRGSVQ-SAVPVVGPSVTCQSRRLYVGNIPFGCNEEAMLDF 194

Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
           F+Q M  +   +  PG+ ++   IN +K FAF+E R+++E +  MA DGI F G  ++VR
Sbjct: 195 FNQQM-HLCNLAQAPGNPILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVR 253

Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
           RP DY P+        Q + ++N A + ++S  +  A   +++F+GGLP Y TE Q+KEL
Sbjct: 254 RPRDYQPS--------QNTFDMN-ARMPVSSIVVDSA---NKIFIGGLPNYLTEDQVKEL 301

Query: 315 LESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATAS 374
           L SFG L  F L  D   GNSKGY F  Y DP +TD A A LNG+++GDK L V+ A A+
Sbjct: 302 LCSFGPLKAFSLNVD-SQGNSKGYAFAEYLDPTLTDQAIAGLNGMQLGDKQLVVQLACAN 360

Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMN-TLGGGMSLFGETLAKVLCLTEAITADAL 433
                       Q + +  +   A   +G++ + G G +       ++LCL   +T D L
Sbjct: 361 ------------QTRHNTHLPNSASAIAGIDLSQGAGRA------TEILCLMNMVTEDEL 402

Query: 434 ADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNA 493
             DE+YEEILED+REEC KYG + ++ IPRP  +    PGVGKVF+E+     C  A+ A
Sbjct: 403 RSDEDYEEILEDVREECSKYGIVRSLEIPRP-YDDHPVPGVGKVFVEFATTSDCQRAQAA 461

Query: 494 LSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           L+GRKF   TV   YY  DKY N+ ++
Sbjct: 462 LTGRKFANRTVVTSYYDVDKYHNRQFN 488


>gi|410905623|ref|XP_003966291.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Takifugu
           rubripes]
          Length = 458

 Score =  250 bits (638), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 156/389 (40%), Positives = 219/389 (56%), Gaps = 32/389 (8%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
           GQ+P   + +P M  + L    T        PV V+  Q TR ARR+YVG +P    E++
Sbjct: 101 GQIPAT-ALLPTMTPDGLAVTPT--------PVPVVGSQMTRQARRLYVGNIPFGITEES 151

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G +
Sbjct: 152 MMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQS 210

Query: 251 VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQ 310
           +++RRP DY P       PG  S N ++   G+ S  +   +   ++F+GGLP Y  + Q
Sbjct: 211 LKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQ 261

Query: 311 IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
           +KELL SFG L  F+LVKD  TG SKGY FC Y D  + D A A LNG+++GDK L V+R
Sbjct: 262 VKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQR 321

Query: 371 ATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
           A+                Q  I    + LQ  G+N+    ++  G    +VLCL   +  
Sbjct: 322 ASXXXXXS---------FQTSINQTPVTLQVPGLNS---SVTQMGGVPTEVLCLMNMVAP 369

Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATA 490
           + L DDEEYEEI+ED+R+ECGKYG + ++ IPRP  +G E PG GK+F+E+        A
Sbjct: 370 EELLDDEEYEEIVEDVRDECGKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFMSVFDSQKA 428

Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
              L+GRKF    V   Y   D Y  +D+
Sbjct: 429 MQGLTGRKFANRVVVTKYCDPDAYHRRDF 457


>gi|343426615|emb|CBQ70144.1| related to pre-mRNA splicing factor U2AF large chain [Sporisorium
           reilianum SRZ2]
          Length = 710

 Score =  250 bits (638), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 151/413 (36%), Positives = 225/413 (54%), Gaps = 45/413 (10%)

Query: 132 QLPGVPSAVPEMAQNMLP---FGATQLGAFPLM----PVQVMTQQATRHARRVYVGGLPP 184
           Q P   S  P   Q   P   FG    G++P      P ++  Q A R ARR+YVG +  
Sbjct: 289 QHPYSRSQQPPHGQPYSPQAGFGGADNGSYPSSQGPSPAELAAQNANRQARRLYVGNITH 348

Query: 185 LANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGI 244
             NE  I  FF++ M  +   +  PG+  V+  +N +K +AFVE R  +EA+NAM+ DGI
Sbjct: 349 QTNEHNIVAFFNEQMLKLKLGTE-PGEPAVSAQVNVDKGYAFVEFRHPDEATNAMSFDGI 407

Query: 245 IFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPY 304
           +F+  ++++RRP DY        GP    P+ N+   G+ S  +   + P ++FVGGLP 
Sbjct: 408 VFQAQSLKIRRPKDYT-------GPDVRPPS-NIHVPGVISTNV--PDSPFKIFVGGLPT 457

Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDK 364
           Y T+ Q+ ELL++FG L  F+LVKD  TG SKG+ FC Y D A+TD+AC  LNG+++GD+
Sbjct: 458 YLTDDQVIELLQAFGELRAFNLVKDTGTGASKGFAFCEYVDTALTDLACQGLNGMELGDR 517

Query: 365 TLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCL 424
            L V+RA+   + K +    LA    +     +A   S +    G     GE  + ++ L
Sbjct: 518 NLVVQRASVGSEKKAQ---ALAATGANSGALGIAAVPSSVQQSAGEDGDAGEPTSCMVML 574

Query: 425 TEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP----------------DQNG 468
              +T + L DDEEY +I+ED+R+EC KYG + +V +PRP                D++G
Sbjct: 575 N-MVTPEELQDDEEYADIVEDIRDECTKYGAVTDVRVPRPAKESKGAAAHQWKRSQDESG 633

Query: 469 --GETP-----GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
             G+ P     GVG+V++ Y +   CA A  A++GR+FGG TV   +  ED +
Sbjct: 634 AEGDKPDAEREGVGRVYVRYGETEHCAQALRAIAGRQFGGRTVICAFLKEDDW 686


>gi|327280717|ref|XP_003225098.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
           [Anolis carolinensis]
          Length = 467

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 166/429 (38%), Positives = 233/429 (54%), Gaps = 48/429 (11%)

Query: 107 KSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLM 161
           K K R  +D+ PP    +           GQ+P   + +P M  + L    T        
Sbjct: 70  KKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT-------- 120

Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
           PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +
Sbjct: 121 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQD 179

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++   
Sbjct: 180 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVP 232

Query: 282 GLASGAIGGAEGPDRVFVGGLPYY-----------FTETQIKELLESFGTLHGFDLVKDR 330
           G+ S  +   +   ++F+GGLP Y           F   Q+KELL SFG L  F+LVKD 
Sbjct: 233 GVVSTVV--PDSAHKLFIGGLPNYLNDDQVMFLPPFLSCQVKELLTSFGPLKAFNLVKDS 290

Query: 331 DTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQ 390
            TG SKGY FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q 
Sbjct: 291 ATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQT 346

Query: 391 HIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREEC 450
            + +Q   L +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC
Sbjct: 347 PVTLQVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDEC 398

Query: 451 GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYP 510
            KYG + ++ IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y  
Sbjct: 399 SKYGVVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCD 457

Query: 511 EDKYFNKDY 519
            D Y  +D+
Sbjct: 458 PDSYHRRDF 466


>gi|341891946|gb|EGT47881.1| hypothetical protein CAEBREN_25972 [Caenorhabditis brenneri]
          Length = 491

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 146/358 (40%), Positives = 208/358 (58%), Gaps = 34/358 (9%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V+    T  +RR+YVG +P   NE+A+  FF+Q M  + G +  PG+ ++   IN +K
Sbjct: 167 VPVVGPSVTCQSRRLYVGNIPFGCNEEAMLDFFNQQM-HLCGLAQAPGNPILLCQINLDK 225

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+++E +  MA DGI F G  ++VRRP DY P+        Q + ++N + + 
Sbjct: 226 NFAFIEFRSIDETTAGMAFDGINFMGQQLKVRRPRDYQPS--------QNTFDMN-SRMP 276

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           ++S  +  A   +++F+GGLP Y TE Q+KELL SFG L  F L  D   GNSKGY F  
Sbjct: 277 VSSIVVDSA---NKIFIGGLPNYLTEDQVKELLCSFGPLKAFSLNVD-SQGNSKGYAFAE 332

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           Y DP +TD A A LNG+++GDK L V+ A A+            Q + +  +   A   +
Sbjct: 333 YLDPTLTDQAIAGLNGMQLGDKQLVVQLACAN------------QTRHNTHLPNSASAIA 380

Query: 403 GMN-TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           G++ + G G +       +VLCL   +T D L  DE+YEEILED+REEC KYG + ++ I
Sbjct: 381 GIDLSQGAGRA------TEVLCLMNMVTEDELKSDEDYEEILEDVREECSKYGIVRSLEI 434

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           PRP  +    PGVGKVF+E+     C  A+ AL+GRKF   TV   YY  DKY N+ +
Sbjct: 435 PRP-YDEHPVPGVGKVFVEFASTSDCQRAQAALTGRKFANRTVVTSYYDVDKYHNRQF 491


>gi|47221657|emb|CAF97922.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 458

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 156/389 (40%), Positives = 221/389 (56%), Gaps = 32/389 (8%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
           GQ+P   + +P M  + L    T        PV V+  Q TR ARR+YVG +P    E++
Sbjct: 101 GQIPAT-ALLPTMTPDGLAVTPT--------PVPVVGSQMTRQARRLYVGNIPFGITEES 151

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G +
Sbjct: 152 MMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQS 210

Query: 251 VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQ 310
           +++RRP DY P       PG  S N ++   G+ S  +   +   ++F+GGLP Y  + Q
Sbjct: 211 LKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQ 261

Query: 311 IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
           +KELL SFG L  F+LVKD  TG SKGY FC Y D  + D A A LNG+++GDK L V+R
Sbjct: 262 VKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQR 321

Query: 371 ATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
           A+   ++ T     L    Q      + LQ  G+N+    ++  G    +VLCL   +  
Sbjct: 322 ASVGSKNAT-----LTSINQ----TPVTLQVPGLNS---SVTQMGGVPTEVLCLMNMVAP 369

Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATA 490
           + L DDEEYEEI+ED+R+EC KYG + ++ IPRP  +G E PG GK+F+E+        A
Sbjct: 370 EELLDDEEYEEIVEDVRDECSKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFMSVFDSQKA 428

Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
              L+GRKF    V   Y   D Y  +D+
Sbjct: 429 MQGLTGRKFANRVVVTKYCDPDAYHRRDF 457


>gi|348526428|ref|XP_003450721.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 3
           [Oreochromis niloticus]
          Length = 458

 Score =  249 bits (636), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 156/389 (40%), Positives = 222/389 (57%), Gaps = 32/389 (8%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
           GQ+P   + +P M  + L    T        PV V+  Q TR ARR+YVG +P    E++
Sbjct: 101 GQIPAT-ALLPTMTPDGLAVTPT--------PVPVVGSQMTRQARRLYVGNIPFGITEES 151

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G +
Sbjct: 152 MMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQS 210

Query: 251 VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQ 310
           +++RRP DY P       PG  S N ++   G+ S  +   +   ++F+GGLP Y  + Q
Sbjct: 211 LKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQ 261

Query: 311 IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
           +KELL SFG L  F+LVKD  TG SKGY FC Y D  + D A A LNG+++GDK L V+R
Sbjct: 262 VKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQR 321

Query: 371 ATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
           A+   ++ T     L+   Q      + LQ  G+N+    ++  G    +VLCL   +  
Sbjct: 322 ASVGSKNAT-----LSSINQ----TPVTLQVPGLNS---SVTQMGGLPTEVLCLMNMVAP 369

Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATA 490
           + L DDEEYEEI+ED+R+EC KYG + ++ IPRP  +G E PG GK+F+E+        A
Sbjct: 370 EELLDDEEYEEIVEDVRDECSKYGQVKSIEIPRP-VDGLEVPGTGKIFVEFTSVFDSQKA 428

Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
              L+GRKF    V   Y   D Y  +D+
Sbjct: 429 MQGLTGRKFANRVVVTKYCDPDAYHRRDF 457


>gi|71480064|ref|NP_001025127.1| U2 small nuclear RNA auxiliary factor 2a [Danio rerio]
 gi|68533572|gb|AAH98548.1| U2 small nuclear RNA auxiliary factor 2a [Danio rerio]
          Length = 465

 Score =  248 bits (634), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 157/406 (38%), Positives = 227/406 (55%), Gaps = 26/406 (6%)

Query: 114 FDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRH 173
           +D+ PP    +     P Q   + +A    A  +LP    +  A    PV V+  Q TR 
Sbjct: 85  WDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPEGLAVTPTPVPVVGSQMTRQ 140

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           ARR+YVG +P    E+++  FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+
Sbjct: 141 ARRLYVGNIPFGITEESMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVD 199

Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
           E + AMA DGIIF+G ++++RRP DY P       PG  S N ++   G+ S  +   + 
Sbjct: 200 ETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDS 250

Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
             ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D  ++D A 
Sbjct: 251 AHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNISDQAI 310

Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSL 413
           A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L  S +N +GG    
Sbjct: 311 AGLNGMQLGDKKLLVQRASVGSKNTT----LTGINQTPVTLQVPGLMNSSVNQMGG---- 362

Query: 414 FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPG 473
                 +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++ IPRP  +G + PG
Sbjct: 363 ---IPTEVLCLMNMVAPEELLDDEEYEEIVEDVRDECSKYGQVKSIEIPRP-VDGLDIPG 418

Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            GK+F+E+        A   L+GRK     V   Y   D Y  +D+
Sbjct: 419 TGKIFVEFMSVFDSQKAMQGLTGRKSANRVVVTKYCDPDAYHRRDF 464


>gi|384939342|gb|AFI33276.1| splicing factor U2AF 65 kDa subunit isoform b [Macaca mulatta]
          Length = 471

 Score =  248 bits (634), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 167/420 (39%), Positives = 235/420 (55%), Gaps = 30/420 (7%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGL 359

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
            +S +  +GG  +       +VLCL   +  + L DDEEYEEI+E++R+EC KYG + ++
Sbjct: 360 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEEVRDECSKYGLVKSI 411

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 412 EIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 470


>gi|301117646|ref|XP_002906551.1| splicing factor U2af large subunit, putative [Phytophthora
           infestans T30-4]
 gi|262107900|gb|EEY65952.1| splicing factor U2af large subunit, putative [Phytophthora
           infestans T30-4]
          Length = 569

 Score =  248 bits (633), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 147/357 (41%), Positives = 215/357 (60%), Gaps = 18/357 (5%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMT-AIGGNSAGPGDAVVNVYINHEKKFAFV 227
           Q TRHARR+YVGG+  ++ E  I  FF+ V+  A+G    G   +VV+VYIN E+ FAFV
Sbjct: 226 QQTRHARRLYVGGIGEIS-EPEITAFFNDVIDRALGEKQEG--GSVVSVYINRERHFAFV 282

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNP-TLAAALGPGQPSPNLNLAAVGLASG 286
           E+R++E  +  M LDG+ + G  +++RRP DYNP T+   LGP    P LNLAA+G+ S 
Sbjct: 283 ELRSIELTTACMNLDGVSYNGQPLKIRRPNDYNPATVPKDLGP---IPQLNLAALGIVST 339

Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
            +  ++GP ++F+GGLPY+  E Q+KELL++FG L  F LVK+  +  SKGYGFC Y D 
Sbjct: 340 TV--SDGPGKIFIGGLPYHLNEEQVKELLQAFGPLRSFHLVKELSSNLSKGYGFCEYMDI 397

Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQSK---TEQESILAQAQQHIAIQKMALQTSG 403
            VTD AC  LN +++GDKTLTVRRA +   +K   +   ++    +  +     A+Q   
Sbjct: 398 NVTDAACIGLNDMQLGDKTLTVRRAMSQENAKAVASAAGTVNTGLEMGLDPSLAAMQAMS 457

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
           M    G  S+   T ++V+ L   +T + L D+EEY +IL+D++ EC ++G + ++++PR
Sbjct: 458 M---AGIPSVPLGTPSRVIVLLNMVTPEELEDEEEYADILDDIKGECERFGAVPSLLLPR 514

Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           P    G    VGKVF+E+ D      A   L GR F   TV   +  E KY  ++ S
Sbjct: 515 PRD--GVLSAVGKVFVEFADVQSAQAAATELHGRGFSNRTVAVEFMDEGKYARREIS 569


>gi|345480698|ref|XP_001604333.2| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Nasonia
           vitripennis]
          Length = 455

 Score =  248 bits (633), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 148/363 (40%), Positives = 212/363 (58%), Gaps = 37/363 (10%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 128 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 186

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG   +P++N
Sbjct: 187 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 240

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
           +             + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKG
Sbjct: 241 VP------------DSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 288

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y FC Y D ++TD A A LNG+++GDK L V+RA+    +K     I AQA   I +  +
Sbjct: 289 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVG--AKNPMPMIGAQAPVQIQVPGL 346

Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
           ++  TSG  T             +VLCL   +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 347 SMVGTSGPPT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 393

Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            +V IPRP + G + PG GKVF+E+   + C  A+  L+GRKF    V   Y+  DKY  
Sbjct: 394 RSVEIPRPIE-GVDVPGCGKVFVEFNSVLDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 452

Query: 517 KDY 519
           +++
Sbjct: 453 REF 455


>gi|198432988|ref|XP_002130386.1| PREDICTED: similar to U2 small nuclear ribonucleoprotein auxiliary
           factor (U2AF) 2 isoform 1 [Ciona intestinalis]
          Length = 482

 Score =  248 bits (633), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 146/360 (40%), Positives = 211/360 (58%), Gaps = 29/360 (8%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V   Q TR ARR+YVG +P    E A+  FF+  M  I G +  PG  ++ V IN +K
Sbjct: 150 VPVAGSQMTRQARRLYVGNIPFGVTEDAMMDFFNNQMQ-IAGLAQAPGQPILAVQINLDK 208

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+V+E + A+A DGI F   ++++RRP+DY P L  +L   QP+ +L     G
Sbjct: 209 NFAFLEFRSVDETTQALAFDGINFMNQSLKIRRPSDYKP-LPGSLE--QPAIHLP----G 261

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           + S  +  ++   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  T  SKGY F  
Sbjct: 262 VISTVVQDSQ--HKMFIGGLPNYLNDDQVKELLTSFGPLRAFNLVKDSATALSKGYAFAE 319

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQK-MALQT 401
           + D ++TD A A LNG+++GDK L V+RA           SI A+   H AI   + LQ 
Sbjct: 320 FADYSLTDQAIAGLNGMQLGDKKLIVQRA-----------SIGAKNNPHGAIMAPVTLQI 368

Query: 402 SGM-NTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
            GM +  G G +        VLCL   +  + L DDEEYEEI+ED+++ECGK G++V++ 
Sbjct: 369 PGMAHATGAGPA------TTVLCLMNMVLPEELTDDEEYEEIMEDVKDECGKLGSVVSLE 422

Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           IPRP     E  GVGK+++E+ + +    A  ALSGRKF    V   ++  ++Y ++ +S
Sbjct: 423 IPRPGPGLTEADGVGKIYVEFANHLDTQKAAQALSGRKFSNRVVVTSFFDPERYHHRQFS 482


>gi|50882018|gb|AAT85577.1| U2 small nuclear ribonucleoprotein auxiliary factor large subunit
           [Alvinella pompejana]
          Length = 479

 Score =  248 bits (632), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 159/416 (38%), Positives = 234/416 (56%), Gaps = 40/416 (9%)

Query: 113 GFD-MAPPAAAMLPGAAVPGQL--PGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMT-- 167
           GF+ M P     L G   PG L  PG+P   P +A N        + A P  P+ + T  
Sbjct: 95  GFEHMTPMQYKALHGLGPPGGLAAPGMPVVAPVIAAN-------NVVASPTAPMALNTTI 147

Query: 168 ----QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
                  +R ARR+YVG +P    E+ +  +F+  M  + G +   G+ V+  ++N +K 
Sbjct: 148 PFAGSAISRQARRLYVGNIPFGVTEEMMMDYFNTQMK-MAGLAQAEGNPVIACHVNLDKN 206

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
           FAF+E R+V+E + AMA DGI F+G ++++RRP DY P    A  P       ++A  G+
Sbjct: 207 FAFLEFRSVDETTQAMAFDGINFQGQSLKIRRPKDYQPLPGMAEVP-------SVAVPGV 259

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
            S  +   +   ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKGY FC Y
Sbjct: 260 VSTVV--QDSAHKIFIGGLPNYLNEDQVKELLTSFGLLKAFNLVKDSATGLSKGYAFCEY 317

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
            DP++TD ACA LNG+++GD+ L V+RA+   ++     ++L    Q   +    +Q  G
Sbjct: 318 LDPSITDQACAGLNGMQLGDEKLIVQRASVGAKNAQGGPNVLPVQLQIPGLNMAQVQGPG 377

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
             T             +VLCL   +T + L D+EEYEEILED++EEC KYG + ++ IPR
Sbjct: 378 PTT-------------EVLCLMNMVTPEDLEDEEEYEEILEDVKEECSKYGYVKSIEIPR 424

Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           P + G E PGVGK+F+E+   + C  A+ AL+GRKF    V + Y+  D+Y  +++
Sbjct: 425 PIK-GVEVPGVGKIFVEFNSVIDCQKAQQALTGRKFSNRVVVSSYFDPDRYHRREF 479


>gi|328766440|gb|EGF76494.1| hypothetical protein BATDEDRAFT_21058 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 551

 Score =  247 bits (631), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 136/352 (38%), Positives = 205/352 (58%), Gaps = 27/352 (7%)

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           Q   R  +R+Y G +P    E+ I +F S     +G     PG+A VN YIN E+ +AFV
Sbjct: 227 QPIARQFKRLYFGNIPVDCIEERILSFASSSYEKLG-LPKDPGNAAVNAYINRERNYAFV 285

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           E R+ EEA+ AMALDG +F+G  ++VRRP DYNP  AA  G  QPS           + A
Sbjct: 286 EFRSPEEATRAMALDGSLFDGNILKVRRPKDYNPE-AAPDGATQPS----------IAPA 334

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
               E  D++FVG +P Y ++ Q++ELL++FG L  F L++D  TG SKG+ FC Y D  
Sbjct: 335 TSAQESLDKIFVGAIPTYLSDDQVQELLKTFGELKTFSLIRDSATGLSKGFAFCEYVDGQ 394

Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
           +TD AC  LNG+++G+K L V+RA+      T    I A  Q  +   ++ L T   +  
Sbjct: 395 ITDAACQGLNGMELGEKKLIVQRASVGSNKNT----ISAVGQSQLLPMEI-LATIAKDPC 449

Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
                     + +VL L   + ++ L  DE+Y++IL D++EEC K+GT++++ IPRP  +
Sbjct: 450 ---------KVTRVLLLLNMVVSEDLVSDEDYQDILLDIQEECEKFGTILDIAIPRP-VS 499

Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           G    GVGK+F+++ +    A+A++AL+GRKF    V A ++ EDK+  +D+
Sbjct: 500 GQSNAGVGKIFVKFDNVKQSASAQHALAGRKFADRVVIASFFDEDKFDQQDF 551


>gi|309271453|ref|XP_003085312.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Mus musculus]
          Length = 730

 Score =  247 bits (631), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 161/419 (38%), Positives = 227/419 (54%), Gaps = 29/419 (6%)

Query: 103 RSPS--KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPL 160
           RSP   K K R  +D+ PP    +     P Q   + +A   +A  +LP       A   
Sbjct: 338 RSPCHEKKKVRKYWDVPPPGFEHV----TPMQYKAMQAAGQILATALLPTMTPGGLAVTP 393

Query: 161 MPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH 220
           MPV V+  Q TR ARR+YVG +P    E+++  F +  M  + G +  PG+ ++ V IN 
Sbjct: 394 MPVPVVGSQMTRQARRLYVGTIPFGITEESMLDFLNTQM-HLRGLTQAPGNPILAVQINL 452

Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           +K FAF+E R+V+E + A+A DGIIF+G ++++RRP DY P    +  P    P      
Sbjct: 453 DKNFAFLEFRSVDETTQALAFDGIIFQGQSLKIRRPHDYQPLPGMSGSPSVYVP------ 506

Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
            G+ S  +   +   ++F+GGLP Y  + Q+KELL S G L  F+LVKD  TG SKGY F
Sbjct: 507 -GVVSTIV--PDSAHKLFIGGLPNYLNDDQVKELLTSVGILRAFNLVKDSITGLSKGYAF 563

Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ 400
           C Y D  VTD A A LNG+ +GDK L V+RA+    +K    SI+ Q         + LQ
Sbjct: 564 CEYMDINVTDQAIAWLNGMHLGDKKLLVQRASVG--AKNVALSIINQT-------PVTLQ 614

Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
             G+ +    M   G     VLCL   +    L DDEEYEEI++D+R+EC KYG + ++ 
Sbjct: 615 VPGLTSSQVQM---GGHPTTVLCLMNMVLPKELLDDEEYEEIVDDVRDECNKYGLVKSIE 671

Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           IPRP  +G E PG GK+F+E+   + C  A   L+GRKF    V   Y   D Y  +++
Sbjct: 672 IPRP-MDGVEVPGCGKIFVEFTSVIDCQKAMQGLTGRKFANRVVVTKYCDPDSYHGRNF 729


>gi|71996475|ref|NP_001022967.1| Protein UAF-1, isoform a [Caenorhabditis elegans]
 gi|6226906|sp|P90978.2|U2AF2_CAEEL RecName: Full=Splicing factor U2AF 65 kDa subunit; AltName: Full=U2
           auxiliary factor 65 kDa subunit; Short=U2AF65; AltName:
           Full=U2 snRNP auxiliary factor large subunit
 gi|3334906|gb|AAC26982.1| splicing factor U2AF65 [Caenorhabditis elegans]
 gi|351018334|emb|CCD62278.1| Protein UAF-1, isoform a [Caenorhabditis elegans]
          Length = 496

 Score =  247 bits (631), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 149/385 (38%), Positives = 211/385 (54%), Gaps = 33/385 (8%)

Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
           G  +  P   +NM   G    G+     V V+    T  +RR+YVG +P   NE+A+  F
Sbjct: 145 GFETTTPMEYKNMQAAGQVPRGSVQ-SAVPVVGPSVTCQSRRLYVGNIPFGCNEEAMLDF 203

Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
           F+Q M  + G +  PG+ ++   IN +K FAF+E R+++E +  MA DGI F G  ++VR
Sbjct: 204 FNQQM-HLCGLAQAPGNPILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVR 262

Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
           RP DY P+        Q + ++N + + +++  +  A   +++F+GGLP Y TE Q+KEL
Sbjct: 263 RPRDYQPS--------QNTFDMN-SRMPVSTIVVDSA---NKIFIGGLPNYLTEDQVKEL 310

Query: 315 LESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATAS 374
           L SFG L  F L  D   GNSKGY F  Y DP +TD A A LNG+++GDK L V+ A A+
Sbjct: 311 LCSFGPLKAFSLNVD-SQGNSKGYAFAEYLDPTLTDQAIAGLNGMQLGDKQLVVQLACAN 369

Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA 434
            Q                      L  S     G  +S       ++LCL   +T D L 
Sbjct: 370 QQR-----------------HNTNLPNSASAIAGIDLSQGAGRATEILCLMNMVTEDELK 412

Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNAL 494
            D+EYEEILED+R+EC KYG + ++ IPRP ++    PGVGKVF+E+     C  A+ AL
Sbjct: 413 ADDEYEEILEDVRDECSKYGIVRSLEIPRPYED-HPVPGVGKVFVEFASTSDCQRAQAAL 471

Query: 495 SGRKFGGNTVNAFYYPEDKYFNKDY 519
           +GRKF   TV   YY  DKY N+ +
Sbjct: 472 TGRKFANRTVVTSYYDVDKYHNRQF 496


>gi|71996485|ref|NP_001022969.1| Protein UAF-1, isoform c [Caenorhabditis elegans]
 gi|351018336|emb|CCD62280.1| Protein UAF-1, isoform c [Caenorhabditis elegans]
          Length = 474

 Score =  247 bits (631), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 149/385 (38%), Positives = 211/385 (54%), Gaps = 33/385 (8%)

Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
           G  +  P   +NM   G    G+     V V+    T  +RR+YVG +P   NE+A+  F
Sbjct: 123 GFETTTPMEYKNMQAAGQVPRGSVQ-SAVPVVGPSVTCQSRRLYVGNIPFGCNEEAMLDF 181

Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
           F+Q M  + G +  PG+ ++   IN +K FAF+E R+++E +  MA DGI F G  ++VR
Sbjct: 182 FNQQM-HLCGLAQAPGNPILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVR 240

Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
           RP DY P+        Q + ++N + + +++  +  A   +++F+GGLP Y TE Q+KEL
Sbjct: 241 RPRDYQPS--------QNTFDMN-SRMPVSTIVVDSA---NKIFIGGLPNYLTEDQVKEL 288

Query: 315 LESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATAS 374
           L SFG L  F L  D   GNSKGY F  Y DP +TD A A LNG+++GDK L V+ A A+
Sbjct: 289 LCSFGPLKAFSLNVD-SQGNSKGYAFAEYLDPTLTDQAIAGLNGMQLGDKQLVVQLACAN 347

Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA 434
            Q                      L  S     G  +S       ++LCL   +T D L 
Sbjct: 348 QQR-----------------HNTNLPNSASAIAGIDLSQGAGRATEILCLMNMVTEDELK 390

Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNAL 494
            D+EYEEILED+R+EC KYG + ++ IPRP ++    PGVGKVF+E+     C  A+ AL
Sbjct: 391 ADDEYEEILEDVRDECSKYGIVRSLEIPRPYED-HPVPGVGKVFVEFASTSDCQRAQAAL 449

Query: 495 SGRKFGGNTVNAFYYPEDKYFNKDY 519
           +GRKF   TV   YY  DKY N+ +
Sbjct: 450 TGRKFANRTVVTSYYDVDKYHNRQF 474


>gi|71996490|ref|NP_001022970.1| Protein UAF-1, isoform d [Caenorhabditis elegans]
 gi|351018337|emb|CCD62281.1| Protein UAF-1, isoform d [Caenorhabditis elegans]
          Length = 471

 Score =  247 bits (631), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 149/385 (38%), Positives = 211/385 (54%), Gaps = 33/385 (8%)

Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
           G  +  P   +NM   G    G+     V V+    T  +RR+YVG +P   NE+A+  F
Sbjct: 120 GFETTTPMEYKNMQAAGQVPRGSVQ-SAVPVVGPSVTCQSRRLYVGNIPFGCNEEAMLDF 178

Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
           F+Q M  + G +  PG+ ++   IN +K FAF+E R+++E +  MA DGI F G  ++VR
Sbjct: 179 FNQQM-HLCGLAQAPGNPILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVR 237

Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
           RP DY P+        Q + ++N + + +++  +  A   +++F+GGLP Y TE Q+KEL
Sbjct: 238 RPRDYQPS--------QNTFDMN-SRMPVSTIVVDSA---NKIFIGGLPNYLTEDQVKEL 285

Query: 315 LESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATAS 374
           L SFG L  F L  D   GNSKGY F  Y DP +TD A A LNG+++GDK L V+ A A+
Sbjct: 286 LCSFGPLKAFSLNVD-SQGNSKGYAFAEYLDPTLTDQAIAGLNGMQLGDKQLVVQLACAN 344

Query: 375 GQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA 434
            Q                      L  S     G  +S       ++LCL   +T D L 
Sbjct: 345 QQR-----------------HNTNLPNSASAIAGIDLSQGAGRATEILCLMNMVTEDELK 387

Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNAL 494
            D+EYEEILED+R+EC KYG + ++ IPRP ++    PGVGKVF+E+     C  A+ AL
Sbjct: 388 ADDEYEEILEDVRDECSKYGIVRSLEIPRPYED-HPVPGVGKVFVEFASTSDCQRAQAAL 446

Query: 495 SGRKFGGNTVNAFYYPEDKYFNKDY 519
           +GRKF   TV   YY  DKY N+ +
Sbjct: 447 TGRKFANRTVVTSYYDVDKYHNRQF 471


>gi|309266895|ref|XP_003086891.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like, partial [Mus
           musculus]
          Length = 493

 Score =  247 bits (630), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 162/419 (38%), Positives = 229/419 (54%), Gaps = 29/419 (6%)

Query: 103 RSPS--KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPL 160
           RSP   K K R  +D+ PP    +     P Q   + +A   +A  +LP       A   
Sbjct: 101 RSPCHEKKKVRKYWDVPPPGFEHV----TPMQYKAMQAAGQILATALLPTMTPGGLAVTP 156

Query: 161 MPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH 220
           MPV V+  Q TR ARR+YVG +P    E+++  F +  M  + G +  PG+ ++ V IN 
Sbjct: 157 MPVPVVGSQMTRQARRLYVGTIPFGITEESMLDFLNTQM-HLRGLTQAPGNPILAVQINL 215

Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           +K FAF+E R+V+E + A+A DGIIF+G ++++RRP DY P    +   G PS    +  
Sbjct: 216 DKNFAFLEFRSVDETTQALAFDGIIFQGQSLKIRRPHDYQPLPGMS---GSPS----VYV 268

Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
            G+ S  +   +   ++F+GGLP Y  + Q+KELL S G L  F+LVKD  TG SKGY F
Sbjct: 269 PGVVSTIV--PDSAHKLFIGGLPNYLNDDQVKELLTSVGILRAFNLVKDSITGLSKGYAF 326

Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ 400
           C Y D  VTD A A LNG+ +GDK L V+RA+    +K    SI+ Q         + LQ
Sbjct: 327 CEYMDINVTDQAIAWLNGMHLGDKKLLVQRASVG--AKNVALSIINQT-------PVTLQ 377

Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
             G+ +    M   G     VLCL   +    L DDEEYEEI++D+R+EC KYG + ++ 
Sbjct: 378 VPGLTSSQVQM---GGHPTTVLCLMNMVLPKELLDDEEYEEIVDDVRDECNKYGLVKSIE 434

Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           IPRP  +G E PG GK+F+E+   + C  A   L+GRKF    V   Y   D Y  +++
Sbjct: 435 IPRP-MDGVEVPGCGKIFVEFTSVIDCQKAMQGLTGRKFANRVVVTKYCDPDSYHGRNF 492


>gi|322792032|gb|EFZ16137.1| hypothetical protein SINV_12499 [Solenopsis invicta]
          Length = 344

 Score =  246 bits (629), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 146/364 (40%), Positives = 211/364 (57%), Gaps = 41/364 (11%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 19  PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 77

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG   +P++N
Sbjct: 78  NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 131

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
           +             + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKG
Sbjct: 132 VP------------DSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 179

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y FC Y D ++TD A A LNG+++GDK L V+RA+   ++      I AQA   I +   
Sbjct: 180 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVP-- 233

Query: 398 ALQTSGMNTLGGGMSLFGET--LAKVLCLTEAITADALADDEEYEEILEDMREECGKYGT 455
                       G+S+ G +    +VLCL   +T + L ++EEYE+ILED++EEC KYG 
Sbjct: 234 ------------GLSMVGTSGPATEVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGV 281

Query: 456 LVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
           + +V IPRP + G + PG GKVF+E+   + C  A+  L+GRKF    V   Y+  DKY 
Sbjct: 282 VRSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYH 340

Query: 516 NKDY 519
            +++
Sbjct: 341 RREF 344


>gi|148697816|gb|EDL29763.1| mCG68163 [Mus musculus]
          Length = 472

 Score =  246 bits (629), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 162/419 (38%), Positives = 229/419 (54%), Gaps = 29/419 (6%)

Query: 103 RSPS--KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPL 160
           RSP   K K R  +D+ PP    +     P Q   + +A   +A  +LP       A   
Sbjct: 80  RSPCHEKKKVRKYWDVPPPGFEHV----TPMQYKAMQAAGQILATALLPTMTPGGLAVTP 135

Query: 161 MPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH 220
           MPV V+  Q TR ARR+YVG +P    E+++  F +  M  + G +  PG+ ++ V IN 
Sbjct: 136 MPVPVVGSQMTRQARRLYVGTIPFGITEESMLDFLNTQM-HLRGLTQAPGNPILAVQINL 194

Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           +K FAF+E R+V+E + A+A DGIIF+G ++++RRP DY P    +   G PS    +  
Sbjct: 195 DKNFAFLEFRSVDETTQALAFDGIIFQGQSLKIRRPHDYQPLPGMS---GSPS----VYV 247

Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
            G+ S  +   +   ++F+GGLP Y  + Q+KELL S G L  F+LVKD  TG SKGY F
Sbjct: 248 PGVVSTIV--PDSAHKLFIGGLPNYLNDDQVKELLTSVGILRAFNLVKDSITGLSKGYAF 305

Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ 400
           C Y D  VTD A A LNG+ +GDK L V+RA+    +K    SI+ Q         + LQ
Sbjct: 306 CEYMDINVTDQAIAWLNGMHLGDKKLLVQRASVG--AKNVALSIINQT-------PVTLQ 356

Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
             G+ +    M   G     VLCL   +    L DDEEYEEI++D+R+EC KYG + ++ 
Sbjct: 357 VPGLTSSQVQM---GGHPTTVLCLMNMVLPKELLDDEEYEEIVDDVRDECNKYGLVKSIE 413

Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           IPRP  +G E PG GK+F+E+   + C  A   L+GRKF    V   Y   D Y  +++
Sbjct: 414 IPRP-MDGVEVPGCGKIFVEFTSVIDCQKAMQGLTGRKFANRVVVTKYCDPDSYHGRNF 471


>gi|350417886|ref|XP_003491628.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like isoform 2
           [Bombus impatiens]
          Length = 428

 Score =  246 bits (628), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 147/363 (40%), Positives = 212/363 (58%), Gaps = 39/363 (10%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 103 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 161

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG   +P++N
Sbjct: 162 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 215

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
           +             + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKG
Sbjct: 216 VP------------DSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 263

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y FC Y D ++TD A A LNG+++GDK L V+RA+   ++      I AQA   I +  +
Sbjct: 264 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVPGL 319

Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
           ++  TSG  T             +VLCL   +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 320 SMVGTSGPAT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 366

Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            +V IPRP + G + PG GKVF+E+   + C  A+  L+GRKF    V   Y+  DKY  
Sbjct: 367 RSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 425

Query: 517 KDY 519
           +++
Sbjct: 426 REF 428


>gi|383854116|ref|XP_003702568.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Megachile
           rotundata]
          Length = 432

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 148/363 (40%), Positives = 214/363 (58%), Gaps = 35/363 (9%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 103 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 161

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG   +P++N
Sbjct: 162 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 215

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
           +       G +   + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKG
Sbjct: 216 V------PGTV--PDSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 267

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y FC Y D ++TD A A LNG+++GDK L V+RA+   ++      I AQA   I +  +
Sbjct: 268 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVPGL 323

Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
           ++  TSG  T             +VLCL   +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 324 SMVGTSGPAT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 370

Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            +V IPRP + G + PG GKVF+E+   + C  A+  L+GRKF    V   Y+  DKY  
Sbjct: 371 RSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 429

Query: 517 KDY 519
           +++
Sbjct: 430 REF 432


>gi|340715832|ref|XP_003396412.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Bombus
           terrestris]
 gi|350417884|ref|XP_003491627.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like isoform 1
           [Bombus impatiens]
          Length = 432

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 148/363 (40%), Positives = 214/363 (58%), Gaps = 35/363 (9%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 103 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 161

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG   +P++N
Sbjct: 162 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 215

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
           +       G +   + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKG
Sbjct: 216 V------PGTV--PDSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 267

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y FC Y D ++TD A A LNG+++GDK L V+RA+   ++      I AQA   I +  +
Sbjct: 268 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVPGL 323

Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
           ++  TSG  T             +VLCL   +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 324 SMVGTSGPAT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 370

Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            +V IPRP + G + PG GKVF+E+   + C  A+  L+GRKF    V   Y+  DKY  
Sbjct: 371 RSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 429

Query: 517 KDY 519
           +++
Sbjct: 430 REF 432


>gi|307176032|gb|EFN65791.1| Splicing factor U2AF 50 kDa subunit [Camponotus floridanus]
          Length = 432

 Score =  246 bits (627), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 148/363 (40%), Positives = 214/363 (58%), Gaps = 35/363 (9%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 103 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 161

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG   +P++N
Sbjct: 162 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 215

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
           +       G +   + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKG
Sbjct: 216 V------PGTV--PDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSATGLSKG 267

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y FC Y D ++TD A A LNG+++GDK L V+RA+   ++      I AQA   I +  +
Sbjct: 268 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVPGL 323

Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
           ++  TSG  T             +VLCL   +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 324 SMVGTSGPAT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 370

Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            +V IPRP + G + PG GKVF+E+   + C  A+  L+GRKF    V   Y+  DKY  
Sbjct: 371 RSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 429

Query: 517 KDY 519
           +++
Sbjct: 430 REF 432


>gi|332026432|gb|EGI66560.1| Splicing factor U2AF 50 kDa subunit [Acromyrmex echinatior]
          Length = 435

 Score =  246 bits (627), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 147/363 (40%), Positives = 212/363 (58%), Gaps = 39/363 (10%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 110 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 168

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG   +P++N
Sbjct: 169 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 222

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
           +             + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKG
Sbjct: 223 VP------------DSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 270

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y FC Y D ++TD A A LNG+++GDK L V+RA+   ++      I AQA   I +  +
Sbjct: 271 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVPGL 326

Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
           ++  TSG  T             +VLCL   +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 327 SMVGTSGPAT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 373

Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            +V IPRP + G + PG GKVF+E+   + C  A+  L+GRKF    V   Y+  DKY  
Sbjct: 374 RSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 432

Query: 517 KDY 519
           +++
Sbjct: 433 REF 435


>gi|402590758|gb|EJW84688.1| hypothetical protein WUBG_04401 [Wuchereria bancrofti]
          Length = 477

 Score =  246 bits (627), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 139/358 (38%), Positives = 203/358 (56%), Gaps = 34/358 (9%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V+    T  +RR+YVG +P   +E A+  FF+Q M  + G +  PG+ V+   +N +K
Sbjct: 153 VPVVGPSVTCQSRRLYVGNIPFGCSEDAMLDFFNQQM-HLCGLAQAPGNPVLACQMNLDK 211

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+++E +  MA DGI F G  +++RRP DY P           S + +L  + 
Sbjct: 212 NFAFIEFRSIDETTAGMAFDGINFMGQQLKIRRPRDYQPM----------STSYDLGNM- 260

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           + S  +   + P ++F+GGLP Y    Q+KELL SFG L  F+LV ++ TG SKGY F  
Sbjct: 261 MVSNIV--PDSPHKIFIGGLPSYLNAEQVKELLSSFGQLKAFNLVTEQSTGVSKGYAFAE 318

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           Y DP++TD A A LNG+++GDK L V+ + A+ ++   Q +               +Q +
Sbjct: 319 YLDPSLTDQAIAGLNGMQLGDKNLVVQLSCANARNNVAQNTF------------PQIQVA 366

Query: 403 GMN-TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           G++ + G G         +VLCL   +T D L DDEEYE+ILED+REEC KYG + ++ I
Sbjct: 367 GIDLSHGAGPP------TEVLCLMNMVTEDELKDDEEYEDILEDIREECAKYGIVKSLEI 420

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           PR    G +  GVGKVF+E+     C  A+ AL+GRKF   TV   YY  D Y  + +
Sbjct: 421 PR-SVPGVDVTGVGKVFVEFNSKQECQKAQAALTGRKFANRTVVTSYYDPDMYHRRQF 477


>gi|307195151|gb|EFN77144.1| Splicing factor U2AF 50 kDa subunit [Harpegnathos saltator]
          Length = 402

 Score =  245 bits (625), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 145/364 (39%), Positives = 210/364 (57%), Gaps = 41/364 (11%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 77  PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 135

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG   +P++N
Sbjct: 136 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 189

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
           +             + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKG
Sbjct: 190 VP------------DSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 237

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y FC Y D ++TD A A LNG+++GDK L V+RA+   ++      I  QA   I +   
Sbjct: 238 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGTQAPVQIQVP-- 291

Query: 398 ALQTSGMNTLGGGMSLFGET--LAKVLCLTEAITADALADDEEYEEILEDMREECGKYGT 455
                       G+S+ G +    +VLCL   +T + L ++EEYE+ILED++EEC KYG 
Sbjct: 292 ------------GLSMVGSSGPATEVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGV 339

Query: 456 LVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
           + +V IPRP + G + PG GKVF+E+   + C  A+  L+GRKF    V   Y+  DKY 
Sbjct: 340 VRSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYH 398

Query: 516 NKDY 519
            +++
Sbjct: 399 RREF 402


>gi|170575889|ref|XP_001893425.1| U2 auxiliary factor 65 kDa subunit [Brugia malayi]
 gi|158600599|gb|EDP37742.1| U2 auxiliary factor 65 kDa subunit, putative [Brugia malayi]
          Length = 502

 Score =  244 bits (624), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 138/358 (38%), Positives = 203/358 (56%), Gaps = 34/358 (9%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V+    T  +RR+YVG +P   +E A+  FF+Q M  + G +  PG+ V+   +N +K
Sbjct: 178 VPVVGPSVTCQSRRLYVGNIPFGCSEDAMLDFFNQQM-HLCGLAQAPGNPVLACQMNLDK 236

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+++E +  MA DGI F G  +++RRP DY P           S + +L  + 
Sbjct: 237 NFAFIEFRSIDETTAGMAFDGINFMGQQLKIRRPRDYQPM----------STSYDLGNMM 286

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           +++      + P ++F+GGLP Y    Q+KELL SFG L  F+LV ++ TG SKGY F  
Sbjct: 287 VSNIV---PDSPHKIFIGGLPSYLNAEQVKELLSSFGQLKAFNLVTEQSTGVSKGYAFAE 343

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           Y DP++TD A A LNG+++GDK L V+ + A+ ++   Q +               +Q +
Sbjct: 344 YLDPSLTDQAIAGLNGMQLGDKNLVVQLSCANARNNVAQNTF------------PQIQVA 391

Query: 403 GMN-TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           G++ + G G         +VLCL   +T D L DDEEYE+ILED+REEC KYG + ++ I
Sbjct: 392 GIDLSHGAGPP------TEVLCLMNMVTEDELKDDEEYEDILEDIREECAKYGIVKSLEI 445

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           PR    G +  GVGKVF+E+     C  A+ AL+GRKF   TV   YY  D Y  + +
Sbjct: 446 PR-SVPGVDVTGVGKVFVEFNSKQECQKAQAALTGRKFANRTVVTSYYDPDMYHRRQF 502


>gi|393909510|gb|EJD75480.1| hypothetical protein LOAG_17389 [Loa loa]
          Length = 502

 Score =  244 bits (623), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 138/358 (38%), Positives = 203/358 (56%), Gaps = 34/358 (9%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V+    T  +RR+YVG +P   +E A+  FF+Q M  + G +  PG+ V+   +N +K
Sbjct: 178 VPVVGPSVTCQSRRLYVGNIPFGCSEDAMLDFFNQQM-HLCGLAQAPGNPVLACQMNLDK 236

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+++E +  MA DGI F G  +++RRP DY P           S + +L  + 
Sbjct: 237 NFAFIEFRSIDETTAGMAFDGINFMGQQLKIRRPRDYQPM----------STSYDLGNMM 286

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           +++      + P ++F+GGLP Y    Q+KELL SFG L  F+LV ++ TG SKGY F  
Sbjct: 287 VSNIV---PDSPHKIFIGGLPSYLNAEQVKELLSSFGQLKAFNLVTEQSTGVSKGYAFAE 343

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           Y DP++TD A A LNG+++GDK L V+ + A+ ++   Q +               +Q +
Sbjct: 344 YLDPSLTDQAIAGLNGMQLGDKNLVVQLSCANARNNVAQNTF------------PQIQVA 391

Query: 403 GMN-TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           G++ + G G         +VLCL   +T D L DDEEYE+ILED+REEC KYG + ++ I
Sbjct: 392 GIDLSHGAGPP------TEVLCLMNMVTEDELKDDEEYEDILEDIREECAKYGIVKSLEI 445

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           PR    G +  GVGKVF+E+     C  A+ AL+GRKF   TV   YY  D Y  + +
Sbjct: 446 PR-SVPGVDVTGVGKVFVEFNSKQECQKAQAALTGRKFANRTVVTSYYDPDMYHRRQF 502


>gi|380016747|ref|XP_003692335.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Apis florea]
          Length = 428

 Score =  244 bits (623), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 146/363 (40%), Positives = 211/363 (58%), Gaps = 39/363 (10%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 103 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 161

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
           N +K FAF+E R+++E + AMA D I F+G ++++RRP DY P       PG   +P++N
Sbjct: 162 NLDKNFAFLEFRSIDETTQAMAFDSINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 215

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
           +             + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKG
Sbjct: 216 VP------------DSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 263

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y FC Y D ++TD A A LNG+++GDK L V+RA+   ++      I AQA   I +  +
Sbjct: 264 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVPGL 319

Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
           ++  TSG  T             +VLCL   +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 320 SMVGTSGPAT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 366

Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            +V IPRP + G + PG GKVF+E+   + C  A+  L+GRKF    V   Y+  DKY  
Sbjct: 367 RSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 425

Query: 517 KDY 519
           +++
Sbjct: 426 REF 428


>gi|66520699|ref|XP_623055.1| PREDICTED: splicing factor U2AF 50 kDa subunit [Apis mellifera]
          Length = 432

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 147/363 (40%), Positives = 213/363 (58%), Gaps = 35/363 (9%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 103 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 161

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-SPNLN 277
           N +K FAF+E R+++E + AMA D I F+G ++++RRP DY P       PG   +P++N
Sbjct: 162 NLDKNFAFLEFRSIDETTQAMAFDSINFKGQSLKIRRPHDYQPM------PGMTDNPSMN 215

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
           +       G +   + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKG
Sbjct: 216 V------PGTV--PDSPHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKG 267

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y FC Y D ++TD A A LNG+++GDK L V+RA+   ++      I AQA   I +  +
Sbjct: 268 YAFCEYVDVSMTDQAIAGLNGMQLGDKKLIVQRASVGAKNPM----IGAQAPVQIQVPGL 323

Query: 398 ALQ-TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
           ++  TSG  T             +VLCL   +T + L ++EEYE+ILED++EEC KYG +
Sbjct: 324 SMVGTSGPAT-------------EVLCLLNMVTPEELMEEEEYEDILEDIKEECNKYGVV 370

Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            +V IPRP + G + PG GKVF+E+   + C  A+  L+GRKF    V   Y+  DKY  
Sbjct: 371 RSVEIPRPIE-GVDVPGCGKVFVEFNSVIDCQKAQQTLTGRKFNNRVVVTSYFDPDKYHR 429

Query: 517 KDY 519
           +++
Sbjct: 430 REF 432


>gi|405976087|gb|EKC40607.1| Splicing factor U2AF 50 kDa subunit [Crassostrea gigas]
          Length = 428

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 148/349 (42%), Positives = 205/349 (58%), Gaps = 31/349 (8%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           +R ARR+YVG +P    E+A+  FF+  M   G   A  G  V+ V IN +K FAF+E R
Sbjct: 111 SRQARRLYVGNIPFGVTEEAMMDFFNHQMKMTGLAQA-EGSPVIAVQINLDKNFAFLEFR 169

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           +V+E + AMA DGI F+G ++++RRP DY P    A      +P++N+   G+ S  +  
Sbjct: 170 SVDETTQAMAFDGINFQGQSLKIRRPRDYQPLPGMA-----ETPSVNVP--GVVSTVV-- 220

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
            + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKGY FC Y DP VTD
Sbjct: 221 QDSPHKIFIGGLPNYLNEDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDPNVTD 280

Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
             CA LNG+++GDK L V+RA+            L      + +Q   LQ  G+N   G 
Sbjct: 281 QGCAGLNGMQLGDKKLIVQRAS------------LGAKNSQVPVQ---LQIPGLNLNQG- 324

Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
                    +VLCL   I  + L D+EEYE+ILED++EEC KYG + ++ IPRP + G +
Sbjct: 325 ----AGPPTEVLCLMNMIVPEELEDEEEYEDILEDVKEECSKYGVVRSIEIPRPIK-GVD 379

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            PG GK+F+E+   + C  A+ AL+GRKF    V   YY  DKY  +++
Sbjct: 380 VPGCGKIFVEFNSIIDCQKAQQALTGRKFSNRVVVTSYYDPDKYHRREF 428


>gi|198432986|ref|XP_002130494.1| PREDICTED: similar to U2 small nuclear ribonucleoprotein auxiliary
           factor (U2AF) 2 isoform 2 [Ciona intestinalis]
          Length = 472

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 145/360 (40%), Positives = 208/360 (57%), Gaps = 35/360 (9%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V   Q TR ARR+YVG +P    E A+  FF+  M  I G +  PG  ++ V IN +K
Sbjct: 146 VPVAGSQMTRQARRLYVGNIPFGVTEDAMMDFFNNQMQ-IAGLAQAPGQPILAVQINLDK 204

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+V+E + A+A DGI F   ++++RRP+DY P L  +L   QP+ +L     G
Sbjct: 205 NFAFLEFRSVDETTQALAFDGINFMNQSLKIRRPSDYKP-LPGSLE--QPAIHLP----G 257

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           + S  +  ++   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  T  SKGY F  
Sbjct: 258 VISTVVQDSQ--HKMFIGGLPNYLNDDQVKELLTSFGPLRAFNLVKDSATALSKGYAFAE 315

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQH--IAIQKMALQ 400
           + D ++TD A A LNG+++GDK L V+RA           SI A+   H  I I  MA  
Sbjct: 316 FADYSLTDQAIAGLNGMQLGDKKLIVQRA-----------SIGAKNNPHGAIMIPGMAHA 364

Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
           T      G G +        VLCL   +  + L DDEEYEEI+ED+++ECGK G++V++ 
Sbjct: 365 T------GAGPA------TTVLCLMNMVLPEELTDDEEYEEIMEDVKDECGKLGSVVSLE 412

Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           IPRP     E  GVGK+++E+ + +    A  ALSGRKF    V   ++  ++Y ++ +S
Sbjct: 413 IPRPGPGLTEADGVGKIYVEFANHLDTQKAAQALSGRKFSNRVVVTSFFDPERYHHRQFS 472


>gi|403175591|ref|XP_003888994.1| hypothetical protein PGTG_22302 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375171670|gb|EHS64431.1| hypothetical protein PGTG_22302 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 713

 Score =  242 bits (617), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 145/369 (39%), Positives = 201/369 (54%), Gaps = 35/369 (9%)

Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIG----GNSAGPG-----DAVVNVY 217
           TQ   R ARR+YVG +   ANE  +A FF+  M  +G     N  G       + VV V 
Sbjct: 329 TQSFARQARRLYVGNILHTANEMNVAEFFNAKMKELGLLARNNEDGMAISISENPVVAVQ 388

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLN 277
           +NHEK +AFVE R  EEA++ M+ DGIIF+  A+++RRP DY        GP    P   
Sbjct: 389 VNHEKNYAFVEFRNAEEATHGMSFDGIIFQNQALKIRRPKDYT-------GPDHAGPT-- 439

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN--S 335
               G+ S  +   + P+++F+GGLP Y T+ Q+ ELL+SFG L  F+LVKD  +G   S
Sbjct: 440 -HIPGVVSTNV--PDSPNKIFIGGLPSYLTDDQVMELLKSFGELKSFNLVKDTSSGGHVS 496

Query: 336 KGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESIL-AQAQQHIAI 394
           KG+ FC Y DP +TDIAC  LNG+++GD+ L V+RA     +K E+E+    Q   +   
Sbjct: 497 KGFAFCEYVDPDLTDIACQGLNGMELGDRYLVVQRAQIGQNAKKEKENNPDGQRNNYNQF 556

Query: 395 QKMA-LQTSGMNTLGGGMSLFGE-TLAKVLCLTEAITADALADDEEYEEILEDMREECGK 452
              A  Q +   +        GE    +VL +   +  + L DD+EY EILED+R+ECGK
Sbjct: 557 NNFAGGQATAAASSVLAAVKSGEGEKTRVLQMLNMVNQEELVDDQEYGEILEDIRDECGK 616

Query: 453 YGTLVNVVIPRPDQN---------GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNT 503
           YG +  V IPRP +N              G+GKVF+ +     CA A  A++GR+F G  
Sbjct: 617 YGKIEGVRIPRPIKNEKGRIDLKASESVDGLGKVFVMFEKVDECAAALLAIAGRQFAGRV 676

Query: 504 VNAFYYPED 512
           +   Y PED
Sbjct: 677 IICAYAPED 685


>gi|427789501|gb|JAA60202.1| Putative splicing factor u2af large subunit rrm superfamily
           [Rhipicephalus pulchellus]
          Length = 462

 Score =  241 bits (616), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 146/361 (40%), Positives = 210/361 (58%), Gaps = 28/361 (7%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V ++    TR ARR+YVG +P   +E+ +  +F+  M A G + A PG+ V+   I
Sbjct: 130 PPAAVPIVGGTITRQARRLYVGNIPFGCSEEEMMDYFNAQMHACGFSQA-PGNPVLACQI 188

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG  S   ++
Sbjct: 189 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGM-SETPSV 241

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
           A  G+ S  +   + P ++F+GGLP Y  E Q++ELL SFG L  F+LVKD  TG SKGY
Sbjct: 242 AVPGVISTVV--QDSPHKIFIGGLPNYLNEDQVRELLMSFGQLRAFNLVKDSATGLSKGY 299

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            FC Y + A TD A   LNG+++GDK L V+RA+    +K  Q +     Q  + IQ   
Sbjct: 300 AFCEYVEVATTDQAIMGLNGMQLGDKKLIVQRASVG--AKNSQMN-----QAPVQIQVPG 352

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
           LQ  G    G G         +VLCL   +  + L D+EEYE+ILED+ EEC KYG + +
Sbjct: 353 LQLQG----GAGPP------TEVLCLMNLVCPEELKDEEEYEDILEDIHEECNKYGVVKS 402

Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
           + IPRP  +G E PG GK ++E+   + C  A+ +L+GRKF    V   Y+  DKY  ++
Sbjct: 403 IEIPRPI-DGVEVPGCGKAYVEFNSVIDCQKAQQSLTGRKFSNRVVVTSYFDPDKYHRRE 461

Query: 519 Y 519
           +
Sbjct: 462 F 462


>gi|226483519|emb|CAX74060.1| Splicing factor U2AF 65 kDa subunit [Schistosoma japonicum]
 gi|226483521|emb|CAX74061.1| Splicing factor U2AF 65 kDa subunit [Schistosoma japonicum]
          Length = 518

 Score =  241 bits (616), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 170/447 (38%), Positives = 234/447 (52%), Gaps = 36/447 (8%)

Query: 76  KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPG 135
           K +RH  R RS SS     +   + P           GF+   PA      A+  GQ   
Sbjct: 105 KSKRHHSRHRSASSPTLVYKYWDVPPP----------GFEHVTPAQYKALQAS--GQ--- 149

Query: 136 VPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFF 195
           +P  V    Q  +P  A          V        R ARR+YVG +P  A E+ +  FF
Sbjct: 150 IPVNVYAAGQVPMPVHAPNAPLTLTTNVPFAGSAVCRQARRLYVGNIPFTATEENMMEFF 209

Query: 196 SQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRR 255
           ++ M A G   A  G+ ++ V IN EK FAF+E R+V+E +  +ALDG++F+  A+++RR
Sbjct: 210 NKQMRAQGLIQA-EGNPIIAVQINMEKNFAFLEFRSVDETTQGLALDGVLFQNQALKLRR 268

Query: 256 PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELL 315
           P DY P    +  P    P       G+ S  +   + P ++FVGGLP Y  E Q+KELL
Sbjct: 269 PRDYAPLPGVSEQPSVIVP-------GVVSTVV--QDSPHKIFVGGLPNYLNEDQVKELL 319

Query: 316 ESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASG 375
            SFG L GF+LVKD  TG SKGY FC Y D  VTD ACA LNG+++GDK L V+RA+   
Sbjct: 320 LSFGPLKGFNLVKDGSTGLSKGYAFCEYVDANVTDHACAGLNGMQLGDKKLIVQRASVGA 379

Query: 376 QSKTEQESILAQAQQHI-AIQKMALQTSGMNTLGGGMSLF--GETLAKVLCLTEAITADA 432
           +  T    +L Q    +  ++   +Q    NT G G      G    +VLCL   I    
Sbjct: 380 KHTT---GVLPQTLLSLPGLEDGTVQ----NTTGSGNITIRSGGPPTEVLCLMNMIETSE 432

Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
           L DDEEYE+I+ED+R EC KYG + ++ IPRP   G E PGVGK+++E+   + C  A  
Sbjct: 433 LEDDEEYEDIVEDVRAECSKYGVVRSLEIPRPIP-GVEVPGVGKIYVEFASLIDCQKAAT 491

Query: 493 ALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           AL+GRKF    V   ++  D Y  +++
Sbjct: 492 ALTGRKFNQRLVVTSFFSPDNYHRREF 518


>gi|328862941|gb|EGG12041.1| hypothetical protein MELLADRAFT_41759 [Melampsora larici-populina
           98AG31]
          Length = 397

 Score =  241 bits (616), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 141/368 (38%), Positives = 198/368 (53%), Gaps = 46/368 (12%)

Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDA---------VVNVY 217
           TQ   R ARR+YVG +   ANE  +A FF+  M  +G       D          VV V 
Sbjct: 26  TQSFARQARRLYVGNILHTANEMNVAEFFNAKMKELGLLVRNGEDGSMISISENPVVAVQ 85

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALG--PGQPSPN 275
           +NHEK +AFVE R  EEA++ M+ DGIIF+  A+++RRP DY  T   +    PG  S N
Sbjct: 86  VNHEKNYAFVEFRNAEEATHGMSFDGIIFQNQALKIRRPKDYTGTEHTSTNHIPGVVSTN 145

Query: 276 LNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN- 334
           +               + P+++F+GGLP Y T+ Q+ ELL+SFG L  F+LVKD  +G  
Sbjct: 146 V--------------PDSPNKIFIGGLPSYLTDDQVMELLKSFGELKSFNLVKDTSSGGQ 191

Query: 335 -SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
            SKG+ FC Y D  +TDIAC  LNG+++GD+ L V+RA     +K E+E+        + 
Sbjct: 192 VSKGFAFCEYVDSDLTDIACQGLNGMELGDRYLVVQRAQIGQNAKKEKENAEGAGGAGMG 251

Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
           +       +G N    G         KVL +   +  + L DDEEY+EILED++EEC KY
Sbjct: 252 VGGF----NGTNRASEG------ERTKVLQMLNMVNPEELVDDEEYKEILEDIKEECSKY 301

Query: 454 GTLVNVVIPRPDQN---------GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           G + +V IPRP +N              G+GKVF+++     CA A +A++GR+F G  +
Sbjct: 302 GQIEDVKIPRPAKNEKGRMDLKSSESIEGLGKVFIKFERIEDCAQALSAIAGRQFAGRVI 361

Query: 505 NAFYYPED 512
              Y  ED
Sbjct: 362 ICAYASED 369


>gi|440900150|gb|ELR51345.1| Splicing factor U2AF 65 kDa subunit, partial [Bos grunniens mutus]
          Length = 411

 Score =  241 bits (615), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 162/425 (38%), Positives = 230/425 (54%), Gaps = 39/425 (9%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
           RSP    K K R  +D+ PP    +           GQ+P   + +P M  + L    T 
Sbjct: 17  RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT- 74

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
                  PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+
Sbjct: 75  -------PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVL 126

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
            V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S 
Sbjct: 127 AVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SE 179

Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
           N ++   G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG 
Sbjct: 180 NPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGL 237

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
           SKGY FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +
Sbjct: 238 SKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTL 293

Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
           Q   L +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+ECGKYG
Sbjct: 294 QVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECGKYG 345

Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
            + ++ IPR            ++F+E+     C  A   L+GRKF    V   Y   D Y
Sbjct: 346 LVKSIEIPRQAWVFTSLILFLQIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSY 405

Query: 515 FNKDY 519
             +D+
Sbjct: 406 HRRDF 410


>gi|321479007|gb|EFX89963.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 isoform 1
           [Daphnia pulex]
          Length = 487

 Score =  241 bits (615), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 154/422 (36%), Positives = 224/422 (53%), Gaps = 56/422 (13%)

Query: 134 PGVPSAVPEMA-QNMLPFGATQLGAFPLMPVQVMTQQA------------TRHARRVYVG 180
           P +   +P +  +++ P     + A   +P  +MT+ A            TR ARR+YVG
Sbjct: 86  PSIYWDIPPIGFEHITPMQYKAMQAAGQIPANIMTEVAPQVQVPVVGNTITRQARRLYVG 145

Query: 181 GLP-----PLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEA 235
            +P       A E+ +  FF+Q M  + G S  PG  ++   IN +K FAF+E R+++E 
Sbjct: 146 NIPFGVSDVRAAEEEMMDFFNQQM-HLSGLSQAPGHPILACQINLDKNFAFLEFRSIDET 204

Query: 236 SNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA---- 291
           S AMA DGI F+G  +++RRP DY+PT     G G           G++SG I       
Sbjct: 205 SQAMAFDGINFKGQTLKIRRPHDYHPTPGGGGGGGGGG-GGPETTPGMSSGGITEKRAGG 263

Query: 292 --------------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
                         + P ++F+GGLP Y  E Q  ELL SFG L  F+LVKD  TG SKG
Sbjct: 264 GGGGGGGTMSTIVPDTPHKLFIGGLPNYLNEEQ--ELLMSFGQLRAFNLVKDTATGLSKG 321

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y FC Y D  VTD A + LNG+++GDK + ++RA+   ++        A A   + +Q  
Sbjct: 322 YAFCEYADVTVTDQAISGLNGMQLGDKKIIIQRASVGAKN--------AAAYAQMPVQ-- 371

Query: 398 ALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
            +Q  G N   G     G+   +VLCL   +T + L DDEEYEEI++D+REEC ++G + 
Sbjct: 372 -IQVPGFNLAAGP----GQP-TEVLCLLNMVTPEELRDDEEYEEIVDDIREECNRHGAVR 425

Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
           +V IPRP +   + PGVGKVF+E+     C  A+ AL+GRKF    V   +Y  ++Y  +
Sbjct: 426 SVEIPRPLEGVDDVPGVGKVFVEFISVSDCVKAQQALTGRKFANRIVVTSFYEPERYHRR 485

Query: 518 DY 519
           D+
Sbjct: 486 DF 487


>gi|256074204|ref|XP_002573416.1| splicing factor u2af large subunit [Schistosoma mansoni]
 gi|238658595|emb|CAZ29648.1| splicing factor u2af large subunit, putative [Schistosoma mansoni]
          Length = 521

 Score =  241 bits (614), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 170/447 (38%), Positives = 234/447 (52%), Gaps = 36/447 (8%)

Query: 76  KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPG 135
           K +RH  R RS SS     +   + P           GF+   PA      A+  GQ   
Sbjct: 108 KSKRHHSRHRSASSPTLVYKYWDVPPP----------GFEHVTPAQYKALQAS--GQ--- 152

Query: 136 VPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFF 195
           +P  V    Q  +P  A          V        R ARR+YVG +P  A E+ +  FF
Sbjct: 153 IPVNVYAAGQVPMPVHAPNAPLTLTTNVPFAGSAVCRQARRLYVGNIPFTATEENMMEFF 212

Query: 196 SQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRR 255
           ++ M A G   A  G+ ++ V IN EK FAF+E R+V+E +  +ALDG++F+  A+++RR
Sbjct: 213 NKQMRAQGLIQA-EGNPIIAVQINMEKNFAFLEFRSVDETTQGLALDGVLFQNQALKLRR 271

Query: 256 PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELL 315
           P DY P    +  P    P       G+ S  +   + P ++FVGGLP Y  E Q+KELL
Sbjct: 272 PRDYAPLPGVSEQPSVIVP-------GVVSTVV--QDSPHKIFVGGLPNYLNEDQVKELL 322

Query: 316 ESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASG 375
            SFG L GF+LVKD  TG SKGY FC Y D  VTD ACA LNG+++GDK L V+RA+   
Sbjct: 323 LSFGPLKGFNLVKDGSTGLSKGYAFCEYVDANVTDHACAGLNGMQLGDKKLIVQRASVGA 382

Query: 376 QSKTEQESILAQAQQHI-AIQKMALQTSGMNTLGGGMSLF--GETLAKVLCLTEAITADA 432
           +  T    +L Q    +  ++   +Q    NT G G      G    +VLCL   I    
Sbjct: 383 KHTT---GVLPQTLLSLPGLEDGTVQ----NTTGSGNITIRSGGPPTEVLCLMNMIETSE 435

Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
           L DDEEYE+I+ED+R EC KYG + ++ IPRP   G E PGVGK+++E+   + C  A  
Sbjct: 436 LEDDEEYEDIVEDVRAECSKYGVVRSLEIPRPIP-GVEVPGVGKIYVEFASLIDCQKAAT 494

Query: 493 ALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           AL+GRKF    V   ++  D Y  +++
Sbjct: 495 ALTGRKFNQRLVVTSFFSPDNYHRREF 521


>gi|299470773|emb|CBN79819.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 636

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 143/404 (35%), Positives = 222/404 (54%), Gaps = 58/404 (14%)

Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
           + Q TRHARR+Y+GG P    E+ +++FF++V+          G AV +VY++ EK FAF
Sbjct: 239 SNQQTRHARRLYIGGCPK-TTEEEMSSFFNEVINR-ALEYPIDGGAVASVYVSQEKAFAF 296

Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
           +E++T+E A++ + LDGI+++   +++RRP+DYNP L  A     P P LNL+ +G+ S 
Sbjct: 297 LELKTMELATSVLELDGIVYKETQLKMRRPSDYNPQLVPAAS--GPIPKLNLSVLGIISS 354

Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
            +  ++  ++VF+GGLPY  TE Q++ELL +FG L  F LV+D  +  SKGYGFC Y + 
Sbjct: 355 TVPDSD--NKVFIGGLPYNLTEDQVRELLSAFGPLKSFHLVRDPGSPTSKGYGFCEYLNA 412

Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQS----------------------------- 377
            VT IAC  L+G+ +GDKTLTVR AT  G S                             
Sbjct: 413 GVTAIACEGLHGMTLGDKTLTVRPATDRGSSGGQQQQQQQQQQQQHYQQQAMPGMQMGGG 472

Query: 378 --------KTEQESI-LAQAQQHI-------AIQKMALQTSGMNTLGGGMSLFGETLAK- 420
                    T+QE + LAQA   +       A+  +A   +G        S     L + 
Sbjct: 473 QQMGLTAPPTQQEQMQLAQAMAALDPNASQAAMASLAQGGAGAVPSAAAESAVAANLIRA 532

Query: 421 -----VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
                VL +   +T   L D +EYE+I++D+++EC  +GT++++++PRP +    +  VG
Sbjct: 533 IPPTRVLVMLNMVTEAELRDPQEYEDIVDDIQQECSSHGTVLSIIVPRPGE-ADASRAVG 591

Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           KVF+EY        A  +L+GR+FG N V   Y  E+K+  +++
Sbjct: 592 KVFVEYDTKDSAQKAALSLAGRQFGANIVKVEYLNEEKFARREF 635


>gi|443731660|gb|ELU16702.1| hypothetical protein CAPTEDRAFT_155651 [Capitella teleta]
          Length = 480

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 145/349 (41%), Positives = 209/349 (59%), Gaps = 25/349 (7%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           +R ARR+YVG +P    E+ +  FF+  M  +   S   G+ V+   +N +K FAF+E R
Sbjct: 157 SRQARRLYVGNIPFGVTEEMMMDFFNTQMK-MAALSQAEGNPVIACQVNLDKNFAFLEYR 215

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           +V+E ++AMALDGI F+G ++++RRP DY P    A  P       N+A  G+ S  +  
Sbjct: 216 SVDETTHAMALDGINFQGQSLKIRRPKDYQPLPGIAETP-------NVAVPGVVSTVV-- 266

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
            +   ++F+GGLP Y  E Q+KELL SFG L  F LVKD  TG SKGY FC Y D ++TD
Sbjct: 267 QDSAHKIFIGGLPNYLNEDQVKELLTSFGPLKAFSLVKDSATGLSKGYAFCEYLDISITD 326

Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
            ACA LNG+++GDK L V+RA+   ++     +I++ A   + I  +A+           
Sbjct: 327 QACAGLNGMQLGDKKLIVQRASVGAKNA----AIISSAPMQMQIPGLAVNP--------- 373

Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
           M+  G    +VLCL   +  + L D+EEYE+ILED++EEC KYG + ++ IPRP + G E
Sbjct: 374 MAAAGPA-TEVLCLMNMVLPEELEDEEEYEDILEDVKEECSKYGFVKSLEIPRPIK-GVE 431

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            PGVGK+F+E++       A+NALSGRKF   TV   Y   DKY  +++
Sbjct: 432 VPGVGKIFVEFHSITDSQKAQNALSGRKFANRTVITSYCDPDKYHRREF 480


>gi|340373805|ref|XP_003385430.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Amphimedon
           queenslandica]
          Length = 529

 Score =  239 bits (609), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 141/370 (38%), Positives = 195/370 (52%), Gaps = 56/370 (15%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           Q TR ARR+Y+G +P    E+ +  FF++ M      SA PG  V+ V IN +K FAF+E
Sbjct: 198 QLTRQARRLYIGNIPFGIAEEVMVNFFNEKMLEAKLCSA-PGIPVLAVQINMDKNFAFIE 256

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            R+VEE +NAMA DGI+ +G ++++RRP DY P     + P            G+ S  +
Sbjct: 257 FRSVEETTNAMAFDGIVLQGQSLKIRRPKDYAPIPGVDIMPKH--------VPGVISTVV 308

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
              +GP +VF GGLP Y ++ Q+KELL SFG L  F+LVKD  T  SKGY F  Y D  V
Sbjct: 309 --PDGPHKVFCGGLPTYLSDDQVKELLSSFGDLKAFNLVKDSGTSFSKGYCFFEYLDTDV 366

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQE-----------------SILAQAQQH 391
           TD A   LNG+ +GDK L V+RA+   +   E +                  + + AQ  
Sbjct: 367 TDGAIQGLNGMALGDKKLVVQRASVGAKVMEEYDISTDITSMAMPISIPGLQMPSTAQ-- 424

Query: 392 IAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECG 451
                                    T   VLCL    T + L DD+EYE ILED+REEC 
Sbjct: 425 -------------------------TATTVLCLMNMTTEEELRDDDEYEGILEDVREECS 459

Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
            YG +++V  PRP + G   PG+GK+F+E+     C  A+ AL+GRKF    V   ++  
Sbjct: 460 NYGQVLSVAAPRPVE-GTLVPGLGKIFVEFAATDQCQRAQTALAGRKFANRVVVTSFFDL 518

Query: 512 DKYFNKDYSA 521
           +KY  K+++A
Sbjct: 519 EKYRQKNFAA 528


>gi|331243454|ref|XP_003334370.1| splicing factor U2AF subunit [Puccinia graminis f. sp. tritici CRL
           75-36-700-3]
          Length = 600

 Score =  239 bits (609), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 147/379 (38%), Positives = 203/379 (53%), Gaps = 55/379 (14%)

Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIG----GNSAGPG-----DAVVNVY 217
           TQ   R ARR+YVG +   ANE  +A FF+  M  +G     N  G       + VV V 
Sbjct: 216 TQSFARQARRLYVGNILHTANEMNVAEFFNAKMKELGLLARNNEDGMAISISENPVVAVQ 275

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLN 277
           +NHEK +AFVE R  EEA++ M+ DGIIF+  A+++RRP DY        GP    P   
Sbjct: 276 VNHEKNYAFVEFRNAEEATHGMSFDGIIFQNQALKIRRPKDYT-------GPDHAGPT-- 326

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN--S 335
               G+ S  +   + P+++F+GGLP Y T+ Q+ ELL+SFG L  F+LVKD  +G   S
Sbjct: 327 -HIPGVVSTNV--PDSPNKIFIGGLPSYLTDDQVMELLKSFGELKSFNLVKDTSSGGHVS 383

Query: 336 KGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQ 395
           KG+ FC Y DP +TDIAC  LNG+++GD+ L V+RA     +K E+E+     Q++   Q
Sbjct: 384 KGFAFCEYVDPDLTDIACQGLNGMELGDRYLVVQRAQIGQNAKKEKEN-NPDGQRNNYNQ 442

Query: 396 KMALQTSGMNTLGGGMSLF-------------GETLAKVLCLTEAITADALADDEEYEEI 442
                    N   GG +               GE   +VL +   +  + L DD+EY EI
Sbjct: 443 --------FNNFAGGQATAAASSVLAAVKSGEGEK-TRVLQMLNMVNQEELVDDQEYGEI 493

Query: 443 LEDMREECGKYGTLVNVVIPRPDQN---------GGETPGVGKVFLEYYDAVGCATAKNA 493
           LED+R+ECGKYG +  V IPRP +N              G+GKVF+ +     CA A  A
Sbjct: 494 LEDIRDECGKYGKIEGVRIPRPIKNEKGRIDLKASESVDGLGKVFVMFEKVDECAAALLA 553

Query: 494 LSGRKFGGNTVNAFYYPED 512
           ++GR+F G  +   Y PED
Sbjct: 554 IAGRQFAGRVIICAYAPED 572


>gi|195380577|ref|XP_002049047.1| GJ20975 [Drosophila virilis]
 gi|194143844|gb|EDW60240.1| GJ20975 [Drosophila virilis]
          Length = 428

 Score =  239 bits (609), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 128/354 (36%), Positives = 200/354 (56%), Gaps = 26/354 (7%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
            TR ARR+YVG +P    ++ +  FF++ +  + G     G+AV+    N +K FAF+E 
Sbjct: 96  VTRQARRLYVGNIPFSTTDEDMMAFFNEQIHRLNGTQGHDGNAVLTCQTNLDKNFAFLEF 155

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNP----TLAAALGPGQPSPNLNLAAVGLAS 285
           R+++EA+ A+  DGI + G  +++RRP DY+P    T A  +   + +  ++ + + ++ 
Sbjct: 156 RSMDEATQAITFDGISYRGQTLKIRRPHDYHPVASITTAEIVEIAKGATQIHASNLPISP 215

Query: 286 GAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQD 345
                 + P ++++GGLP    E Q+KELL +FG L GF+LVKD  TG SKG+ FC Y D
Sbjct: 216 VV---PDSPHKIYIGGLPTCLNEEQVKELLVTFGQLRGFNLVKDTITGQSKGFAFCEYLD 272

Query: 346 PAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
           P++T+ A A LNG+++GD+ L V+R+             +A  +  +A Q   LQ  G  
Sbjct: 273 PSITEQAIAGLNGMQLGDRKLVVQRS-------------IAGVRNMVASQLPVLQVPGF- 318

Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
                + +      +VLCL   +    L D++EYE+I  D+++EC KYG + ++ IPRP 
Sbjct: 319 ----PIDVSTCKATEVLCLLNMVLPSELLDNDEYEDIRTDIKQECAKYGKVKSLKIPRPI 374

Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            +  +T G GKVF+ +     C  A NALSGRKF G  V   +Y  DKY  KD+
Sbjct: 375 GDPPQT-GCGKVFVRFESIEDCKKALNALSGRKFSGRIVMTSFYDLDKYKRKDF 427


>gi|358338608|dbj|GAA57078.1| splicing factor U2AF large subunit [Clonorchis sinensis]
          Length = 518

 Score =  237 bits (605), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 166/422 (39%), Positives = 228/422 (54%), Gaps = 50/422 (11%)

Query: 113 GFDMAPPA--AAMLPGAAVP------GQLPGVPSAVPEMAQNM---LPFGATQLGAFPLM 161
           GF+   PA   AM     +P      GQ+P +P+  P     +   +PF  + +      
Sbjct: 132 GFEHVTPAQYKAMQASGQIPVNVYAAGQIP-MPAHAPNAPLTLTTNIPFAGSAV------ 184

Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
                     R ARR+YVG +P  A E+ +  FF++ M A G   A  G  ++ V IN E
Sbjct: 185 ---------CRQARRLYVGNIPFTATEENMMEFFNKQMRAQGLVQAE-GSPIIAVQINME 234

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K FAF+E R+V+E +  +ALDGI+F   A+++RRP DY P    +  P    P       
Sbjct: 235 KNFAFLEFRSVDETTQGLALDGILFHNQALKLRRPRDYAPLPGVSETPSVIVP------- 287

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
           G+ S  +   + P +VFVGGLP Y  E Q+KELL SFG L GF+LVKD  TG SKGY FC
Sbjct: 288 GVVSTVV--QDSPHKVFVGGLPNYLNEDQVKELLLSFGPLKGFNLVKDGSTGLSKGYAFC 345

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT--ASGQSKTEQESILAQAQQHIAIQKMAL 399
            Y DP VTD ACA LNG+++GDK L V+RA+  A   + T   ++L    Q   +    +
Sbjct: 346 EYVDPNVTDHACAGLNGMQLGDKKLIVQRASVGAKHNATTAAPALL----QLPGLTDTLV 401

Query: 400 QTSGMNTLGGGMSLF--GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
           Q    NT G G      G    +VLCL   I    L DDEEYE+I+ED+R EC KYG + 
Sbjct: 402 Q----NTTGTGNITIRSGGPPTEVLCLMNMIDPAELEDDEEYEDIVEDVRAECSKYGVVR 457

Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
           ++ IPRP + G E PGVGK+++E+   + C  A  AL+GRKF    V   ++  D Y  +
Sbjct: 458 SLEIPRPIR-GVEVPGVGKIYVEFASLIDCQKAATALTGRKFNQRLVVTSFFNPDNYHRR 516

Query: 518 DY 519
           ++
Sbjct: 517 EF 518


>gi|195489053|ref|XP_002092574.1| GE11595 [Drosophila yakuba]
 gi|194178675|gb|EDW92286.1| GE11595 [Drosophila yakuba]
          Length = 437

 Score =  237 bits (605), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 133/362 (36%), Positives = 205/362 (56%), Gaps = 38/362 (10%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIG--GNSAGPGDAVVNVYINHEKKFAFVE 228
           TR ARR+YVG +P    E+ +  FF+Q + A+G  G     G AV+    N EK FAF+E
Sbjct: 98  TRQARRLYVGNIPFGVTEEEMMEFFNQQLMALGLEGAQYLDGKAVLTCQTNLEKNFAFLE 157

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALG----------PGQPSPNLNL 278
            R+++EA+ A+  DGIIF G  +++RRP DY P  +  +           P   + N  +
Sbjct: 158 FRSMDEATQALNFDGIIFRGQILKIRRPHDYQPVPSIRVSNMESYRSFRLPATTTTNPPI 217

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
           + + ++S      + P++++VGGLP    + QIK+LL+SFG L G +LVKD +T  SKG+
Sbjct: 218 STIAVSSIV---PDSPNKIYVGGLPTCLDQDQIKDLLQSFGELKGLNLVKDINTSLSKGF 274

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            F  Y DP+VTD A A L+G+++GD+ L V+R+   G++              +++Q+  
Sbjct: 275 AFFEYIDPSVTDHAIAGLHGMQLGDRRLVVQRSIPGGKN-------------GLSVQQPI 321

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
           +Q  G++TL     L   +  ++LCL   +  + L D+EE+E+I  D+++EC KYG + +
Sbjct: 322 VQVPGISTL-----LDPGSPTEILCLLNMVLPEELLDNEEFEDIRSDIKQECAKYGDVRS 376

Query: 459 VVIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           + IPRP    G+ P  G GKVF+++        A  ALSGRKF G  V   +Y  DKY  
Sbjct: 377 IKIPRP---VGQFPKRGCGKVFVQFESVEDSQKALKALSGRKFSGRIVMTSFYDPDKYLQ 433

Query: 517 KD 518
            D
Sbjct: 434 DD 435


>gi|297837403|ref|XP_002886583.1| hypothetical protein ARALYDRAFT_338287 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297332424|gb|EFH62842.1| hypothetical protein ARALYDRAFT_338287 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 233

 Score =  237 bits (605), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 132/246 (53%), Positives = 169/246 (68%), Gaps = 20/246 (8%)

Query: 276 LNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNS 335
           + L+  G  +G +   EGPDR+FVGGLP+YFT+ QI+E+LE  G L GF+L+KDR TG+S
Sbjct: 4   VELSTTGSTTGDL--VEGPDRIFVGGLPHYFTDAQIREILECLGPLRGFNLLKDRQTGDS 61

Query: 336 KGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESI-LAQAQQHIA 393
           KGY FCVYQDP+VTDIACAALNG+K+GDKTL VRRA   + Q K EQE +    AQQ IA
Sbjct: 62  KGYAFCVYQDPSVTDIACAALNGIKIGDKTLAVRRAMQGTIQPKPEQEEVLQQIAQQQIA 121

Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
           +Q++ L+  G+ T             K++CL++ +T D L + EEY +I   MR+E GK+
Sbjct: 122 LQRLMLEPGGIPT-------------KIVCLSQLVTIDNLRNYEEYADI---MRQEGGKF 165

Query: 454 GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
           G LVNVVIPRP+ + G TPGVG VFLEY D  G + A+  ++GR  GG  V A YYPEDK
Sbjct: 166 GNLVNVVIPRPNPDHGPTPGVGNVFLEYADVDGSSKARLEMNGRIVGGYQVVAVYYPEDK 225

Query: 514 YFNKDY 519
           Y   DY
Sbjct: 226 YAQGDY 231


>gi|443898020|dbj|GAC75358.1| splicing factor U2AF, large subunit [Pseudozyma antarctica T-34]
          Length = 699

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 145/407 (35%), Positives = 222/407 (54%), Gaps = 42/407 (10%)

Query: 132 QLPGV-PSAVPEMAQNMLPFGATQLGAFPLMPVQVM-TQQATRHARRVYVGGLPPLANEQ 189
           Q PG+ P    +  Q   PF        P    Q + T  + R ARR+YVG +   ANE 
Sbjct: 284 QHPGMYPQDGQQYTQGQPPFAGQYQQGHPASHNQALATADSGRQARRLYVGNITHQANEP 343

Query: 190 AIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV 249
           ++  FF++ M  +   +  PG+  V+  +N +K +AFVE R  +EA+NAM+ DGI+F+G 
Sbjct: 344 SMVAFFNEQMLKLKLGTE-PGEPAVSAQVNVDKGYAFVEFRHPDEATNAMSFDGIVFQGQ 402

Query: 250 AVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTET 309
           ++++RRP DY        GP    P+ ++   G+ S  +   + P ++FVGGLP Y T+ 
Sbjct: 403 SLKIRRPKDYT-------GPDVRPPS-SIHVPGVISTNV--PDSPFKIFVGGLPTYLTDD 452

Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVR 369
           Q+ ELL++FG L  F+LVKD  T  SKG+ FC Y D A+TD+AC  LNG+++GD+ L V+
Sbjct: 453 QVIELLQAFGELRSFNLVKDPATNASKGFAFCEYVDTALTDLACQGLNGMELGDRNLVVQ 512

Query: 370 RATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAIT 429
           RA+   + K +     A A     +  + + +S     G G    GE  + ++ L   +T
Sbjct: 513 RASVGSEKKAQ-----AIAAYGANVGALGVPSSVQQFAGAGGDA-GEPTSCMVMLN-MVT 565

Query: 430 ADALADDEEYEEILEDMREECGKYGTLVNVVIPRP--------------DQNG------- 468
            + L DDEEY +I+ED+R+EC K+GT+ +V +PRP               QN        
Sbjct: 566 PEELQDDEEYADIVEDIRDECTKFGTVNDVRVPRPAKESKGAAAHQWKRSQNDEAADAGK 625

Query: 469 -GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
             E  GVG+V++ Y +   CA A  +++GR+FGG TV   +  ED +
Sbjct: 626 PSEREGVGRVYVRYAETDQCAQALKSIAGRQFGGRTVICAFLKEDDW 672


>gi|195124159|ref|XP_002006561.1| GI21125 [Drosophila mojavensis]
 gi|193911629|gb|EDW10496.1| GI21125 [Drosophila mojavensis]
          Length = 427

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 128/352 (36%), Positives = 199/352 (56%), Gaps = 23/352 (6%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
            TR ARR+YVG +P    ++ +  FF++ +  + G +   G+AV+    N +K FAF+E 
Sbjct: 96  VTRQARRLYVGNIPFSTTDEDMMAFFNEQINRLNGTNGVDGNAVLTCQTNLDKNFAFLEF 155

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
           R+++EA+ A+  DGI++ G  +++RRP DY+P   A++   + +     +A  + S  I 
Sbjct: 156 RSMDEATQAINFDGILYRGQTLKIRRPHDYHPM--ASVSSSEAADAAKGSATHVNSVPIS 213

Query: 290 GA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
               + P +++VGGLP    E Q+KELL +FG L GF+LVK+  TG SKG+ FC Y DP 
Sbjct: 214 PMVPDSPHKIYVGGLPTCLNEEQVKELLVTFGKLRGFNLVKEAVTGQSKGFAFCEYVDPC 273

Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
           +T+ A A LNG+++GD+ L V+R+             +A  +  +A Q   LQ  G    
Sbjct: 274 ITEQAIAGLNGMQLGDRKLIVQRS-------------IAGVRNLVANQLPVLQVPGF--- 317

Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
              + +      +VLCL   +    L DD+EY++I  D+++EC KYG + ++ IPRP  +
Sbjct: 318 --PVDVSTGKATEVLCLLNMVLPSELTDDDEYDDIRTDIKQECAKYGKVKSLKIPRPGDD 375

Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             +  G GKVF+ +     C  A NALSGRKF G  V   +Y  +KY  KD+
Sbjct: 376 SIQG-GCGKVFVRFESIDDCKKALNALSGRKFSGRIVMTSFYDLEKYKRKDF 426


>gi|392572624|gb|EIW65769.1| hypothetical protein TREMEDRAFT_41238 [Tremella mesenterica DSM
           1558]
          Length = 596

 Score =  236 bits (601), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 147/415 (35%), Positives = 210/415 (50%), Gaps = 74/415 (17%)

Query: 125 PGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPP 184
           PG   P    G+PSA+  +A +  P GA  L                R A+R+YVGG+  
Sbjct: 201 PGRVPPPPELGIPSAL--IAGSFPPPGANGL----------------RQAKRIYVGGITE 242

Query: 185 LANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGI 244
              + ++  FF+  M+  G     PGD +  V +NHEK FAF+E R+ EEAS+A+ LD +
Sbjct: 243 SMTDASLLEFFNTTMSERGFTLEIPGDPIGAVQVNHEKAFAFLEFRSAEEASSALKLDNV 302

Query: 245 IFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPY 304
           +FE V +RV+RP DY       L P Q +            GA   ++ P+++F+GGLP 
Sbjct: 303 MFEDVPLRVKRPKDYT-----GLDPLQHT----------MGGAQAMSDSPNKLFIGGLPT 347

Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDK 364
           Y  E Q+ ELL+SFG L  F+LVKD D+  +KG+ F  Y DP+ TD+A + LN   +GD+
Sbjct: 348 YLDEAQVMELLKSFGELRSFNLVKDPDSSENKGFAFAEYTDPSNTDMAISGLNNFSLGDR 407

Query: 365 TLTVRRATASGQSKTE-----QESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA 419
            L V+RA     S T       ES LA++    AI     Q SG               +
Sbjct: 408 ILVVQRAAVGRASGTTDAIPGSESFLAKS----AIFAQENQQSG-------------PTS 450

Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE--------- 470
           +V+ L   +TAD L DD+EY+EILED+  EC ++G +  V +PRP     +         
Sbjct: 451 RVMLLLNMVTADELYDDQEYQEILEDITSECSRFGEIEGVRVPRPVPKSKKWEPSESAVV 510

Query: 471 ----------TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
                       GVG+VF+ Y D      A N+L GR+FGG T+     PE+++ 
Sbjct: 511 TQERARRADLAAGVGRVFVMYKDLASTEKAMNSLGGRQFGGRTIVVANVPEEEFL 565


>gi|430813569|emb|CCJ29085.1| unnamed protein product, partial [Pneumocystis jirovecii]
          Length = 545

 Score =  235 bits (600), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 137/397 (34%), Positives = 209/397 (52%), Gaps = 48/397 (12%)

Query: 129 VPG--QLPGVPSAVPEMAQNMLPFGATQL-----GAFPLMPVQVMTQQATRHARRVYVGG 181
           VPG   LPG P       Q+M+            GA  +   Q +    +R +RR++VG 
Sbjct: 182 VPGLFPLPGAPR------QSMMDLSKLSTVHKGPGAMNIPNPQALQPLQSRQSRRIHVGN 235

Query: 182 LPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMAL 241
           +P   +E+ +  FF+  M+ +   ++G  + V++  +NHEK +AF+E R  E+A+ A+  
Sbjct: 236 IPQPIDEEHLVNFFNDTMSCLNVTTSG-DNPVISAQVNHEKGYAFLEFRQPEDATVAIGF 294

Query: 242 DGIIFEGVAVRVRRPTDY----NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRV 297
           DGI +   ++++RRP DY     PT   +  PG  S N                + P+++
Sbjct: 295 DGISYMNNSLKIRRPMDYIVPQMPTDDGSYVPGVISTNF--------------TDTPNKI 340

Query: 298 FVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALN 357
            +GGLP Y  + Q+ ELL+SFG L  F+L+KD  T  SKG+ FC Y DP VTDIAC  LN
Sbjct: 341 HIGGLPTYLDDEQVIELLKSFGELKAFNLIKDAATNESKGFAFCEYVDPDVTDIACEGLN 400

Query: 358 GLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGET 417
           G+++GDK L V+RA+      T+Q+ I        +I  +A +   +             
Sbjct: 401 GMELGDKILVVKRASIG----TKQKPISTSGGGIASITMLAEEEGQLRP----------- 445

Query: 418 LAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKV 477
             +VL +   +T + L DD+EYEEI ED+R+EC KYG ++++ IPR         GVGKV
Sbjct: 446 -TRVLQMFNMVTPEELQDDDEYEEISEDIRDECSKYGKVLDLKIPRGIGGSRSNFGVGKV 504

Query: 478 FLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           ++ +   + C  A   L+GRKF   TV   +YPE+ Y
Sbjct: 505 YVRFETEMSCLKAMKDLAGRKFSDRTVLTSFYPEENY 541


>gi|389745686|gb|EIM86867.1| hypothetical protein STEHIDRAFT_57258 [Stereum hirsutum FP-91666
           SS1]
          Length = 417

 Score =  235 bits (599), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 137/364 (37%), Positives = 199/364 (54%), Gaps = 27/364 (7%)

Query: 156 GAFPLMPVQV------MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
           G  P MPVQ       +    +R +RR+Y+G + P   EQ +  FF+  M  +   +  P
Sbjct: 39  GLPPPMPVQTFGMGMGVNPNLSRQSRRLYIGSITPDITEQNLTDFFNSKMIEMNIGTGAP 98

Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
           G  V+ V  N+EK +AFVE R+ E+A+ AMA DGIIF    +++RRP DY        G 
Sbjct: 99  GPPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFVNGPLKIRRPKDYG-------GM 151

Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD 329
             P+P L++   G+ S  +   + P ++FVGGLP Y  E Q+ ELL+SFG L  F+LV++
Sbjct: 152 EMPAPPLHVP--GVVSTNV--PDSPHKIFVGGLPSYLNEEQVMELLKSFGDLKAFNLVRE 207

Query: 330 RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
              G SKG+ F  Y DP VTD+A  +L+G+++GD+ L V+RA+   ++       L    
Sbjct: 208 NGNGPSKGFAFFEYVDPEVTDVAIQSLSGMELGDRYLVVQRASVGAKAGQPGMPNLPY-D 266

Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
           Q   I +          L  G S      +++L +   +T + L DD EY ++LED+REE
Sbjct: 267 QFPEIPR--------PILPAGASDLSSANSRILLMLNMVTPEDLIDDSEYADLLEDIREE 318

Query: 450 CGKYGTLVNVVIPRPD-QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
              YG + +V IPRPD Q   E  GVG+V++ Y DA G A    AL+GR F G ++ A  
Sbjct: 319 VANYGDVDDVRIPRPDAQRADEAAGVGRVYVRYKDAEGAAKGMQALAGRSFAGRSIIATV 378

Query: 509 YPED 512
             ED
Sbjct: 379 LSED 382


>gi|384496094|gb|EIE86585.1| hypothetical protein RO3G_11296 [Rhizopus delemar RA 99-880]
          Length = 502

 Score =  234 bits (598), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 132/358 (36%), Positives = 192/358 (53%), Gaps = 37/358 (10%)

Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
           P Q M     + ++++YVG +P   +E  +  FF+  +     +       VV V INHE
Sbjct: 182 PPQRMEDATPKQSKKLYVGQIPSTTDEVTLCDFFNATIR----HELQDKTPVVGVQINHE 237

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K FAF+E  T ++A+  M LDGI F+G  +++RRP  Y P       P +  P       
Sbjct: 238 KNFAFIEFHTSQQATACMVLDGISFQGNTLKIRRPNHYQP-------PEEQVP------- 283

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
           GL++      + P++VFVGGLP Y T+ Q+ ELL SFG L  F+LVKD  TG +KG+ FC
Sbjct: 284 GLSTNV---PDTPNKVFVGGLPVYLTDNQVMELLTSFGELRAFNLVKDTATGANKGFAFC 340

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
            Y DP+VTD+AC  LNG+++GDK L V+RA+   +              HI    M+   
Sbjct: 341 EYADPSVTDLACQGLNGMELGDKKLIVQRASVGAK--------------HIPPDYMSGPI 386

Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
              N +    S   E   +VL L   ++ + L DDEEY++I ED+ EEC K+G +V++ I
Sbjct: 387 LPANYVPV-TSAKEEDATRVLQLMNMVSPEELEDDEEYQDIWEDIAEECAKFGNIVDMKI 445

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           P+P Q   + PG G +F+ Y        A  AL+GRKF   TV A +  E  Y + ++
Sbjct: 446 PKP-QKDQQVPGCGLIFVRYETTDETLAALRALAGRKFADRTVVATFVDEQNYLSDNF 502


>gi|395331854|gb|EJF64234.1| hypothetical protein DICSQDRAFT_144911 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 587

 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 154/468 (32%), Positives = 239/468 (51%), Gaps = 56/468 (11%)

Query: 71  DYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR--SGFDMAPPAAAMLPGAA 128
           D   D+  R +HR    + +R   RS +  P  +P   ++R  SG+D+  P        A
Sbjct: 126 DERGDRRGRGKHREGLGTPER---RSPT-PPDAAPLSQRKRKASGWDVHAPGYEQY--TA 179

Query: 129 VPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQV------MTQQATRHARRVYVGGL 182
           +  +  G+   +P   +  +P      G  P MPV        +    +R +RR+Y+G +
Sbjct: 180 MQAKQTGL-FNLPGANRTQIPPILAIPGLPPPMPVSTFGMGTGVNPNLSRQSRRLYIGSI 238

Query: 183 PPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALD 242
            P  NEQ +  FF+  M  +   +  PG+ V+ V  N+EK +AFVE R+ E+A+ AMA D
Sbjct: 239 TPDINEQNLTDFFNSKMKEMNLGTGAPGNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFD 298

Query: 243 GIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGL 302
           GIIF    +++RRP DY        GP   +PN+++   G+ S  +   +  +++FVGGL
Sbjct: 299 GIIFLNGPLKIRRPKDYG-------GPDVIAPNMHVP--GVVSTNV--PDSANKIFVGGL 347

Query: 303 PYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMG 362
           P Y  E Q+ ELL SFG L  F+LV++   G SKG+ F  Y DP+VTD+A  +L+G+++G
Sbjct: 348 PTYLNEEQVMELLSSFGELKAFNLVRENGNGPSKGFAFFEYVDPSVTDVAIQSLSGMELG 407

Query: 363 DKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVL 422
           DK L V+RA+    +K  Q  I      +  I K  L    ++          E+  ++L
Sbjct: 408 DKYLVVQRASVG--AKPGQSPIPGMYDLNPEIPKPILPVGDLS----------ESQDRIL 455

Query: 423 CLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP------------------ 464
            +   +  + L DD+EY +ILED++EECGKYG + ++ IPRP                  
Sbjct: 456 LMLNMVVPEELQDDQEYADILEDVKEECGKYGEVEDLRIPRPVKKDKAKWGEGGRDSALA 515

Query: 465 DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
            Q   E  GVG+V+++Y      A A  AL+GR F G ++ A    +D
Sbjct: 516 QQRADEAAGVGRVYVKYASPRSAANALKALAGRSFAGRSIIATLLSDD 563


>gi|391337926|ref|XP_003743315.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Metaseiulus
           occidentalis]
          Length = 430

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 138/371 (37%), Positives = 194/371 (52%), Gaps = 59/371 (15%)

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
           V+    TR ARR+YVG +P    EQ +  +F+  M A     A  G+ V+   IN +K F
Sbjct: 103 VIGSTITRQARRLYVGNIPFGCTEQEMIDYFNVQMHACAFAQAQ-GNPVLACQINMDKNF 161

Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
           AF+E R+++E S AM+ DGI F+G ++++RRP DY P    +   G   P       G+ 
Sbjct: 162 AFLEFRSIDETSAAMSFDGINFKGQSLKIRRPHDYQPMPGMSESQGSVIP-------GVV 214

Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
           S  +   + P +VF+GGLP Y  E Q++ELL SFG L  F+LVKD  TG SKGY FC Y 
Sbjct: 215 STVV--QDSPHKVFIGGLPNYLNEDQVRELLMSFGQLKAFNLVKDTATGLSKGYAFCEYA 272

Query: 345 DPAVTDIACAALNGLKMGD----------------KTLTVRRATASGQSKTEQESILAQA 388
           +  +TD A A LNG+++GD                  + V+                   
Sbjct: 273 EVTITDDAIAGLNGMQLGDKKLIVQRASVGAKNSNMAVPVQ------------------- 313

Query: 389 QQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMRE 448
                     +Q  GM  +  G S       +VLCL   +T D L D+EEY++ILED+++
Sbjct: 314 ----------IQVPGMPNVPIGSS---GPATEVLCLMNLVTPDELRDEEEYDDILEDIQD 360

Query: 449 ECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
           EC KYG + ++ IPRP Q G + PGVGKVF+E+     C  A+ AL+GRKF    V   Y
Sbjct: 361 ECNKYGHVKSIEIPRPIQ-GVDVPGVGKVFVEFNSVADCQKAQQALTGRKFSNRVVVTSY 419

Query: 509 YPEDKYFNKDY 519
           +  DKY  + +
Sbjct: 420 FEPDKYHRRQF 430


>gi|302854386|ref|XP_002958701.1| hypothetical protein VOLCADRAFT_121741 [Volvox carteri f.
           nagariensis]
 gi|300255941|gb|EFJ40221.1| hypothetical protein VOLCADRAFT_121741 [Volvox carteri f.
           nagariensis]
          Length = 294

 Score =  233 bits (593), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 138/335 (41%), Positives = 195/335 (58%), Gaps = 43/335 (12%)

Query: 187 NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF 246
           +E ++   F+ VM A G  +  PG  V++ Y+N+EK+FAF+E R+VEE SNAMA DG+  
Sbjct: 2   SEVSLTQLFNNVMMAAGATTQ-PGGPVISCYMNNEKRFAFLEFRSVEETSNAMAFDGLQC 60

Query: 247 EGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYF 306
           +G  ++VRRP DYNP  A  LGP +PS  +NLA +G+ +  +   +GP++VFVGGLP Y 
Sbjct: 61  QGETLKVRRPHDYNPAAAKLLGPTEPSAKINLALLGVVNTLV--EDGPNKVFVGGLPGYL 118

Query: 307 TETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTL 366
           +E Q++++L++FG L  F+LV DRDTG SKGYGFC Y DP +TD+A   L+ L +G K L
Sbjct: 119 SEEQVRQILQAFGPLRAFNLVTDRDTGASKGYGFCEYADPNITDVAIQGLSALIVGGKPL 178

Query: 367 TVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTE 426
           TVRRA  +G++    ++++ Q Q  +                                  
Sbjct: 179 TVRRANTAGEASATLQTLIQQQQAAL---------------------------------- 204

Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVI-PRPDQNGGETPGVGKVFLEYYDAV 485
                 L DD EY +++ED+ +E GKYG LV V I   P   G + PGVG V+L Y D  
Sbjct: 205 -----DLVDDGEYMDLMEDVTQEVGKYGKLVGVEIPRPPPDGGPDPPGVGFVYLCYEDPR 259

Query: 486 GCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           G   A+ AL+GRKFG N   A +Y   ++  KD S
Sbjct: 260 GAERAQVALNGRKFGDNLAEATFYDRSRFDAKDLS 294


>gi|426244214|ref|XP_004015921.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Ovis aries]
          Length = 471

 Score =  233 bits (593), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 162/421 (38%), Positives = 227/421 (53%), Gaps = 33/421 (7%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 79  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 134

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 135 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 193

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 194 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 246

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  + G+    ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 247 VPGVVSTVVPGSA--HKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 304

Query: 340 FCVYQDPAVTD-IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
           FC Y D  VTD ++ A  +   +G      R A +S    +         Q  + +Q   
Sbjct: 305 FCEYVDINVTDQVSPAPAHPALLGSPLRAGRGACSSSPFASTIN------QTPVTLQVPG 358

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
           L +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+ECGKYG + +
Sbjct: 359 LMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECGKYGLVKS 410

Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
           + IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D Y  +D
Sbjct: 411 IEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRD 469

Query: 519 Y 519
           +
Sbjct: 470 F 470


>gi|302685922|ref|XP_003032641.1| hypothetical protein SCHCODRAFT_75908 [Schizophyllum commune H4-8]
 gi|300106335|gb|EFI97738.1| hypothetical protein SCHCODRAFT_75908 [Schizophyllum commune H4-8]
          Length = 556

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 131/356 (36%), Positives = 203/356 (57%), Gaps = 43/356 (12%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           +R +RR+Y+G + P  NE  +A FF+  MT +   + GPG+ V+ V  N+EK +AFVE R
Sbjct: 190 SRQSRRLYIGSITPEINEHNLAEFFNSKMTEMNIGTGGPGNPVLAVQCNYEKNYAFVEFR 249

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           + ++A+ AMA DGIIF    +++RRP DY+ ++A+A       P +++   G+ S  +  
Sbjct: 250 SADDATAAMAFDGIIFLNGPLKIRRPKDYDISVASA-------PMIHVP--GIISTNV-- 298

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK-GYGFCVYQDPAVT 349
            +  +++FVGGLP Y  E Q++ELL SFG L  F+LV++  TG SK GY F  Y DP VT
Sbjct: 299 PDSANKIFVGGLPAYLNEEQVQELLTSFGELKAFNLVRETGTGASKQGYAFFEYVDPNVT 358

Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
           D+A  +LNG+++GD+ L V+RA+   +  T        A+   AI K  +     +T G 
Sbjct: 359 DVAIQSLNGMELGDRFLVVQRASVGAKDGTIPN---LPAELMPAIPKPIMPAGQTDTSGD 415

Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP----- 464
                    A+VL +   +T D L DD+EY ++LED++EEC K+G + ++ +PRP     
Sbjct: 416 ---------ARVLLMLNMVTPDDLVDDDEYGDLLEDIKEECSKFGPVEDLRVPRPVKKEK 466

Query: 465 --------------DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
                          Q   E  GVG+V++++ DA   A A  +L+GR F G ++ A
Sbjct: 467 KWAPGEGGREAAVEAQRADEAAGVGRVYVKFVDAKDAAVALKSLAGRSFAGRSIIA 522


>gi|390479436|ref|XP_002762565.2| PREDICTED: splicing factor U2AF 65 kDa subunit [Callithrix jacchus]
          Length = 453

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 151/377 (40%), Positives = 214/377 (56%), Gaps = 37/377 (9%)

Query: 107 KSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLM 161
           K K R  +D+ PP    +           GQ+P   + +P M  + L    T        
Sbjct: 70  KKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT-------- 120

Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
           PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +
Sbjct: 121 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQD 179

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++   
Sbjct: 180 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVP 232

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
           G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC
Sbjct: 233 GVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFC 290

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
            Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L +
Sbjct: 291 EYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGLMS 346

Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++ I
Sbjct: 347 SQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEI 398

Query: 462 PRPDQNGGETPGVGKVF 478
           PRP  +G E PG GK +
Sbjct: 399 PRP-VDGVEVPGCGKAW 414


>gi|170088030|ref|XP_001875238.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164650438|gb|EDR14679.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 370

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 132/367 (35%), Positives = 198/367 (53%), Gaps = 34/367 (9%)

Query: 156 GAFPLMPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
           G  P MPVQ            +R +RR+Y+G + P  NEQ +A FF+  M  +   + GP
Sbjct: 4   GLPPPMPVQSFGMGIGGNPNLSRQSRRLYIGSITPEVNEQNLADFFNSKMIEMSIGTGGP 63

Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
           G+ V+ V  N+EK +AFVE R+ E+A+ AMA DGIIF    +++RRP DY          
Sbjct: 64  GNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDYG--------- 114

Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
                 + +A+ G+    +     PD   ++FVGGLP Y  E Q+ ELL+SFG L  F+L
Sbjct: 115 -----GMEIASPGVHVPGVVSTNVPDSINKIFVGGLPTYLNEEQVMELLKSFGDLKAFNL 169

Query: 327 VKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILA 386
           V++   G SKG+ F  Y D  VTD+A  +LNG+++GD+ L V+RA+   +  T       
Sbjct: 170 VRENGNGPSKGFAFFEYVDVGVTDVAIQSLNGMELGDRYLVVQRASVGAKPGTPGMIPNL 229

Query: 387 QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
              Q   I +  +  +G +           T +++L +   +T D L DD+EY ++ ED+
Sbjct: 230 PYDQFPEIPR-PIMPAGKDP---------ATDSRILLMLNMVTPDDLTDDQEYGDLYEDV 279

Query: 447 REECGKYGTLVNVVIPRPDQ-NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
           +EEC  YG + ++ IPRPD     E  GVG+V+++Y D+     A N L+GR F G ++ 
Sbjct: 280 KEECSNYGAVEDLRIPRPDAVRLDEASGVGRVYVKYKDSESATAALNNLAGRSFAGRSII 339

Query: 506 AFYYPED 512
           A    ED
Sbjct: 340 ATLLSED 346


>gi|194756144|ref|XP_001960339.1| GF13310 [Drosophila ananassae]
 gi|190621637|gb|EDV37161.1| GF13310 [Drosophila ananassae]
          Length = 434

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 140/362 (38%), Positives = 205/362 (56%), Gaps = 33/362 (9%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP-GDAVVNVYINHEKKFAFVE 228
            TR ARR+YVG +P    E+ +  FF+Q + A+G +S    G AV+    N EK FAF+E
Sbjct: 93  VTRQARRLYVGNIPFGVTEEEMMGFFNQQLIALGSSSLKTDGKAVLTCQTNLEKNFAFLE 152

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALG-PG-----QPSPNLNLAAVG 282
            R+++EA+ A+  DGI+F G  +++RRP DY+P  + +   PG       SP + ++  G
Sbjct: 153 FRSMDEATQAINFDGIVFRGQTLKIRRPHDYHPVASISCSEPGFATTTMTSPQIVVSTTG 212

Query: 283 ---LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
              + S  +   + P ++++GGLP    ETQIKELL SFG L GF+LVKD +T  SKG+ 
Sbjct: 213 PNHVISTLV--PDSPQKIYIGGLPTCLNETQIKELLLSFGQLKGFNLVKDANTSLSKGFA 270

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           F  Y DP VT+ A A LNG+++GD+ L V+R+ A G++                +    L
Sbjct: 271 FFEYVDPLVTEQAIAGLNGMQLGDRKLVVQRSIAGGRNSG-------------GVPATVL 317

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
           Q  G+  +        E+  +VLCL   +  + L DDEEYE+I  D+++EC KYG + ++
Sbjct: 318 QVPGLTAIPN-----TESPTEVLCLLNMVLPEELLDDEEYEDIRTDIQQECAKYGDVRSL 372

Query: 460 VIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
            IPRP    G++P  G GKVF+++     C  A  ALSGRKF G  V   +   DKY   
Sbjct: 373 KIPRPIPK-GDSPKRGCGKVFVQFESVDDCQKAMRALSGRKFSGRIVMTSFSDPDKYLAD 431

Query: 518 DY 519
           D+
Sbjct: 432 DF 433


>gi|124360614|gb|ABN08613.1| RNA-binding region RNP-1 (RNA recognition motif) [Medicago
           truncatula]
          Length = 257

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 128/199 (64%), Positives = 151/199 (75%), Gaps = 6/199 (3%)

Query: 49  RDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKS 108
           R K   Y+R+  RD+DR   H DY+RD++ R+R+ + S  S R  +RS+S S S S S+ 
Sbjct: 58  RGKYDSYNRQRGRDYDR---HNDYDRDRDTRNRYGAHSKRSRR-ESRSRSRSRSPSQSEG 113

Query: 109 KRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ 168
           KR SGFDMAPPA  + P   V GQ+PG+   +    QN  P+G +Q+GA  LM VQ MTQ
Sbjct: 114 KRTSGFDMAPPATGVTP--TVSGQMPGIAHMIQGATQNFSPYGISQIGALSLMQVQPMTQ 171

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           QATRHARRVYVGGLPP ANEQ+IA+FFSQVM AIGGNSAG GD+VVNVYINHEKKFAFVE
Sbjct: 172 QATRHARRVYVGGLPPFANEQSIASFFSQVMIAIGGNSAGSGDSVVNVYINHEKKFAFVE 231

Query: 229 MRTVEEASNAMALDGIIFE 247
           MRTVEEASNAMALDGI+FE
Sbjct: 232 MRTVEEASNAMALDGIVFE 250


>gi|410054709|ref|XP_003954504.1| PREDICTED: LOW QUALITY PROTEIN: splicing factor U2AF 65 kDa subunit
           [Pan troglodytes]
          Length = 394

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 138/331 (41%), Positives = 196/331 (59%), Gaps = 23/331 (6%)

Query: 189 QAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEG 248
           +A+  FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G
Sbjct: 86  EAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQG 144

Query: 249 VAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTE 308
            ++++RRP DY P       PG  S N ++   G+ S  +   +   ++F+GGLP Y  +
Sbjct: 145 QSLKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLND 195

Query: 309 TQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV 368
            Q+KELL SFG L  F+LVKD  TG SKGY FC Y D  VTD A A LNG+++GDK L V
Sbjct: 196 DQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLV 255

Query: 369 RRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAI 428
           +RA+   ++ T    +    Q  + +Q   L +S +  +GG  +       +VLCL   +
Sbjct: 256 QRASVGAKNAT----LSTINQTPVTLQVPGLMSSQVQ-MGGHPT-------EVLCLMNMV 303

Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCA 488
             + L DDEEYEEI+ED+R+EC KYG + ++ IPRP  +G E PG GK+F+E+     C 
Sbjct: 304 LPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQ 362

Query: 489 TAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            A   L+GRKF    V   Y   D Y  +D+
Sbjct: 363 KAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 393


>gi|195429288|ref|XP_002062695.1| GK19586 [Drosophila willistoni]
 gi|194158780|gb|EDW73681.1| GK19586 [Drosophila willistoni]
          Length = 466

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 136/368 (36%), Positives = 195/368 (52%), Gaps = 48/368 (13%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
            TR ARR+YVG +P    ++ +  FF+  + ++G      G  V+    N EK FAF+E 
Sbjct: 129 VTRQARRLYVGNIPFGVTDKEMMNFFNVQLQSLGLKQFHDGTPVLTCQTNLEKNFAFLEF 188

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
           R++ E + A+A DG+ F G  +++RRP DY+P  + +          +L  VGL+   + 
Sbjct: 189 RSMGETTQAIAFDGVNFRGQTLKIRRPHDYHPVTSLS----------SLETVGLSDTIVT 238

Query: 290 GA---------------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
            A               + P ++++G LP    E QIKELL SFG L GF+LVKD +TG 
Sbjct: 239 SAHTPVPMKDLVSTLVPDSPQKIYIGSLPPCLDEAQIKELLLSFGRLRGFNLVKDANTGM 298

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
           SKGY F  Y D AVT+ A A LNG+ +GD+ L V+R+ A G++          A  H   
Sbjct: 299 SKGYAFFEYVDSAVTEQAIAGLNGMLLGDRRLVVQRSIAGGRN----------ASNHSPA 348

Query: 395 QKMALQTSGMNTLGGGMSLFGETLA-KVLCLTEAITADALADDEEYEEILEDMREECGKY 453
               LQ  G        S+F    A ++LCL   +  + L DDEEYE+I  D+++EC K+
Sbjct: 349 S--VLQVPGFP------SVFSTGAATEILCLLNMVQPEDLLDDEEYEDICVDIKQECDKH 400

Query: 454 GTLVNVVIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
           G +  + IPRP    G+TP  G GKVF+ +     C  A NALSGRKF G  V   ++  
Sbjct: 401 GKVKGLKIPRPLV--GKTPRAGCGKVFVRFESMEDCQKALNALSGRKFNGRIVVTSFFNL 458

Query: 512 DKYFNKDY 519
           DKY   ++
Sbjct: 459 DKYEKNEF 466


>gi|297302956|ref|XP_001119590.2| PREDICTED: splicing factor U2AF 65 kDa subunit-like, partial
           [Macaca mulatta]
          Length = 432

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 154/378 (40%), Positives = 215/378 (56%), Gaps = 30/378 (7%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGL 359

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
            +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++
Sbjct: 360 MSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSI 411

Query: 460 VIPRPDQNGGETPGVGKV 477
            IPR   +G E PG GKV
Sbjct: 412 EIPR-LVDGVEVPGCGKV 428


>gi|387193280|gb|AFJ68695.1| splicing factor U2AF 65 kDa subunit [Nannochloropsis gaditana
           CCMP526]
          Length = 424

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 145/363 (39%), Positives = 204/363 (56%), Gaps = 41/363 (11%)

Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAF 226
           TQQ+TRHARRVYVGG    A++  +  FF+Q++         P   VV + +N +K FAF
Sbjct: 82  TQQSTRHARRVYVGGNFGDASDFEVLAFFNQIINE-SLERPSPAGPVVAIQVNRQKHFAF 140

Query: 227 VEMRTVEEASNA-MALDGIIFEGVAVRVRRPTDYNPTL------------AAALGPGQPS 273
           +E+ +V   ++  M LDG+ F G  ++V+RPTDY+P L             A     Q S
Sbjct: 141 LELNSVPLTTSVIMQLDGVPFRGNPLKVKRPTDYHPELLPLDTPPPPTLKVANFRALQAS 200

Query: 274 PNLNLAAVGL-ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
             L +A+ GL A GA    + P ++FVGGLPY+ T+ Q++ELL +FG L GFDL KD  T
Sbjct: 201 GALPMASTGLTAPGANSVPDSPYKIFVGGLPYHVTDDQVRELLSAFGPLRGFDLKKDPAT 260

Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI 392
           G SKGYGFC Y D AV D+A   L+G+ +G KTLTV+ A AS Q + +Q           
Sbjct: 261 GMSKGYGFCEYIDHAVGDVAIQGLHGMDLGGKTLTVKYALASQQLQQQQSMQQMLLSTTP 320

Query: 393 AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGK 452
           A                          KVL L   +T D L DD+EY+EI+ED+REE  K
Sbjct: 321 A-------------------------TKVLVLANMVTPDELKDDQEYQEIVEDVREEVAK 355

Query: 453 YGTLVNVVIPR-PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
           +G ++++VIPR  + +   +P VGK+F+EY ++     A  +L GR+F G  V A +Y E
Sbjct: 356 FGEVLSLVIPRPEEPSAPPSPAVGKIFVEYAESSQTKAAAQSLQGRRFAGRIVQASFYDE 415

Query: 512 DKY 514
           +K+
Sbjct: 416 EKF 418


>gi|198456623|ref|XP_001360392.2| GA16338 [Drosophila pseudoobscura pseudoobscura]
 gi|198135682|gb|EAL24967.2| GA16338 [Drosophila pseudoobscura pseudoobscura]
          Length = 491

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 135/384 (35%), Positives = 196/384 (51%), Gaps = 47/384 (12%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP----GDAVVNVYINHEKKFA 225
            TR ARR+YVG +P    E  I  FF+Q    +G N  G     G AV++   N +K FA
Sbjct: 121 VTRQARRLYVGNIPFGVTEDDIMAFFNQQFLLLGDNCGGQLCLDGKAVLSCQANLDKNFA 180

Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPT-----------LAAALGPGQPSP 274
           F+E R+++EA+ A   DGI F G  +++RRP DY+P            +  A+G    S 
Sbjct: 181 FIEFRSMQEATQATTFDGISFRGQVLKIRRPHDYHPVGSVGAAAGAGSIPDAVGGCASSA 240

Query: 275 NLNLAAVGLASGAIGGA-------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLV 327
                +    +G++G         + P ++++GGLP    ETQIKELL SFG L GF+LV
Sbjct: 241 AAKSRSSSAETGSLGSQAISNLVPDSPHKIYIGGLPTCLNETQIKELLLSFGQLRGFNLV 300

Query: 328 KDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
           KD  T  SKGY F  Y DP +T+   A LNG+++GD+ L V+R+  SG+           
Sbjct: 301 KDPSTTLSKGYAFFEYVDPLLTEQVIANLNGMQLGDRRLIVQRSIPSGRYA--------- 351

Query: 388 AQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMR 447
             Q I IQ   L  + +     G++       +VLCL   +  + L D+EEYE+I  D+ 
Sbjct: 352 GNQQIPIQVPGLVATSLTGSTAGLN----NATQVLCLLNMVLPEELLDNEEYEDIRADIE 407

Query: 448 EECGKYGTLVNVVIPRPD------------QNGGETPGVGKVFLEYYDAVGCATAKNALS 495
           +EC KYG ++++ IPRP              +     G GKV++ +        A  ALS
Sbjct: 408 QECSKYGEVLSLKIPRPQVSGGEGEGEGGGDSATRPKGCGKVYVHFGSIEDSEKALGALS 467

Query: 496 GRKFGGNTVNAFYYPEDKYFNKDY 519
           GRKF G  V   ++  DKY ++D+
Sbjct: 468 GRKFSGRIVIGSFFDRDKYLSEDF 491


>gi|449547880|gb|EMD38847.1| hypothetical protein CERSUDRAFT_81656 [Ceriporiopsis subvermispora
           B]
          Length = 476

 Score =  224 bits (571), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 134/381 (35%), Positives = 206/381 (54%), Gaps = 47/381 (12%)

Query: 156 GAFPLMPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
           G  P MPVQ            +R +RR+Y+G + P  NEQ +A FF+  M  +   +  P
Sbjct: 95  GLPPPMPVQSFGMGIGGNPNLSRQSRRLYIGSITPDINEQNLAEFFNGKMKEMDIGTGAP 154

Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
           G+ V+ V  N+EK +AFVE R+ E+A+ AMA DGIIF    +++RRP DY        GP
Sbjct: 155 GNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFLNGPLKIRRPKDYG-------GP 207

Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD 329
              +P +++   G+ S  +   +  ++VFVGGLP Y  E Q+ ELL+SFG L  F+LV++
Sbjct: 208 DVLAPMMHVP--GVVSTNV--PDSANKVFVGGLPMYLNEEQVMELLKSFGELKAFNLVRE 263

Query: 330 RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
              G SKG+ F  Y DP+VTD+A  +L+G+++GDK L V+RA+    +K  Q  I     
Sbjct: 264 NGNGPSKGFAFFEYVDPSVTDVAIQSLSGMELGDKYLVVQRASVG--AKPGQSPIDEMYG 321

Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
               I +  +  + +++          T +++L +   +  + L DD+EY +I ED+ EE
Sbjct: 322 SPAPIPRPIMPATDIDS----------TQSRILLMLNMVVPEELQDDQEYADIYEDITEE 371

Query: 450 CGKYGTLVNVVIPRP--------DQNG----------GETPGVGKVFLEYYDAVGCATAK 491
           CG+YG + ++ IPRP         +NG           E  GVG+V+++Y      A A 
Sbjct: 372 CGRYGAVEDLRIPRPVKRDKAKWGENGMDSARAAQLADEAAGVGRVYVKYAQPNSAANAL 431

Query: 492 NALSGRKFGGNTVNAFYYPED 512
            AL+GR F G ++ A    +D
Sbjct: 432 KALAGRSFAGRSIIATLLSDD 452


>gi|336368252|gb|EGN96595.1| hypothetical protein SERLA73DRAFT_93106 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 396

 Score =  224 bits (571), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 133/367 (36%), Positives = 201/367 (54%), Gaps = 31/367 (8%)

Query: 156 GAFPLMPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
           G  P MPVQ            +R +RR+Y+G + P  NEQ +A FF+  M  +   +  P
Sbjct: 27  GLPPPMPVQTFGMGIGSNPNLSRQSRRLYIGSITPDVNEQNLADFFNSKMIEMSIGTGAP 86

Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
           G+ V+ V  N+EK +AFVE R+ E+A+ AMA DGIIF    +++RRP DY        G 
Sbjct: 87  GNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDYG-------GV 139

Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD 329
              +P++++   G+ S  +   +  ++VFVGGLP Y  E Q+ ELL+SFG L  F+LV++
Sbjct: 140 DMSAPSVHVP--GVVSTNV--PDSINKVFVGGLPTYLNEEQVMELLKSFGELKAFNLVRE 195

Query: 330 RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
              G SKG+ F  Y D +VTD+A  +LNG+++GD+ L V+RA+   +  T          
Sbjct: 196 NGNGPSKGFAFFEYVDISVTDVAIQSLNGMELGDRYLVVQRASVGAKPGTPGMIPNLPYD 255

Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
           Q   I +  +  +G N+            A++L +   +T D L DD+EY ++ ED++EE
Sbjct: 256 QFPEIPR-PIMPAGENSSAD---------ARILLMLNMVTPDDLTDDQEYGDLYEDVKEE 305

Query: 450 CGKYGTLVNVVIPRPDQNGG----ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
           C  YG + ++ IPRP         E  GVG+V+++Y DA     A  AL+GR F G ++ 
Sbjct: 306 CSVYGAVEDLRIPRPSAMDAIRQDEAAGVGRVYVKYIDADSANNALKALAGRSFAGRSII 365

Query: 506 AFYYPED 512
           A    ED
Sbjct: 366 ATLLSED 372


>gi|76779874|gb|AAI06135.1| U2af2 protein [Mus musculus]
          Length = 307

 Score =  224 bits (571), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 137/326 (42%), Positives = 193/326 (59%), Gaps = 23/326 (7%)

Query: 194 FFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRV 253
           FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G ++++
Sbjct: 4   FFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKI 62

Query: 254 RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
           RRP DY P       PG  S N ++   G+ S  +   +   ++F+GGLP Y  + Q+KE
Sbjct: 63  RRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKE 113

Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
           LL SFG L  F+LVKD  TG SKGY FC Y D  VTD A A LNG+++GDK L V+RA+ 
Sbjct: 114 LLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASV 173

Query: 374 SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADAL 433
             ++ T    +    Q  + +Q   L +S +  +GG  +       +VLCL   +  + L
Sbjct: 174 GAKNAT----LSTINQTPVTLQVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEEL 221

Query: 434 ADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNA 493
            DDEEYEEI+ED+R+EC KYG + ++ IPRP  +G E PG GK+F+E+     C  A   
Sbjct: 222 LDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQG 280

Query: 494 LSGRKFGGNTVNAFYYPEDKYFNKDY 519
           L+GRKF    V   Y   D Y  +D+
Sbjct: 281 LTGRKFANRVVVTKYCDPDSYHRRDF 306


>gi|13938661|gb|AAH07487.1| U2 small nuclear ribonucleoprotein auxiliary factor (U2AF) 2 [Mus
           musculus]
          Length = 306

 Score =  224 bits (571), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 137/326 (42%), Positives = 193/326 (59%), Gaps = 23/326 (7%)

Query: 194 FFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRV 253
           FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+G ++++
Sbjct: 3   FFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKI 61

Query: 254 RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
           RRP DY P       PG  S N ++   G+ S  +   +   ++F+GGLP Y  + Q+KE
Sbjct: 62  RRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKE 112

Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
           LL SFG L  F+LVKD  TG SKGY FC Y D  VTD A A LNG+++GDK L V+RA+ 
Sbjct: 113 LLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASV 172

Query: 374 SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADAL 433
             ++ T    +    Q  + +Q   L +S +  +GG  +       +VLCL   +  + L
Sbjct: 173 GAKNAT----LSTINQTPVTLQVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEEL 220

Query: 434 ADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNA 493
            DDEEYEEI+ED+R+EC KYG + ++ IPRP  +G E PG GK+F+E+     C  A   
Sbjct: 221 LDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQG 279

Query: 494 LSGRKFGGNTVNAFYYPEDKYFNKDY 519
           L+GRKF    V   Y   D Y  +D+
Sbjct: 280 LTGRKFANRVVVTKYCDPDSYHRRDF 305


>gi|403413555|emb|CCM00255.1| predicted protein [Fibroporia radiculosa]
          Length = 582

 Score =  223 bits (568), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 135/381 (35%), Positives = 204/381 (53%), Gaps = 48/381 (12%)

Query: 156 GAFPLMPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
           G  P MPVQ            +R +RR+Y+G + P  NEQ +A FF+  M  +   + GP
Sbjct: 202 GLPPPMPVQSFGMGIGGNPNLSRQSRRLYIGSITPDINEQNLADFFNSKMKEMSIGTGGP 261

Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
           G+ V+ V  N+EK +AFVE R+ E+A+ AMA DGIIF    +++RRP DY          
Sbjct: 262 GNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDYG--------- 312

Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD 329
           G  S   ++   G+ S  +   +  +++FVGGLP Y  E Q+ ELL+SFG L  F+LV++
Sbjct: 313 GMDSIAPSMHVPGVVSTNV--PDSINKIFVGGLPTYLNEEQVMELLKSFGDLKAFNLVRE 370

Query: 330 RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
              G SKG+ F  Y DP VTD+A  +L+G+++GDK L V+RA+    +K  Q  I     
Sbjct: 371 NGNGPSKGFAFFEYVDPGVTDVAIQSLSGMELGDKFLVVQRASVG--AKPGQPPIPGLYD 428

Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
           Q + I +  L    +            T A++L +   +  + L DD+EY ++ ED++EE
Sbjct: 429 Q-VEIPRPILPAGDVEG----------TDARILLMLNMVVPEDLTDDQEYADVYEDVKEE 477

Query: 450 CGKYGTLVNVVIPRPDQ-------NGG-----------ETPGVGKVFLEYYDAVGCATAK 491
           C KYG + ++ IPRP +        GG           E  GVG+V++++ ++   A A 
Sbjct: 478 CSKYGLVEDLRIPRPVKRDKAKWGEGGHESAITAQRIDEAAGVGRVYVKFTESYSAAQAL 537

Query: 492 NALSGRKFGGNTVNAFYYPED 512
            AL+GR F G ++ A    ED
Sbjct: 538 KALAGRSFAGRSIIATLLSED 558


>gi|195149862|ref|XP_002015874.1| GL11290 [Drosophila persimilis]
 gi|194109721|gb|EDW31764.1| GL11290 [Drosophila persimilis]
          Length = 487

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 136/389 (34%), Positives = 199/389 (51%), Gaps = 57/389 (14%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP----GDAVVNVYINHEKKFA 225
            TR ARR+YVG +P    E  I  FF+Q    +G +  G     G AV++   N +K FA
Sbjct: 117 VTRQARRLYVGNIPFGVTEDDIMAFFNQQFLLLGDDCGGQLCLDGKAVLSCQANLDKNFA 176

Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPT-----------LAAALGPGQPSP 274
           F+E R+++EA+ A   DGI F G  +++RRP DY+P            +  A+G    S 
Sbjct: 177 FIEFRSMQEATQATTFDGISFRGQVLKIRRPHDYHPVGSVGAAAGAGSIPDAVGGCASSA 236

Query: 275 NLNLAAVGLASGAIGGA-------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLV 327
                +    +G++G         + P ++++GGLP    ETQIKELL SFG L GF+LV
Sbjct: 237 AAKSRSSSADTGSLGSQAISNLVPDSPHKIYIGGLPTCLNETQIKELLLSFGQLRGFNLV 296

Query: 328 KDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
           KD  T  SKGY F  Y DP +T+   A LNG+++GD+ L V+R+  SG            
Sbjct: 297 KDPSTTLSKGYAFFEYVDPLLTEQVIANLNGMQLGDRRLIVQRSIPSG------------ 344

Query: 388 AQQHIAIQKMALQTSGMNTLGGGMSLFGET-----LAKVLCLTEAITADALADDEEYEEI 442
             ++  IQ++ +Q  G+       SL G T       +VLCL   +  + L D+EEYE+I
Sbjct: 345 --RYAGIQQIPIQVPGLV----ATSLTGSTAGLNNATQVLCLLNMVLPEELLDNEEYEDI 398

Query: 443 LEDMREECGKYGTLVNVVIPRPD------------QNGGETPGVGKVFLEYYDAVGCATA 490
             D+ +EC KYG ++++ IPRP              +     G GKV++ +        A
Sbjct: 399 RADIEQECSKYGEVLSLKIPRPQASGGEGEGGGGGDSATRPKGCGKVYVHFGTIEDSEKA 458

Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             ALSGRKF G  V   ++  DKY ++D+
Sbjct: 459 LGALSGRKFSGRIVIGSFFDRDKYLSEDF 487


>gi|409040470|gb|EKM49957.1| hypothetical protein PHACADRAFT_264412 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 575

 Score =  222 bits (566), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 150/450 (33%), Positives = 233/450 (51%), Gaps = 55/450 (12%)

Query: 81  RHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPAAAMLPGAAVPGQLPGVPSA 139
           +HR    + +R   RS + S + S S+ KR+ SG+D+  P        A+  +  G+   
Sbjct: 127 KHREGLGTPER---RSPTPSDAVSLSQRKRKASGWDIHAPGYEQY--TAMQAKQTGL-FN 180

Query: 140 VPEMAQNMLPFGATQLGAFPLMPVQV------MTQQATRHARRVYVGGLPPLANEQAIAT 193
           +P   +  +P      G  P MPVQ       +    +R +RR+Y+G + P  NEQ +A 
Sbjct: 181 LPGANRTQIPPILAVPGLPPPMPVQSFGMGMGVNPNLSRQSRRLYIGSITPEINEQNLAD 240

Query: 194 FFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRV 253
           FF++ M  +   +  PG+ V+ V  N+EK +AFVE R+ E+A+ AMA DGIIF    +++
Sbjct: 241 FFNEKMKEMSIGTGAPGNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFLSGPLKI 300

Query: 254 RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
           RRP DY        G    +P++++   G+ S  +   +  ++VFVGGLP Y  E Q+ E
Sbjct: 301 RRPKDYG-------GSENLAPSMHVP--GVVSTNV--PDSINKVFVGGLPPYLNEEQVME 349

Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
           LL SFG L  F+LV++   G SKG+ F  Y DPAVTD+A  +LN +++GDK L V+RA+ 
Sbjct: 350 LLTSFGDLKAFNLVRENGNGPSKGFAFFEYVDPAVTDVAIQSLNEMELGDKYLVVQRASV 409

Query: 374 SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADAL 433
             ++      I  QA     I K  L    +  +           A++L +   +  + L
Sbjct: 410 GAKNG----QIPPQALYPTEIPKPILPAGDLEGVE----------ARILLMLNMVVPEDL 455

Query: 434 ADDEEYEEILEDMREECGKYGTLVNVVIPRP-----------------DQNGGETPGVGK 476
            DD+EY +I ED+++EC K+G + ++ IPRP                  Q   E  GVG+
Sbjct: 456 NDDQEYADIYEDVKDECEKHGPIEDLRIPRPVKKDKAKWGESGLDPLSAQRVDEAAGVGR 515

Query: 477 VFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
           V++ +  A G   A  AL+GR F G ++ A
Sbjct: 516 VYVRFVGADGAKRALKALAGRSFAGRSIIA 545


>gi|194884971|ref|XP_001976363.1| GG20057 [Drosophila erecta]
 gi|190659550|gb|EDV56763.1| GG20057 [Drosophila erecta]
          Length = 440

 Score =  222 bits (565), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 130/362 (35%), Positives = 199/362 (54%), Gaps = 38/362 (10%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIG--GNSAGPGDAVVNVYINHEKKFAFVE 228
           TR ARR+YVG +P    E+ +  FF+Q + A+G  G     G AV+    N EK FAF+E
Sbjct: 101 TRQARRLYVGNIPFGVTEEEMMKFFNQQLLALGLAGAQYMDGKAVLTCQTNLEKNFAFLE 160

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALG----------PGQPSPNLNL 278
            R+++EA+ A+  DGI+F G  +++RRP DY P  +  +           P     +  L
Sbjct: 161 FRSMDEATQALNFDGILFRGQVLKIRRPHDYQPVPSIRVSAMESYRSFRLPDNTVTHPPL 220

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
           A + L+S      + P+++FVGGLP    + QI++LL+SFG L   +LVKD +T  SKG+
Sbjct: 221 ATIPLSSIV---PDSPNKIFVGGLPTCLGQDQIRDLLQSFGELKRLNLVKDTNTCLSKGF 277

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            F  Y DP VTD A A L+G+++G++ L V+R+   G             +  ++ Q+  
Sbjct: 278 AFFEYFDPTVTDHAIAGLHGMQLGNRRLVVQRSIPGG-------------KHAVSGQQPL 324

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
           +Q  G++TL     L   +  +++CL   +  + L D+EE+E+I  D+ +EC KYG + +
Sbjct: 325 VQVPGISTL-----LDPGSPTEIICLLNMVLPEELLDNEEFEDIRTDIEQECAKYGEVRS 379

Query: 459 VVIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           + IPRP    G+ P  G GKVF+++        A  ALSGRKF G  V   +Y  +KY  
Sbjct: 380 IKIPRPI---GQAPKRGCGKVFVQFESVEDSQRALKALSGRKFSGRIVMTSFYDPEKYLL 436

Query: 517 KD 518
            D
Sbjct: 437 DD 438


>gi|324503285|gb|ADY41429.1| Splicing factor U2AF 65 kDa subunit [Ascaris suum]
          Length = 522

 Score =  221 bits (562), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 138/357 (38%), Positives = 201/357 (56%), Gaps = 32/357 (8%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V+    T  +RR+YVG +P   +E A+  FF+Q M  + G +  PG+ V+   +N +K
Sbjct: 198 VPVVGPSVTCQSRRLYVGNIPFGCSEDAMLDFFNQQMH-LCGLAQAPGNPVLACQMNLDK 256

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+++E +  MA DGI F G  +++RRP DY P ++ +   G      N+    
Sbjct: 257 NFAFIEFRSIDETTAGMAFDGINFMGQQLKIRRPRDYQP-MSTSYDMG------NMMVSN 309

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           +       A+ P ++F+GGLP Y    Q+KELL SFG L  F+LV D  TG SKGY F  
Sbjct: 310 IV------ADSPYKIFIGGLPSYLNAEQVKELLSSFGQLKAFNLVTDVSTGVSKGYAFAE 363

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           Y DP++TD A A LNG+++GDK L V+ + A+ ++     S  A  Q         +Q +
Sbjct: 364 YLDPSLTDQAIAGLNGMQLGDKNLVVQLSCANARAAM---STTAFPQ---------IQVA 411

Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
           G++   G          +VLCL   +T + L +DEEYE+ILED+REEC KYG + ++ +P
Sbjct: 412 GIDLSHG-----AGPPTEVLCLMNMVTEEELKEDEEYEDILEDIREECAKYGFVKSIEVP 466

Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           R    G +  GVGKVF+E+     C  A+ AL+GRKF   TV   YY  D Y  + +
Sbjct: 467 R-SIPGVDVTGVGKVFVEFNSKQECQKAQAALTGRKFANRTVVTSYYDPDLYHRRQF 522


>gi|392589921|gb|EIW79251.1| hypothetical protein CONPUDRAFT_83522 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 411

 Score =  218 bits (554), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 133/384 (34%), Positives = 201/384 (52%), Gaps = 51/384 (13%)

Query: 156 GAFPLMPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
           G  P +PVQ            +R +RR+Y+G + P  NEQ +A FF+  M  +   + GP
Sbjct: 27  GLPPPIPVQTFGMGIGSNPNLSRQSRRLYIGSITPDVNEQNLADFFNGKMIEMSIGTGGP 86

Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
           G+ V+ V  N+EK +AFVE R+ E+A+ AMA DGIIF    +++RRP DY        G 
Sbjct: 87  GNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDYG-------GD 139

Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
              +PN ++   G+ S  +     PD   ++FVGGLP Y  E Q+ ELL+SFG L  F+L
Sbjct: 140 AIMAPNFHVP--GVVSTNV-----PDSIHKIFVGGLPPYLNEEQVMELLKSFGELKAFNL 192

Query: 327 VKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILA 386
           V++   G SKG+ F  Y D +VTD+A  +LNG+++GD+ L V+RA+   +  T       
Sbjct: 193 VRENGNGPSKGFAFFEYVDSSVTDVAIQSLNGMELGDRYLVVQRASVGAKPGTPGMIPNL 252

Query: 387 QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
              Q   I +  +     +T            A++L +   +T D L DD+EY +I ED+
Sbjct: 253 PYDQFPEIPRPIMPAGDGST----------EDARILLMLNMVTVDDLQDDDEYGDIYEDV 302

Query: 447 REECGKYGTLVNVVIPRPDQN------------------GGETPGVGKVFLEYYDAVGCA 488
           +EEC K+G + ++ IPRP +                     E  GVG+V++++ D  G  
Sbjct: 303 KEECSKHGAVEDLRIPRPIKKDKSKWGDAGQQSQTDAARADEAAGVGRVYVKFVDGDGAQ 362

Query: 489 TAKNALSGRKFGGNTVNAFYYPED 512
            A  +L+GR F G ++ A    ED
Sbjct: 363 RAMKSLAGRSFAGRSIIATVLSED 386


>gi|336381013|gb|EGO22165.1| hypothetical protein SERLADRAFT_371639 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 381

 Score =  217 bits (553), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 131/378 (34%), Positives = 200/378 (52%), Gaps = 47/378 (12%)

Query: 161 MPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
           MPVQ            +R +RR+Y+G + P  NEQ +A FF+  M  +   +  PG+ V+
Sbjct: 1   MPVQTFGMGIGSNPNLSRQSRRLYIGSITPDVNEQNLADFFNSKMIEMSIGTGAPGNPVL 60

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
            V  N+EK +AFVE R+ E+A+ AMA DGIIF    +++RRP DY        G    +P
Sbjct: 61  AVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDYG-------GVDMSAP 113

Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
           ++++   G+ S  +   +  ++VFVGGLP Y  E Q+ ELL+SFG L  F+LV++   G 
Sbjct: 114 SVHVP--GVVSTNV--PDSINKVFVGGLPTYLNEEQVMELLKSFGELKAFNLVRENGNGP 169

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
           SKG+ F  Y D +VTD+A  +LNG+++GD+ L V+RA+   +  T          Q   I
Sbjct: 170 SKGFAFFEYVDISVTDVAIQSLNGMELGDRYLVVQRASVGAKPGTPGMIPNLPYDQFPEI 229

Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
            +  +  +G N+            A++L +   +T D L DD+EY ++ ED++EEC  YG
Sbjct: 230 PR-PIMPAGENSSAD---------ARILLMLNMVTPDDLTDDQEYGDLYEDVKEECSVYG 279

Query: 455 TLVNVVIPRPDQNG--------------------GETPGVGKVFLEYYDAVGCATAKNAL 494
            + ++ IPRP +                       E  GVG+V+++Y DA     A  AL
Sbjct: 280 AVEDLRIPRPVKKDKSKWAPGEVGHQSAMDAIRQDEAAGVGRVYVKYIDADSANNALKAL 339

Query: 495 SGRKFGGNTVNAFYYPED 512
           +GR F G ++ A    ED
Sbjct: 340 AGRSFAGRSIIATLLSED 357


>gi|195393580|ref|XP_002055432.1| GJ19364 [Drosophila virilis]
 gi|194149942|gb|EDW65633.1| GJ19364 [Drosophila virilis]
          Length = 476

 Score =  216 bits (551), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 150/444 (33%), Positives = 217/444 (48%), Gaps = 88/444 (19%)

Query: 131 GQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQA 190
           GQ+P   S VP+  Q  +P               V+    TR ARR+YVG +P    E+ 
Sbjct: 66  GQIPA--SVVPDTPQTAVP---------------VVGSTITRQARRLYVGNIPFGVTEEE 108

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+Q M  +G   A  G  V+   IN +K FAF+E R+++E + AMA DGI  +G  
Sbjct: 109 MMEFFNQQMHLVGLAQAA-GSPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINLKGQD 167

Query: 251 VRVR-----------------RPTDYNPTLAAALGPGQPS-------PN----------- 275
           +++R                 +P   +  + + + P  P        PN           
Sbjct: 168 LKIRRPHDYQPMPGITDTPAVKPAVVSSGVISTVVPDSPHKIFIGGLPNYLNDEQKEFTL 227

Query: 276 ---LNLAAVGLAS-----GAIGGAEGPDRVF------------VGGLPYYFTETQIKELL 315
              L++ A    +     GAI   E   R+                L    T   +KELL
Sbjct: 228 NAFLDIGACKKVTPHTNTGAIASLEVDPRIVNLIDELLIRRTVKASLGSSATSRFVKELL 287

Query: 316 ESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASG 375
            SFG L  F+LVKD  TG SKGY FC Y D ++TD + A LNG+++GDK L V+RA+   
Sbjct: 288 LSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKLIVQRASVGA 347

Query: 376 QSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALAD 435
           ++        AQ   + + Q + LQ  G++T+     +      +VLCL   +T D L D
Sbjct: 348 KN--------AQNAANTS-QSVMLQVPGLSTV-----VTSGPPTEVLCLLNMVTPDELRD 393

Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
           +EEYE+ILED++EEC KYG + +V IPRP + G E PG GKVF+E+   + C  A+ AL+
Sbjct: 394 EEEYEDILEDIKEECTKYGVVRSVEIPRPIE-GVEVPGCGKVFVEFNSVLDCQKAQQALT 452

Query: 496 GRKFGGNTVNAFYYPEDKYFNKDY 519
           GRKF    V   Y+  DKY  +++
Sbjct: 453 GRKFSDRVVVTSYFDPDKYHRREF 476


>gi|195057468|ref|XP_001995263.1| GH23055 [Drosophila grimshawi]
 gi|193899469|gb|EDV98335.1| GH23055 [Drosophila grimshawi]
          Length = 453

 Score =  216 bits (551), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 125/363 (34%), Positives = 198/363 (54%), Gaps = 46/363 (12%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
            TR ARR+YVG +P    +  +  FF+  +  + G     G AV+    N EK FAF+E+
Sbjct: 123 VTRQARRLYVGNIPFNTTDDEMRAFFNVQIQRMCGALENDGKAVLTCQTNLEKNFAFLEL 182

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNP--TLAAALGPGQPSPNLNLAAVGLASGA 287
           R+++E + A++ DGI + G ++++RRP DY+   T  + +G           A G  SGA
Sbjct: 183 RSMDETTLAISFDGINYRGQSLKIRRPHDYHAGGTTGSFVG-----------ATGYVSGA 231

Query: 288 IGGA---------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
           +  +         + P ++++GGLP    + Q+KELL +FG L GF++VKD + G+ KGY
Sbjct: 232 VVQSNAAIATVVPDTPHKIYIGGLPTCLNDDQVKELLMTFGHLRGFNMVKD-ELGHGKGY 290

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            FC Y D ++T+ A A LNG+++G++ L V+R+ A  ++    +  + Q     A  K+ 
Sbjct: 291 AFCEYMDASITEQAIAGLNGMQLGERKLIVQRSLAGVRNLVTHQLPVLQVPGFPADVKVG 350

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
             T                  +VLCL   +  D L DD EYE+I +D++EEC K+G +++
Sbjct: 351 KAT------------------EVLCLLNMVMPDELLDDAEYEDIRKDIKEECAKFGKVIS 392

Query: 459 VVIPRPDQNGGET--PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           + IPRP    GE+  PG GKV++ +     C  A  ALSGR+F G  V   +Y  +K+  
Sbjct: 393 IKIPRP---FGESPQPGCGKVYVRFETTDVCKKALKALSGRRFSGRIVMTSFYDPNKFKR 449

Query: 517 KDY 519
           KD+
Sbjct: 450 KDF 452


>gi|358059688|dbj|GAA94557.1| hypothetical protein E5Q_01209 [Mixia osmundae IAM 14324]
          Length = 564

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 155/458 (33%), Positives = 228/458 (49%), Gaps = 61/458 (13%)

Query: 93  RNRSKSLSPSRSPSKSKRRS----------GFDMAPPAAAMLPGA-AVPGQLPGVPSAVP 141
           R   +  +P  +   SKRR           GF+      A + G   +PGQ  GV    P
Sbjct: 133 REEVREKTPENTIPISKRRRAQTAWDVRPIGFETVSAETARMSGHFLLPGQ-NGVVRFPP 191

Query: 142 EMAQNMLPFGATQL-----GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFS 196
              +    FG   +      A P+  VQ +   A R  RR+YVG + P A+EQ +  FF+
Sbjct: 192 GFHEGRGAFGGLNMSGAGSAAAPMGGVQPIISFA-RQQRRLYVGNIMPTADEQNVTEFFN 250

Query: 197 QVMTAIGGN------SAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
             M   G +           D VV+V +NHEK +AFVE R+ EEAS+AM+ DGI+F+   
Sbjct: 251 AKMRENGLSLDDKKVDVQTADPVVSVQVNHEKSYAFVEFRSPEEASSAMSFDGIVFQDQQ 310

Query: 251 VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQ 310
           +++RRP DY          G  S   +L  V ++S      + P++VFVGGLP Y  + Q
Sbjct: 311 LKIRRPKDYT---------GDESGGTHLPGV-ISSNV---PDTPNKVFVGGLPSYLDDEQ 357

Query: 311 IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
           + ELL SFG L  F+LVK+     SKG+ FC Y DP VTD ACA LNG+++GD+ L V+R
Sbjct: 358 VLELLSSFGELRSFNLVKEGPQNASKGFAFCEYADPNVTDAACAGLNGMEIGDRYLVVQR 417

Query: 371 ATASGQSKTEQESILAQAQQHI--AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLT--E 426
           A   G +  +       +   +  A+ ++A    G +          ET     CL    
Sbjct: 418 AQV-GANVYKHPGGYGGSNPALPPALARVAPTIFGQD----------ETAPATRCLQMLN 466

Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP--DQNG-------GETPGVGKV 477
            +T + L DD++Y +I ED+++EC KYG +++V IPRP   +NG            +GK+
Sbjct: 467 MVTPEELVDDQDYADINEDIKDECSKYGEVIDVKIPRPIKTENGRMDVKASESVEHLGKI 526

Query: 478 FLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
           F+ +        A +A++GR+FGG  V   Y  E+ + 
Sbjct: 527 FVMFDSTESSKKAIDAIAGRQFGGRLVICAYEKEETFL 564


>gi|219116422|ref|XP_002179006.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409773|gb|EEC49704.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 325

 Score =  215 bits (548), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 123/348 (35%), Positives = 186/348 (53%), Gaps = 32/348 (9%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           Q TRHARR+YVG LPP   E AI   F + +  I   +    D V++ YINHE++F FVE
Sbjct: 2   QQTRHARRLYVGNLPPHITEDAIHVEFRRAI-EIASPTPLSEDPVLSTYINHERRFCFVE 60

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            +TVE A+  M LDG+  +GV V+V+RP DYN  +A  + P    P L+++ +G+ SG +
Sbjct: 61  FKTVEMATACMNLDGLHVQGVPVKVKRPNDYNANMAPKIHPSA-LPPLDVSKLGIVSGTV 119

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLV-KDRDTGNSKGYGFCVYQDPA 347
              +GP+++F+GGL Y+  ++Q+ ELL++FG +  F LV  D ++  SKGY F  Y DP 
Sbjct: 120 --EDGPNKIFIGGLHYHLQDSQVMELLQAFGKIKAFHLVSNDPESNMSKGYCFVEYADPN 177

Query: 348 VTDIACAALNGLKMGD-KTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
           +T IA   LNG+ +G+ K LT R A            +         +  +         
Sbjct: 178 ITPIAVQGLNGMDIGNGKALTARLAGDRTGGAGGAAFLAHAMDPQNGVPNIP-------- 229

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
                        +VL L   +T + LA D EY+ + +++++EC K+G L  + IPR   
Sbjct: 230 ------------TRVLVLHNMVTDEDLATDTEYQGLFDEVKDECAKFGRLERLEIPR--- 274

Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
              + P   KVFL Y        A++ L GR+FG N V   ++PE ++
Sbjct: 275 ---QGPAARKVFLGYVTVAEAMQAQHELQGRQFGPNVVQTTFFPESEF 319


>gi|346978171|gb|EGY21623.1| splicing factor U2AF 50 kDa subunit [Verticillium dahliae VdLs.17]
          Length = 589

 Score =  214 bits (546), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 139/425 (32%), Positives = 212/425 (49%), Gaps = 61/425 (14%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P           P   T+L AF   P
Sbjct: 173 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPTKLQAFMNQP 224

Query: 163 VQV----MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
             V    +    +R A+R+ V  LP  A E ++A+FF+  +   G N     D  V+  +
Sbjct: 225 GTVNSASLKPSNSRQAKRLLVSKLPSSATEDSVASFFNLQLN--GLNVIESTDPCVSCQL 282

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFE---------GVAVRVRRPTDYNPTLAAALGP 269
           +++K F  VE R   EA+ A+ALDGI  E         G  + +RRP DY         P
Sbjct: 283 SNDKSFCVVEFRNASEATVALALDGISMEADSATDGAAGRGLEIRRPKDYIVPAVTEELP 342

Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD 329
            +P         G+ S  +   + P+++ + G P Y TE Q+ ELL SFG L  F LV+D
Sbjct: 343 YEP---------GVVSSNV--VDTPNKLSITGFPPYLTEEQVTELLTSFGELKAFVLVRD 391

Query: 330 RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
           R T  S+G+ FC Y D A  D+A   L+G+ +G+  L +++A+                 
Sbjct: 392 RHTDESRGFVFCEYVDSAANDVAIQGLSGMDLGNSKLKIQKAS----------------- 434

Query: 390 QHIAIQKMA---LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
             I + ++A   +  + M+ L G  +   E  ++VL L   +TAD L D+E+YEEI+ED+
Sbjct: 435 --IGVTQVAGVEMGVAAMSMLAGTTATDSEE-SRVLQLLNMVTADELMDNEDYEEIVEDV 491

Query: 447 REECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
           +EEC KYGT++ V +PRP     ++ GVGK+F++Y        A  +L+GRKF   TV  
Sbjct: 492 QEECAKYGTVLEVKVPRPVGGSRQSAGVGKIFVKYETKEATKKALQSLAGRKFADRTVVT 551

Query: 507 FYYPE 511
            Y+PE
Sbjct: 552 TYFPE 556


>gi|85111663|ref|XP_964044.1| hypothetical protein NCU03039 [Neurospora crassa OR74A]
 gi|28925805|gb|EAA34808.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 584

 Score =  214 bits (546), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 141/428 (32%), Positives = 223/428 (52%), Gaps = 59/428 (13%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P       Q M P   T+L AF   P
Sbjct: 191 RKRRMTQWDIKPPGYGNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMTQP 242

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +   A     +R ++R+ V  +PP A ++++  FF+  +   G N     D  V   
Sbjct: 243 GGAVNSAALKPTNSRQSKRLIVSNIPPSATDESLLGFFNLQLN--GLNVIDSADPCVQCQ 300

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------VAVRVRRPTDYNPTLAAALG 268
           I+ +  FA +E R   +A+ A+ALDGI  E            +++RRP DY   +  A+ 
Sbjct: 301 ISPDHSFAMLEFRNSPDATVALALDGITMEAEDANGAAGAGGLKIRRPKDY---IVPAI- 356

Query: 269 PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
                PN +  +   +S  I   + P+++ V  +P Y +E QI ELL +FG L  F LVK
Sbjct: 357 --VEDPNYDPDSEVPSSIVI---DSPNKISVTNIPAYLSEEQIMELLVAFGKLKSFVLVK 411

Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
           D+ T  S+G  FC Y D +VT +A   LN + +GD+ L V++A+                
Sbjct: 412 DKHTEESRGIAFCEYHDSSVTSVAIDGLNNMMLGDRALKVQKAS---------------- 455

Query: 389 QQHIAIQKMA--LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
                IQ++A  L  + M+ L G  SL G+ +++V+ L   +TAD L D+++YEEI +D+
Sbjct: 456 ---YGIQQVAGELSVNAMSMLAGTTSLDGD-VSRVVQLLNMVTADELMDNDDYEEIRDDV 511

Query: 447 REECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
           +EEC K+GT+V++ IPRP     ++ GVGK+F++Y ++     A  AL+GRKF   TV A
Sbjct: 512 QEECEKFGTIVSLKIPRPTGGSRQSAGVGKIFIKYENSDQATKALKALAGRKFADRTVVA 571

Query: 507 FYYPEDKY 514
            Y+PE+ +
Sbjct: 572 TYFPEENF 579


>gi|392565476|gb|EIW58653.1| hypothetical protein TRAVEDRAFT_37512 [Trametes versicolor
           FP-101664 SS1]
          Length = 548

 Score =  214 bits (545), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 151/461 (32%), Positives = 229/461 (49%), Gaps = 75/461 (16%)

Query: 91  RFRNRSKSLSPSRSPSKSKRR---SGFDMAPPA----AAM---------LPGA---AVPG 131
           R     +S +PS +   S+R+   SG+D+  P      AM         LPGA    +P 
Sbjct: 100 RVMTTRRSPTPSDATPLSQRKRKASGWDVHAPGYEQYTAMQAKQTGLFNLPGANRTQIPP 159

Query: 132 QLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAI 191
            L  +P   P M  N    G    G  P           +R +RR+Y+G + P  NEQ +
Sbjct: 160 IL-AIPGLPPPMPVNTFGMGT---GVNP---------NLSRQSRRLYIGSITPDINEQNL 206

Query: 192 ATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAV 251
             FF+  M  +   +  PG+ V+ V  N+EK +AFVE R+ E+A+ AMA DGIIF    +
Sbjct: 207 TDFFNSKMKEMNLGTGAPGNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFLNGPL 266

Query: 252 RVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQI 311
           ++RRP DY          G P    N+   G+ S  +   +  +++FVGGLP Y  E Q+
Sbjct: 267 KIRRPKDY----------GGPDMLANMHVPGVVSTNV--PDSANKIFVGGLPTYLNEEQV 314

Query: 312 KELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
            ELL SFG L  F+LV++   G SKG+ F  Y DP+VTD+A  +L+G+++GDK L V+RA
Sbjct: 315 MELLSSFGELKAFNLVRENGNGPSKGFAFFEYVDPSVTDVAIPSLSGMELGDKYLVVQRA 374

Query: 372 TA---SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAI 428
           +     GQS      +   A + I    M +    +  +            ++L +   +
Sbjct: 375 SVGAKPGQSPIPGMGMFDMAPE-IPKPIMPVGERDLEAMQD----------RILLMLNMV 423

Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIPRP-----------------DQNGGET 471
             + L+DD+EY ++ ED++EEC KYGT+ ++ IPRP                  Q   E 
Sbjct: 424 VPEELSDDQEYGDLYEDVKEECEKYGTVEDLRIPRPVKKDKAKWGEGRESAIAAQRADEA 483

Query: 472 PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
            GVG+V+++++D    A A  AL+GR F G ++ A    +D
Sbjct: 484 AGVGRVYVKFHDPRAAANALKALAGRSFAGRSIIATLLTDD 524


>gi|336465212|gb|EGO53452.1| hypothetical protein NEUTE1DRAFT_92746 [Neurospora tetrasperma FGSC
           2508]
          Length = 584

 Score =  214 bits (545), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 141/428 (32%), Positives = 223/428 (52%), Gaps = 59/428 (13%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P       Q M P   T+L AF   P
Sbjct: 191 RKRRMTQWDIKPPGYGNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMTQP 242

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +   A     +R ++R+ V  +PP A ++++  FF+  +   G N     D  V   
Sbjct: 243 GGAVNSTALKPTNSRQSKRLIVSNIPPSATDESLLGFFNLQLN--GLNVIDSADPCVQCQ 300

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------VAVRVRRPTDYNPTLAAALG 268
           I+ +  FA +E R   +A+ A+ALDGI  E            +++RRP DY   +  A+ 
Sbjct: 301 ISPDHSFAMLEFRNSPDATVALALDGITMEAEDANGAAGAGGLKIRRPKDY---IVPAI- 356

Query: 269 PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
                PN +  +   +S  I   + P+++ V  +P Y +E QI ELL +FG L  F LVK
Sbjct: 357 --VEDPNYDPDSEVPSSIVI---DSPNKISVTNIPAYLSEEQIMELLVAFGKLKSFVLVK 411

Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
           D+ T  S+G  FC Y D +VT +A   LN + +GD+ L V++A+                
Sbjct: 412 DKHTEESRGIAFCEYHDSSVTSVAIDGLNNMMLGDRALKVQKAS---------------- 455

Query: 389 QQHIAIQKMA--LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
                IQ++A  L  + M+ L G  SL G+ +++V+ L   +TAD L D+++YEEI +D+
Sbjct: 456 ---YGIQQVAGELSVNAMSMLAGTTSLDGD-VSRVVQLLNMVTADELMDNDDYEEIRDDV 511

Query: 447 REECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
           +EEC K+GT+V++ IPRP     ++ GVGK+F++Y ++     A  AL+GRKF   TV A
Sbjct: 512 QEECEKFGTIVSLKIPRPTGGSRQSAGVGKIFIKYENSDQATKALKALAGRKFADRTVVA 571

Query: 507 FYYPEDKY 514
            Y+PE+ +
Sbjct: 572 TYFPEENF 579


>gi|449680331|ref|XP_002158219.2| PREDICTED: splicing factor U2AF 50 kDa subunit-like, partial [Hydra
           magnipapillata]
          Length = 259

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 120/293 (40%), Positives = 161/293 (54%), Gaps = 35/293 (11%)

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            R+VEE + AMA DGI+ +G A+++RRP DY P    +       P       G+ S  +
Sbjct: 1   FRSVEETTLAMAFDGIMLQGQALKIRRPKDYQPIPGISEMQATHIP-------GVVSTVV 53

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
             ++  ++VFVGGLP Y  E Q+KELL +FG L  F+LVKD  TG SKGY FC Y D  +
Sbjct: 54  --SDTINKVFVGGLPNYLNEDQVKELLSTFGDLRSFNLVKDSATGLSKGYAFCEYVDIGI 111

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
           TD+A A +NG+++GDK L V+RA+   ++ T Q +I                        
Sbjct: 112 TDVAIAGMNGMQLGDKKLVVQRASVGSKTMTAQLNI------------------------ 147

Query: 409 GGMSLFGE-TLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
            G  L  E T   +LCL   + AD L DDE+Y+EI ED+REEC KYG + ++ IPRP+  
Sbjct: 148 PGFDLSKEITATNILCLMNMVVADELIDDEDYDEIFEDIREECSKYGRIRSMQIPRPNHE 207

Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
                GVGKVF+EY  +     A  AL+GRKF    V   YY  D Y   D+S
Sbjct: 208 -FLVSGVGKVFIEYATSEESKIASEALAGRKFANRVVVTAYYDPDAYHQHDFS 259


>gi|390596686|gb|EIN06087.1| hypothetical protein PUNSTDRAFT_145447 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 602

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 125/361 (34%), Positives = 190/361 (52%), Gaps = 53/361 (14%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           +R +RR+Y+G + P   EQ +A FF+  M  +   + GPG+ V+ V  N+EK +AFVE R
Sbjct: 240 SRQSRRLYIGSITPEITEQNLADFFNSKMIEMSIGTGGPGNPVLAVQCNYEKNYAFVEFR 299

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           + E+A+ AMA DGIIF    +++RRP DY          G P P       GL    +  
Sbjct: 300 SAEDATAAMAFDGIIFLNGPLKIRRPKDY----------GGPDP----MGAGLHVPGVVS 345

Query: 291 AEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
              PD   +VFVGGLP Y  E Q+ ELL SFG L  F+LV++   G SKG+ F  Y D +
Sbjct: 346 TNVPDSINKVFVGGLPAYLNEEQVMELLTSFGELKAFNLVRENGNGPSKGFAFFEYVDES 405

Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
           VTD+A  ALNG+++GD+ L V+RA+   +            +Q   + +  +    ++  
Sbjct: 406 VTDVAIQALNGMELGDRYLVVQRASVGAKPGMPN----LPYEQFPELPRPIMPAGDVSNR 461

Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
                      A++L +   +  + L DD+EY +I ED++EEC K+G + ++ IPRP + 
Sbjct: 462 D----------ARILLMLSMVVPEDLVDDQEYADICEDVKEECEKFGAVEDLRIPRPAKR 511

Query: 468 ---------GGETP-------------GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
                    GG+ P             GVG+V++++ +A     A  AL+GR FGG ++ 
Sbjct: 512 DRTKWGEGAGGQAPMSAQDYQRMDEAAGVGRVYVKFREARAAGDALKALAGRSFGGRSII 571

Query: 506 A 506
           A
Sbjct: 572 A 572


>gi|332375140|gb|AEE62711.1| unknown [Dendroctonus ponderosae]
          Length = 374

 Score =  213 bits (543), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 119/305 (39%), Positives = 174/305 (57%), Gaps = 36/305 (11%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E  +  +F+Q M  + G +   G+ V+   I
Sbjct: 102 PQAAVPVVGSTITRQARRLYVGNIPFGVTEDEMMEYFNQQM-HLSGLAQAAGNPVLACQI 160

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY           QP P ++ 
Sbjct: 161 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDY-----------QPMPGMSE 209

Query: 279 AAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
            ++ + +G I     + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD   G SK
Sbjct: 210 NSISVPAGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDTAFGLSK 269

Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQK 396
           GY F  Y D ++TD A A LNG+++GDK L V+RA+   ++ T    +L   Q  +    
Sbjct: 270 GYAFAEYIDISMTDQAIAGLNGMQLGDKRLIVQRASVGAKNAT----VLPAVQIQVP--- 322

Query: 397 MALQTSGMNTLGGGMSLFGET--LAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
                        G+SL G +    +VLCL   +T D L D+EEYE+ILED++EEC KYG
Sbjct: 323 -------------GLSLVGASGPPTEVLCLLNMVTPDELKDEEEYEDILEDIKEECNKYG 369

Query: 455 TLVNV 459
            + ++
Sbjct: 370 VVRSI 374


>gi|350295506|gb|EGZ76483.1| hypothetical protein NEUTE2DRAFT_76972 [Neurospora tetrasperma FGSC
           2509]
          Length = 592

 Score =  213 bits (543), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 140/428 (32%), Positives = 223/428 (52%), Gaps = 59/428 (13%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P       Q M P   T+L AF   P
Sbjct: 199 RKRRMTQWDIKPPGYGNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMTQP 250

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +   A     +R ++R+ V  +PP A ++++  FF+  +   G N     D  V   
Sbjct: 251 GGAVNSAALKPTNSRQSKRLIVSNIPPSATDESLLGFFNLQLN--GLNVIDSADPCVQCQ 308

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------VAVRVRRPTDYNPTLAAALG 268
           I+ +  FA +E +   +A+ A+ALDGI  E            +++RRP DY   +  A+ 
Sbjct: 309 ISPDHSFAMLEFKNSPDATVALALDGITMEAEDANGAAGAGGLKIRRPKDY---IVPAI- 364

Query: 269 PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
                PN +  +   +S  I   + P+++ V  +P Y +E QI ELL +FG L  F LVK
Sbjct: 365 --VEDPNYDPDSEVPSSIVI---DSPNKISVTNIPAYLSEEQIMELLVAFGKLKSFVLVK 419

Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
           D+ T  S+G  FC Y D +VT +A   LN + +GD+ L V++A+                
Sbjct: 420 DKHTEESRGIAFCEYHDSSVTSVAIDGLNNMMLGDRALKVQKAS---------------- 463

Query: 389 QQHIAIQKMA--LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
                IQ++A  L  + M+ L G  SL G+ +++V+ L   +TAD L D+++YEEI +D+
Sbjct: 464 ---YGIQQVAGELSVNAMSMLAGTTSLDGD-VSRVVQLLNMVTADELMDNDDYEEIRDDV 519

Query: 447 REECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
           +EEC K+GT+V++ IPRP     ++ GVGK+F++Y ++     A  AL+GRKF   TV A
Sbjct: 520 QEECEKFGTIVSLKIPRPTGGSRQSAGVGKIFIKYENSDQATKALKALAGRKFADRTVVA 579

Query: 507 FYYPEDKY 514
            Y+PE+ +
Sbjct: 580 TYFPEENF 587


>gi|380481793|emb|CCF41637.1| U2 snRNP auxilliary factor [Colletotrichum higginsianum]
          Length = 550

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 145/459 (31%), Positives = 225/459 (49%), Gaps = 63/459 (13%)

Query: 78  RRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPA-----------AAMLP 125
           RR R RS +    + R  +  L+   S  + KRR + +D+ PP            + M P
Sbjct: 128 RRERQRSATPPPKK-REPTPDLTDITSVLERKRRLTQWDIKPPGYENVTAEQAKLSGMFP 186

Query: 126 GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPL 185
               P Q P  PS +    Q ++     Q+ +  L P        +R A+R+ +  LPP 
Sbjct: 187 LPGAPRQQPIDPSKL----QAIMNQPGGQVNSAALKPSN------SRQAKRLLINNLPPS 236

Query: 186 ANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGII 245
           A E++I +FF+  +   G N     D   +  ++ +  FA VE R   EA+ A+ALDGI 
Sbjct: 237 ATEESIQSFFNLQLN--GLNIIESADPCTSCQVSKDNSFAVVEFRNASEATIALALDGIS 294

Query: 246 FEG----------VAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
            E             + + RP DY         P +P         G+ S  +   + P 
Sbjct: 295 MEADDATNGEAANQGLSIHRPKDYIVPAVVDDVPYEP---------GVVSNVV--IDTPS 343

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           ++ +  LP Y ++ Q+ ELL SFG L  F LV+DR T  S+G  FC Y DPA TD+A   
Sbjct: 344 KLSIANLPTYLSDEQVSELLVSFGELKAFVLVRDRSTEESRGIAFCEYVDPAATDVAIQG 403

Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFG 415
           LNG+ +GDK L V++A+  G ++      +A  +  +A   M   T+  ++        G
Sbjct: 404 LNGMDLGDKKLRVQKASV-GVTQ------VAGVEMGVAAMSMLAGTTSTDS--------G 448

Query: 416 ETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
           ET  +VL L   +T + L D+++YEEI ED++EEC K+G ++++ IPRP     ++ GVG
Sbjct: 449 ET--RVLQLLNMVTPEELMDNDDYEEIKEDVQEECSKFGNVLDIKIPRPVGGSRQSAGVG 506

Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           K+F+ + +      A  AL+GRKF   TV   Y+PE+ +
Sbjct: 507 KIFVRFENTESAKKALQALAGRKFADRTVVTTYFPEENF 545


>gi|393236224|gb|EJD43774.1| hypothetical protein AURDEDRAFT_137718 [Auricularia delicata
           TFB-10046 SS5]
          Length = 389

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 122/353 (34%), Positives = 193/353 (54%), Gaps = 41/353 (11%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIG-GNSAGPGDAVVNVYINHEKKFAFVEMR 230
           R +RR+Y+G + P   E+ +  FF+Q M  +  G     GD V+ V +N+EK +AFVE R
Sbjct: 26  RQSRRLYIGSITPEITEENLTKFFNQKMREMNLGQQNASGDPVLAVQVNYEKNYAFVEFR 85

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           + ++A+ AMA DGIIF+   +R+RRP DY       +G    +P+  +   G+ S  +  
Sbjct: 86  SADDATAAMAFDGIIFQSGPLRIRRPKDY-------MGNEYSAPSA-MHVPGVVSTNV-- 135

Query: 291 AEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
              PD   ++FVGGLP Y  E Q+ ELL+SFG L  F+LV++ + G SKGY F  Y D  
Sbjct: 136 ---PDSLHKIFVGGLPTYLNEEQVMELLKSFGELKAFNLVRENNNGPSKGYAFFEYVDEE 192

Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
           VT++A   LNG+++GD+ L V+RA+   ++       +      I   +M      +  L
Sbjct: 193 VTEVAIQGLNGMELGDRVLAVQRASVGSKNG------MVVPNPDIPYDQMPEVPRPIMPL 246

Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
               +      A++L +   +  + L DDEE+ E+ ED++EEC K+G + ++ IPRP + 
Sbjct: 247 NEAPT----QDARILLMLNMVVPEDLVDDEEFAELYEDVKEECAKFGAVEDLRIPRPAKR 302

Query: 468 GG--------------ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
            G              E  GVG+V+++YY +   +TA  +L+GR F G ++ A
Sbjct: 303 AGPKYGPAAVEAQRVDEAAGVGRVYVKYYKSSDASTALRSLAGRSFAGRSIIA 355


>gi|299745153|ref|XP_001831503.2| rRNA primary transcript binding protein [Coprinopsis cinerea
           okayama7#130]
 gi|298406457|gb|EAU90350.2| rRNA primary transcript binding protein [Coprinopsis cinerea
           okayama7#130]
          Length = 550

 Score =  212 bits (540), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 126/359 (35%), Positives = 194/359 (54%), Gaps = 42/359 (11%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           +R +RR+Y+G + P  NEQ +A FF++ M  +   +   G+ V+ V  N+EK +AFVE R
Sbjct: 192 SRQSRRLYIGSITPDVNEQNLAEFFNKKMAEMNIGTGSTGNPVLAVQCNYEKNYAFVEFR 251

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           + ++A+ AMA DGIIF    +++RRP DY   +         SP +++   G  S  +  
Sbjct: 252 SADDATAAMAFDGIIFINGPLKIRRPKDYGGEVVTG------SPGIHVP--GAVSTNV-- 301

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
            +  ++VFVGGLP Y  E Q+ ELL+SFG L  F+LV++   G SKG+ F  Y D +VTD
Sbjct: 302 PDSINKVFVGGLPTYLNEEQVMELLKSFGELKAFNLVRENGNGPSKGFAFFEYVDSSVTD 361

Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
           +A  +LNG+++GD+ L V+RA+   +             Q   I +  +  +G       
Sbjct: 362 VAIQSLNGMELGDRYLVVQRASVGAKPGAPG----LPYDQFPDIPR-PIMPAGAEV---- 412

Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP------ 464
                 T A++L +   +T D L DDEEY ++ ED++EEC KYG + ++ IPRP      
Sbjct: 413 ------TDARILLMLNMVTPDDLIDDEEYGDLYEDVKEECSKYGEVEDLRIPRPVKKDKA 466

Query: 465 -----------DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
                       Q   E  GVG+V+++Y D  G   A N+L+GR F G ++ A    E+
Sbjct: 467 KWGEGQISAQDAQRIDEAAGVGRVYVKYADTEGANKALNSLAGRSFAGRSIIATLLSEE 525


>gi|302411252|ref|XP_003003459.1| splicing factor U2AF 65 kDa subunit [Verticillium albo-atrum
           VaMs.102]
 gi|261357364|gb|EEY19792.1| splicing factor U2AF 65 kDa subunit [Verticillium albo-atrum
           VaMs.102]
          Length = 568

 Score =  212 bits (539), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 162/543 (29%), Positives = 252/543 (46%), Gaps = 110/543 (20%)

Query: 40  FKSGGDDRRRDKNYKYDREGIRDHDRTDRH--------RDYNRDK--------------- 76
           ++ G   +  D +Y   R   R+ +R DR+        RD++RD+               
Sbjct: 63  YERGSRRQENDDSYAASRSH-REREREDRYSGRDRRAERDWDRDRGSSRRDARRDDDDRG 121

Query: 77  -----------ERRHRHRSRSHSSDRFRNRSKSLSP---SRSPS-----------KSKRR 111
                      +RR   R R+    R   R +S SP    R P+           + +R 
Sbjct: 122 ARRGGDRDQFDDRRRGGRDRNEEFARREQRPRSASPPPKKREPTPDLTDIVSVLDRKRRL 181

Query: 112 SGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQV-- 165
           + +D+ PP    +    A + G   LPG P           P   T+L AF   P  V  
Sbjct: 182 TQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPTKLQAFMNQPGTVNS 233

Query: 166 --MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
             +    +R A+R+ V  LP  A E+++A+FF+  +   G N     D  V+  ++++K 
Sbjct: 234 ASLKPSNSRQAKRLLVSKLPSSATEESVASFFNLQLN--GLNVIESTDPCVSCQLSNDKS 291

Query: 224 FAFVEMRTVEEASNAMALDGIIFE---------GVAVRVRRPTDYNPTLAAALGPGQPSP 274
           F  VE R   EA+ A+ALDGI  E         G  + +RRP DY         P +P  
Sbjct: 292 FCVVEFRNASEATVALALDGISMEADSGTDGAAGRGMEIRRPKDYIVPAVTEELPYEP-- 349

Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
                  G+ S  +   + P+++ + G P Y TE Q+ ELL SFG L  F LV+DR T  
Sbjct: 350 -------GVVSSNV--VDTPNKLSITGFPPYLTEEQVTELLTSFGELKAFVLVRDRHTDE 400

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
           S+G+ FC Y D A  D+A   L+G+ +G+  L +++A+                   I +
Sbjct: 401 SRGFVFCEYVDSAANDVAIQGLSGMDLGNSKLKIQKAS-------------------IGV 441

Query: 395 QKMA---LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECG 451
            ++A   +  + M+ L G  +   E  ++VL L   +TAD L D+E+YEEI+ED++EEC 
Sbjct: 442 TQVAGVEMGVAAMSMLAGTTATDSEE-SRVLQLLNMVTADELMDNEDYEEIVEDVQEECA 500

Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
           KYGT++ V +PRP     ++ GVGK+F++Y        A  +L+GRKF   TV   Y+PE
Sbjct: 501 KYGTVLEVKVPRPVGGSRQSAGVGKIFVKYETKEATKKALQSLAGRKFADRTVVTTYFPE 560

Query: 512 DKY 514
           + +
Sbjct: 561 ENF 563


>gi|336274240|ref|XP_003351874.1| hypothetical protein SMAC_00421 [Sordaria macrospora k-hell]
 gi|380096157|emb|CCC06204.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 594

 Score =  211 bits (538), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 140/428 (32%), Positives = 223/428 (52%), Gaps = 59/428 (13%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P       Q M P   T+L AF   P
Sbjct: 201 RKRRMTQWDIKPPGYGNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMTQP 252

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +   A     +R ++R+ V  +PP A ++++  FF+  +   G N     D  V   
Sbjct: 253 GGTVNSTALKPTNSRQSKRLIVSNIPPSATDESLLGFFNLQLN--GLNVIDSVDPCVQCQ 310

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGVA---------VRVRRPTDYNPTLAAALG 268
           I+ +  FA VE R   +A+ A+ALDGI  E            +++RRP DY   +  A+ 
Sbjct: 311 ISPDHSFAMVEFRNSPDATVALALDGITMEADDANDAASAGGLKIRRPKDY---IVPAI- 366

Query: 269 PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
                PN +  +   ++  I   + P+++ V  +P Y TE QI ELL +FG L  F LVK
Sbjct: 367 --VEDPNYDPDSEVPSNIVI---DSPNKISVTNIPAYLTEEQIMELLVAFGKLKSFVLVK 421

Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
           D+ T  S+G  FC Y D +VT +A   LN + +GD+ L V++A+                
Sbjct: 422 DKHTEESRGIAFCEYHDSSVTSVAIDGLNNMMLGDRALKVQKAS---------------- 465

Query: 389 QQHIAIQKMA--LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
                IQ++A  L  + M+ L G  SL G+ +++V+ L   +TAD L D+++YEEI +D+
Sbjct: 466 ---YGIQQVAGELSVNAMSMLAGTTSLDGD-VSRVVQLLNMVTADELMDNDDYEEIRDDV 521

Query: 447 REECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
           +EEC K+GT+V++ IPRP     ++ GVGK++++Y ++     A  +L+GRKF   TV A
Sbjct: 522 QEECEKFGTIVSLKIPRPTGGSRQSAGVGKIYIKYENSDQATKALKSLAGRKFADRTVVA 581

Query: 507 FYYPEDKY 514
            Y+PE+ +
Sbjct: 582 TYFPEENF 589


>gi|330803435|ref|XP_003289712.1| hypothetical protein DICPUDRAFT_56283 [Dictyostelium purpureum]
 gi|325080222|gb|EGC33787.1| hypothetical protein DICPUDRAFT_56283 [Dictyostelium purpureum]
          Length = 501

 Score =  211 bits (538), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 122/348 (35%), Positives = 180/348 (51%), Gaps = 37/348 (10%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           + +RR+YVG +PP   +  +  FF+  + A   N   PG  VV   IN  K FAF+E RT
Sbjct: 158 KQSRRIYVGNIPPGITDSELIEFFNAAVLAANLN-VKPGPPVVFCQINAPKCFAFIEFRT 216

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDY----NPTLAAALGPGQPSPNLNLAAVGLASGA 287
            EEA+NAM  DGI  +   +++RRP DY    +PT  +AL    P+              
Sbjct: 217 PEEATNAMRFDGITLKNYTLKIRRPKDYQQSNDPTNTSALPTIVPT-------------- 262

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
               +   ++FVGGLP    E Q+K LL ++G L  F+LVKD +TG SKGY FC Y DP 
Sbjct: 263 -NVPDSEHKIFVGGLPSNLNEEQVKTLLSAYGKLKAFNLVKDTNTGISKGYAFCEYLDPD 321

Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
           VTD ACA+LNG+ + DK L V+RA+   Q+     S +               +   NT 
Sbjct: 322 VTDQACASLNGISLADKNLIVQRASIVAQTL----STIRSTVPSSPTTSTTQTSIDNNT- 376

Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
                      ++V+ L   +  + L DD+EY+ IL D++EEC  +G   ++ +P P +N
Sbjct: 377 ---------KPSRVIQLLNLVDKEDLYDDKEYDNILIDVKEECENFGPTQSLWLPMPSKN 427

Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
             +   V ++++E+        A   L G+K+ G T+ + YYPED Y+
Sbjct: 428 PLD---VTRIYIEFQQLESSQKACLGLGGKKYNGRTIFSAYYPEDLYY 472


>gi|296420976|ref|XP_002840043.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295636253|emb|CAZ84234.1| unnamed protein product [Tuber melanosporum]
          Length = 540

 Score =  209 bits (532), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 124/389 (31%), Positives = 197/389 (50%), Gaps = 45/389 (11%)

Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQA-----TRHARRVYVGGLPPLAN 187
           LPG P   P           T+L AF   P  V T        +R A+R+ +  +PP  +
Sbjct: 185 LPGAPRQAP--------MDPTRLHAFMTQPSNVATASTLKPTNSRQAKRLLMSNIPPGTD 236

Query: 188 EQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE 247
           E+++  FF+Q ++++   + GP D + +V ++  K    +E +   +A+  +AL GI F 
Sbjct: 237 EESLLQFFNQTLSSLNVTTGGP-DPITSVQLSGSKILGLLEFKNTNDATVCLALSGIEFN 295

Query: 248 GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFT 307
           G  + ++RP DY       + P  P  + +    G+ S  +   + P+++ +  +P Y  
Sbjct: 296 GGNIEIKRPRDY-------IVPIVPEDHRHQEP-GVISSDV--PDTPNKILISEIPEYLQ 345

Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
           + Q+ ELL+SFG L  F LVKD     SKG  FC Y DP  T+IA   LN +++ D TL 
Sbjct: 346 DEQVIELLKSFGDLKAFVLVKDVTDETSKGIAFCEYLDPGTTEIAVEGLNAMEIADNTLR 405

Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG--GGMSLFGETLAKVLCLT 425
           VRRA+                   I +++ A    G+N +    G +      ++VL L 
Sbjct: 406 VRRAS-------------------IGMKQAAGVEMGVNAMAMMAGTTSVDLEASRVLQLL 446

Query: 426 EAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAV 485
             +TAD L D EEY+EI ED+R+EC K+G+LV++ +PRP     +  GVGK+F+ +    
Sbjct: 447 NMVTADELLDPEEYDEICEDIRDECQKFGSLVDMKVPRPSGGSRQAAGVGKIFVRFETQE 506

Query: 486 GCATAKNALSGRKFGGNTVNAFYYPEDKY 514
             + A  +L+GRKF   TV   Y+ E+ Y
Sbjct: 507 SASNALRSLAGRKFADRTVVCTYFSEENY 535


>gi|224003073|ref|XP_002291208.1| U2 snRNP auxillary splicing factor, U2AF subunit [Thalassiosira
           pseudonana CCMP1335]
 gi|220972984|gb|EED91315.1| U2 snRNP auxillary splicing factor, U2AF subunit, partial
           [Thalassiosira pseudonana CCMP1335]
          Length = 352

 Score =  208 bits (530), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 129/361 (35%), Positives = 195/361 (54%), Gaps = 32/361 (8%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAI--GGNSAG---------PGDAVVNVYIN 219
           TRHARR+YVG +P +++EQ    F   + ++I    NS             D +++VYIN
Sbjct: 1   TRHARRLYVGNIPDVSDEQLHHFFRDAIRSSIILDNNSEAHSSHKHQYVDNDPIISVYIN 60

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVA-VRVRRPTDYNPTLAAALGPGQPSPNLNL 278
            E++FAF+E +T+E  +  MALDG+   G   V+++RP DYNP +A  L      P L+ 
Sbjct: 61  RERRFAFLEFKTMEITTACMALDGLDVMGRGKVKIKRPNDYNPAVAPMLN-ASTMPVLDT 119

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK-DRDTGNSKG 337
             +G+ S  +   +GP+++FVGGLPY+  ++Q+ ELL +FG +  F+LVK D  +  SKG
Sbjct: 120 GKLGIISMTV--HDGPNKIFVGGLPYHLVDSQVLELLSAFGAVKAFNLVKNDPMSDTSKG 177

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y F  Y DP VT IA   LNG+ MG            G+   +Q  ++            
Sbjct: 178 YCFVEYCDPNVTQIAAMGLNGMDMG-----------GGKQPYQQPIVVKDPMAVANAAAS 226

Query: 398 ALQ----TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
           AL     + G+  L    ++ G T+ ++L L   +  + LA  E+ + + E++REE GKY
Sbjct: 227 ALDQAFGSGGVPPLAPPTAMPGSTVTRILVLLNMVMDEDLATAEDRKFLEEEVREEVGKY 286

Query: 454 GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
           GTL+++ IP P + G     V K+FLEY        A+  L GR FG N V+A Y+ E+ 
Sbjct: 287 GTLLSMKIPMPHE-GCAPSAVKKIFLEYATPAEAMYAEKELKGRAFGPNVVDASYFSEED 345

Query: 514 Y 514
           Y
Sbjct: 346 Y 346


>gi|440633242|gb|ELR03161.1| hypothetical protein GMDG_05987 [Geomyces destructans 20631-21]
          Length = 559

 Score =  208 bits (529), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 136/421 (32%), Positives = 211/421 (50%), Gaps = 52/421 (12%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P           P   ++L AF   P
Sbjct: 173 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMSQP 224

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +T  A     +R A+R+ V  LP   +E+ I  FF+  +   G N     D  +   
Sbjct: 225 SGSITNAALKPSNSRQAKRLLVHNLPKTLSEEGIVEFFNLQLN--GLNVVEGSDPCLTAQ 282

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFE----GVAVRVRRPTDYNPTLAAALGPGQPS 273
           ++ +K FA VE +T  +A+ A+A+DGI  E      A+ +RRP DY   +  A+      
Sbjct: 283 VSKDKSFALVEFKTTSDATVALAMDGIGIEENGGSRALSIRRPKDY---IVPAVDEAMHE 339

Query: 274 PNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTG 333
           P       G+ +  +   + P+++ +  +P Y T+ Q+ ELL SFG L  F L KD  T 
Sbjct: 340 P-------GVVTNVV--PDTPNKISISNVPPYLTDEQVTELLVSFGELKAFVLAKDSTTE 390

Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
            S+G  FC Y D A TDIA   LNG+++GDK L V+RA+  G ++T        A   + 
Sbjct: 391 ESRGIAFCEYVDAAATDIAVEGLNGMELGDKHLKVQRASI-GTTQT--------AGLEMG 441

Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
           +  M++          G +  G    +VL L   +TA+ L D+E+YEEILED++EEC KY
Sbjct: 442 VNAMSML--------AGTTTDGLDEGRVLQLLNMVTAEELIDNEDYEEILEDVKEECEKY 493

Query: 454 GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
           G ++++ IPRP     ++ GVGK+F+++        A   L+GRKF   TV   Y+ E+ 
Sbjct: 494 GKVLDIKIPRPSGGSRQSAGVGKIFVKFDTPASAGKALRTLAGRKFADRTVVTTYFSEEN 553

Query: 514 Y 514
           +
Sbjct: 554 F 554


>gi|426196755|gb|EKV46683.1| hypothetical protein AGABI2DRAFT_206147 [Agaricus bisporus var.
           bisporus H97]
          Length = 558

 Score =  207 bits (528), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 143/441 (32%), Positives = 225/441 (51%), Gaps = 51/441 (11%)

Query: 97  KSLSPSRSPSKSKRR---SGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGAT 153
           +S +P   PS S+R+   SG+D+  P       +A+  +  G+   +P   +  +P   +
Sbjct: 120 RSPTPENCPSLSQRKRKASGWDVHAPGYEQY--SAMQAKQTGL-FNLPGANRTQIPPILS 176

Query: 154 QLGAFPLMPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
             G  P MPVQ            +R +RR+Y+G +    NEQ +A FF+  M  +   + 
Sbjct: 177 IPGLPPPMPVQSFGMGIGGNPNLSRQSRRLYIGSITQEVNEQNLADFFNAKMAEMNIGTG 236

Query: 208 GPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAAL 267
             G+ V+ V  N+EK +AFVE R+ E+A+ AMA DGIIF    +++RRP DY      + 
Sbjct: 237 ITGNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDYGGDTIVSP 296

Query: 268 G---PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
           G   PG  S N+               +  ++VFVGGLP Y  E Q+ ELL+SFG L  F
Sbjct: 297 GVHVPGVVSTNV--------------PDSINKVFVGGLPTYLNEEQVMELLKSFGELKAF 342

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
           +LV++  TG SKG+ F  Y D AVTD+A  +LNG+++GD+ L V+RA+   +  T     
Sbjct: 343 NLVRENGTGTSKGFAFFEYVDQAVTDVAIQSLNGMELGDRYLVVQRASVGAKPGTPGMIP 402

Query: 385 LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
                Q   I +  +      T          + +++L +   +T + L +D+EY ++ +
Sbjct: 403 NLPYDQFPEIPRPIMPAGKDQT---------SSESRILLMLNMVTPEDLHEDDEYGDLYD 453

Query: 445 DMREECGKYGTLVNVVIPRP-------------DQNGGETPGVGKVFLEYYDAVGCATAK 491
           D++ EC KYG L ++ IPRP              Q   E  GVG+V+++Y ++   + A 
Sbjct: 454 DVKAECSKYGELEDLRIPRPVKKDKTSSMSAQDAQRIDEAAGVGRVYVKYTNSKSASAAL 513

Query: 492 NALSGRKFGGNTVNAFYYPED 512
           NAL+GR F G ++ A    +D
Sbjct: 514 NALAGRSFAGRSIIATLLSDD 534


>gi|154319442|ref|XP_001559038.1| hypothetical protein BC1G_02202 [Botryotinia fuckeliana B05.10]
          Length = 596

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 133/430 (30%), Positives = 211/430 (49%), Gaps = 60/430 (13%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P           P   ++L AF   P
Sbjct: 200 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMAQP 251

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +T  A     +R ++R+ V  +P    E+ I +FF+  +   G N     D +++  
Sbjct: 252 SGQVTNAALKPSNSRQSKRLLVHNIPADTKEETIVSFFNLQLN--GLNVIEGSDPLISAQ 309

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGV-------------AVRVRRPTDYNPTLA 264
           ++ +  FA +E +T  +A+ A+ALDGI  +G               + +RRP DY     
Sbjct: 310 VSKDGSFALLEFKTQSDATVALALDGITMDGNDHMETENGSADTRGLSIRRPKDYIVPAV 369

Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
               P +P         G+ S  +  A+  +++ V  +P+Y  + Q+ ELL SFG L  F
Sbjct: 370 TDETPFEP---------GVISNVV--ADTQNKISVTNIPHYLNDEQVTELLVSFGELKAF 418

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
            LVKD +T  S+G  FC Y DPA TDIA   LNG+++GDK L V+RA+            
Sbjct: 419 VLVKDSNTDESRGIAFCEYVDPAATDIAVEGLNGMELGDKHLKVQRASIG---------- 468

Query: 385 LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
                 +  +  + +  + M+ L G  S   E   +VL L   +T + L D+E+YEEI E
Sbjct: 469 ------NTQVSGLEMSVNAMSMLAGTTSQDLEN-GRVLQLLNMVTPEELIDNEDYEEICE 521

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           D++EEC KYG ++ + +PRP     ++ GVGK+F+++        A  AL+GRKF   TV
Sbjct: 522 DVKEECEKYGKVLEMKVPRPTGGSRQSTGVGKIFVKFDTPDSAGKALKALAGRKFADRTV 581

Query: 505 NAFYYPEDKY 514
              Y+PE+ +
Sbjct: 582 VTTYFPEENF 591


>gi|347842431|emb|CCD57003.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 606

 Score =  207 bits (526), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 133/430 (30%), Positives = 211/430 (49%), Gaps = 60/430 (13%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P           P   ++L AF   P
Sbjct: 210 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMAQP 261

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +T  A     +R ++R+ V  +P    E+ I +FF+  +   G N     D +++  
Sbjct: 262 SGQVTNAALKPSNSRQSKRLLVHNIPADTKEETIVSFFNLQLN--GLNVIEGSDPLISAQ 319

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGV-------------AVRVRRPTDYNPTLA 264
           ++ +  FA +E +T  +A+ A+ALDGI  +G               + +RRP DY     
Sbjct: 320 VSKDGSFALLEFKTQSDATVALALDGITMDGNDHMETGNGSADTRGLSIRRPKDYIVPAV 379

Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
               P +P         G+ S  +  A+  +++ V  +P+Y  + Q+ ELL SFG L  F
Sbjct: 380 TDETPFEP---------GVISNVV--ADTQNKISVTNIPHYLNDEQVTELLVSFGELKAF 428

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
            LVKD +T  S+G  FC Y DPA TDIA   LNG+++GDK L V+RA+            
Sbjct: 429 VLVKDSNTDESRGIAFCEYVDPAATDIAVEGLNGMELGDKHLKVQRASIG---------- 478

Query: 385 LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
                 +  +  + +  + M+ L G  S   E   +VL L   +T + L D+E+YEEI E
Sbjct: 479 ------NTQVSGLEMSVNAMSMLAGTTSQDLEN-GRVLQLLNMVTPEELIDNEDYEEICE 531

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           D++EEC KYG ++ + +PRP     ++ GVGK+F+++        A  AL+GRKF   TV
Sbjct: 532 DVKEECEKYGKVLEMKVPRPTGGSRQSTGVGKIFVKFDTPDSAGKALKALAGRKFADRTV 591

Query: 505 NAFYYPEDKY 514
              Y+PE+ +
Sbjct: 592 VTTYFPEENF 601


>gi|18446992|gb|AAL68087.1| AT16577p [Drosophila melanogaster]
          Length = 449

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 132/362 (36%), Positives = 200/362 (55%), Gaps = 35/362 (9%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP----GDAVVNVYINHEKKFAF 226
           TR ARR+YVG +P    ++ +  FF+  + A+G  +       G+AV+    N EK FAF
Sbjct: 107 TRQARRLYVGNIPFGVTDEEMMQFFNHQIMALGFEAKSSHYMDGNAVLTCQTNLEKNFAF 166

Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-----SPNLNLA-- 279
           +E R+++EAS A+  DG++F G  +++RRP DY P  + ++   +       P +N+A  
Sbjct: 167 LEFRSIDEASQALNFDGMVFRGQTLKIRRPHDYQPVPSISVSAMESYRSFRVPAINVAQQ 226

Query: 280 -AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
            AV L    I   + P++++VGGLP    + Q+KELL+SFG L G +LV D +T  +KG+
Sbjct: 227 PAVTLPVTTIV-PDSPNKIYVGGLPTCLNQDQVKELLQSFGELKGLNLVMDTNTNLNKGF 285

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            F  Y DP+VTD A A L+G+ +GD+ L V+R+   G++             H+      
Sbjct: 286 AFFEYCDPSVTDHAIAGLHGMLLGDRRLVVQRSIPGGKNAFP---------GHLPT---V 333

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
           LQ  G++TL    S       + LCL   +  + L DDEE+E+I  D+++EC K+G + +
Sbjct: 334 LQVPGISTLQDPGS-----PTETLCLLNMVRPEELLDDEEFEDIRTDIKQECAKFGEVRS 388

Query: 459 VVIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           + IPRP    G+ P  G GKVF+++        A  ALSGRKF G  V   YY  +KY  
Sbjct: 389 IKIPRPI---GQFPKRGCGKVFVQFESVEDSQKALKALSGRKFSGRIVMTSYYDPEKYLA 445

Query: 517 KD 518
            D
Sbjct: 446 DD 447


>gi|310793965|gb|EFQ29426.1| U2 snRNP auxilliary factor [Glomerella graminicola M1.001]
          Length = 549

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 146/459 (31%), Positives = 221/459 (48%), Gaps = 63/459 (13%)

Query: 78  RRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPA-----------AAMLP 125
           RR R RS +    + R  +  L+   S  + KRR + +D+ PP            + M P
Sbjct: 127 RRERQRSATPPPKK-REPTPDLTDVTSVLERKRRLTQWDIKPPGYENVTAEQAKLSGMFP 185

Query: 126 GAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPL 185
               P Q P  PS +    Q ++     Q+ +  L P        +R A+R+ +  LPP 
Sbjct: 186 LPGAPRQQPMDPSKL----QAIMNQPGGQVNSAALKPSN------SRQAKRLLINNLPPS 235

Query: 186 ANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGII 245
           A E+ I  FF+  +   G N     D   +  ++ +  FA VE R   EA+ A+ALDGI 
Sbjct: 236 ATEEGIQNFFNLQLN--GLNIIESTDPCTSCQVSKDHSFAVVEFRNASEATVALALDGIS 293

Query: 246 FE------GVA----VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
            E      G A    + +RRP DY         P +P         G+ S  +   +  +
Sbjct: 294 MEAEDATNGAAADQGLVIRRPKDYIVPAVVDDVPYEP---------GVVSNVV--VDTHN 342

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           ++ V  +P Y +E Q+ ELL SFG L  F LV+D+ T  S+G  FC Y DPA TD+A   
Sbjct: 343 KISVANMPVYLSEEQVSELLVSFGELKAFVLVRDKSTEESRGIAFCEYVDPAATDVAIQG 402

Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFG 415
           LNG+ +GDK L V++A+                     +  + +  + M+ L G  S   
Sbjct: 403 LNGMDLGDKRLKVQKASVG----------------VTQVAGVEMGVAAMSMLAGTTSTDS 446

Query: 416 ETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
           E   +VL L   +T + L D+++YEEI ED++EEC K+GT+++V IPRP     ++ GVG
Sbjct: 447 EE-TRVLQLLNMVTPEELMDNDDYEEIKEDVQEECAKFGTVLDVKIPRPVGGSRQSAGVG 505

Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           K+F+++        A  AL+GRKF   TV   Y+PE+ +
Sbjct: 506 KIFVKFETKEAAKKALQALAGRKFADRTVVTTYFPEENF 544


>gi|24659166|ref|NP_611769.2| large subunit 2 [Drosophila melanogaster]
 gi|7291545|gb|AAF46969.1| large subunit 2 [Drosophila melanogaster]
 gi|201066057|gb|ACH92438.1| FI08027p [Drosophila melanogaster]
          Length = 449

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 132/362 (36%), Positives = 200/362 (55%), Gaps = 35/362 (9%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP----GDAVVNVYINHEKKFAF 226
           TR ARR+YVG +P    ++ +  FF+  + A+G  +       G+AV+    N EK FAF
Sbjct: 107 TRQARRLYVGNIPFGVTDEEMMQFFNHQIMALGFEAKSSHYMDGNAVLTCQTNLEKNFAF 166

Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-----SPNLNLA-- 279
           +E R+++EAS A+  DG++F G  +++RRP DY P  + ++   +       P +N+A  
Sbjct: 167 LEFRSIDEASQALNFDGMVFRGQTLKIRRPHDYQPVPSISVSAMESYRSFRVPAINVAQQ 226

Query: 280 -AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
            AV L    I   + P++++VGGLP    + Q+KELL+SFG L G +LV D +T  +KG+
Sbjct: 227 PAVTLPVTTIV-PDSPNKIYVGGLPTCLNQDQVKELLQSFGELKGLNLVMDTNTNLNKGF 285

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            F  Y DP+VTD A A L+G+ +GD+ L V+R+   G++             H+      
Sbjct: 286 AFFEYCDPSVTDHAIAGLHGMLLGDRRLVVQRSIPGGKNAFP---------GHLPT---V 333

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
           LQ  G++TL    S       + LCL   +  + L DDEE+E+I  D+++EC K+G + +
Sbjct: 334 LQVPGISTLQDPGS-----PTETLCLLNMVRPEELLDDEEFEDIRTDIKQECAKFGEVRS 388

Query: 459 VVIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           + IPRP    G+ P  G GKVF+++        A  ALSGRKF G  V   YY  +KY  
Sbjct: 389 IKIPRPI---GQFPKRGCGKVFVQFESVEDSQKALKALSGRKFSGRIVMTSYYDPEKYLA 445

Query: 517 KD 518
            D
Sbjct: 446 DD 447


>gi|402223467|gb|EJU03531.1| hypothetical protein DACRYDRAFT_77158 [Dacryopinax sp. DJM-731 SS1]
          Length = 392

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 124/363 (34%), Positives = 196/363 (53%), Gaps = 46/363 (12%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           + ARR+Y+G + P   E+ +  F  + +  +G    G       V ++HEK +A++E   
Sbjct: 54  KQARRLYIGDITPDTTEENLTAFLKKTLPELGIKVEGEDVGFEEVRVSHEKNYAYIEFSN 113

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV--GLASGAIG 289
            ++A+  M LDG +F G  +++RRP DY   L+A           +LA V  G+  G + 
Sbjct: 114 PDDATKTMELDGTVFLGQPLKIRRPHDY---LSAT----------DLAVVFGGIVPGVVS 160

Query: 290 GAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
               PD   ++FVGGLP Y  E Q+ ELL++FG L  F+LVKD  TG SKG+ F  Y DP
Sbjct: 161 -TNVPDSINKIFVGGLPTYLNEAQVMELLQTFGELRAFNLVKDGSTGVSKGFAFFEYMDP 219

Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
            VTD+AC  LNG+++GD+ L V+RA+  G + T+    +      +   + A+     NT
Sbjct: 220 GVTDVACQGLNGMELGDRYLVVQRASI-GANPTKPN--MPNMPGTLPPPRPAILPVD-NT 275

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
                       + +L L   +TA+ L  D++YE+ILED+REE G++G ++++ IPRP +
Sbjct: 276 ---------NPPSPILLLLNMVTAEELLQDQDYEDILEDVREEMGRFGPVIDIKIPRPQR 326

Query: 467 N-----GGETP---------GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
                  G T          GVG+V++++ D    + A NA++GR+F G ++ A Y  ED
Sbjct: 327 RENRWIPGSTSEPVKTDIELGVGRVYVKFADTDSASQALNAVAGRQFSGRSIIATYLQED 386

Query: 513 KYF 515
            + 
Sbjct: 387 PFL 389


>gi|326428095|gb|EGD73665.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 isoform 2
           [Salpingoeca sp. ATCC 50818]
          Length = 415

 Score =  204 bits (519), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 128/395 (32%), Positives = 193/395 (48%), Gaps = 77/395 (19%)

Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEE 234
           R++ V  LP    +  + +F +  + A       PG  V+   ++ + K A +E R+V+E
Sbjct: 36  RQLVVENLPEGTRDHDLMSFMNDCI-ASNKLITQPGQPVIKCTLSEDGKSAVLEFRSVDE 94

Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA--- 291
           A+N +  D   F+G  +RVRRP +Y             +P  ++  + + SGA   A   
Sbjct: 95  ATNGLVFDRERFKGAQLRVRRPDNYE------------APKGHITRIPMQSGANVSAVVQ 142

Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
           + P ++++GG+P Y +E Q+KELLE FG L  F LV D  TG SKG+ FC Y DPA+TD 
Sbjct: 143 DSPYKIYIGGIPPYLSEEQVKELLEPFGQLKSFHLVMDTSTGQSKGFAFCEYMDPAITDT 202

Query: 352 ACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM-ALQTSGMN----- 405
              ALN L++G+K L V+RA+   +           AQQ+  +  + A +++GM      
Sbjct: 203 MIGALNDLRIGEKRLLVQRASIGARGG---------AQQNNPMHSIPAFKSAGMQGPGGP 253

Query: 406 -------------------------------------TLGGGMSLFGETL-------AKV 421
                                                  G G+ L   T+        ++
Sbjct: 254 MMPLPPMSAPLKPAAPGAPITPGPISQPPPMSLPLPPPSGAGLPLPAPTMPTQQLVATRI 313

Query: 422 LCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE--TPGVGKVFL 479
           L L   +T D L DDEEYE+I  D+REEC K+G ++++ IPRP +   E   PGVGK++L
Sbjct: 314 LILMNMVTVDELRDDEEYEDICADIREECEKFGEILDMKIPRPSKENPEEQVPGVGKIYL 373

Query: 480 EYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           +Y  A     A  ALSGR F   TV   ++PEDK+
Sbjct: 374 KYASADEARIAARALSGRSFAERTVVTSFWPEDKF 408


>gi|134113282|ref|XP_774666.1| hypothetical protein CNBF3460 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50257310|gb|EAL20019.1| hypothetical protein CNBF3460 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 652

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 131/406 (32%), Positives = 198/406 (48%), Gaps = 70/406 (17%)

Query: 137 PSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFS 196
           P  VP  A   +P      GAFP            R   R+Y+GG+     EQ I  FF+
Sbjct: 246 PGRVPPPAHLGIP-ATFVAGAFP-------PSNPVRQNNRLYIGGIKEDMQEQQIQDFFN 297

Query: 197 QVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRP 256
            +M    G + G  D V    IN+++ FAF+E+ T E+A+ A+ LDG++ +G ++RVRRP
Sbjct: 298 NLMKE-KGMADGKEDPVKQCQINNDRNFAFIELHTPEQATAALELDGVVLDGASLRVRRP 356

Query: 257 TDY---NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
            DY   +P L    G   PS                 A+ P+++F+GG+P Y  + Q+ E
Sbjct: 357 KDYAGIDPLLQTFNGVVAPS----------------VADSPNKLFIGGIPTYLNDEQVME 400

Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
           LL+SFG L  F+LVK+   G SKG+ F  Y DP VTD+A   L+   +GD+ L V+RA  
Sbjct: 401 LLKSFGELKSFNLVKE-SAGVSKGFAFAEYLDPEVTDMAIQGLHNFSLGDRNLVVQRAAV 459

Query: 374 SGQSKTE-----QESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAI 428
              +          + L+QA  H+ +Q  A   S                ++V+ L   +
Sbjct: 460 GRNTGVNAPIPGSAAYLSQAIPHL-MQNNADAPS----------------SRVMLLLNMV 502

Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIPRP-------------------DQNGG 469
           T + L +D++Y +I+ED+ EEC KYG +  V IPRP                   ++   
Sbjct: 503 TPEELYNDDDYNDIIEDINEECSKYGEIEGVRIPRPVPKSKKWESTEAAAATAERNKRTD 562

Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
           +  GVG+V++ Y D      A NA+ GR+F G T+     PE+++ 
Sbjct: 563 DEAGVGRVYVMYKDVESTKKAMNAIGGRQFAGRTILVANVPEEEFL 608


>gi|58268792|ref|XP_571552.1| rRNA primary transcript binding protein [Cryptococcus neoformans
           var. neoformans JEC21]
 gi|57227787|gb|AAW44245.1| rRNA primary transcript binding protein, putative [Cryptococcus
           neoformans var. neoformans JEC21]
          Length = 651

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 131/406 (32%), Positives = 198/406 (48%), Gaps = 70/406 (17%)

Query: 137 PSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFS 196
           P  VP  A   +P      GAFP            R   R+Y+GG+     EQ I  FF+
Sbjct: 245 PGRVPPPAHLGIP-ATFVAGAFP-------PSNPVRQNNRLYIGGIKEDMQEQQIQDFFN 296

Query: 197 QVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRP 256
            +M    G + G  D V    IN+++ FAF+E+ T E+A+ A+ LDG++ +G ++RVRRP
Sbjct: 297 NLMKE-KGMADGKEDPVKQCQINNDRNFAFIELHTPEQATAALELDGVVLDGASLRVRRP 355

Query: 257 TDY---NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
            DY   +P L    G   PS                 A+ P+++F+GG+P Y  + Q+ E
Sbjct: 356 KDYAGIDPLLQTFNGVVAPS----------------VADSPNKLFIGGIPTYLNDEQVME 399

Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
           LL+SFG L  F+LVK+   G SKG+ F  Y DP VTD+A   L+   +GD+ L V+RA  
Sbjct: 400 LLKSFGELKSFNLVKE-SAGVSKGFAFAEYLDPEVTDMAIQGLHNFSLGDRNLVVQRAAV 458

Query: 374 SGQSKTE-----QESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAI 428
              +          + L+QA  H+ +Q  A   S                ++V+ L   +
Sbjct: 459 GRNTGVNAPIPGSAAYLSQAIPHL-MQNNADAPS----------------SRVMLLLNMV 501

Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIPRP-------------------DQNGG 469
           T + L +D++Y +I+ED+ EEC KYG +  V IPRP                   ++   
Sbjct: 502 TPEELYNDDDYNDIIEDINEECSKYGEIEGVRIPRPVPKSKKWESTEAAAATAERNKRTD 561

Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
           +  GVG+V++ Y D      A NA+ GR+F G T+     PE+++ 
Sbjct: 562 DEAGVGRVYVMYKDVESTKKAMNAIGGRQFAGRTILVANVPEEEFL 607


>gi|402906865|ref|XP_003916203.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Papio anubis]
          Length = 446

 Score =  202 bits (514), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 149/426 (34%), Positives = 209/426 (49%), Gaps = 67/426 (15%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLP-----GAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
           RSP    K K R  +D+ PP    +           GQ+P   + +P M  + L    T 
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT- 135

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
                  PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+
Sbjct: 136 -------PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVL 187

Query: 215 NVYINHEKKFAFVEMRTVEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPS 273
            V IN +K FAF+E+     +   + AL  + F    V    P                 
Sbjct: 188 AVQINQDKNFAFLEVSWGNRSGPVLCALAMLTFPEWVVSTVVP----------------- 230

Query: 274 PNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTG 333
                             +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG
Sbjct: 231 ------------------DSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATG 272

Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
            SKGY FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + 
Sbjct: 273 LSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVT 328

Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
           +Q   L +S +  +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KY
Sbjct: 329 LQVPGLMSSQVQ-MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKY 380

Query: 454 GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
           G + ++ IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V   Y   D 
Sbjct: 381 GLVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDS 439

Query: 514 YFNKDY 519
           Y  +D+
Sbjct: 440 YHRRDF 445


>gi|405121398|gb|AFR96167.1| rRNA primary transcript binding protein [Cryptococcus neoformans
           var. grubii H99]
          Length = 655

 Score =  202 bits (513), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 130/406 (32%), Positives = 198/406 (48%), Gaps = 70/406 (17%)

Query: 137 PSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFS 196
           P  VP  A   +P      GAFP            R   R+Y+GG+     EQ I  FF+
Sbjct: 249 PGRVPPPAHLGIP-ATFVAGAFP-------PSNPVRQNNRLYIGGIKEDMQEQQIQDFFN 300

Query: 197 QVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRP 256
            +M    G + G  D V    IN+++ FAF+E+ T E+A+ A+ LDG++ +G ++RVRRP
Sbjct: 301 NLMKE-KGMADGKEDPVKQCQINNDRNFAFIELHTPEQATAALELDGVVLDGASLRVRRP 359

Query: 257 TDY---NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKE 313
            DY   +P L    G   PS                 A+ P+++F+GG+P Y  + Q+ E
Sbjct: 360 KDYAGIDPLLQTFNGVVAPS----------------VADSPNKLFIGGIPTYLNDEQVME 403

Query: 314 LLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
           LL+SFG L  F+LVK+   G SKG+ F  Y DP VTD+A   L+   +GD+ L V+RA  
Sbjct: 404 LLKSFGELKSFNLVKE-SAGVSKGFAFAEYLDPEVTDMAIQGLHNFALGDRNLVVQRAAV 462

Query: 374 SGQSKTE-----QESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAI 428
              +          + L+QA  H+ +Q  A   S                ++V+ L   +
Sbjct: 463 GRNTGVNAPIPGSAAYLSQAIPHL-MQNNADAPS----------------SRVMLLLNMV 505

Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIPRP-------------------DQNGG 469
           T + L +D++Y +I+ED+ +EC KYG +  V IPRP                   ++   
Sbjct: 506 TPEELYNDDDYNDIIEDINDECSKYGEIEGVRIPRPVPKSKKWESTEAAAATAERNKRTD 565

Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
           +  GVG+V++ Y D      A NA+ GR+F G T+     PE+++ 
Sbjct: 566 DEAGVGRVYVMYKDVESTKKAMNAIGGRQFAGRTILVANVPEEEFL 611


>gi|156061663|ref|XP_001596754.1| hypothetical protein SS1G_02977 [Sclerotinia sclerotiorum 1980]
 gi|154700378|gb|EDO00117.1| hypothetical protein SS1G_02977 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 518

 Score =  201 bits (511), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 132/428 (30%), Positives = 206/428 (48%), Gaps = 60/428 (14%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P           P   ++L AF   P
Sbjct: 125 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMAQP 176

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +T  A     +R ++R+ V  +P    E+ +  FF+  +   G N     D  ++  
Sbjct: 177 SGSVTNAALKPSNSRQSKRLLVHNIPADTKEETLVGFFNLQLN--GLNVIEGSDPCISAQ 234

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG-------------VAVRVRRPTDYNPTLA 264
           ++ +  FA +E +T  +A+ A+ALDGI  E                + +RRP DY     
Sbjct: 235 VSKDGSFALLEFKTQSDATVALALDGITMENNDHMVTGNGSADTQGLSIRRPKDYIVPAV 294

Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
               P +P         G+ S  +   +  +++ V  +P+Y  + Q+ ELL SFG L  F
Sbjct: 295 TDETPFEP---------GVVSNIVPDTQ--NKISVANIPHYLNDEQVTELLVSFGELKAF 343

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
            LVKD +T  S+G  FC Y DPA TDIA   LNG+++GDK L V+RA+            
Sbjct: 344 VLVKDSNTDESRGIAFCEYVDPAATDIAVEGLNGMELGDKHLKVQRASIG---------- 393

Query: 385 LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
                 +  +  + +  + M+ L G  S   E   +VL L   +T + L D+E+YEEI E
Sbjct: 394 ------NTQVSGLEMSVNAMSMLAGTTSQDLEN-GRVLQLLNMVTPEELIDNEDYEEICE 446

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           D++EEC KYG ++ + +PRP     ++ GVGK+F+++        A  AL+GRKF   TV
Sbjct: 447 DVKEECEKYGKVLEMKVPRPTGGSRQSTGVGKIFVKFDTPDSAGKALKALAGRKFADRTV 506

Query: 505 NAFYYPED 512
              Y+PED
Sbjct: 507 VTTYFPED 514


>gi|321260434|ref|XP_003194937.1| splicing factor (U2 snRNP auxiliary factor large subunit)
           [Cryptococcus gattii WM276]
 gi|317461409|gb|ADV23150.1| Splicing factor (U2 snRNP auxiliary factor large subunit), putative
           [Cryptococcus gattii WM276]
          Length = 654

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 126/403 (31%), Positives = 198/403 (49%), Gaps = 64/403 (15%)

Query: 137 PSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFS 196
           P  VP  A   +P      GAFP            R   R+Y+GG+     E  I  FF+
Sbjct: 248 PGRVPPPAHLGIP-ATFVAGAFP-------PSNPVRQNNRLYIGGIKEDMQENQIQDFFN 299

Query: 197 QVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRP 256
            +M    G + G  D V    IN+++ FAF+E+ T E+A+ A+ LDG++ +G ++RVRRP
Sbjct: 300 NLMKE-KGMADGKEDPVKQCQINNDRNFAFIELHTPEQATAALELDGVVLDGASLRVRRP 358

Query: 257 TDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLE 316
            DY     A + P   + N      G+ + ++  A+ P+++F+GG+P Y  + Q+ ELL+
Sbjct: 359 KDY-----AGIDPLLQTFN------GIVAPSV--ADSPNKLFIGGIPTYLNDEQVMELLK 405

Query: 317 SFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQ 376
           SFG L  F+LVK+   G SKG+ F  Y DP VTD+A   L+   +GD+ L V+RA     
Sbjct: 406 SFGELKSFNLVKE-SAGVSKGFAFAEYLDPEVTDMAIQGLHNFALGDRNLVVQRAAVGRN 464

Query: 377 SKTE-----QESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITAD 431
           +          + L+QA  H+        TS                 +V+ L   +T +
Sbjct: 465 TGVNAPIPGSAAYLSQAIPHLMQNNADAPTS-----------------RVMLLLNMVTPE 507

Query: 432 ALADDEEYEEILEDMREECGKYGTLVNVVIPRP-------------------DQNGGETP 472
            L +D++Y +I+ED+ +EC KYG +  V IPRP                   ++   +  
Sbjct: 508 ELYNDDDYNDIIEDINDECSKYGEIEGVRIPRPVPKSKKWESTEAAAATAERNKRTDDEA 567

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
           GVG+V++ Y D      A +A+ GR+F G T+     PE+++ 
Sbjct: 568 GVGRVYVMYKDVESTKKAMDAIGGRQFAGRTILVANVPEEEFL 610


>gi|353240191|emb|CCA72072.1| related to pre-mRNA splicing factor U2AF large chain
           [Piriformospora indica DSM 11827]
          Length = 403

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 140/396 (35%), Positives = 198/396 (50%), Gaps = 76/396 (19%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMT--AIGGNSAGPGDAVVNV 216
           P+    V      R +RR+Y+G + PLA+E++IA FF+  M    +  +SA P   V+ V
Sbjct: 31  PVATGMVSNSNLARQSRRLYLGSITPLADEESIALFFNSQMRERKLTTSSAPP---VLAV 87

Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL 276
            +N EK +AFVE R+ E+A+  MALDG +F    +RVRRP DY        GP       
Sbjct: 88  QVNREKNYAFVEFRSAEDATAGMALDGTVFLDGPLRVRRPHDYA-------GPE------ 134

Query: 277 NLAAVGLASGA-IGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
             A  G+A  A +  A  PD   ++F+G LP + TE QI ELL+SFG L  F+LV++  T
Sbjct: 135 --AMTGVAGFATLLPATMPDSVNKIFIGNLPTHLTEDQIVELLKSFGELKAFNLVREHGT 192

Query: 333 GNSK----------------GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQ 376
             SK                G+ F  Y DPAVTDIA  +LNG+ +GDK L V+RA     
Sbjct: 193 NVSKVFTVRITLSMNLTGSQGFAFVEYADPAVTDIATESLNGMDLGDKKLVVQRA----- 247

Query: 377 SKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADD 436
                 S+ A+    I  + M +    +      + L  E   +V+ +   + A+ L DD
Sbjct: 248 ------SVGAKGGVPIPPEAMDIPAPIV-----AVDLNKEANGRVVLMLNMVVAEDLMDD 296

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRP-----------DQN---------GGETPGVGK 476
            EYEEIL+D+R EC  +G ++ V +PRP           D N           E  GVG+
Sbjct: 297 VEYEEILDDIRSECSGFGQVLGVYVPRPLKKDKSRWDTMDPNTMSPAEIAKADEIAGVGR 356

Query: 477 VFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
           V++++ D  G  TA   L+GR F G T+      ED
Sbjct: 357 VYVKFSDTEGATTAVRQLAGRSFAGRTIITTLMRED 392


>gi|110741990|dbj|BAE98934.1| splicing factor like protein [Arabidopsis thaliana]
          Length = 341

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 105/149 (70%), Positives = 118/149 (79%), Gaps = 4/149 (2%)

Query: 109 KRRSGFDMAPPAAAMLPGAA-VPGQLPGVPSAVPE--MAQNMLPFGATQ-LGAFPLMPVQ 164
           +R SGFDMAPPA+AML   A V GQ+P  P  +P   M  NM P    Q  G   +MP+Q
Sbjct: 169 QRVSGFDMAPPASAMLAAGAAVTGQVPPAPPTLPGAGMFPNMFPLPTGQSFGGLSMMPIQ 228

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
            MTQQATRHARRVYVGGL P ANEQ++ATFFSQVM A+GGN+AGPGDAVVNVYINHEKKF
Sbjct: 229 AMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINHEKKF 288

Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRV 253
           AFVEMR+VEEASNAM+LDGIIFEG  V+V
Sbjct: 289 AFVEMRSVEEASNAMSLDGIIFEGAPVKV 317


>gi|440792998|gb|ELR14199.1| U2 snRNP auxilliary factor, large subunit, splicing factor
           subfamily protein [Acanthamoeba castellanii str. Neff]
          Length = 462

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 126/350 (36%), Positives = 180/350 (51%), Gaps = 62/350 (17%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           R ARR+YVG +PP                   G S      VV+  +N +K F+F+E  T
Sbjct: 172 RQARRLYVGSIPP-------------------GVSDPDHRPVVSSQLNPDKSFSFIEFST 212

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           ++EA+  MALDGI   G+ ++VRRP DY +P  A A   G       +   G+ S  +  
Sbjct: 213 IDEATAGMALDGITMNGMTLKVRRPKDYVSPPTAQAPASG------GIHIPGIVSTNV-- 264

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
            + P+++F+GGLP Y  E Q+KELL +FG L  F+LVKD  TGNSKGY F  Y D +VTD
Sbjct: 265 PDSPNKIFIGGLPSYLNEAQVKELLTAFGPLKAFNLVKDTATGNSKGYAFFEYLDASVTD 324

Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
            AC  LNG+K+GDKTL V+RA    +       I+           M +  SGM  L   
Sbjct: 325 RACQGLNGMKLGDKTLLVQRANIGAKQDGTGGLIM-----------MPMDPSGM--LNAS 371

Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
            S       ++      +  + L  DE++ +I+ED+R+EC KYG +++            
Sbjct: 372 PSAASLLNLQL---LNLVRPEELVSDEDHADIVEDVRQECEKYGNVMS------------ 416

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
                 VF+E+ D    A A++ L+GRKF G+ V   Y+ ED Y  +D+S
Sbjct: 417 ------VFVEFADREDAARAQSNLAGRKFNGHVVLTSYFDEDDYEQRDFS 460


>gi|406867695|gb|EKD20733.1| U2 snRNP auxilliary factor [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 613

 Score =  199 bits (505), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 137/443 (30%), Positives = 209/443 (47%), Gaps = 76/443 (17%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P           P   ++L AF   P
Sbjct: 201 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMSQP 252

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +T  A     +R A+R+ V  LPP  NE+AI  FF+  +   G N     D  V+  
Sbjct: 253 SGSVTNAALKPSNSRQAKRLLVSNLPPTVNEEAIVNFFNLQLN--GLNVIEGSDPCVSAQ 310

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFE-----GVA-------VRVRRPTDYNPTLAA 265
           I+ +K FA +E +   +A+ A+ALDGI  E     G A       + +RRP DY   +  
Sbjct: 311 ISKDKTFALLEFKQTSDATIALALDGITMEDEHMNGSATNGDTQGLSIRRPKDY---IVP 367

Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFD 325
            +    P         G+ S  +   +  +++ +  +P Y T+ Q+ ELL SFG L  F 
Sbjct: 368 TITDETP------FEAGVISNVV--VDTQNKISISNIPLYLTDEQVTELLVSFGELKAFV 419

Query: 326 LVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESIL 385
           LVKD  T  S+G  FC Y DP  TDIA   LNG+++GDK L V+RA+             
Sbjct: 420 LVKDNGTDESRGIAFCEYVDPVATDIAVEGLNGMELGDKHLKVQRASIG----------- 468

Query: 386 AQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE----- 440
                H  +  + +  + M+ L G  S  G    +VL L   +T + L D+E+YE     
Sbjct: 469 -----HTQVSGLEMSVNAMSMLAGTTSQ-GLEEGRVLQLLNMVTPEELIDNEDYEGTILL 522

Query: 441 ------------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCA 488
                       EI ED++EEC KYG ++++ +PRP     ++ GVGK+++++ ++    
Sbjct: 523 TTLIQSLLTMYPEICEDVKEECEKYGKVLDMKVPRPTGGSRQSNGVGKIYVKFDNSDSAG 582

Query: 489 TAKNALSGRKFGGNTVNAFYYPE 511
            A   L+GRKF   TV   Y+ E
Sbjct: 583 KALRTLAGRKFADRTVVVTYFSE 605


>gi|443924699|gb|ELU43686.1| rRNA primary transcript binding protein [Rhizoctonia solani AG-1
           IA]
          Length = 678

 Score =  197 bits (502), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 134/382 (35%), Positives = 198/382 (51%), Gaps = 70/382 (18%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           R +RR+YVG +   ANE  +  FF++ M  +   + G GD V+ V INHEK +AFVE R+
Sbjct: 233 RQSRRLYVGNITYEANENNLQEFFNRKMVEMKIGTGGGGDPVIGVQINHEKSYAFVEFRS 292

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPT-LAAALG---PGQPSPNLNLAAVGLASGA 287
            E+A+ AMA DGI+F+   +++RRP DY  + L+A +G   PG  S N+           
Sbjct: 293 AEDATAAMAFDGIMFQSGPLKIRRPKDYTGSDLSAPMGVHVPGVVSTNV----------- 341

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC-VYQDP 346
               +  +++FVGGLP Y  E Q+ ELL+SFG L  F+LV  R+ GN      C VY DP
Sbjct: 342 ---PDSINKIFVGGLPTYLDENQVMELLKSFGELKAFNLV--RENGNGAFRRDCQVYVDP 396

Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQSKTE--QESILAQAQQHIAIQKMAL-QTSG 403
           +VTDIA   LNG+++GD+ L V+RA+   +S        + A A   I    M + +T+ 
Sbjct: 397 SVTDIAIQGLNGMELGDRFLVVQRASVGAKSGIPGVPPELFAPA---IPRPIMPITETAD 453

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
            N           + + +L L   +  + L DD EY EI+ED+REEC K+G + ++ IPR
Sbjct: 454 PN----------PSDSTILLLLNMVAPEDLTDDGEYTEIVEDVREECSKFGPVRSLAIPR 503

Query: 464 P--------DQNGG-------------------------ETPGVGKVFLEYYDAVGCATA 490
           P        D N G                         E  GVG+V++++    G ++A
Sbjct: 504 PAKKEKSKWDANAGALVTAGGAVLAPGAVGATAGDGKTDEQRGVGRVYIKFETPEGASSA 563

Query: 491 KNALSGRKFGGNTVNAFYYPED 512
              L+GR F G ++ A    +D
Sbjct: 564 LRGLAGRAFAGRSIIATILADD 585


>gi|297706019|ref|XP_002829852.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Pongo abelii]
          Length = 352

 Score =  197 bits (502), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 124/289 (42%), Positives = 177/289 (61%), Gaps = 23/289 (7%)

Query: 188 EQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE 247
           ++A+  FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E + AMA DGIIF+
Sbjct: 74  KEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQ 132

Query: 248 GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFT 307
           G ++++RRP DY P       PG  S N ++   G+ S  +   +   ++F+GGLP Y  
Sbjct: 133 GQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLN 183

Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLT 367
           + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D  VTD A A LNG+++GDK L 
Sbjct: 184 DDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLL 243

Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEA 427
           V+RA+   ++ T    +    Q  + +Q   L +S +  +GG  +       +VLCL   
Sbjct: 244 VQRASVGAKNAT----LSTINQTPVTLQVPGLMSSQVQ-MGGHPT-------EVLCLMNM 291

Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGK 476
           +  + L DDEEYEEI+ED+R+EC KYG + ++ IPRP  +G E PG GK
Sbjct: 292 VLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGK 339


>gi|295663747|ref|XP_002792426.1| splicing factor U2AF 50 kDa subunit [Paracoccidioides sp. 'lutzii'
           Pb01]
 gi|226279096|gb|EEH34662.1| splicing factor U2AF 50 kDa subunit [Paracoccidioides sp. 'lutzii'
           Pb01]
          Length = 567

 Score =  197 bits (502), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 136/465 (29%), Positives = 227/465 (48%), Gaps = 85/465 (18%)

Query: 93  RNRSKSLSP---SRSPS-----------KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGV 136
           R+R +S SP    R P+           + +R + +D+ PP    +    A + G  P +
Sbjct: 140 RDRKRSASPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYEHVTAEQAKLSGMFP-L 198

Query: 137 PSAVPEMAQNMLPFGATQLGAFPLMPV------QVMTQQATRHARRVYVGGLPPLANEQA 190
           P A  + A +      ++L AF   PV       V+    +R A+R++V  +PP A E++
Sbjct: 199 PGAPRQQAVD-----PSRLQAFMNQPVPGISTNTVLRPSNSRQAKRLFVHNIPPSATEES 253

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+  +   G N     D  V+  ++ ++ F  +E ++  +A+ A+A DGI  E   
Sbjct: 254 LVQFFNLQLN--GLNVIKGVDPCVSAQLSKDRSFGLLEFKSSADATVALAFDGITMEDTG 311

Query: 251 ---------------VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
                          + +RRP DY       +G G+  P+      G+ S  +   + P+
Sbjct: 312 DMDTSNGEANGSNQGLSLRRPKDY----ILPVG-GEEEPHRE----GVVSNVV--PDTPN 360

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           ++ V  +P +  E  +  LL SFG L  F LVKD +TG S+G  FC Y DP  TDIA   
Sbjct: 361 KICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSETGESRGIAFCEYLDPMSTDIAVEN 420

Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFG 415
           LNG+++G+K L V RA+                          +Q +G++     MS++ 
Sbjct: 421 LNGMELGNKRLKVVRASIG-----------------------TIQATGLDMGINAMSMYA 457

Query: 416 ETLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
           +T +      +VL L   +T D L D+++YEEI +D+R+EC KYG +V + +PRP  N  
Sbjct: 458 KTTSQDIEAGRVLQLLNMVTTDELLDNDDYEEICDDVRDECSKYGQVVELKVPRPTGNNK 517

Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           ++ GVGK+++++ ++   + A  AL+GRKF   TV   Y+ E+ +
Sbjct: 518 QSAGVGKIYVKFDNSESASKALKALAGRKFQDRTVVTTYFSEENF 562


>gi|409081516|gb|EKM81875.1| hypothetical protein AGABI1DRAFT_70425 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 375

 Score =  197 bits (502), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 127/371 (34%), Positives = 196/371 (52%), Gaps = 39/371 (10%)

Query: 161 MPVQVM------TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
           MPVQ            +R +RR+Y+G +    NEQ +A FF+  M  +   +   G+ V+
Sbjct: 1   MPVQSFGMGIGGNPNLSRQSRRLYIGSITQEVNEQNLADFFNAKMAEMNIGTGITGNPVL 60

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
            V  N+EK +AFVE R+ E+A+ AMA DGIIF    +++RRP DY       + PG   P
Sbjct: 61  AVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDYGGD--TIVSPGVHVP 118

Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
                  G+ S  +   +  ++VFVGGLP Y  E Q+ ELL+SFG L  F+LV++  TG 
Sbjct: 119 -------GVVSTNV--PDSINKVFVGGLPTYLNEEQVMELLKSFGELKAFNLVRENGTGT 169

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
           SKG+ F  Y D AVTD+A  +LNG+++GD+ L V+RA+   +  T          Q   I
Sbjct: 170 SKGFAFFEYVDQAVTDVAIQSLNGMELGDRYLVVQRASVGAKPGTPGMIPNLPYDQFPEI 229

Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
            +  +      T          + +++L +   +T + L +D+EY ++ +D++ EC KYG
Sbjct: 230 PRPIMPAGKDQT---------SSESRILLMLNMVTPEDLHEDDEYGDLYDDVKAECSKYG 280

Query: 455 TLVNVVIPRP-------------DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
            L ++ IPRP              Q   E  GVG+V+++Y ++   + A NAL+GR F G
Sbjct: 281 ELEDLRIPRPVKKDKTSSMSAQDAQRIDEAAGVGRVYVKYTNSKSASAALNALAGRSFAG 340

Query: 502 NTVNAFYYPED 512
            ++ A    +D
Sbjct: 341 RSIIATLLSDD 351


>gi|225677913|gb|EEH16197.1| splicing factor U2AF 65 kDa subunit ) [Paracoccidioides
           brasiliensis Pb03]
 gi|226287345|gb|EEH42858.1| splicing factor U2AF 50 kDa subunit [Paracoccidioides brasiliensis
           Pb18]
          Length = 567

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 136/465 (29%), Positives = 227/465 (48%), Gaps = 85/465 (18%)

Query: 93  RNRSKSLSP---SRSPS-----------KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGV 136
           R+R +S SP    R P+           + +R + +D+ PP    +    A + G  P +
Sbjct: 140 RDRKRSASPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYEHVTAEQAKLSGMFP-L 198

Query: 137 PSAVPEMAQNMLPFGATQLGAFPLMPV------QVMTQQATRHARRVYVGGLPPLANEQA 190
           P A  + A +      ++L AF   PV       V+    +R A+R++V  +PP A E++
Sbjct: 199 PGAPRQQAVD-----PSRLQAFMNQPVPGTSINTVLRPSNSRQAKRLFVHNIPPSATEES 253

Query: 191 IATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA 250
           +  FF+  +   G N     D  V+  ++ ++ F  +E ++  +A+ A+A DGI  E   
Sbjct: 254 LVQFFNLQLN--GLNVIKGVDPCVSAQLSKDRSFGLLEFKSSADATVALAFDGITMEDTG 311

Query: 251 ---------------VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
                          + +RRP DY       +G G+  P+      G+ S  +   + P+
Sbjct: 312 DMDTSNGEANGSNQGLSLRRPKDY----ILPVG-GEEEPHRE----GVVSNVV--PDTPN 360

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           ++ V  +P +  E  +  LL SFG L  F LVKD +TG S+G  FC Y DP  TDIA   
Sbjct: 361 KICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSETGESRGIAFCEYLDPMSTDIAVEN 420

Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFG 415
           LNG+++G+K L V RA+                          +Q +G++     MS++ 
Sbjct: 421 LNGMELGNKRLRVVRASIG-----------------------TIQATGLDMGINAMSMYA 457

Query: 416 ETLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
           +T +      +VL L   +T D L D+++YEEI +D+R+EC KYG +V + +PRP  N  
Sbjct: 458 KTTSQDIEAGRVLQLLNMVTTDELLDNDDYEEICDDVRDECSKYGQVVELKVPRPTGNNK 517

Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           ++ GVGK+++++ ++   + A  AL+GRKF   TV   Y+ E+ +
Sbjct: 518 QSAGVGKIYVKFDNSESASKALKALAGRKFQDRTVVTTYFSEENF 562


>gi|240279650|gb|EER43155.1| splicing factor u2af large subunit [Ajellomyces capsulatus H143]
 gi|325092783|gb|EGC46093.1| splicing factor u2af large subunit [Ajellomyces capsulatus H88]
          Length = 572

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 137/464 (29%), Positives = 221/464 (47%), Gaps = 78/464 (16%)

Query: 93  RNRSKSLSP---SRSPS-----------KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LP 134
           R R +S SP    R P+           + +R + +D+ PP    +    A + G   LP
Sbjct: 140 RERKRSASPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYEHVTAEQAKLSGMFPLP 199

Query: 135 GVP---SAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAI 191
           G P   +  P   Q  +    T   A       V+    +R A+R++V  LP  A E+++
Sbjct: 200 GAPRQQAVDPSRLQAFIHPPTTTSTAPGTSTNTVLKPSNSRQAKRLFVHNLPSSATEESL 259

Query: 192 ATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE---- 247
             FF+  +   G N     D  V   ++++K FA +E R   + + A+A DGI  E    
Sbjct: 260 VQFFNLQLN--GLNVIKGVDPCVTAQLSNDKTFALLEFRNAADTTVALAFDGITMEDNDE 317

Query: 248 -----------GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDR 296
                         + +RRP DY   L +A+  G+P+        G+ S  +   + P++
Sbjct: 318 MDTTNGDSNGSNQGLSIRRPKDY--ILPSAVE-GEPNQE------GVVSNVV--PDSPNK 366

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           + V  +P +  E  +  LL SFG L  F LVKD +TG S+G  FC Y+DP  TDIA   L
Sbjct: 367 ICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSETGESRGIAFCEYRDPMSTDIAVENL 426

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGE 416
           NG+++G+K L V RA+                           Q +G++     MS++ +
Sbjct: 427 NGMELGNKKLKVVRASIG-----------------------TTQAAGLDMGVNAMSMYAK 463

Query: 417 TLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
           T +      +VL L   +T + L D+++YEEI +D+R+EC KYG +V + +PRP  N  +
Sbjct: 464 TTSQDIEASRVLQLLNMVTTEELIDNDDYEEICDDVRDECSKYGEVVELKVPRPTGNNKQ 523

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           + GVGK+++++ ++     A  AL+GRKF   TV   Y+ E+ +
Sbjct: 524 SAGVGKIYVKFDNSESATKALRALAGRKFQDRTVVTTYFSEENF 567


>gi|388580158|gb|EIM20475.1| hypothetical protein WALSEDRAFT_55089 [Wallemia sebi CBS 633.66]
          Length = 391

 Score =  197 bits (500), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 125/382 (32%), Positives = 193/382 (50%), Gaps = 61/382 (15%)

Query: 157 AFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
           +F  +P  +      R +RRV++G L P  NE+ +   F+  M+ I   +  PG+ VVNV
Sbjct: 15  SFSNLPPTLEDFNGARASRRVFIGDLKPNHNEENLTKLFNDKMSTIDQVAKIPGEPVVNV 74

Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL 276
            + H++ +A++E R  +EA+ A+  DG IF+G  ++++RP +    L    G        
Sbjct: 75  TVKHDRGYAYIEFRNTDEAAYALQFDGTIFQGEGIQIKRPQEVLDELQRKQG-------- 126

Query: 277 NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
                   SG +  ++   ++FVG LP +  + Q+ ELL SFG L  F+LVK+  +  SK
Sbjct: 127 -----HTVSGTVPDSD--QKIFVGSLPTFLNDEQVMELLGSFGELRSFNLVKEGTSDVSK 179

Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQK 396
           G+ FC Y DPA+TDIA   LNG+++GD+ L V+R++     K                  
Sbjct: 180 GFAFCEYMDPALTDIAIQGLNGMEVGDRKLVVQRSSTGPMGK------------------ 221

Query: 397 MALQTSGMNTLGGGMSLFGETLA---KVLCLTEAITADALADDEEYEEILEDMREECGKY 453
             +   G +++   + L  ET A    VL L   +TA+ L DD +Y+EI ED++EEC +Y
Sbjct: 222 --IGVGGTSSIAQILPLASETQAYRTNVLLLLNMVTAEELKDDLDYQEICEDIQEECSQY 279

Query: 454 GTLVNVVIPRPD----------------------QNGGETPGVGKVFLEYYDAVGCATAK 491
           G ++ + IPRP                       +   E  GVGK+F+ Y      + A 
Sbjct: 280 GEIIKIKIPRPPRPDDPVFSTPGVTLSSGEDLRFEAASEELGVGKIFILYKTEEQASKAL 339

Query: 492 NALSGRKFGGNTV-NAFYYPED 512
            AL+GR FGG TV  A+  PED
Sbjct: 340 KALAGRVFGGRTVIGAYGKPED 361


>gi|154280004|ref|XP_001540815.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150412758|gb|EDN08145.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 571

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 139/466 (29%), Positives = 222/466 (47%), Gaps = 83/466 (17%)

Query: 93  RNRSKSLSP---SRSPS-----------KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LP 134
           R R +S SP    R P+           + +R + +D+ PP    +    A + G   LP
Sbjct: 140 RERKRSASPPPKKREPTPDLTDVVPVLERKRRLTQWDIKPPGYEHVTAEQAKLSGMFPLP 199

Query: 135 GVP---SAVPEMAQNML--PFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQ 189
           G P   +  P   Q  +  P  +T  G        V+    +R A+R++V  LP  A E+
Sbjct: 200 GAPRQQAVDPSRLQAFIHPPTTSTAPGT---STNTVLKPSNSRQAKRLFVHNLPSSATEE 256

Query: 190 AIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE-- 247
           ++  FF+  +   G N     D  V   ++++K FA VE R   + + A+A DGI  E  
Sbjct: 257 SLVQFFNLQLN--GLNVIKGVDPCVTAQLSNDKTFALVEFRNAADTTVALAFDGITMEDN 314

Query: 248 -------------GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
                           + +RRP DY   L +A+  G+P         G+ S  +   + P
Sbjct: 315 DEMDTTNGNSNGSNQGLSIRRPKDY--ILPSAVE-GEPHQE------GVVSNVV--PDSP 363

Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
           +++ V  +P +  E  +  LL SFG L  F LVKD +TG S+G  FC Y+DP  TDIA  
Sbjct: 364 NKICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSETGESRGIAFCEYRDPMSTDIAVE 423

Query: 355 ALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLF 414
            LNG+++G+K L V RA+                           Q +G++     MS++
Sbjct: 424 NLNGMELGNKKLKVVRASIG-----------------------TTQAAGLDMGVNAMSMY 460

Query: 415 GETLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
            +T +      +VL L   +T + L D+++YEEI +D+R+EC KYG +V + +PRP  N 
Sbjct: 461 AKTTSQDIEASRVLQLLNMVTTEELIDNDDYEEICDDVRDECSKYGEVVELKVPRPTGNN 520

Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
            ++ GVGK+++++ ++     A  AL+GRKF   TV   Y+ E+ +
Sbjct: 521 KQSAGVGKIYVKFDNSESATKALRALAGRKFQDRTVVTTYFSEENF 566


>gi|367020820|ref|XP_003659695.1| hypothetical protein MYCTH_2297048 [Myceliophthora thermophila ATCC
           42464]
 gi|347006962|gb|AEO54450.1| hypothetical protein MYCTH_2297048 [Myceliophthora thermophila ATCC
           42464]
          Length = 567

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 132/430 (30%), Positives = 213/430 (49%), Gaps = 59/430 (13%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P       Q M P   T+L AF   P
Sbjct: 170 RKRRLTQWDIKPPGYDNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMNHP 221

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +   A     +R ++R+ V  LPP A E+++  FF+  +   G N     D  +  +
Sbjct: 222 GGAVNSAALKPTNSRQSKRLIVSNLPPSATEESLVNFFNLQLN--GLNVIETADPCLQAH 279

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG-------------VAVRVRRPTDYNPTLA 264
           I  ++ FA +E R   +A+ A+ALDGI  E                + +RRP DY   + 
Sbjct: 280 IAPDRSFAMLEFRHNTDATVALALDGITMEAEDADAANGNGAATQGLHLRRPKDY--IVP 337

Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
           A +      PN +  +   +S  +   + P+++ V  LP Y T+ Q+ ELL SFG L  F
Sbjct: 338 AVVE----DPNYDPDSDTPSSVVL---DSPNKISVTNLPLYLTDDQVMELLVSFGKLKSF 390

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
            LVKD  T  S+G  F  Y DP+ T++A   LN + +G++ L V++A+            
Sbjct: 391 VLVKDNGTQESRGIAFLEYADPSATNVAVQGLNNMMLGERALKVQKASIG---------- 440

Query: 385 LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
           + Q    + +  M++    M+   GG        ++VL L   +T + L D+++YEEI E
Sbjct: 441 ITQVSGEMGVNAMSMLAGTMSADAGG--------SRVLQLLNMVTPEELMDNDDYEEIRE 492

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           D++EEC K+G ++++ IPRP     ++ GVGK++++Y  A     A  AL+GRKF   TV
Sbjct: 493 DVQEECQKFGKILSLKIPRPVGGSRQSAGVGKIYIKYETAESATKALRALAGRKFADRTV 552

Query: 505 NAFYYPEDKY 514
              Y+PE+ +
Sbjct: 553 VTTYFPEENF 562


>gi|195585954|ref|XP_002082743.1| GD25073 [Drosophila simulans]
 gi|194194752|gb|EDX08328.1| GD25073 [Drosophila simulans]
          Length = 445

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 125/367 (34%), Positives = 192/367 (52%), Gaps = 45/367 (12%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP----GDAVVNVYINHEKKFAF 226
           TR ARR+YVG +P    E+ +  FF+  +TA+G  +       G AV+    N EK FAF
Sbjct: 103 TRQARRLYVGNIPFGVTEEEMMQFFNHRITALGYEAKSSHYMDGKAVLTCQTNLEKNFAF 162

Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP-------------S 273
           +E R+++EA+ A+  DG++F G  +++RRP DY P  + +    +               
Sbjct: 163 LEFRSIDEATQALNFDGMVFRGQTLKIRRPHDYQPVPSISFSAMENYRSFRVPDTTIANP 222

Query: 274 PNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTG 333
           PN+ +    +        + P++++VGGLP    + Q+KELL+SFG L G +LV D +T 
Sbjct: 223 PNVTIPVTTIV------PDSPNKIYVGGLPTCLNQDQVKELLQSFGELKGLNLVMDGNTS 276

Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
            +KG+ F  Y DP VTD A A L+G+ +GD+ L V+R+   G++     +          
Sbjct: 277 LNKGFAFFEYYDPLVTDHAIAGLHGMLLGDRRLVVQRSIPGGKNAFPGHT---------- 326

Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
                +Q  G++TL     L   +  + LCL   +  + L DDEE+E+I  D+++EC K+
Sbjct: 327 --APVVQVPGISTL-----LDPGSPTETLCLLNMVRPEELLDDEEFEDIRTDIKQECAKF 379

Query: 454 GTLVNVVIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
           G + ++ IPRP    G  P  G GKVF+++        A  ALSGRKF    V   YY  
Sbjct: 380 GEVRSIKIPRPI---GPFPKRGCGKVFVQFESVEDSQKALKALSGRKFSDRIVMTSYYDP 436

Query: 512 DKYFNKD 518
           +KY   D
Sbjct: 437 EKYLADD 443


>gi|70946422|ref|XP_742927.1| U2 snRNP auxiliary factor [Plasmodium chabaudi chabaudi]
 gi|56522174|emb|CAH84932.1| U2 snRNP auxiliary factor, putative [Plasmodium chabaudi chabaudi]
          Length = 561

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 141/490 (28%), Positives = 229/490 (46%), Gaps = 53/490 (10%)

Query: 64  DRTDRHRDYNRDKERRHRHRSR---SHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPA 120
           D +D++   +  +ER+   R +   S S +  +  +K + P R  SK       D +  A
Sbjct: 88  DNSDKNYSDDSTRERKKNARDKNDISMSEEDSKKENKEIKPKRKKSK---WDTVDESLLA 144

Query: 121 AAMLPGAAVPGQLPGVPS-AVPEMAQNMLPFGAT-QLGAFPLMPVQVMTQQATRHARRVY 178
             ML  +     L GV       +  N+LP     QLG  P         +  +  R++Y
Sbjct: 145 NNMLIDS---NNLSGVLQYQRLSLNGNLLPGNKMPQLGRNP------YELEGDKKQRKLY 195

Query: 179 VGGLPPLANEQAIATFFSQVMTAIGGNSA---GPGDA----VVNVYI-NHEKKFAFVEMR 230
           +G LPP + ++ I  FF+  +++I   S+     GD     VV   I N + +F F+E R
Sbjct: 196 IGNLPPNSKQEEIVEFFNNTLSSIIKGSSLEVKIGDVQLLPVVKCEIFNPDSRFCFLEFR 255

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
           T++    ++ LD + +    +R+ RP DY P       P +  P L +    +  G +  
Sbjct: 256 TMDITWLSLKLDSMSYNNYCLRINRPHDYMP-------PPEGDPALTVVFPDIDMGLLES 308

Query: 291 AEGP------------DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
            + P            +++++  LP+   + QI +LL  FG L GF+++KD +TG +KGY
Sbjct: 309 FKPPKIAPVRSTGDDDNKLYIQNLPHDLKDDQIMDLLGQFGKLKGFNIIKDLNTGLNKGY 368

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI------ 392
           GF  Y+D + T +A  ALNG   G   L V++AT +                ++      
Sbjct: 369 GFFEYEDSSCTQVAIHALNGFVCGKNILNVKKATFNKNPNNIPNPNNIALANNVDVPVSL 428

Query: 393 ---AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
              +I +  L  S +          GE  ++V+ LT A+  + L  + +YEEIL+D++EE
Sbjct: 429 LPNSISQKILSNSIIGLQIQASRKIGEKSSRVIQLTNAVFQEDLIINSQYEEILKDVKEE 488

Query: 450 CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
             KYGTL ++VIP+P+ +   T GVGK+FL Y D      A+   +GR F    V A +Y
Sbjct: 489 AEKYGTLQSIVIPKPNIDLSYTEGVGKIFLHYADETAARKAQYMFNGRLFEKRVVCASFY 548

Query: 510 PEDKYFNKDY 519
            EDK+    Y
Sbjct: 549 SEDKFLEGKY 558


>gi|261196608|ref|XP_002624707.1| splicing factor u2af large subunit [Ajellomyces dermatitidis
           SLH14081]
 gi|239595952|gb|EEQ78533.1| splicing factor u2af large subunit [Ajellomyces dermatitidis
           SLH14081]
 gi|239609528|gb|EEQ86515.1| splicing factor u2af large subunit [Ajellomyces dermatitidis ER-3]
 gi|327350238|gb|EGE79095.1| splicing factor u2af large subunit [Ajellomyces dermatitidis ATCC
           18188]
          Length = 570

 Score =  196 bits (497), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 136/469 (28%), Positives = 217/469 (46%), Gaps = 89/469 (18%)

Query: 93  RNRSKSLSP---SRSPS-----------KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGV 136
           R R +S SP    R P+           + +R + +D+ PP    +    A + G  P +
Sbjct: 139 RERKRSASPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYEHVTAEQAKLSGMFP-L 197

Query: 137 PSAVPEMAQNMLPFGATQLGAFPLMPVQ----------VMTQQATRHARRVYVGGLPPLA 186
           P A  + A        ++L AF   P            V+    +R A+R++V  LPP A
Sbjct: 198 PGAPRQQA-----VDPSRLQAFIHPPTTITAPGSSTNTVLKPSNSRQAKRLFVHNLPPSA 252

Query: 187 NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF 246
            E+ +  FF+  +   G N     D  ++  ++ +K FA +E R   + + A+A DGI  
Sbjct: 253 TEERLVQFFNLQLN--GLNVIKGVDPCLSAQLSRDKTFALLEFRNAADTTVALAFDGITM 310

Query: 247 E---------------GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
           E                  + +RRP DY    A    P Q          G+ S  +   
Sbjct: 311 EDSGEMDTSNGDVDGPSQGLSIRRPKDYILPSAVEEEPQQE---------GVVSNVV--P 359

Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
           + P+++ V  +P +  E  +  LL SFG L  F LVKD +TG S+G  FC Y DP  T+I
Sbjct: 360 DSPNKICVSNIPPFIEEEPVTMLLVSFGELKSFILVKDSETGESRGIAFCEYLDPTSTEI 419

Query: 352 ACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGM 411
           A   LNG+++G+K L V RA+                           Q +G++     M
Sbjct: 420 AVENLNGMELGNKRLKVVRASVG-----------------------TTQAAGLDMGVNAM 456

Query: 412 SLFGETLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
           S++ +T +      +VL L   +T + L D+++YEEI +D+R+EC KYG +V + +PRP 
Sbjct: 457 SMYAKTTSQDIEASRVLQLLNMVTTEELMDNDDYEEICDDVRDECSKYGEVVELKVPRPT 516

Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
            N  ++ GVGK+++E+ ++     A  AL+GRKF   TV   Y+ E+ +
Sbjct: 517 GNNKQSAGVGKIYVEFDNSESATKALKALAGRKFQDRTVVTTYFSEENF 565


>gi|241166827|ref|XP_002409934.1| splicing factor u2af large subunit, putative [Ixodes scapularis]
 gi|215494685|gb|EEC04326.1| splicing factor u2af large subunit, putative [Ixodes scapularis]
          Length = 444

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 131/361 (36%), Positives = 187/361 (51%), Gaps = 40/361 (11%)

Query: 171 TRHARRVYVGGLP--------PLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           TR ARR+YVG +P        PL   + +  +F+  M A G + A PG+ V+   IN +K
Sbjct: 112 TRQARRLYVGNIPFGCSEASRPLLLREEMMDYFNAQMHACGFSQA-PGNPVLACQINLDK 170

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDY--NPTLAAALGPGQPSPNLNLAA 280
            FAF+E+  ++           +            DY   P     LG G  +P      
Sbjct: 171 NFAFLEVSALDTDLGTPCCPTFVL----------YDYLSPPFSGNCLGAGNGTPGDTW-- 218

Query: 281 VGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
           +G  SG I     + P ++F+GGLP Y  E Q++ELL SFG L  F+LVKD  TG SKGY
Sbjct: 219 LGFLSGVISTVVQDSPHKIFIGGLPNYLNEDQVRELLMSFGQLRAFNLVKDSATGLSKGY 278

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            FC Y +   TD A   LNG+++GDK L V+RA+    +K  Q ++   A   I +  + 
Sbjct: 279 AFCEYVEVTTTDQAIMGLNGMQLGDKKLIVQRASVG--AKNSQMNVSRDAPVQIQVPGLQ 336

Query: 399 LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
           LQ       G G         +VLCL   +  + L D+EEYE+ILED+ EEC KYG + +
Sbjct: 337 LQG------GAGPP------TEVLCLMNLVCPEELKDEEEYEDILEDIHEECNKYGVVKS 384

Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
           + IPRP + G + PG GK F+E+   + C  A+ +L+GRKF    V   Y+  DKY  ++
Sbjct: 385 IEIPRPIE-GVDVPGCGKAFVEFNSVIDCQKAQQSLTGRKFSNRVVVTSYFDPDKYHRRE 443

Query: 519 Y 519
           +
Sbjct: 444 F 444


>gi|225562835|gb|EEH11114.1| splicing factor u2af large subunit [Ajellomyces capsulatus G186AR]
          Length = 571

 Score =  195 bits (496), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 138/466 (29%), Positives = 222/466 (47%), Gaps = 83/466 (17%)

Query: 93  RNRSKSLSP---SRSPS-----------KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LP 134
           R R +S SP    R P+           + +R + +D+ PP    +    A + G   LP
Sbjct: 140 RERKRSASPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYEHVTAEQAKLSGMFPLP 199

Query: 135 GVP---SAVPEMAQNML--PFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQ 189
           G P   +  P   Q  +  P  +T  G        V+    +R A+R++V  LP  A E+
Sbjct: 200 GAPRQQAVDPSRLQAFIHPPTTSTAPGT---STNTVLKPSNSRQAKRLFVHNLPSSATEE 256

Query: 190 AIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE-- 247
           ++  FF+  +   G N     D  V   ++++K FA +E R   + + A+A DGI  E  
Sbjct: 257 SLVQFFNLQLN--GLNVIKGVDPCVTAQLSNDKTFALLEFRNAADTTVALAFDGITMEDN 314

Query: 248 -------------GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
                           + +RRP DY   L +A+  G+P         G+ S  +   + P
Sbjct: 315 DDMDTTNGDSNGSNQGLSIRRPKDY--ILPSAVE-GEPHQE------GVVSNVV--PDSP 363

Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
           +++ V  +P +  E  +  LL SFG L  F LVKD +TG S+G  FC Y+DP  TDIA  
Sbjct: 364 NKICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSETGESRGIAFCEYRDPMSTDIAVE 423

Query: 355 ALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLF 414
            LNG+++G+K L V RA+                           Q +G++     MS++
Sbjct: 424 NLNGMELGNKKLKVVRASIG-----------------------TTQAAGLDMGVNAMSMY 460

Query: 415 GETLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
            +T +      +VL L   +T + L D+++YEEI +D+R+EC KYG +V + +PRP  N 
Sbjct: 461 AKTTSQDIEASRVLQLLNMVTTEELIDNDDYEEICDDVRDECSKYGEVVELKVPRPTGNN 520

Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
            ++ GVGK+++++ ++     A  AL+GRKF   TV   Y+ E+ +
Sbjct: 521 KQSAGVGKIYVKFDNSESATKALRALAGRKFQDRTVVTTYFSEENF 566


>gi|119592808|gb|EAW72402.1| U2 (RNU2) small nuclear RNA auxiliary factor 2, isoform CRA_a [Homo
           sapiens]
          Length = 376

 Score =  194 bits (494), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 122/311 (39%), Positives = 173/311 (55%), Gaps = 21/311 (6%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T    +++   QH        
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LVSPPAQHHQSDACDP 359

Query: 400 QTSGMNTLGGG 410
            ++G++ L G 
Sbjct: 360 ASAGLDELPGA 370


>gi|345569109|gb|EGX51978.1| hypothetical protein AOL_s00043g712 [Arthrobotrys oligospora ATCC
           24927]
          Length = 569

 Score =  194 bits (493), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 132/450 (29%), Positives = 216/450 (48%), Gaps = 54/450 (12%)

Query: 81  RHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPA--AAMLPGAAVPGQLPGVP 137
           R   RS S  + +  +  L+     S+ KRR + +D+ PP   +     A + G  P +P
Sbjct: 153 RQPRRSPSPPKVKEPTPDLTDIIPVSERKRRLTMWDIKPPGYESVTAEQAKLSGMFP-LP 211

Query: 138 SAVPEMAQNMLPFGATQLGAFPLMPVQ--------VMTQQATRHARRVYVGGLPPLANEQ 189
            A  + A        +++ AF   P +         +    +R A+R+    LPP+  E+
Sbjct: 212 GAPRQTA-----LDPSRMAAFVNHPPKEGSTPQPTALKPSNSRQAKRLLCQNLPPMCTEE 266

Query: 190 AIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV 249
            I +FFS  + ++    +   + ++ VY+N     A +E R+   A+  +A DG+ F+  
Sbjct: 267 TIYSFFSSFLKSLNAVDSE-NEPLITVYLNPTGTMAMLEFRSTAYATLCLAFDGMEFDDT 325

Query: 250 AVRVR--RPTDY---NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPY 304
            V++R  RP DY     + ++    G  SPN+               +  +++ V  +P 
Sbjct: 326 EVKIRLSRPKDYIIPQYSESSESHNGDISPNV--------------PDSINKICVSNIPT 371

Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDK 364
           +  + Q+ ELL++FG L  F LVKD++   SKG  FC Y DP + +IA   LNGL + ++
Sbjct: 372 HLADQQVMELLQTFGPLKSFFLVKDKEMDESKGVAFCEYLDPNIAEIAIEGLNGLDINEQ 431

Query: 365 TLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCL 424
            L V+RA+   +         A A+  I    +   T+     GG          +VL L
Sbjct: 432 LLNVKRASIGVKQS-------AGAEAGIPAMTVIAATTSAEMEGG----------RVLQL 474

Query: 425 TEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDA 484
              +TAD L D EEYEEILED+ +EC K+G ++++ IPRP  N     GVGK+++ + + 
Sbjct: 475 LNMVTADELLDQEEYEEILEDVTDECNKFGPIIDIKIPRPSGNQRAAAGVGKIYVRFEEH 534

Query: 485 VGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
                A  +L+GRKF   TV   Y+ E+ Y
Sbjct: 535 ESAEKALKSLAGRKFADRTVIVSYFSEENY 564


>gi|148699340|gb|EDL31287.1| U2 small nuclear ribonucleoprotein auxiliary factor (U2AF) 2,
           isoform CRA_b [Mus musculus]
          Length = 356

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 117/285 (41%), Positives = 162/285 (56%), Gaps = 27/285 (9%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAML-----PGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
           RSP    K K R  +D+ PP    +           GQ+P   + +P M  + L    T 
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPAT-ALLPTMTPDGLAVTPT- 135

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
                  PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+
Sbjct: 136 -------PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVL 187

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
            V IN +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S 
Sbjct: 188 AVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SE 240

Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
           N ++   G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG 
Sbjct: 241 NPSVYVPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGL 298

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT 379
           SKGY FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T
Sbjct: 299 SKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT 343


>gi|119592809|gb|EAW72403.1| U2 (RNU2) small nuclear RNA auxiliary factor 2, isoform CRA_b [Homo
           sapiens]
          Length = 356

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 117/280 (41%), Positives = 161/280 (57%), Gaps = 17/280 (6%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 245

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 246 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 303

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT 379
           FC Y D  VTD A A LNG+++GDK L V+RA+   ++ T
Sbjct: 304 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT 343


>gi|221059061|ref|XP_002260176.1| U2 snRNP auxiliary factor [Plasmodium knowlesi strain H]
 gi|193810249|emb|CAQ41443.1| U2 snRNP auxiliary factor, putative [Plasmodium knowlesi strain H]
          Length = 865

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 116/375 (30%), Positives = 193/375 (51%), Gaps = 25/375 (6%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA---GPGDAVVNVYI-----NH 220
           +  +  R++Y+G +PP + ++ +  FF+  + +I  +S+     GD V+   +     N 
Sbjct: 489 EGDKKQRKLYIGNIPPNSKQEELIDFFNNTLGSIIKDSSLEIKIGDIVLMPILKCEIFNV 548

Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDY------NPTLAAALGPGQPSP 274
           E +F F+E R++E     + LD I F   A+R+ RP D+      +P L       Q   
Sbjct: 549 ESRFCFLEFRSLEITWLCLRLDAITFNNYALRIARPHDFVPPPGGDPALTVVFTDIQHEV 608

Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
              +  + +A     G +  +++++  LP+   + QI++LL+ FG L GF+++KD+ TG 
Sbjct: 609 FEMVKPIKIAPVRSTG-DDDNKLYIQNLPHDLGDVQIRDLLQQFGKLKGFNVIKDQSTGL 667

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHI- 392
           +KGYGF  Y+D   T IA  ALNG   G   L+V++AT    Q+ T+  + ++     + 
Sbjct: 668 NKGYGFFEYEDSNCTPIAMHALNGFVCGQNILSVKKATFGKSQNSTQNANTISLGSGSVD 727

Query: 393 --------AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
                   +I +  L  S +          GE  ++V+ LT A+  + L  D +YEEIL 
Sbjct: 728 LPVSLLPNSISQKILSNSIIGLQIQASRKIGEKSSRVVQLTNAVFQEDLIIDSQYEEILR 787

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           D++EE  KYG L N+VIP+P+++   T GVGK+FL Y D      A+   +GR F    V
Sbjct: 788 DIKEEAEKYGPLQNIVIPKPNKDLSYTEGVGKIFLHYADETTARKAQYMFNGRLFEKRVV 847

Query: 505 NAFYYPEDKYFNKDY 519
            A +Y E+K+    Y
Sbjct: 848 CAAFYSEEKFLAGKY 862


>gi|361132025|gb|EHL03640.1| putative Splicing factor U2AF 50 kDa subunit [Glarea lozoyensis
           74030]
          Length = 568

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 133/407 (32%), Positives = 202/407 (49%), Gaps = 59/407 (14%)

Query: 124 LPGAAVPGQLPGVPSAVPE-MAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGL 182
           LPGA  P Q P  PS +   MAQ   P G+    A        +    +R A+R+ V  L
Sbjct: 200 LPGA--PRQQPMDPSKLQAIMAQ---PSGSVTNAA--------LKPSNSRQAKRLLVHNL 246

Query: 183 PPLA-NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMAL 241
           P  + +E+AI  FF+  M   G N     D  ++  I+ +K FA +E +T  +A+ A+A 
Sbjct: 247 PSTSISEEAIINFFNLQMN--GLNIVEGSDPCISAQISKDKSFALLEFKTPSDATLALAF 304

Query: 242 DGIIFEGV-------------AVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           DGI  +                + +RRP DY     +   P +P         G+ S  +
Sbjct: 305 DGITMDDSEYVNREANGGDTKGLSIRRPKDYIVPAVSDETPQEP---------GVVSSVV 355

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
              +  +++ +  +P Y T+ Q+ ELL SFG L  F LVKD  TG S+G  FC Y DPA 
Sbjct: 356 --VDTQNKICMSNVPLYLTDEQVIELLTSFGELKAFVLVKDNSTGESRGIAFCEYADPAA 413

Query: 349 -TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
            TDIA   LNG+++GDK L V+RA+                  +  +  + +  + M+ L
Sbjct: 414 ATDIAVEGLNGMELGDKHLRVQRASIG----------------NTQVSGLEMGVNAMSML 457

Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
            G  S  G    +VL L   +T + L D+++YEEI ED++EEC KYG ++++ +PRP   
Sbjct: 458 AGTTSA-GLEDGRVLQLLNMVTPEELVDNDDYEEICEDVKEECEKYGKVLDMKVPRPTGG 516

Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
             ++ GVGK+F+++        A  AL+GRKF   TV   Y+ E+ +
Sbjct: 517 SRQSNGVGKIFVKFDTPESAGKALRALAGRKFADRTVVTTYFSEENF 563


>gi|307106531|gb|EFN54776.1| hypothetical protein CHLNCDRAFT_58050 [Chlorella variabilis]
          Length = 354

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 100/228 (43%), Positives = 139/228 (60%), Gaps = 28/228 (12%)

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
           + GA+  +R+F+GGLPYY TE Q +ELL SFG +  FDLVKD++TG SKGYGFCVY+DP 
Sbjct: 148 MSGADAAERIFIGGLPYYLTEEQCRELLASFGAIKSFDLVKDKETGQSKGYGFCVYEDPR 207

Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
           VTD+AC  LNG++MGD+TLTVRRAT   + + EQ+                         
Sbjct: 208 VTDVACQGLNGMRMGDRTLTVRRATEGQRQQAEQKQ------------------------ 243

Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV-VIPRPDQ 466
                ++    A+V+ L+ A+T + L DD+EY +I+EDM+EECGKYGT+V V +      
Sbjct: 244 ---QDMYVGNTARVVKLSHAVTLEELGDDQEYGDIMEDMKEECGKYGTVVQVHIPRPAPP 300

Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           +    PG+GK+ +E+ +      A+NA+ GRKFGG  V A    +  Y
Sbjct: 301 SAPPPPGLGKIIIEFAETPAAMAARNAMHGRKFGGRVVEAVMMGDSDY 348


>gi|258576333|ref|XP_002542348.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237902614|gb|EEP77015.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 621

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 134/431 (31%), Positives = 209/431 (48%), Gaps = 70/431 (16%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P       Q + P   ++L AF   P
Sbjct: 155 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQ-----QTVDP---SRLQAFMNQP 206

Query: 163 V-----QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
                  ++    +R A+R++V  LPP  +E  +A FF+  +   G N     D  ++  
Sbjct: 207 AGNANSTLLKPSNSRQAKRLFVHNLPPSVSEDTLAQFFNLQLN--GLNVISGVDPCISAQ 264

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFE-----------GVAVRVRRPTDYNPTLAAA 266
           ++ + KFA +E +T  +A+ A+ALDGI  E           G  + ++RP DY     A 
Sbjct: 265 VSSDGKFALLEFKTASDATVALALDGISLEHDDANGTSSAPGQGLSLKRPKDYIVPSEAD 324

Query: 267 LGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
               Q          G+ S  +   + P+++ V  +P +  E Q+  LL SFG L  F L
Sbjct: 325 DSNRQD---------GVVSNEV--PDSPNKICVTNIPPFIQEEQVTMLLVSFGELKSFVL 373

Query: 327 VKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILA 386
           VKD  T  S+G  FC Y DP+ T+IA   LNG+++GDK L V RA+              
Sbjct: 374 VKDSGTDESRGIAFCEYVDPSSTNIAVEGLNGMELGDKRLKVTRASIG------------ 421

Query: 387 QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYE 440
                      A Q +G++     MS+F +T +      +VL L   +TA+ L D +EYE
Sbjct: 422 -----------ATQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTAEELMDSDEYE 470

Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
           EI +D+REEC KYG ++++ IPRP     +  GVGK+++++      + A  AL+GRKF 
Sbjct: 471 EICDDVREECSKYGQVLDLKIPRPTGGSRQAAGVGKIYVKFDSYDSASKAMKALAGRKFQ 530

Query: 501 GNTVNAFYYPE 511
             TV   ++ E
Sbjct: 531 DRTVVTTFFSE 541


>gi|259480265|tpe|CBF71237.1| TPA: splicing factor u2af large subunit (AFU_orthologue;
           AFUA_7G05310) [Aspergillus nidulans FGSC A4]
          Length = 547

 Score =  191 bits (485), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 116/360 (32%), Positives = 179/360 (49%), Gaps = 53/360 (14%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           +R A+R++V  LPP A  + + +FF+  +   G N     D  ++  I+ +  FA +E +
Sbjct: 220 SRQAKRLFVYNLPPNATVENLVSFFNLQLN--GLNVIQSVDPCISAQISDDHSFALLEFK 277

Query: 231 TVEEASNAMALDGIIF--------EGVA--VRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           +  + + A+ALDGI           G A  + VRRP DY              PNL    
Sbjct: 278 SPNDTTVALALDGITMGEHESNGENGAAKGLEVRRPKDYI------------VPNLAEQD 325

Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
           +  ASG     + P+++ V  +P Y  E  +  LL+SFG L  F LVKD  T  S+G  F
Sbjct: 326 LEGASGMKDVPDSPNKICVSNIPQYIPEEPVTMLLKSFGELKSFVLVKDSSTEESRGIAF 385

Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ 400
           C Y DP  T IA   LNG+++GD+ L V RA+                           Q
Sbjct: 386 CEYADPNTTTIAVQGLNGMELGDRHLKVVRASIG-----------------------MTQ 422

Query: 401 TSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
            +G++     MS+F +T +      +VL L   +T + L D+E+YEEI +D+R+EC K+G
Sbjct: 423 AAGLDMGVNAMSMFAKTTSQDLESSRVLQLLNMVTPEELMDNEDYEEICDDVRDECSKFG 482

Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
            ++ + IPRP     ++PGVGK+F+++        A  +L+GRKF   TV   Y+ E+ +
Sbjct: 483 RVLELKIPRPTGGSRQSPGVGKIFVKFETIEATTAALKSLAGRKFSDRTVVTTYFSEENF 542


>gi|124810295|ref|XP_001348830.1| U2 snRNP auxiliary factor, putative [Plasmodium falciparum 3D7]
 gi|23497731|gb|AAN37269.1|AE014827_12 U2 snRNP auxiliary factor, putative [Plasmodium falciparum 3D7]
          Length = 833

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 121/394 (30%), Positives = 193/394 (48%), Gaps = 54/394 (13%)

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA---GPGDA----VVNVYI-N 219
           Q   +  R++Y+G +PP + ++ +  FF+  + A+  +S+     GD     V+   I N
Sbjct: 449 QDTDKKQRKLYIGNIPPNSKQEDVVDFFNNSILAVIKDSSLDVKIGDVQLMPVIKCEIFN 508

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNP-----------------T 262
            + +F F+E RTV+     + LD I +    +R+ RP DY P                  
Sbjct: 509 SDSRFCFLEFRTVQITWLCLKLDSIPYNNYCLRIGRPHDYIPPPEGDPAFTTVFTDINMD 568

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
           +   L P +P   +N+            ++  +R+++  LP+   + QIK+LLE FG L 
Sbjct: 569 VFEKLRPSKP---VNVKT---------SSDEENRLYIQNLPHDLKDEQIKDLLEQFGDLK 616

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQ--SKTE 380
            F+++KD +TG +KGYGF  Y+D + T +A  ALNG   G   L V++AT + Q  + T 
Sbjct: 617 AFNIIKDLNTGLNKGYGFFEYEDSSCTQLAIHALNGFVCGQNILNVKKATFNKQPTTITT 676

Query: 381 QESILAQAQQHIA---------------IQKMALQTSGMNTLGGGMSLFGETLAKVLCLT 425
             ++  Q    IA               I +  L  S +          GE  +KV+ LT
Sbjct: 677 NNNMNNQNPNFIALPNNSDVPVTLLPSSISQKILSNSIIGLQVQASRKIGEKSSKVVQLT 736

Query: 426 EAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAV 485
            A+  + L  D +YEEIL++++EE  KYGTL N+VIP+P+++   T GVGK+FL Y D  
Sbjct: 737 NAVFQEDLIVDSQYEEILKEVKEEAEKYGTLQNIVIPKPNKDLSYTEGVGKIFLHYADEA 796

Query: 486 GCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
               A+   +GR F    V A +Y E+ +    Y
Sbjct: 797 TARKAQYMFNGRLFEKRVVCAAFYSEEHFLKGKY 830


>gi|242803779|ref|XP_002484243.1| splicing factor u2af large subunit [Talaromyces stipitatus ATCC
           10500]
 gi|218717588|gb|EED17009.1| splicing factor u2af large subunit [Talaromyces stipitatus ATCC
           10500]
          Length = 543

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 135/438 (30%), Positives = 209/438 (47%), Gaps = 76/438 (17%)

Query: 107 KSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
           + +R + +D+ PP            + M P    P Q    PS +  +  N    G T+ 
Sbjct: 147 RKRRLTQWDIKPPGYDNVTAEQAKLSGMFPLPGAPRQQAVDPSRLQALV-NQPAAGTTEN 205

Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
            A  L P        +R A+R++   LPP   E A+ +FF+  +   G N     D  V+
Sbjct: 206 SA--LRPAN------SRQAKRLFAHNLPPNVTEAALVSFFNLQLN--GLNVIEGIDPCVS 255

Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGIIFE-----------GVAVRVRRPTDY-NPTL 263
             I+ +  FA +E +   E + A+ALDGI  E              + +RRP DY  P++
Sbjct: 256 AQISKDHSFALLEFKGANETTVALALDGITMEEHESAATANGGARGLELRRPKDYIVPSV 315

Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHG 323
                P Q S   N              + P+++ +  +P Y  E  +  LL+S G L  
Sbjct: 316 PEDQQPHQESVISNHVP-----------DSPNKLCITNIPLYIPEEPVTMLLKSIGELKA 364

Query: 324 FDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQES 383
           F LVKD  T  S+G  FC Y D A T IA  +LNG+++GDK L +  A+           
Sbjct: 365 FVLVKDSGTDESRGIAFCEYVDAASTAIAVESLNGMELGDKHLKITHASIG--------- 415

Query: 384 ILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDE 437
                         A Q +G++     MS+F +T +      ++L L   +TAD L +++
Sbjct: 416 --------------ATQAAGLDMGVNAMSMFAKTTSADLETSRILQLLNMVTADELINND 461

Query: 438 EYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AKNALSG 496
           +YEEILED+++EC KYG +++V IPRP     ++ GVGK++++ +D+V  AT A  AL+G
Sbjct: 462 DYEEILEDVQDECSKYGQVLDVKIPRPAGGSRQSAGVGKIYVK-FDSVESATNALKALAG 520

Query: 497 RKFGGNTVNAFYYPEDKY 514
           RKF   TV   Y+PE+ +
Sbjct: 521 RKFSDRTVVTTYFPEESF 538


>gi|367042858|ref|XP_003651809.1| hypothetical protein THITE_2112508 [Thielavia terrestris NRRL 8126]
 gi|346999071|gb|AEO65473.1| hypothetical protein THITE_2112508 [Thielavia terrestris NRRL 8126]
          Length = 563

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 135/432 (31%), Positives = 216/432 (50%), Gaps = 63/432 (14%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P       Q M P   T+L AF   P
Sbjct: 166 RKRRLTQWDIKPPGYDNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMNQP 217

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +   A     +R ++R+ V  LP  A E+++  F +  +   G N     D  +   
Sbjct: 218 GGSVNSAALRPTNSRQSKRLVVENLPASATEESMVNFINLQLN--GLNVIENTDPCLQCL 275

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG-------------VAVRVRRPTDYNPTLA 264
           I  ++ FA +E R   +A+ A+A DGI  E                +R+RRP DY   + 
Sbjct: 276 IAPDRSFAMLEFRNSPDATVALAFDGISMEADDAHAANGNGAAPAGLRIRRPKDY--IVP 333

Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
           A +      PN +  +   +S  I   + P+++ V  LP Y T+ Q+ ELL SFG L  F
Sbjct: 334 AVVE----DPNYDPDSDVPSSVVI---DSPNKISVTNLPLYLTDDQVMELLVSFGKLKSF 386

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
            LVKD  T  S+G  F  Y DP+VT++A   LN + +G++ L V++A+            
Sbjct: 387 VLVKDNGTQESRGIAFLEYVDPSVTNVAIQGLNNMMLGERALKVQKAS------------ 434

Query: 385 LAQAQQHIAIQKMA--LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEI 442
                  I I ++A  +  + M+ L G  S   + +++VL L   +TAD L D+++YEEI
Sbjct: 435 -------IGITQVAGEMGVNAMSMLAGTTSTDSD-VSRVLQLLNMVTADELMDNDDYEEI 486

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
            +D+++EC K+GT++++ IPRP   G ++ GVGK+F+++        A   L+GRKF   
Sbjct: 487 RDDVQDECEKFGTVLSLKIPRPTGGGRQSAGVGKIFIKFETPEVATKALRGLAGRKFADR 546

Query: 503 TVNAFYYPEDKY 514
           TV A Y+PE+ +
Sbjct: 547 TVVATYFPEENF 558


>gi|341038664|gb|EGS23656.1| hypothetical protein CTHT_0003520 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 584

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 116/358 (32%), Positives = 185/358 (51%), Gaps = 44/358 (12%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           +R A+R+ V  LPP A E+++  FF+  +   G N     D  +  +I  ++ FA +E R
Sbjct: 252 SRQAKRLVVRNLPPSATEESLVNFFNLQLN--GLNVIETTDPCLQAHIAPDRSFAMLEFR 309

Query: 231 TVEEASNAMALDGIIF----------EGV--AVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
              EA+ A+A DGI            E V   +++ RP DY   + A +      PN + 
Sbjct: 310 NSSEATVALAFDGISMDADDAGANGAEAVHGGLQITRPKDY--IVPAVVE----DPNYDP 363

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
            +   +S  I   + P+++ V  +P Y  E Q+ ELL SFG L  F LVKD  T  S+G 
Sbjct: 364 DSDVPSSVVI---DSPNKISVANIPPYLNEDQVMELLVSFGKLKSFVLVKDNGTQESRGI 420

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            F  Y DP+V+++A   LN + +G++ L V++A+                   I I ++A
Sbjct: 421 AFLEYVDPSVSNVAIQGLNDMPLGEQKLKVKKAS-------------------IGITQVA 461

Query: 399 --LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
             +  + M+ L G  S   E  ++VL L   +T + L D+++YEEI ED+ EEC K+G +
Sbjct: 462 GEMSVNAMSMLAGTTSTHAEASSRVLQLLNMVTPEELMDNDDYEEIREDVLEECKKFGNV 521

Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           +++ IPRP     ++ GVGK+++++        A  AL+GRKF   TV   Y+PE+ Y
Sbjct: 522 LSLKIPRPIGGNRQSAGVGKIYVKFEQVESATKALRALAGRKFSDRTVVTTYFPEENY 579


>gi|453082700|gb|EMF10747.1| hypothetical protein SEPMUDRAFT_48483 [Mycosphaerella populorum
           SO2202]
          Length = 432

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 144/471 (30%), Positives = 231/471 (49%), Gaps = 72/471 (15%)

Query: 74  RDKERR-HRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPA----------- 120
           R  ER+  R  S S    + R  +  L+   S  + KRR + +D+ PP            
Sbjct: 2   RQLERQIKREASGSPPPKKPREPTPDLTDVVSVLERKRRLTQWDIKPPGYDNVTAEQAKL 61

Query: 121 AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVG 180
           + M P    P Q P  P+ +    Q  +     Q  +  L P       + R ++RV V 
Sbjct: 62  SGMFPLPGAPRQQPMDPAKL----QAFMNQPGNQASSSALKP------SSARQSKRVMVH 111

Query: 181 GLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMA 240
            LPP A ++++  FF+  +   G N     D  ++   + +K +A VE +T E+A+NAMA
Sbjct: 112 NLPPSATDESMVDFFNLQLN--GLNITRGVDPCISAQCSKDKTYALVEFKTPEDATNAMA 169

Query: 241 LDGIIFEGVA-------------VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           LDGI  +  A             ++++RP DY       + P       N +  GL S  
Sbjct: 170 LDGITMDHDAMDTSGASNGAPKGLQIKRPRDY-------IVPNVIDETENES--GLLSNT 220

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
           +   +  +++ +  LP +  E QI+ELL SFG L  F LV+++ +G S+G  FC Y+DP+
Sbjct: 221 VPDTQ--NKISITNLPSFLAEEQIQELLMSFGELKSFVLVRNQSSGESRGIAFCEYKDPS 278

Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA--LQTSGMN 405
           VT +A  +LNG+++ D  + V+ A+                   I IQ+++  +  + M+
Sbjct: 279 VTKVAVDSLNGMELADTAMRVKLAS-------------------IGIQQVSSEMSVNAMS 319

Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD 465
            + G  S   +   +VL L   IT + L D ++ +EILED++EEC KYG L+ V +PRP 
Sbjct: 320 LMAGAKSTDADN-GRVLALMNMITPEELMDPDDADEILEDVKEECAKYGPLLEVKMPRPT 378

Query: 466 QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
               ++ GVGK++L+Y  +   A A  AL+GRKF   TV   Y+ E+ YF+
Sbjct: 379 GGSRQSTGVGKIYLKYETSEHAAKALAALAGRKFADRTVVVTYFGEE-YFD 428


>gi|195346998|ref|XP_002040041.1| GM15574 [Drosophila sechellia]
 gi|194135390|gb|EDW56906.1| GM15574 [Drosophila sechellia]
          Length = 445

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 122/361 (33%), Positives = 191/361 (52%), Gaps = 33/361 (9%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP----GDAVVNVYINHEKKFAF 226
           TR ARR+YVG +P    E+ +  FF+  + A G  +       G AV+    + EK FAF
Sbjct: 103 TRQARRLYVGNIPFGVTEEEMMQFFNHRIMAQGYEAKSSHYMDGKAVLTCQTHLEKNFAF 162

Query: 227 VEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
           +E R+++EA+ A+  DG+++ G  +++RRP DY P  + +    +   +  + A  +A+ 
Sbjct: 163 LEFRSIDEATQALNFDGMVYRGQTLKIRRPHDYQPVPSISFSAMENYRSFRVPATTIANP 222

Query: 287 -------AIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
                       + P++++VGGLP    + Q+KELL+SFG L G +LV D +T  +KG+ 
Sbjct: 223 PNVTIPVTTIVPDSPNKIYVGGLPTCLNQDQVKELLQSFGELKGLNLVMDGNTSLNKGFA 282

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMAL 399
           F  Y DP VTD A A L+G+ +GD+ L V+R+   G++     +               +
Sbjct: 283 FFEYYDPLVTDHAIAGLHGMLLGDRRLVVQRSIPGGKNAFPGHT------------GPVV 330

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
           Q  G++TL     L   +  + LCL   +  + L DDEE+E+I  D+++EC K+G + ++
Sbjct: 331 QVPGISTL-----LDPGSPTETLCLLNMVRPEELLDDEEFEDIRTDIKQECAKFGEVRSI 385

Query: 460 VIPRPDQNGGETP--GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
            IPRP    G  P  G GKVF+++        A  ALSGRKF    V   YY  +KY   
Sbjct: 386 KIPRPI---GPFPKRGCGKVFVQFESVEDSQKALKALSGRKFSDRIVMTSYYDPEKYLAD 442

Query: 518 D 518
           D
Sbjct: 443 D 443


>gi|67541022|ref|XP_664285.1| hypothetical protein AN6681.2 [Aspergillus nidulans FGSC A4]
 gi|40738434|gb|EAA57624.1| hypothetical protein AN6681.2 [Aspergillus nidulans FGSC A4]
          Length = 624

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 116/357 (32%), Positives = 177/357 (49%), Gaps = 53/357 (14%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           +R A+R++V  LPP A  + + +FF+  +   G N     D  ++  I+ +  FA +E +
Sbjct: 220 SRQAKRLFVYNLPPNATVENLVSFFNLQLN--GLNVIQSVDPCISAQISDDHSFALLEFK 277

Query: 231 TVEEASNAMALDGIIF--------EGVA--VRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           +  + + A+ALDGI           G A  + VRRP DY              PNL    
Sbjct: 278 SPNDTTVALALDGITMGEHESNGENGAAKGLEVRRPKDYI------------VPNLAEQD 325

Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
           +  ASG     + P+++ V  +P Y  E  +  LL+SFG L  F LVKD  T  S+G  F
Sbjct: 326 LEGASGMKDVPDSPNKICVSNIPQYIPEEPVTMLLKSFGELKSFVLVKDSSTEESRGIAF 385

Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ 400
           C Y DP  T IA   LNG+++GD+ L V RA+                           Q
Sbjct: 386 CEYADPNTTTIAVQGLNGMELGDRHLKVVRASIG-----------------------MTQ 422

Query: 401 TSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
            +G++     MS+F +T +      +VL L   +T + L D+E+YEEI +D+R+EC K+G
Sbjct: 423 AAGLDMGVNAMSMFAKTTSQDLESSRVLQLLNMVTPEELMDNEDYEEICDDVRDECSKFG 482

Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
            ++ + IPRP     ++PGVGK+F+++        A  +L+GRKF   TV   Y+ E
Sbjct: 483 RVLELKIPRPTGGSRQSPGVGKIFVKFETIEATTAALKSLAGRKFSDRTVVTTYFSE 539


>gi|391864554|gb|EIT73849.1| splicing factor U2AF, large subunit [Aspergillus oryzae 3.042]
          Length = 538

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 131/438 (29%), Positives = 204/438 (46%), Gaps = 76/438 (17%)

Query: 107 KSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
           + +R + +D+ PP            + M P    P Q P  PS    +   M   GA   
Sbjct: 142 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPS---RLQAFMSQPGAGTA 198

Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
            +  L P        +R A+R++V  LP  A  + + +FF+  +   G N     D  ++
Sbjct: 199 ESASLKPSN------SRQAKRLFVSNLPASATGENLLSFFNLQLN--GLNVIHSVDPCIS 250

Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV-------------AVRVRRPTDYNPT 262
             ++ ++ FA +E +T  +A+ A+A DGI  +                + VRRP DY   
Sbjct: 251 AQVSDDRSFALLEFKTPNDATVALAFDGITMDESEAAGNGAANGAPQGLEVRRPKDY--- 307

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
                    PS N      G+    +   + P+++ V  +P+Y  E  +  LL+SFG L 
Sbjct: 308 -------IVPSGNEQEYQEGVLLNEV--PDSPNKICVSNIPHYIPEEPVTMLLKSFGELK 358

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            F LVKD  T  S+G  FC Y DP  T IA   LNG+++GD+ L V RA+          
Sbjct: 359 SFVLVKDGSTEESRGIAFCEYADPNATSIAVEGLNGMELGDRHLKVVRAS---------- 408

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
                    I I     Q +G++     MS+F +T +      +VL L   +T + L D+
Sbjct: 409 ---------IGIT----QAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMDN 455

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
           ++Y+EI +D+REEC KYG +V + IPRP     ++PGVGK+F+++        A  AL+G
Sbjct: 456 DDYDEICDDVREECAKYGQVVELKIPRPSGGSRQSPGVGKIFVKFDSVESTTNALKALAG 515

Query: 497 RKFGGNTVNAFYYPEDKY 514
           RKF   TV   Y+ E+ +
Sbjct: 516 RKFSDRTVVTTYFSEENF 533


>gi|317139209|ref|XP_001817348.2| splicing factor u2af large subunit [Aspergillus oryzae RIB40]
          Length = 538

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 131/438 (29%), Positives = 204/438 (46%), Gaps = 76/438 (17%)

Query: 107 KSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
           + +R + +D+ PP            + M P    P Q P  PS    +   M   GA   
Sbjct: 142 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPS---RLQAFMSQPGAGTA 198

Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
            +  L P        +R A+R++V  LP  A  + + +FF+  +   G N     D  ++
Sbjct: 199 ESASLKPSN------SRQAKRLFVSNLPASATGENLLSFFNLQLN--GLNVIHSVDPCIS 250

Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV-------------AVRVRRPTDYNPT 262
             ++ ++ FA +E +T  +A+ A+A DGI  +                + VRRP DY   
Sbjct: 251 AQVSDDRSFALLEFKTPNDATVALAFDGITMDESEAAGNGAANGAPQGLEVRRPKDY--- 307

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
                    PS N      G+    +   + P+++ V  +P+Y  E  +  LL+SFG L 
Sbjct: 308 -------IVPSGNEQEYQEGVLLNEV--PDSPNKICVSNIPHYIPEEPVTMLLKSFGELK 358

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            F LVKD  T  S+G  FC Y DP  T IA   LNG+++GD+ L V RA+          
Sbjct: 359 SFVLVKDGSTEESRGIAFCEYADPNATSIAVEGLNGMELGDRHLKVVRAS---------- 408

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
                    I I     Q +G++     MS+F +T +      +VL L   +T + L D+
Sbjct: 409 ---------IGIT----QAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMDN 455

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
           ++Y+EI +D+REEC KYG +V + IPRP     ++PGVGK+F+++        A  AL+G
Sbjct: 456 DDYDEICDDVREECAKYGQVVELKIPRPSGGSRQSPGVGKIFVKFDSVESTTNALKALAG 515

Query: 497 RKFGGNTVNAFYYPEDKY 514
           RKF   TV   Y+ E+ +
Sbjct: 516 RKFSDRTVVTTYFSEENF 533


>gi|238482353|ref|XP_002372415.1| splicing factor u2af large subunit [Aspergillus flavus NRRL3357]
 gi|220700465|gb|EED56803.1| splicing factor u2af large subunit [Aspergillus flavus NRRL3357]
          Length = 556

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 131/438 (29%), Positives = 204/438 (46%), Gaps = 76/438 (17%)

Query: 107 KSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
           + +R + +D+ PP            + M P    P Q P  PS    +   M   GA   
Sbjct: 160 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPS---RLQAFMSQPGAGTA 216

Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
            +  L P        +R A+R++V  LP  A  + + +FF+  +   G N     D  ++
Sbjct: 217 ESASLKPSN------SRQAKRLFVSNLPASATGENLLSFFNLQLN--GLNVIHSVDPCIS 268

Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV-------------AVRVRRPTDYNPT 262
             ++ ++ FA +E +T  +A+ A+A DGI  +                + VRRP DY   
Sbjct: 269 AQVSDDRSFALLEFKTPNDATVALAFDGITMDESEAAGNGAANGAPQGLEVRRPKDY--- 325

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
                    PS N      G+    +   + P+++ V  +P+Y  E  +  LL+SFG L 
Sbjct: 326 -------IVPSGNEQEYQEGVLLNEV--PDSPNKICVSNIPHYIPEEPVTMLLKSFGELK 376

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            F LVKD  T  S+G  FC Y DP  T IA   LNG+++GD+ L V RA+          
Sbjct: 377 SFVLVKDGSTEESRGIAFCEYADPNATSIAVEGLNGMELGDRHLKVVRAS---------- 426

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
                    I I     Q +G++     MS+F +T +      +VL L   +T + L D+
Sbjct: 427 ---------IGIT----QAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMDN 473

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
           ++Y+EI +D+REEC KYG +V + IPRP     ++PGVGK+F+++        A  AL+G
Sbjct: 474 DDYDEICDDVREECAKYGQVVELKIPRPSGGSRQSPGVGKIFVKFDSVESTTNALKALAG 533

Query: 497 RKFGGNTVNAFYYPEDKY 514
           RKF   TV   Y+ E+ +
Sbjct: 534 RKFSDRTVVTTYFSEENF 551


>gi|68068227|ref|XP_676023.1| U2 snRNP auxiliary factor [Plasmodium berghei strain ANKA]
 gi|56495523|emb|CAI00540.1| U2 snRNP auxiliary factor, putative [Plasmodium berghei]
          Length = 630

 Score =  189 bits (479), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 115/380 (30%), Positives = 188/380 (49%), Gaps = 36/380 (9%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA---GPGDA----VVNVYI-NH 220
           +  +  R++Y+G LPP + ++ I  FF+  +++I   S+     GD     VV   I N 
Sbjct: 255 EGDKKQRKLYIGNLPPNSKQEEIVEFFNNTISSIIKGSSLEVKIGDVQLLPVVKCEIFNA 314

Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           + +F F+E RT++     + LD + +    +R+ RP DY P       P +  P L +  
Sbjct: 315 DSRFCFLEFRTMDITWLCLKLDSMSYNNYCLRINRPHDYMP-------PPEGDPALTVVF 367

Query: 281 VGLASGAIGGAEGP------------DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
             +  G +   + P            +++++  LP+   + QI +LL  FG L GF+++K
Sbjct: 368 PDIDMGLLESFKPPKIAPVRSTGDDDNKLYIQNLPHDLKDDQIMDLLGQFGKLKGFNIIK 427

Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
           D +TG +KGYGF  Y+D + T +A  ALNG   G   L V++AT +  S     S     
Sbjct: 428 DLNTGLNKGYGFFEYEDSSCTQVAIHALNGFVCGKNILNVKKATFNKNSNNAPNSNNIVL 487

Query: 389 QQHI---------AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEY 439
             ++         +I +  L  S +          GE  ++V+ LT A+  + L  D +Y
Sbjct: 488 ANNVDVPVSLLPNSISQKILSNSIIGLQIQASRKIGEKSSRVIQLTNAVFQEDLIIDSQY 547

Query: 440 EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKF 499
           +EIL+D++EE  KYG L ++VIP+P+ +   T GVGK+FL Y D      A+   +GR F
Sbjct: 548 DEILKDVKEEAEKYGPLQSIVIPKPNTDLSYTEGVGKIFLHYVDETAARKAQYMFNGRLF 607

Query: 500 GGNTVNAFYYPEDKYFNKDY 519
               V A +Y E+K+    Y
Sbjct: 608 EKRVVCASFYSEEKFLKGKY 627


>gi|357155772|ref|XP_003577233.1| PREDICTED: splicing factor U2af large subunit B-like [Brachypodium
           distachyon]
          Length = 446

 Score =  189 bits (479), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 96/150 (64%), Positives = 111/150 (74%), Gaps = 7/150 (4%)

Query: 113 GFDMAPPAA----AMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ 168
           GFD+ P +A       P    P QLPG  S++P M  NMLPF   Q     + P Q MTQ
Sbjct: 141 GFDLGPTSAQAVVPQFPTIPAPSQLPG--SSIPGMFPNMLPFAVGQFNPLVMQP-QAMTQ 197

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           QATRHARRVYVGGLPP ANEQ++A +F+QVM AIGGN+AGPGDAV+NVYINH+KKFAFVE
Sbjct: 198 QATRHARRVYVGGLPPTANEQSVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVE 257

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTD 258
           MR+VEEASNAMALDGI+FEG  V+    TD
Sbjct: 258 MRSVEEASNAMALDGILFEGAPVKDLNVTD 287



 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 102/179 (56%), Positives = 124/179 (69%), Gaps = 14/179 (7%)

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           +D  VTDIACAALNG+KMGDKTLTVRRA   S Q + EQE+IL QAQQ + +QK+  Q  
Sbjct: 281 KDLNVTDIACAALNGIKMGDKTLTVRRANQGSAQPRPEQENILLQAQQQVQLQKLVYQVG 340

Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
            + T             KV+CLT+ +TAD L DDEEYE+I+EDMR E GKYGTLV VVIP
Sbjct: 341 ALPT-------------KVICLTQVVTADELKDDEEYEDIMEDMRLEAGKYGTLVKVVIP 387

Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
           RP  +G    GVGKVFLEY D  G   AK A+ GRKFGGN V A +YPE+K+ ++++ A
Sbjct: 388 RPHPSGEPVAGVGKVFLEYADVDGSTKAKTAMHGRKFGGNPVVAVFYPENKFSDEEFDA 446


>gi|384246661|gb|EIE20150.1| hypothetical protein COCSUDRAFT_30785 [Coccomyxa subellipsoidea
           C-169]
          Length = 212

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 92/206 (44%), Positives = 133/206 (64%), Gaps = 3/206 (1%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           A R ARR+YVGGLPP   +  +  + +++M + GG  A  G  + +  I  EK +AF+E 
Sbjct: 6   AARPARRIYVGGLPPETTDADLRQYINELMVSTGG-CAATGYPIASCKIYTEKSYAFLEF 64

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
           R+VEEASN MA DG+ F+   +RVRRP +Y+  +A  LGP  P P ++++ + +    + 
Sbjct: 65  RSVEEASNCMAFDGVAFKDSYLRVRRPNNYDINVAVMLGPTDPDPTMDVSNLDIVKTVV- 123

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
             + P ++F+GGLP  +TE Q+KELL  FG+L  F+LV D++TGNSKGY FC YQD  +T
Sbjct: 124 -QDSPHKLFIGGLPCDWTEDQVKELLLPFGSLKAFNLVMDKNTGNSKGYAFCEYQDIGLT 182

Query: 350 DIACAALNGLKMGDKTLTVRRATASG 375
           D     LNG ++G+K LTV+RA   G
Sbjct: 183 DYVIQNLNGKQIGNKFLTVKRALQPG 208


>gi|156099808|ref|XP_001615700.1| U2 snRNP auxiliary factor [Plasmodium vivax Sal-1]
 gi|148804574|gb|EDL45973.1| U2 snRNP auxiliary factor, putative [Plasmodium vivax]
          Length = 914

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 115/375 (30%), Positives = 189/375 (50%), Gaps = 25/375 (6%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA---GPGDAVVNVYI-----NH 220
           +  +  R++Y+G +PP + ++ +  FF+  + +I  +S+     GD V+   +     N 
Sbjct: 538 EGDKKQRKLYIGNIPPNSKQEELIDFFNNTLASIIKDSSLEIKIGDIVLLPILKCEIFNV 597

Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDY------NPTLAAALGPGQPSP 274
           E +F F+E R++E     + LD I F    +R+ RP D+      +P L           
Sbjct: 598 ESRFCFLEFRSLEITWLCLRLDAISFNNYCLRIARPHDFVPPPGGDPALTVVFTDINHEV 657

Query: 275 NLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
              +  V +A     G +  +++++  LP+   + QI++LL+ FG L GF+++KD +TG 
Sbjct: 658 FEMVKPVKIAPVRSTG-DDDNKLYIQNLPHDLRDDQIRDLLQQFGKLKGFNIIKDLNTGL 716

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQHI- 392
           +KGYGF  Y+D   T IA  ALNG   G   L V++AT    Q+ T+  + ++     + 
Sbjct: 717 NKGYGFFEYEDSNCTPIAMHALNGFVCGQNILNVKKATFGKSQNSTQNANTISLPTGSVD 776

Query: 393 --------AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
                   +I +  L  S +          GE  ++V+ LT A+  + L  D +YEEIL 
Sbjct: 777 LPVSLLPNSISQKILSNSIIGLQIQASRKIGEKSSRVVQLTNAVFQEDLLIDSQYEEILR 836

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           D++EE  KYG L N+VIP+P ++   T GVGK+FL Y D      A+   +GR F    V
Sbjct: 837 DIKEEAEKYGPLQNIVIPKPSKDLSYTEGVGKIFLHYADETTARKAQYMFNGRLFEKRVV 896

Query: 505 NAFYYPEDKYFNKDY 519
            A +Y E+K+    Y
Sbjct: 897 CAAFYSEEKFLAGKY 911


>gi|115400045|ref|XP_001215611.1| hypothetical protein ATEG_06433 [Aspergillus terreus NIH2624]
 gi|114191277|gb|EAU32977.1| hypothetical protein ATEG_06433 [Aspergillus terreus NIH2624]
          Length = 413

 Score =  187 bits (476), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 136/435 (31%), Positives = 205/435 (47%), Gaps = 76/435 (17%)

Query: 107 KSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
           + +R + +D+ PP            + M P    P Q P  PS + +   N    G+   
Sbjct: 22  RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPSRL-QAFMNQSTTGSAD- 79

Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
            A  L P        +R A+R++V  +PP    +A+ +FF+  +   G N     D  ++
Sbjct: 80  -AASLKPSH------SRQAKRLFVYNIPPNVTGEALLSFFNLQLN--GLNVVQSVDPCIS 130

Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA------------VRVRRPTDYNPTL 263
             ++ +  FA +E ++  EA+ A+A DGI  +  A            + VRRP DY    
Sbjct: 131 AQVSDDHSFALLEFKSPNEATVALAFDGITMDEHASMDGAGKGEVKGLEVRRPKDYIVPN 190

Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHG 323
            +A    Q    LN              + P+++ V  +P Y  E  +  LL+SFG L  
Sbjct: 191 GSADQEYQEGVLLNEVP-----------DSPNKICVSNIPQYIQEEAVIMLLKSFGELKS 239

Query: 324 FDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQES 383
           F LVKD  T  S+G  FC Y DP  T IA   LNG+++GD+ L V RA+           
Sbjct: 240 FVLVKDASTEESRGIAFCEYADPTATSIAVEGLNGMEIGDRPLKVVRASIG--------- 290

Query: 384 ILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDE 437
                           Q +G++     MS+F +T +      +VL L   +TA+ L D++
Sbjct: 291 --------------MTQAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTAEELIDND 336

Query: 438 EYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AKNALSG 496
           +YEEI ED+REEC KYG ++ + +PRP     ++PGVGK+F++ +D V  AT A  AL+G
Sbjct: 337 DYEEICEDVREECSKYGQVLELKVPRPSGGSRQSPGVGKIFVK-FDTVESATKALKALAG 395

Query: 497 RKFGGNTVNAFYYPE 511
           RKF   TV   YY E
Sbjct: 396 RKFSDRTVVTTYYGE 410


>gi|212539736|ref|XP_002150023.1| splicing factor u2af large subunit [Talaromyces marneffei ATCC
           18224]
 gi|210067322|gb|EEA21414.1| splicing factor u2af large subunit [Talaromyces marneffei ATCC
           18224]
          Length = 551

 Score =  187 bits (475), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 130/433 (30%), Positives = 207/433 (47%), Gaps = 66/433 (15%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQ 164
           + +R + +D+ PP    +    A + G  P +P A  + A +      ++L A    P  
Sbjct: 155 RKRRMTQWDIKPPGYDNVTAEQAKLSGMFP-LPGAPRQQAVD-----PSRLQALVNQPSA 208

Query: 165 VMTQQAT------RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
             T+ +T      R A+R++   LPP   + A+ +FF+  +   G N     D  V+  I
Sbjct: 209 TTTESSTLRPANSRQAKRLFAYNLPPNVTDAALISFFNLQLN--GLNVIEGIDPCVSSQI 266

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFE-----------GVAVRVRRPTDYNPTLAAAL 267
           + +  FA +E +   EA+ A+ALDGI  E              + +RRP DY       +
Sbjct: 267 SKDHAFALLEFKGPNEATVALALDGISMEEHEAAATTNGGARGLELRRPKDY-------I 319

Query: 268 GPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLV 327
            P  P  +      G+ S  +   + P+++ V  +P Y  E  +  LL+S G L  F LV
Sbjct: 320 VPSSPE-DQQPYQEGVISNQV--PDSPNKLCVTNIPLYIPEEPVTMLLKSIGELRAFVLV 376

Query: 328 KDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
           KD  T  S+G  FC Y D   T IA  +LNG+++GDK L +  A+               
Sbjct: 377 KDSGTDESRGIAFCEYVDATATAIAVESLNGMELGDKHLKITHASIG------------- 423

Query: 388 AQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEE 441
                       Q +G++     MS+F +T +      +VL L   +TAD L ++E+YEE
Sbjct: 424 ----------VTQAAGLDMGVNAMSMFAKTTSADLETTRVLQLLNMVTADELINNEDYEE 473

Query: 442 ILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
           ILED+++EC KYG ++++ IPRP     ++ GVGK+F+++        A  AL+GRKF  
Sbjct: 474 ILEDVQDECSKYGQVLDLKIPRPAGGSRQSAGVGKIFVKFDTVESATNALKALAGRKFSD 533

Query: 502 NTVNAFYYPEDKY 514
            TV   Y+PE+ +
Sbjct: 534 RTVVTTYFPEESF 546


>gi|83765203|dbj|BAE55346.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 563

 Score =  187 bits (474), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 131/435 (30%), Positives = 202/435 (46%), Gaps = 76/435 (17%)

Query: 107 KSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
           + +R + +D+ PP            + M P    P Q P  PS    +   M   GA   
Sbjct: 163 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPS---RLQAFMSQPGAGTA 219

Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
            +  L P        +R A+R++V  LP  A  + + +FF+  +   G N     D  ++
Sbjct: 220 ESASLKPSN------SRQAKRLFVSNLPASATGENLLSFFNLQLN--GLNVIHSVDPCIS 271

Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV-------------AVRVRRPTDYNPT 262
             ++ ++ FA +E +T  +A+ A+A DGI  +                + VRRP DY   
Sbjct: 272 AQVSDDRSFALLEFKTPNDATVALAFDGITMDESEAAGNGAANGAPQGLEVRRPKDY--- 328

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
                    PS N      G+    +   + P+++ V  +P+Y  E  +  LL+SFG L 
Sbjct: 329 -------IVPSGNEQEYQEGVLLNEV--PDSPNKICVSNIPHYIPEEPVTMLLKSFGELK 379

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            F LVKD  T  S+G  FC Y DP  T IA   LNG+++GD+ L V RA+          
Sbjct: 380 SFVLVKDGSTEESRGIAFCEYADPNATSIAVEGLNGMELGDRHLKVVRAS---------- 429

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
                    I I     Q +G++     MS+F +T +      +VL L   +T + L D+
Sbjct: 430 ---------IGIT----QAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMDN 476

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
           ++Y+EI +D+REEC KYG +V + IPRP     ++PGVGK+F+++        A  AL+G
Sbjct: 477 DDYDEICDDVREECAKYGQVVELKIPRPSGGSRQSPGVGKIFVKFDSVESTTNALKALAG 536

Query: 497 RKFGGNTVNAFYYPE 511
           RKF   TV   Y+ E
Sbjct: 537 RKFSDRTVVTTYFSE 551


>gi|171684585|ref|XP_001907234.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942253|emb|CAP67905.1| unnamed protein product [Podospora anserina S mat+]
          Length = 585

 Score =  187 bits (474), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 130/431 (30%), Positives = 212/431 (49%), Gaps = 62/431 (14%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P       Q M P   T+L AF   P
Sbjct: 189 RRRRLTQWDIKPPGYDNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMSQP 240

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +   A     +R A+R+ +  +P  A + +I  FF+  +   G N     D  +   
Sbjct: 241 GGAVNSAALKPTNSRQAKRLILSNIPASATDDSIVNFFNLQLN--GLNVIEQTDPCLLCN 298

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG------------VAVRVRRPTDYNPTLAA 265
           I+ ++ FA +E R   +A+ A+ALDGI  +               +++RRP DY   + A
Sbjct: 299 ISPDRSFAMLEFRNNTDATVALALDGITMDADDHQANGNGAAATGLKIRRPKDY--IVPA 356

Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFD 325
            +      P+ ++ +  +        +GP+++ V  +P Y TE Q+ ELL SFG L  F 
Sbjct: 357 IVEDPNYDPDSSVPSTNVV-------DGPNKISVTNIPPYLTEDQVMELLVSFGKLKSFV 409

Query: 326 LVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESIL 385
            VKD  T   +G  F  Y D +VTD+A + LN + +G+K L V++A+             
Sbjct: 410 FVKDNGTQEPRGIAFLEYADSSVTDVAISGLNNMMLGEKALKVQKAS------------- 456

Query: 386 AQAQQHIAIQKMA--LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEIL 443
                 I I ++A  L  + M+ L G      +   +VL L   +TAD L D+++YEEI 
Sbjct: 457 ------IGITQVAGELSVNAMSMLAGTTPSDNDA-GRVLQLLNMVTADELMDNDDYEEIR 509

Query: 444 EDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNT 503
           +D++EEC K+G ++++ IPRP     ++ GVGK+F+++ +      A  AL+GRKF   T
Sbjct: 510 DDVQEECEKFGKILSLKIPRPVGGSRQSAGVGKIFIKFENHEAANKALRALAGRKFADRT 569

Query: 504 VNAFYYPEDKY 514
           V   Y+PE+ +
Sbjct: 570 VVTTYFPEENF 580


>gi|326475623|gb|EGD99632.1| splicing factor u2af large subunit [Trichophyton tonsurans CBS
           112818]
 gi|326483752|gb|EGE07762.1| splicing factor U2AF subunit [Trichophyton equinum CBS 127.97]
          Length = 565

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 133/442 (30%), Positives = 203/442 (45%), Gaps = 72/442 (16%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPV- 163
           + +R + +D+ PP    +    A V G  P +P A  + A +      ++L AF   P  
Sbjct: 156 RKRRLTQWDIKPPGYENVTAEQAKVSGMFP-LPGAPRQQAVD-----PSRLQAFMNPPAA 209

Query: 164 ------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
                  ++    +R ++R++   +PP   E  +  FF+  +   G N     D   +V 
Sbjct: 210 SGSSNNTLLKPSNSRQSKRLFAHNIPPSVTEDTLQQFFNLQLN--GLNVISGVDPCQSVQ 267

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------------VAVRVRRPTDYNPT 262
           I+ + KFA +E  T  +A+ A+A DGI  E                  + + RP DY   
Sbjct: 268 ISKDGKFALLEFNTAADATVALAFDGITMEEHEANRESNGESNGEVKGLTIVRPKDYIVP 327

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
           L     P Q          G+ S  +   + P+++ V  +P +  E Q+  LL SFG L 
Sbjct: 328 LPTDEEPRQE---------GVVSSNV--PDSPNKICVSNIPPFIQEDQVTMLLVSFGELK 376

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            F LVKD  T  S+G  FC Y DPA T IA   LNG+++GD+ L V RA+          
Sbjct: 377 SFVLVKDVGTDESRGIAFCEYLDPASTGIAVEGLNGMELGDRRLKVNRASIG-------- 428

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
                           +Q +G++     MS+F +T +      +VL L   +TAD L D+
Sbjct: 429 ---------------TVQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTADELIDN 473

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
           E+YEEI ED++EEC KYG +  + IPRP     +  GVGK+++++        A  AL+G
Sbjct: 474 EDYEEICEDVQEECSKYGVVEELKIPRPSAGSRQAAGVGKIYVKFDTPEAATKALQALAG 533

Query: 497 RKFGGNTVNAFYYPEDKYFNKD 518
           RKF   TV   Y+ E  Y N +
Sbjct: 534 RKFQDRTVVTTYFSEASYPNSN 555


>gi|358369529|dbj|GAA86143.1| splicing factor u2af large subunit [Aspergillus kawachii IFO 4308]
          Length = 571

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 127/439 (28%), Positives = 200/439 (45%), Gaps = 78/439 (17%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P           P   ++L AF   P
Sbjct: 175 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSRLQAFMNQP 226

Query: 163 ------VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
                    +    +R A+R++V  +P     + +  FF+  +   G N     D  ++ 
Sbjct: 227 AGGNADTSTLKPSNSRQAKRLFVYNIPQTVTGETLLAFFNVQLN--GLNVIESVDPCISA 284

Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFE-------------GVAVRVRRPTDYNPTL 263
            +  +  FA +E ++  +A+ A+A DGI  E                + VRRP DY    
Sbjct: 285 QVAQDHSFALLEFKSPNDATVALAFDGIAMEEHEAAGNGAANGAAQGLEVRRPKDY---- 340

Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTL 321
              + PG        A      G +     + P+++ V  +P+Y  E  +  LL+SFG L
Sbjct: 341 ---IVPGG-------AEQEYQEGVLLNEVPDSPNKICVSNIPHYIPEEPVTMLLKSFGEL 390

Query: 322 HGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQ 381
             F LVKD  T  S+G  FC Y DP+ T IA   LNG+++GD+ L V RA+         
Sbjct: 391 KSFVLVKDSSTEESRGIAFCEYADPSATTIAVEGLNGMELGDRHLKVVRASIG------- 443

Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALAD 435
                             Q +G++     MS+F +T +      +VL L   +T + L D
Sbjct: 444 ----------------MTQAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMD 487

Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
            E+YEEI +D+R+EC KYGT+V + +PRP     ++PGVGK+F+++        A  AL+
Sbjct: 488 PEDYEEICDDVRDECSKYGTVVELKVPRPTGGSRQSPGVGKIFVKFDTVESTTNALKALA 547

Query: 496 GRKFGGNTVNAFYYPEDKY 514
           GRKF   TV   Y+ E+ +
Sbjct: 548 GRKFSDRTVVTTYFSEENF 566


>gi|159123253|gb|EDP48373.1| splicing factor u2af large subunit [Aspergillus fumigatus A1163]
          Length = 567

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 131/439 (29%), Positives = 206/439 (46%), Gaps = 76/439 (17%)

Query: 106 SKSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
           ++ +R + +D+ PP            + M P    P Q P  PS + +   N    G+  
Sbjct: 170 TRKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPSRL-QAFMNQSGGGSAD 228

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
             A        +    +R ARR++V  LPP  + + + + F+  +   G N     D  +
Sbjct: 229 NSA--------LKPSNSRQARRLFVYNLPPGVSSEHLVSLFNLQLN--GLNVIHHVDPCI 278

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEG------------VAVRVRRPTDYNPT 262
           +  I+ +  FA +E +T  +A+ A+A DGI  E               + VRRP DY   
Sbjct: 279 SAQISEDHSFALLEFKTPNDATVALAFDGITMEEHEPVSGAENGAPKGLEVRRPKDYIVP 338

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
             +A    Q    LN              + P+++ V  +P Y  E  +  LL+SFG L 
Sbjct: 339 NGSADQEYQEGVLLNEVP-----------DSPNKICVSNIPQYIPEEPVTMLLKSFGELK 387

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            F LVKD  T  S+G  FC Y DP+ T IA   LNG+++GD+ L V RA+          
Sbjct: 388 SFVLVKDSSTEESRGIAFCEYADPSATAIAVEGLNGMELGDRHLKVVRASIG-------- 439

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
                            Q +G++     MS+F +T +      +VL L   +T + L D+
Sbjct: 440 ---------------MTQAAGLDMGVNAMSMFAKTTSQDLESSRVLQLLNMVTPEELLDN 484

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AKNALS 495
           ++YEEI +D+REEC KYG ++++ +PRP     ++PGVGK++++ +D V  AT A  AL+
Sbjct: 485 DDYEEICDDVREECFKYGKVLDLKVPRPSGGSRQSPGVGKIYVK-FDTVEAATNALKALA 543

Query: 496 GRKFGGNTVNAFYYPEDKY 514
           GRKF   TV   Y+ E+ +
Sbjct: 544 GRKFSDRTVVTTYFSEENF 562


>gi|146324846|ref|XP_748978.2| splicing factor u2af large subunit [Aspergillus fumigatus Af293]
 gi|129556630|gb|EAL86940.2| splicing factor u2af large subunit [Aspergillus fumigatus Af293]
          Length = 563

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 131/439 (29%), Positives = 206/439 (46%), Gaps = 76/439 (17%)

Query: 106 SKSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
           ++ +R + +D+ PP            + M P    P Q P  PS + +   N    G+  
Sbjct: 166 TRKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPSRL-QAFMNQSGGGSAD 224

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
             A        +    +R ARR++V  LPP  + + + + F+  +   G N     D  +
Sbjct: 225 NSA--------LKPSNSRQARRLFVYNLPPGVSSEHLVSLFNLQLN--GLNVIHHVDPCI 274

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFEG------------VAVRVRRPTDYNPT 262
           +  I+ +  FA +E +T  +A+ A+A DGI  E               + VRRP DY   
Sbjct: 275 SAQISEDHSFALLEFKTPNDATVALAFDGITMEEHEPVSGAENGAPKGLEVRRPKDYIVP 334

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
             +A    Q    LN              + P+++ V  +P Y  E  +  LL+SFG L 
Sbjct: 335 NGSADQEYQEGVLLNEVP-----------DSPNKICVSNIPQYIPEEPVTMLLKSFGELK 383

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            F LVKD  T  S+G  FC Y DP+ T IA   LNG+++GD+ L V RA+          
Sbjct: 384 SFVLVKDSSTEESRGIAFCEYADPSATAIAVEGLNGMELGDRHLKVVRASIG-------- 435

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
                            Q +G++     MS+F +T +      +VL L   +T + L D+
Sbjct: 436 ---------------MTQAAGLDMGVNAMSMFAKTTSQDLESSRVLQLLNMVTPEELLDN 480

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AKNALS 495
           ++YEEI +D+REEC KYG ++++ +PRP     ++PGVGK++++ +D V  AT A  AL+
Sbjct: 481 DDYEEICDDVREECFKYGKVLDLKVPRPSGGSRQSPGVGKIYVK-FDTVEAATNALKALA 539

Query: 496 GRKFGGNTVNAFYYPEDKY 514
           GRKF   TV   Y+ E+ +
Sbjct: 540 GRKFSDRTVVTTYFSEENF 558


>gi|82540696|ref|XP_724646.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23479360|gb|EAA16211.1| splicing factor-like protein, putative [Plasmodium yoelii yoelii]
          Length = 714

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 114/380 (30%), Positives = 188/380 (49%), Gaps = 36/380 (9%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA---GPGDA----VVNVYI-NH 220
           +  +  R++Y+G LPP + ++ I  FF+  +++I   S+     GD     VV   I N 
Sbjct: 339 EGDKKQRKLYIGNLPPNSKQEEIVEFFNNTISSIIKGSSLEVKIGDVQLLPVVKCEIFNA 398

Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           + +F F+E RT++     + LD + +    +R+ RP DY P       P +  P L +  
Sbjct: 399 DSRFCFLEFRTMDITWLCLKLDSMSYNNYCLRINRPHDYMP-------PPEGDPALTVVF 451

Query: 281 VGLASGAIGGAEGP------------DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
             +  G +   + P            +++++  LP+   + QI +LL  FG L GF+++K
Sbjct: 452 PDIDMGLLESFKPPKIAPVRSTGDDDNKLYIQNLPHDLKDDQIMDLLGQFGKLKGFNIIK 511

Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
           D +TG +KGYGF  Y+D + T +A  ALNG   G   L V++AT +  S     S     
Sbjct: 512 DLNTGLNKGYGFFEYEDSSCTQVAIHALNGFVCGKNILNVKKATFNKNSNNAPNSNNIAL 571

Query: 389 QQHI---------AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEY 439
             ++         +I +  L  S +          GE  ++V+ LT A+  + L  + +Y
Sbjct: 572 ANNVDVPVSLLPNSISQKILSNSIIGLQIQASRKIGEKSSRVIQLTNAVFQEDLIINSQY 631

Query: 440 EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKF 499
           +EIL+D++EE  KYG L ++VIP+P+ +   T GVGK+FL Y D      A+   +GR F
Sbjct: 632 DEILKDVKEEAEKYGPLQSIVIPKPNTDLSYTEGVGKIFLHYVDETAARKAQYMFNGRLF 691

Query: 500 GGNTVNAFYYPEDKYFNKDY 519
               V A +Y E+K+    Y
Sbjct: 692 EKRVVCASFYSEEKFLEGKY 711


>gi|121711505|ref|XP_001273368.1| splicing factor u2af large subunit [Aspergillus clavatus NRRL 1]
 gi|119401519|gb|EAW11942.1| splicing factor u2af large subunit [Aspergillus clavatus NRRL 1]
          Length = 583

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 130/437 (29%), Positives = 202/437 (46%), Gaps = 77/437 (17%)

Query: 106 SKSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
           ++ +R + +D+ PP            + M P    P Q P  PS +        P G + 
Sbjct: 167 TRKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPSRLQAFMNQ--PGGGSA 224

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
             A        +    +R A+R++V  LPP  + + + +FF+  +   G N     D  +
Sbjct: 225 DNA-------ALKPSNSRQAKRLFVYNLPPGVSNEHLVSFFNLQLN--GLNVIHNVDPCI 275

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFE-------------GVAVRVRRPTDYNP 261
           +  I+ +  FA +E ++  + + A+A DGI  E                + VRRP DY  
Sbjct: 276 SAQISEDHTFALLEFKSPNDTTVALAFDGITMEEHEAMGAGAENGASKGLEVRRPKDYVV 335

Query: 262 TLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTL 321
              +A    Q    LN              + P+++ V  +P Y  E  +  LL+SFG L
Sbjct: 336 PNGSADQEYQEGVLLNEVP-----------DSPNKICVSNIPQYIPEEPVTMLLKSFGEL 384

Query: 322 HGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQ 381
             F LVKD  T  S+G  FC Y DP+ T IA   LNG+++GD+ L V RA+         
Sbjct: 385 KSFVLVKDSSTEESRGIAFCEYADPSATPIAVEGLNGMELGDRHLKVVRASIG------- 437

Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALAD 435
                             Q  G++     MS+F +T +      +VL L   +T + L D
Sbjct: 438 ----------------MTQAVGLDMGVNAMSMFAKTTSQDLESGRVLQLLNMVTPEELMD 481

Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AKNAL 494
           +++YEEI +D+REEC KYG ++++ +PRP     ++PGVGK+F++ +D V  AT A  AL
Sbjct: 482 NDDYEEICDDVREECSKYGKVLDLKVPRPSGGSRQSPGVGKIFVK-FDTVESATNALKAL 540

Query: 495 SGRKFGGNTVNAFYYPE 511
           +GRKF   TV   Y+ E
Sbjct: 541 AGRKFSDRTVVTTYFAE 557


>gi|452979953|gb|EME79715.1| hypothetical protein MYCFIDRAFT_81194 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 552

 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 145/461 (31%), Positives = 220/461 (47%), Gaps = 59/461 (12%)

Query: 77  ERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPAAAMLPG--AAVPGQ- 132
           ER+   +S S    + R  +  L+   S  + KRR + +D+ PP    +    A + G  
Sbjct: 126 ERQTARKSASPPPKKPREPTPDLTDVVSILERKRRLTQWDIKPPGYENVTAEQAKLSGMF 185

Query: 133 -LPGVPSAVPEMAQNMLPF---GATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANE 188
            LPG P   P   Q +  F      Q  +  L P       + R ++R+ +  +P  A +
Sbjct: 186 PLPGAPRQQPMDPQKLQAFMNQPGNQASSTALKP------SSARQSKRLLIHNIPAAATD 239

Query: 189 QAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF-- 246
             I  FF+  +   G N     D  V+  I+ E  +A VE +T E+A+NAMA DGI    
Sbjct: 240 DNIVDFFNLQLN--GLNVTRGQDPCVSAQISKENGYALVEFKTPEDATNAMAFDGINMMP 297

Query: 247 ---------EGVA--VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
                     G A  ++++RP DY       + P       N +  G+ SG +   +  +
Sbjct: 298 DAMDTNGDSNGTAKGLQIKRPKDY-------IVPNVTDETENPS--GILSGVVPDTQ--N 346

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           ++ +  LP +  E QI+ELL SFG L  F LVKD  T  S+G  FC Y+DP+VT  A  +
Sbjct: 347 KISITNLPTFLGEDQIQELLNSFGELRNFVLVKDTSTEESRGIAFCEYKDPSVTKTAVES 406

Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFG 415
           LNG+++GD  + V+ A+   Q          Q    +++  M+L          G    G
Sbjct: 407 LNGMELGDAAMKVKLASIGIQ----------QVPGEMSVNAMSLM--------AGTQAEG 448

Query: 416 ETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
               +VLCL   IT + L D +E +EIL D++EE  KYG L++V +PRP     +  G+G
Sbjct: 449 TEKGRVLCLMNMITPEELMDADEADEILVDVKEEVSKYGPLLDVKMPRPTGGSRQNNGIG 508

Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           K++L+Y      A A  AL+GRKF   TV   Y+ E+ YF+
Sbjct: 509 KIYLKYESPDSAAKALAALAGRKFADRTVVVTYFGEE-YFD 548


>gi|212539738|ref|XP_002150024.1| splicing factor u2af large subunit [Talaromyces marneffei ATCC
           18224]
 gi|210067323|gb|EEA21415.1| splicing factor u2af large subunit [Talaromyces marneffei ATCC
           18224]
          Length = 556

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 130/435 (29%), Positives = 207/435 (47%), Gaps = 66/435 (15%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQ 164
           + +R + +D+ PP    +    A + G  P +P A  + A +      ++L A    P  
Sbjct: 155 RKRRMTQWDIKPPGYDNVTAEQAKLSGMFP-LPGAPRQQAVD-----PSRLQALVNQPSA 208

Query: 165 VMTQQAT------RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
             T+ +T      R A+R++   LPP   + A+ +FF+  +   G N     D  V+  I
Sbjct: 209 TTTESSTLRPANSRQAKRLFAYNLPPNVTDAALISFFNLQLN--GLNVIEGIDPCVSSQI 266

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFE-----------GVAVRVRRPTDYNPTLAAAL 267
           + +  FA +E +   EA+ A+ALDGI  E              + +RRP DY       +
Sbjct: 267 SKDHAFALLEFKGPNEATVALALDGISMEEHEAAATTNGGARGLELRRPKDY-------I 319

Query: 268 GPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLV 327
            P  P  +      G+ S  +   + P+++ V  +P Y  E  +  LL+S G L  F LV
Sbjct: 320 VPSSPE-DQQPYQEGVISNQV--PDSPNKLCVTNIPLYIPEEPVTMLLKSIGELRAFVLV 376

Query: 328 KDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
           KD  T  S+G  FC Y D   T IA  +LNG+++GDK L +  A+               
Sbjct: 377 KDSGTDESRGIAFCEYVDATATAIAVESLNGMELGDKHLKITHASIG------------- 423

Query: 388 AQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEE 441
                       Q +G++     MS+F +T +      +VL L   +TAD L ++E+YEE
Sbjct: 424 ----------VTQAAGLDMGVNAMSMFAKTTSADLETTRVLQLLNMVTADELINNEDYEE 473

Query: 442 ILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
           ILED+++EC KYG ++++ IPRP     ++ GVGK+F+++        A  AL+GRKF  
Sbjct: 474 ILEDVQDECSKYGQVLDLKIPRPAGGSRQSAGVGKIFVKFDTVESATNALKALAGRKFSD 533

Query: 502 NTVNAFYYPEDKYFN 516
            TV   Y+PE  + +
Sbjct: 534 RTVVTTYFPEVSFLS 548


>gi|315044445|ref|XP_003171598.1| splicing factor U2AF subunit [Arthroderma gypseum CBS 118893]
 gi|311343941|gb|EFR03144.1| splicing factor U2AF subunit [Arthroderma gypseum CBS 118893]
          Length = 565

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 131/438 (29%), Positives = 202/438 (46%), Gaps = 72/438 (16%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPV- 163
           + +R + +D+ PP    +    A V G  P +P A  + A +      ++L AF   P  
Sbjct: 165 RKRRLTQWDIKPPGYENVTAEQAKVSGMFP-LPGAPRQQAVD-----PSRLQAFMNPPAA 218

Query: 164 ------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
                  ++    +R ++R++   +PP   E  +  FF+  +   G N     D   +V 
Sbjct: 219 SGSNTNTLLKPSNSRQSKRLFTHNIPPSVTEDTLQQFFNLQLN--GLNVISGVDPCQSVQ 276

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------------VAVRVRRPTDYNPT 262
           I+ + KFA +E  T  +A+ A+A DGI  E                  + + RP DY   
Sbjct: 277 ISKDGKFALLEFNTAADATVALAFDGITMEEHEANRENNGESNGEVKGLSIIRPKDYIVP 336

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
           L     P Q          G+ S  +   + P+++ V  +P +  E Q+  LL SFG L 
Sbjct: 337 LPTDEEPHQE---------GVVSSNV--PDSPNKICVSNIPPFIQEDQVTMLLVSFGELK 385

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            F LVKD  T  S+G  FC Y DPA T IA   LNG+++GD+ L V RA+          
Sbjct: 386 SFVLVKDVGTDESRGIAFCEYLDPASTGIAVEGLNGMELGDRRLKVNRASIG-------- 437

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
                           +Q +G++     MS+F +T +      +VL L   +TAD L D+
Sbjct: 438 ---------------TVQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTADELIDN 482

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
           E+YEEI ED++EEC KYG +  + IPRP     +  GVGK+++++        A  AL+G
Sbjct: 483 EDYEEICEDVQEECSKYGVVEELKIPRPSAGSRQAAGVGKIYIKFDTPESATKALQALAG 542

Query: 497 RKFGGNTVNAFYYPEDKY 514
           RKF   TV   Y+ E+ +
Sbjct: 543 RKFQDRTVVTTYFSEENF 560


>gi|281207514|gb|EFA81697.1| RNA-binding region RNP-1 domain-containing protein [Polysphondylium
           pallidum PN500]
          Length = 682

 Score =  185 bits (470), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 116/368 (31%), Positives = 181/368 (49%), Gaps = 32/368 (8%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           VQ  +    + +RR+YVG +PP   E  I  FF+  + A    +  PG  V+   I   K
Sbjct: 287 VQSASAALAKQSRRLYVGNIPPNVTEAQIVEFFNAAIIAAALTTK-PGQPVLLCQITTGK 345

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+ EEA+  M LDGI   G ++++RRP DY        G  +P P       G
Sbjct: 346 SFAFIEFRSSEEATLGMGLDGISLSGYSLKIRRPKDYQS------GSNEPMP------TG 393

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           L+  +    +  +++F+GGLP    E QIK +L + G L  F+LVKD  TG SKG+ FC 
Sbjct: 394 LSIVSTNVPDSENKIFLGGLPPTLNEEQIKSMLSAIGRLKAFNLVKDTKTGISKGFAFCE 453

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           + DP  TD ACA LNG K GDK+L V++A+   ++     +     Q      K+ + +S
Sbjct: 454 FLDPENTDKACAELNGTKFGDKSLLVQKASLGKEAIANNNNNNNNNQSLNNPSKVKVDSS 513

Query: 403 GMNTLGGGMSLFGETL---------------AKVLCLTEAITADALADDEEYEEILEDMR 447
            +++L    +   + L               ++V+ L      + + DD  YE +L D +
Sbjct: 514 -VSSLLNLTTALPQVLGAIRSNVSSDNNSKPSRVVQLLNMTDKEEIQDDNNYENLLLDTK 572

Query: 448 EECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAF 507
           + C ++G + ++ I RP  +  +   + KVF+ +        A   L GRK+   T+   
Sbjct: 573 DACEEFGEIESIFISRPKDSPLD---IIKVFVCFSQLESAQKAWVGLGGRKYNYRTIITA 629

Query: 508 YYPEDKYF 515
           +YPED Y 
Sbjct: 630 FYPEDLYI 637


>gi|221483471|gb|EEE21790.1| U2 snRNP auxiliary factor, putative [Toxoplasma gondii GT1]
 gi|221507941|gb|EEE33528.1| U2 snRNP splicing factor, putative [Toxoplasma gondii VEG]
          Length = 553

 Score =  185 bits (469), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 118/366 (32%), Positives = 184/366 (50%), Gaps = 28/366 (7%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS-------AGPGDAVVNV----YINH 220
           R  +R+YVG LPP + +  +  FF+  + A+   +       A  G+ ++ V      N 
Sbjct: 185 RKQKRLYVGNLPPGSTQPDVVGFFNGALLAVNAQTGFVKEDEATAGEQLLPVERCEVFNE 244

Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
             +F F+E+R  + A   + LDGI + G ++RV RP DY P        G P+    +  
Sbjct: 245 SSRFCFIELRNEQYAILCVKLDGITYNGYSLRVGRPHDYVPPPG-----GDPAHQAYIPL 299

Query: 281 VGLASGAIGGAE---------GPD-RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDR 330
           +  A       +         GPD ++++  LP    E Q+++LLE FGTL   +L+++ 
Sbjct: 300 LDDAKKVKREEKREKPSRPETGPDNKIYIQNLPPEMGEEQVRDLLEQFGTLRVLNLIRNV 359

Query: 331 DTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT-EQESILAQAQ 389
            TG  KGYGF  Y+DP VTD A  ALNG   G   L+V+RA  SG   T E    +A   
Sbjct: 360 QTGQHKGYGFFEYEDPEVTDQAILALNGFVCGANMLSVQRANFSGSVVTRETNRTMAVTS 419

Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
              ++ +  L    +          GE  +KV+ L   +  + L D +EYE I +D+++E
Sbjct: 420 LPNSMTQKLLSDPLVAVQVQAARKIGERPSKVVQLLNCVYQEDLIDPKEYEAICDDIKQE 479

Query: 450 CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN-TVNAFY 508
             K+G L  V++P+P+++     GVGKVFL Y D      A+  L+GR+F  N  V A +
Sbjct: 480 AEKHGALEEVLVPKPNEDLSYREGVGKVFLRYSDVTAARKAQLMLNGRRFDSNRVVCAAF 539

Query: 509 YPEDKY 514
           +PE+K+
Sbjct: 540 FPEEKF 545


>gi|116192501|ref|XP_001222063.1| hypothetical protein CHGG_05968 [Chaetomium globosum CBS 148.51]
 gi|88181881|gb|EAQ89349.1| hypothetical protein CHGG_05968 [Chaetomium globosum CBS 148.51]
          Length = 566

 Score =  184 bits (468), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 130/430 (30%), Positives = 208/430 (48%), Gaps = 59/430 (13%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P       Q M P   T+L AF   P
Sbjct: 169 RRRRLTQWDIKPPGYDNVTAEQAKLSGMFPLPGAPRQ-----QAMDP---TKLQAFMNQP 220

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +   A     +R ++R+ +  LP    ++++  FF+  +   G N     D  +  +
Sbjct: 221 GGAVNSAALKPSNSRQSKRLIISNLPASVTDESLTNFFNLQLN--GLNVIETADPCLQAH 278

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFE----------GVA---VRVRRPTDYNPTLA 264
           I  E+ FA VE R   +A+ A+ALDGI  E          G A   + +RRP DY   + 
Sbjct: 279 IAAERAFAMVEFRNNTDATVALALDGISMEADDAHAANGNGTAPQGLHIRRPKDY--IVP 336

Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
           A +      PN +  +   +S  +   + P+++ V  LP Y TE Q+ ELL SFG L  F
Sbjct: 337 AVV----EDPNYDPDSDRPSSVVV---DSPNKISVTNLPLYLTEDQVMELLVSFGKLKSF 389

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI 384
            LVKD  T  S+G  F  Y DP VT +A   L+ + +G++ L V++A+            
Sbjct: 390 VLVKDNGTEESRGIAFLEYADPGVTTVAIQGLHNMMLGERALKVQKASIG---------- 439

Query: 385 LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE 444
           + Q    + +  M++     +   G          +VL L   +T D L D+++YEEI +
Sbjct: 440 ITQVSGEMGVNAMSMLAGTTSADAGA--------GRVLQLLNMVTPDELMDNDDYEEIRD 491

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           D++EEC K+G ++++ IPRP     ++ GVGK+F+++        A  AL+GRKF   TV
Sbjct: 492 DVQEECEKFGKILSIKIPRPAGGSRQSAGVGKIFIKFETPETATKALQALAGRKFADRTV 551

Query: 505 NAFYYPEDKY 514
              Y+PE+ +
Sbjct: 552 VTTYFPEENF 561


>gi|317029342|ref|XP_001391373.2| splicing factor u2af large subunit [Aspergillus niger CBS 513.88]
          Length = 561

 Score =  184 bits (467), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 126/439 (28%), Positives = 200/439 (45%), Gaps = 78/439 (17%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P           P   ++L AF   P
Sbjct: 165 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSRLQAFMNQP 216

Query: 163 ------VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
                    +    +R A+R++V  +P     + +  FF+  +   G N     D  ++ 
Sbjct: 217 AGGNADTSTLKPSNSRQAKRLFVYNIPESVTGETLLAFFNVQLN--GLNVIQSVDPCISA 274

Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFE-------------GVAVRVRRPTDYNPTL 263
            +  +  FA +E ++  +A+ A+A DGI  E                + VRRP DY    
Sbjct: 275 QVAQDHTFALLEFKSPNDATVALAFDGIAMEEHEAAGNGAANGAAQGLEVRRPKDY---- 330

Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTL 321
              + PG        A      G +     + P+++ V  +P+Y  E  +  LL+SFG L
Sbjct: 331 ---IVPGG-------AEQEYQEGVLLNEVPDSPNKICVSNIPHYIPEEPVTMLLKSFGEL 380

Query: 322 HGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQ 381
             F LVKD  T  S+G  FC Y DP+ T IA   LNG+++GD+ L V RA+         
Sbjct: 381 KSFVLVKDSSTEESRGIAFCEYADPSATTIAVEGLNGMELGDRHLKVVRASIG------- 433

Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALAD 435
                             Q +G++     MS+F +T +      +VL L   +T + L D
Sbjct: 434 ----------------MTQAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMD 477

Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
            E+Y+EI +D+R+EC KYGT+V + +PRP     ++PGVGK+F+++        A  AL+
Sbjct: 478 PEDYDEICDDVRDECSKYGTVVELKVPRPTGGSRQSPGVGKIFVKFDTVESTTNALKALA 537

Query: 496 GRKFGGNTVNAFYYPEDKY 514
           GRKF   TV   Y+ E+ +
Sbjct: 538 GRKFSDRTVVTTYFSEENF 556


>gi|119189253|ref|XP_001245233.1| hypothetical protein CIMG_04674 [Coccidioides immitis RS]
 gi|392868136|gb|EAS33879.2| U2 snRNP auxilliary factor, large subunit, splicing factor
           [Coccidioides immitis RS]
          Length = 545

 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 126/432 (29%), Positives = 211/432 (48%), Gaps = 66/432 (15%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMP-- 162
           + +R + +D+ PP    +    A + G  P +P A  + A +      ++L AF   P  
Sbjct: 151 RKRRLTQWDIKPPGYENVTAEQAKLSGMFP-LPGAPRQQAVD-----PSRLQAFMNQPGG 204

Query: 163 ---VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
                ++    +R A+R++V  L P  +E +IA FF+  +   G N     D  V+  ++
Sbjct: 205 NASNTLLKPSNSRQAKRLFVYNLSPSLSEDSIAQFFNLQLN--GLNVVSGVDPCVSAQLS 262

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFE-----------GVAVRVRRPTDYNPTLAAALG 268
            +  FA +E +T  +A+ A+A DG+  E              + +RRP DY       + 
Sbjct: 263 TDGTFALLEFKTAADATVALAFDGVSMEPDDANGHTNGSSQGLSIRRPKDY-------IV 315

Query: 269 PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
           P +   +      G+ S  +   + P+++ V  +P +  E Q+  LL SFG L  F LVK
Sbjct: 316 PSETDDSNRQE--GVVSNEV--PDSPNKICVTNIPPFIQEEQVTMLLVSFGELKSFVLVK 371

Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
           D  T  S+G  FC Y DP+ T+IA   LNG+++GDK L V RA+                
Sbjct: 372 DSGTDESRGIAFCEYVDPSSTNIAVEGLNGMELGDKRLKVTRASIG-------------- 417

Query: 389 QQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEI 442
                    A Q +G++     MS+F +T +      +VL L   +TA+ L D+++YEEI
Sbjct: 418 ---------ATQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTAEELMDNDDYEEI 468

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
            +D+R+EC KYG ++ + +PRP     ++ GVGK+++++ +      A  AL+GRKF   
Sbjct: 469 CDDVRDECSKYGQILEMKVPRPTGGSRQSAGVGKIYVKFDNYESAYKAMKALAGRKFQDR 528

Query: 503 TVNAFYYPEDKY 514
           TV   ++ E+ +
Sbjct: 529 TVVTTFFSEENF 540


>gi|400602736|gb|EJP70338.1| splicing factor U2AF 65 kDa subunit [Beauveria bassiana ARSEF 2860]
          Length = 576

 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 158/521 (30%), Positives = 246/521 (47%), Gaps = 87/521 (16%)

Query: 33  RDRHHRDFKSGGDDRRRDKNYKYDR-------------EGIRDHDRTDRHRD-------- 71
           RDR   +  +GG DRR D+ +  DR              G RD D  D +R         
Sbjct: 99  RDREREERYTGGRDRRGDREWDRDRGSSRRDARRDDDGHGRRDRDGFDENRRGGRDRRDD 158

Query: 72  -YNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPA--AAMLPGA 127
            + R +ERR      S S  + R  +  L+      + KRR + +D+ PP   A     A
Sbjct: 159 GFARQQERR------SPSPPKRREPTPDLTDVIPVLERKRRMTQWDIKPPGYEAVTSEQA 212

Query: 128 AVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMPV--QV----MTQQATRHARRVYV 179
            + G   LPG P       Q M P   T+L AF   P   QV    +    +R A+R+ V
Sbjct: 213 KMSGMFPLPGAPRQ-----QQMDP---TKLQAFMNQPAGGQVSSAGLKASNSRQAKRLLV 264

Query: 180 GGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAM 239
             LP    + A+  FF+  +  +   SA   D      ++++K FA +E +   +A+ A+
Sbjct: 265 SNLPSGTTDDALVAFFNLQLNGLNVISAT--DPCALSQLSNDKSFAVLEFKNTSDATVAL 322

Query: 240 ALDGIIFE--GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD-- 295
           ALDGI  E  G  + +RRP DY   + A        P+  +    + S ++     PD  
Sbjct: 323 ALDGISMEANGPGLSIRRPKDY--VMPAV-------PDDIMYNPDVVSDSV-----PDTI 368

Query: 296 -RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP-AVTDIAC 353
            ++ +  LP + TE Q+ ELL +FG    F LVKDR T  S+G  F  Y +P +  + A 
Sbjct: 369 HKLSITNLPPFLTEEQVLELLAAFGKPKAFVLVKDRTTEESRGIAFAEYAEPGSANEAAL 428

Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSL 413
            AL+G+ +G K L + +A   G      ++ +       AI  +A Q +G          
Sbjct: 429 KALSGMDVGGKPLKITKACIGGTQVANFDAGIN------AISNLAGQGNG---------- 472

Query: 414 FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPG 473
            GE   +VL L   +TA+ L D+++YEEI +D+R+EC KYG ++++ +PRP     ++ G
Sbjct: 473 -GEA-TRVLQLLNMVTAEELLDNDDYEEICDDVRDECSKYGKILDLKVPRPAGGSRQSAG 530

Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           VG++F+++       +A  AL+GRKF   TV   Y+PE+ +
Sbjct: 531 VGRIFVKFESVDATTSALKALAGRKFADRTVVTTYFPEENF 571


>gi|378731414|gb|EHY57873.1| U2AF domain-containing protein (UHM) kinase 1 [Exophiala
           dermatitidis NIH/UT8656]
          Length = 574

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 133/439 (30%), Positives = 203/439 (46%), Gaps = 79/439 (17%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P       Q M P   ++L AF  +P
Sbjct: 174 RKRRLTQWDIKPPGYENVTAEQAKMSGMFPLPGAPRQ-----QQMDP---SRLQAFMNLP 225

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
                  A     +R ARR++V  LP   +E+A+  FF+  +   G N     D      
Sbjct: 226 SSSSNNTALKPSNSRQARRLFVHNLPASVSEEALVQFFNLQLN--GLNVTKAVDPCAQAN 283

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG-------------------VAVRVRRPTD 258
           I  ++ FA VE +   +A+ A+ALDGI                         + +RRP D
Sbjct: 284 IAEDRSFALVEFKNASDATLALALDGITMPEHHSEMNGNGDANGNGTAAPKGLEIRRPKD 343

Query: 259 YNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESF 318
           Y            PS +    A G  S  +   +  +++ V  LP + T+ Q+ ELL++F
Sbjct: 344 Y----------IVPSADEATYAEGEISSEV--PDTANKLAVTNLPPFLTDDQVIELLKAF 391

Query: 319 GTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK 378
           G +  F LV++ D+  S+G  FC Y DPA T +A   LNG+ +   ++ V RA+   Q  
Sbjct: 392 GEVKAFVLVREPDSQESRGIAFCEYADPASTAVAIEGLNGMDLAGNSIKVTRASIGYQ-- 449

Query: 379 TEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADA 432
                                Q +G++     MS+F  T +      +VL L   +T + 
Sbjct: 450 ---------------------QAAGLDMGVNAMSMFAGTTSDAHDEGRVLQLLNMVTPED 488

Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
           L D+++YEEI ED+ EEC KYG ++++ IPRP     ++ GVGK+FL+Y DA     A  
Sbjct: 489 LMDNDDYEEICEDVMEECSKYGKILSMKIPRPSGGSRQSAGVGKIFLKYEDAESAKKALQ 548

Query: 493 ALSGRKFGGNTVNAFYYPE 511
           AL+GRKF   TV   Y+ E
Sbjct: 549 ALAGRKFADRTVVTTYFDE 567


>gi|320031290|gb|EFW13263.1| splicing factor u2af large subunit [Coccidioides posadasii str.
           Silveira]
          Length = 545

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 126/432 (29%), Positives = 211/432 (48%), Gaps = 66/432 (15%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMP-- 162
           + +R + +D+ PP    +    A + G  P +P A  + A +      ++L AF   P  
Sbjct: 151 RKRRLTQWDIKPPGYENVTAEQAKLSGMFP-LPGAPRQQAVD-----PSRLQAFMNQPGG 204

Query: 163 ---VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
                ++    +R A+R++V  L P  +E +IA FF+  +   G N     D  V+  ++
Sbjct: 205 NASNTLLKPSNSRQAKRLFVYNLSPSLSEDSIAQFFNLQLN--GLNVVSGVDPCVSAQLS 262

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFE-----------GVAVRVRRPTDYNPTLAAALG 268
            +  FA +E +T  +A+ A+A DG+  E              + +RRP DY       + 
Sbjct: 263 TDGTFALLEFKTAADATVALAFDGVSMEPDDANGHTNGSSQGLSIRRPKDY-------IV 315

Query: 269 PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
           P +   +      G+ S  +   + P+++ V  +P +  E Q+  LL SFG L  F LVK
Sbjct: 316 PSETDDSNRQE--GVVSNEV--PDSPNKICVTNIPPFIQEEQVTMLLVSFGELKSFILVK 371

Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
           D  T  S+G  FC Y DP+ T+IA   LNG+++GDK L V RA+                
Sbjct: 372 DSGTDESRGIAFCEYVDPSSTNIAVEGLNGMELGDKRLKVTRASIG-------------- 417

Query: 389 QQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEI 442
                    A Q +G++     MS+F +T +      +VL L   +TA+ L D+++YEEI
Sbjct: 418 ---------ATQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTAEELMDNDDYEEI 468

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
            +D+R+EC KYG ++ + +PRP     ++ GVGK+++++ +      A  AL+GRKF   
Sbjct: 469 CDDVRDECSKYGQVLEMKVPRPTGGSRQSAGVGKIYVKFDNYESAYKAMKALAGRKFQDR 528

Query: 503 TVNAFYYPEDKY 514
           TV   ++ E+ +
Sbjct: 529 TVVTTFFSEENF 540


>gi|237839189|ref|XP_002368892.1| U2 snRNP auxiliary factor or splicing factor, putative [Toxoplasma
           gondii ME49]
 gi|211966556|gb|EEB01752.1| U2 snRNP auxiliary factor or splicing factor, putative [Toxoplasma
           gondii ME49]
          Length = 553

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 117/366 (31%), Positives = 184/366 (50%), Gaps = 28/366 (7%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS-------AGPGDAVVNV----YINH 220
           R  +R+YVG LPP + +  +  FF+  + A+   +       A  G+ ++ V      N 
Sbjct: 185 RKQKRLYVGNLPPGSTQPDVVGFFNGALLAVNAQTGFVKEDEATAGEQLLPVERCEVFNE 244

Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
             +F F+E+R  + A   + LDGI + G ++RV RP DY P        G P+    +  
Sbjct: 245 SSRFCFIELRNEQYAILCVKLDGITYNGYSLRVGRPHDYVPPPG-----GDPAHQAYIPL 299

Query: 281 VGLASGAIGGAE---------GP-DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDR 330
           +  A       +         GP +++++  LP    E Q+++LLE FGTL   +L+++ 
Sbjct: 300 LDDAKKVKREEKREKPSRPETGPNNKIYIQNLPPEMGEEQVRDLLEQFGTLRVLNLIRNV 359

Query: 331 DTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT-EQESILAQAQ 389
            TG  KGYGF  Y+DP VTD A  ALNG   G   L+V+RA  SG   T E    +A   
Sbjct: 360 QTGQHKGYGFFEYEDPEVTDQAILALNGFVCGANMLSVQRANFSGSVVTRETNRTMAVTS 419

Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
              ++ +  L    +          GE  +KV+ L   +  + L D +EYE I +D+++E
Sbjct: 420 LPNSMTQKLLSDPLVAVQVQAARKIGERPSKVVQLLNCVYQEDLIDPKEYEAICDDIKQE 479

Query: 450 CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN-TVNAFY 508
             K+G L  V++P+P+++     GVGKVFL Y D      A+  L+GR+F  N  V A +
Sbjct: 480 AEKHGALEEVLVPKPNEDLSYREGVGKVFLRYSDVTAARKAQLMLNGRRFDSNRVVCAAF 539

Query: 509 YPEDKY 514
           +PE+K+
Sbjct: 540 FPEEKF 545


>gi|296811258|ref|XP_002845967.1| splicing factor U2AF subunit [Arthroderma otae CBS 113480]
 gi|238843355|gb|EEQ33017.1| splicing factor U2AF subunit [Arthroderma otae CBS 113480]
          Length = 557

 Score =  182 bits (463), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 129/438 (29%), Positives = 203/438 (46%), Gaps = 72/438 (16%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPV- 163
           + +R + +D+ PP    +    A + G  P +P A  + A +      ++L AF   P  
Sbjct: 157 RKRRLTQWDIKPPGYENVTAEQAKLSGMFP-LPGAPRQQAVD-----PSRLQAFINPPTA 210

Query: 164 ------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
                  ++    +R ++R++   +PP   E  +  FF+  +   G N     D   +V 
Sbjct: 211 SGSSNNTLLKPSNSRQSKRLFAHNIPPSVTEDTLQQFFNLQLN--GLNVISGVDPCQSVQ 268

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------------VAVRVRRPTDYNPT 262
           I+ + KFA +E  T  +A+ A+A DGI  E                  + + RP DY   
Sbjct: 269 ISKDGKFALLEFNTAADATVALAFDGITMEEHEANQESNGESNGQVKGLSIVRPKDYIVP 328

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
           L     P Q          G+ S  +   + P+++ V  +P +  E Q+  LL SFG L 
Sbjct: 329 LPTEEEPRQE---------GVLSSNV--PDSPNKICVSNIPPFIQEDQVTMLLISFGELK 377

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            F LVKD  T  S+G  FC Y DPA T IA   LNG+++GD+ L V RA+          
Sbjct: 378 SFVLVKDVGTDESRGIAFCEYLDPASTGIAVEGLNGMELGDRRLKVNRASIG-------- 429

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
                           +Q +G++     MS+F +T +      +VL L   +TAD L D+
Sbjct: 430 ---------------TVQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTADELIDN 474

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
           ++YEEI ED+++EC KYG +  + IPRP     +  GVGK+++++  A     A  AL+G
Sbjct: 475 DDYEEICEDVQDECSKYGVVEELKIPRPSGGSRQAAGVGKIYVKFDTAESATKALQALAG 534

Query: 497 RKFGGNTVNAFYYPEDKY 514
           RKF   TV   Y+ E+ +
Sbjct: 535 RKFQDRTVVTTYFSEENF 552


>gi|350635494|gb|EHA23855.1| hypothetical protein ASPNIDRAFT_209800 [Aspergillus niger ATCC
           1015]
          Length = 566

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 126/436 (28%), Positives = 198/436 (45%), Gaps = 78/436 (17%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P           P   ++L AF   P
Sbjct: 150 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSRLQAFMNQP 201

Query: 163 ------VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
                    +    +R A+R++V  +P     + +  FF+  +   G N     D  ++ 
Sbjct: 202 AGGNADTSTLKPSNSRQAKRLFVYNIPESVTGETLLAFFNVQLN--GLNVIQSVDPCISA 259

Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFE-------------GVAVRVRRPTDYNPTL 263
            +  +  FA +E ++  +A+ A+A DGI  E                + VRRP DY    
Sbjct: 260 QVAQDHTFALLEFKSPNDATVALAFDGIAMEEHEAAGNGAANGAAQGLEVRRPKDY---- 315

Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTL 321
              + PG        A      G +     + P+++ V  +P+Y  E  +  LL+SFG L
Sbjct: 316 ---IVPGG-------AEQEYQEGVLLNEVPDSPNKICVSNIPHYIPEEPVTMLLKSFGEL 365

Query: 322 HGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQ 381
             F LVKD  T  S+G  FC Y DP+ T IA   LNG+++GD+ L V RA+         
Sbjct: 366 KSFVLVKDSSTEESRGIAFCEYADPSATTIAVEGLNGMELGDRHLKVVRASIG------- 418

Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALAD 435
                             Q +G++     MS+F +T +      +VL L   +T + L D
Sbjct: 419 ----------------MTQAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMD 462

Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
            E+Y+EI +D+R+EC KYGT+V + +PRP     ++PGVGK+F+++        A  AL+
Sbjct: 463 PEDYDEICDDVRDECSKYGTVVELKVPRPTGGSRQSPGVGKIFVKFDTVESTTNALKALA 522

Query: 496 GRKFGGNTVNAFYYPE 511
           GRKF   TV   Y+ E
Sbjct: 523 GRKFSDRTVVTTYFSE 538


>gi|303323229|ref|XP_003071606.1| splicing factor, putative [Coccidioides posadasii C735 delta SOWgp]
 gi|240111308|gb|EER29461.1| splicing factor, putative [Coccidioides posadasii C735 delta SOWgp]
          Length = 545

 Score =  182 bits (462), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 126/432 (29%), Positives = 210/432 (48%), Gaps = 66/432 (15%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMP-- 162
           + +R + +D+ PP    +    A + G  P +P A  + A +      ++L AF   P  
Sbjct: 151 RKRRLTQWDIKPPGYENVTAEQAKLSGMFP-LPGAPRQQAVD-----PSRLQAFMNQPGG 204

Query: 163 ---VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
                ++    +R A+R++V  L P  +E +IA FF+  +   G N     D  V+  ++
Sbjct: 205 NASNTLLKPSNSRQAKRLFVYNLSPSLSEDSIAQFFNLQLN--GLNVVSGVDPCVSAQLS 262

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFE-----------GVAVRVRRPTDYNPTLAAALG 268
            +  FA +E +T  +A+ A+A DG+  E              + +RRP DY       + 
Sbjct: 263 TDGTFALLEFKTAADATVALAFDGVSMEPDDANGHTNGSSQGLSIRRPKDY-------IV 315

Query: 269 PGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
           P +   +      G+ S  +   + P ++ V  +P +  E Q+  LL SFG L  F LVK
Sbjct: 316 PSETDDSNRQE--GVVSNEV--PDSPSKICVTNIPPFIQEEQVTMLLVSFGELKSFILVK 371

Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
           D  T  S+G  FC Y DP+ T+IA   LNG+++GDK L V RA+                
Sbjct: 372 DSGTDESRGIAFCEYVDPSSTNIAVEGLNGMELGDKRLKVTRASIG-------------- 417

Query: 389 QQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEI 442
                    A Q +G++     MS+F +T +      +VL L   +TA+ L D+++YEEI
Sbjct: 418 ---------ATQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTAEELMDNDDYEEI 468

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
            +D+R+EC KYG ++ + +PRP     ++ GVGK+++++ +      A  AL+GRKF   
Sbjct: 469 CDDVRDECSKYGQVLEMKVPRPTGGSRQSAGVGKIYVKFDNYESAYKAMKALAGRKFQDR 528

Query: 503 TVNAFYYPEDKY 514
           TV   ++ E+ +
Sbjct: 529 TVVTTFFSEENF 540


>gi|134075845|emb|CAL00224.1| unnamed protein product [Aspergillus niger]
          Length = 598

 Score =  182 bits (461), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 126/436 (28%), Positives = 198/436 (45%), Gaps = 78/436 (17%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P           P   ++L AF   P
Sbjct: 182 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSRLQAFMNQP 233

Query: 163 ------VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
                    +    +R A+R++V  +P     + +  FF+  +   G N     D  ++ 
Sbjct: 234 AGGNADTSTLKPSNSRQAKRLFVYNIPESVTGETLLAFFNVQLN--GLNVIQSVDPCISA 291

Query: 217 YINHEKKFAFVEMRTVEEASNAMALDGIIFE-------------GVAVRVRRPTDYNPTL 263
            +  +  FA +E ++  +A+ A+A DGI  E                + VRRP DY    
Sbjct: 292 QVAQDHTFALLEFKSPNDATVALAFDGIAMEEHEAAGNGAANGAAQGLEVRRPKDY---- 347

Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTL 321
              + PG        A      G +     + P+++ V  +P+Y  E  +  LL+SFG L
Sbjct: 348 ---IVPGG-------AEQEYQEGVLLNEVPDSPNKICVSNIPHYIPEEPVTMLLKSFGEL 397

Query: 322 HGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQ 381
             F LVKD  T  S+G  FC Y DP+ T IA   LNG+++GD+ L V RA+         
Sbjct: 398 KSFVLVKDSSTEESRGIAFCEYADPSATTIAVEGLNGMELGDRHLKVVRASIG------- 450

Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALAD 435
                             Q +G++     MS+F +T +      +VL L   +T + L D
Sbjct: 451 ----------------MTQAAGLDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEELMD 494

Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
            E+Y+EI +D+R+EC KYGT+V + +PRP     ++PGVGK+F+++        A  AL+
Sbjct: 495 PEDYDEICDDVRDECSKYGTVVELKVPRPTGGSRQSPGVGKIFVKFDTVESTTNALKALA 554

Query: 496 GRKFGGNTVNAFYYPE 511
           GRKF   TV   Y+ E
Sbjct: 555 GRKFSDRTVVTTYFSE 570


>gi|398403643|ref|XP_003853288.1| hypothetical protein MYCGRDRAFT_100024 [Zymoseptoria tritici
           IPO323]
 gi|339473170|gb|EGP88264.1| hypothetical protein MYCGRDRAFT_100024 [Zymoseptoria tritici
           IPO323]
          Length = 544

 Score =  182 bits (461), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 148/462 (32%), Positives = 217/462 (46%), Gaps = 79/462 (17%)

Query: 93  RNRSKSLSPSRSP--------------SKSKRRSGFDMAPPAAAMLPG--AAVPGQ--LP 134
           +++ KS SP R P               K +R + +D+ PP    +    A + G   LP
Sbjct: 120 QSKRKSASPPRKPKEPTPDLTDIVPILEKPRRMTQWDVKPPGYENVTAEQAKLSGMFPLP 179

Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
           G P  VP M  N L     Q G+        +   + R ++R+ V  LP  A + ++  F
Sbjct: 180 GAPR-VPAMDANRLKEFMAQPGS--QANTSALKPSSARQSKRLLVYNLPASATDDSLMDF 236

Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEG------ 248
           F+  +   G N     D  ++  I+    +A +E +T E+A+NAMA+DGI  E       
Sbjct: 237 FNLQLN--GLNVTKGADPCISANISQGNGYALLEFKTPEDATNAMAMDGIKMEADVDMGN 294

Query: 249 -------VAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGG 301
                    + ++RP DY       + P       N +  GL S  +   +  +++ +  
Sbjct: 295 GESNGTSKGLEIKRPKDY-------IVPTVSDETENTS--GLFSSIVPDTQ--NKISITN 343

Query: 302 LPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP-AVTDIACAALNGLK 360
           +P Y  E Q+ ELL SFG L  F LVKD+ T  S+G  F  Y+DP + T IA  ALNG+ 
Sbjct: 344 IPVYLQEEQVVELLTSFGQLKNFVLVKDKSTEESRGIAFVEYKDPDSTTKIALEALNGMD 403

Query: 361 MGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA- 419
           +GD  L V+ A+                   I IQ    Q SG  T+G  M L   T + 
Sbjct: 404 LGDAALKVKLAS-------------------IGIQ----QVSGEMTVGA-MGLIAGTKST 439

Query: 420 -----KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
                +VLCL   IT + L D +E +EILED++EEC KYG L+ V +PRP     +  G+
Sbjct: 440 DADNGRVLCLMNMITPEELMDADEADEILEDVKEECAKYGELMEVKMPRPTGGSRQNNGI 499

Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           GK++L+Y      A A  AL+GRKF   TV   Y+ E+ YF+
Sbjct: 500 GKIYLKYKAPDSAAKALGALAGRKFADRTVVVTYFGEE-YFD 540


>gi|237838479|ref|XP_002368537.1| RNA binding motif-containing protein [Toxoplasma gondii ME49]
 gi|211966201|gb|EEB01397.1| RNA binding motif-containing protein [Toxoplasma gondii ME49]
 gi|221505828|gb|EEE31473.1| RNA binding motif-containing protein, putative [Toxoplasma gondii
           VEG]
          Length = 816

 Score =  182 bits (461), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 93/230 (40%), Positives = 137/230 (59%), Gaps = 23/230 (10%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           +RHAR+VYVG LP    +  +  +F++++T +      PGD +V+VY+N  ++FAF+E R
Sbjct: 275 SRHARKVYVGNLPVPVTQAEVQQYFNELLTTLLPKKV-PGDTIVHVYVNPSRRFAFLEHR 333

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA--------AALG---------PGQPS 273
           ++EEA+  + LDG+ +   A+ +RRP DYNPTLA        A LG         P Q +
Sbjct: 334 SIEEANFTLGLDGVSWRNCALSLRRPQDYNPTLAEQQYREERARLGSMTGFAVPPPSQAA 393

Query: 274 PNLNLAAVGLASGAIGGA-----EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
              + A   L +GA+G       + P ++F+GGLP+  TE   K+LLE+FG L    +VK
Sbjct: 394 TPASPAESSLIAGALGIVSTTVPDSPHKIFIGGLPHSITEQGCKQLLEAFGQLRALHVVK 453

Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK 378
           D+  G+ KG+ FC Y DP VTD+A A LN +++ D+ L VRRA   GQ K
Sbjct: 454 DQQRGDCKGFAFCEYLDPNVTDVAVAGLNNMRIADRVLQVRRAMPHGQMK 503


>gi|302511201|ref|XP_003017552.1| hypothetical protein ARB_04434 [Arthroderma benhamiae CBS 112371]
 gi|291181123|gb|EFE36907.1| hypothetical protein ARB_04434 [Arthroderma benhamiae CBS 112371]
          Length = 501

 Score =  182 bits (461), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 131/442 (29%), Positives = 203/442 (45%), Gaps = 72/442 (16%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPV- 163
           + +R + +D+ PP    +    A V G  P +P A  + A +      ++L AF   P  
Sbjct: 94  RKRRLTQWDIKPPGYENVTAEQAKVSGMFP-LPGAPRQQAVD-----PSRLQAFMNPPAA 147

Query: 164 ------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
                  ++    +R ++R++   +PP   E  +  FF+  +   G N     D   +V 
Sbjct: 148 SGSSNNTLLKPSNSRQSKRLFAHNIPPNVTEDTLQQFFNLQLN--GLNVISGVDPCQSVQ 205

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------------VAVRVRRPTDYNPT 262
           I+ + KFA +E  T  +A+ A+A DGI  E                  + + RP DY   
Sbjct: 206 ISKDGKFALLEFNTAADATVALAFDGITMEEHEANRESNGESNGEVKGLTIVRPKDYIVP 265

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
           +     P Q          G+ S  +   + P+++ V  +P +  E Q+  LL SFG L 
Sbjct: 266 IPTDEEPRQE---------GVVSSNV--PDSPNKICVSNIPPFIQEDQVTMLLVSFGELK 314

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            F LVKD  T  S+G  FC Y DPA T IA   LNG+++GD+ L V RA+          
Sbjct: 315 SFVLVKDVGTDESRGIAFCEYLDPASTGIAVEGLNGMELGDRRLKVNRASIG-------- 366

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
                           +Q +G++     MS+F +T +      +VL L   +TAD L D+
Sbjct: 367 ---------------TVQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTADELIDN 411

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
           E+YEEI ED++EEC KYG +  + IPRP     +  GVGK+++++        A  AL+G
Sbjct: 412 EDYEEICEDVQEECSKYGVVEELKIPRPSAGSRQAAGVGKIYVKFDSPESATKALQALAG 471

Query: 497 RKFGGNTVNAFYYPEDKYFNKD 518
           RKF   TV   Y+ E  + N +
Sbjct: 472 RKFQDRTVVTTYFSEASHPNSN 493


>gi|119482894|ref|XP_001261475.1| splicing factor u2af large subunit [Neosartorya fischeri NRRL 181]
 gi|119409630|gb|EAW19578.1| splicing factor u2af large subunit [Neosartorya fischeri NRRL 181]
          Length = 563

 Score =  181 bits (460), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 133/443 (30%), Positives = 208/443 (46%), Gaps = 84/443 (18%)

Query: 106 SKSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLM 161
           ++ +R + +D+ PP    +    A + G   LPG P           P   ++L AF   
Sbjct: 166 TRKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSRLQAFMNQ 217

Query: 162 P------VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
           P         +    +R ARR++V  LP + + + + +FF+  +   G N     D  ++
Sbjct: 218 PGGGSADNSALKPSNSRQARRLFVYNLPSVVSSEHLVSFFNLQLN--GLNVIHSVDPCIS 275

Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGIIFEG------------VAVRVRRPTDYNPTL 263
             I+ +  FA +E +T  + + A+A DGI  E               + VRRP DY    
Sbjct: 276 AQISEDHSFALLEFKTPNDTTVALAFDGITMEEHEPASGTENGAPKGLEVRRPKDYIVPN 335

Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHG 323
            +A    Q    LN              + P+++ V  +P Y  E  +  LL+SFG L  
Sbjct: 336 GSADQEYQEGVLLNEVP-----------DSPNKICVSNIPQYIPEEPVTMLLKSFGELKS 384

Query: 324 FDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-----ASGQSK 378
           F LVKD  T  S+G  FC Y DP+ T IA   LNG+++GD+ L V RA+     A+G   
Sbjct: 385 FVLVKDSSTEESRGIAFCEYADPSATAIAVEGLNGMELGDRHLKVVRASIGMTQAAGLDM 444

Query: 379 TEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL------AKVLCLTEAITADA 432
                                   G+N     MS+F +T       ++VL L   +T + 
Sbjct: 445 ------------------------GVN----AMSMFAKTTSQDLESSRVLQLLNMVTPEE 476

Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AK 491
           L D+++YEEI +D+REEC KYG ++++ +PRP     ++PGVGK++++ +D V  AT A 
Sbjct: 477 LLDNDDYEEICDDVREECSKYGKVLDLKVPRPSGGSRQSPGVGKIYVK-FDTVESATNAL 535

Query: 492 NALSGRKFGGNTVNAFYYPEDKY 514
            AL+GRKF   TV   Y+ E+ +
Sbjct: 536 KALAGRKFSDRTVVTTYFSEENF 558


>gi|221484193|gb|EEE22489.1| RNA binding motif-containing protein, putative [Toxoplasma gondii
           GT1]
          Length = 820

 Score =  181 bits (460), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 93/230 (40%), Positives = 137/230 (59%), Gaps = 23/230 (10%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           +RHAR+VYVG LP    +  +  +F++++T +      PGD +V+VY+N  ++FAF+E R
Sbjct: 279 SRHARKVYVGNLPVPVTQAEVQQYFNELLTTLLPKKV-PGDTIVHVYVNPSRRFAFLEHR 337

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA--------AALG---------PGQPS 273
           ++EEA+  + LDG+ +   A+ +RRP DYNPTLA        A LG         P Q +
Sbjct: 338 SIEEANFTLGLDGVSWRNCALSLRRPQDYNPTLAEQQYREERARLGSMTGFAVPPPSQAA 397

Query: 274 PNLNLAAVGLASGAIGGA-----EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVK 328
              + A   L +GA+G       + P ++F+GGLP+  TE   K+LLE+FG L    +VK
Sbjct: 398 TPASPAESSLIAGALGIVSTTVPDSPHKIFIGGLPHSITEQGCKQLLEAFGQLRALHVVK 457

Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK 378
           D+  G+ KG+ FC Y DP VTD+A A LN +++ D+ L VRRA   GQ K
Sbjct: 458 DQQRGDCKGFAFCEYLDPNVTDVAVAGLNNMRIADRVLQVRRAMPHGQMK 507


>gi|389642205|ref|XP_003718735.1| splicing factor U2AF 50 kDa subunit [Magnaporthe oryzae 70-15]
 gi|351641288|gb|EHA49151.1| splicing factor U2AF 50 kDa subunit [Magnaporthe oryzae 70-15]
          Length = 620

 Score =  181 bits (459), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 118/363 (32%), Positives = 175/363 (48%), Gaps = 55/363 (15%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           +R ++R+ +  LP    E ++ +F +  +  +  N     D  +   +  +  FA VE R
Sbjct: 289 SRQSKRLILSNLPAGTTEDSLISFLNLQLNGL--NVIEASDPCLACQMAPDGSFAMVEFR 346

Query: 231 TVEEASNAMALDGIIFE----------GVAVR---VRRPTDYNPTLAAALGPGQPSPNLN 277
           +  + + A ALDGI  E          G A +   +RRP DY   + A +      P   
Sbjct: 347 SPSDTTVAYALDGISMEAEDAGNGDANGAASKGLAMRRPKDY--IVPAVVDDTGYEP--- 401

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
               G+ S  +   + P ++ V  LP Y T+ Q+ ELL SFG L    L KD  T  S+G
Sbjct: 402 ----GVVSSRV--VDTPHKISVTNLPAYLTDEQVVELLSSFGELKALVLAKDSSTEESRG 455

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
             FC Y D   TD+A   LNG+++GDK L VR+A+                   I I   
Sbjct: 456 IAFCEYVDVTNTDVAIEGLNGMELGDKRLKVRKAS-------------------IGIT-- 494

Query: 398 ALQTSGMNTLGGGMSLFGETLAK------VLCLTEAITADALADDEEYEEILEDMREECG 451
             Q SGM      MS+   T+A+      VL L   +TAD L D+++YEEI ED++EEC 
Sbjct: 495 --QVSGMEMGVNAMSMLAGTVAQDPDLSPVLQLLNMVTADELMDNDDYEEICEDVQEECA 552

Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
           KYGT++ + +PRP     +  GVGK+++++        A  AL GRKF   TV   Y+PE
Sbjct: 553 KYGTVIELKVPRPSSGAKQAAGVGKIYVKFDSIESSTKALKALGGRKFADRTVVTTYFPE 612

Query: 512 DKY 514
           + +
Sbjct: 613 ENF 615


>gi|440468063|gb|ELQ37246.1| splicing factor U2AF 50 kDa subunit [Magnaporthe oryzae Y34]
 gi|440489023|gb|ELQ68704.1| splicing factor U2AF 50 kDa subunit [Magnaporthe oryzae P131]
          Length = 640

 Score =  181 bits (459), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 118/363 (32%), Positives = 175/363 (48%), Gaps = 55/363 (15%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           +R ++R+ +  LP    E ++ +F +  +  +  N     D  +   +  +  FA VE R
Sbjct: 289 SRQSKRLILSNLPAGTTEDSLISFLNLQLNGL--NVIEASDPCLACQMAPDGSFAMVEFR 346

Query: 231 TVEEASNAMALDGIIFE----------GVAVR---VRRPTDYNPTLAAALGPGQPSPNLN 277
           +  + + A ALDGI  E          G A +   +RRP DY   + A +      P   
Sbjct: 347 SPSDTTVAYALDGISMEAEDAGNGDANGAASKGLAMRRPKDY--IVPAVVDDTGYEP--- 401

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
               G+ S  +   + P ++ V  LP Y T+ Q+ ELL SFG L    L KD  T  S+G
Sbjct: 402 ----GVVSSRV--VDTPHKISVTNLPAYLTDEQVVELLSSFGELKALVLAKDSSTEESRG 455

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
             FC Y D   TD+A   LNG+++GDK L VR+A+                   I I   
Sbjct: 456 IAFCEYVDVTNTDVAIEGLNGMELGDKRLKVRKAS-------------------IGIT-- 494

Query: 398 ALQTSGMNTLGGGMSLFGETLAK------VLCLTEAITADALADDEEYEEILEDMREECG 451
             Q SGM      MS+   T+A+      VL L   +TAD L D+++YEEI ED++EEC 
Sbjct: 495 --QVSGMEMGVNAMSMLAGTVAQDPDLSPVLQLLNMVTADELMDNDDYEEICEDVQEECA 552

Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
           KYGT++ + +PRP     +  GVGK+++++        A  AL GRKF   TV   Y+PE
Sbjct: 553 KYGTVIELKVPRPSSGAKQAAGVGKIYVKFDSIESSTKALKALGGRKFADRTVVTTYFPE 612

Query: 512 DKY 514
           + +
Sbjct: 613 ENF 615


>gi|164657478|ref|XP_001729865.1| hypothetical protein MGL_2851 [Malassezia globosa CBS 7966]
 gi|159103759|gb|EDP42651.1| hypothetical protein MGL_2851 [Malassezia globosa CBS 7966]
          Length = 473

 Score =  181 bits (459), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 119/399 (29%), Positives = 196/399 (49%), Gaps = 75/399 (18%)

Query: 128 AVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLAN 187
           A+PG +P  PSA+ E+         T   AF                RR++V  +  +  
Sbjct: 124 AMPGTIP--PSAMAELE------ATTSAAAF--------NASMYLETRRLHVSPVSSVKT 167

Query: 188 EQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE 247
            Q +  F +  M      S+G  +    V ++ ++ +A++E R  +EASNA+ LDG+ F 
Sbjct: 168 SQQLRIFINAKMNERLLCSSGSLEPCYAVDMHLDEGYAYLEFRNPDEASNALLLDGVAFL 227

Query: 248 GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYY 305
           G  + + RP  Y    A       P+P           GAI  +  +GP+++++G +P +
Sbjct: 228 GHRLHIERPKGYVGQDAV------PAP-----------GAIETSVPDGPNKLYIGNVPVF 270

Query: 306 FTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKT 365
             E Q+ ELL++FG +  FDL++D +T  S+G  FC + + AVTD+AC  L+GL++G++ 
Sbjct: 271 LNEQQVMELLKAFGDVRHFDLIRDPETQRSRGMAFCEFHEDAVTDLACEGLDGLEVGEQR 330

Query: 366 LTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLT 425
           L VRR  AS  + T +++                +TS             +T  + + + 
Sbjct: 331 LMVRRVNASTNTHTHEDT---------------QETS-------------DTPTRAMLML 362

Query: 426 EAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP------------DQNGGETPG 473
             +T D L DD EY++I ED+  EC ++GT+ +V IPRP              +G E  G
Sbjct: 363 NMVTTDELLDDTEYQDIKEDVHSECSRHGTVTSVYIPRPLAAAAGHATSADAASGMEPKG 422

Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
           VG+V++++  A  C  A  A++GR+F G TV   Y  +D
Sbjct: 423 VGRVYVQFVHADECEAALRAIAGRQFDGRTVICAYVRDD 461


>gi|302656965|ref|XP_003020217.1| hypothetical protein TRV_05722 [Trichophyton verrucosum HKI 0517]
 gi|291184026|gb|EFE39599.1| hypothetical protein TRV_05722 [Trichophyton verrucosum HKI 0517]
          Length = 486

 Score =  181 bits (459), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 130/435 (29%), Positives = 200/435 (45%), Gaps = 72/435 (16%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPV- 163
           + +R + +D+ PP    +    A V G  P +P A  + A +      ++L AF   P  
Sbjct: 94  RKRRLTQWDIKPPGYENVTAEQAKVSGMFP-LPGAPRQQAVD-----PSRLQAFMNPPAA 147

Query: 164 ------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
                  ++    +R ++R++   +PP   E  +  FF+  +   G N     D   +V 
Sbjct: 148 SGSSNNTLLKPSNSRQSKRLFAHNIPPNVTEDTLQQFFNLQLN--GLNVISGVDPCQSVQ 205

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------------VAVRVRRPTDYNPT 262
           I+ + KFA +E  T  +A+ A+A DGI  E                  + + RP DY   
Sbjct: 206 ISKDGKFALLEFNTAADATVALAFDGITMEEHEANRESNGESNGDVKGLTIVRPKDYIVP 265

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
           +     P Q          G+ S  +   + P+++ V  +P +  E Q+  LL SFG L 
Sbjct: 266 IPTDEEPRQE---------GVVSSNV--PDSPNKICVSNIPPFIQEDQVTMLLVSFGELK 314

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            F LVKD  T  S+G  FC Y DPA T IA   LNG+++GD+ L V RA+          
Sbjct: 315 SFVLVKDVGTDESRGIAFCEYLDPASTGIAVEGLNGMELGDRRLKVNRASIG-------- 366

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
                           +Q +G++     MS+F +T +      +VL L   +TAD L D+
Sbjct: 367 ---------------TVQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTADELIDN 411

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
           E+YEEI ED++EEC KYG +  + IPRP     +  GVGK+++++        A  AL+G
Sbjct: 412 EDYEEICEDVQEECSKYGVVEELKIPRPSAGSRQAAGVGKIYVKFDSPESATKALQALAG 471

Query: 497 RKFGGNTVNAFYYPE 511
           RKF   TV   Y+ E
Sbjct: 472 RKFQDRTVVTTYFSE 486


>gi|327297188|ref|XP_003233288.1| splicing factor u2af large subunit [Trichophyton rubrum CBS 118892]
 gi|326464594|gb|EGD90047.1| splicing factor u2af large subunit [Trichophyton rubrum CBS 118892]
          Length = 563

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 131/442 (29%), Positives = 202/442 (45%), Gaps = 72/442 (16%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPV- 163
           + +R + +D+ PP    +    A V G  P +P A  + A +      ++L AF   P  
Sbjct: 156 RKRRLTQWDIKPPGYENVTAEQAKVSGMFP-LPGAPRQQAVD-----PSRLQAFMNPPAA 209

Query: 164 ------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
                  ++    +R ++R++   +PP   E  +  FF+  +   G N     D   +V 
Sbjct: 210 SGSGNNTLLKPSNSRQSKRLFAHNIPPNVTEDTLQQFFNLQLN--GLNVISGVDPCQSVQ 267

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG---------------VAVRVRRPTDYNPT 262
           I+ + KFA +E  T  +A+ A+A DGI  E                  + + RP DY   
Sbjct: 268 ISKDGKFALLEFNTAADATVALAFDGITMEEHEANRESNGESNGNVKGLTIVRPKDYIVP 327

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
           L     P Q          G+ S  +   + P+++ V  +P +  E Q+  LL SFG L 
Sbjct: 328 LPTDEEPRQE---------GVVSSNV--PDSPNKICVSNIPPFIQEDQVTMLLVSFGELK 376

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            F LVKD  T  S+G  FC Y D A T IA   LNG+++GD+ L V RA+          
Sbjct: 377 SFVLVKDVGTDESRGIAFCEYLDSASTGIAVEGLNGMELGDRRLKVNRASIG-------- 428

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADD 436
                           +Q +G++     MS+F +T +      +VL L   +TAD L D+
Sbjct: 429 ---------------TVQAAGLDMGVNAMSMFAKTTSQDLETGRVLQLLNMVTADELIDN 473

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
           E+YEEI ED++EEC KYG +  + IPRP     +  GVGK+++++        A  AL+G
Sbjct: 474 EDYEEICEDVQEECSKYGVVEELKIPRPSAGSRQAAGVGKIYVKFDTPESATKALQALAG 533

Query: 497 RKFGGNTVNAFYYPEDKYFNKD 518
           RKF   TV   Y+ E  + N +
Sbjct: 534 RKFQDRTVVTTYFSEASHSNSN 555


>gi|449299113|gb|EMC95127.1| hypothetical protein BAUCODRAFT_526859 [Baudoinia compniacensis
           UAMH 10762]
          Length = 432

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 134/438 (30%), Positives = 209/438 (47%), Gaps = 72/438 (16%)

Query: 107 KSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
           + +R + +D+ PP            + M P    P Q P  PS +    Q  +     Q 
Sbjct: 35  RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAPRQQPMDPSRL----QAFMNQPGNQA 90

Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
               L P         R ++R++   +PP  NE  I+ FF+  +   G N     D  ++
Sbjct: 91  NTSALKP------STARQSKRLFAYNIPPNVNESMISDFFNLQLN--GLNVTRGVDPCIS 142

Query: 216 VYINHEKKFAFVEMRTVEEASNAMALDGI-------IFEGVA------VRVRRPTDYNPT 262
             ++ +  +A ++ +T E+A+NAMALDGI       +  G A      + ++RP DY   
Sbjct: 143 AQLSQDLTYALLDFKTSEDATNAMALDGITMPEHMEVMNGSANGNSQGLIIQRPKDY--I 200

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVF---VGGLPYYFTETQIKELLESFG 319
           + A +   +          G+ S  +     PD  F   +  +P Y TE Q++ELL SFG
Sbjct: 201 VPAVVDDTE-------HEAGVLSSTV-----PDTQFKISITHIPSYLTEEQVQELLVSFG 248

Query: 320 TLHGFDLVKDRDTGNSKGYGFCVYQDPA-VTDIACAALNGLKMGDKTLTVRRATASGQSK 378
            L  F LVKD  T  S+G  FC Y+D    TDIA  +LNG+++GD  L V+RA+   Q  
Sbjct: 249 ELKNFVLVKDAGTDQSRGIAFCEYKDAKNTTDIAVESLNGMELGDSHLKVQRASIGTQ-- 306

Query: 379 TEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEE 438
                   Q    + +  M++  S      GG        ++VLCL   IT + L D +E
Sbjct: 307 --------QVGGEMTVNAMSMMASA----AGGAD---RDASRVLCLMNMITPEELMDADE 351

Query: 439 YEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRK 498
            +EILED++EEC KYG +++V +PRP     ++ G+GK++++Y        A  AL+GRK
Sbjct: 352 ADEILEDVKEECAKYGAIIDVKMPRPSSGSRQSNGIGKIYVKYEKPEAAQKALAALAGRK 411

Query: 499 FGGNTVNAFYYPEDKYFN 516
           F   TV   ++ E+ YF+
Sbjct: 412 FADRTVVVTFFGEE-YFD 428


>gi|95103124|gb|ABF51503.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 isoform 2
           [Bombyx mori]
          Length = 306

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 96/221 (43%), Positives = 136/221 (61%), Gaps = 10/221 (4%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+    FF+Q M  + G +   G+ V+   I
Sbjct: 83  PQAAVPVVGSTITRQARRLYVGNIPFGVTEEETMEFFNQQMH-LSGLAQAAGNPVLACQI 141

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY P       PG  +P +N+
Sbjct: 142 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPM------PGTENPAINV 195

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
            A G+ S  +   + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  TG SKGY
Sbjct: 196 PA-GVISTVV--PDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSSTGLSKGY 252

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT 379
            F  Y D ++TD A A LNG+++GDK L V+RA+   ++ T
Sbjct: 253 AFAEYVDISMTDQAIAGLNGMQLGDKKLIVQRASIGAKNST 293


>gi|452841884|gb|EME43820.1| hypothetical protein DOTSEDRAFT_71600 [Dothistroma septosporum
           NZE10]
          Length = 433

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 139/463 (30%), Positives = 217/463 (46%), Gaps = 55/463 (11%)

Query: 74  RDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPAAAMLPG--AAVP 130
           R  ER+   +S S    + +  +  L+   S  + KRR + +D+ PP    +    A + 
Sbjct: 2   RQLERQTARKSASPPPRKPKEPTPDLTEVTSVLERKRRLTQWDIKPPGYENVTAEQAKLS 61

Query: 131 GQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANE 188
           G   LPG P   P   Q +  F     G      ++  T    R ++R+ +  +P  A E
Sbjct: 62  GMFPLPGAPRQQPMDPQKLQAFMNQPGGEANKTALKPST---ARQSKRLLIYNIPASATE 118

Query: 189 QAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEG 248
             I  FF+  +   G N     D  ++  ++ +K +A +E +T E+A+NAMA DGI  E 
Sbjct: 119 DTIMDFFNLQLN--GLNVTRGADPCISAQLSQDKAYALLEFKTPEDATNAMAFDGINMEP 176

Query: 249 VA---------------VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
            A               + ++RP DY   +   +  G  +        G+ S  +   + 
Sbjct: 177 EAMVTSGNEDENGGARGLDIKRPKDY---IVPVVTDGTEN------DAGVLSNVVPDTQ- 226

Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
            +++ +  +P Y  E Q  ELL SFG L  F LVKD  T  S+G  FC Y+DP  T +A 
Sbjct: 227 -NKISITNIPAYVDEEQTMELLNSFGELKNFVLVKDASTEESRGIAFCEYKDPNSTKVAV 285

Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSL 413
            +L+G+ +GD  + VR A+   Q          Q    +++  M+L        G G   
Sbjct: 286 ESLHGMTLGDAAMKVRLASIGIQ----------QVSGEMSVNAMSLMAGTARADGEG--- 332

Query: 414 FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPG 473
                 +VL L   IT + L D +E +EILED++EEC KYG L++V +PRP     ++ G
Sbjct: 333 -----GRVLSLMNMITPEELMDPDEADEILEDVKEECAKYGPLLDVKMPRPTGGSRQSNG 387

Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           +GK++L+Y      A A  AL+GRKF   TV   Y+ E+ YF+
Sbjct: 388 IGKIYLKYESTESAAKALAALAGRKFADRTVVVTYFGEE-YFD 429


>gi|255931767|ref|XP_002557440.1| Pc12g05960 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211582059|emb|CAP80223.1| Pc12g05960 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 554

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 126/429 (29%), Positives = 195/429 (45%), Gaps = 72/429 (16%)

Query: 106 SKSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
           ++ +R + +DM PP            + M P    P Q P  PS + +      P G ++
Sbjct: 173 TRKRRLTQWDMKPPGYENVTAEQAKISGMFPLPGAPRQQPMDPSRMKDFLNP--PTGDSE 230

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
             A        +    +R ++R++V  +PP  +  A+  FF+  +   G N     D  +
Sbjct: 231 NAA--------LKPSNSRQSKRLFVYNIPPGVSGDAVIAFFNLQLN--GLNVIRSVDPCI 280

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFE---GVAVRVRRPTDYNPTLAAALGPGQ 271
           +  ++ +K FA +E +   +A+ A+ALDGI         + VRRP DY     +A  P Q
Sbjct: 281 SAQVSEDKTFALLEFKDPNDATVALALDGITMPESGDKGLEVRRPKDYIVPDGSAAQPVQ 340

Query: 272 PSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRD 331
           P   LN              + P+++ +  +P Y  E  I  LL+SFG L  F LVKD  
Sbjct: 341 PGVVLNEVP-----------DSPNKICISNIPTYINEEAIIMLLKSFGDLKSFILVKDAA 389

Query: 332 TGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQH 391
           T  S+G  F  Y DP  T +A   LNG+++ D+ L   RA+                   
Sbjct: 390 TEESRGIAFYEYVDPNNTALAVEGLNGMELADRRLKFVRASIG----------------- 432

Query: 392 IAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEILED 445
                   Q +G++     M +F +T +      +VL L   +T D L +DE+YEEILED
Sbjct: 433 ------TTQATGLDMGVNAMQMFAKTTSQDLETTQVLQLLNMVTLDELLNDEDYEEILED 486

Query: 446 MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
           + EEC K+G ++ + IPR         G GK+F++Y  A     A  AL+GRKF   TV 
Sbjct: 487 VGEECSKFGKMIGIKIPRRGH------GAGKIFIKYDTAESATNALKALAGRKFSDRTVV 540

Query: 506 AFYYPEDKY 514
           A Y+  + +
Sbjct: 541 ASYFSVENF 549


>gi|312078073|ref|XP_003141580.1| U2af splicing factor protein 1 [Loa loa]
          Length = 460

 Score =  178 bits (451), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 103/280 (36%), Positives = 155/280 (55%), Gaps = 31/280 (11%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           V V+    T  +RR+YVG +P   +E A+  FF+Q M  + G +  PG+ V+   +N +K
Sbjct: 178 VPVVGPSVTCQSRRLYVGNIPFGCSEDAMLDFFNQQMH-LCGLAQAPGNPVLACQMNLDK 236

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            FAF+E R+++E +  MA DGI F G  +++RRP DY P           S + +L  + 
Sbjct: 237 NFAFIEFRSIDETTAGMAFDGINFMGQQLKIRRPRDYQPM----------STSYDLGNMM 286

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           +++      + P ++F+GGLP Y    Q+KELL SFG L  F+LV ++ TG SKGY F  
Sbjct: 287 VSNIV---PDSPHKIFIGGLPSYLNAEQVKELLSSFGQLKAFNLVTEQSTGVSKGYAFAE 343

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           Y DP++TD A A LNG+++GDK L V+ + A+ ++   Q +               +Q +
Sbjct: 344 YLDPSLTDQAIAGLNGMQLGDKNLVVQLSCANARNNVAQNTF------------PQIQVA 391

Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEI 442
           G++   G     G    +VLCL   +T D L DDEEYE I
Sbjct: 392 GIDLSHGA----GPP-TEVLCLMNMVTEDELKDDEEYEGI 426


>gi|342873171|gb|EGU75391.1| hypothetical protein FOXB_14096 [Fusarium oxysporum Fo5176]
          Length = 661

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 129/430 (30%), Positives = 205/430 (47%), Gaps = 67/430 (15%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P           P   ++L AF   P
Sbjct: 177 RKRRLTQWDIKPPGYENVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMNQP 228

Query: 163 VQVMTQ-----QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +T        +R ++R+ V  +P   +E+A+ +FF+  +   G N     D  V   
Sbjct: 229 GGQVTSAGLKANNSRQSKRLLVSRIPSGTSEEALMSFFNLQLN--GLNVIDTTDPCVLCQ 286

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGV-----------AVRVRRPTDYNPTLAAA 266
            ++++ FA +E +   EA+ A+A+DGI  E              + +RRP DY   +  A
Sbjct: 287 FSNDRSFAVIEFKDAPEATVALAMDGISMEASDASNGTDGGHRGLEIRRPRDY---VVPA 343

Query: 267 LGPGQPSPNLNLAAVGLASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
           +             V   S  +     +  +++ +  +P + TE QI ELL SFG    F
Sbjct: 344 V----------TEEVSYDSEVVSNIVPDTVNKLSITNIPTFLTEEQIIELLASFGKPKAF 393

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTD-IACAALNGLKMGDKTLTVRRATASGQSKTEQES 383
            LVKDR T  S+G  F  YQDPA ++  A   LNG+++G K L V +A+           
Sbjct: 394 VLVKDRGTEESRGIAFAEYQDPAASNPTALDTLNGMEIGGKKLKVSKAS----------- 442

Query: 384 ILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL--AKVLCLTEAITADALADDEEYEE 441
                   I   ++A    G+  + G  S     +  ++VL L   +TA+ L D+++YEE
Sbjct: 443 --------IGPTQVANFDVGITAISGLASQTANEVESSRVLQLLNMVTAEELLDNDDYEE 494

Query: 442 ILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
           I ED++EEC K+G +++V +PRP     ++ GVGK+F++Y  A   A A  AL+GRKF  
Sbjct: 495 ICEDVKEECSKFGKIIDVKVPRPTGGSRQSAGVGKIFVKYEKAEDTAKALQALAGRKFAD 554

Query: 502 NTVNAFYYPE 511
            TV   Y+PE
Sbjct: 555 RTVVTTYFPE 564


>gi|408397958|gb|EKJ77095.1| hypothetical protein FPSE_02739 [Fusarium pseudograminearum CS3096]
          Length = 554

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 129/431 (29%), Positives = 201/431 (46%), Gaps = 63/431 (14%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P           P   ++L AF   P
Sbjct: 159 RQRRLTQWDIKPPGYDNVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMNQP 210

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +T        +R ++R+ V  +PP  +E  +  FF+  +   G N     D  V   
Sbjct: 211 GGQVTSAGLKASNSRQSKRLLVSRIPPGTSEDTLIAFFNLQLN--GLNVIDTTDPCVLCQ 268

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG-----------VAVRVRRPTDYNPTLAAA 266
            ++++ FA +E +   E + A+ALDGI  E              + +RRP DY   + A 
Sbjct: 269 FSNDRSFAVIEFKDAPETTVALALDGISMEANDASNGADGGHRGLEIRRPRDY--VVPAV 326

Query: 267 LGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
                  P +    V          +  +++ +  +P + TE QI ELL SFG    F L
Sbjct: 327 TEDVAYDPEVVSNVV---------PDTVNKLSITNIPPFLTEEQIIELLASFGKPKAFVL 377

Query: 327 VKDRDTGNSKGYGFCVYQDPAVTD-IACAALNGLKMGDKTLTVRRATASGQSKTEQESIL 385
           VKDR T  S+G  F  YQDPAV++  A   LNG+ +G K + V +A+             
Sbjct: 378 VKDRGTEESRGIAFAEYQDPAVSNPTALDTLNGMDIGGKQIKVSKAS------------- 424

Query: 386 AQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL--AKVLCLTEAITADALADDEEYEEIL 443
                 I   ++A    G+  + G  S     +  ++VL L   +TA+ L D+++YEEI 
Sbjct: 425 ------IGPTQVANFDVGITAISGLASQTANEVESSRVLQLLNMVTAEELLDNDDYEEIC 478

Query: 444 EDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNT 503
           ED+REEC KYG +++V +PRP     ++ GVGK+F++Y        A  AL+GRKF   T
Sbjct: 479 EDVREECSKYGKILDVKVPRPTGGSRQSAGVGKIFVKYEHTEDTTKALQALAGRKFADRT 538

Query: 504 VNAFYYPEDKY 514
           V   Y+PE+ +
Sbjct: 539 VVTTYFPEENF 549


>gi|403331270|gb|EJY64574.1| RNA-binding proteins (RRM domain) [Oxytricha trifallax]
          Length = 565

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 123/391 (31%), Positives = 187/391 (47%), Gaps = 62/391 (15%)

Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
           H+RR+YVGG+P   ++  +  F +Q +   GG     GD V+    N EK++ F+E+R+V
Sbjct: 188 HSRRLYVGGVPTSQSDVQVVQFLTQTLRKAGG-ILEEGDPVIKSQNNPEKRYTFLELRSV 246

Query: 233 EEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
           EEAS  + LDGI F    +R+RRP DY+           P  +   AA+G+ S  +   E
Sbjct: 247 EEASTMIQLDGIKFMDSTLRIRRPEDYDKYPQIPPRRPIPQIDT--AALGIISTKV--EE 302

Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
            P ++FVGGLP  F E QIK LL  +G L  F LVK  +   S+G+ FC Y D      A
Sbjct: 303 TPLKIFVGGLPKEFNEEQIKNLLLRYGQLKSFHLVKHTNIDQSRGFAFCEYTDEKGVQNA 362

Query: 353 CAALNGLKMGDKTLTVRRATASGQSKTEQESILAQ------------------------- 387
              LNGLK+G +++ VRR  AS  S  +Q  I  +                         
Sbjct: 363 IQFLNGLKIGSRSINVRRTGAST-STVQQNQISEEDKKKFEDKLDDFINETGKFMQNANM 421

Query: 388 -------AQQHIAI-------QKMA---LQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
                   QQH          Q+MA    Q+ G  T+   M +      K+  L++ I  
Sbjct: 422 YNIDKQSVQQHNPFDELERHNQQMASYIQQSYGYTTISTVMEI------KIPALSQQI-- 473

Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRP---DQNGGETP-GVGKVFLEYYDAVG 486
             L +D+E++E+++D+ +E  K+G +  +++PR     Q     P  +GK F+E+ +   
Sbjct: 474 --LDNDQEHDELVKDLTQELQKFGKIRTLLLPRSLDMMQTSTVKPSAIGKAFVEFEEVTS 531

Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
                N L+GR+F G  V   +Y +D +  K
Sbjct: 532 GFACYNLLNGRQFMGMPVEINFYNKDLFVTK 562


>gi|340780291|pdb|2YH0|A Chain A, Solution Structure Of The Closed Conformation Of Human
           U2af65 Tandem Rrm1 And Rrm2 Domains
 gi|340780292|pdb|2YH1|A Chain A, Model Of Human U2af65 Tandem Rrm1 And Rrm2 Domains  With
           Eight-Site Uridine Binding
          Length = 198

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 94/204 (46%), Positives = 131/204 (64%), Gaps = 10/204 (4%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+
Sbjct: 4   ARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVD 62

Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
           E + AMA DGIIF+G ++++RRP DY P       PG  S N ++   G+ S  +   + 
Sbjct: 63  ETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDS 113

Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
             ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D  VTD A 
Sbjct: 114 AHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAI 173

Query: 354 AALNGLKMGDKTLTVRRATASGQS 377
           A LNG+++GDK L V+RA+   ++
Sbjct: 174 AGLNGMQLGDKKLLVQRASVGAKN 197


>gi|402073699|gb|EJT69251.1| splicing factor U2AF 50 kDa subunit [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 623

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 114/365 (31%), Positives = 177/365 (48%), Gaps = 59/365 (16%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           +R ++R+ V    P   E+A+  F +  +  +  N     D  +   +  +  FA +E R
Sbjct: 292 SRQSKRLIVTNFAPGTTEEALVAFMNLQLNGL--NVIESTDPCLLCQMAPDSSFAILEFR 349

Query: 231 TVEEASNAMALDGIIFE-------GVA------VRVRRPTDYNPTLAAALGPGQPSPNLN 277
           +  E + A+ALDGI  E       G A      + +RRP DY   +  A+          
Sbjct: 350 SPAETTVALALDGITMEAEDTPMEGAANGTPQGLELRRPKDY---IVPAV---------- 396

Query: 278 LAAVGLASGAIGG--AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNS 335
           +   G   G +     + P ++ +  L  Y TE Q+ ELL SFG L    LVKD  T  S
Sbjct: 397 VEDTGYERGVVSSRVVDTPHKIGITNLAPYLTEEQVTELLVSFGELKALVLVKDSGTEES 456

Query: 336 KGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQ 395
           +G  FC Y DP  TD+A   LN +++G+K L V++A+                   I I 
Sbjct: 457 RGIAFCEYVDPVATDVAIHGLNNMELGEKRLRVKKAS-------------------IGIT 497

Query: 396 KMALQTSGMNTLGGGMSLFGET------LAKVLCLTEAITADALADDEEYEEILEDMREE 449
           +++    G+N     MS+   T      L++VL L   +TAD L D+++YEEI +D+REE
Sbjct: 498 QVSGIEMGIN----AMSMLAGTVAQDPDLSRVLQLLNMVTADELLDNDDYEEICDDVREE 553

Query: 450 CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
           C K+GT++ + IPRP     +  GVGK+++++        A  AL+GRKF   TV   Y+
Sbjct: 554 CSKFGTILELKIPRPSGGARQLAGVGKIYVKFDTIESSTEALKALAGRKFADRTVVTTYF 613

Query: 510 PEDKY 514
           PE+ +
Sbjct: 614 PEENF 618


>gi|429854658|gb|ELA29655.1| splicing factor u2af large subunit [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 559

 Score =  174 bits (442), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 159/526 (30%), Positives = 243/526 (46%), Gaps = 67/526 (12%)

Query: 16  RHKSSWVSGRSRTGER--GRDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYN 73
           R +    SGR R G+R   RDR      +  DD    +  +   +  R   R DR +   
Sbjct: 69  REREDRYSGRDRRGDREWDRDRGSSRRDARRDDDDHRRRDRDPYDDRRRGGRGDRQQQQQ 128

Query: 74  RDKERRHRHRSRSHSSD---RFRNRSKSLSPSRSPSKSKRR-SGFDMAPPA--------- 120
            D     R   R  S+    + R  +  L+   S  + KRR + +D+ PP          
Sbjct: 129 HDDGGMGRRGDRQRSATPPPKKREPTPDLTNVTSVLERKRRLTQWDIKPPGYENVTAEQA 188

Query: 121 --AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVY 178
             + M P    P Q P  PS +    Q ++     Q+ +  L P        +R A+R+ 
Sbjct: 189 KLSGMFPLPGAPRQQPMDPSKL----QAIMNQPGGQVNSAALKPSN------SRQAKRLL 238

Query: 179 VGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNA 238
           +  LPP A E +I  FF+  +   G N     D   +  ++ +  FA VE R   EA+ A
Sbjct: 239 INNLPPSATEDSIVGFFNLQLN--GLNVIESTDPCTSCQLSKDHSFAVVEFRNASEATVA 296

Query: 239 MALDGIIFE------GVA----VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           +ALDGI  E      G A    + +RRP DY         P +P         G+ S  +
Sbjct: 297 LALDGITMEADDATNGAAGSNGLVIRRPKDYIVPAVVDDVPYEP---------GVVSNIV 347

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
              + P+++ +  +P Y ++ Q+ ELL SFG L  F LV+D+ T  S+G  FC Y +P+ 
Sbjct: 348 --IDTPNKISIANMPPYLSDEQVTELLVSFGELKAFVLVRDKSTEESRGIAFCEYVEPSA 405

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
           TD+A   LNG+ +GDK L V++A+                   +A  +M +  + M+ L 
Sbjct: 406 TDVAIQGLNGMDLGDKKLRVQKASV--------------GVTQVAGVEMGV--AAMSMLA 449

Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNG 468
           G  S   E   +VL L   +T + L D+++YEEI ED+ EEC K+G +++V IPRP    
Sbjct: 450 GTTSTDSEE-TRVLQLLNMVTPEELMDNDDYEEIKEDVEEECTKFGKVLDVKIPRPVGGS 508

Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
            ++ GVGK+F+ +        A  AL+GRKF   TV   Y+PE+ +
Sbjct: 509 RQSAGVGKIFVRFESKEVAKKALQALAGRKFADRTVVTTYFPEENF 554


>gi|407929464|gb|EKG22293.1| hypothetical protein MPH_00360 [Macrophomina phaseolina MS6]
          Length = 824

 Score =  174 bits (442), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 119/366 (32%), Positives = 178/366 (48%), Gaps = 56/366 (15%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           +R A+R++V   P  + + +I  FF+  +  +  N     D  ++V I+ ++ FA  E +
Sbjct: 18  SRQAKRLFVYNFPAASTDDSIQDFFNLQLNHL--NVISSSDPCISVQISKDRTFALCEFK 75

Query: 231 TVEEASNAMALDGIIFEG------------VAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           T E+ + A+ALDG   E               +++ RP DY       + P Q S + + 
Sbjct: 76  TPEDTTMALALDGQSMEAEDASNGASNGGHSGIKISRPKDY-------IVPAQ-SDDADY 127

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
              G+ S  +   +GP ++ V  +P Y TE Q+ +LL +FG L  F LVKD  T  SKG 
Sbjct: 128 QE-GVVSNKV--KDGPHKICVAQIPVYLTEEQVMDLLSAFGGLKAFTLVKDTGTDQSKGI 184

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            FC Y DP  TD A   L+G+++    L V++A                    + IQ   
Sbjct: 185 AFCEYVDPDTTDPAVEGLDGMEIAQDHLKVKKAC-------------------VGIQ--- 222

Query: 399 LQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEILEDMREECGK 452
            Q SG+      MS+   T +      +VL L   +T + L D +EYEEI ED+ EEC K
Sbjct: 223 -QASGLEMGVNAMSMLAGTSSGDVEQGRVLMLLNMVTPEELMDPQEYEEIQEDVHEECSK 281

Query: 453 YGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
           YG +  + IPRP Q   E  GVGK+F++Y        A  AL+GRKF   TV   ++ E+
Sbjct: 282 YGKVEELKIPRP-QPPKENKGVGKIFVKYDTPESAQKALRALAGRKFADRTVVVTFFGEE 340

Query: 513 KYFNKD 518
            YF+ D
Sbjct: 341 -YFDVD 345


>gi|322693990|gb|EFY85833.1| splicing factor, putative [Metarhizium acridum CQMa 102]
          Length = 584

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 160/521 (30%), Positives = 232/521 (44%), Gaps = 66/521 (12%)

Query: 16  RHKSSWVSGRSRTGERGRDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRD 75
           R +    SGR R GER  DR     +        D+  + DREG  D  R  R  D   D
Sbjct: 86  REREDRYSGRDRRGERDWDRDRGSSRRDARRDEDDRPNRRDREGFDDRRRGGRGGDRRDD 145

Query: 76  KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPAAAMLPG--AAVPGQ 132
                +   RS S  + R  +  L+      + KRR + +D+ PP   ++    A + G 
Sbjct: 146 GGFARQESRRSPSPAKPREPTPDLTDIIPVLERKRRMTQWDIKPPGYELVTAEQAKLSGM 205

Query: 133 --LPGVPSAVPEMAQNMLPFGATQLGAFPLMP-----VQVMTQQATRHARRVYVGGLPPL 185
             LPG P           P   T+L AF   P        +    +R A+R+ V  +P  
Sbjct: 206 FPLPGAP--------RQQPMDPTKLQAFMKEPNGGVSSAGLKASNSRQAKRLIVSNIPQG 257

Query: 186 ANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGII 245
            +E+++ +FF+  +   G N     D       + ++ FA +E R   +A+ A+ALDGI 
Sbjct: 258 NSEESLISFFNLQLN--GLNVIESSDPCNLCQFSTDRSFAVLEFRNAGDATVALALDGIN 315

Query: 246 FEG-----------VAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
            E              + +RRP DY   + A        PN+    V            P
Sbjct: 316 MEADDTMNGDGGEKQGLSIRRPKDY--VMPAIPEEMAYDPNVVSNVV------------P 361

Query: 295 DRVF---VGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP-AVTD 350
           D V    +  +P + TE QI ELL +FG    F LVKDR T  S+G  F  Y +P +  +
Sbjct: 362 DTVHKLSITNIPTFLTEDQIIELLAAFGKPKAFVLVKDRSTEESRGIAFAEYLEPGSANE 421

Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
            A +ALNG+ +G K L V +A+  G ++     +        AI  +A QTS     G  
Sbjct: 422 PALSALNGMDVGGKKLKVAKASI-GPTQVANFDV-----GITAISGLASQTSTDAEKG-- 473

Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
                    +VL L   +T + L D EEYEEI ED+REEC K+G ++ + IPRP     +
Sbjct: 474 ---------RVLQLLNMVTPEELMDTEEYEEICEDVREECSKFGNILELKIPRPVGGSRQ 524

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
           + GVGK+F+++     C  A  AL+GRKF   TV   Y+PE
Sbjct: 525 SAGVGKIFVKFDTPDSCHKALTALAGRKFADRTVVTTYFPE 565


>gi|47217926|emb|CAG02209.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 600

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 95/212 (44%), Positives = 123/212 (58%), Gaps = 17/212 (8%)

Query: 309 TQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV 368
            ++KELL SFG L  F+LVKD  T  SKGY FC Y D   TD A A LNG+++GDK L V
Sbjct: 405 VKVKELLTSFGPLKAFNLVKDGATSLSKGYAFCEYVDVGATDQAVAGLNGMQLGDKKLIV 464

Query: 369 RRATASGQSKTEQESILAQAQQHIAIQKMA-LQTSGMNTLGGGMSLFGETLAKVLCLTEA 427
           +RA+    +K    S  A+A   + +  +  LQTSG+ T             +VLCL   
Sbjct: 465 QRASVG--AKNANPSAAAEAPVTLQVPGLQRLQTSGVPT-------------EVLCLLNM 509

Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
           +  + L DDE+YEEILED+REEC KYG + ++ IPRP  +G E PG GK+F+EY  A  C
Sbjct: 510 VVPEELVDDEDYEEILEDVREECCKYGGVRSIEIPRP-VDGVEVPGCGKIFVEYVSASDC 568

Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             A  AL+GRKF    V   YY  D Y   ++
Sbjct: 569 QKAMQALTGRKFANRVVVTKYYDPDMYHRHEF 600



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 66/254 (25%), Positives = 98/254 (38%), Gaps = 63/254 (24%)

Query: 100 SPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           +P R P  + R     + PPAA  +P  A                  +L   A+      
Sbjct: 112 APPRLPGPAPRLPSVSLWPPAAGQIPTMA------------------LLATAASAGVVAA 153

Query: 160 LMPVQVMTQQATRHARRVYVGGLP---------PLANEQAIATFFSQVMTAIGGNSAGPG 210
             PV V   Q TR ARR+YVG +P           + ++++A FF+  M  + G S  P 
Sbjct: 154 PTPVPVAGSQMTRQARRLYVGNIPFGLTEALRRLCSPQESMAEFFNAQMR-LAGLSQAPS 212

Query: 211 DAVVNVYINHEKKFAFVEMR-------------TVEEASNAMALDGIIFEGVAVRVRR-- 255
           + V+ V IN +K FAF+E+R                +   ++ L G    G  +R  R  
Sbjct: 213 NPVLAVQINQDKNFAFLEVRPGFSAAAALPAAAAAADVCVSVPLGGRDHAGHGLRRHRVP 272

Query: 256 ------PTDYN-PTLAAALG-PGQPSPNL----------NLAAVGLASGAIGGAEGPDRV 297
                 PT    P  A  LG  G P P                 G+ S  +   + P ++
Sbjct: 273 GSGSEDPTASRLPASARHLGAAGVPRPRFLRAAARHAARVGRRPGVVSTVV--PDSPHKL 330

Query: 298 FVGGLPYYFTETQI 311
           F+GGLP Y  + Q+
Sbjct: 331 FIGGLPNYLNDDQV 344


>gi|389585165|dbj|GAB67896.1| U2 snRNP auxiliary factor [Plasmodium cynomolgi strain B]
          Length = 894

 Score =  172 bits (436), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 105/318 (33%), Positives = 163/318 (51%), Gaps = 17/318 (5%)

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDY------NPTLAAALGPGQ 271
            N E +F F+E R++E     + LD I F    +R+ RP D+      +P L        
Sbjct: 575 FNVESRFCFLEFRSLEITWLCLRLDAISFNNYCLRIARPHDFVPPPGGDPALTVVFTDIN 634

Query: 272 PSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRD 331
                 +  V +A     G +  +++++  LP+   + QI++LL+ FG L GF+++KD +
Sbjct: 635 HEVFEMVKPVKIAPVRSTG-DDDNKLYIQNLPHDLRDDQIRDLLQQFGKLKGFNVIKDLN 693

Query: 332 TGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-ASGQSKTEQESILAQAQQ 390
           TG +KGYGF  Y+D   T IA  ALNG   G   L V++AT    Q+ T+  +  + A  
Sbjct: 694 TGLNKGYGFFEYEDSNCTPIAMHALNGFVCGQNILNVKKATFGKSQNSTQNANTTSLATG 753

Query: 391 HI---------AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEE 441
            +         +I +  L  S +          GE  ++V+ LT A+  + L  D +YEE
Sbjct: 754 SVDLPVSLLPNSISQKILSNSIIGLQIQASRKIGEKSSRVVQLTNAVFQEDLLIDSQYEE 813

Query: 442 ILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
           IL+D++EE  KYG L N+VIP+P+++   T GVGK+FL Y D      A+  L+GR F  
Sbjct: 814 ILKDIKEEAEKYGPLQNIVIPKPNKDLSYTEGVGKIFLHYADETTARKAQYMLNGRLFEK 873

Query: 502 NTVNAFYYPEDKYFNKDY 519
             V A +Y E+K+    Y
Sbjct: 874 RVVCAAFYSEEKFLAGKY 891


>gi|344251408|gb|EGW07512.1| Splicing factor U2AF 65 kDa subunit [Cricetulus griseus]
          Length = 422

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 115/285 (40%), Positives = 160/285 (56%), Gaps = 30/285 (10%)

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
           R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++   G+ S  + 
Sbjct: 26  RSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVYVPGVVSTVV- 77

Query: 290 GAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
               PD   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D 
Sbjct: 78  ----PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDI 133

Query: 347 AVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
            VTD A A LNG+++GDK L V+RA+   ++ T    +    Q  + +Q   L +S +  
Sbjct: 134 NVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT----LSTINQTPVTLQVPGLMSSQVQ- 188

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
           +GG  +       +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++ IPRP  
Sbjct: 189 MGGHPT-------EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-V 240

Query: 467 NGGETPGVGK--VFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
           +G E PG GK    +EY    G            +   T+  F Y
Sbjct: 241 DGVEVPGCGKAMTLMEYLIKTGSERVSQQCKENMYAVQTLKDFQY 285


>gi|167515386|ref|XP_001742034.1| RNA-binding region RNP-1 containing protein [Monosiga brevicollis
           MX1]
 gi|163778658|gb|EDQ92272.1| RNA-binding region RNP-1 containing protein [Monosiga brevicollis
           MX1]
          Length = 431

 Score =  171 bits (433), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 138/464 (29%), Positives = 211/464 (45%), Gaps = 69/464 (14%)

Query: 75  DKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPP------AAAMLPGAA 128
           D++RR R   R       R  S+  +P  S +      G++  PP            G  
Sbjct: 16  DRDRRSRSLERDS-----RRSSERDAPEPSEAWDVPPPGYENMPPKVYKDYVCTYFAGLP 70

Query: 129 VPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANE 188
           +  +LPG+ S++P                 PL          TR ARR+YVGG+P  AN+
Sbjct: 71  ISAELPGLRSSMPN----------------PL----------TRGARRLYVGGIPNGAND 104

Query: 189 QAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEG 248
             +A FF+  +T   G + GPG  VV+  IN EK FAF+E+R+ EEA++ +A D I+F G
Sbjct: 105 MELAEFFNMQLTQ-QGLTIGPGAPVVSAQINEEKSFAFLELRSPEEATSCIAFDNIMFMG 163

Query: 249 VAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD--------RVFVG 300
             +R+RRP DY     A  G     P +++                         R+ V 
Sbjct: 164 NQLRIRRPKDYQ----APAGGTSEVPKVDMPMPRPMPMPTPMPMPTPMPMLVPSGRLNVT 219

Query: 301 GLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLK 360
            +P    E Q++EL   FGT+   +L K+ +T    G     + D    D     LN +K
Sbjct: 220 NIPLAMDEEQLRELFSVFGTIASLELRKEPETDKFAGDAIVEF-DTRAPDF----LNQVK 274

Query: 361 MGDKTLTVRRATASGQS-KTEQESILAQAQQHIAIQKMALQTSGMNT---LGGGM--SLF 414
            G + +        GQ  K EQ   + +   +  ++   +  S +N    +GG    ++ 
Sbjct: 275 AGLEDIDF-----EGQKLKVEQ---VVRWWSYCGLRASYIAPSLVNASPFVGGAAAPAVP 326

Query: 415 GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
                +VL L   +T + L DDEEY++I+ED+REECGK+G + ++ IPRP   G +  G+
Sbjct: 327 DVEATEVLVLMNMVTKEELQDDEEYKDIMEDIREECGKFGNITDLKIPRPVAEGEQPIGL 386

Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKD 518
            K+F+ Y        A+ ALSGR+F   TV   Y    K+ N +
Sbjct: 387 EKIFIRYATVDEARNAQRALSGRRFANRTVVVSYLDVAKFENDE 430


>gi|328865493|gb|EGG13879.1| RapGAP/RanGAP domain-containing protein [Dictyostelium fasciculatum]
          Length = 3032

 Score =  171 bits (433), Expect = 9e-40,   Method: Composition-based stats.
 Identities = 114/373 (30%), Positives = 186/373 (49%), Gaps = 38/373 (10%)

Query: 169  QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGN-SAGPGDAVVNVYINHEKKFAFV 227
            Q  + +RR+Y+G +PP   +  +  FF+  +TA   + S+  G  V++  IN  K FAF+
Sbjct: 2570 QQNKQSRRLYIGNIPPNITDNTLIDFFNTAITAANLHLSSKTGPVVLSCQINSAKNFAFL 2629

Query: 228  EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
            E R+ EEA+NAM LDGI     ++++RRPTDY P    +  P  P       ++ + S  
Sbjct: 2630 EFRSAEEATNAMGLDGISLFTFSLKIRRPTDYQPPANESSMPSAP------VSMSIVSTN 2683

Query: 288  IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
            +  +E  +++F+GG+P    E QIK +L +FG L  F+LVKD  TG+SKGY FC Y +  
Sbjct: 2684 VPDSE--NKIFIGGIPTTLNEEQIKSMLLAFGRLKAFNLVKDPKTGSSKGYAFCEYYETE 2741

Query: 348  VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ------------------ 389
             T+     LNG K G+K+L V+R++   +  +   +                        
Sbjct: 2742 ETNDCINGLNGTKFGEKSLVVQRSSVGTKDPSTTSTNNNNNNNNNNNNNNANMNSNRKSV 2801

Query: 390  ---QHIAIQKMALQTSGMNTLGGGMSLF----GETLAKVLCLTEAITADALADDEEYEEI 442
                    Q + L +S    LG   S          + V+ L   +  + + DD +YE +
Sbjct: 2802 TTFDQSVTQMLNLASSIPQVLGTIRSNIPSDSNTKSSTVIQLFNLVDREDIQDDSDYENL 2861

Query: 443  LEDMREECGKYGTLVNVVIPRPDQNGGETP-GVGKVFLEYYDAVGCATAKNALSGRKFGG 501
            L D++EEC ++G + ++ I RP +   E P  + KVF+++        A  ++ GR++  
Sbjct: 2862 LIDVKEECEEFGEVESIFISRPKE---ENPLDIVKVFVKFVSLESAQRAWMSIGGRRYNY 2918

Query: 502  NTVNAFYYPEDKY 514
             T+   +YPED Y
Sbjct: 2919 RTIITAFYPEDFY 2931


>gi|384500209|gb|EIE90700.1| hypothetical protein RO3G_15411 [Rhizopus delemar RA 99-880]
          Length = 490

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 109/350 (31%), Positives = 163/350 (46%), Gaps = 73/350 (20%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           AT+ ARR+YVG +PP   E+ +A FF+  M  +      P   V  V INHEK +AFVE 
Sbjct: 214 ATKQARRLYVGQIPPGLEEKPLADFFNATMHQLQMQDRTP---VAAVQINHEKSYAFVEF 270

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
           +T E+A+  MA DGI+F+G  +++RRP DY P             ++++   GL S  + 
Sbjct: 271 QTAEQATACMAFDGIMFQGQQLKIRRPKDYQPPAEG---------DVSMQLPGLVSTNV- 320

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
             + P+++F+GGLP Y  + Q+ ELL+SF                               
Sbjct: 321 -PDTPNKIFIGGLPVYLNDDQVIELLKSF------------------------------- 348

Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG 409
                       GD+ L V+RA+                 +HI    M+      N +  
Sbjct: 349 ------------GDRKLIVQRASVGA--------------KHIPPDYMSGPMLPANYV-P 381

Query: 410 GMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG 469
             S   +   +VL L   +T + L DDEEY++I ED+ EEC K+G ++++ IP+P Q   
Sbjct: 382 VTSAKEDDATRVLQLMNMVTPEELEDDEEYQDIWEDIAEECAKFGNVLDMKIPKP-QKDQ 440

Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           E PG G +F+ +        A  AL+GRKF   TV A +  E  Y    +
Sbjct: 441 EVPGCGLIFVRFETKDQTLDALRALAGRKFADRTVVATFIDEQNYLTDSF 490


>gi|425773483|gb|EKV11835.1| Splicing factor u2af large subunit [Penicillium digitatum Pd1]
          Length = 585

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 125/424 (29%), Positives = 191/424 (45%), Gaps = 72/424 (16%)

Query: 106 SKSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
           ++ +R + +DM PP            + M P    P Q P  PS + +      P G + 
Sbjct: 204 TRKRRLTQWDMKPPGYENVTAEQAKISGMFPLPGAPRQQPMDPSRMKDFLNP--PTGDSD 261

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
             A  L P        +R ++R++V  +P   +  A+  FF+  +   G N     D  +
Sbjct: 262 NAA--LKPSN------SRQSKRLFVYNIPSGVSGDAVIAFFNLQLN--GLNVVHSVDPCI 311

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFE---GVAVRVRRPTDYNPTLAAALGPGQ 271
           +  ++ +K FA +E +   +A+ A+A DGI         + VRRP DY     +A  P Q
Sbjct: 312 SAQVSEDKTFALLEFKDPNDATVALAFDGITMAESGDKGLEVRRPKDYIVPDGSASQPVQ 371

Query: 272 PSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRD 331
               LN              + P+++ +  +P Y  E  I  LL+SFG L  F LVKD  
Sbjct: 372 AGVVLNEVP-----------DSPNKICISNIPTYINEEAIIMLLKSFGDLKSFVLVKDAA 420

Query: 332 TGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQH 391
           T  S+G  F  Y DP  T +A   LNG+++ D+ L   RA+                   
Sbjct: 421 TEESRGIAFYEYVDPNNTALAVEGLNGMELVDRHLKFVRASIG----------------- 463

Query: 392 IAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEILED 445
                   Q SG++     M +F +T +      +VL L   +T D L +DE+YEEI+ED
Sbjct: 464 ------TTQASGLDMGVNAMQMFAKTTSQDLETTQVLQLLNMVTLDELLNDEDYEEIMED 517

Query: 446 MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
           + +EC K+GT++ + IPR         G GK+F++Y  A     A  AL+GRKF   TV 
Sbjct: 518 VSDECSKFGTILGIKIPRRGH------GAGKIFIKYDAAESATNALKALAGRKFSDRTVV 571

Query: 506 AFYY 509
           A Y+
Sbjct: 572 ASYF 575


>gi|425775779|gb|EKV14031.1| Splicing factor u2af large subunit [Penicillium digitatum PHI26]
          Length = 585

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 123/424 (29%), Positives = 190/424 (44%), Gaps = 72/424 (16%)

Query: 106 SKSKRRSGFDMAPPA-----------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
           ++ +R + +DM PP            + M P    P Q P  PS + +      P G + 
Sbjct: 204 TRKRRLTQWDMKPPGYENVTAEQAKISGMFPLPGAPRQQPMDPSRMKDFLNP--PTGDSD 261

Query: 155 LGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV 214
             A        +    +R ++R++V  +P   +  A+  FF+  +   G N     D  +
Sbjct: 262 NAA--------LKPSNSRQSKRLFVYNIPSGVSGDAVIAFFNLQLN--GLNVVHSVDPCI 311

Query: 215 NVYINHEKKFAFVEMRTVEEASNAMALDGIIFE---GVAVRVRRPTDYNPTLAAALGPGQ 271
           +  ++ +K FA +E +   +A+ A+A DGI         + VRRP DY     +A  P Q
Sbjct: 312 SAQVSEDKTFALLEFKDPNDATVALAFDGITMAESGDKGLEVRRPKDYIVPDGSASQPVQ 371

Query: 272 PSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRD 331
               LN              + P+++ +  +P Y  E  I  LL+SFG L  F LVKD  
Sbjct: 372 AGVVLNEVP-----------DSPNKICISNIPTYINEEAIIMLLKSFGDLKSFVLVKDAA 420

Query: 332 TGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQH 391
           T  S+G  F  Y DP  T +A   LNG+++ D+ L   RA+                   
Sbjct: 421 TEESRGIAFYEYVDPNNTALAVEGLNGMELVDRHLKFVRASIG----------------- 463

Query: 392 IAIQKMALQTSGMNTLGGGMSLFGETLA------KVLCLTEAITADALADDEEYEEILED 445
                   Q SG++     M +F +T +      +VL L   +T D L +DE+YEEI+ED
Sbjct: 464 ------TTQASGLDMGVNAMQMFAKTTSQDLETTQVLQLLNMVTLDELLNDEDYEEIMED 517

Query: 446 MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
           + +EC K+GT++ + IPR         G GK+F++Y  A     A  AL+GRKF   TV 
Sbjct: 518 VSDECSKFGTILGIKIPRRGH------GAGKIFIKYDAAESATNALKALAGRKFSDRTVV 571

Query: 506 AFYY 509
           A Y+
Sbjct: 572 ASYF 575


>gi|76154831|gb|AAX26240.2| SJCHGC03157 protein [Schistosoma japonicum]
          Length = 258

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 112/273 (41%), Positives = 158/273 (57%), Gaps = 20/273 (7%)

Query: 250 AVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTET 309
           A++ RRP  + P L         S   ++   G+ S  +   + P ++FVGGLPYY  E 
Sbjct: 3   ALKFRRPRVFAPLLGV-------SEQQSVIVPGVVSTVV--QDSPHKIFVGGLPYYLNED 53

Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVR 369
           Q+KELL SFG L GF+LVKD  TG SKGY FC Y D  VTD ACA LNG+++GDK L V+
Sbjct: 54  QVKELLLSFGPLKGFNLVKDGSTGLSKGYAFCEYVDSNVTDHACAGLNGMQLGDKKLIVQ 113

Query: 370 RATASGQSKTEQESILAQAQQHIA-IQKMALQTSGMNTLGGGMSLF--GETLAKVLCLTE 426
           RA+   +  T    +L Q    ++ +++ A+Q    NT G G      G    +VLCL  
Sbjct: 114 RASVGAKHTT---GVLPQCLLQMSGLEEGAVQ----NTTGSGNLTVRSGGPPTEVLCLMN 166

Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
            I    L DDEEYE+I+ED+R EC KYG + ++ IPRP + G + PGVGK+++E+   + 
Sbjct: 167 MIETSELEDDEEYEDIVEDVRAECSKYGVVRSLEIPRPIR-GIDVPGVGKIYVEFASLID 225

Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           C  A  AL+GRKF    V   ++  + Y  +++
Sbjct: 226 CQKAATALTGRKFNQRLVVTSFFSPNSYHRREF 258


>gi|112490659|pdb|2G4B|A Chain A, Structure Of U2af65 Variant With Polyuridine Tract
          Length = 172

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 91/199 (45%), Positives = 121/199 (60%), Gaps = 30/199 (15%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+
Sbjct: 4   ARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVD 62

Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
           E + AMA DGIIF+G ++++RRP DY           QP P                  G
Sbjct: 63  ETTQAMAFDGIIFQGQSLKIRRPHDY-----------QPLP------------------G 93

Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
             ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D  VTD A 
Sbjct: 94  AHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAI 153

Query: 354 AALNGLKMGDKTLTVRRAT 372
           A LNG+++GDK L V+RA+
Sbjct: 154 AGLNGMQLGDKKLLVQRAS 172


>gi|449802099|pdb|3VAF|A Chain A, Structure Of U2af65 Variant With Bru3 Dna
 gi|449802100|pdb|3VAF|B Chain B, Structure Of U2af65 Variant With Bru3 Dna
 gi|449802101|pdb|3VAG|A Chain A, Structure Of U2af65 Variant With Bru3c2 Dna
 gi|449802102|pdb|3VAG|B Chain B, Structure Of U2af65 Variant With Bru3c2 Dna
 gi|449802103|pdb|3VAH|A Chain A, Structure Of U2af65 Variant With Bru3c4 Dna
 gi|449802104|pdb|3VAH|B Chain B, Structure Of U2af65 Variant With Bru3c4 Dna
 gi|449802105|pdb|3VAI|A Chain A, Structure Of U2af65 Variant With Bru3c5 Dna
 gi|449802106|pdb|3VAI|B Chain B, Structure Of U2af65 Variant With Bru3c5 Dna
 gi|449802107|pdb|3VAJ|A Chain A, Structure Of U2af65 Variant With Bru5c6 Dna
 gi|449802108|pdb|3VAJ|B Chain B, Structure Of U2af65 Variant With Bru5c6 Dna
 gi|449802109|pdb|3VAK|A Chain A, Structure Of U2af65 Variant With Bru5 Dna
 gi|449802110|pdb|3VAK|B Chain B, Structure Of U2af65 Variant With Bru5 Dna
 gi|449802113|pdb|3VAL|A Chain A, Structure Of U2af65 Variant With Bru5c1 Dna
 gi|449802114|pdb|3VAL|B Chain B, Structure Of U2af65 Variant With Bru5c1 Dna
 gi|449802115|pdb|3VAL|D Chain D, Structure Of U2af65 Variant With Bru5c1 Dna
 gi|449802116|pdb|3VAL|I Chain I, Structure Of U2af65 Variant With Bru5c1 Dna
 gi|449802117|pdb|3VAM|A Chain A, Structure Of U2af65 Variant With Bru5c2 Dna
 gi|449802118|pdb|3VAM|B Chain B, Structure Of U2af65 Variant With Bru5c2 Dna
          Length = 174

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 91/199 (45%), Positives = 121/199 (60%), Gaps = 30/199 (15%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+
Sbjct: 6   ARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVD 64

Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
           E + AMA DGIIF+G ++++RRP DY           QP P                  G
Sbjct: 65  ETTQAMAFDGIIFQGQSLKIRRPHDY-----------QPLP------------------G 95

Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
             ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D  VTD A 
Sbjct: 96  AHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAI 155

Query: 354 AALNGLKMGDKTLTVRRAT 372
           A LNG+++GDK L V+RA+
Sbjct: 156 AGLNGMQLGDKKLLVQRAS 174


>gi|358378060|gb|EHK15743.1| hypothetical protein TRIVIDRAFT_79964 [Trichoderma virens Gv29-8]
          Length = 503

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 152/533 (28%), Positives = 233/533 (43%), Gaps = 96/533 (18%)

Query: 33  RDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKER-------------- 78
           RDR   D  S G DRR D+ +  DR   R   R D     NR +ER              
Sbjct: 11  RDREREDRYSSGRDRRGDREWDRDRGSYRRDARRDDDERPNR-REREPYDDRRRGGGRER 69

Query: 79  --------RHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPAAAMLPG--A 127
                   R     RS S  + R  +  L+      + KRR + +D+ PP   ++    A
Sbjct: 70  ERRDDGFARQEQPRRSPSPPKKREPTPDLTDIVPVLERKRRLTQWDIKPPGYDLVTAEQA 129

Query: 128 AVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQA-----TRHARRVYVG 180
            + G   LPG P           P   T+L AF   P   +T        +R A+R+ V 
Sbjct: 130 KLSGMFPLPGAP--------RQQPMDPTKLQAFMTQPGGQVTSAGLKASNSRQAKRLLVS 181

Query: 181 GLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMA 240
            +P    + A+ +FF+  +   G N     D  V    + +K FA +E R   +A+ A+A
Sbjct: 182 NVPSSVTDDALISFFNLQLN--GLNVIDSSDPCVLSQFSQDKAFAVLEFRNASDATVALA 239

Query: 241 LDGIIFE------GVA------VRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGLASGA 287
           LDGI  E      G A      + +RRP DY  P L   + P  P    N+         
Sbjct: 240 LDGITMEADDAQNGTANGGNHGLVIRRPKDYVMPALPDEM-PYDPEVISNVV-------- 290

Query: 288 IGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
                 PD   ++ +  +P +  E Q+ ELL +FG    F LVKDR T  S+G  F  Y 
Sbjct: 291 ------PDTVHKLCITNIPSFLNEDQVIELLAAFGKPKAFVLVKDRSTEESRGIAFTEYL 344

Query: 345 DPAV-TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
           +P+   + A  +LNG+ +G K L V +A+                   I   ++A    G
Sbjct: 345 EPSTANEPALNSLNGMDVGGKKLKVTKAS-------------------IGPTQVANFDVG 385

Query: 404 MNTLGGGMSLFGETLAK--VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           +  + G  S     + +  V+ L   +T + L D+++YEEI ED+++EC K+G +V + +
Sbjct: 386 ITAISGLASQTSNDIERSSVIQLLNMVTPEELMDNDDYEEICEDVQDECSKFGKVVELKV 445

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           PRP     ++ GVGK+++++        A  AL+GRKF   TV + Y+PE+ +
Sbjct: 446 PRPSGGSRQSTGVGKIYVKFDSEESATKALTALAGRKFADRTVVSTYFPEENF 498


>gi|170053756|ref|XP_001862821.1| splicing factor U2AF 50 kDa subunit [Culex quinquefasciatus]
 gi|167874130|gb|EDS37513.1| splicing factor U2AF 50 kDa subunit [Culex quinquefasciatus]
          Length = 382

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 92/221 (41%), Positives = 129/221 (58%), Gaps = 18/221 (8%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 115 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQMH-LSGLAQAAGNPVLACQI 173

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           N +K FAF+E R+++E + AMA DGI F+G ++++RRP DY           QP P +  
Sbjct: 174 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDY-----------QPMPGMTD 222

Query: 279 AAVGLASGAIGGA------EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
           +AV        G       + P ++F+GGLP Y  E Q+KELL SFG L  F+LVKD  T
Sbjct: 223 SAVAPVQEKFSGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLLSFGQLKAFNLVKDAAT 282

Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
           G SKGY F  Y + ++TD A A LNG+++GDK L V+RA+ 
Sbjct: 283 GLSKGYAFAEYVEYSITDQAIAGLNGMQLGDKKLIVQRASV 323


>gi|302916595|ref|XP_003052108.1| hypothetical protein NECHADRAFT_38412 [Nectria haematococca mpVI
           77-13-4]
 gi|256733047|gb|EEU46395.1| hypothetical protein NECHADRAFT_38412 [Nectria haematococca mpVI
           77-13-4]
          Length = 564

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 134/455 (29%), Positives = 214/455 (47%), Gaps = 60/455 (13%)

Query: 79  RHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRR-SGFDMAPPAAAMLPG--AAVPGQ--L 133
           RH +R  +    + R  +  L+      + KRR + +D+ PP    +    A + G   L
Sbjct: 127 RHENRRSASPPPKKREPTPDLTNIVPILERKRRLTQWDIKPPGYDNVTAEQAKLSGMFPL 186

Query: 134 PGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ-----QATRHARRVYVGGLPPLANE 188
           PG P           P   ++L AF   P   +T        +R ++R+ V  +P   +E
Sbjct: 187 PGAP--------RQQPMDPSKLQAFMNQPGGQVTSAGLKANNSRQSKRLLVSKIPSGTSE 238

Query: 189 QAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE- 247
           +A+ +FF+  +   G N     D  +    ++++ FA +E R   EA+ A+ALDG   E 
Sbjct: 239 EALISFFNLQLN--GLNVIDATDPCILCQFSNDRSFAVLEFREASEATVALALDGTSMEP 296

Query: 248 ----------GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRV 297
                        + +RRP DY   + A       +P++      +    I      +++
Sbjct: 297 DDANGASNGESRGLEIRRPRDY--VVPAVTEEVSYNPDV---VSNIVPDTI------NKL 345

Query: 298 FVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP-AVTDIACAAL 356
            +  +P +  E Q+ ELL +FG    F LVKDR T  S+G  F  YQDP A    A   L
Sbjct: 346 CITNIPPFLAEDQVIELLAAFGKPKAFVLVKDRGTEESRGIAFAEYQDPNAANPTALDTL 405

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGE 416
           NG+ +G K L V +A+  G ++     +        AI  +A QT+  N + G       
Sbjct: 406 NGMDVGGKKLKVTKASI-GPTQVANFDV-----GITAISGLASQTA--NDVEG------- 450

Query: 417 TLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGK 476
             ++VL L   +TA+ L D+++YEEI ED++EEC K+G ++++ IPRP     ++ GVGK
Sbjct: 451 --SRVLQLLNMVTAEELLDNDDYEEICEDVKEECSKFGKIIDMKIPRPTGGSRQSAGVGK 508

Query: 477 VFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
           +F++Y        A  AL+GRKF   TV   Y+PE
Sbjct: 509 IFVKYETIEDTTKALKALAGRKFADRTVVTTYFPE 543


>gi|348681357|gb|EGZ21173.1| hypothetical protein PHYSODRAFT_488481 [Phytophthora sojae]
          Length = 640

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 116/404 (28%), Positives = 181/404 (44%), Gaps = 88/404 (21%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           A + AR +YVG LPP      +  F S ++  +G  +  PG+ ++N +I+ +  FAF EM
Sbjct: 272 AQKPARELYVGNLPPNVTGPQLQEFLSTIIQQVGLTTQ-PGNPIINTWISTDGHFAFCEM 330

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPS---------------- 273
           R+VEE + A+ L+ +   G  ++  RP  +       +GP QP                 
Sbjct: 331 RSVEECNLALLLNQLSLLGQPLKFGRPRSF-------MGPPQPMPQISARTQTALTNLGC 383

Query: 274 ------------PNLNLAAVG-------------------------LASGAIGGAEGPDR 296
                       P+L+ AA                           +AS  +  +    R
Sbjct: 384 TPNPAWFAQPAVPSLDEAAAAPVGDSSTLAGATAAAVAAAQPAVPAVASTTVDASLSAHR 443

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           + +  +P   TE Q+KEL+E FG L  F LVKD  TG S G     Y+D +VT  A   L
Sbjct: 444 LIMSNIPVVLTEDQVKELVEPFGALKSFTLVKDTATGASMGSALFEYEDDSVTAQAVEGL 503

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGE 416
           NGL +G   L+V+   ASG +                                    F +
Sbjct: 504 NGLSIGGILLSVQCQPASGAALPAAPGATPN--------------------------FED 537

Query: 417 TLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGK 476
             + VL +   ++ D L DD+EY ++ ED+ EEC ++G +  + IPRP ++G E PG+G 
Sbjct: 538 QPSAVLKMANMVSIDELRDDDEYADLAEDVEEECKRFGNVTGLEIPRP-KDGEEVPGLGC 596

Query: 477 VFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           +++ + +      A  AL+GRKFGGN V   Y+P DK+  +++S
Sbjct: 597 IYVRFEEEKNAVDALKALNGRKFGGNIVKVTYFPLDKFDKQEFS 640


>gi|347968829|ref|XP_003436304.1| AGAP002908-PC [Anopheles gambiae str. PEST]
 gi|333467822|gb|EGK96709.1| AGAP002908-PC [Anopheles gambiae str. PEST]
          Length = 250

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 93/212 (43%), Positives = 125/212 (58%), Gaps = 21/212 (9%)

Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVR 369
           Q+KELL SFG L  F+LVKD  TG  KGY F  Y +  VTD A A LNG+++GDK L V+
Sbjct: 58  QVKELLLSFGQLKAFNLVKDAATGLGKGYAFAEYVEYTVTDQAIAGLNGMQLGDKKLIVQ 117

Query: 370 RATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGET--LAKVLCLTEA 427
           RA+    +K    +++A  Q  +                 G+SL G +    +VLCL   
Sbjct: 118 RASVG--AKNSNAAVVAPVQIQVP----------------GLSLVGSSGPPTEVLCLLNM 159

Query: 428 ITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
           +T D L D+EEYE+ILED+REEC KYG + +V IPRP + G + PG GKVF+E+   V C
Sbjct: 160 VTPDELKDEEEYEDILEDIREECNKYGVVRSVEIPRPIE-GVDVPGCGKVFVEFNSIVDC 218

Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             A+ AL+GRKF    V   Y+  DKY  +++
Sbjct: 219 QKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 250


>gi|355727237|gb|AES09128.1| U2 small nuclear RNA auxiliary factor 2 [Mustela putorius furo]
          Length = 301

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 104/251 (41%), Positives = 141/251 (56%), Gaps = 17/251 (6%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 65  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 120

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 121 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 179

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 180 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 232

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 233 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 290

Query: 340 FCVYQDPAVTD 350
           FC Y D  VTD
Sbjct: 291 FCEYVDINVTD 301


>gi|395529346|ref|XP_003766777.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Sarcophilus
           harrisii]
          Length = 462

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 104/251 (41%), Positives = 141/251 (56%), Gaps = 17/251 (6%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 126 RSPRHEKKKKIRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 181

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 182 PAPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 240

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P       PG  S N ++ 
Sbjct: 241 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPL------PGM-SENPSVY 293

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 294 VPGVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 351

Query: 340 FCVYQDPAVTD 350
           FC Y D  VTD
Sbjct: 352 FCEYVDINVTD 362


>gi|301121478|ref|XP_002908466.1| splicing factor U2af large subunit, putative [Phytophthora
           infestans T30-4]
 gi|262103497|gb|EEY61549.1| splicing factor U2af large subunit, putative [Phytophthora
           infestans T30-4]
          Length = 597

 Score =  165 bits (418), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 115/386 (29%), Positives = 174/386 (45%), Gaps = 77/386 (19%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           A + AR +YVG LPP      +  F S ++  +G  +  PG+ ++N + + +  FAF EM
Sbjct: 242 AQKPARELYVGNLPPNVTGPQLQEFLSTIIQQVGLTTQ-PGNPIINTWTSTDGHFAFCEM 300

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL---------NLAA 280
           R+VEE + A+ L+ +   G  ++  RP  +       +GP QP P +         NL  
Sbjct: 301 RSVEECNLALLLNQLSLLGQPLKFGRPRSF-------MGPPQPMPQVSARTQTALTNLGC 353

Query: 281 V-----------------------------GLASGAIGGAEGP---DRVFVGGLPYYFTE 308
                                          +A+    G+E     +R+ +  +P    E
Sbjct: 354 TPNPAWFAQHTVSSTETTTTETTLAEATLSAIAAAQPAGSEAVSSGNRLIMSNIPVVLAE 413

Query: 309 TQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV 368
            Q+KEL+E FG L  F LVKD  TG S G     Y+D  V   A   LNGL +G   L+V
Sbjct: 414 EQVKELVEPFGKLKSFTLVKDSATGASLGSALFEYEDSDVAAQAVEGLNGLSIGGILLSV 473

Query: 369 RRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAI 428
           +R  AS  +     +      Q  A+ KMA                             +
Sbjct: 474 QRQPASSAAALPSAAAANPEDQPSAVLKMA---------------------------NMV 506

Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCA 488
           + D L DDEEY ++ ED+ EEC ++G +  + IPRP ++G E PG+G +++ +       
Sbjct: 507 SIDELRDDEEYADLAEDVEEECKRFGGVTGMEIPRP-KDGEEVPGLGCIYVRFGKEEDAV 565

Query: 489 TAKNALSGRKFGGNTVNAFYYPEDKY 514
           +A  AL+GRKFGGN V   Y+P DK+
Sbjct: 566 SALKALNGRKFGGNIVKVTYFPVDKF 591


>gi|320590609|gb|EFX03052.1| splicing factor u2af large subunit [Grosmannia clavigera kw1407]
          Length = 420

 Score =  164 bits (416), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 94/297 (31%), Positives = 152/297 (51%), Gaps = 35/297 (11%)

Query: 226 FVEMRTVEEASNAMALDGIIFEG--------VAVRVRRPTDYNPTLAAALGPGQPSPNLN 277
            VE +   +A+ A+AL+GI  E           + ++RP DY            P     
Sbjct: 1   MVEFKEPIDATVALALNGISMEAEDASGSGQSGLSIQRPKDYIVPAVVDYSVYHP----- 55

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
               G+ S  +   + P ++ +  +P Y ++ Q+ ELL SFG L  F L+KDR T  S+G
Sbjct: 56  ----GVVSNVV--IDTPFKIAITNIPSYLSDEQVTELLVSFGELRAFVLLKDRSTEESRG 109

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
             FC Y +P  TD+A   LNG+ +GD+ L V++A+      T  E               
Sbjct: 110 VAFCEYTEPQSTDVAIQGLNGMDLGDRKLRVQKASIGITQVTSVE--------------- 154

Query: 398 ALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
            +  + M+ L G +S     +++V+ L   +TA+ L ++++YE+I ED+ EEC K+G ++
Sbjct: 155 -MGVNAMSLLAGTISQEASDVSRVVQLLNMVTAEELVNNDDYEDICEDVTEECAKFGPVM 213

Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
            + +PRP   G  +PGVGK+F+++        A  AL+GRKF   TV A Y+PE+ +
Sbjct: 214 GLKVPRPASGGRHSPGVGKIFVKFDSRDSATKALKALAGRKFSDRTVVATYFPEENF 270


>gi|346324367|gb|EGX93964.1| splicing factor u2af large subunit [Cordyceps militaris CM01]
          Length = 583

 Score =  164 bits (416), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 126/424 (29%), Positives = 203/424 (47%), Gaps = 58/424 (13%)

Query: 107 KSKRRSGFDMAPPA--AAMLPGAAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP   A     A + G   LPG P       Q + P   T+L A    P
Sbjct: 197 RKRRMTQWDIKPPGYEAVTSEQAKMSGMFPLPGAPRQ-----QQVDP---TKLQALMNQP 248

Query: 163 V--QV----MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNV 216
              QV    +    +R ARR+ V  +P    E A+  FF+  +  +  N     D     
Sbjct: 249 AGGQVSSAGLKANNSRQARRLLVSDIPSGTTEDALVAFFNLQLNGL--NVIEATDPCALC 306

Query: 217 YINHEKKFAFVEMRTVEEASNAMALDG--IIFEGVAVRVRRPTDYNPTLAAALGPGQPSP 274
            ++++K FA +E +   +A+ A+ALDG  ++ +   + +RRP DY       + P  P  
Sbjct: 307 QLSNDKSFAVLEFKNTGDATVALALDGSSMVADTPGLSIRRPKDY-------VMPAVPDE 359

Query: 275 NLNLAAVGLASGAIGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRD 331
            +    V   S ++     PD   ++ +  +P + TE Q+ ELL +FG    F LVK+R 
Sbjct: 360 IIFNPEV--VSNSV-----PDTIHKLCITNIPPFLTEDQVLELLAAFGKPKAFVLVKERS 412

Query: 332 TGNSKGYGFCVYQDPA-VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQ 390
           T  S+G  F  Y +P    + A   LNG+ +G K L  R+A   G      ++ +     
Sbjct: 413 TEESRGIAFAEYVEPTNANEPALNTLNGMDVGGKKLKARKACVGGTQVANFDAGIN---- 468

Query: 391 HIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREEC 450
             AI  +A Q +G +              +VL L   +TA+ L D+++YEEI ED+R+EC
Sbjct: 469 --AISNLAGQGNGGDA------------TRVLQLLNMVTAEELLDNDDYEEICEDVRDEC 514

Query: 451 GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYP 510
            KYG +++V +PRP     ++ GVG++F+++        A  AL+GRKF   TV   Y+P
Sbjct: 515 SKYGKVLDVKVPRPAGGSRQSAGVGRIFVKFESVDSTTGALKALAGRKFADRTVVTTYFP 574

Query: 511 EDKY 514
           E+ +
Sbjct: 575 EENF 578


>gi|66808005|ref|XP_637725.1| RNA-binding region RNP-1 domain-containing protein [Dictyostelium
           discoideum AX4]
 gi|60466159|gb|EAL64222.1| RNA-binding region RNP-1 domain-containing protein [Dictyostelium
           discoideum AX4]
          Length = 671

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 83/201 (41%), Positives = 120/201 (59%), Gaps = 10/201 (4%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           + +RR+YVG +PP  ++  +  FF+  + A   N+  PG  VV   IN  K FAF+E R+
Sbjct: 265 KQSRRIYVGNIPPGISDSELMEFFNAAVLAANLNTK-PGPPVVFCQINAPKCFAFIEFRS 323

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
            EEA+NAM  DGI  +   +++RRP DY  T     G     P++    V          
Sbjct: 324 PEEATNAMRFDGISLKNFTLKIRRPKDYQSTSDNTGGNASLLPSIVPTNV---------P 374

Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
           +  ++++VGGLP   +E Q+K LL ++G L  F+LVKD +TG SKG+ FC YQD  VTD+
Sbjct: 375 DSENKIYVGGLPSNLSEEQVKSLLSAYGKLKAFNLVKDTNTGVSKGFAFCEYQDSEVTDV 434

Query: 352 ACAALNGLKMGDKTLTVRRAT 372
           AC+ LNG+ + DKTL V+RA+
Sbjct: 435 ACSKLNGIPLADKTLVVQRAS 455



 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 56/95 (58%), Gaps = 3/95 (3%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           V+ +   +  + + DD+EY+ IL D++EEC ++G + ++ +P P +N  E   V +V++E
Sbjct: 570 VIQILNLVDREDIFDDKEYDNILIDVKEECEQFGEVQSIWLPLPSKNPLE---VTRVYVE 626

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
           +        A  AL GRK+ G  + + YYPED +F
Sbjct: 627 FSQVEFAQKACLALGGRKYNGRVLFSAYYPEDLFF 661


>gi|325179530|emb|CCA13927.1| splicing factor U2af large subunit putative [Albugo laibachii Nc14]
          Length = 833

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 119/402 (29%), Positives = 179/402 (44%), Gaps = 86/402 (21%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           A + AR +YVG LP       +  F   ++  +G  S  PG+ +++V+I+ +  FAF EM
Sbjct: 467 AMKPARELYVGNLPATITGPQLQEFLGTIIQQVGL-STQPGNPILSVWISTDGHFAFCEM 525

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSP-----------NLNL 278
           R+VEE + A+ L+ +   G  ++  RP  +       +GP QP P           NL  
Sbjct: 526 RSVEECNLALLLNQLPLLGQPLKFGRPRSF-------MGPPQPMPIVSARTQTALVNLGC 578

Query: 279 A--------------------------------------AVGLASGAIGGAEGPD--RVF 298
                                                  A  LA         P+  ++ 
Sbjct: 579 TPNPVWFASPDVTSFGSDPMGFGNGLNGFLSSSSSSLMSATALADSLASLPSDPNATQLL 638

Query: 299 VGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNG 358
           +  +P    E Q+KEL++ FG L  F L+KD  TG S G  F  YQ+  VT  A   L+G
Sbjct: 639 MSNIPGVLAEEQVKELVQPFGELRFFKLIKDPITGQSTGTAFFEYQENQVTTEALNGLDG 698

Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
           L +G   L+VRRA  +  +K  Q ++L                     + G     GE  
Sbjct: 699 LDIGGVKLSVRRAPDA--TKYPQIAVL---------------------MPGAA---GEEP 732

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
             VL +   ++ D L +DEE+ ++ ED+ EEC ++GT++ + IPR  Q+G E  G G +F
Sbjct: 733 GPVLRMANMVSEDELKNDEEFADLKEDVEEECKRFGTIIALDIPR-SQDGEEIAGTGNIF 791

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           + Y D      A+ AL GRKFGGN V   Y+   K+  K+YS
Sbjct: 792 VRYSDTKEATAAQKALCGRKFGGNVVKVTYFSLSKFEAKEYS 833


>gi|71028054|ref|XP_763670.1| U2 small nuclear ribonucleoprotein, auxiliary factor, large subunit
           [Theileria parva strain Muguga]
 gi|68350624|gb|EAN31387.1| U2 small nuclear ribonucleoprotein, auxiliary factor, large
           subunit, putative [Theileria parva]
          Length = 380

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 115/362 (31%), Positives = 175/362 (48%), Gaps = 41/362 (11%)

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFS-QVMTAIGGNSAGPGDAVVN---VYINHEKK 223
           ++A +  +R+YVG LP     Q +  FF+  +M  + GN+  P D +V    +Y N ++ 
Sbjct: 49  EEAKKRQKRLYVGNLPSGTKLQDVVDFFNGALMAMVPGNTIDPRDPLVTKTEIY-NPDQG 107

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
           + F+E +T E A  A  LDGI   G ++++RRP D+N                       
Sbjct: 108 YCFLEFKTPELADLAFKLDGITCNGYSLKLRRPLDFN----------------------- 144

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
               +G      +VFV  +P   TE Q+KELLE  G L   +L+KD  TG SKGYGF  +
Sbjct: 145 ----LGTNSDDTKVFVQNIPLDVTEDQMKELLEKHGKLKLANLLKDPATGVSKGYGFFEF 200

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT----ASGQSKTEQESILAQAQQHIAIQKMAL 399
           +D   + +A   LNG  +G   L+V+ A     ASG    + ++  +     I    ++ 
Sbjct: 201 EDARSSKLAVLHLNGSVLGKNVLSVKHAAFGYFASGGKPIDCKA--SNLPNSITQSILSN 258

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
              G+    G     G   +KV+ L   +  + L  D  Y EI+  ++EE  KYG L  V
Sbjct: 259 PLLGLQLQNG--RRIGSNPSKVIQLLNMVFHEDLISDYNYNEIVRLVKEEAQKYGPLQEV 316

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN-TVNAFYYPEDKYFNKD 518
           VIPRPD++     GVGKVF+ Y D +    A+   +GR F  N  V + ++PED +    
Sbjct: 317 VIPRPDKDLTFKEGVGKVFIRYEDLLSARKAQYMFNGRVFDKNRIVCSAFFPEDLFITGK 376

Query: 519 YS 520
           Y+
Sbjct: 377 YT 378


>gi|46125343|ref|XP_387225.1| hypothetical protein FG07049.1 [Gibberella zeae PH-1]
          Length = 564

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 123/414 (29%), Positives = 193/414 (46%), Gaps = 63/414 (15%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP    +    A + G   LPG P           P   ++L AF   P
Sbjct: 159 RQRRLTQWDIKPPGYDNVTAEQAKLSGMFPLPGAP--------RQQPMDPSKLQAFMNQP 210

Query: 163 VQVMTQQA-----TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
              +T  +     +R ++R+ V  +PP  +E A+  FF+  +   G N     D  V   
Sbjct: 211 GGQVTSASLKASNSRQSKRLLVSRIPPGTSEDALIAFFNLQLN--GLNVIDTTDPCVLCQ 268

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFEG-----------VAVRVRRPTDYNPTLAAA 266
            ++++ FA +E +   E + A+ALDGI  E              + +RRP DY   + A 
Sbjct: 269 FSNDRSFAVIEFKDAPETTVALALDGISMEANDASNGADGGHRGLEIRRPRDY--VVPAV 326

Query: 267 LGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
                  P +    V          +  +++ +  +P + TE QI ELL SFG    F L
Sbjct: 327 TEDVAYDPEVVSNVV---------PDTVNKLSITNIPPFLTEEQIIELLASFGKPKAFVL 377

Query: 327 VKDRDTGNSKGYGFCVYQDPAVTD-IACAALNGLKMGDKTLTVRRATASGQSKTEQESIL 385
           VKDR T  S+G  F  YQDPAV++  A   LNG+ +G K + V +A+             
Sbjct: 378 VKDRGTEESRGIAFAEYQDPAVSNPTALDTLNGMDIGGKQIKVSKAS------------- 424

Query: 386 AQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL--AKVLCLTEAITADALADDEEYEEIL 443
                 I   ++A    G+  + G  S     +  ++VL L   +TA+ L D+++YEEI 
Sbjct: 425 ------IGPTQVANFDVGITAISGLASQTANEVESSRVLQLLNMVTAEELLDNDDYEEIC 478

Query: 444 EDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGR 497
           ED+REEC KYG +++V +PRP     ++ GVGK+F++Y        A  AL+GR
Sbjct: 479 EDVREECSKYGKILDVKVPRPTGGSRQSAGVGKIFVKYEHTEDTTKALQALAGR 532


>gi|406699650|gb|EKD02849.1| splicing factor (U2 snRNP auxiliary factor large subunit)
           [Trichosporon asahii var. asahii CBS 8904]
          Length = 487

 Score =  161 bits (407), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 172/366 (46%), Gaps = 62/366 (16%)

Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
             +R+Y  G+    NE  +   F++V+  +G +    G+AV  V IN EK + +VE  + 
Sbjct: 135 QKKRIYFAGVTDAMNENRLRKLFNKVLRDVGYD----GEAVSGVEINKEKDYVWVEFVSS 190

Query: 233 EEASNAMALDGIIFEGVAVRVRRPTDY---NPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
           + A        + F+G  +  +RP D+   +P L      G P+                
Sbjct: 191 DLAQVVFNKKDLDFDGAPIEPKRPKDFVGIDPALGFMGVSGDPN---------------- 234

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
                +++FVGGLP      ++KELL  FG L  F+LVK+ +   SKG+ F  + DPAVT
Sbjct: 235 -----NKLFVGGLPTTLGSDEVKELLTPFGELRTFNLVKEGNGSVSKGFAFVEFLDPAVT 289

Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKTEQ-ESILAQAQQHIAIQKMALQTSGMNTLG 408
           DIA   LNG ++GD+ L V+RA  +G+S +    S  AQ     +I + A + +      
Sbjct: 290 DIAIQGLNGFQLGDRALVVQRAATTGRSASSTGVSGTAQFLAQSSILEKADEPA------ 343

Query: 409 GGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP---- 464
                      +V+ +   + AD L DD++Y +ILED+R+EC K+G +  V IPRP    
Sbjct: 344 --------PATRVILMLNMVGADELYDDQDYADILEDIRDECSKFGEVEGVRIPRPVPKS 395

Query: 465 ---------------DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
                          ++   +  GVG+V++ Y D      A  AL GR+F G T+     
Sbjct: 396 TKWEPSDSAAQTAEKNRRIDQENGVGRVYVMYADTESAVKAMRALGGRQFAGRTILVASC 455

Query: 510 PEDKYF 515
            E+ + 
Sbjct: 456 SEEDFL 461


>gi|440290938|gb|ELP84237.1| splicing factor u2af large subunit, putative [Entamoeba invadens
           IP1]
          Length = 623

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 127/472 (26%), Positives = 208/472 (44%), Gaps = 67/472 (14%)

Query: 74  RDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQL 133
           RD   RHR R  SH  +R R+R  S  P R     + RS     PP       A  P   
Sbjct: 38  RDYHERHRDRYESHYDNRPRDRYDS--PKRRYYDKEDRS-----PPRHDERRKARSPS-- 88

Query: 134 PGVPSAVPEMAQNMLPFGATQLGAFPLMPV---QVMTQQATRH----ARRVYVGGLPPLA 186
                        + P G      +   PV    ++ QQ   H    +RRVYVG +    
Sbjct: 89  -------------LSPLGKKIKSRWDEQPVADASLLQQQLNVHQEKGSRRVYVGNINTTT 135

Query: 187 NEQAIATFFSQVM---TAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDG 243
            EQ I   F+  M     +  N     D +V+  +N+EK +AF+E RT ++A  A++LDG
Sbjct: 136 TEQDIVEAFNDAMRRGDYVDKNDKS--DIIVSTEVNYEKSYAFIEFRTFDQAVKALSLDG 193

Query: 244 IIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLP 303
           +  +G +V+VRRP D+NP L       Q      L  VG      G       +++G +P
Sbjct: 194 LTIKGASVKVRRPKDFNPVLPFISSLSQ------LMEVGTTKPRDGV------MYMGNIP 241

Query: 304 YYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV--YQDPAVTDIACAALNGLKM 361
              ++ QI++ LE+   L  + +V+D   G  +G  +C+  YQ+P   D    A NG+ +
Sbjct: 242 LQMSDEQIQKKLENLNPLKKYVVVRDPSLGAPQGKCYCLFEYQNPEYKD-KVLAFNGIIL 300

Query: 362 GDKTLTVRRATASGQS--KTEQESILAQAQQHIAIQKMALQTSG-MNTLGGGMSLFGETL 418
           G   + V     SG    K    + L +    +  Q+  + T+  +N+  G   +F   L
Sbjct: 301 GGDKIEV----CSGLEGFKHFPTAALNELCMKMFPQRTDIITATLLNSSVGYSDVFERVL 356

Query: 419 -----------AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
                       +V+ L      + L +++ Y E+++D+RE C  YG ++++ IPRP + 
Sbjct: 357 HNSEDLSQYECTRVIVLFNMFFPEDLNNEQRYIELVDDIREACIAYGEVISISIPRPTET 416

Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
                G+G+ F+E+ D     T    +  R++    + A +Y E KY ++ +
Sbjct: 417 NKRPSGIGRAFVEFKDVEMAKTCWREIVKRRYDNRQIVAGFYSESKYNSRSF 468


>gi|167395950|ref|XP_001741817.1| hexokinase [Entamoeba dispar SAW760]
 gi|165893477|gb|EDR21726.1| hexokinase, putative [Entamoeba dispar SAW760]
          Length = 974

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 124/494 (25%), Positives = 219/494 (44%), Gaps = 103/494 (20%)

Query: 54  KYDREGIRDHDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSG 113
           K  RE  R  DR DRHR   R  E  H +R+    +DR        SPS SP   K  S 
Sbjct: 286 KTRREYSRSEDREDRHR---RVAEEEHYNRNIRRRADR--------SPSLSPLGDKLHSR 334

Query: 114 FDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRH 173
           +D  P A                               + Q+        Q +     R 
Sbjct: 335 WDEQPKA-----------------------------IDSVQIS-------QQLNVHQERA 358

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVM---TAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           A+R+YVG +    +E+ I   F++ M     +  N   P D + ++ +N+E+ +AF+E R
Sbjct: 359 AKRIYVGNINSSTSEKDIVDAFNEAMRRGDYVDKND--PRDIITHIEVNYERSYAFLEFR 416

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPT------LAAALGPGQPSPNLNLAAVGLA 284
           T+EEA  A++LDG+  +G +V+VRRP DYNP       L+  + PG  +P  ++      
Sbjct: 417 TLEEAVKALSLDGLTIKGASVKVRRPKDYNPVLPFISGLSQLMEPGTTNPRESI------ 470

Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV-- 342
                       +++G +P   T+ QI++ LE+   L  F +++D D G  +G  +C+  
Sbjct: 471 ------------LYMGNIPLQMTDEQIRKKLENLNPLKKFFVIRDPDLGAPQGKCYCLFE 518

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           YQ+P   +      +G+ +G   + V     SG    +    L +A  +    KM   T+
Sbjct: 519 YQNPEYKE-KILTFDGINLGGNKIEV----CSGVDGFKH---LPKASLNELFSKMFPHTT 570

Query: 403 G------MNTLGGGMSLFGETL-----------AKVLCLTEAITADALADDEEYEEILED 445
                  +N+  G  ++F + L           ++++ +   +  + L D + Y E+++D
Sbjct: 571 DLVIGTLLNSSVGYSTVFEKILKPSEKIEDQHVSRIIVIFNMVYPEDLIDQQRYIELIDD 630

Query: 446 MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
           +R  C +YG + ++ IPRP +   +  G+G+VF+E+    G       +  +++   ++ 
Sbjct: 631 IRFVCQEYGEVESISIPRPTEENKKPSGLGRVFIEFKTIDGAIRCWKEIVKKRYDNRSLL 690

Query: 506 AFYYPEDKYFNKDY 519
             +Y E KY N+ +
Sbjct: 691 VGFYSEKKYANRMF 704


>gi|401402634|ref|XP_003881297.1| rna recognition motif (RRM)-containing protein,related [Neospora
           caninum Liverpool]
 gi|325115709|emb|CBZ51264.1| rna recognition motif (RRM)-containing protein,related [Neospora
           caninum Liverpool]
          Length = 555

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 85/221 (38%), Positives = 127/221 (57%), Gaps = 23/221 (10%)

Query: 180 GGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAM 239
           G LP    +  +  +F++++ ++      PGD +V+VY+N  ++FAF+E R++EEA+  +
Sbjct: 16  GNLPVPVTQGEVQQYFNELLNSLLPQKV-PGDTIVHVYVNPARRFAFLEHRSIEEANFTL 74

Query: 240 ALDGIIFEGVAVRVRRPTDYNPTLA--------AALG---------PGQPSPNLNLAAVG 282
            LDG+ +   A+ +RRP DYNPTLA        A LG         P Q +     A   
Sbjct: 75  GLDGVSWRNCALSLRRPQDYNPTLADQQYREERARLGSMTGFAVPPPSQAATPAAPAESS 134

Query: 283 LASGAIGGA-----EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
           L +GA+G       + P ++F+GGLP+  TE   K+LLE+FG L    +VKD+  G+ KG
Sbjct: 135 LIAGALGIVSTTVPDSPHKIFIGGLPHSITEQGCKQLLEAFGQLRALHVVKDQQRGDCKG 194

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK 378
           + FC Y DP VTD+A A LN +++ D+ L VRRA   GQ K
Sbjct: 195 FAFCEYLDPNVTDVAVAGLNNMRIADRVLQVRRAMPHGQMK 235


>gi|397633851|gb|EJK71162.1| hypothetical protein THAOC_07424, partial [Thalassiosira oceanica]
          Length = 449

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 92/226 (40%), Positives = 133/226 (58%), Gaps = 21/226 (9%)

Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFF------SQVMTAIGGNSAG------PGDAV 213
           M    TRHARR+Y+G +P ++ E  I +FF      S +M     N A         D +
Sbjct: 230 MNPNQTRHARRLYIGNIPDIS-ETEIHSFFRKTIEKSLIMNRDDKNYAQLQEEYIANDPI 288

Query: 214 VNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVA-VRVRRPTDYNPTLAAALGPGQP 272
           V+VYIN E++FAF+E RT++  +  ++LDGI  EG   V+V+RP DYN +LA     G  
Sbjct: 289 VSVYINRERRFAFIEFRTMDITTACLSLDGIDVEGRGKVKVKRPNDYNASLAPQTSNGV- 347

Query: 273 SPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
             +L+ A +GL S  +   +GP+++F+GGLPY+ TE+Q+ ELL +FG++  F LVK   +
Sbjct: 348 --SLDTAKLGLVSSTV--PDGPNKIFIGGLPYHLTESQVLELLGAFGSVRAFHLVKSEPS 403

Query: 333 G-NSKGYGFCVYQDPAVTDIACAALNGLKM-GDKTLTVRRATASGQ 376
              SKGY F  Y DP +T +AC  LNG+ + G K L+ R A    Q
Sbjct: 404 ATTSKGYCFVEYADPNITQVACMGLNGMDLGGGKQLSCRMAVQGLQ 449


>gi|357017085|gb|AET50571.1| hypothetical protein [Eimeria tenella]
          Length = 527

 Score =  159 bits (401), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 184/385 (47%), Gaps = 43/385 (11%)

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS-------AGPGDAVVNVY 217
           V   ++ R ARR+++  +PP   E  I  FF+  + A+   +       A     ++ V 
Sbjct: 155 VTHSESDRIARRLFISNIPPGTTEADICGFFNGALLAVNAQTGYTDLSLASDKPQLLPVE 214

Query: 218 ----INHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP--GQ 271
               +    +  F+++R+ E     + LDGI F   +++V RP +Y       + P  G 
Sbjct: 215 RCEGLQENSRHCFLDLRSHEWVVLCLKLDGITFNNNSLKVLRPKEY-------VQPPGGD 267

Query: 272 PSPNLNLAAVGLASGAIGG---AEGPDR-----VFVGGLPYYFTETQIKELLESFGTLHG 323
           P+  +++  +   +        A  P R     +++  LP    E Q+++LLE FG L  
Sbjct: 268 PAKTVHIPELERGTKPQQNEVRATAPPRSADCKLYIQNLPPEMGEDQVRDLLEQFGKLRV 327

Query: 324 FDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-----ASGQSK 378
            +L+K+R TG  +GYGF  Y+DP VTD A  ALNG   G   L+V+R+         +  
Sbjct: 328 LNLIKNRQTGKHRGYGFFEYEDPEVTDQAIEALNGFVCGASVLSVQRSNFMPDLLPTKQH 387

Query: 379 TEQESILAQAQQHIAIQK--MALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADD 436
           T + + L  +  +  +    +A+Q     T+       GE  ++V+ L   I  + +  D
Sbjct: 388 TTEVTALPSSTSYAVLSDPVVAIQVRAGRTI-------GEKPSRVVQLLNTIYPEDIMTD 440

Query: 437 EEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSG 496
             +E  ++D R E  KYG L  V+IPRP+++    PGVGKVFL Y D      A+  L+G
Sbjct: 441 SSHEAAVKDTRSEAEKYGPLEEVLIPRPNEDLSYKPGVGKVFLVYGDVTSARRAQYMLNG 500

Query: 497 RKFGGN-TVNAFYYPEDKYFNKDYS 520
           R+F     V A ++PE K+ +  Y+
Sbjct: 501 RRFDQTRVVCAAFFPEQKFKDGQYT 525


>gi|84996015|ref|XP_952729.1| splicing factor [Theileria annulata strain Ankara]
 gi|65303726|emb|CAI76103.1| splicing factor, putative [Theileria annulata]
          Length = 380

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 110/362 (30%), Positives = 174/362 (48%), Gaps = 41/362 (11%)

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFS-QVMTAIGGNSAGPGDAVVN---VYINHEKK 223
           ++A +  +R+YVG LP     Q +  FF+  +M  + GN+  P D +V    +Y N ++ 
Sbjct: 49  EEARKRQKRLYVGNLPSGTKLQDVVDFFNGALMAMVPGNTMDPRDPLVTKTEIY-NPDQG 107

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
           + F+E +T E A     LDGI   G ++++RRP D+N                       
Sbjct: 108 YCFLEFKTPELADLGFKLDGITCNGYSLKIRRPLDFN----------------------- 144

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
               +G      +VFV  +P   TE ++K LLE  G L   +L+KD  TG SKGYGF  +
Sbjct: 145 ----LGANSDDTKVFVQNIPLDVTEDEMKALLEKHGKLKMANLLKDPATGVSKGYGFFEF 200

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRAT----ASGQSKTEQESILAQAQQHIAIQKMAL 399
           +D   + +A   LNG  +G   L+V+ A     ASG    + ++  +     I    ++ 
Sbjct: 201 EDARSSKLAVLHLNGSVLGKNVLSVKHAAFGYFASGGKPIDCKA--SNLPNSITQSILSN 258

Query: 400 QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNV 459
              G+    G     G   +KV+ L   +  + L  D  Y EI+  ++EE  KYG L  V
Sbjct: 259 PLLGLQLQNG--RRIGSNPSKVIQLLNMVFHEDLISDYNYNEIVRLVKEEAQKYGPLQEV 316

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN-TVNAFYYPEDKYFNKD 518
           VIPRPD++     GVGKVF+ Y + +    A+   +GR F  N  + + ++PED + +  
Sbjct: 317 VIPRPDKDLTFKEGVGKVFIRYENLLSARKAQYMFNGRVFDKNRIICSAFFPEDLFISGK 376

Query: 519 YS 520
           Y+
Sbjct: 377 YT 378


>gi|407043289|gb|EKE41863.1| U2 snRNP auxiliary factor large subunit, putative [Entamoeba
           nuttalli P19]
          Length = 628

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 116/467 (24%), Positives = 205/467 (43%), Gaps = 80/467 (17%)

Query: 73  NRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQ 132
           N D+E RHR        +R   R    SPS SP   K  S +D  P A            
Sbjct: 76  NEDREDRHRRVPEEERYNRSIRRRADRSPSLSPLGDKLPSRWDEQPKA------------ 123

Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIA 192
                              + Q+        Q +     R A+R+YVG +    +E+ I 
Sbjct: 124 -----------------IDSVQIS-------QQLNVHQERAAKRIYVGNINSSTSEKDIV 159

Query: 193 TFFSQVM---TAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV 249
             F++ M     +  N     D + ++ +N+E+ +AF+E RT+EEA  A++LDG+  +G 
Sbjct: 160 DAFNEAMRRGDYVDKNDTR--DIITHIEVNYERSYAFLEFRTLEEAVKALSLDGLTIKGA 217

Query: 250 AVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTET 309
           +V+VRRP DYNP L    G  Q           +  G     E    +++G +P   T+ 
Sbjct: 218 SVKVRRPKDYNPVLPFISGLSQ----------LMEPGTTNPRESI--LYMGNIPLQMTDE 265

Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV--YQDPAVTDIACAALNGLKMGDKTLT 367
           QI++ LE+   L  F +++D D G  +G  +C+  YQ+P   +      +G+ +G   + 
Sbjct: 266 QIRKKLENLNPLKNFFVIRDPDLGAPQGKCYCLFEYQNPEYKE-KILTFDGINLGGNKIE 324

Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG------MNTLGGGMSLFGETL--- 418
           V     SG    +    L +A  +    KM   T+       +N+  G  ++F + L   
Sbjct: 325 V----CSGVDGFKH---LPKASLNELFSKMFPHTTDLVIGTLLNSSVGYSTVFEKILKPS 377

Query: 419 --------AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
                   ++++ +   +  + L D + Y E+++D+R  C +YG + ++ IPRP +   +
Sbjct: 378 EKIEDQHVSRIIIIFNMVYPEDLTDQQRYIELIDDIRFVCQEYGEVESISIPRPTEENKK 437

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
             G+G+VF+E+    G       +  +++   ++   +Y E KY N+
Sbjct: 438 PSGLGRVFIEFKTIEGAIKCWKEIIKKRYDNRSLLVGFYSEKKYANR 484


>gi|449707077|gb|EMD46798.1| RNA recognition motif domain containing protein [Entamoeba
           histolytica KU27]
          Length = 712

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 116/467 (24%), Positives = 205/467 (43%), Gaps = 80/467 (17%)

Query: 73  NRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQ 132
           N D+E RHR        +R   R    SPS SP   K  S +D  P A            
Sbjct: 76  NEDREDRHRRVPEEERYNRSIRRRADRSPSLSPLGDKLPSRWDEQPKA------------ 123

Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIA 192
                              + Q+        Q +     R A+R+YVG +    +E+ I 
Sbjct: 124 -----------------IDSVQIS-------QQLNVHQERAAKRIYVGNINSSTSEKDIV 159

Query: 193 TFFSQVM---TAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV 249
             F++ M     +  N     D + ++ +N+E+ +AF+E RT+EEA  A++LDG+  +G 
Sbjct: 160 DAFNEAMRRGDYVDKNDTR--DIITHIEVNYERSYAFLEFRTLEEAVKALSLDGLTIKGA 217

Query: 250 AVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTET 309
           +V+VRRP DYNP L    G  Q           +  G     E    +++G +P   T+ 
Sbjct: 218 SVKVRRPKDYNPVLPFISGLSQ----------LMEPGTTNPRESI--LYMGNIPLQMTDE 265

Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV--YQDPAVTDIACAALNGLKMGDKTLT 367
           QI++ LE+   L  F +++D D G  +G  +C+  YQ+P   +      +G+ +G   + 
Sbjct: 266 QIRKKLENLNPLKNFFVIRDPDLGAPQGKCYCLFEYQNPEYKE-KILTFDGINLGGNKIE 324

Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG------MNTLGGGMSLFGETL--- 418
           V     SG    +    L +A  +    KM   T+       +N+  G  ++F + L   
Sbjct: 325 V----CSGVDGFKH---LPKASLNELFSKMFPHTTDLVIGTLLNSSVGYSTVFEKILKPS 377

Query: 419 --------AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
                   ++++ +   +  + L D + Y E+++D+R  C +YG + ++ IPRP +   +
Sbjct: 378 EKIEDQHVSRIIIIFNMVYPEDLTDQQRYIELIDDIRFVCQEYGEVESISIPRPTEENKK 437

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
             G+G+VF+E+    G       +  +++   ++   +Y E KY N+
Sbjct: 438 PSGLGRVFIEFKTIEGAIKCWKEIIKKRYDNRSLLVGFYSEKKYANR 484


>gi|67475980|ref|XP_653619.1| U2 snRNP auxiliary factor large subunit [Entamoeba histolytica
           HM-1:IMSS]
 gi|56470591|gb|EAL48233.1| U2 snRNP auxiliary factor large subunit, putative [Entamoeba
           histolytica HM-1:IMSS]
          Length = 712

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 116/467 (24%), Positives = 205/467 (43%), Gaps = 80/467 (17%)

Query: 73  NRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQ 132
           N D+E RHR        +R   R    SPS SP   K  S +D  P A            
Sbjct: 76  NEDREDRHRRVPEEERYNRSIRRRADRSPSLSPLGDKLPSRWDEQPKA------------ 123

Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIA 192
                              + Q+        Q +     R A+R+YVG +    +E+ I 
Sbjct: 124 -----------------IDSVQIS-------QQLNVHQERAAKRIYVGNINSSTSEKDIV 159

Query: 193 TFFSQVM---TAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV 249
             F++ M     +  N     D + ++ +N+E+ +AF+E RT+EEA  A++LDG+  +G 
Sbjct: 160 DAFNEAMRRGDYVDKNDTR--DIITHIEVNYERSYAFLEFRTLEEAVKALSLDGLTIKGA 217

Query: 250 AVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTET 309
           +V+VRRP DYNP L    G  Q           +  G     E    +++G +P   T+ 
Sbjct: 218 SVKVRRPKDYNPVLPFISGLSQ----------LMEPGTTNPRESI--LYMGNIPLQMTDE 265

Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV--YQDPAVTDIACAALNGLKMGDKTLT 367
           QI++ LE+   L  F +++D D G  +G  +C+  YQ+P   +      +G+ +G   + 
Sbjct: 266 QIRKKLENLNPLKNFFVIRDPDLGAPQGKCYCLFEYQNPEYKE-KILTFDGINLGGNKIE 324

Query: 368 VRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG------MNTLGGGMSLFGETL--- 418
           V     SG    +    L +A  +    KM   T+       +N+  G  ++F + L   
Sbjct: 325 V----CSGVDGFKH---LPKASLNELFSKMFPHTTDLVIGTLLNSSVGYSTVFEKILKPS 377

Query: 419 --------AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
                   ++++ +   +  + L D + Y E+++D+R  C +YG + ++ IPRP +   +
Sbjct: 378 EKIEDQHVSRIIIIFNMVYPEDLTDQQRYIELIDDIRFVCQEYGEVESISIPRPTEENKK 437

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
             G+G+VF+E+    G       +  +++   ++   +Y E KY N+
Sbjct: 438 PSGLGRVFIEFKTIEGAIKCWKEIIKKRYDNRSLLVGFYSEKKYANR 484


>gi|358391563|gb|EHK40967.1| hypothetical protein TRIATDRAFT_135674 [Trichoderma atroviride IMI
           206040]
          Length = 558

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 150/532 (28%), Positives = 234/532 (43%), Gaps = 94/532 (17%)

Query: 33  RDRHHRDFKSGG-------DDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRHRSR 85
           RDR   D  S         D  R   +Y+ D     D   + R RD   D+ R  R R  
Sbjct: 66  RDREREDRYSSARDRRGDRDWDRDRGSYRRDARRDDDERPSRRERDPYDDRRRGGRDRRD 125

Query: 86  SHSSDRFRNRSKSLSPS----RSPS-----------KSKRRSGFDMAPPAAAMLPG--AA 128
              + +   +    SPS    R P+           + +R + +D+ PP   ++    A 
Sbjct: 126 DGFARQQEQQQPRRSPSPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYDLVTAEQAK 185

Query: 129 VPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQA-----TRHARRVYVGG 181
           + G   LPG P           P   T+L AF   P   +T        +R A+R+ V  
Sbjct: 186 LSGMFPLPGAP--------RQQPMDPTKLQAFITQPGGQVTSAGLKASNSRQAKRLLVSN 237

Query: 182 LPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMAL 241
           +P  A E A+ +FF+  +   G N     D  V    + ++ FA +E R   +A+ A+AL
Sbjct: 238 VPSGAGEDALISFFNLQLN--GLNVIESSDPCVLCQFSADRAFAVLEFRNASDATVALAL 295

Query: 242 DGIIFE----------GVA--VRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           DGI  E          GV+  + +RRP DY  P L   + P  P    N+          
Sbjct: 296 DGISMEADDAMNGTADGVSSGLNIRRPKDYVMPALPDEM-PFDPEVISNVV--------- 345

Query: 289 GGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQD 345
                PD   ++ +  +P + TE Q+ ELL +FG    F LVKD+ T  S+G  F  Y +
Sbjct: 346 -----PDTVHKLCITNIPSFLTEEQVIELLAAFGKPKAFVLVKDQSTEESRGIAFTEYLE 400

Query: 346 P-AVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM 404
           P +  + A  +LNG+ +G K L V +A+                   I   ++A    G+
Sbjct: 401 PSSANEPALNSLNGMDVGGKKLKVTKAS-------------------IGPTQVANFDVGI 441

Query: 405 NTLGGGMSLFGETLAK--VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
             + G  S     + +  V+ L   +T + L D+++YEEI ED+++EC K+G +V + +P
Sbjct: 442 TAISGLASQTSNDIERSSVIQLLNMVTPEELIDNDDYEEICEDVQDECAKFGKVVELKVP 501

Query: 463 RPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           RP     ++ GVGK++++Y        A  AL+GRKF   TV A Y+PE+ +
Sbjct: 502 RPSGGSRQSAGVGKIYVKYDSEESATKALTALAGRKFADRTVVATYFPEENF 553


>gi|330931856|ref|XP_003303563.1| hypothetical protein PTT_15819 [Pyrenophora teres f. teres 0-1]
 gi|311320368|gb|EFQ88342.1| hypothetical protein PTT_15819 [Pyrenophora teres f. teres 0-1]
          Length = 578

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 126/396 (31%), Positives = 189/396 (47%), Gaps = 54/396 (13%)

Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMP------VQVMTQQATRHARRVYVGGLPPLA 186
           LPG P A P     M P   ++L AF + P         +   A + ++R+YV  LP   
Sbjct: 218 LPGAPRAAP-----MDP---SKLAAF-ISPSAGTATAAALATSAAKQSKRLYVHNLPSGC 268

Query: 187 NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF 246
             Q I  FF+  +   G N     D  ++ +I   K++A +E +  E+A+ A+A++GI  
Sbjct: 269 TSQEIMEFFNNQLN--GLNVVSGNDPCLSAHIATSKEYAALEFKAPEDATLALAMNGISM 326

Query: 247 --EGVA-----VRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVF 298
             EG A     + +RRP DY  PT      P    P   +++V          + P+++ 
Sbjct: 327 RDEGGAPDRSGLSIRRPKDYITPTADENAYP----PGDEVSSVV--------KDSPNKLS 374

Query: 299 VGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNG 358
           +  +P Y  E QI+EL+E+ G L  F LVKD  T   +G  FC Y D  + D     LN 
Sbjct: 375 IVNIPTYIEEEQIRELVETMGKLKAFILVKDTGTDQHRGIAFCEYADNEIIDAVIEGLND 434

Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
           + +GD  L V RAT   Q  T  +  +                  ++ L G  +      
Sbjct: 435 IPLGDGNLKVSRATVGLQQSTGLDGGVG----------------AISMLAGASAAENHEH 478

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           ++V+CL   +T+D L +D+EYEEI ED+ EECGKYG +V   IPRP        GVGK++
Sbjct: 479 SRVVCLMNMVTSDELLNDDEYEEIKEDIEEECGKYGPIVETKIPRP-AGARVNLGVGKIY 537

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           ++Y D      A  AL+GR+F   TV A  + E+ +
Sbjct: 538 IKYQDTESAQKAIKALAGRQFSRRTVVATEFSEEGF 573


>gi|340520531|gb|EGR50767.1| predicted protein [Trichoderma reesei QM6a]
          Length = 539

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 128/422 (30%), Positives = 196/422 (46%), Gaps = 72/422 (17%)

Query: 107 KSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMAQNMLPFGATQLGAFPLMP 162
           + +R + +D+ PP   ++    A + G   LPG P           P   T+L AF   P
Sbjct: 160 RKRRLTQWDIKPPGYDLVTAEQAKLSGMFPLPGAP--------RQQPMDPTKLQAFMTQP 211

Query: 163 V-QV----MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY 217
             QV    +    +R A+R+ V  +P    E+A+  FF+  +   G N     D  V   
Sbjct: 212 GGQVSSAGLKASNSRQAKRLLVYNVPSGVTEEALIAFFNLQLN--GLNVIETPDPCVLCQ 269

Query: 218 INHEKKFAFVEMRTVEEASNAMALDGIIFE------GVA------VRVRRPTDYNPTLAA 265
            + +K FA VE R   +A+ A+ALDGI  E      G A      + +RRP DY      
Sbjct: 270 FSSDKTFAVVEFRNASDATVALALDGITMEADDAQNGTANGGSHGLDIRRPKDY------ 323

Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPD---RVFVGGLPYYFTETQIKELLESFGTLH 322
            + PG P        +      I     PD   ++ +  +P +  E QI ELL +FG   
Sbjct: 324 -VMPGIPD------DIPYDPEVISNVV-PDTVHKLCITNIPTFLNEEQIIELLAAFGKPK 375

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPA-VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQ 381
            F LVKDR T  S+G  F  Y DP+   + A  +LNG+ +  K L V +A+         
Sbjct: 376 SFVLVKDRSTEESRGIAFTEYLDPSSANEPALNSLNGMDVAGKKLKVTKAS--------- 426

Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAK--VLCLTEAITADALADDEEY 439
                     I   ++A    G+  + G  S     + +  V+ L   +T + L D+++Y
Sbjct: 427 ----------IGPTQVANFDVGITAISGLASQTSNDIERSSVIQLLNMVTPEELLDNDDY 476

Query: 440 EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AKNALSGRK 498
           EEI ED+++EC K+G +V + +PRP     ++ GVGK+F++ +D+V  AT A  AL+GRK
Sbjct: 477 EEICEDVQDECSKFGKVVELKVPRPTGGSRQSAGVGKIFVK-FDSVESATKALTALAGRK 535

Query: 499 FG 500
           F 
Sbjct: 536 FA 537


>gi|403367221|gb|EJY83425.1| RNA-binding proteins (RRM domain) [Oxytricha trifallax]
          Length = 543

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 129/530 (24%), Positives = 233/530 (43%), Gaps = 55/530 (10%)

Query: 13  EGSRHKSSWVSGRSRTGERGRDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDY 72
           +  +H+SS    ++R     R    R      + RRRDK    DR+G RD   T   + Y
Sbjct: 45  DDKKHRSSRDIDKARDESSKRKDDKRRTSDSREGRRRDK----DRKGKRD---TSEEKKY 97

Query: 73  NRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQ 132
           +R +++  R++  + SSD  ++     S S +  +  +     +       L    +  Q
Sbjct: 98  SRSEKKDRRNKDGNMSSDSQKSSPLVSSSSSNTPEKYKGDDILITEGYKLYLERKKLRKQ 157

Query: 133 LP-GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAI 191
                 +A+ +  +  L      L  FP             + R++ +  LPP   E+ +
Sbjct: 158 EEEKKAAALQDGMEGGLKLRKVDLKDFP------------NYKRKLVIQNLPPDITEEDV 205

Query: 192 ATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE-GVA 250
             +F  V+++         + +++V    +  F  +E R  +E    + LDG  +  G  
Sbjct: 206 MNYFFTVISSFS-KVEYQKNPIMSVIKYKDLGFVTLEFRKRDEGEICLTLDGTEYRTGYK 264

Query: 251 VRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE-----GPD---------- 295
           +R+ R   +     A +  G+          G++  + G  +      PD          
Sbjct: 265 MRIMRVKRFIDDWNADIDKGKNPIEAMTRGKGVSLFSTGNNQFKEPAKPDQKAGKKEKVE 324

Query: 296 ----RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD-RDTGNSKGYGFCVYQDPAVTD 350
               R+++G +P    +  +K++ ESFG L  F+LVKD  +   +KGY F  Y D    D
Sbjct: 325 EVDNRLYMGNIPNSMKDEDVKKMCESFGRLKAFNLVKDPMNPDLNKGYAFFEYVDERSID 384

Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGG 410
            A  +LNGL   +K L V++A+A  ++        +Q Q  I + K       +      
Sbjct: 385 KAIKSLNGLDFKEKKLKVQKASAHQKT--------SQTQIQIGMYKNVPDEKRL-----P 431

Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
           + LF  T ++V+     I+ + L +++E   + +D+ +EC  YG ++++ IP+PD+ G  
Sbjct: 432 IPLFAMTPSRVVQFINMISVEDLFEEDEIIHVKDDLLQECKNYGEIISIEIPKPDEQGHA 491

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           T GVGK+F+++   V    A+  LSGRK+ G TV   +YPE  +  K++S
Sbjct: 492 TYGVGKIFVKFNHIVAAKQARYKLSGRKYNGRTVVVSFYPEHYFDIKEFS 541


>gi|403224363|dbj|BAM42493.1| splicing factor [Theileria orientalis strain Shintoku]
          Length = 377

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 106/366 (28%), Positives = 173/366 (47%), Gaps = 50/366 (13%)

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN---VYINHEKKF 224
           ++  +  +R+Y+G LP       +  FF+  + A+  ++    D +V+   +Y N E+ +
Sbjct: 47  EENKKRQKRLYIGNLPAGMKLGDVVEFFNGALLAMVPSNQTTKDPLVSKTEIY-NPEQGY 105

Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
            F+E +T E    A  LDGI   G ++++RRP D+                         
Sbjct: 106 CFLEFKTPELTDLAFKLDGITCNGYSLKIRRPIDFTQ----------------------- 142

Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
               G      ++F+  +    TE +++ELLE  G L  F+L+KD  TG SKGYGF  Y+
Sbjct: 143 ----GNQLEDTKIFIQNVATDVTEAELRELLEKHGKLKLFNLIKDPITGASKGYGFFEYE 198

Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRAT----ASGQSKTEQESI-----LAQAQQHIAIQ 395
           D     +A   LNG  +    L+V+ A     ASG    + ++      + Q+  +  + 
Sbjct: 199 DSRSAKMAVLHLNGQALKQNVLSVKHAAFGYFASGGKPIDCKASNLPNSITQSILNNPLL 258

Query: 396 KMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGT 455
            + LQ S          + G    +V+ L   + ++ L  D  Y EI+   +EE GKYG 
Sbjct: 259 GLQLQNS---------KIVGAKPTRVVQLLNMVFSEDLLSDYNYNEIVRLTKEEAGKYGA 309

Query: 456 LVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN-TVNAFYYPEDKY 514
           L  +V+PRP ++     GVGKVFL+Y + +    A++  +GR F  N  V A +YPEDKY
Sbjct: 310 LDEIVVPRPSKDLTFKSGVGKVFLKYKEVLHARKAQHMFNGRIFDKNRVVCAAFYPEDKY 369

Query: 515 FNKDYS 520
              +Y+
Sbjct: 370 SRGEYT 375


>gi|451849636|gb|EMD62939.1| hypothetical protein COCSADRAFT_200575 [Cochliobolus sativus
           ND90Pr]
          Length = 576

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 126/396 (31%), Positives = 191/396 (48%), Gaps = 54/396 (13%)

Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMP------VQVMTQQATRHARRVYVGGLPPLA 186
           LPG P A P     M P   ++L AF + P         +   A + ++R+YV  LP   
Sbjct: 216 LPGAPRAAP-----MDP---SKLAAF-ISPSTGTATAAALATSAAKQSKRLYVHNLPSGC 266

Query: 187 NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF 246
             Q I  FF+  +  +   S GP D  V+ +I   K++A +E +  E+A+ A+A++GI  
Sbjct: 267 TSQEIMEFFNTQLNGLNVVS-GP-DPCVSAHIATSKEYAALEFKAPEDATLALAMNGISM 324

Query: 247 -------EGVAVRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVF 298
                  +   + +RRP DY  PT      P    P   +++V          + P+++ 
Sbjct: 325 RDDGGAPDRAGLSIRRPKDYITPTADENAYP----PGDEVSSVV--------KDSPNKLS 372

Query: 299 VGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNG 358
           +  +P Y  E QI+EL+E+ G L  F LVKD  T   +G  FC Y D  + D     LN 
Sbjct: 373 IVNIPTYIEEEQIRELVETMGKLKAFILVKDTSTDQHRGIAFCEYADNEIIDAVIEGLND 432

Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
           + +GD  L V RAT   Q  T  +  +                  ++ L G  ++     
Sbjct: 433 IPLGDGNLKVSRATVGLQQTTGLDGGVG----------------AISMLAGASAVENREH 476

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           ++V+CL   +T+D L +DEEYEEI ED+ EECGKYGT++   IPRP        GVGK++
Sbjct: 477 SRVVCLMNMVTSDELLNDEEYEEIKEDIEEECGKYGTILETKIPRP-AGARVNLGVGKIY 535

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           ++Y D      A  AL+GR+F   TV A  + E+ +
Sbjct: 536 IKYQDIESAQKAIKALAGRQFSRRTVVATEFSEEGF 571


>gi|156085070|ref|XP_001610018.1| RNA recognition motif (RRM)-containing protein [Babesia bovis]
 gi|154797270|gb|EDO06450.1| RNA recognition motif (RRM)-containing protein [Babesia bovis]
          Length = 383

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 170/361 (47%), Gaps = 48/361 (13%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAG----PGDAVVNVYINHEKKFAFV 227
           RH RR+Y+G LP     +A+  F S  +     +S      P  +   ++ N ++ + F+
Sbjct: 57  RH-RRLYIGNLPSGTTYKALVEFLSAALRLPNDDSGQTVQVPHISKTEIF-NEDQGYCFL 114

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           E  T E A     LDGI F+G  +++RRP DY  T ++                      
Sbjct: 115 EFSTPELADACFKLDGINFKGKLLKIRRPIDYGTTSSSE--------------------- 153

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
                   +VFV  +P   +E ++KELLE  G +   +LVKD  TG +KGYGF  + D  
Sbjct: 154 ------DTKVFVQNIPPTMSEAEVKELLEKHGKIKSSNLVKDLKTGQNKGYGFFEFDDSR 207

Query: 348 VTDIACAALNGLKMGDKTLTVRRAT----ASGQSKTEQESIL---AQAQQHIAIQKMALQ 400
              +A   LNG  +G   L+V+ A     A+G   T+ ++     +  Q  ++   + LQ
Sbjct: 208 AAKMAVCHLNGHIIGKNVLSVKHAAFSYFAAGGKLTDCKATNLPNSVTQSILSNPLLGLQ 267

Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
                 +G   S       +++ L   +  + L  D+ Y E+ + + EE  KYG L ++V
Sbjct: 268 MQSGRRIGSKPS-------RIVQLINIVFHEDLIQDKRYHEVKDAIMEEAKKYGHLEDIV 320

Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN-TVNAFYYPEDKYFNKDY 519
           IPRP+ +     GVGKVFL++ D +    A+  L+GR F GN  V A ++P D++    Y
Sbjct: 321 IPRPNDDLSYKEGVGKVFLKFGDEISSRRAQYMLNGRVFDGNRIVCAAFFPLDRFLKGKY 380

Query: 520 S 520
           +
Sbjct: 381 T 381


>gi|189204129|ref|XP_001938400.1| splicing factor U2AF 65 kDa subunit [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187985499|gb|EDU50987.1| splicing factor U2AF 65 kDa subunit [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 572

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 127/396 (32%), Positives = 188/396 (47%), Gaps = 54/396 (13%)

Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMP------VQVMTQQATRHARRVYVGGLPPLA 186
           LPG P A P     M P   ++L AF + P         +   A + ++R+YV  LP   
Sbjct: 212 LPGAPRAAP-----MDP---SKLAAF-ISPSAGTATAAALATSAAKQSKRLYVHNLPSGC 262

Query: 187 NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF 246
             Q I  FF+  +   G N     D  ++ +I   K++A +E +  E+A+ A+A+ GI  
Sbjct: 263 TSQEIMEFFNNQLN--GLNVVSGNDPCLSAHIATSKEYAALEFKAPEDATLALAMTGISM 320

Query: 247 --EGVA-----VRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVF 298
             EG A     + +RRP DY  PT      P    P   +++V          + P+++ 
Sbjct: 321 RDEGGAPDRSGLSIRRPKDYITPTADENAYP----PGDEVSSVV--------KDSPNKLS 368

Query: 299 VGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNG 358
           +  +P Y  E QI+EL+E+ G L  F LVKD  T   +G  FC Y D  + D     LN 
Sbjct: 369 IVNIPTYIEEEQIRELVETMGKLKAFILVKDTGTDQHRGIAFCEYADNEIIDAVIEGLND 428

Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
           + +GD  L V RAT   Q  T  +  +                  ++ L G  +      
Sbjct: 429 IPLGDGNLKVSRATVGLQQSTGLDGGVG----------------AISMLAGASAAENHEH 472

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           ++V+CL   +T+D L +DEEYEEI ED+ EECGKYG +V   IPRP        GVGK++
Sbjct: 473 SRVVCLMNMVTSDELLNDEEYEEIKEDIEEECGKYGPIVETKIPRP-AGARVNLGVGKIY 531

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           ++Y D      A  AL+GR+F   TV A  + E+ +
Sbjct: 532 IKYQDTESAQKAIKALAGRQFSRRTVVATEFSEEGF 567


>gi|296083697|emb|CBI23686.3| unnamed protein product [Vitis vinifera]
          Length = 882

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 150/538 (27%), Positives = 230/538 (42%), Gaps = 113/538 (21%)

Query: 25  RSRTGERGRDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRD-KERRHRHR 83
           RSR  E+   R HR   +G D++ R++N       ++ HD   RH     D KERR    
Sbjct: 222 RSRKSEKESKRKHR---TGEDEKNRERN------SMKKHDPGKRHESEFLDRKERRESPP 272

Query: 84  SRSHSSDRFRNRS---------------------------------KSLSPS-RSPSKSK 109
           SR   SD  RNR                                  K+ SP+ RSP K  
Sbjct: 273 SRRQHSDADRNRISNNGSSSHFRRHGGSASGLGGYSPRKRRTEAAIKTPSPTNRSPEK-- 330

Query: 110 RRSGFDMAPPAA-AMLPGAAVPGQLP-GVPSAVPEMAQNMLP-------FGATQLGAFPL 160
           + +G+D+ P     M  G+ +  +LP  VP AVP  A    P          ++     +
Sbjct: 331 KSAGWDLPPSRTDGMNAGSVLSNELPSAVPVAVPVTATTAKPPLPRIYSDAVSKNKNVSI 390

Query: 161 MPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH 220
             +Q+   QATR  RR+YV  LP  ++E+A+    +  + + G N        ++  I+ 
Sbjct: 391 DSIQLT--QATRPMRRLYVENLPVSSSEKALMECLNNFLLSSGINHVQGTPPCISCIIHK 448

Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           EK  A VE  T E+AS A++ DGI F G  +++RRP D+                  L A
Sbjct: 449 EKGQALVEFLTPEDASAALSFDGISFSGSILKIRRPKDFVDMTGV---------QEKLVA 499

Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
              A   I   + P ++F+GG+    +   + E+  +FG L  +    + D G  +   F
Sbjct: 500 APDAISDI-VKDSPHKIFIGGISRALSSDMLMEIAAAFGPLKAYRFQVNEDLG--EPCAF 556

Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQ 400
             Y D +VT  ACA LNG+K+G + LTV +A  +                      +A++
Sbjct: 557 LEYVDQSVTLKACAGLNGMKLGGQVLTVVQAIPNA---------------------LAME 595

Query: 401 TSGMNTLGGGMSLFGETL----AKVLCLTEAITADALA--DDEEYEEILEDMREECGKYG 454
            +G N    G+    + L     +VL L   +  D L+   + E EEILED+R EC ++G
Sbjct: 596 NTG-NLPFYGIPEHAKPLLERPTQVLKLKNVVNPDDLSSLSEAELEEILEDIRLECTRFG 654

Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDA-------VGCATAKNALSGRKFGGNTVN 505
           T+ +V I + + +   T       LE Y+A       +GC    N++     GG T N
Sbjct: 655 TVKSVNIVKYNNSHVST-------LEVYEAADNTGSNLGCDG--NSMKAETLGGGTDN 703


>gi|320170643|gb|EFW47542.1| splicing factor u2af large subunit [Capsaspora owczarzaki ATCC
           30864]
          Length = 393

 Score =  144 bits (364), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 106/355 (29%), Positives = 165/355 (46%), Gaps = 66/355 (18%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           R +RR+Y+GG+ P   +  I  F ++ M   G  S+ PG+ V+ + +  +K FAF++MRT
Sbjct: 98  RQSRRLYIGGIVPGTPDVLIVDFLNREMNQRGMTSS-PGNPVLAIQMTPDKNFAFLDMRT 156

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYN-------PTLAAALGPGQPSPNLNLAAVGLA 284
            EEA+  +ALDGI FEG   R++RP +Y        P+L           +    A G +
Sbjct: 157 SEEATMCIALDGIPFEGTVFRIKRPKEYEGREANDPPSLFGMPSSSGGGFSSQGGAQGGS 216

Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
            G   G + P+++++GGLP+   E QI+EL      L  F ++++          F + +
Sbjct: 217 FGGSMGNDNPNKIYIGGLPFSLDEQQIREL------LQTFGVIRN----------FSLVR 260

Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM 404
           +                            +GQSK +Q                     G 
Sbjct: 261 E---------------------------GNGQSKGQQPP--------------PSMPYGA 279

Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP 464
            +  G   +       V+ L   +T + L D EEY++I++D+REEC KYG +V+V IPRP
Sbjct: 280 PSSFGAAPITPMQATPVVQLLNMVTPEELMDPEEYQDIVDDIREECSKYGEVVSVAIPRP 339

Query: 465 DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
              G E  GVGKV++E+ +      A  ALSGRKF    V   +Y  D Y   ++
Sbjct: 340 -VPGREVSGVGKVYVEFSNVDHAYQALQALSGRKFASRIVVTSFYGLDAYRRSEF 393


>gi|452001453|gb|EMD93912.1| hypothetical protein COCHEDRAFT_1020092 [Cochliobolus
           heterostrophus C5]
          Length = 352

 Score =  144 bits (364), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 116/358 (32%), Positives = 175/358 (48%), Gaps = 53/358 (14%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           + ++R+YV  LP     Q I  FF+  +  +   S GP D  V+ +I   K++A +E + 
Sbjct: 28  KQSKRLYVHNLPSGCTSQEIMEFFNTQLNGLNVVS-GP-DPCVSAHIATSKEYAALEFKA 85

Query: 232 VEEASNAMALDGIIF-------EGVAVRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGL 283
            E+A+ A+A++GI         +   + +RRP DY  PT      P    P   +++V  
Sbjct: 86  PEDATLALAMNGISMRDDGGAPDRAGLSIRRPKDYITPTADENAYP----PGDEVSSVV- 140

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
                   + P+++ +  +P Y  E QI+EL+E+ G L  F LVKD  T   +G  FC Y
Sbjct: 141 -------KDSPNKLSIVNIPTYIEEEQIRELVETMGKLKAFILVKDTSTDQHRGIAFCEY 193

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
            D  + D     LN + +GD  L V RAT   Q                       QT+G
Sbjct: 194 ADNEIIDAVIEGLNDIPLGDGNLKVSRATVGLQ-----------------------QTTG 230

Query: 404 MNTLGGGMSLFGETLA-------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTL 456
           ++   G +S+     A       +V+CL   +T+D L +DEEYEEI ED+ EECGKYGT+
Sbjct: 231 LDGGVGAISMLAGASAAENREHSRVVCLMNMVTSDELLNDEEYEEIKEDIEEECGKYGTI 290

Query: 457 VNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           +   IPRP        GVGK++++Y D      A  AL+GR+F   TV A  + E+ +
Sbjct: 291 LETKIPRP-AGARVNLGVGKIYIKYQDTESAQKAIKALAGRQFSRRTVVATEFSEEGF 347


>gi|399216439|emb|CCF73127.1| unnamed protein product [Babesia microti strain RI]
          Length = 424

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 114/404 (28%), Positives = 189/404 (46%), Gaps = 50/404 (12%)

Query: 150 FGATQLGAFPL-MPVQVMTQQATRHARRVYVGGLPP------LANEQA-IATFFSQVMTA 201
           FG    G+  L +P   +  +A R  RR+Y+G +P       L + Q+ I  F +  +  
Sbjct: 34  FGFDSSGSSALAIPAADLDPEAERRHRRLYIGNVPAGNHNTNLGSSQSDIVAFLNGALLT 93

Query: 202 IGGNS---AGPGDAVVNVY--INHEKKFAFVEMRTVEEASNAMALDGII-------FEGV 249
           +  N+   A P D  +      N E +F F+E+R V+     + +DGI        + G 
Sbjct: 94  VLSNTGMPATPADTPITKCESFNSENRFCFIELRNVDVTLVCLKMDGISLVDSGINYNGN 153

Query: 250 AVRVRRPTDYNP----TLAAALGPG--QPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLP 303
           A+++ RP+DY P     LA  + P   QP     +A                ++ +  +P
Sbjct: 154 ALKISRPSDYVPPSNNELATQMQPTIQQPPRGFTMALQVF------------KLHIQNIP 201

Query: 304 YYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGD 363
               E  + EL++ FG +    ++KD  TG  K   F  ++D    + A  AL G ++  
Sbjct: 202 TTMAEDGVLELVKEFGDVKYVYIIKDT-TGQHKNTAFVEFKDSVSLEPASKALTGKEVEG 260

Query: 364 KTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSL-------FGE 416
           ++LT +  T S Q+ T     LA  + ++    ++   S        +S+        G 
Sbjct: 261 QSLTAKIVT-SNQADTLAS--LAAGKYNLGATHLSTSISRKILSDPLLSIGVQSGRKIGA 317

Query: 417 TLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGK 476
           T++ V+ L   +  + L DD+ Y+ +LED+R+E  KYGTL ++VIPRP+ +     GVGK
Sbjct: 318 TVSTVVQLLNIVFHEDLIDDDSYQSLLEDIRKEAKKYGTLEDIVIPRPNLDKTFNEGVGK 377

Query: 477 VFLEYYDAVGCATAKNALSGRKFGGN-TVNAFYYPEDKYFNKDY 519
           VFL++ D +    A+  L+GR+F     V A +YP DK+  K Y
Sbjct: 378 VFLQFADELSSRKAQYMLNGRRFDAKRVVCAAFYPLDKFLEKTY 421


>gi|428671645|gb|EKX72563.1| conserved hypothetical protein [Babesia equi]
          Length = 455

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 106/369 (28%), Positives = 166/369 (44%), Gaps = 52/369 (14%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           ATR  R +YVG +PP+++   +  F ++ + AI G S  PG+  +  +I+ +  +AF+E+
Sbjct: 72  ATRPYREIYVGNIPPVSDVSTLLDFLNEALIAINGTSM-PGNPCLKGWISSDSHYAFIEL 130

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNP-TLAAALGPGQPSPNLNLAAVGL----- 283
           RT+EEASN M L G+   G  +RV RP  Y P  LA A  P  P+ + +L A+GL     
Sbjct: 131 RTMEEASNCMQLTGLNCMGYNIRVNRPKTYTPEMLALAPSPTVPTLDPSLLAMGLKALKN 190

Query: 284 ------ASGAIGGAEG----PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTG 333
                 A+  I   E      DR+ +  +P    ++ +K  +E+ G +     + D    
Sbjct: 191 AREQIVAASDILATEKAKAMTDRLCIIDIPSETQDSDLKSAIEAIGQVKYIHFIND---D 247

Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
            SK      YQ      I   AL  L    K +    A   G       S + Q  +   
Sbjct: 248 PSKRVCLFEYQHIEQQKI---ALEQLPANHKVIMAIDAVTQG---IINPSYIRQQLEKCE 301

Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
           I +                   E   +VL L+  ++ + L DD EY +I++D+R EC  Y
Sbjct: 302 IMR------------------PEVPTRVLWLSNLVSKEELDDDAEYFDIIDDVRTECEDY 343

Query: 454 GTLVNVVIPR-------PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
           G ++ + +PR        +    +T  VG  F+ +    GC  A+  L GR+FG   V+A
Sbjct: 344 GQVIRLELPRVPKGLTEEEMKTVDTSSVGCAFVLFTTIDGCTKARKILGGRRFGPRIVDA 403

Query: 507 FYYPEDKYF 515
            Y+ E  YF
Sbjct: 404 HYFSE-LYF 411


>gi|396472864|ref|XP_003839217.1| hypothetical protein LEMA_P028900.1 [Leptosphaeria maculans JN3]
 gi|312215786|emb|CBX95738.1| hypothetical protein LEMA_P028900.1 [Leptosphaeria maculans JN3]
          Length = 587

 Score =  139 bits (349), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 119/402 (29%), Positives = 186/402 (46%), Gaps = 67/402 (16%)

Query: 133 LPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQAT------RHARRVYVGGLPPLA 186
           LPG P A P     M P   ++L AF + P       A       + ++R+YV  LP   
Sbjct: 228 LPGAPRAAP-----MDP---SKLAAF-MTPSAGSASSAALAPSAAKQSKRLYVHNLPSGV 278

Query: 187 NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF 246
           + + +  FF+  +   G N     D  ++  I   K++A +E +T E+A+ A+A++GI  
Sbjct: 279 SSEELMEFFNLQLN--GLNVVSGQDPCLSAQIATSKEYAALEFKTPEDATVALAMNGISM 336

Query: 247 -------EGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFV 299
                  +   + +RRP DY            P+ + N       S  +   + P+++ +
Sbjct: 337 REESGGPDRSGLSIRRPKDYI----------TPTADDNAYTGDEVSSVV--KDSPNKLSI 384

Query: 300 GGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGL 359
             +P Y  E Q++EL+ + G L  F LVKD  T   +G  FC Y D  + D     LN +
Sbjct: 385 VNIPTYIEEEQVRELVGTMGKLKAFVLVKDESTDQHRGIAFCEYADNEIVDAVIEGLNDI 444

Query: 360 KMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA 419
            +GD  L V RAT   Q                       QT+G++   G +S+     A
Sbjct: 445 PLGDGNLKVTRATVGLQ-----------------------QTAGLDGGVGAISMLAGASA 481

Query: 420 -------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                  +V+CL   +T++ L +DEEYEEI ED+ EECGK+GT++   IPRP        
Sbjct: 482 AENREHSRVICLMNMVTSEELINDEEYEEIKEDIEEECGKFGTILETKIPRP-AGARVNL 540

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           GVGK++++Y D      A  AL+GR+F   TV    + E+ +
Sbjct: 541 GVGKIYIKYQDTESAQKAIKALAGRQFSRRTVVVTEFSEEGF 582


>gi|169602913|ref|XP_001794878.1| hypothetical protein SNOG_04461 [Phaeosphaeria nodorum SN15]
 gi|160706286|gb|EAT88221.2| hypothetical protein SNOG_04461 [Phaeosphaeria nodorum SN15]
          Length = 594

 Score =  138 bits (347), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 128/442 (28%), Positives = 204/442 (46%), Gaps = 60/442 (13%)

Query: 94  NRSKSLSPSRSP-----SKSKRRSGFDMAPPAAAMLPG--AAVPGQ--LPGVPSAVPEMA 144
           N+++  +P  +P      + +R + +D+ P     +    A + G   LPG P A P   
Sbjct: 187 NKAREPTPDLAPFTNILKRERRMTQWDIKPAGYENITAEQAKLSGMFPLPGAPRAAP--- 243

Query: 145 QNMLPFGATQLGAF-----PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVM 199
             M P   ++L AF            +   A++ ++R+YV  LP     + +  FF+  +
Sbjct: 244 --MDP---SKLAAFMSPSAGTASAAALAPGASKQSKRLYVHNLPSGTTSEELLEFFNLQL 298

Query: 200 TAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIF-------EGVAVR 252
              G N     D  ++  I   K +A +E +T E+A+ A+A+ GI         +   + 
Sbjct: 299 N--GLNVVSGQDPCLSAQIASSKTYAALEFKTPEDATVALAMSGISMRDDGGGPDRSGLS 356

Query: 253 VRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIK 312
           +RRP DY            PS + N       S  +   + P+++ +  +P +  E QI+
Sbjct: 357 IRRPKDYI----------TPSADENAYPGDEVSSVV--KDSPNKLSIVNIPTFIEEEQIR 404

Query: 313 ELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
           EL+E+ G L+ F LVKD  +   +G  FC Y D  V +     LN + +G+  L V RAT
Sbjct: 405 ELVETMGKLNAFVLVKDISSEQHRGIAFCEYADNEVVNAVIEGLNDITLGEGNLKVSRAT 464

Query: 373 ASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADA 432
              Q    Q + L      I++   A  TS +               +V+CL   +T+D 
Sbjct: 465 VGMQ----QNAGLDGGVNAISMLASAEPTSNLEH------------GRVVCLMNMVTSDE 508

Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
           L +DEEYEEI ED+ EEC KYG +V   IPRP      + GVGK++++Y D      A  
Sbjct: 509 LINDEEYEEIKEDIEEECQKYGPIVETKIPRP-AGARSSLGVGKIYIKYQDTESAQRAIK 567

Query: 493 ALSGRKFGGNTVNAFYYPEDKY 514
           AL+GR+F   TV A  + E+ +
Sbjct: 568 ALAGRQFSRRTVVATQFSEEGF 589


>gi|145536694|ref|XP_001454069.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124421813|emb|CAK86672.1| unnamed protein product [Paramecium tetraurelia]
          Length = 426

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 96/311 (30%), Positives = 149/311 (47%), Gaps = 42/311 (13%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIF-EGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           K +  +E  + E     +  D + F     ++V +P  +   L   L P      LN   
Sbjct: 141 KSWVVLECSSKEAKRALVTQDQVQFVNNCKIKVEKPRKF---LERILNPQAKEAELN--- 194

Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD--RDTGNSKGY 338
                 A    E   R+++GGLP Y  +  + +L++SFGT   F+LVKD   +T  SKGY
Sbjct: 195 ------ADQKQEDNTRLYLGGLPTYLRDEDVMKLIQSFGTTKYFNLVKDTTSNTEISKGY 248

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTV-RRATASGQSKTEQESILA--------QAQ 389
            F  Y+  A T  A  ALN L++GDK L + ++     Q      S LA        Q Q
Sbjct: 249 CFFEYEKTASTAKALKALNNLQIGDKKLKICKKINGRDQPSNYAGSFLASCDLLRIPQVQ 308

Query: 390 QHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREE 449
           Q + I + AL  S                 KV+      + + L +D+ YEE++ED+R E
Sbjct: 309 QMLTIPQSALIPS-----------------KVVQFLNMCSIEDLYEDDIYEELMEDIRSE 351

Query: 450 CGKYGTLVNVVIPRPDQNGGET-PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
           C ++G +  + IPRPD++ G   P VGK+F+++Y  +    AK  L+GR +   T+   +
Sbjct: 352 CIRFGQIEKIEIPRPDKDSGFCNPAVGKIFVKFYYQIPAKKAKFHLAGRTYNKRTIITSF 411

Query: 509 YPEDKYFNKDY 519
           YPE+++  KDY
Sbjct: 412 YPEEQFDYKDY 422


>gi|213408691|ref|XP_002175116.1| splicing factor U2AF subunit [Schizosaccharomyces japonicus yFS275]
 gi|212003163|gb|EEB08823.1| splicing factor U2AF subunit [Schizosaccharomyces japonicus yFS275]
          Length = 511

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 105/414 (25%), Positives = 178/414 (42%), Gaps = 35/414 (8%)

Query: 109 KRRSGFDMAPPAAAMLPG--AAVPG--QLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQ 164
           ++RS +DM PP    +    A + G   LPG P +     + +  F  +  G+       
Sbjct: 120 RKRSMWDMKPPGYENVTADQAKMSGLFPLPGAPRSATADPEKLAAFARSTAGSIIAP-PP 178

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
            +   A+R ARR+ V  LP     + +   F + ++    +       V  +Y    +++
Sbjct: 179 PIQPGASRQARRLKVKELPAEFEVEDLKNVFEESISTSSFHKDRDTKHVTAIYPCKTERY 238

Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDY-NPTLAAALGPGQPSPNLNLAAVGL 283
           A +E+ T E+A+       + F+   V + R   Y  P +++ +   +P  +LN   +  
Sbjct: 239 AIIELATPEDATFIWGARKLKFKNETVLIDRLEGYIVPQISSEVAQKRPKNDLNQKVLDS 298

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
           A          D+V++G LP Y  E QI ELL+ FG L    L K+     S+GY FC Y
Sbjct: 299 A----------DKVYIGSLPLYLNEDQISELLKPFGELQSLFLAKNSADMTSRGYAFCEY 348

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
                   A   LN ++ GD  L V+ A    Q        +  A   I + K + + + 
Sbjct: 349 ISSESATAAVQGLNNMEFGDTRLMVQFACVGIQQPVPSPRSVGMAAL-IELSKSSTEAAP 407

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
                           +VL +   + AD   D E+YE+I + ++ +C +YG ++++ +PR
Sbjct: 408 ---------------TRVLQIHNLLDADETLDTEDYEDIRKSVQNKCNEYGQVLDLKLPR 452

Query: 464 PDQNGGET---PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
              +   T   PGVG  F+ +      A A   +SG +F   ++   YYPED Y
Sbjct: 453 ETSSSDNTSAPPGVGVTFVRFGSIKDAANALQHMSGLRFDDRSIVIAYYPEDCY 506


>gi|118390069|ref|XP_001028025.1| U2 snRNP auxilliary factor, splicing factor [Tetrahymena
           thermophila]
 gi|89309795|gb|EAS07783.1| U2 snRNP auxilliary factor, splicing factor [Tetrahymena
           thermophila SB210]
          Length = 480

 Score =  135 bits (339), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 103/364 (28%), Positives = 177/364 (48%), Gaps = 39/364 (10%)

Query: 177 VYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEAS 236
           + V  +P +  +  I  FF+ +++ +    A P   VV V +    +FA + M      S
Sbjct: 130 LIVSDIPRMITDIEIKEFFNILISKLRPELAEPS-PVVKVDVMTNGQFATMHMSCKLAKS 188

Query: 237 NAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDR 296
            A+ L G+ F+   + + +P  Y       +   Q         V +  GA+   +  ++
Sbjct: 189 FALTLRGVEFQKCKLMIEKPKQY--FFRMYMEKQQND----DVMVDVDDGALQQMQM-NK 241

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN---SKGYGFCVYQDPAVTDIAC 353
           +++GGLP Y  +  +++L E+FG L  F++ K ++      SKGY F  Y+DP +T+ A 
Sbjct: 242 IYMGGLPTYLKDIDVRKLCETFGKLKYFNVAKQQNENKEQVSKGYCFFEYEDPNITEKAI 301

Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQ----KMALQTSGMNTLGG 409
            ALNGL  GD+ L V R T   Q+K      LA  QQ   IQ    K+A   +  + LGG
Sbjct: 302 KALNGLPCGDRKLKVSRVT-KDQNK------LANTQQ---IQSEKNKLAPSNNSGSFLGG 351

Query: 410 GMSLFGETLAKVLCLTE-------------AITADALADDEEYEEILEDMREECGKYGTL 456
              +  +   K+L + E              ++ + L +D+  +++ +D+  EC K G +
Sbjct: 352 SDLIRKDEFQKLLTIPEFTSLPSRVIQLLNMVSIEDLFEDDIVDDLYQDVMTECEKIGPV 411

Query: 457 VNVVIPRPDQNGGET-PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
             + IP+P +  G   P +GKVF+++   +    A+ +L+GR +   TV A +YPEDK+ 
Sbjct: 412 EKIEIPKPCKTTGICPPCIGKVFVKFKYMLKAKKARYSLNGRTYNRRTVIASFYPEDKFD 471

Query: 516 NKDY 519
            KD+
Sbjct: 472 RKDF 475


>gi|294954867|ref|XP_002788334.1| U2 small nuclear ribonucleoprotein, auxiliary factor, large
           subunit, putative [Perkinsus marinus ATCC 50983]
 gi|239903646|gb|EER20130.1| U2 small nuclear ribonucleoprotein, auxiliary factor, large
           subunit, putative [Perkinsus marinus ATCC 50983]
          Length = 543

 Score =  135 bits (339), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 102/357 (28%), Positives = 171/357 (47%), Gaps = 24/357 (6%)

Query: 186 ANEQAIATFFSQVMTAIGGN--SAGPGDAVVNVYI---NHEKKFAFVEMRTVEEASNAMA 240
           +++Q++  FF   + A+ GN     P   VV+V+    +   + A VE RT   A+ AM 
Sbjct: 113 SSQQSVMDFFKGALFAVTGNGGKTTPLHPVVSVFFLISDGHSRTALVEFRTPIAATVAMR 172

Query: 241 LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD-RVFV 299
           L+GI  +G  + + RP  YN    +     +    + +  +   S     A G + ++ +
Sbjct: 173 LNGIDLDGRKLAITRPHGYNKEDPSKSITAEDIQKVTIEELCGGSSTKKTAPGSNLQLGI 232

Query: 300 GGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGL 359
             LP   TET +++LLE FG L    L++D+ TG SKGYGFC ++DP   D    AL+  
Sbjct: 233 YHLPPVMTETYLRDLLEQFGALTMVSLIRDKTTGLSKGYGFCQFEDPNDADRCLYALDQF 292

Query: 360 KMGDKTLTVRRAT---------ASGQSKTEQESILAQAQQHIA-IQKMALQTSGMNTLGG 409
            +G+ +L+V R             G +     + LA     +A +Q M  +      L  
Sbjct: 293 VLGNYSLSVTRLVPDAQQGGAAGIGGAGVGPATNLADGSSGVAVVQSMTARVLANPALAA 352

Query: 410 GMSL---FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ 466
            +      G T + V+ L  A+  + L  + E + I +++REE  ++GT++ V +PRP  
Sbjct: 353 QLKAGREIGSTPSTVVQLLNAVYIEDLMSETEVKSIEDEIREEAQRHGTVLEVRVPRP-- 410

Query: 467 NGGETP---GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           +   TP   GVGK+F+++ D       +   +GRKF    + A +YP D+Y    Y+
Sbjct: 411 SASLTPYANGVGKIFVQFADITAARKFQATNNGRKFDDRVMCAAFYPTDRYKMGKYT 467


>gi|18406905|ref|NP_564764.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
 gi|12323801|gb|AAG51869.1|AC079675_4 U2 snRNP auxiliary factor, large subunit, putative; 15147-15692
           [Arabidopsis thaliana]
 gi|332195616|gb|AEE33737.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
          Length = 111

 Score =  133 bits (334), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 63/105 (60%), Positives = 78/105 (74%)

Query: 415 GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
           G T  K++CLT+ +TAD L DD EY +I+EDM +E GK+G LVNVVIPRP+ +   TPGV
Sbjct: 5   GGTPTKIVCLTQVVTADDLRDDAEYADIMEDMSQEGGKFGNLVNVVIPRPNPDHDPTPGV 64

Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           GKVFLEY D  G + A++ ++GRKFGGN V A YYPEDKY   DY
Sbjct: 65  GKVFLEYADVDGSSKARSGMNGRKFGGNQVVAVYYPEDKYAQGDY 109


>gi|19112188|ref|NP_595396.1| U2AF large subunit (U2AF-59) [Schizosaccharomyces pombe 972h-]
 gi|549144|sp|P36629.1|U2AF2_SCHPO RecName: Full=Splicing factor U2AF 59 kDa subunit; AltName: Full=U2
           auxiliary factor 59 kDa subunit; Short=U2AF59; AltName:
           Full=U2 snRNP auxiliary factor large subunit
 gi|410322|gb|AAA03578.1| splicing factor U2AF large subunit [Schizosaccharomyces pombe]
 gi|5441489|emb|CAB46760.1| U2AF large subunit (U2AF-59) [Schizosaccharomyces pombe]
          Length = 517

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 110/413 (26%), Positives = 188/413 (45%), Gaps = 37/413 (8%)

Query: 109 KRRSGFDMAPPAAAMLPG--AAVPG--QLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQ 164
           ++RS +D+ PP   ++    A + G   LPG P A     + +L F  +  G+  + P  
Sbjct: 130 RKRSLWDIKPPGYELVTADQAKMSGVFPLPGAPRAAVTDPEKLLEFARSAEGSI-IAPPP 188

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
            +   A+R ARR+ V G+P    E A  +F   +  +   +         +V +  E+ F
Sbjct: 189 PLQPGASRQARRLVVTGIPNEFVEDAFVSFIEDLFISTTYHKPE-TKHFSSVNVCKEENF 247

Query: 225 AFVEMRTVEEASNAMALDGIIFEG-VAVRVRRPTDYNPTLAAALGPGQPSPNLNLA-AVG 282
           A +E+ T E+A+    L    +   V ++ +R  +Y       + P Q +P ++   +  
Sbjct: 248 AILEVATPEDATFLWGLQSESYSNDVFLKFQRIQNY-------IVP-QITPEVSQKRSDD 299

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
            A   +   +  D++++  LP    E Q+ ELL+ FG L  F L+K+   G+SKG+ FC 
Sbjct: 300 YAKNDV--LDSKDKIYISNLPLNLGEDQVVELLKPFGDLLSFQLIKNIADGSSKGFCFCE 357

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS 402
           +++P+  ++A + L+G       L  + A             L QA        M  +++
Sbjct: 358 FKNPSDAEVAISGLDGKDTYGNKLHAQFACVG----------LNQA--------MIDKSN 399

Query: 403 GMNTLGGGMSLFGETL-AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           GM  L        +++  +VL L   IT D + D +EYE+I E ++ +   YG L+++ I
Sbjct: 400 GMAILTELAKASSQSIPTRVLQLHNLITGDEIMDVQEYEDIYESVKTQFSNYGPLIDIKI 459

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           PR         G GKVF+ Y D      A   + G KF   T+   +Y ED Y
Sbjct: 460 PRSIGTRNSGLGTGKVFVRYSDIRSAEVAMEEMKGCKFNDRTIVIAFYGEDCY 512


>gi|145544238|ref|XP_001457804.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124425622|emb|CAK90407.1| unnamed protein product [Paramecium tetraurelia]
          Length = 435

 Score =  131 bits (330), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 97/320 (30%), Positives = 147/320 (45%), Gaps = 51/320 (15%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIF-EGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           K +  +E  + E     +  D + F     ++V RP  +   L   L P      L    
Sbjct: 141 KSWVVLECSSKEAKRALVTQDQVQFVNNCKIKVERPRKF---LERILNPQAREGEL---- 193

Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD--RDTGNSKGY 338
                 A    E   R+++GGLP Y  +  + +L++SFGT   F+LVKD   +T  SKGY
Sbjct: 194 -----SAEQKQEDNTRLYLGGLPTYLRDEDVMKLIQSFGTTKYFNLVKDTTSNTEISKGY 248

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT-----EQESI--------- 384
            F  Y++   T  A  ALN L++GDK L + +     Q        EQ S          
Sbjct: 249 CFFEYENTGSTAKALKALNNLQIGDKKLKICKVQGEPQQNKKINGREQPSNYAGSFLASC 308

Query: 385 ----LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE 440
               L Q QQ + I + AL  S                 KV+      + + L +D+ YE
Sbjct: 309 DLLRLPQIQQMLTIPQSALIPS-----------------KVVQFLNMCSVEDLYEDDLYE 351

Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGG-ETPGVGKVFLEYYDAVGCATAKNALSGRKF 499
           E++ED+R EC ++G +  + IPRPD+  G   P VGK+F+++Y  +    AK  L+GR +
Sbjct: 352 ELMEDIRSECIRFGQIEKIEIPRPDKESGFCNPAVGKIFVKFYYQIPAKKAKFHLAGRTY 411

Query: 500 GGNTVNAFYYPEDKYFNKDY 519
              TV   +YPE+++  KDY
Sbjct: 412 NKRTVVTSFYPEEQFDYKDY 431


>gi|156086444|ref|XP_001610631.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154797884|gb|EDO07063.1| conserved hypothetical protein [Babesia bovis]
          Length = 400

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 104/374 (27%), Positives = 172/374 (45%), Gaps = 53/374 (14%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           AT+  R +Y+G +PP A+   +  F +  +TA+ G S  PG+     +I+ +  +AFVEM
Sbjct: 20  ATKPYREIYIGNIPPQADVNNLLEFLNDALTAVNGTSI-PGNPCQKGWISADSHYAFVEM 78

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNP-TLAAALGPGQPSPNLNLAAVGL----- 283
           RT+EEASN + L GI +   ++R+ RP  YNP  L  A  P  P+ + +L A+G+     
Sbjct: 79  RTMEEASNCIQLSGINYMNYSLRINRPKTYNPEILTEAPSPTIPTLDPSLLALGIAGLKC 138

Query: 284 ASGAIGGAEG----------PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTG 333
           AS  I  A             DR+ V  +     E  +K  LE+ G L  +  + + +  
Sbjct: 139 ASEQISAAADMLATERAKAMTDRLCVLNVT---DEPALKRELEAQGNLKYYQYITEDNKP 195

Query: 334 NSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIA 393
                  C+++   + ++   AL GLK  D  + V  A  + +     E  + Q  +   
Sbjct: 196 -----PLCIFEYEHI-EMQNIALEGLKKRD--VKVELAVDALERGAMSEDFMKQQIESCD 247

Query: 394 IQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
           I K  + T                  +VL L   ++ + L DD EY +I++D+R EC +Y
Sbjct: 248 IMKSQIPT------------------RVLLLANLVSKEDLEDDAEYYDIIDDVRCECEEY 289

Query: 454 GTLVNVVIPR-------PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
           G +V V +PR        +    +   VG  F+ + +  G + A+  L GRKFG   V  
Sbjct: 290 GPVVRVEMPRVPKGLTLDEIRNMDFSAVGCAFVLFSNIEGASKARKVLDGRKFGHRIVEC 349

Query: 507 FYYPEDKYFNKDYS 520
            ++ E  +   ++S
Sbjct: 350 HFFSELLFHVGEFS 363


>gi|146170296|ref|XP_001470832.1| hypothetical protein TTHERM_00484731 [Tetrahymena thermophila]
 gi|146145092|gb|EDK31651.1| hypothetical protein TTHERM_00484731 [Tetrahymena thermophila
           SB210]
          Length = 471

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 73/220 (33%), Positives = 118/220 (53%), Gaps = 7/220 (3%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGG--NSAGPGDAVVNVYINH 220
           +++  Q   RHARR+Y+G +P   N++ ++ +  + + A GG   S    + +V   I+ 
Sbjct: 22  IKLDNQSGYRHARRLYIGNIPETINQEYLSEWLYRSLEAAGGLQPSLPSENPIVKCEIDP 81

Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYN--PTLAAALGPGQPSPNLNL 278
           + +FAF E+R++EE +  + LDGII     +R+RRPT+Y   P +        P  N +L
Sbjct: 82  KGRFAFTELRSIEETTALLQLDGIILWHRQLRIRRPTEYEKFPKVQGQFEANIPKLNFDL 141

Query: 279 -AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQI-KEL-LESFGTLHGFDLVKDRDTGNS 335
              VG+        +GP+++F+  LP    E  I  EL L   G +  F LVKD  T  S
Sbjct: 142 FKTVGIVIIPTIVDDGPNKIFLANLPTKMDELMILDELKLRDMGEIKAFHLVKDNQTNQS 201

Query: 336 KGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASG 375
           KGY F  ++DP++TD     L+G++   +TLT +R+   G
Sbjct: 202 KGYAFFEFKDPSLTDNCIETLHGMQYAGRTLTCKRSQIGG 241


>gi|298713809|emb|CBJ27181.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 1141

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 97/342 (28%), Positives = 161/342 (47%), Gaps = 47/342 (13%)

Query: 175  RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE-KKFAFVEMRTVE 233
            R ++VGGLP   +   +  F +  M  +   ++  G+ V+ + +  +   FAF+E+RT E
Sbjct: 749  RELHVGGLPHGVSGVQLQDFLNAAMQYLKIATSA-GNPVIRIAMGPDGTNFAFIELRTEE 807

Query: 234  EASNAMA-LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
            E +  +  + GI      ++  RP  +     A       +P    ++V           
Sbjct: 808  ETNATLGRMSGIQCGTGHLKFGRPKAHAAGATAV------APKKEESSV----------- 850

Query: 293  GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
                + V  LP   T+  ++ELL  FG L  F+L+KD  +G SKG     Y D     +A
Sbjct: 851  ----LMVMNLPDSLTDDHVRELLSPFGELKKFNLLKD-SSGKSKGTAVFEYTDMENGQLA 905

Query: 353  CAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMS 412
             + L+GL +G   L V+R  A          + A   + + ++++  +   +        
Sbjct: 906  LSGLSGLPVGKGKLMVQRVPAM---------MAATLLKPVKVKEVEDEQDNVEPTC---- 952

Query: 413  LFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                    V+ L+  +  + LADD EY EI  D+ EEC +YG + +  +PRP ++G E  
Sbjct: 953  --------VVRLSNMVEVEELADDTEYAEIKGDVVEECEQYGKVKSAEVPRP-EDGKEVL 1003

Query: 473  GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
            G+G++F+E+ D  G    +NAL+GRKFGG  V A YYP D +
Sbjct: 1004 GLGEIFVEFEDVAGATKGRNALAGRKFGGKAVKATYYPLDLF 1045


>gi|307106441|gb|EFN54687.1| hypothetical protein CHLNCDRAFT_53018 [Chlorella variabilis]
          Length = 247

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 81/211 (38%), Positives = 109/211 (51%), Gaps = 36/211 (17%)

Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN-VYINHEK 222
           Q+    A R A+RVYVG LP   +E  +    +++M        G GD + N ++   +K
Sbjct: 68  QLFNPDAARPAKRVYVGNLPAAVSEAELRQAVNELM--------GNGDLLFNGMHQVQDK 119

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVG 282
            +AFVE R+VEEASNAMALDG+ F    +++                           VG
Sbjct: 120 GYAFVEFRSVEEASNAMALDGVKFHDSYLKL---------------------------VG 152

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           L        + P ++F+GGLP  ++E Q+KE+L  FG L  F+LV DR TGNSKGY F  
Sbjct: 153 LEVVKTVVQDSPHKLFIGGLPCDWSEDQVKEMLMPFGQLKAFNLVMDRGTGNSKGYAFAE 212

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
           + D  VTDI    LNG     K LTV+RA A
Sbjct: 213 FMDVHVTDIVIQNLNGKPCNTKFLTVKRALA 243


>gi|145542929|ref|XP_001457151.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124424966|emb|CAK89754.1| unnamed protein product [Paramecium tetraurelia]
          Length = 429

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 94/320 (29%), Positives = 145/320 (45%), Gaps = 51/320 (15%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIF-EGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           K +  +E  + E     +  D + F     ++V RP  +   L   L P      LN   
Sbjct: 135 KSWVVLECSSKEAKRALVTQDQVQFVNNCKIKVERPRKF---LERILNPQTKDGELN--- 188

Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD--RDTGNSKGY 338
                      E   R+++GGLP Y  +  + +L++SFG    F+LVKD   +T  SKGY
Sbjct: 189 ------PDQKQEDNTRLYLGGLPTYLRDEDVMKLIQSFGITKYFNLVKDTTSNTEISKGY 242

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE----------SILA-- 386
            F  Y++   T  A  ALN L++GDK L + +     Q   +            S LA  
Sbjct: 243 CFFEYENAQSTAKALKALNNLQIGDKKLKICKVQGETQQNKKINGKDQPSNYAGSFLASC 302

Query: 387 ------QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE 440
                 Q QQ + I + AL  S                 KV+      +   L +D+ +E
Sbjct: 303 DLLRIPQVQQMLTIPQSALIPS-----------------KVVQFLNMCSIQDLYEDDIFE 345

Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGG-ETPGVGKVFLEYYDAVGCATAKNALSGRKF 499
           E++ED+R EC +YG +  + IPRPD+  G   P VGK+F+++Y  +    AK  L+GR +
Sbjct: 346 ELMEDIRSECMRYGQIEKIEIPRPDKESGFCNPAVGKIFVKFYYQIPAKKAKFHLAGRTY 405

Query: 500 GGNTVNAFYYPEDKYFNKDY 519
              T+   +YPE+++  KDY
Sbjct: 406 NKRTIITSFYPEEQFDYKDY 425


>gi|71027151|ref|XP_763219.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68350172|gb|EAN30936.1| hypothetical protein TP03_0201 [Theileria parva]
          Length = 509

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 97/369 (26%), Positives = 172/369 (46%), Gaps = 39/369 (10%)

Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEE 234
           R +Y+G +PP+ + + +    +Q + ++ G S  PG+  +  +I+ +  +AF+E+RT+EE
Sbjct: 104 REIYIGNIPPVGDIEILMDIINQALISVNGTSM-PGNPCLKGWISSDGHYAFIELRTMEE 162

Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNP-TLAAALGPGQPSPNLNLAAVGL---------- 283
           ASN M L G+   G  ++V RP  ++    + A  P  P+ + +L A+G+          
Sbjct: 163 ASNCMQLTGLNIMGHNIKVNRPKTFDADVFSKAPSPTVPTLDPSLLAMGVQALKSAKEQI 222

Query: 284 -ASGAIGGAEG----PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
            A+  I  AE      DR+ + G+P    +  + + L   GT+   + +   +  N    
Sbjct: 223 AAASDILAAEKAKPITDRLCLVGIPKDTDQQTVVDTLRLHGTIKFTNFIMGIENFNYITV 282

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI----AI 394
            + +Y    +T +        + G+  +         Q KT  ES+  Q  + I    A+
Sbjct: 283 IYVIYIYQLMTIV--------EKGEMVVLFEYENLEDQ-KTALESLPKQGYRVILAIDAV 333

Query: 395 QKMALQTSGMNTLGGGMSLF-GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKY 453
            +  +    + T     SL   E   +VL L+  ++ D L DDEEY +I++D+R EC  Y
Sbjct: 334 TQGIISPQQIKTQLANCSLMRAEIPTRVLLLSNLVSKDELEDDEEYVDIIDDVRCECELY 393

Query: 454 GTLVNVVIPR-------PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
           G ++ V +PR        +    +   VG  F+ +      + A+  L GRKFG  TV+A
Sbjct: 394 GVVLRVELPRVPKGLTEEEMKAFDPTSVGSAFVLFSTVESASKARKVLDGRKFGQRTVHA 453

Query: 507 FYYPEDKYF 515
            ++ E  YF
Sbjct: 454 HFFSE-LYF 461


>gi|399216014|emb|CCF72702.1| unnamed protein product [Babesia microti strain RI]
          Length = 487

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 116/460 (25%), Positives = 197/460 (42%), Gaps = 71/460 (15%)

Query: 81  RHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAV 140
           R R    +SDR R R        S S  ++R  FD +PP  A  P     G + G    +
Sbjct: 57  RGRDIDRASDRSRFR-------HSDSYDRKRFKFD-SPPKQA--PKEGFGGGVLGYVDGI 106

Query: 141 PEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMT 200
           P   +  +       G F          ++TR +R++ +   PP    + I  +F+  M 
Sbjct: 107 PVQGKRHIIMQTCLFGIF------YSEAESTRFSRQLEISNTPPNIEVEVIIEYFNMAML 160

Query: 201 AIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYN 260
           A+GGN      A+   + +++K    +EMRT+EE SNA+ L+G+   G ++ + R  +  
Sbjct: 161 AVGGNCLPGNPAIRGKHNSNDKTSITIEMRTLEETSNALQLNGLNLMGKSLSITRVGNCP 220

Query: 261 PT-LAAALGPGQPSPNLNLAAVG-----------LASGAI----GGAEGPDRVFVGGLPY 304
           P  +  A  P  P+ + ++ A+G           L S AI    GGA   DR+ +  LP 
Sbjct: 221 PEYINKAPPPTVPTISPSILALGVNGLQSADIKPLLSNAITSLVGGAPKTDRLLILDLPI 280

Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLK---- 360
             +E QIK ++E FG L    L K+ D  ++   G C+ +    T++   AL  ++    
Sbjct: 281 TQSEDQIKSMVEEFGKLKYIQLFKNADDTSA---GMCLIEF-VDTNVQVEALQKMRLQYN 336

Query: 361 --MGDKTLTVR---RATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFG 415
             + +  LT R   R     Q + + E +  Q      I +  + T+ +++         
Sbjct: 337 IILAEDALTKRIIDRNLLRLQMRNQSELMKTQIPTRCIIIRNLVTTASVSS--------- 387

Query: 416 ETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
                       +    L +D EY+E++ED+R EC   G +  V +PR          + 
Sbjct: 388 ------------VQFMILQNDREYQEVIEDIRAECDLMGQVERVEVPR-----NPPSEMA 430

Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
             F+ +    G A A+ +L GR+F  N V   +Y E+++ 
Sbjct: 431 YAFVLFESIQGAAMARKSLGGRRFASNVVQVDFYNEEEFM 470


>gi|145538137|ref|XP_001454774.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124422551|emb|CAK87377.1| unnamed protein product [Paramecium tetraurelia]
          Length = 426

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 94/320 (29%), Positives = 144/320 (45%), Gaps = 51/320 (15%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIF-EGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           K +  +E  + E     +  D + F     ++V RP  +   L   L P      LN   
Sbjct: 132 KSWVVLECSSKEAKRALVTQDQVQFVNNCKIKVERPRKF---LERILNPQARDGELNPEQ 188

Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD--RDTGNSKGY 338
                      E   R+++GGLP Y  +  + +L++SFG    F+LVKD   +T  SKGY
Sbjct: 189 ---------KQEDNTRLYLGGLPTYLRDEDVMKLIQSFGITKYFNLVKDTTSNTEISKGY 239

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE----------SILA-- 386
            F  Y+    T  A  ALN L++GD+ L + +     Q   +            S LA  
Sbjct: 240 CFFEYESAQSTAKALKALNNLQIGDRKLKICKVQGETQQNKKINGKDQPSNYAGSFLASC 299

Query: 387 ------QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE 440
                 Q QQ + I + AL  S                 KV+      +   L +D+ +E
Sbjct: 300 DLLRIPQVQQMLTIPQSALIPS-----------------KVVQFLNMCSIQDLYEDDIFE 342

Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGET-PGVGKVFLEYYDAVGCATAKNALSGRKF 499
           E++ED+R EC +YG +  + IPRPD+  G   P VGK+F+++Y  +    AK  L+GR +
Sbjct: 343 ELMEDIRSECVRYGQIEKIEIPRPDKESGFCNPAVGKIFVKFYYQIPAKKAKFHLAGRTY 402

Query: 500 GGNTVNAFYYPEDKYFNKDY 519
              TV   +YPE+++  KDY
Sbjct: 403 NKRTVITSFYPEEQFDYKDY 422


>gi|85000357|ref|XP_954897.1| snrnp splicing factor (U2AF) [Theileria annulata strain Ankara]
 gi|65303043|emb|CAI75421.1| snrnp splicing factor (U2AF), putative [Theileria annulata]
          Length = 486

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 174/368 (47%), Gaps = 52/368 (14%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           ++  R +Y+G +PP+ + + +    +Q + ++ G S  PG+  +  +I+ +  +AF+E+R
Sbjct: 100 SKAFREIYIGNIPPVGDIEILMDIINQALISVNGTSM-PGNPCLKGWISSDGHYAFIELR 158

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNP-TLAAALGPGQPSPNLNLAAVGL------ 283
           T+EEASN M L G+   G  ++V RP  Y+    + A  P  P+ + +L A+G+      
Sbjct: 159 TMEEASNCMQLTGLNIMGHNIKVNRPKTYDADVFSKAPSPTVPTLDPSLLAMGVQALKSA 218

Query: 284 -----ASGAIGGAEG----PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
                A+  I  AE      DR+ + G+P    +  + +LL+S GT+     + +     
Sbjct: 219 KEQIAAASDILAAEKAKSITDRLCLVGIPKDMEQQTVVDLLQSQGTIKFTHFIME----- 273

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
            KG    +++   + D   A  +  K G + +                +I A  Q  I+ 
Sbjct: 274 -KGEMVVLFEYENLEDQKSALESLPKQGYRVIM---------------AIDAVTQGIISP 317

Query: 395 QKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
           Q++  Q +  + +        E   + L L+  ++ + L DDEEY +I++D+R EC  YG
Sbjct: 318 QQIKTQLANCSLMK------AEIPTRALLLSNLVSKEELDDDEEYVDIIDDIRCECELYG 371

Query: 455 TLVNVVIPR-------PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAF 507
            ++ V +PR        + N  +   VG  F+ +      + A+  L GRKFG  TV+A 
Sbjct: 372 VVLRVELPRVPKGLSEEEMNSFDPTSVGSGFVLFSTVDSASKARKVLDGRKFGQRTVHAH 431

Query: 508 YYPEDKYF 515
           ++ E  YF
Sbjct: 432 FFSE-LYF 438


>gi|340503018|gb|EGR29650.1| splicing factor u2af large subunit, putative [Ichthyophthirius
           multifiliis]
          Length = 438

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 90/337 (26%), Positives = 159/337 (47%), Gaps = 25/337 (7%)

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           Q+   HA R+Y+G +P   + + +  F  + +   GG    PG+ +++   +  KKF F+
Sbjct: 109 QKNYIHALRIYIGNIPDPIDTEDVCHFVYKSLLESGG-LLEPGNPIISKKNDPIKKFIFL 167

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
           ++R++EE S  M LDGI+++G ++R RRP DY  T+    G  +  P L+   + +    
Sbjct: 168 QLRSIEETSACMQLDGILYKGKSLRFRRPKDYT-TMPQVEG-TRKIPILDRNKLRIVQTQ 225

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
           +      +++ V  +P   +E  + ++L+++G L  F L  D  TG SKG+ FC Y    
Sbjct: 226 VENTY--NKLQVMNIPETISEEHVMQILQNYGELRSFHLAVDIYTGESKGFAFCEYLTDK 283

Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE-----------------SILAQAQQ 390
            T      L+G ++ +K + V+R   +     E++                 SIL    +
Sbjct: 284 ATMDCLNQLSGQQILNKIINVKRCNPNLAPPVEEQMQPIEVLVKNLCDFINKSILESGFK 343

Query: 391 HIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREEC 450
              IQ+  +Q    N       L  E    VL +   I    + +D EYE I  D++++ 
Sbjct: 344 D--IQEEYIQKVISNEGQKYSGLNQEEATSVLKIKNVIDKQVIEEDPEYEFIYNDLKQQL 401

Query: 451 GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
            K+G L  ++IPR  +   +   VG VF+E+ +   C
Sbjct: 402 VKFGRLKQMIIPRLKEK-YQPDSVGLVFVEFENEKIC 437


>gi|397623851|gb|EJK67169.1| hypothetical protein THAOC_11833 [Thalassiosira oceanica]
          Length = 436

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 114/385 (29%), Positives = 174/385 (45%), Gaps = 50/385 (12%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           T+ +R ++VG  PP  +E  +  F S  M+ +   +  P D    V      KF F+E+ 
Sbjct: 66  TKLSRELFVGNTPPGTSEALLMQFLSGAMSRV---NLCPPDVTPIVTCRKNDKFCFIELA 122

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYN----PTLAAALGPGQPSPNLNLAAVGLASG 286
           TV+ A+ A+ L+GI F G ++RV RP+ Y+    P+       GQP P   +AAV   +G
Sbjct: 123 TVDLANKALNLNGIPFLGSSLRVARPSKYSGPHVPSQTWQQLTGQPLPP-GMAAVPENTG 181

Query: 287 AIGGAEGPDRV----FVGGLPYYFTETQIKELL----ESFG--TLHGFDLVKDRDTGNSK 336
                 G D++    F+G      T   +++ L    E  G  T+ G  +V  R +    
Sbjct: 182 VTMALSGEDKLSRELFIGNTTPEMTAEMLRDFLGRAMEQVGLSTMPGNPIVTVRPSAK-- 239

Query: 337 GYGFCVYQDPAVTDIACAA-LNGLKMGDKTLTVRRATASGQSKTEQ---ESILAQ----- 387
              F   +  ++ + A A  LN +      L V R +     +T     E ILA+     
Sbjct: 240 ---FAFIEVRSMQEAANALNLNNIPYLGAQLRVGRPSKYSGPETPHGNWEDILAKFMSGE 296

Query: 388 -------AQQHIAIQKMALQTSGMNTLGGGMSLFGETLAK-----VLCLTEAITADALAD 435
                   Q +  +Q+     +  ++L   ++      +K     V+ L   +T   L D
Sbjct: 297 LHLKNNATQANPLVQQAHAVAAAASSLAPSLASVPPLASKASPSPVVELRHMLTQQDLDD 356

Query: 436 DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALS 495
           D EY +ILED R+EC  +GTL N+VIPR      + PG  K+FLEY  A     A   L+
Sbjct: 357 DNEYNDILEDTRDECSSFGTLKNIVIPR------KGPGATKIFLEYMTAEDAGKAIAGLA 410

Query: 496 GRKFGGNTVNAFYYPEDKYFNKDYS 520
           GR F G  V A Y+   K+ N+DYS
Sbjct: 411 GRTFDGRKVTAVYFDTVKFANEDYS 435


>gi|255082091|ref|XP_002508264.1| RNA binding protein [Micromonas sp. RCC299]
 gi|226523540|gb|ACO69522.1| RNA binding protein [Micromonas sp. RCC299]
          Length = 493

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 80/246 (32%), Positives = 121/246 (49%), Gaps = 35/246 (14%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGN---SAGPGDAVVNVYINHEKKFA 225
           Q TR +RR+YVG LP   N++A+  FF+  M   G     S GP  +VVN  I HEK FA
Sbjct: 112 QHTRQSRRLYVGSLPKPVNDEALHAFFNNAMVNSGAAIDPSGGP--SVVNTTITHEKGFA 169

Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDY----NPTLAAALGPGQPSPNLNLAAV 281
           F+E R +E+A +A+  DGI+F G  + ++RP DY    NP + A  G   P   + L   
Sbjct: 170 FIEFRRLEDAESALMFDGIVFNGSKLIIKRPKDYDAARNP-IWAMRGQAPPQDEVKLIGE 228

Query: 282 GLASGAI--GGAE----------------------GPDRVFVGGLPYYFTETQIKELLES 317
            L  G I   G E                      GP +++ GG     T+ Q++++L+S
Sbjct: 229 ELPIGTIIVDGKEVKIPLPPPLPSEWPRLPRRTPNGPHKMYCGGFHPLHTDLQVRQVLQS 288

Query: 318 FGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQS 377
            G L  F ++ D + G   G+ F  Y+DP ++ +A   L G+++ ++ L  RR       
Sbjct: 289 VGELKSFAVMPD-ENGRPTGHAFFEYKDPRLSAVAETVLTGIRVRNRRLVCRRMNPDAAP 347

Query: 378 KTEQES 383
           +   ES
Sbjct: 348 EKPGES 353


>gi|403223258|dbj|BAM41389.1| snRNP splicing factor U2AF [Theileria orientalis strain Shintoku]
          Length = 534

 Score =  121 bits (304), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 170/371 (45%), Gaps = 51/371 (13%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           ATR  R +Y+G +PP+ +   +    +Q + ++ G S  PG+  +  +I+ +  +AFVE+
Sbjct: 119 ATRPYREIYIGNIPPVGDIAILLDIINQALISVNGTSM-PGNPCLKGWISSDGHYAFVEL 177

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAA-ALGPGQPSPNLNLAAVGL----- 283
           RT+EEASN M L G+   G  ++V RP  Y+P L + A  P  P+ + +L A+GL     
Sbjct: 178 RTMEEASNCMQLTGLNIMGHNIKVNRPKTYDPDLMSKAPSPTVPTLDPSLLAMGLQALKS 237

Query: 284 ------ASGAIGGAEG----PDRVFVGGLPYYFTETQIKELLESFGTL-HGFDLVKDRDT 332
                 A+  +  AE      DR+ +  +P    +  +  L+ S G + + + + +  ++
Sbjct: 238 AREQIVAASDVLAAEKAKVMTDRLCIVDIPPEADKQTVINLVHSMGEVKYTYFVDEPAES 297

Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGL-KMGDKTLTVRRATASGQSKTEQESILAQAQQH 391
           G +K      Y +    D    A+  + KM  + +    A   G    E    + +  + 
Sbjct: 298 GTNKRVFLFEYMN---MDHQKKAMEEIPKMNYRLILAIDAVTQGMIAPE---YIKKQLES 351

Query: 392 IAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECG 451
            AI K  + T                  + L L   ++ + L DD EY +I++D++ EC 
Sbjct: 352 CAIMKPEVPT------------------RALLLGNLVSKEELDDDAEYVDIIDDVKTECE 393

Query: 452 KYGTLVNVVIPRPDQNGGE-------TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
            YG ++ + +PR  +   E          VG  F+ +    G + A+  L GRKFG  TV
Sbjct: 394 DYGVVLRLELPRVPKGLSEEEMRSFDESSVGSAFVLFSTVDGASKARKVLDGRKFGNRTV 453

Query: 505 NAFYYPEDKYF 515
            A ++ E  YF
Sbjct: 454 KAHFFSE-LYF 463


>gi|218192051|gb|EEC74478.1| hypothetical protein OsI_09930 [Oryza sativa Indica Group]
          Length = 1128

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 96/342 (28%), Positives = 157/342 (45%), Gaps = 42/342 (12%)

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQ-VMTAIGGNSAGPGDAVVNVYINHEKK 223
           V   QATR  RR+++  LP LA E  +    ++ ++++   +        ++  IN +K+
Sbjct: 628 VQLTQATRPLRRLHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKR 687

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
            AFVE  T E+A+ A++ DG  F G ++++RRP +Y     A + P +PS  + L +  +
Sbjct: 688 QAFVEFLTPEDATAALSFDGRSFGGSSLKIRRPKEY--VEMAHVAPKKPSEEIKLISDVV 745

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
                  A+ P ++F+ G+    +   + E++ SFG L  +  + + D G +    F  Y
Sbjct: 746 -------ADSPHKIFIAGISGVISSEMLMEIVSSFGPLAAYRFLFNEDLGGA--CAFLEY 796

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
            D ++T  ACA LNG+K+G   LT   A     + TEQ     +A     I   A     
Sbjct: 797 IDHSITSKACAGLNGMKLGGGILT---AVNVFPNSTEQ--AFNEASPFYGIPDSA----- 846

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADA--LADDEEYEEILEDMREECGKYGTLVNV-V 460
                   SL  E   KVL L      +   L    E EEILED+R EC ++G + ++ V
Sbjct: 847 -------KSLLEEP-TKVLQLKNVFDQEEYLLLSKSELEEILEDVRVECARFGAVKSINV 898

Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
           +  P  +   T   G    E      C      +  +++GGN
Sbjct: 899 VEYPASSDNTT---GDTITE------CEDGSTKIEPKEYGGN 931


>gi|340506650|gb|EGR32741.1| u2 snrnp auxilliary splicing factor, putative [Ichthyophthirius
           multifiliis]
          Length = 276

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 83/261 (31%), Positives = 133/261 (50%), Gaps = 34/261 (13%)

Query: 278 LAAVGLASGAIGGAEG--PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN- 334
           +  + +    +   EG   +++++GGLP Y  + +IK+L E+FG L  F+L K ++    
Sbjct: 26  MDKIQVEDAILDSEEGIQENKIYMGGLPTYLKDPEIKKLCETFGKLKYFNLAKQQNENKE 85

Query: 335 --SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI 392
             SKGY F  Y+D  VTD A  ALNGL  GD+ L V + T   Q+K      LA+ QQ  
Sbjct: 86  WVSKGYCFFEYEDKEVTDRAIKALNGLPCGDRKLKVSKVT-RDQNK------LAKTQQ-- 136

Query: 393 AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTE-------------AITADALADDEEY 439
                 +Q    + LG    +  E + K+L + E               + + L +D+ Y
Sbjct: 137 ------IQNDSGSYLGDCHLIKNEFVRKMLSIPEYTYQPSRVIQLLNMCSPEDLFEDDIY 190

Query: 440 EEILEDMREECGKYGTLVNVVIPRPDQNGGET-PGVGKVFLEYYDAVGCATAKNALSGRK 498
            EI +D++ EC K G +  V I RP +  G   P VGK+F+++   +    A++ L+GR 
Sbjct: 191 NEIYQDVQSECEKIGPIEKVEIVRPCKMTGICPPSVGKIFVKFKYLLKAKRARHVLNGRT 250

Query: 499 FGGNTVNAFYYPEDKYFNKDY 519
           +   TV A +YPE+K+  K++
Sbjct: 251 YNKRTVVASFYPEEKFDCKEF 271


>gi|118376950|ref|XP_001021657.1| hypothetical protein TTHERM_00151210 [Tetrahymena thermophila]
 gi|89303423|gb|EAS01411.1| hypothetical protein TTHERM_00151210 [Tetrahymena thermophila
           SB210]
          Length = 554

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 99/365 (27%), Positives = 161/365 (44%), Gaps = 43/365 (11%)

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           Q+   HA RVY+G +P   + + +  F  + M   GG    PG+                
Sbjct: 210 QKQYIHALRVYIGNIPDPVDVEDVCKFVFEQMANAGG-LLEPGNP--------------- 253

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGA 287
            +R++EE S  M LDGII++G ++R RRP D+   L    G  +P P L+   + +    
Sbjct: 254 -LRSIEETSACMELDGIIYKGKSLRFRRPKDFG-VLQKVEG-TRPVPTLDKTKLKIVQTQ 310

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
           +      +++ +  LP  F+E  + +LL ++G L  F L  D+ T  SKG+ FC +    
Sbjct: 311 VENTY--NKLQIMNLPENFSEEHVMQLLLTYGDLKSFHLAVDKITSESKGFAFCEFITDR 368

Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT---SGM 404
            T      L+G ++ +K + V+R       + E E IL+  Q +  + +   +T   SG 
Sbjct: 369 STVECLNKLSGQQILNKVINVKRCNPQLAPQHE-EPILSLDQLYKNLVENVNKTIIESGQ 427

Query: 405 -----NTLGGGMS--------LFGETLAKVLCLTEAITADALADDEEYEEILEDMREECG 451
                + L   +S        L  E    VL L   +    + +D EY  I  D++ +  
Sbjct: 428 KDIQEDYLKKMLSINAPKYDGLITEDATNVLKLHNIVNKQLIEEDAEYHFIFNDLKTQLD 487

Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA-FYYP 510
           + G    ++IPR      E  G+G VF+E+ +      A   L   K+ G  V A FY P
Sbjct: 488 RIGRTKQIIIPRKKDKFLE--GIGFVFVEFDNERTSQIASFLLQKIKYDGKDVKAEFYSP 545

Query: 511 EDKYF 515
           +  YF
Sbjct: 546 Q--YF 548


>gi|428172624|gb|EKX41532.1| hypothetical protein GUITHDRAFT_112506 [Guillardia theta CCMP2712]
          Length = 514

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 111/371 (29%), Positives = 164/371 (44%), Gaps = 39/371 (10%)

Query: 169 QATRHARRVYVGGLP---PLANEQAIATFFSQVMTAI-------------GGNSAGPGDA 212
           Q T  ARRVYVG LP   P  +E A+  FF Q M  +              G +  PG  
Sbjct: 133 QLTLKARRVYVGNLPQLDPPISEPALKEFFDQAMHQVQDQGAYFKAEFAQAGLTQSPGCC 192

Query: 213 VVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP 272
           V +V+I+ EK FAF+E+RTV+EA++AM LDGI F G  +RV RP DY P    A+     
Sbjct: 193 VCDVWISSEKHFAFIEVRTVQEATSAMTLDGITFYGTPLRVNRPHDYVPPAPDAMIMTMA 252

Query: 273 SPNLNLAAVGLA---SGAIGGAEGPDRVFVGGLPY-YFTETQIKELLESFGTLHGFDLVK 328
              L  +  G+A   S  +   +   R+ VG L     T   +K+ +    ++    LV 
Sbjct: 253 QAGLMGSGGGIAANLSALMQQTKKARRIHVGNLLVGSMTSASLKQFISQ--SMQQLSLVV 310

Query: 329 -------DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQ 381
                  D        +GF   +  A  + A  AL+G++   + + V R         E 
Sbjct: 311 KPGDPCIDSFLSGDGNFGFVEMRTVAEANNA-MALSGIECNGRPIRVGRPADYVPLNAE- 368

Query: 382 ESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAK---VLCLTEAITADALADDEE 438
             ++AQ  Q   I            +G GM L G   +K   V+ +   ++ D LA+D+E
Sbjct: 369 --LIAQC-QGTGILGTPGDAGVTEAVGAGM-LNGPDESKATEVVVIRNMMSDDDLANDDE 424

Query: 439 YEEILEDMREEC-GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGR 497
            ++I ED   +C  +YG +V  VI RP + G     +G V +++        A N L+  
Sbjct: 425 CKDIAEDTISKCEEEYGKVVRFVIVRPGREGAPADLIGNVLVQFETKESAIKAANDLNHV 484

Query: 498 KFGGNTVNAFY 508
           KF    V   Y
Sbjct: 485 KFDERVVETDY 495


>gi|145547916|ref|XP_001459639.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124427465|emb|CAK92242.1| unnamed protein product [Paramecium tetraurelia]
          Length = 402

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 90/356 (25%), Positives = 157/356 (44%), Gaps = 48/356 (13%)

Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
           HA R+Y+G LP   ++  +  +  Q M + G     PGD V+ V +   +K+ FV+ R++
Sbjct: 86  HAVRLYLGNLPDNVDKDHLHNYIRQQMESHGA-VLDPGDPVIQVQLQPGQKYCFVQFRSI 144

Query: 233 EEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
           EE   A+ +D I ++G  ++ +R  DY   ++  +   +  P             I   E
Sbjct: 145 EETEAALQIDTINYQGKPLKFKRVKDYE--ISPRIEGEREVP------------KIQPKE 190

Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN-SKGYGFCVYQDPAVTDI 351
              ++FV GL        +  +L  +G L   ++V  RD  N  KG+ FC ++    T  
Sbjct: 191 PAQKLFVCGLAPDTDNDALANILSEYGNLKSLNVV--RDIKNVCKGFAFCEFETDLETQN 248

Query: 352 ACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGM 411
               LN   +G + L V++                 AQ     Q   + T  +    G  
Sbjct: 249 CVNGLNNKVIGGRLLQVKK----------------NAQLPTPTQDYIIDTITL----GEQ 288

Query: 412 SLFGETLAK--------VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR 463
           S F   L +        V+ +  A+    + DD EY  I++D+++E  K G L+++V+PR
Sbjct: 289 SAFEAKLQQINQMKVSSVVVINNAVRIKNIEDDYEYNFIVKDLKKEIEKIGRLISMVVPR 348

Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             +  G + G+GKVF+E+ +      A   L  +K+ G  ++  +Y    Y +K Y
Sbjct: 349 --KKDGYSEGIGKVFVEFENEQFAKIAIILLQNKKYDGREIDIAFYDPRLYADKQY 402


>gi|209878476|ref|XP_002140679.1| RNA recognition motif. family protein [Cryptosporidium muris RN66]
 gi|209556285|gb|EEA06330.1| RNA recognition motif. family protein [Cryptosporidium muris RN66]
          Length = 577

 Score =  116 bits (290), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 86/364 (23%), Positives = 164/364 (45%), Gaps = 21/364 (5%)

Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEE 234
           R VYVG LP       +  F +Q +     N   PG+  V+ +I+ + K+AF E R++EE
Sbjct: 183 REVYVGNLPSGIGTTTLLEFMNQFLIK-NCNITTPGNPFVSAWISSDGKYAFCECRSMEE 241

Query: 235 ASNAMALDGII-FEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE- 292
           A+ A+ L+  I   G  +R+ RP     +          +  ++  +    +  +   + 
Sbjct: 242 ANMALQLNNTINLNGNILRIGRPKTIENSSNINSSNEPNNSVVSSISTQSNTTFLSNIQP 301

Query: 293 ---GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLV-------KDR-DTGNSKGYGFC 341
                DR+ + G PY +++  I++++          L+       K R ++ N      C
Sbjct: 302 IIKKADRIVISGFPYSYSDEDIEDIIREVNGNQAIKLLYVPPNSNKGRIESSNCLKIAIC 361

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
            ++D  +T+     +N   + +  L   R+  + Q+K    ++L+     I   ++   +
Sbjct: 362 EFEDVVITERVIRRVNTQNVCNLKLNAFRSHEALQNKYIL-NVLSDEIHKIYDYEVKQLS 420

Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVI 461
           S  + +   + L G+   + + ++  IT + L  D  Y EI++++++E  KYG + ++VI
Sbjct: 421 SDYSEISTFL-LRGQIPCRCIKISNIITPEELVVDNIYNEIMDEIKQEVCKYGNIKHIVI 479

Query: 462 PRP-----DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           PRP      ++ G     G VF+ Y D      AK  L   KF G +V   YYPE+ +  
Sbjct: 480 PRPASAFKSEDSGYFSIYGSVFVLYNDVQNAIDAKINLYKMKFSGRSVCISYYPENYFIQ 539

Query: 517 KDYS 520
            ++S
Sbjct: 540 NNFS 543


>gi|50552688|ref|XP_503754.1| YALI0E09889p [Yarrowia lipolytica]
 gi|49649623|emb|CAG79345.1| YALI0E09889p [Yarrowia lipolytica CLIB122]
          Length = 601

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 99/365 (27%), Positives = 161/365 (44%), Gaps = 51/365 (13%)

Query: 171 TRHARRVYVGGLPP-LANEQAIATFFSQVMTAIGGNSAGPGDAVVN-VYINHEKKFAFVE 228
           +R ARR+ + G+P    +  AI +FF+  +   G    G  + +V+ VY +       VE
Sbjct: 262 SRVARRLILSGIPADQIDTVAIKSFFTDFIE--GLELQGSKERIVDGVYKHPRLPEVLVE 319

Query: 229 MRTVEEASNAMALDG--IIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
             + E A+ A+AL G  I + G  + +RRP++Y       + P      ++  ++     
Sbjct: 320 FFSAEMATLALALSGLGINYSGPPISIRRPSNY-------ICPTPERSEVSRRSLDEEKE 372

Query: 287 AIGGAEGPD-RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQD 345
                E  + ++ V  +P+   E Q+++L  SFG L  F L++   +  S G     Y+D
Sbjct: 373 VASVVEDSNTKIIVWDIPFNVEEDQVRQLTASFGELSAFQLIRQLPSRESAGIALVDYKD 432

Query: 346 PAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
           P V   A + L+G  +G K L V  A   G ++                    L  S  N
Sbjct: 433 PEVVKDAVSGLSGQVIGGKNLKVMLA-CEGPTQ--------------------LSCSSNN 471

Query: 406 TLGGGMSLFGETLAK----VLCLTEAITADALADDEEYEEILEDMREECGKY--GTLVNV 459
            L G +++  +  ++    V+ L   +T D L DD  Y EI E +  EC KY  G  V +
Sbjct: 472 GLKGIVTVMNDVKSRPESSVIVLFNLVTLDELLDDVAYREITEQVESECLKYGGGEEVQI 531

Query: 460 VIPRPDQNGGET----------PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
            IPRPD    +           PGVGKV++++        A   L+G +F   +V A YY
Sbjct: 532 KIPRPDPEAMKASYRRLIFETRPGVGKVYVKFASVETSRVAMQKLTGLRFSRRSVIASYY 591

Query: 510 PEDKY 514
            E+ +
Sbjct: 592 SEECF 596


>gi|108706080|gb|ABF93875.1| RNA recognition motif family protein, expressed [Oryza sativa
           Japonica Group]
          Length = 964

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 95/342 (27%), Positives = 156/342 (45%), Gaps = 42/342 (12%)

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQ-VMTAIGGNSAGPGDAVVNVYINHEKK 223
           V   QATR  RR+++  LP LA E  +    ++ ++++   +        ++  IN +K+
Sbjct: 464 VQLTQATRPLRRLHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKR 523

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
            AFVE  T E+A+ A++ DG  F G ++++RRP +Y     A + P +PS  + L +  +
Sbjct: 524 QAFVEFLTPEDATAALSFDGRSFGGSSLKIRRPKEY--VEMAHVAPKKPSEEIKLISDVV 581

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
                  A+ P ++F+ G+    +   + E++ SFG L  +  + +   G +    F  Y
Sbjct: 582 -------ADSPHKIFIAGISGVISSEMLMEIVSSFGPLAAYRFLFNEYLGGA--CAFLEY 632

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
            D ++T  ACA LNG+K+G   LT   A     + TEQ     +A     I   A     
Sbjct: 633 IDHSITSKACAGLNGMKLGGGILT---AVNVFPNSTEQ--AFNEASPFYGIPDSA----- 682

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADA--LADDEEYEEILEDMREECGKYGTLVNV-V 460
                   SL  E   KVL L      +   L    E EEILED+R EC ++G + ++ V
Sbjct: 683 -------KSLLEEP-TKVLQLKNVFDQEEYLLLSKSELEEILEDVRVECARFGAVKSINV 734

Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
           +  P  +   T   G    E      C      +  +++GGN
Sbjct: 735 VKYPASSDNTT---GDTITE------CEDGSTKIEPKEYGGN 767


>gi|118489922|gb|ABK96758.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 787

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 84/303 (27%), Positives = 141/303 (46%), Gaps = 39/303 (12%)

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
           +   QAT   RR+Y+  +P  A+E+A+    +  + + G +        ++     EK  
Sbjct: 285 IQLTQATHPIRRLYMENIPASASEKAVMDCLNNFLISSGVHHIQGTQPCISCIRQKEKGQ 344

Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
           A VE  T E+AS A++ DG  F G  ++VRRP D+   +  A G         L     A
Sbjct: 345 ALVEFLTPEDASAALSFDGRSFSGSIIKVRRPKDF---IEVATG--------ELEKSAAA 393

Query: 285 SGAIGG--AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
             AIG    + P ++F+GG+    +   + E+  +FG L  +     +D    + + F  
Sbjct: 394 IDAIGDIVKDSPHKIFIGGISKVLSSKMLMEIASAFGPLKAYQFENRKDP--DEPFAFLE 451

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK-TEQESILAQAQQHIAIQKMALQT 401
           Y D +VT  ACA LNG+K+G + +T  +A  +  S  ++  S   Q  QH          
Sbjct: 452 YADESVTFKACAGLNGMKLGGQVITAIQAVPNASSSGSDGNSQFGQISQH---------- 501

Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALA--DDEEYEEILEDMREECGKYGTL--V 457
                     +L  E   +VL L     +++L+   + E EE+LED+R EC ++G++  +
Sbjct: 502 --------AKALL-EKPTEVLKLKNVFDSESLSSLSNTEVEEVLEDVRLECARFGSVKSI 552

Query: 458 NVV 460
           NV+
Sbjct: 553 NVI 555


>gi|312372039|gb|EFR20089.1| hypothetical protein AND_20681 [Anopheles darlingi]
          Length = 384

 Score =  115 bits (287), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 85/284 (29%), Positives = 128/284 (45%), Gaps = 24/284 (8%)

Query: 256 PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELL 315
           P  Y    AA   P     +   AAV +    I       R++VG +P+  TE ++ E  
Sbjct: 105 PLQYKAMQAAGQIPANIVADTPQAAVPVVGSTI--TRQARRLYVGNIPFGVTEEEMMEFF 162

Query: 316 ESFGTLHGF-----DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
                L G      + V        K + F  ++    T  A  A + +    ++L +RR
Sbjct: 163 NQQMHLSGLAQAAGNPVLACQINLDKNFAFLEFRSIDETTQA-MAFDSINFKGQSLKIRR 221

Query: 371 ATASGQSKTEQESILAQAQQHIA--IQKMALQTSGMNTLGG-----------GMSLFGET 417
                      +S      +  +  I  +   ++    +GG           G+SL G +
Sbjct: 222 PHDYQPMPGMTDSATVNVPEKFSGVISTVVPDSAHKIFIGGLPNYLNEDQVPGLSLVGSS 281

Query: 418 --LAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
               +VLCL   +T D L D+EEYE+ILED+REEC KYG + +V IPRP + G + PG G
Sbjct: 282 GPPTEVLCLLNMVTPDELKDEEEYEDILEDIREECNKYGVVRSVEIPRPIE-GVDVPGCG 340

Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           KVF+E+   V C  A+ AL+GRKF    V   Y+  DKY  +++
Sbjct: 341 KVFVEFNSIVDCQKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 384



 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 56/159 (35%), Positives = 86/159 (54%), Gaps = 12/159 (7%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P   V V+    TR ARR+YVG +P    E+ +  FF+Q M  + G +   G+ V+   I
Sbjct: 126 PQAAVPVVGSTITRQARRLYVGNIPFGVTEEEMMEFFNQQM-HLSGLAQAAGNPVLACQI 184

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP---SPN 275
           N +K FAF+E R+++E + AMA D I F+G ++++RRP DY P       PG     + N
Sbjct: 185 NLDKNFAFLEFRSIDETTQAMAFDSINFKGQSLKIRRPHDYQPM------PGMTDSATVN 238

Query: 276 LNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKEL 314
           +     G+ S  +   +   ++F+GGLP Y  E Q+  L
Sbjct: 239 VPEKFSGVISTVV--PDSAHKIFIGGLPNYLNEDQVPGL 275


>gi|347968831|ref|XP_003436305.1| AGAP002908-PB [Anopheles gambiae str. PEST]
 gi|333467821|gb|EGK96708.1| AGAP002908-PB [Anopheles gambiae str. PEST]
          Length = 144

 Score =  114 bits (285), Expect = 1e-22,   Method: Composition-based stats.
 Identities = 64/163 (39%), Positives = 93/163 (57%), Gaps = 21/163 (12%)

Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGET- 417
           +++GDK L V+RA+    +K    +++A  Q  +                 G+SL G + 
Sbjct: 1   MQLGDKKLIVQRASVG--AKNSNAAVVAPVQIQVP----------------GLSLVGSSG 42

Query: 418 -LAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGK 476
              +VLCL   +T D L D+EEYE+ILED+REEC KYG + +V IPRP + G + PG GK
Sbjct: 43  PPTEVLCLLNMVTPDELKDEEEYEDILEDIREECNKYGVVRSVEIPRPIE-GVDVPGCGK 101

Query: 477 VFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           VF+E+   V C  A+ AL+GRKF    V   Y+  DKY  +++
Sbjct: 102 VFVEFNSIVDCQKAQQALTGRKFSDRVVVTSYFDPDKYHRREF 144


>gi|403345499|gb|EJY72120.1| Splicing factor U2af large subunit, putative [Oxytricha trifallax]
          Length = 437

 Score =  112 bits (281), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 98/377 (25%), Positives = 159/377 (42%), Gaps = 47/377 (12%)

Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA--GPGDAVVNVYIN 219
           P Q +T    +  R++YVG +PP      I    +  +  +G ++     GD +V  +I+
Sbjct: 83  PSQGLTNH-NKAERQLYVGNIPPGLAVPQIMELLNTALKELGKDAGIFQEGDPIVGAWIS 141

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDY-----NPTLAAALGP----G 270
            +  +AFV+ RT EEA+   AL  +   G  ++V RP +      NP+   A  P    G
Sbjct: 142 GDGHYAFVDFRTAEEATQGFALQQVSIHGNNLKVGRPKNATGPIPNPSQLLAGNPNLMSG 201

Query: 271 QP--SPNLNLAAVGLASGAIGGAEGP------DRVFVGGLPYYFTETQIKELLESFGTLH 322
           Q   S N      GL +  +G            +V V   P   ++  I ++ E FG + 
Sbjct: 202 QNVISNNKKKTNQGLKNLQLGDQGNQIIQALNTKVMVSNFPVNHSKESIHKICEVFGKVK 261

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
             DL+KD  TG  KG     ++D          L G K+ +K L V+R T      T+ E
Sbjct: 262 NVDLLKDITTGEFKGQVNVEFEDELEAKKGYTGLMGFKIDEKVLFVKRLTTISAPTTQIE 321

Query: 383 SILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEI 442
             + +                        +L  +   + L L   I  + + + ++Y+++
Sbjct: 322 GEVFK------------------------NLIEDKPTECLMLKNCIILEEMTERDDYKDL 357

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGG--ETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
              + EE  +YG +V V  PRP   G     PGVGKV++ +        AK+ +  R+  
Sbjct: 358 EIAVEEEMSRYGKVVKVHCPRPPIFGDPYSVPGVGKVYVRFQTEEDSEKAKHGIYKRRLN 417

Query: 501 GNTVNAFYYPEDKYFNK 517
           G  V+  YY  +K FNK
Sbjct: 418 GRAVDPVYYSVEK-FNK 433


>gi|383125816|gb|AFG43490.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
 gi|383125818|gb|AFG43491.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
 gi|383125820|gb|AFG43492.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
 gi|383125822|gb|AFG43493.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
 gi|383125824|gb|AFG43494.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
 gi|383125826|gb|AFG43495.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
 gi|383125828|gb|AFG43496.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
 gi|383125830|gb|AFG43497.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
 gi|383125832|gb|AFG43498.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
 gi|383125834|gb|AFG43499.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
 gi|383125836|gb|AFG43500.1| Pinus taeda anonymous locus 2_3207_01 genomic sequence
          Length = 93

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 50/88 (56%), Positives = 64/88 (72%)

Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
           A+  D L DD+E+E+I +DM+EECGK+G +  +VIPRP   G E PGVGKVF+EY +   
Sbjct: 1   AVNPDELLDDQEFEDIYDDMKEECGKHGEITKLVIPRPKSTGEEVPGVGKVFVEYANTQS 60

Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKY 514
            A A+ +L GRKFGGN V A YYPEDK+
Sbjct: 61  SAKARASLHGRKFGGNVVVAVYYPEDKF 88


>gi|294948294|ref|XP_002785691.1| splicing factor u2af large subunit, putative [Perkinsus marinus
           ATCC 50983]
 gi|239899714|gb|EER17487.1| splicing factor u2af large subunit, putative [Perkinsus marinus
           ATCC 50983]
          Length = 370

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 84/296 (28%), Positives = 138/296 (46%), Gaps = 16/296 (5%)

Query: 239 MALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD-RV 297
           M L+GI  +G  + + RP  YN    +     +    + +  +   S     A G + ++
Sbjct: 1   MRLNGIDLDGRKLAITRPHGYNKEDPSKSITAEDIQKVTIEELCGGSSTKKTAPGSNLQL 60

Query: 298 FVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALN 357
            +  LP   TET +++LLE FG L    L++D+ TG SKGYGFC ++DP   D    AL+
Sbjct: 61  GIYHLPPVMTETYLRDLLEQFGALTMVSLIRDKTTGLSKGYGFCQFEDPNDADRCLYALD 120

Query: 358 GLKMGDKTLTVRRATASGQSKTEQESILAQ-------AQQHIAIQKMALQTSGMNTLGGG 410
              +G+ +L+V R     Q         A        A     +Q M  +      L   
Sbjct: 121 QFVLGNYSLSVTRLVPDAQQGGAAGIGGAGVGPATNLADGSSGVQSMTARVLANPALAAQ 180

Query: 411 MSL---FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN 467
           +      G T + V+ L  A+  + L  + E + I +++REE  ++GT++ V +PRP  +
Sbjct: 181 LKAGREIGSTPSTVVQLLNAVYIEDLMSETEVKSIEDEIREEAQRHGTVLEVRVPRP--S 238

Query: 468 GGETP---GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
              TP   GVGK+F+++ D       +   +GRKF    + A +YP D+Y    Y+
Sbjct: 239 ASLTPYANGVGKIFVQFADITAARKFQATNNGRKFDDRVMCAAFYPTDRYKMGKYT 294


>gi|432095994|gb|ELK26905.1| Splicing factor U2AF 65 kDa subunit [Myotis davidii]
          Length = 171

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 68/177 (38%), Positives = 94/177 (53%), Gaps = 36/177 (20%)

Query: 161 MPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH 220
           MPV V     TR ARR+ VG +P    E+A+                          +N 
Sbjct: 31  MPVPVAVSNMTRQARRLCVGNIPFGITEEAM--------------------------VNR 64

Query: 221 EKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAA 280
           +K FAF+E R+V+E + AMALDGIIF+G ++++RRP DY P    +  P    P      
Sbjct: 65  DKNFAFLEFRSVDETTQAMALDGIIFQGQSLKIRRPHDYQPLPDMSENPSVYLP------ 118

Query: 281 VGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
            G+ S  +   +   ++F+GGLPYY  + Q+KELL SFG L  F+LVKD  TG S+G
Sbjct: 119 -GVVSTVV--PDSAHKLFMGGLPYYLKD-QVKELLTSFGPLKAFNLVKDGATGLSRG 171


>gi|414591751|tpg|DAA42322.1| TPA: hypothetical protein ZEAMMB73_939656 [Zea mays]
          Length = 704

 Score =  111 bits (278), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 59/106 (55%), Positives = 70/106 (66%), Gaps = 5/106 (4%)

Query: 112 SGFDMAPP--AAAMLPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
           SGFD AP   A  ++    +PGQLPGV + +P +    N+    A Q     + P Q MT
Sbjct: 184 SGFDQAPTQQAVPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQFNPHVIQP-QAMT 242

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAV 213
           QQATRHAR VYVGGLPP ANEQ +A FF+ VM AIGGN+AGPGDAV
Sbjct: 243 QQATRHARPVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAV 288


>gi|108706079|gb|ABF93874.1| RNA recognition motif family protein, expressed [Oryza sativa
           Japonica Group]
          Length = 964

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 94/342 (27%), Positives = 155/342 (45%), Gaps = 42/342 (12%)

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQ-VMTAIGGNSAGPGDAVVNVYINHEKK 223
           V   QATR  RR+++  LP LA E  +    ++ ++++   +        ++  IN +K+
Sbjct: 464 VQLTQATRPLRRLHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKR 523

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
            AFVE  T E+A+ A++ DG  F G ++++RRP +Y     A + P +PS  + L +  +
Sbjct: 524 QAFVEFLTPEDATAALSFDGRSFGGSSLKIRRPKEY--VEMAHVAPKKPSEEIKLISDVV 581

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
                  A+ P ++F+ G+    +   + E++ SFG L  +  + +   G +    F  Y
Sbjct: 582 -------ADSPHKIFIAGISGVISSEMLMEIVSSFGPLAAYRFLFNEYLGGA--CAFLEY 632

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
            D ++T  ACA LNG+K+G   LT   A     + TEQ     +A     I   A     
Sbjct: 633 IDHSITSKACAGLNGMKLGGGILT---AVNVFPNSTEQ--AFNEASPFYGIPDSA----- 682

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADA--LADDEEYEEILEDMREECGKYGTLVNV-V 460
                   SL  E   KVL L      +   L    E EEILED+R E  ++G + ++ V
Sbjct: 683 -------KSLLEEP-TKVLQLKNVFDQEEYLLLSKSELEEILEDVRVEYDRFGAVKSINV 734

Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
           +  P  +   T   G    E      C      +  +++GGN
Sbjct: 735 VKYPASSDNTT---GDTITE------CEDGSTKIEPKEYGGN 767


>gi|224077134|ref|XP_002305147.1| predicted protein [Populus trichocarpa]
 gi|222848111|gb|EEE85658.1| predicted protein [Populus trichocarpa]
          Length = 191

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 88/173 (50%), Positives = 107/173 (61%), Gaps = 23/173 (13%)

Query: 15  SRHK--SSWVSGRSRTGERGRDRHHRDFKSG------GDDRRRDKNYKYDREGIRDHDRT 66
           SRHK  SS  S   R+  RGR + H  ++ G       +D RRDK   +DR     H+R+
Sbjct: 23  SRHKTYSSRESEHDRSRTRGRGKDHDRYRGGYKDGSVRNDGRRDKFGDFDR-----HERS 77

Query: 67  DR------HRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPA 120
            R      HRDY+ D+ RR+ +RS S+S  RF+NRS+S S SRSPSKSKR+SGFDMAP  
Sbjct: 78  SRGRNYHRHRDYDGDRGRRNGNRSSSYSQGRFQNRSRSRSRSRSPSKSKRKSGFDMAPSE 137

Query: 121 AAMLPGAAV----PGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQ 169
             MLPGAAV     GQLP +P  +P + QN L FG TQ G FPLMP Q MTQQ
Sbjct: 138 VGMLPGAAVAVNDAGQLPSLPQTMPGVVQNALQFGTTQFGVFPLMPAQAMTQQ 190


>gi|429328959|gb|AFZ80718.1| hypothetical protein BEWA_001250 [Babesia equi]
          Length = 711

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 98/381 (25%), Positives = 157/381 (41%), Gaps = 83/381 (21%)

Query: 189 QAIATFFSQVM----TAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGI 244
           Q +  FF+  +    T+I  N  GP   +     N E+ + F+E  T E A     LDGI
Sbjct: 363 QDVVDFFNGALMTMSTSIDIN--GPMPVMKTEIFNQEQGYCFLEFTTAEYADLCYKLDGI 420

Query: 245 IFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPY 304
              G ++++RRP D++ ++++                              ++FV  +P 
Sbjct: 421 QCNGYSLKLRRPIDFSSSMSSE---------------------------DTKIFVQNIPE 453

Query: 305 YFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDK 364
            F+E  I++LLE+ G L   +LV D  T  +KGYGF  Y+  +    A   LNG  + + 
Sbjct: 454 SFSEEDIRKLLEAHGKLKTCNLVIDPFTRLNKGYGFFEYESSSSAKEAVIHLNGHVIQNN 513

Query: 365 TLTVRRA-----TASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA 419
            L+V+ A      A G+    + S +  +  H       L   G+    G     G   +
Sbjct: 514 VLSVKHAAFSSFAAGGKPADCRASSIITSVSHCVFSNPLL---GLQMQNGRKK--GSEPS 568

Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPR---------------- 463
           +V+ L   +  + + DD+ Y E+L++++EE  KYG L  + IPR                
Sbjct: 569 RVVQLLNVVYPEDILDDKNYREMLKEIKEEAQKYGPLEEIYIPRIHKREEPASIEDVKTE 628

Query: 464 -----------------------PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
                                   D+N     GVGKVFL+Y +      A+  L+GR F 
Sbjct: 629 GNDKVAVKSEETSVKTDVRQQTIEDRNKEYQLGVGKVFLKYSNETAGRKAQYMLNGRIFD 688

Query: 501 GN-TVNAFYYPEDKYFNKDYS 520
            N  V A ++P D Y    Y+
Sbjct: 689 KNRVVCAAFFPCDLYQQGKYT 709


>gi|358417046|ref|XP_001256277.3| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Bos taurus]
          Length = 330

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 68/177 (38%), Positives = 95/177 (53%), Gaps = 8/177 (4%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKVRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQIN 192

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL 276
            +K FAF+E R+V+E + AMA DGIIF+G ++++RRP DY P    +  P    P L
Sbjct: 193 QDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGL 249


>gi|242037001|ref|XP_002465895.1| hypothetical protein SORBIDRAFT_01g047730 [Sorghum bicolor]
 gi|241919749|gb|EER92893.1| hypothetical protein SORBIDRAFT_01g047730 [Sorghum bicolor]
          Length = 969

 Score =  108 bits (270), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 99/373 (26%), Positives = 163/373 (43%), Gaps = 45/373 (12%)

Query: 102 SRSPSK-----SKRRSGFDMAPPAA--AMLPGAAVP--GQLPGVPSAVPEMAQNMLPFGA 152
           +++PSK      K+ + +D  P  A  +  P   +P  GQ+  +P +   + ++      
Sbjct: 391 TKTPSKVIQSPEKKSATWDQPPVKANQSNFPTTFLPTVGQMAPIPFSFSTIKKDPSTTVE 450

Query: 153 TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVM--TAIGGNSAGPG 210
           T L    L    V   QATR  RR+++  LP  A E  +    +  +  T I    + P 
Sbjct: 451 TMLVGNSLTADSVQLTQATRPLRRLHIENLPDSATEDKLIDCLNDFLLPTGIKPQRSKP- 509

Query: 211 DAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPG 270
              ++  IN EK+ AFVE  T E+A+ A++ DG    G  +R+RRP +Y  T+   + P 
Sbjct: 510 --CLSCTINREKRQAFVEFLTPEDATAALSFDGRSLNGSTLRIRRPKEYVETV--NVTPK 565

Query: 271 QPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDR 330
           +P+         L S  +  A+ P ++F+ G+    +   + E++ +FG L  +  + + 
Sbjct: 566 KPA-----EETALISDVV--ADSPHKIFIAGIAGVISSEMLMEIVSAFGPLAAYRFLFNS 618

Query: 331 DTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQ 390
           + G      F  Y D ++T  ACA LNG+ +G   LT      +   +   E     A  
Sbjct: 619 ELGGP--CAFLEYADRSITSKACAGLNGMMLGGCVLTAVHVFPNPPVEAANE-----ASP 671

Query: 391 HIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADA--LADDEEYEEILEDMRE 448
              I + A             SL  E   KVL L      +   L    E EE LED+R 
Sbjct: 672 FYGIPENA------------KSLLKEP-TKVLQLKNTFEREEYMLLSKSELEETLEDVRV 718

Query: 449 ECGKYGTLVNVVI 461
           EC ++G + +V +
Sbjct: 719 ECTRFGAVKSVHV 731


>gi|356536627|ref|XP_003536838.1| PREDICTED: uncharacterized protein LOC100810537 [Glycine max]
          Length = 735

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 114/445 (25%), Positives = 177/445 (39%), Gaps = 115/445 (25%)

Query: 56  DREGIRDHDRTDRHRDYNRDKERR---------HRHRSRSHSS-----DRFRNRSKSLSP 101
           +R+ +  H   D  R  N D +R          H HR    +S        + +S++ + 
Sbjct: 134 ERKELSMHSLKDSSRTKNPDIDRNRVSTNGSSGHHHRHGVSTSGLGGYSPRKRKSEAAAK 193

Query: 102 SRSPSK---SKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAF 158
           + SPSK    K+R+G+D+ PPA    P A V    P    AV     +++   +  L   
Sbjct: 194 TPSPSKHSLEKKRAGWDL-PPAGTNNPSAVVSSSFPVSNCAVLSNMHDVVSTSSLDLALV 252

Query: 159 PLMPV---------------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIG 203
             +PV                V   QATR  RR+Y+  LP  A+E+A+   F+ ++ +  
Sbjct: 253 KPLPVSFPSDVSTGKNTNIDSVQLTQATRPIRRLYLENLPASASEKAVMDCFNNLLLSAR 312

Query: 204 GNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTL 263
            N        +   ++ +K  A VE  T ++AS A++ DG +  G  V++RRP DY   +
Sbjct: 313 VNHIQQAQPCICCILHKDKGQALVEFLTADDASAALSFDGSMLFGSIVKIRRPKDYIELM 372

Query: 264 AAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHG 323
                                   I G      VF G L  Y  ET++            
Sbjct: 373 -----------------------EIAG------VF-GSLKAYHFETKV------------ 390

Query: 324 FDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQES 383
                     N+    F  Y D +VT  ACA LNG+K+G + LTV +A            
Sbjct: 391 ----------NNGPCAFLEYVDHSVTIKACAGLNGMKLGGEVLTVLQAMPDAS------- 433

Query: 384 ILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL----AKVLCLTEAITADAL--ADDE 437
                          L+ +G  +L  G+    + L     +VL +     AD +    D 
Sbjct: 434 --------------PLENAG-ESLSYGVPEHAKPLLRKPTQVLEINNVFAADTILSLSDM 478

Query: 438 EYEEILEDMREECGKYGTL--VNVV 460
             EEIL+D+R EC ++GT+  +NVV
Sbjct: 479 AIEEILDDVRLECARFGTIKSINVV 503


>gi|357114131|ref|XP_003558854.1| PREDICTED: uncharacterized protein LOC100840355 [Brachypodium
           distachyon]
          Length = 840

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 82/301 (27%), Positives = 133/301 (44%), Gaps = 36/301 (11%)

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPG-DAVVNVYINHEKK 223
           V   QATR  RR+++  LP  A+E  +    +    +   N         ++  IN EK 
Sbjct: 327 VQLTQATRPLRRLHIENLPSSASEDMLIGCLNDFFLSSDVNHIQKSKQPCLSCTINKEKH 386

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
            AFVE  T E+A+ A++ DG  F G A+++RRP +Y       +      P   +  + L
Sbjct: 387 QAFVEFLTPEDATAALSFDGRSFNGSALKIRRPKEY-------IEMANVVPKKTVEEIKL 439

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
           AS     A+ P ++FV G+    +   + E++ SFG L  +   +D +  + +   F  Y
Sbjct: 440 ASDV---ADSPHKIFVAGISGVISSEMLMEIVSSFGQLAAYRF-QDHEALSGR-CAFLEY 494

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
            D ++TD ACA LNG+K+G                     IL   Q      +   + S 
Sbjct: 495 IDHSITDKACAGLNGMKLGG-------------------CILTAVQVFPNPLEACNEASP 535

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADA--LADDEEYEEILEDMREECGKYGTL--VNV 459
             ++     +  E   +VL L      +   L    E EEI+ED+R EC ++G +  +N+
Sbjct: 536 FYSIPDSAKMLLEAPTEVLQLKNVFDREEYLLLSKSELEEIMEDIRMECARFGAVKSINI 595

Query: 460 V 460
           V
Sbjct: 596 V 596


>gi|297823139|ref|XP_002879452.1| hypothetical protein ARALYDRAFT_321074 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297325291|gb|EFH55711.1| hypothetical protein ARALYDRAFT_321074 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 497

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 106/430 (24%), Positives = 176/430 (40%), Gaps = 82/430 (19%)

Query: 96  SKSLSPSRSPSKSKRRSGFDMAPPAAA-MLPGAAVPGQLPGVPSAVPEMAQ-NMLPFGAT 153
           +K++SP  + S  K+ + +D+AP   + M  G    G      +A P +++ +++     
Sbjct: 132 TKAVSPP-NLSSEKKSAKWDLAPTVTSGMFSGPVFSGLQAATQTAYPTISEASLMLLKPL 190

Query: 154 QLGAFPLMPVQVMTQ-------QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS 206
             G F   P + +T        ++TR  RR+Y   +P  A+E+++   F+  M + G N 
Sbjct: 191 MEGTFRTPPPRQITSFDSVQLTESTRPMRRLYAENVPDSASEKSLIECFNGYMLSSGSNH 250

Query: 207 AGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAA 266
               +  ++  IN EK  A VE  T ++AS A++LDG  F G  +++RRP DY       
Sbjct: 251 IKGSEPCISCIINKEKSQALVEFLTPQDASAALSLDGCSFAGSNLKIRRPKDY------- 303

Query: 267 LGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
                         VG                          T + E++  FG L  +  
Sbjct: 304 --------------VG--------------------------TTLMEIVSVFGPLKAYRF 323

Query: 327 VKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILA 386
           V + D      Y    Y D +VT  ACA LNG+K+G   +T   A     S         
Sbjct: 324 VSNNDLNQQCAY--LEYTDGSVTLKACAGLNGMKLGGSVITAVCAFPDASS--------- 372

Query: 387 QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITAD--ALADDEEYEEILE 444
                +A+ +           G    L G+    +L L   +  +   L  ++E +EIL+
Sbjct: 373 -----VAVNE---NPPFYGIPGHAKPLLGKP-KHILKLKNVVDPEDFTLLSEQEVKEILD 423

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           D+R EC ++ T   +   R D +    PG   +F+EY        A ++L GR +    V
Sbjct: 424 DVRLECARWDTDDKMEEER-DPDDLFEPGC--IFIEYGRPEATCDAAHSLHGRLYDNRIV 480

Query: 505 NAFYYPEDKY 514
            A Y  ++ Y
Sbjct: 481 KAEYVSKELY 490


>gi|71996481|ref|NP_497326.2| Protein UAF-1, isoform b [Caenorhabditis elegans]
 gi|351018335|emb|CCD62279.1| Protein UAF-1, isoform b [Caenorhabditis elegans]
          Length = 143

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/161 (37%), Positives = 84/161 (52%), Gaps = 18/161 (11%)

Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
           +++GDK L V+ A A+ Q                      L  S     G  +S      
Sbjct: 1   MQLGDKQLVVQLACANQQR-----------------HNTNLPNSASAIAGIDLSQGAGRA 43

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
            ++LCL   +T D L  D+EYEEILED+R+EC KYG + ++ IPRP ++    PGVGKVF
Sbjct: 44  TEILCLMNMVTEDELKADDEYEEILEDVRDECSKYGIVRSLEIPRPYEDHP-VPGVGKVF 102

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           +E+     C  A+ AL+GRKF   TV   YY  DKY N+ +
Sbjct: 103 VEFASTSDCQRAQAALTGRKFANRTVVTSYYDVDKYHNRQF 143


>gi|296088195|emb|CBI35711.3| unnamed protein product [Vitis vinifera]
          Length = 116

 Score =  105 bits (262), Expect = 6e-20,   Method: Composition-based stats.
 Identities = 47/69 (68%), Positives = 55/69 (79%)

Query: 453 YGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
           +G LV+VVIPRP  NG   PGVGKVFLEY D  G ++A+NALSGRKFGGN V+A YYPED
Sbjct: 48  HGALVHVVIPRPSPNGDLIPGVGKVFLEYSDTAGSSSARNALSGRKFGGNVVSAVYYPED 107

Query: 513 KYFNKDYSA 521
           KY++ DY A
Sbjct: 108 KYYDGDYGA 116


>gi|359477752|ref|XP_002281833.2| PREDICTED: uncharacterized protein LOC100266510 [Vitis vinifera]
          Length = 895

 Score =  105 bits (262), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 133/498 (26%), Positives = 197/498 (39%), Gaps = 141/498 (28%)

Query: 56  DREGIRDHDRTDRHRDYNRDKERRHRHRSRSHS-----SDRFRNRS---KSLSPS-RSPS 106
           DR G R H   DR+R  N       R    S S     S R R      K+ SP+ RSP 
Sbjct: 295 DRSG-RQHSDADRNRISNNGSSSHFRRHGGSASGLGGYSPRKRRTEAAIKTPSPTNRSPE 353

Query: 107 KSKRRSGFDMAPPAAAMLPGAAVPGQL----PGV-------PSAVPEMAQNMLPFGATQL 155
           K  + +G+D+ P     +   +V   L    P V       PSAVP      +P  AT  
Sbjct: 354 K--KSAGWDLPPSRTDGMNAGSVLSSLQVLKPTVSSNADELPSAVPVA----VPVTATT- 406

Query: 156 GAFPLMPV---------------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMT 200
            A P +P                 +   QATR  RR+YV  LP  ++E+A+    +  + 
Sbjct: 407 -AKPPLPRIYSDAVSKNKNVSIDSIQLTQATRPMRRLYVENLPVSSSEKALMECLNNFLL 465

Query: 201 AIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYN 260
           + G N        ++  I+ EK  A VE  T E+AS A++ DGI F G  +++RRP D+ 
Sbjct: 466 SSGINHVQGTPPCISCIIHKEKGQALVEFLTPEDASAALSFDGISFSGSILKIRRPKDF- 524

Query: 261 PTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGT 320
                                                             + E+  +FG 
Sbjct: 525 --------------------------------------------------LMEIAAAFGP 534

Query: 321 LHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTE 380
           L  +    + D G  +   F  Y D +VT  ACA LNG+K+G + LTV +A  +      
Sbjct: 535 LKAYRFQVNEDLG--EPCAFLEYVDQSVTLKACAGLNGMKLGGQVLTVVQAIPNA----- 587

Query: 381 QESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL----AKVLCLTEAITADALA-- 434
                           +A++ +G N    G+    + L     +VL L   +  D L+  
Sbjct: 588 ----------------LAMENTG-NLPFYGIPEHAKPLLERPTQVLKLKNVVNPDDLSSL 630

Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDA-------VGC 487
            + E EEILED+R EC ++GT+ +V I + + +   T       LE Y+A       +GC
Sbjct: 631 SEAELEEILEDIRLECTRFGTVKSVNIVKYNNSHVST-------LEVYEAADNTGSNLGC 683

Query: 488 ATAKNALSGRKFGGNTVN 505
               N++     GG T N
Sbjct: 684 DG--NSMKAETLGGGTDN 699


>gi|303279322|ref|XP_003058954.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460114|gb|EEH57409.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 559

 Score =  105 bits (262), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 73/248 (29%), Positives = 113/248 (45%), Gaps = 33/248 (13%)

Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGG--NSAGPGDAVVNVYINHE 221
           Q    Q TR ARR+Y+GG+PP A    +  F + +M   G   N A  G  VV+V I HE
Sbjct: 149 QTAYAQHTRQARRLYIGGIPPGAINSDVQRFLNDLMLNSGAAINPAA-GPPVVDVKIQHE 207

Query: 222 KKFAFVEMRTVEEASNAMALDGIIF--EGVAVRVRRPTDYNPTLAAAL--------GP-- 269
           K F F E    ++A +A+  DG+++   G  +RV RP DY+P+    +        GP  
Sbjct: 208 KGFGFAEFTNCDDAQSALMFDGVVYGDTGRKIRVNRPRDYDPSKNPVVIRDGLQIEGPKG 267

Query: 270 ---------GQPSPNLNLAAVGLASGAIG--------GAEGPDRVFVGGLPYYFTETQIK 312
                      P P     A+      +           +GP++++VGG     TE Q +
Sbjct: 268 IGLLGEKQANAPPPWPEDLAIPAPPPLVSEWPKLPKRTPDGPNKLYVGGFDPLHTEGQTR 327

Query: 313 ELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
           ++L++ G L  F ++ D   G + G+ FC + DP +T +A  AL G     + +  +RA 
Sbjct: 328 QVLQAIGELKSFCVMPDA-RGRNTGHVFCEFADPRLTVVAEEALTGAWCFRQPIVCKRAM 386

Query: 373 ASGQSKTE 380
                  E
Sbjct: 387 PDAAPAKE 394


>gi|189308116|gb|ACD86942.1| UAF-1 [Caenorhabditis brenneri]
          Length = 108

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 51/101 (50%), Positives = 65/101 (64%), Gaps = 1/101 (0%)

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
            +VLCL   +T D L  DE+YEEILED+REEC KYG + ++ IPRP  +    PGVGKVF
Sbjct: 9   TEVLCLMNMVTEDELKSDEDYEEILEDVREECSKYGIVRSLEIPRP-YDEHPVPGVGKVF 67

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           +E+     C  A+ AL+GRKF   TV   YY  DKY N+ +
Sbjct: 68  VEFASTSDCQRAQAALTGRKFANRTVVTSYYDVDKYHNRQF 108


>gi|413956976|gb|AFW89625.1| hypothetical protein ZEAMMB73_282398 [Zea mays]
          Length = 635

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 96/372 (25%), Positives = 160/372 (43%), Gaps = 45/372 (12%)

Query: 102 SRSPSK-----SKRRSGFDMAPPAA--AMLPGAAVP--GQLPGVPSAVPEMAQNMLPFGA 152
           +++PSK      K+ + +D  P  A  +  P   +P  GQ+   P +   + ++      
Sbjct: 53  TKTPSKVIQSPEKKSATWDQPPVKANQSNFPTTFLPTVGQMAPTPFSF-SVIKDPSTTAV 111

Query: 153 TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDA 212
           T L    L    V   QATR  RR+++  LP  A E  +    +  + + G         
Sbjct: 112 TMLAGNSLTADSVQLTQATRPLRRLHIENLPDSATEDKLIDCLNDFLLSTGSKLQR-SKP 170

Query: 213 VVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP 272
            ++  IN EK+ AFVE  T E+A+ A++ DG    G  +R+RRP +Y  T+       + 
Sbjct: 171 CLSCTINREKRQAFVEFLTPEDATAAISFDGRSLNGSVLRIRRPKEYVETVNVTPKKAEE 230

Query: 273 SPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
           +         L S  +  A+ P ++F+ G+    +   + E++ +FG L  +  + + + 
Sbjct: 231 T--------ALISDVV--ADSPYKIFIAGIAGVISSKMLMEIVSAFGPLAAYRFLFNNEL 280

Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI 392
           G      F  Y D +VT  ACA LNG+ +G + LT               ++      H+
Sbjct: 281 GGP--CAFLEYADRSVTSKACAGLNGMMLGGRVLT---------------AVHVFPNPHV 323

Query: 393 AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITAD--ALADDEEYEEILEDMREEC 450
              + A + S    +     L  +   KVL L      +   L    E EE LED+R EC
Sbjct: 324 ---EAANEASPFYGIPDNAKLLLKEPTKVLQLKNVFEREEYMLLSKSELEETLEDVRVEC 380

Query: 451 GKYGTL--VNVV 460
            ++G +  VNVV
Sbjct: 381 TRFGAVKSVNVV 392


>gi|389582230|dbj|GAB64785.1| RNA binding domain [Plasmodium cynomolgi strain B]
          Length = 1046

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 110/495 (22%), Positives = 188/495 (37%), Gaps = 131/495 (26%)

Query: 118  PPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRV 177
            PP   M+    + GQL      + ++ ++ L  G  +L    L P       A + AR +
Sbjct: 557  PPQLNMI----LDGQLKLDSQTIQQLCKSAL--GINELCLSSLDPT------AEKTAREL 604

Query: 178  YVGGLPPLANEQAIATFFSQVMTAIGGNSAGPG--------------------------- 210
            YVG +P   + Q I  F ++ +  +    +G                             
Sbjct: 605  YVGNIPQHIDVQEIVKFLNKCLLILYNKESGSEAENGQGDEQEREKKQEQGNEQEEGSLS 664

Query: 211  -----------------------DAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFE 247
                                   D  +   I  +  +AFVE RT+++ SN M L+GI F 
Sbjct: 665  QSQNQSQNQSLSQSQNQGQGQCEDICLKACIRGDTHYAFVEFRTLQDTSNCMLLNGINFY 724

Query: 248  GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA------EGPDRVFVGG 301
            G  +R+ RP  + P    +L P    P ++  +  L+ G IG        +  +++    
Sbjct: 725  GNNLRIGRPKTF-PAELTSLIPAPTIPTID--SYYLSQGIIGLQAFAVFFQNEEKMKNAY 781

Query: 302  LPYYFTETQ---------------IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
            LP    + Q               IKELLE+FG +  F+  +  D+ +            
Sbjct: 782  LPMSMIKLQKLCVSNISKNNERGKIKELLEAFGEIRNFECFEGDDSTD------------ 829

Query: 347  AVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT 406
              T IA       +   +   +     S + + E E +       +  +K     +G+  
Sbjct: 830  --TYIALVEYTTSENAIQAQKILNQNTSYKIQFEYEILNDPLINRLIKRKYMSSENGI-- 885

Query: 407  LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP---- 462
                  L  +   + + L+   T D L D  EY +I+ED++ EC K+G++V VV+P    
Sbjct: 886  ------LSQQIPTRTVVLSRIATFDELCDPSEYRDIVEDIKIECEKFGSVVEVVLPVFSR 939

Query: 463  ---------------RPDQN--GGETPGVGKVFLEYYDAVGCAT-AKNALSGRKFGGNTV 504
                           + D+     +   +G  F+ Y++ +  AT A+  LSGRKFG N +
Sbjct: 940  ETFDFLLREAAKCAAKEDRTHPNYDLTSIGCAFI-YFETIEAATKARKELSGRKFGANII 998

Query: 505  NAFYYPEDKYFNKDY 519
             A YY E K+  K++
Sbjct: 999  EANYYSEKKFLLKNF 1013


>gi|395334381|gb|EJF66757.1| splicing factor CC1-like protein [Dichomitus squalens LYAD-421 SS1]
          Length = 624

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 85/350 (24%), Positives = 152/350 (43%), Gaps = 42/350 (12%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           AR V+V  L      + +  FF      +G N+      V +      K   +VE R+VE
Sbjct: 292 ARSVFVSQLAARLTARDLGYFFED---KLGENTVMDSRIVTDRISRRSKGIGYVEFRSVE 348

Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
               A+AL G +  G+ ++++    +       L PG    NLNL         +    G
Sbjct: 349 LVDKAIALSGTVVMGLPIQIQ----HTEAERNRLHPG--DGNLNLPP------GVSAPHG 396

Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
             +++VG L +  +E+ IK++ E FG L   DL +D  TG SKGY F  Y+      +A 
Sbjct: 397 GMQLYVGSLHFNLSESDIKQVFEPFGELEFVDLHRDPVTGRSKGYAFVQYKRAEDAKMAL 456

Query: 354 AALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMALQTSGMN 405
             ++G ++  +TL V      G ++ T+Q+S+       L  A +   +QK+A     ++
Sbjct: 457 EQMDGFELAGRTLRVNTVHEKGSARYTQQDSLDEAGGGNLNAASRQALMQKLAR----ID 512

Query: 406 TLGGGMSLFGE-TLAKVLCLTEAITADALADDEEYE-----EILEDMREEC-GKYGTLVN 458
                M       + + +     +  +    +EE E     ++ ED++ EC  KYG ++ 
Sbjct: 513 PTPAKMEPIARPNIPQTMQSRSVLMKNMFNPEEETERDWDKDLAEDVKGECESKYGRVLA 572

Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
           + + +  Q        G++++++        A   L+GR FGG  + A +
Sbjct: 573 IKVEKESQ--------GEIYVKFETVDAAKNAIEGLNGRWFGGRQITAAF 614


>gi|390604396|gb|EIN13787.1| splicing factor CC1-like protein [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 433

 Score =  102 bits (254), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 91/350 (26%), Positives = 158/350 (45%), Gaps = 35/350 (10%)

Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
            AR V+V  L      + +  FF      +G  +      V +      K   +VE++T+
Sbjct: 101 EARSVFVSQLAARLTARDLGYFFED---KLGEGTVMDARIVTDRLSRRSKGIGYVELKTI 157

Query: 233 EEASNAMALDGIIFEGVAVRVRR-PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
           E    A+ L G +  G+ ++V+    + N T A   G G    +LNL         + G 
Sbjct: 158 ELVDQAINLSGTVVMGLPIKVQHTEAERNRTHA---GDG----SLNLPP------GVSGT 204

Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
            GP +++VG L +  TE+ IK++ E FG L   DL +D  TG SKGY F  Y+      +
Sbjct: 205 HGPRQLYVGSLHFNLTESDIKQVFEPFGELEFVDLHRDPMTGRSKGYCFIQYKRAEDAKM 264

Query: 352 ACAALNGLKMGDKTLTVRRATASGQSK-TEQESI------LAQAQQHIAIQKMA-LQTSG 403
           A   + G ++  +TL V      G  K T+QES+      L  A +   +QK+A ++ + 
Sbjct: 265 ALEQMEGFELAGRTLRVNTVHEKGTVKYTQQESLEENGGNLNAASRQALMQKLARIEPAR 324

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEIL-EDMREEC-GKYGTLVNVVI 461
                    +  +TL     L + +   A   ++++++ L +D++ EC  KYG +  + +
Sbjct: 325 APVETVSKPVITQTLQSKSVLLKNMFDPAEETEKDWDKDLADDVKVECENKYGMVNFIKV 384

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
            +  Q        G++++++        A   L+GR FGG  V A + P+
Sbjct: 385 DKESQ--------GEIYVKFDTVDSAKKAIEGLNGRYFGGRQVTATFIPD 426


>gi|393218616|gb|EJD04104.1| splicing factor, CC1-like protein [Fomitiporia mediterranea MF3/22]
          Length = 464

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 94/351 (26%), Positives = 154/351 (43%), Gaps = 44/351 (12%)

Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
            AR V+V  L      + +  FF      +G NS      V +      K  A+VE  ++
Sbjct: 133 EARSVFVSQLAARMTARDLGYFFED---KLGDNSVLDVRIVTDRISRRSKGIAYVEFGSI 189

Query: 233 EEASNAMALDGIIFEGVAVRVRR-PTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
           E    A++L G I  G+ + ++    + N T A     G  S NL   A G   GA    
Sbjct: 190 ELVDKAISLTGTIVMGLPIMIQHTEAERNKTHA-----GDGSINLPPGASG--RGAT--- 239

Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
                ++VG L +  TE+ IK++ E FG L   DL KD  TG SKGY F  Y+      +
Sbjct: 240 -----LYVGSLHFNLTESDIKQVFEPFGELDFVDLHKDSATGRSKGYAFIHYKRAEDAKM 294

Query: 352 ACAALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMALQTSG 403
           A   + G ++  +TL V      GQ++ + Q+S+       L  A +   +QK+A   S 
Sbjct: 295 ALEQMEGFELAGRTLRVNTVHEKGQTRISTQDSLDESGGGNLNAASRQALMQKLARIDSA 354

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE-----EILEDMREEC-GKYGTLV 457
             T      +   T+A+ +     +  +    +EE E     ++ ED++ EC  KYG + 
Sbjct: 355 PVT---QQPIMKPTVAQPMTSKSVLMRNMFDPEEETEPAWDKDLAEDVKTECQAKYGRVQ 411

Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
           ++ +        E    G++++++        A N L+GR FGG  ++A +
Sbjct: 412 HIKV--------EKDSEGEIYVQFDTVDAAKAAINGLNGRWFGGKQISATF 454


>gi|108706081|gb|ABF93876.1| RNA recognition motif family protein, expressed [Oryza sativa
           Japonica Group]
          Length = 704

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 60/204 (29%), Positives = 106/204 (51%), Gaps = 12/204 (5%)

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQ-VMTAIGGNSAGPGDAVVNVYINHEKK 223
           V   QATR  RR+++  LP LA E  +    ++ ++++   +        ++  IN +K+
Sbjct: 464 VQLTQATRPLRRLHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKR 523

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
            AFVE  T E+A+ A++ DG  F G ++++RRP +Y     A + P +PS  + L +  +
Sbjct: 524 QAFVEFLTPEDATAALSFDGRSFGGSSLKIRRPKEY--VEMAHVAPKKPSEEIKLISDVV 581

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
                  A+ P ++F+ G+    +   + E++ SFG L  +  + +   G +    F  Y
Sbjct: 582 -------ADSPHKIFIAGISGVISSEMLMEIVSSFGPLAAYRFLFNEYLGGA--CAFLEY 632

Query: 344 QDPAVTDIACAALNGLKMGDKTLT 367
            D ++T  ACA LNG+K+G   LT
Sbjct: 633 IDHSITSKACAGLNGMKLGGGILT 656


>gi|312085420|ref|XP_003144672.1| U2af splicing factor protein 1 [Loa loa]
          Length = 143

 Score =  102 bits (253), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 60/162 (37%), Positives = 88/162 (54%), Gaps = 20/162 (12%)

Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMN-TLGGGMSLFGET 417
           +++GDK L V+ + A+ ++   Q +               +Q +G++ + G G       
Sbjct: 1   MQLGDKNLVVQLSCANARNNVAQNTF------------PQIQVAGIDLSHGAGPP----- 43

Query: 418 LAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKV 477
             +VLCL   +T D L DDEEYE+ILED+REEC KYG + ++ IPR    G +  GVGKV
Sbjct: 44  -TEVLCLMNMVTEDELKDDEEYEDILEDIREECAKYGIVKSLEIPR-SVPGVDVTGVGKV 101

Query: 478 FLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           F+E+     C  A+ AL+GRKF   TV   YY  D Y  + +
Sbjct: 102 FVEFNSKQECQKAQAALTGRKFANRTVVTSYYDPDMYHRRQF 143


>gi|108706082|gb|ABF93877.1| RNA recognition motif family protein, expressed [Oryza sativa
           Japonica Group]
          Length = 720

 Score =  102 bits (253), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 60/204 (29%), Positives = 106/204 (51%), Gaps = 12/204 (5%)

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQ-VMTAIGGNSAGPGDAVVNVYINHEKK 223
           V   QATR  RR+++  LP LA E  +    ++ ++++   +        ++  IN +K+
Sbjct: 464 VQLTQATRPLRRLHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKR 523

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
            AFVE  T E+A+ A++ DG  F G ++++RRP +Y     A + P +PS  + L +  +
Sbjct: 524 QAFVEFLTPEDATAALSFDGRSFGGSSLKIRRPKEY--VEMAHVAPKKPSEEIKLISDVV 581

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
                  A+ P ++F+ G+    +   + E++ SFG L  +  + +   G +    F  Y
Sbjct: 582 -------ADSPHKIFIAGISGVISSEMLMEIVSSFGPLAAYRFLFNEYLGGA--CAFLEY 632

Query: 344 QDPAVTDIACAALNGLKMGDKTLT 367
            D ++T  ACA LNG+K+G   LT
Sbjct: 633 IDHSITSKACAGLNGMKLGGGILT 656


>gi|82794077|ref|XP_728296.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23484570|gb|EAA19861.1| KED [Plasmodium yoelii yoelii]
          Length = 858

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 81/334 (24%), Positives = 145/334 (43%), Gaps = 50/334 (14%)

Query: 150 FGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
           FG  +LG      +  +   A + AR +YVG +P   + Q I  F +  +  +       
Sbjct: 396 FGIPELG------LSTIDANAEKTARELYVGNIPQNIDIQEIVKFLNTCLLILYNKENEN 449

Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRR----PTDYNPTLAA 265
            +  +   I  + ++AFVE R++++ SN M L+GI F    +R+ R    P +Y   +  
Sbjct: 450 ENICLKACIRGDTRYAFVEFRSLQDTSNCMLLNGIYFYTNNLRIGRPKTFPIEYTKLIPP 509

Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPD----------------RVFVGGLPYYFTET 309
           A  P   +  L+   VG+ + AI      +                ++ V  +       
Sbjct: 510 ATIPTIDTYYLSQGLVGIKAFAIFHQNKDENKNEYHHLPVDMIKLQKLCVSNISKNNETN 569

Query: 310 QIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVR 369
           +IKELLE+FG +  F+  +  +  NS  Y            I     N ++   +   + 
Sbjct: 570 KIKELLEAFGDIQTFEFFEGEE--NSDTY------------ICLVEYNNIENAIQAHKIL 615

Query: 370 RATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL-AKVLCLTEAI 428
               S + + E E IL     +  ++K  +Q           S+  + +  KV+ L++  
Sbjct: 616 NQNTSYKIQFEYE-ILNDPTINQLVKKKYMQNKN--------SILSQQIPTKVIVLSKIA 666

Query: 429 TADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
           T D L++ E+Y+EI ED++ EC KYG+++ VV+P
Sbjct: 667 TFDELSNPEDYKEISEDIKIECEKYGSVIEVVLP 700



 Score = 39.3 bits (90), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 25/47 (53%)

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            +G  F+ +    G    +  LSGRKFG N + A YY E K+  K++
Sbjct: 779 SIGCAFIYFETIEGATKTRKELSGRKFGANIIEANYYSEKKFIMKNF 825


>gi|170083917|ref|XP_001873182.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164650734|gb|EDR14974.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 448

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 91/350 (26%), Positives = 152/350 (43%), Gaps = 41/350 (11%)

Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
            AR V+V  L      + +  FF      +G  +      V +      K   +VE RT+
Sbjct: 116 EARSVFVSQLAARLTARDLGYFFED---KLGEGTVMDSRIVTDRLSRRSKGIGYVEFRTI 172

Query: 233 EEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
           +    A+AL G +  G+ + V+       +    L PG  S NL    V  + GAI    
Sbjct: 173 DHVEKALALSGTVVMGLPIMVQ----LTESERNKLHPGDGSLNLP-PGVTASHGAI---- 223

Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
               ++VG L +  TE+ IK++ E FG L   DL +D  TG SKGY F  Y+      +A
Sbjct: 224 ----LYVGSLHFNLTESDIKQVFEPFGELEFVDLHRDPMTGRSKGYAFVQYKRSEDARMA 279

Query: 353 CAALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMALQTSGM 404
              + G ++  +TL V      G ++ T+Q+S+       L  A +   +QK+A +T   
Sbjct: 280 LEQMEGFELAGRTLRVNTVHEKGTARYTQQDSLDEAGGGNLNAASRQALMQKLA-RTEAP 338

Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE-----EILEDMREEC-GKYGTLVN 458
            T    ++     + + +     +  +    +EE E     E+ +D++ EC  KYG +  
Sbjct: 339 PTFTEPVAR--PNIPQAMQSRSVLLKNMFDPEEETERDWDKELADDVKVECENKYGKVEA 396

Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
           + + R  Q        G+++L++        A   L+GR FGG  V+A +
Sbjct: 397 IKVERETQ--------GEIYLKFDSIESAKQAIQGLNGRWFGGRQVSAAF 438


>gi|209881578|ref|XP_002142227.1| RNA recognition motif. family protein [Cryptosporidium muris RN66]
 gi|209557833|gb|EEA07878.1| RNA recognition motif. family protein [Cryptosporidium muris RN66]
          Length = 533

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 127/500 (25%), Positives = 203/500 (40%), Gaps = 93/500 (18%)

Query: 76  KERRH--RHRSRSHSSDRF----RNRSKSLSPSRS--PSKSKRRSGFDMAPPAAAMLPGA 127
           KE  H  R RSRS S+D       N  +S+SP RS   S+ KRR G            G 
Sbjct: 69  KEIYHIPRRRSRSPSTDNNDKEESNTGRSISPLRSDDSSRKKRRRGSS----------GW 118

Query: 128 AVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLAN 187
             P   P V   +P M Q  +      +G+  + P QV TQ A     R+YVG L    +
Sbjct: 119 DAPFN-PDVSQLMPNMKQANV---GQMMGSSNIAPRQV-TQGA-----RIYVGSLDYSLS 168

Query: 188 EQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMA-LDGIIF 246
           E  +   F    T +  +    G        N  K F F+E  T E A  A+A ++  + 
Sbjct: 169 EADLRQVFGSFGTIVNIDMPREG--------NRSKGFCFIEYTTQESAEMALATMNRFVL 220

Query: 247 EGVAVRVRRPTD----YNPTLAAALG--------PGQPSPNLNLAAVGLASGAIGGAEGP 294
           +G  ++V RPT+     N     ++G        P  P  N N     +    I      
Sbjct: 221 KGRPIKVGRPTNAIVSNNQNNNNSMGNHTGMVGMPVLPPENTN---ANIPPHQIPQNPPQ 277

Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT-GNSKGYGFCVYQDPAVTDIAC 353
           +R+++G +PY FT   ++ + ++FG +    L+   +  G  +GYGF  +  P    +A 
Sbjct: 278 NRIYIGSVPYSFTPDDLRHIFKAFGVILSCQLIPSVEKPGTHRGYGFIEFGTPDQAKLAI 337

Query: 354 AALNGLKMGDKTLTVRRATA---SGQSKTEQESILAQAQQHIAIQKM--ALQTSGMNTLG 408
             +NG ++G K L V  ATA   S    + Q  I++   Q++  Q++   L       + 
Sbjct: 338 ETMNGFEVGGKQLKVNVATALKPSNSISSNQIPIVSPTLQNVMSQQIPPTLAIPPTMAIP 397

Query: 409 GGMSLFGETL-----------------------------AKVLCLTEAITADALADDEEY 439
             +S+   T                              + V+ LT  I  + + D    
Sbjct: 398 PVLSMPNVTPLPPNLYQPPNIPVPYPANSYPIIPNSTSNSNVILLTNMIGPEEVDD---- 453

Query: 440 EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKF 499
            E+ E+++ EC KYG + +V I   D     +  V ++F+ +        A  AL+ R F
Sbjct: 454 -ELKEEVKIECSKYGKVYDVRIHISDHVSKPSDRV-RIFVVFETNTMAQIAVPALNNRWF 511

Query: 500 GGNTVNAFYYPEDKYFNKDY 519
           GGN V    Y  +++++  Y
Sbjct: 512 GGNQVYCRLYNTERFYSSFY 531


>gi|124511860|ref|XP_001349063.1| conserved Plasmodium protein, unknown function [Plasmodium
           falciparum 3D7]
 gi|23498831|emb|CAD50908.1| conserved Plasmodium protein, unknown function [Plasmodium
           falciparum 3D7]
          Length = 1125

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 84/343 (24%), Positives = 146/343 (42%), Gaps = 50/343 (14%)

Query: 139 AVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQV 198
           ++  + +N+L      L   P + V +      + AR +YVG +P   + Q I  + +  
Sbjct: 623 SLQNLYKNVLNINDLNL---PTIDVNI-----EKTARELYVGNIPQHIDIQEIVKYLNSC 674

Query: 199 MTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRP-- 256
           +  +        +  +   I  +  +AFVE R +++ SN M L+GI F G  +R+ RP  
Sbjct: 675 LLILYNKENENENICLKACIRGDTHYAFVEFRNIQDTSNCMLLNGINFYGNNLRIGRPKT 734

Query: 257 --TDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQ---- 310
              +Y+  +  A  P   +  L+   +GL S  I   +  +++   GLP    + Q    
Sbjct: 735 FPIEYHSLIPQATIPAIDNYYLSQGLIGLRSFIIF-CKNEEKMKNDGLPVNMIKLQKLCV 793

Query: 311 -----------IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGL 359
                      IKELLE+FG +  F+            YG     + + T I+       
Sbjct: 794 SNISKNNDTSKIKELLEAFGEIKNFEFF----------YG----DETSDTYISLVEYVNT 839

Query: 360 KMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA 419
           +   +   +     S + + E E I      HI      ++   M T    +SL  +   
Sbjct: 840 ENAIQAHKILNQNTSYKIQFEHEII---NDPHIN---NIIKNKYMKTENSILSL--QVPT 891

Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIP 462
           KV+ L +  T + L+D  EY++I+ED++ EC KYG  + VV+P
Sbjct: 892 KVIVLNKIATFEELSDSSEYKDIVEDIKIECDKYGKTLEVVLP 934



 Score = 39.3 bits (90), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 16/47 (34%), Positives = 26/47 (55%)

Query: 473  GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
             +G  F+ + +      A+  LSGRKFG N + A Y+ E K+  K++
Sbjct: 1046 SIGCAFIHFENIESATKARKELSGRKFGANIIEANYFSEKKFLMKNF 1092


>gi|403412344|emb|CCL99044.1| predicted protein [Fibroporia radiculosa]
          Length = 599

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 87/349 (24%), Positives = 145/349 (41%), Gaps = 40/349 (11%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           AR V+V  L      + +  FF      +G  S      V +      K   +VE R+VE
Sbjct: 267 ARSVFVSQLAARLTARDLGYFFED---KLGEGSVMDSRIVTDRISRRSKGIGYVEFRSVE 323

Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
               A+ L G +  G+ ++++    +       L PG    NLNL         +    G
Sbjct: 324 LVDKALGLSGTVVMGLPIQIQ----HTEAERNRLHPG--DGNLNLPP------GVSAPHG 371

Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
             +++VG L +  TE+ IK++ E FG L   DL +D  TG SKGY F  Y+      +A 
Sbjct: 372 GMQLYVGSLHFNLTESDIKQVFEPFGELEFVDLHRDPMTGRSKGYAFVQYKRAEDARMAL 431

Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQ--------AQQHIAIQKMALQTSGMN 405
             + G ++  +TL V      G +K  Q+  L +        A +   +QK+A       
Sbjct: 432 EQMEGFELAGRTLRVNTVHEKGTTKYAQQDSLDEAGGGNLNAASRQALMQKLARTDQPAV 491

Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYE-----EILEDMREEC-GKYGTLVNV 459
            L     +    + + +     +  +    +EE E     ++ ED++ EC  KYG +  +
Sbjct: 492 KLP---PVTKPNIPQSMQSRSVLLKNMFNPEEETERDWDKDLAEDVKGECEDKYGKVEFI 548

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
            + R  Q        G++++++        A   L GR FGGN V+A +
Sbjct: 549 KVERESQ--------GEIYVKFDSIESAKNAIQGLHGRWFGGNQVSAAF 589


>gi|449551106|gb|EMD42070.1| hypothetical protein CERSUDRAFT_90674 [Ceriporiopsis subvermispora
           B]
          Length = 623

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 93/379 (24%), Positives = 166/379 (43%), Gaps = 49/379 (12%)

Query: 159 PLMPVQVMTQQATRH----ARRVYVGGLPPLANEQAIATFF-----------SQVMT-AI 202
           P++ V  M  +  R     AR V+V  L      + +  FF           S+++T  I
Sbjct: 255 PIVDVDPMNPEEPREDDSEARSVFVSQLAARLTARDLGYFFEDKLGEGSVMDSRIVTDRI 314

Query: 203 GGNSAGPGDAV--VNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYN 260
              S G    +  +N  +    +  +VE RTVE    A+AL G +  G+ ++++    + 
Sbjct: 315 SRRSKGLLLIISRINTSLTFCLRIGYVEFRTVELVDKAIALSGTVVMGLPIQIQ----HT 370

Query: 261 PTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGT 320
                 L PG    NLNL         +  + G  +++VG L +  TE+ IK++ E FG 
Sbjct: 371 EAERNRLHPG--DGNLNLPP------GVSASHGGMQLYVGSLHFNLTESDIKQVFEPFGE 422

Query: 321 LHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK-T 379
           L   DL +D  TG SKGY F  Y+      +A   + G ++  +TL V      G  + T
Sbjct: 423 LEFVDLHRDPMTGRSKGYAFVQYKRSEDARMALEQMEGFELAGRTLRVNTVHEKGTIRYT 482

Query: 380 EQESI-------LAQAQQHIAIQKMALQTSGMNTLGGGM--SLFGETLAKVLCLTEAITA 430
           +Q+S+       L  A +   +QK+A     + T    +  ++     ++ + L      
Sbjct: 483 QQDSLDEAGGGNLNAASRQALMQKLARTDQTVITPPPVVRPNIPQTMQSRSVLLKNMFNP 542

Query: 431 DALADDEEYEEILEDMREEC-GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT 489
           +   + +  +++ ED++ EC  KYG +  + + +  Q        G++++++        
Sbjct: 543 ENETERDWDKDLAEDVKYECEDKYGKVEFIKVEKDSQ--------GEIYVKFDSVESAKN 594

Query: 490 AKNALSGRKFGGNTVNAFY 508
           A   L+GR FGGN V+A +
Sbjct: 595 AIQGLNGRWFGGNQVSAGF 613


>gi|432090458|gb|ELK23883.1| Splicing factor U2AF 65 kDa subunit [Myotis davidii]
          Length = 423

 Score = 98.6 bits (244), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 46/84 (54%), Positives = 59/84 (70%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D  VTD A A 
Sbjct: 263 KLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAG 322

Query: 356 LNGLKMGDKTLTVRRATASGQSKT 379
           LNG+++GDK L V+RA+   ++ T
Sbjct: 323 LNGMQLGDKKLLVQRASVGAKNAT 346



 Score = 58.5 bits (140), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 38/108 (35%), Positives = 60/108 (55%), Gaps = 17/108 (15%)

Query: 194 FFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRV 253
           FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+E + AMA       G ++++
Sbjct: 4   FFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMA-------GQSLKI 55

Query: 254 RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGG 301
           RRP DY P       PG  S N ++   G+ S  +   +   ++F+GG
Sbjct: 56  RRPHDYQPL------PGM-SENPSVYVPGVVSTVV--PDSAHKLFIGG 94



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 32/55 (58%), Gaps = 1/55 (1%)

Query: 450 CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           C KYG + ++ IPRP  +G E PG GK+F+E+     C  A   L+GRKF    V
Sbjct: 362 CSKYGLVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAVKGLTGRKFANRVV 415


>gi|68076889|ref|XP_680364.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56501286|emb|CAI04220.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 652

 Score = 98.2 bits (243), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 80/332 (24%), Positives = 146/332 (43%), Gaps = 48/332 (14%)

Query: 150 FGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
           FG ++LG      +  +   A + AR +YVG +P   + Q I  F +  +  +       
Sbjct: 195 FGISELG------LSTIDANAEKTARELYVGNIPQNIDIQEIVKFLNTCLLILYNKENEN 248

Query: 210 GDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRP----TDYNPTLAA 265
            +  +   I  + ++AFVE R++++ SN M L+GI F    +R+ RP     +Y   +  
Sbjct: 249 ENICLKACIRGDTRYAFVEFRSLQDTSNCMLLNGIYFYTNNLRIGRPKTFPIEYTKLIPP 308

Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPD--------------RVFVGGLPYYFTETQI 311
           A  P   +  L+   +G+ + AI      +              ++ V  +       +I
Sbjct: 309 ATIPTIDTYYLSQGLIGIKAFAIFHQNKDETKNEYIPVDMIKLQKLCVSNISKNNETNKI 368

Query: 312 KELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
           KELLE+FG + GF+  +  +  NS  Y            I     N ++   +   +   
Sbjct: 369 KELLEAFGEIQGFEFFEGEE--NSDTY------------ICLVEYNNVENAIQAHKILNQ 414

Query: 372 TASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL-AKVLCLTEAITA 430
             S + + E E IL     +  ++K  +Q           S+  + +  KV+ L++  T 
Sbjct: 415 NTSYKIQFEYE-ILNDPIINQLVKKKYMQNKN--------SILSQQIPTKVIVLSKIATF 465

Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIP 462
           + L++ E+Y+EI ED++ EC KYG ++ VV+P
Sbjct: 466 EELSNPEDYKEISEDIKIECEKYGPVLEVVLP 497


>gi|313244755|emb|CBY15469.1| unnamed protein product [Oikopleura dioica]
          Length = 2588

 Score = 98.2 bits (243), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 110/412 (26%), Positives = 169/412 (41%), Gaps = 85/412 (20%)

Query: 164  QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY---INH 220
            Q+  Q+A     ++YVG +     E  I   F          + GP  ++   Y    N 
Sbjct: 2208 QLQRQRAVAIMCKIYVGSIYYEIGEATIRQSFE---------TFGPVRSIDMSYDQGTNR 2258

Query: 221  EKKFAFVEMRTVEEASNAMA-LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
             K F F+E    E A  A+  +  I   G AV+V R ++           GQ +    +A
Sbjct: 2259 HKGFCFLEFECPEAAFLALEHMQSITIGGRAVKVGRLSNI----------GQVAAQHFIA 2308

Query: 280  AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
              G        A    RV++  +     +T IK + ESFG +    LVK+ DTG  K YG
Sbjct: 2309 QFG------NEAAKYHRVYIANIHVNIVDTDIKAVFESFGRVLSCQLVKNVDTGRHKNYG 2362

Query: 340  FCVYQDPAVTDIACAALNGLKMGDKTLTV------------------RRATASGQSKTEQ 381
            F  Y +      A +A+NG  +G + + V                    +TA   +K  Q
Sbjct: 2363 FVEYDNSQSMKEAISAMNGFDLGGQCIRVGPCVVPPSMHNIPTVAPGNASTALSGAKAVQ 2422

Query: 382  ESILAQAQQHIAIQK-----------MALQT--------------SGMNTLGGG------ 410
            E +L + ++   + +           +A++               SG   + GG      
Sbjct: 2423 E-MLKKKKKDSGVNRPSPPKDSMADVLAIREELNKKARNPTCDDDSGQMKITGGAQRNMI 2481

Query: 411  MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP-DQNGG 469
            M       + V+ L   ++AD L  DEE E    ++ +EC +YG ++ VVI +  D+   
Sbjct: 2482 MRKLMTRRSNVVVLKNMLSADDL--DEEVES---EVTQECSQYGNVLRVVIYQEVDRLAP 2536

Query: 470  ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
                + KVF+++ DA G  TAK  LSGR F G  +NA  Y E  +  KDYSA
Sbjct: 2537 GCEPIVKVFVQFTDADGAETAKKELSGRFFAGRKINAQSYDETAFEMKDYSA 2588


>gi|392597434|gb|EIW86756.1| splicing factor CC1-like protein [Coniophora puteana RWD-64-598
           SS2]
          Length = 360

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 86/350 (24%), Positives = 151/350 (43%), Gaps = 41/350 (11%)

Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
            AR V+V  L      + +  FF      +G  +      V +      K   +VE R++
Sbjct: 28  EARSVFVSQLAARLTARDLGYFFED---KLGEGTVLDSRIVTDRISRRSKGIGYVEFRSI 84

Query: 233 EEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
           +    A+ L G +  G+ + V+       +    L PG    NLNL         +    
Sbjct: 85  DLVEKALGLSGTVVMGLPIMVQ----LTESERNRLHPG--DGNLNLPP------GVHAPH 132

Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
           G  +++VG L +  TE  IK++ E FG L   DL +D  TG SKGY F  Y+ P    +A
Sbjct: 133 GAMQLYVGSLHFNLTEADIKQVFEPFGDLEFVDLHRDSTTGRSKGYAFVQYKRPEDAKMA 192

Query: 353 CAALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMALQTSGM 404
              ++G ++  +TL V      G ++ T+Q+S+       L  A +   +QK+A     +
Sbjct: 193 LEQMDGFELAGRTLRVNTVHEKGTARYTQQDSLEETGGGNLNAASRQALMQKLA----RI 248

Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE-----EILEDMREECG-KYGTLVN 458
            T      +   ++ + +     +  +    +EE E     E+ ED++ EC  KYG +  
Sbjct: 249 ETPTPAEPVSRPSIPQAMQSRSVLMKNMFDPEEETERDWDKELAEDVKGECQEKYGKVEA 308

Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
           + + +  Q        G++++++        A  AL+GR FGG  ++A +
Sbjct: 309 IKVEKETQ--------GEIYVKFATIDSAKEAVQALNGRWFGGRQISAVF 350


>gi|336389603|gb|EGO30746.1| hypothetical protein SERLADRAFT_455043 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 583

 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 87/351 (24%), Positives = 143/351 (40%), Gaps = 45/351 (12%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           AR V+V  L      + +  FF      +G  +      V +      K   +VE R+++
Sbjct: 252 ARSVFVSQLAARLTARDLGYFFED---KLGEGTVMDSRIVTDRLSRRSKGIGYVEFRSID 308

Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
               A++L G +  G+ V V+       +    L PG    NLNL         +    G
Sbjct: 309 MVEKAISLSGTVVMGLPVMVQ----LTESERNKLHPG--DGNLNLPP------GVSAPHG 356

Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
             +++VG L +  TE+ IK++ E FG L   DL +D  TG SKGY F  Y+      +A 
Sbjct: 357 AMQLYVGSLHFNLTESDIKQVFEPFGELEFVDLHRDPMTGRSKGYAFVQYKRAEDARMAL 416

Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQ--------AQQHIAIQKMAL------ 399
             + G ++  +TL V      G ++  Q+  L +        A +   +QK+A       
Sbjct: 417 EQMEGFELAGRTLRVNTVHEKGTARYAQQDTLDEAGGGNLNAASRQALMQKLARIEPIPK 476

Query: 400 -QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREEC-GKYGTLV 457
             T+   T+   M      L  +    E    D   D      + ED++ EC  KYG + 
Sbjct: 477 PPTNNKPTIPQAMQSRSVLLKNMFDPEEETERDWDKD------LAEDVKGECEDKYGQVD 530

Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
            + + +  Q        GK+++++        A   L+GR FGG  V+A +
Sbjct: 531 AIKVEQETQ--------GKIYVKFNSIDSAKNAIQGLNGRWFGGRQVSAGF 573


>gi|85001331|ref|XP_955384.1| RNA splicing factor [Theileria annulata strain Ankara]
 gi|65303530|emb|CAI75908.1| RNA splicing factor, putative [Theileria annulata]
          Length = 643

 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 133/314 (42%), Gaps = 57/314 (18%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE  T E    A+++ G+  +G  +RV   +      AA             AA 
Sbjct: 359 KGIAYVEFYTQESVIKALSMTGMSMKGQGIRVH-SSQAEKNRAAK------------AAK 405

Query: 282 GLASGAIGGAEGPDRVFVG---GLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
            L   A+  ++ P  + V    G+  Y  E ++ +L   FG +    L +  D G SKGY
Sbjct: 406 QLQDNALKESDNPTTIVVSNLLGVLSYLNEIELNQLFSPFGNIIDVALAR-TDNGESKGY 464

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESI-------------- 384
            +  ++       A   +NG  +  + + V  A      K+   S+              
Sbjct: 465 AYIRFKRWNEAKEALNVMNGFDINGQQIKVAYANTRKDPKSRLHSLGDLDMERLDDDDAG 524

Query: 385 -LAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEY-EEI 442
            ++ +   IA+ K   Q   +N+               L L+   T+   AD+ E+ +EI
Sbjct: 525 LISGSNVKIALMKKLQQRQPLNSSN-------------LVLSNMYTSADYADNHEFFDEI 571

Query: 443 LEDMREECGKYGTLVNVVIPR--PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
            ED++EECGKYGT+V V + R  PD         GKV++++ +     +A  +L GR F 
Sbjct: 572 EEDVKEECGKYGTVVQVFVNRRNPD---------GKVYVKFKNNDDAQSANKSLQGRYFA 622

Query: 501 GNTVNAFYYPEDKY 514
           GNT+   Y  +D+Y
Sbjct: 623 GNTIQVSYISDDQY 636


>gi|5822501|pdb|2U2F|A Chain A, Solution Structure Of The Second Rna-Binding Domain Of
           Hu2af65
          Length = 85

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 45/82 (54%), Positives = 58/82 (70%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY FC Y D  VTD A A 
Sbjct: 3   KLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAG 62

Query: 356 LNGLKMGDKTLTVRRATASGQS 377
           LNG+++GDK L V+RA+   ++
Sbjct: 63  LNGMQLGDKKLLVQRASVGAKN 84


>gi|402217675|gb|EJT97754.1| splicing factor CC1-like protein [Dacryopinax sp. DJM-731 SS1]
          Length = 640

 Score = 95.5 bits (236), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 87/350 (24%), Positives = 151/350 (43%), Gaps = 38/350 (10%)

Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEE 234
           R V+V  L      + +  FF +    +G  S      V +      K  A+VE+ +++ 
Sbjct: 303 RSVFVSQLAARLTARDLGYFFEE---KLGEGSVRDVRIVTDRVSRRSKGIAYVELSSIDM 359

Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
            S A+AL G I  G+ + V+  T+      AA G          +++ L  G      G 
Sbjct: 360 VSRAIALTGTIVMGLPIMVQL-TESERNKVAASG----------SSMHLPPGVTAPPPGS 408

Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
            +++VG L +  TE+ +K++ E FG L   DL +D  TG SKG+ F  Y+      +A  
Sbjct: 409 MQLYVGSLHFNLTESDVKQVFEPFGELEFVDLHRDPLTGRSKGFAFVQYKRSEDARMALQ 468

Query: 355 ALNGLKMGDKTLTVRRATASG-------QSKTEQES---ILAQAQQHIAIQKMALQTSGM 404
           +++G  +  + L V      G       QS +  ES    L  A +   +QK+A      
Sbjct: 469 SMDGFDLAGRQLKVNTVHEKGGAIRYQSQSDSLDESGGGNLNAASRQALMQKLARIEPPK 528

Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADALADDEE-----YEEILEDMREEC-GKYGTLVN 458
             +    SL    +     L   +  +   +++E      +E+ +D+++EC  KYG LV+
Sbjct: 529 PAISPMASLPKAAMQSRSVLLRNMFKEPELEEKENGPNWAKELTDDVKQECEDKYG-LVD 587

Query: 459 VVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
            +   PD         G+++L++      + A   L+GR FGG  + A +
Sbjct: 588 FIKLEPDSQ-------GEMYLKFKSIEAASKAIEGLNGRYFGGQPIQATF 630


>gi|336376609|gb|EGO04944.1| hypothetical protein SERLA73DRAFT_174031 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 583

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 86/351 (24%), Positives = 143/351 (40%), Gaps = 45/351 (12%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           AR V+V  L      + +  FF      +G  +      V +      K   +VE R+++
Sbjct: 252 ARSVFVSQLAARLTARDLGYFFED---KLGEGTVMDSRIVTDRLSRRSKGIGYVEFRSID 308

Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
               A++L G +  G+ V V+       +    L PG    NLNL         +    G
Sbjct: 309 MVEKAISLSGTVVMGLPVMVQ----LTESERNKLHPG--DGNLNLPP------GVSAPHG 356

Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
             +++VG L +  TE+ IK++ E FG L   DL +D  TG SKGY F  Y+      +A 
Sbjct: 357 AMQLYVGSLHFNLTESDIKQVFEPFGELEFVDLHRDPMTGRSKGYAFVQYKRAEDARMAL 416

Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQ--------AQQHIAIQKMAL------ 399
             + G ++  +TL V      G ++  Q+  L +        A +   +QK+A       
Sbjct: 417 EQMEGFELAGRTLRVNTVHEKGTARYAQQDTLDEAGGGNLNAASRQALMQKLARIEPIPK 476

Query: 400 -QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREEC-GKYGTLV 457
             T+   T+   M      L  +    E    D   D      + ED++ EC  KYG + 
Sbjct: 477 PPTNNKPTIPQAMQSRSVLLKNMFDPEEETERDWDKD------LAEDVKGECEDKYGQVD 530

Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
            + + +  Q        G++++++        A   L+GR FGG  V+A +
Sbjct: 531 AIKVEQETQ--------GEIYVKFNSIDSAKNAIQGLNGRWFGGRQVSAGF 573


>gi|409083550|gb|EKM83907.1| hypothetical protein AGABI1DRAFT_110515 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 563

 Score = 95.1 bits (235), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 89/347 (25%), Positives = 149/347 (42%), Gaps = 37/347 (10%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           AR V+V  L      + +  FF      +G  +      V +      K   +VE RT+E
Sbjct: 232 ARSVFVSQLAARLTARDLGYFFED---KLGEGTVMDARIVTDRLSRRSKGIGYVEFRTIE 288

Query: 234 EASNAMALDGIIFEGVAVRVR-RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
               A+ L G +  G+ + V+    + N T A     G  S NL    V    GAI    
Sbjct: 289 LVEKAIGLSGTVVMGLPIMVQLTEAERNKTHA-----GDGSINLP-PGVSAPHGAI---- 338

Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
               ++VG L +  TE+ IK++ E FG L   DL +D  TG SKGY F  Y+      +A
Sbjct: 339 ----LYVGSLHFNLTESDIKQVFEVFGELEFVDLHRDAMTGRSKGYAFVQYKRAEDARMA 394

Query: 353 CAALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMAL--QTS 402
              + G ++  +TL V      G +K T+Q+S+       L  A +   +QK+A   Q +
Sbjct: 395 LQQMEGFELAGRTLRVNTVHEKGTTKYTQQDSLDESGGGNLNAASRQALMQKLARTDQPA 454

Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREEC-GKYGTLVNVVI 461
                    ++     ++ + L      D   + +   E+ +D++ EC  KYG ++ + +
Sbjct: 455 PRPEPVQRPNIPQAMQSRSVLLKNMFDPDEETEKDWDRELAQDVKGECESKYGKVLAIKV 514

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
            +  Q        G++++++        A   L+GR FGG  V+A +
Sbjct: 515 EKDSQ--------GEIYVKFDSIDYAQKAIQGLNGRWFGGRQVSAVF 553


>gi|47223170|emb|CAG11305.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 515

 Score = 94.7 bits (234), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 72/267 (26%), Positives = 113/267 (42%), Gaps = 47/267 (17%)

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
           G+ GP R++VG L +  TE  ++ + E FG + G  L+ D +TG SKGYGF  + D    
Sbjct: 239 GSSGPMRLYVGSLHFNITEEMLRGIFEPFGKIEGIQLMMDSETGRSKGYGFISFADAECA 298

Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKTEQE--------------------SILAQAQ 389
             A   LNG ++  + + V   T    S T                        ++A+  
Sbjct: 299 KKALEQLNGFELAGRPMKVGHVTERSDSSTASSILDNDELERTGIDLGTTGRLQLMARLA 358

Query: 390 QHIAIQ-----KMALQTSG---MNTLGG-----------GMSLFGETLAK-VLCLTEAIT 429
           +   ++     + ALQ +G     T+GG            ++L  + LA   L L+    
Sbjct: 359 EGTGLKIPPAAQQALQMTGSMSFPTIGGPPAVPTPSPSQALNLPAQPLATHCLQLSNLFN 418

Query: 430 ADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT 489
             A  D     EI +D+ EEC K+G +V++ +   D+N  +    G V+++         
Sbjct: 419 PQAENDPSWAVEIQDDVIEECNKHGGVVHIYV---DKNSAQ----GNVYVKCPSIPAAMA 471

Query: 490 AKNALSGRKFGGNTVNAFYYPEDKYFN 516
             NAL GR F G  + A Y P   Y N
Sbjct: 472 TVNALHGRWFAGKMITAAYVPLPTYHN 498


>gi|392571432|gb|EIW64604.1| splicing factor CC1-like protein [Trametes versicolor FP-101664
           SS1]
          Length = 344

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 91/349 (26%), Positives = 149/349 (42%), Gaps = 40/349 (11%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           AR V+V  L      + +  FF      +G  S      V +      K   +VE RTVE
Sbjct: 12  ARSVFVSQLAARLTARDLGYFFED---KLGEGSVMDSRIVTDRISRRSKGIGYVEFRTVE 68

Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
               A+ L G +  G+ ++++    +       L PG    NLNL         +    G
Sbjct: 69  LVDRAIGLSGTVVMGLPIQIQ----HTEAERNRLHPG--DGNLNLPP------GVSAPHG 116

Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
             +++VG L +  TE+ IK++ E FG L   DL +D  TG SKGY F  Y+      +A 
Sbjct: 117 GMQLYVGSLHFNLTESDIKQVFEPFGELEFVDLHRDPMTGRSKGYAFVQYKRAEDAKMAL 176

Query: 354 AALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMALQTSGMN 405
             + G ++  +TL V      G ++ T+Q+++       L  A +   +QK+A   S   
Sbjct: 177 EQMEGFELAGRTLRVNTVHEKGSTRYTQQDTLDEAGGGNLNAASRQALMQKLARTDSAPV 236

Query: 406 TLGGGMSLFGETLAKVLCLTEAITADALADDEEYE-----EILEDMREECG-KYGTLVNV 459
            L     +    + + +     +  +    +EE E     ++ +D++ EC  KYG +  +
Sbjct: 237 KLE---PVARPHIPQTMQSRSVLLKNMFNPEEETERDWDKDLADDVKSECATKYGPVQAI 293

Query: 460 VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
            + +      ET G   V  E  DA G A     L+GR FGG  ++A +
Sbjct: 294 KVEK------ETQGEIYVLFETVDAAGQAI--EGLNGRWFGGRQISAAF 334


>gi|71026268|ref|XP_762815.1| splicing factor [Theileria parva strain Muguga]
 gi|68349767|gb|EAN30532.1| splicing factor, putative [Theileria parva]
          Length = 644

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 78/302 (25%), Positives = 128/302 (42%), Gaps = 33/302 (10%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE  T E    A+++ G+  +G  +RV          A A               
Sbjct: 360 KGIAYVEFYTQESVIKALSMTGMSMKGQGIRVHSSQAEKNRAAKAQKQ------------ 407

Query: 282 GLASGAIGGAEGPDRVFVG---GLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
            L   A+  ++ P  + V    G+  Y  E ++ +L   FG +    L +  D GNSKGY
Sbjct: 408 -LQDNALKESDNPTTIVVSNLLGVLSYLNEIELNQLFSPFGNIIDVALAR-TDDGNSKGY 465

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
            +  ++       A   +NG  +  + + V  A     SK+   S+     + +      
Sbjct: 466 AYIRFKRWNEAKEALNVMNGFDINGQQIKVAYANTRKDSKSRLHSLGDVDMERLDDDDAG 525

Query: 399 LQTSGMNTLGGGMSLFGETL---AKVLCLTEAITADALADDEEY-EEILEDMREECGKYG 454
           L  SG N     M    +     +  L L+   T+    D+ E+ +EI ED++EECGKYG
Sbjct: 526 L-ISGSNIKIALMKKLQQRQPLNSSNLVLSNMYTSADYEDNREFFDEIEEDVKEECGKYG 584

Query: 455 TLVNVVIPR--PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPED 512
           T++ V + +  PD         GKV++++ +      A  +L GR F GNT+   Y  +D
Sbjct: 585 TVIQVFVNKRNPD---------GKVYVKFKNNDDAQAANKSLQGRYFAGNTIQVSYISDD 635

Query: 513 KY 514
           +Y
Sbjct: 636 QY 637


>gi|426201409|gb|EKV51332.1| hypothetical protein AGABI2DRAFT_189584 [Agaricus bisporus var.
           bisporus H97]
          Length = 563

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 89/347 (25%), Positives = 149/347 (42%), Gaps = 37/347 (10%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           AR V+V  L      + +  FF      +G  +      V +      K   +VE RT+E
Sbjct: 232 ARSVFVSQLAARLTARDLGYFFED---KLGEGTVMDARIVTDRLSRRSKGIGYVEFRTIE 288

Query: 234 EASNAMALDGIIFEGVAVRVR-RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
               A+ L G +  G+ + V+    + N T A     G  S NL    V    GAI    
Sbjct: 289 LVEKAIGLSGTVVMGLPIMVQLTEAERNKTHA-----GDGSINLP-PGVSAPHGAI---- 338

Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
               ++VG L +  TE+ IK++ E FG L   DL +D  TG SKGY F  Y+      +A
Sbjct: 339 ----LYVGSLHFNLTESDIKQVFEVFGELEFVDLHRDAMTGRSKGYAFVQYKRAEDARMA 394

Query: 353 CAALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMAL--QTS 402
              + G ++  +TL V      G +K T+Q+S+       L  A +   +QK+A   Q +
Sbjct: 395 LQQMEGFELAGRTLRVNTVHEKGTTKYTQQDSLDESGGGNLNAASRQALMQKLARTDQPA 454

Query: 403 GMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREEC-GKYGTLVNVVI 461
                    ++     ++ + L      D   + +   E+ +D++ EC  KYG ++ + +
Sbjct: 455 PRPEPVQRPNIPQAMQSRSVLLKNMFDPDEETEKDWDRELAQDVKGECESKYGKVLAIKV 514

Query: 462 PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
            +  Q        G++++++        A   L+GR FGG  V+A +
Sbjct: 515 EKDSQ--------GEIYVKFDSIDYAQKAIQGLNGRWFGGRQVSAVF 553


>gi|443921112|gb|ELU40879.1| splicing factor, CC1-like family protein [Rhizoctonia solani AG-1
           IA]
          Length = 399

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 88/363 (24%), Positives = 153/363 (42%), Gaps = 56/363 (15%)

Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
            AR V+V  L      + +  FF      +G  +      V +      K   +VE + +
Sbjct: 63  EARSVFVSQLAARLTARDLGYFFED---KLGEGAVRDARIVTDRLSRRSKGIGYVEFKNI 119

Query: 233 EEASNAMALDGIIFEGVAVRVR----RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
           +  + A+AL G I  G+ + ++          P+ +  L PG   P+             
Sbjct: 120 DLVNKAIALSGTIVMGLPIMIQLTESERNKIGPSSSLHLPPGVSHPH------------- 166

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               G  +++VG L +  TE+ I+++ E FG L   DL +D  TG SKGY F  Y+ P  
Sbjct: 167 ---AGSMQLYVGSLHFNLTESDIRQVFEPFGELDFVDLHRDPATGKSKGYCFIQYKRPED 223

Query: 349 TDIACAALNGLKMGDKTL-----------TVRRATASGQSKTEQESILAQA-QQHIAIQK 396
             +A   + G ++  + L           TVR +TA   S  +   +L  +  +H  +QK
Sbjct: 224 ARMALEQMEGFELAGRQLRVNTVHDKGQGTVRISTAPQDSLEDTGGVLNNSTSRHQLMQK 283

Query: 397 MAL--QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE-----EILEDMREE 449
           +A   Q S  NT+     L    + + L     +  +    DEE E     ++ +D+R E
Sbjct: 284 LARTEQPSKNNTM-----LMKSNIPQTLSSRCVLLRNMFDPDEETERDWDKDLADDVRGE 338

Query: 450 C-GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
           C  KYG ++++ + +  +        G++++++        A   L+GR FGG  V A  
Sbjct: 339 CEEKYGKVLDLKVEKESE--------GEIYIKFESVESAEKAIKGLNGRWFGGKQVTASP 390

Query: 509 YPE 511
            P+
Sbjct: 391 IPD 393


>gi|348503003|ref|XP_003439056.1| PREDICTED: RNA-binding protein 39-like [Oreochromis niloticus]
          Length = 498

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 71/267 (26%), Positives = 111/267 (41%), Gaps = 47/267 (17%)

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
           G+ GP R++VG L +  TE  ++ + E FG + G  L+ D +TG SKGYGF  + D    
Sbjct: 222 GSSGPMRLYVGSLHFNITEEMLRGIFEPFGKIEGIQLMMDSETGRSKGYGFISFADAECA 281

Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKTEQE--------------------SILAQAQ 389
             A   LNG ++  + + V   T    S T                        ++A+  
Sbjct: 282 KKALEQLNGFELAGRPMKVGHVTERSDSSTASSFLDNDELERTGIDLGTTGRLQLMARLA 341

Query: 390 QHIAIQ-----KMALQTSGMNTLGG--------------GMSLFGETLAK-VLCLTEAIT 429
           +   ++     + ALQ +G    GG               ++L  + LA   L L+    
Sbjct: 342 EGTGLKIPPAAQQALQMTGSIPFGGIGAPAAVPTPAPSQALNLPSQPLATHCLQLSNLFN 401

Query: 430 ADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT 489
             A  D     EI +D+ EEC K+G +V++ +   D+N  +    G V+++         
Sbjct: 402 PQAENDPSWAAEIQDDVIEECNKHGGIVHIYV---DKNSPQ----GNVYVKCPSIPAAMA 454

Query: 490 AKNALSGRKFGGNTVNAFYYPEDKYFN 516
             NAL GR F G  + A Y P   Y N
Sbjct: 455 TVNALHGRWFAGKMITAAYVPLPTYHN 481


>gi|403362995|gb|EJY81233.1| hypothetical protein OXYTRI_21372 [Oxytricha trifallax]
          Length = 411

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 83/385 (21%), Positives = 150/385 (38%), Gaps = 85/385 (22%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           +H R++YVG LPP    Q +    +  +  +  N   PG  +++ +I+ +  +AFVE RT
Sbjct: 70  KHERQLYVGNLPPTITHQKLVELLNIAVCVMKLN-VKPGQPILSAWISQDGHYAFVEFRT 128

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
           +EE  N   L+ I  +G  ++V + T     + +      P  N     V + S A+  +
Sbjct: 129 IEECMNGHQLNQIAIQGHPLKVGK-TRIQNQINSQNPHNFPCQNSANQQVLMLSQALSNS 187

Query: 292 EGPDRVFVGGLPYYF---TETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
                + +  +P ++   TE  +K LL+ FG    + +        ++   +C Y+    
Sbjct: 188 -----IEISNIPKFYENDTEALVK-LLKMFGVYRQYQM----KALQNQIICYCEYESDEQ 237

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
           T  A    N L + +  L VRR                                     G
Sbjct: 238 TKKALNGFNDLLVKESKLQVRRLPN----------------------------------G 263

Query: 409 GGMSLFGETLAKVL-CL------------------TEAITADALADDEEYEEILEDMREE 449
               +FG T AK + C+                     +  + +   +E+ E+ +D+REE
Sbjct: 264 HASQIFGSTTAKSIDCIQGNQKSSDEPRSSRVVVLNNLLVLENMKTKQEFYEVEDDIREE 323

Query: 450 CGKYGTLVNVVIPRPD--------------QNGGE---TPGVGKVFLEYYDAVGCATAKN 492
           C KYG +  V+IP+P               Q  G      G GK+++++  +        
Sbjct: 324 CEKYGKIRQVMIPKPSHLSHRQKLPFCIQIQRYGSYLVNEGAGKIYIKFDKSEQAKKCVE 383

Query: 493 ALSGRKFGGNTVNAFYYPEDKYFNK 517
            ++ R +    V A  Y EDK++++
Sbjct: 384 QMNKRLYNQREVIASLYSEDKWYDR 408


>gi|299755304|ref|XP_002912089.1| hypothetical protein CC1G_13622 [Coprinopsis cinerea okayama7#130]
 gi|298411164|gb|EFI28595.1| hypothetical protein CC1G_13622 [Coprinopsis cinerea okayama7#130]
          Length = 580

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 82/345 (23%), Positives = 148/345 (42%), Gaps = 35/345 (10%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           AR V+V  L      + +  FF      +G  +      V +      K   +VE R+++
Sbjct: 251 ARSVFVSQLAARLTARDLGYFFED---KLGEGTVMDARIVTDRLSRRSKGIGYVEFRSID 307

Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
               A+AL G I  G+ + V+          A  G      +L+L     ASGAI     
Sbjct: 308 LVEKAIALSGTIVMGLPINVQLTESERNKSHAGDG------SLHLPPGVTASGAI----- 356

Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
              ++VG L +  TE+ IK++ E FG L   DL KD  TG SKGY F  Y+      +A 
Sbjct: 357 ---LYVGSLHFNLTESDIKQVFEPFGELEFVDLHKDPMTGRSKGYAFVQYKRAEDARMAL 413

Query: 354 AALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQAQQHIAIQKMALQTSGMN 405
             + G ++  +TL V      G  + T+ +S+       L  A +   +QK+A     + 
Sbjct: 414 EQMEGFELAGRTLRVNTVHEKGSVRYTQTDSLDDSGGANLNAASRQALMQKLARTEQPVV 473

Query: 406 TLGGGMSLFGETL-AKVLCLTEAITADALADDEEYEEILEDMREEC-GKYGTLVNVVIPR 463
                  +  + + ++ + L      +   +    +++ +D++ EC  KYG ++ + + +
Sbjct: 474 PAEPVKPIIPQAMQSRSVLLKNMFNPEEETEQNWDKDLADDVKGECENKYGKVLAIKVEK 533

Query: 464 PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
             Q        G++++++       +A   L+GR FGG  ++A +
Sbjct: 534 DSQ--------GEIYVKFDTVDTAKSAVQGLNGRWFGGRQISANF 570


>gi|70954273|ref|XP_746191.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56526725|emb|CAH88205.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 686

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 131/314 (41%), Gaps = 46/314 (14%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           A + AR +YVG +P   + Q I  F +  +  +           +   I  + ++AFVE 
Sbjct: 249 AEKTARELYVGNIPQNIDIQEIVKFLNTCLLILYNKENENESICLKACIRGDTRYAFVEF 308

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIG 289
           R++++ SN M L+GI F    +R+ RP  +       + P    P   +    L+ G IG
Sbjct: 309 RSLQDTSNCMLLNGIYFYSNNLRIGRPKTFPAEYTKLIPPATIPP---IDTYYLSQGLIG 365

Query: 290 GA------EGPDRVFVGGLPYYFTETQ---------------IKELLESFGTLHGFDLVK 328
                   +  D      LP    + Q               IKELLE+FG +  F+  +
Sbjct: 366 IKAFVIFHQNRDETKNEYLPVDMIKLQKLCVSNISKNNETNKIKELLEAFGEIQSFEFFE 425

Query: 329 DRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
             +  NS  Y            I     N ++   +   +     S + + E E I+   
Sbjct: 426 GEE--NSDTY------------ICLVEYNNVENAIQAHKILNQNTSYRIQFEYE-IVNDP 470

Query: 389 QQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMRE 448
             +  ++K  +QT           L  +   KV+ L++  T D L++ E+Y+EI ED++ 
Sbjct: 471 TINQLVKKKYMQTK-------NAILSQQIPTKVVVLSKIATFDELSNPEDYKEISEDIKI 523

Query: 449 ECGKYGTLVNVVIP 462
           EC KYG ++ VV+P
Sbjct: 524 ECEKYGPVLEVVLP 537



 Score = 39.7 bits (91), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 25/47 (53%)

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            +G  F+ +    G    +  LSGRKFG N + A YY E K+  K++
Sbjct: 607 SIGCAFIYFETIEGATKTRKELSGRKFGANIIEANYYSEKKFLMKNF 653


>gi|410899827|ref|XP_003963398.1| PREDICTED: RNA-binding protein 39-like [Takifugu rubripes]
          Length = 500

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 71/267 (26%), Positives = 112/267 (41%), Gaps = 47/267 (17%)

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
           G+ GP R++VG L +  TE  ++ + E FG + G  L+ D +TG SKGYGF  + D    
Sbjct: 224 GSSGPMRLYVGSLHFNITEEMLRGIFEPFGKIEGIQLMMDSETGRSKGYGFISFADAECA 283

Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKTEQE--------------------SILAQAQ 389
             A   LNG ++  + + V   T    S T                        ++A+  
Sbjct: 284 KKALEQLNGFELAGRPMKVGHVTERSDSSTASSILDNDELERTGIDLGTTGRLQLMARLA 343

Query: 390 QHIAIQ-----KMALQTSG---MNTLGG-----------GMSLFGETLAK-VLCLTEAIT 429
           +   ++     + ALQ +G     T+ G            ++L  + LA   L L+    
Sbjct: 344 EGTGLKIPPAAQQALQMTGSMSFPTISGPPAVPTPSPSQALNLPAQPLATHCLQLSNLFN 403

Query: 430 ADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT 489
             A  D     EI +D+ EEC K+G +V++ +   D+N  +    G V+++         
Sbjct: 404 PQAENDPSWAVEIQDDVIEECNKHGGVVHIYV---DKNSTQ----GNVYVKCPSIPAAMA 456

Query: 490 AKNALSGRKFGGNTVNAFYYPEDKYFN 516
             NAL GR F G  + A Y P   Y N
Sbjct: 457 TVNALHGRWFAGKMITAAYVPLPTYHN 483


>gi|255575831|ref|XP_002528813.1| splicing factor u2af large subunit, putative [Ricinus communis]
 gi|223531725|gb|EEF33547.1| splicing factor u2af large subunit, putative [Ricinus communis]
          Length = 844

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 97/392 (24%), Positives = 162/392 (41%), Gaps = 97/392 (24%)

Query: 93  RNRSKSLSPSRSPSK---SKRRSGFDMAPPAAAMLPGAAVP------GQLPGVP-----S 138
           + RS++ + + SP+K    K+++ +D+AP  A      +VP       Q+  +      S
Sbjct: 303 KRRSEAAARTPSPTKHSPEKKKAKWDLAPEGADSTFSVSVPPIFKLSNQIASLNARATVS 362

Query: 139 AVPEMAQNMLPFGATQLGAF------PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIA 192
           AVP  +  + P                +  VQ+   QATR  RR+YV  +P  A+E+A+ 
Sbjct: 363 AVPVASIPVKPLSGVSSNILLTNKNDTIDSVQLT--QATRPMRRLYVENIPAEASEKAVL 420

Query: 193 TFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVR 252
              + ++ + G N        ++  I+ EK  A VE  T E+AS A++ DG  F G  ++
Sbjct: 421 ERLNNLLISSGVNHIQGTQPCISCIIHKEKGQALVEFLTPEDASAALSFDGSYFSGSTIK 480

Query: 253 VRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIK 312
           +RRP D+   +A+  GP        L A                       Y+F      
Sbjct: 481 IRRPKDFIMEIASTFGP--------LKA-----------------------YHF------ 503

Query: 313 ELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA- 371
              E+   ++G                F  Y D +VT  ACA LNG+K+G + ++  +  
Sbjct: 504 ---ENIDDVNG-------------PCAFVEYADQSVTFRACAGLNGMKLGGQVISAVQVI 547

Query: 372 -TASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
             AS      ++      +Q   +     Q   +  L    +L   +L+++         
Sbjct: 548 PNASTLEIDGKQPFYGVPEQAKPLLDKPTQVLKLKNLFDPETL--PSLSRI--------- 596

Query: 431 DALADDEEYEEILEDMREECGKYGTL--VNVV 460
                  E EE+LED+R EC ++GT+  VNVV
Sbjct: 597 -------EIEEVLEDVRLECARFGTVKSVNVV 621


>gi|159163083|pdb|1U2F|A Chain A, Solution Structure Of The First Rna-Binding Domain Of
           Hu2af65
          Length = 90

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 43/88 (48%), Positives = 62/88 (70%), Gaps = 1/88 (1%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+
Sbjct: 1   ARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVD 59

Query: 234 EASNAMALDGIIFEGVAVRVRRPTDYNP 261
           E + AMA DGIIF+G ++++RRP DY P
Sbjct: 60  ETTQAMAFDGIIFQGQSLKIRRPHDYQP 87


>gi|428672327|gb|EKX73241.1| RNA recognition motif domain containing protein [Babesia equi]
          Length = 511

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 89/369 (24%), Positives = 157/369 (42%), Gaps = 48/369 (13%)

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFV 227
           ++A R    V V  L   A+E+ I   FS+               V ++     K  A+V
Sbjct: 164 EEAQRADLTVLVINLSLSADERDIYELFSE-----HAGKVRDIQCVRDLRSGKSKGIAYV 218

Query: 228 EMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPN-LNLAAVGLASG 286
           E  T E    A+++ G+  +G  ++++                Q   N    AA  L   
Sbjct: 219 EFYTQESVIKALSMTGLDLKGQRIKIQ--------------SSQAEKNRAAKAAKMLQQT 264

Query: 287 AIGGAEGPDRVFVGGLP---YYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
           A+  ++ P  ++VGGL        E ++K+L   FGT+   ++ +D +TG SKGY F  +
Sbjct: 265 AMDASDSPFTIYVGGLIGALSALNEVELKQLFSPFGTIIDVEIFRDPETGESKGYAFLKF 324

Query: 344 QDPAVTDIACAALNGLKMGDKTLTV-----------RRATASGQSKTEQES-----ILAQ 387
           +  +    A   +NG  +G + + V            R ++ G    E+       +++ 
Sbjct: 325 RRSSEAKEAMNTMNGFDIGGQQIKVGYANLNTTDSKSRLSSLGDVDIERLDDDGGGLISG 384

Query: 388 AQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEE--YEEILED 445
           A   IA+ +   +T+         S         + L+   TA+    DE   + EI ED
Sbjct: 385 ATNKIALMEKLQRTTAAPISATFSSGKASGPTSNIILSNMFTANDPGADEPNFFVEIEED 444

Query: 446 MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
           ++EEC KYG +V V + +   +       GKV++++ ++   +TA   L+GR F GNT+ 
Sbjct: 445 VKEECEKYGKVVAVYLNKKTID-------GKVWVKFQNSTDASTAYKGLNGRYFAGNTIK 497

Query: 506 AFYYPEDKY 514
             Y  +D +
Sbjct: 498 VEYVTDDFW 506


>gi|195996811|ref|XP_002108274.1| hypothetical protein TRIADDRAFT_37071 [Trichoplax adhaerens]
 gi|190589050|gb|EDV29072.1| hypothetical protein TRIADDRAFT_37071 [Trichoplax adhaerens]
          Length = 351

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 80/321 (24%), Positives = 134/321 (41%), Gaps = 53/321 (16%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE R V+    A+ L+G   EG+ + ++R               Q   N     +
Sbjct: 37  KGIAYVEFRLVDSVDKALKLNGTKVEGIPIMIQRT--------------QSEKN----KI 78

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
                     +GP R+++G L Y   E  ++ + E FG +   ++++D DT  SKGYGF 
Sbjct: 79  AALQAQQKAQQGPTRLYIGSLHYNINEDMLRAIFEPFGLVENVNIIRDSDTNVSKGYGFI 138

Query: 342 VYQDPAVTDIACAALNGLKMGDK-----TLTVRRATASGQS-----KTEQESILAQAQQH 391
            Y++P     A   LNGL++  +     T+T R A  S  S      TE+  I   +   
Sbjct: 139 QYKEPDSARRALEQLNGLEVAGRPIKVGTVTDRSADLSAMSALDDDDTERGGIEMNSLSR 198

Query: 392 IAIQKMALQTSGMNTL----------------GGGMSLFGETLAKVLCLTEAITAD-ALA 434
           +A+     QT    T+                  G+     T+    C   +   D A  
Sbjct: 199 VALMAKLSQTHNATTVPVSVPVPVPVPGPTLPATGLIPAANTVQASPCFLISNMFDPAKE 258

Query: 435 DDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNA 493
            D++++ +I +D+ EEC K+G + +V + +       T   G V+++       A A  +
Sbjct: 259 TDQDWDLDIRDDIIEECNKHGNVYHVYVDK-------TSPKGIVYVKCQTIDVAARAVKS 311

Query: 494 LSGRKFGGNTVNAFYYPEDKY 514
           L+GR F GN + A +     Y
Sbjct: 312 LNGRWFAGNMITAQFLSLASY 332


>gi|294876942|ref|XP_002767845.1| splicing factor u2af large subunit, putative [Perkinsus marinus
           ATCC 50983]
 gi|239869760|gb|EER00563.1| splicing factor u2af large subunit, putative [Perkinsus marinus
           ATCC 50983]
          Length = 220

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 61/197 (30%), Positives = 99/197 (50%), Gaps = 6/197 (3%)

Query: 186 ANEQAIATFFSQVMTAIGGN--SAGPGDAVVNVYI---NHEKKFAFVEMRTVEEASNAMA 240
           +++Q++  FF   + A+ GN     P   VV+V+    +   + A VE RT   A+ AM 
Sbjct: 16  SSQQSVMDFFKGALFAVTGNGGKTTPLHPVVSVFFLISDGHSRTALVEFRTPIAATVAMR 75

Query: 241 LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD-RVFV 299
           L+GI  +G  + + RP  YN    +     +    + +  +   S     A G + ++ +
Sbjct: 76  LNGIDLDGRKLAITRPHGYNKEDPSKSITAEDIQKVTIEELCGGSSTKKTAPGSNLQLGI 135

Query: 300 GGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGL 359
             LP   TET +++LLE FG L    L++D+ TG SKGYGFC ++DP   D    AL+  
Sbjct: 136 YHLPPVMTETYLRDLLEQFGALTMVSLIRDKTTGLSKGYGFCQFEDPNDADRCLYALDQF 195

Query: 360 KMGDKTLTVRRATASGQ 376
            +G+ +L+V R     Q
Sbjct: 196 VLGNYSLSVTRLVPDAQ 212


>gi|118788821|ref|XP_317010.3| AGAP008433-PA [Anopheles gambiae str. PEST]
 gi|116122929|gb|EAA12873.4| AGAP008433-PA [Anopheles gambiae str. PEST]
          Length = 526

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 94/384 (24%), Positives = 150/384 (39%), Gaps = 63/384 (16%)

Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
           P++ M+Q+  R AR V+   L    + + +  FFS         S G    V  +  N  
Sbjct: 159 PLEEMSQE-DRDARTVFCMQLSQRIHARDLEEFFS---------SVGKVRDVRLITCNKT 208

Query: 222 KKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
           K+F   A++E +  E  + A+ L G    G+ + V+  T       A+  P  P  N   
Sbjct: 209 KRFKGIAYIEFKDPESVALALGLSGQKLLGIPISVQH-TQAEKNRMASQPPVAPPKN--- 264

Query: 279 AAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
                         GP R++VG L +  TE  +  + E FG +    L+ D DTG SKGY
Sbjct: 265 ------------PSGPMRLYVGSLHFNITEDMLNGIFEPFGKIDNIQLIMDADTGRSKGY 312

Query: 339 GFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA 398
           GF  + +      A   LNG ++  + + V   T      T   S+         I+  A
Sbjct: 313 GFITFHNADDAKKALEQLNGFELAGRPMKVGNVTER-LDVTTHASLDTDEMDRSGIELGA 371

Query: 399 ---LQTSGMNTLGGGMS----------------------LFGETLAKVLCLTEAITADAL 433
              LQ       G G++                      +    +A    L   +   A 
Sbjct: 372 TGRLQLMFKLAEGAGLAVPRAAADALLATAPQPVPQQPIMQSPPIATQCFLLSNMFDPAT 431

Query: 434 ADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
             +  ++ EI +D+ EEC K+G + +V + +      ++P  G V+++         A N
Sbjct: 432 ETNPNWDLEIQDDVIEECNKHGGVQHVYVDK------QSPS-GNVYVKCPSIATAVLAVN 484

Query: 493 ALSGRKFGGNTVNAFYYPEDKYFN 516
           AL GR F G  + A Y P   Y+N
Sbjct: 485 ALHGRWFAGRVIGAAYVPLINYYN 508


>gi|209879137|ref|XP_002141009.1| RNA recognition motif. family protein [Cryptosporidium muris RN66]
 gi|209556615|gb|EEA06660.1| RNA recognition motif. family protein [Cryptosporidium muris RN66]
          Length = 442

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 98/424 (23%), Positives = 163/424 (38%), Gaps = 95/424 (22%)

Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV-------------------- 214
           R V V  +P L N Q ++TF    + A+ G  A   +  +                    
Sbjct: 30  RTVIVEKVPMLFNSQTLSTFLCGAICALQGKPADSLNTFISSIKEITCEDAQLTVNDNNS 89

Query: 215 ---NVY------------INHEKKFAF-VEMRTVEEASNAMALDGIIF--EGVAVRVRRP 256
              N+Y             NH     F VE++T+      + L+GI        +  RRP
Sbjct: 90  TIHNIYSSNAGSKSGGSSFNHGISRTFRVELQTIIYTLLCLKLNGIPIGTSSSKLICRRP 149

Query: 257 TDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLE 316
            +Y P        G P     +      S  I      ++  +  LP   +E  +++ LE
Sbjct: 150 KEYIPP-----PEGDPINTFQIVLDKPESKQIST----EKCILKDLPIDISEESLRKQLE 200

Query: 317 SFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQ 376
           + G +    ++ D  TG  KG G   ++D    + A    +G  +          T +G 
Sbjct: 201 TIGPIKTLVVIYDPITGVPKGVGSFEFEDSLNCNKAVEKFHGRPI--------EGTKNGV 252

Query: 377 SKTEQES-ILAQAQQHIAIQKMALQTSG--------------------MNTLGGGMSL-- 413
              +  S ILA++  ++++   +L  S                     + +L   MS   
Sbjct: 253 WNIQLSSGILAKSNNNVSLNSTSLPISSTTPSNQASVISFITPREYKPVTSLTKSMSYKL 312

Query: 414 ---------------FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVN 458
                           GET +KV+ L   I  + L DD+EY  IL+ ++ E  K+GT++ 
Sbjct: 313 LSSPIIGLILCASKKVGETPSKVVQLLNIIQPEELLDDQEYHSILDSVKTEAEKFGTILE 372

Query: 459 VVIPRP-DQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG-NTVNAFYYPEDKYFN 516
           +  PRP  +      G GK+F+ + D      A+  L+GR F    TV A ++P +KY N
Sbjct: 373 IFSPRPKSRENLYCNGAGKIFIYFADITSARRAQYQLNGRIFDHVKTVCASFFPLEKYLN 432

Query: 517 KDYS 520
           ++YS
Sbjct: 433 REYS 436


>gi|327271618|ref|XP_003220584.1| PREDICTED: RNA-binding protein 39-like [Anolis carolinensis]
          Length = 578

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 88/336 (26%), Positives = 135/336 (40%), Gaps = 65/336 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 250 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 292

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 293 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 352

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 353 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 412

Query: 382 ESILAQAQQHIAIQ-----KMALQTSG------MNTLGGGMSLFGETLAKVLCL----TE 426
             ++A+  +   +Q     + ALQ SG      +  L   +S   E LA    +    T+
Sbjct: 413 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFSAVADLQTRLSQQSEVLAAAASVQPLATQ 472

Query: 427 AITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
                 + + +  E      EI +D+ EEC K+G +V++ +   D+N  +    G V+++
Sbjct: 473 CFQLSNMFNPQTEEEAGWDTEIKDDVIEECNKHGGVVHIYV---DKNSAQ----GNVYVK 525

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
                    A NAL GR F G  + A Y P   Y N
Sbjct: 526 CPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 561


>gi|393247915|gb|EJD55422.1| splicing factor, CC1-like protein [Auricularia delicata TFB-10046
           SS5]
          Length = 581

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 83/348 (23%), Positives = 145/348 (41%), Gaps = 35/348 (10%)

Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
            AR V+V  L      + +  FF      +G  +      V +      K   +VE+R++
Sbjct: 247 EARSVFVSQLAARLTARDLGYFFED---KLGEGAVRDARIVTDRISRRSKGIGYVELRSI 303

Query: 233 EEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
           +  + A+ L G I  G+ + V+        + A         NLNL      S   GGA 
Sbjct: 304 DLVTKALDLSGTIVMGLPIMVQLTEAERNRVHAG-------ENLNLPPG--VSAPQGGAM 354

Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
              +++VG L +  TE  IK++ E FG L   DL +D  TG SKGY F  Y+      +A
Sbjct: 355 ---QLYVGSLHFNLTEQDIKQVFEPFGELDFVDLHRDPGTGRSKGYAFVQYKRAEDAKMA 411

Query: 353 CAALNGLKMGDKTLTVRRATASGQSKTEQESI----------LAQAQQHIAIQKMA-LQT 401
              ++G ++  +TL V      G +     +I          L  A +   +QK+A +  
Sbjct: 412 LEQMDGFELAGRTLRVNSVNEKGVAVRNTTTIDSLEDSGGGNLNAASRQALMQKLARIDP 471

Query: 402 SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECG-KYGTLVNVV 460
              +       +      + + L      D   + +  +++ +D++ EC  KYG +  + 
Sbjct: 472 PKSSQPEARKHIPQNQSTRSVLLLNMFDPDEETEPDWDKDLADDVKGECASKYGPVTALK 531

Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
           I +  Q        G++++++        A ++L+GR FGG  VNA +
Sbjct: 532 IEKDSQ--------GEIYVQFESVDSAKKAVDSLNGRWFGGRQVNARF 571


>gi|409051610|gb|EKM61086.1| hypothetical protein PHACADRAFT_247456 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 584

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 90/363 (24%), Positives = 151/363 (41%), Gaps = 45/363 (12%)

Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
           P+    ++    AR V+V  L      + +  FF   +    G  A     +V   I+  
Sbjct: 241 PLAEEPREDDSEARSVFVSQLAARLTARDLGYFFEDKL----GEGAVMDSRIVTDRISRR 296

Query: 222 KK-FAFVEMRTVEEASNAMALDGIIFEGVAVRVRR-PTDYNPTLAA---ALGPGQPSPNL 276
            K   +VE RT+E    A+ L G I  G+ ++V+    + N T A     L PG      
Sbjct: 297 SKGIGYVEFRTIELVEKAIGLSGTIVMGLPIQVQHTEAERNRTHAGDSLHLPPG------ 350

Query: 277 NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
                   S   GG +    ++VG L +  TE+ I+++ E FG L   DL +D  TG SK
Sbjct: 351 -------VSSHHGGMQ----LYVGSLHFNLTESDIRQVFEPFGELEFVDLHRDPMTGRSK 399

Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK-TEQESI-------LAQA 388
           GY F  Y+      +A   + G ++  +TL V      G  + T QES+       L  A
Sbjct: 400 GYAFVQYKRGEDAKMALEQMEGFELAGRTLRVNTVHEKGNVRYTPQESLDDTGGGNLNAA 459

Query: 389 QQHIAIQKMAL--QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
            +   +QK+A   Q +         ++     +K + L      +   + +  +E+ +D+
Sbjct: 460 SRQALMQKLARTDQPAARPQPIMKPNIPQSMQSKSVLLKNMFNPEEETERDWDKELADDV 519

Query: 447 REEC-GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVN 505
           + E   KYG +  + + R  Q        G++++++        A   L GR FGG  V+
Sbjct: 520 KNEVEDKYGDVNFIKVERESQ--------GEIYVKFDSIESAKKAIEGLHGRWFGGRQVS 571

Query: 506 AFY 508
           A +
Sbjct: 572 AAF 574


>gi|256082940|ref|XP_002577709.1| splicing factor [Schistosoma mansoni]
 gi|360043602|emb|CCD81148.1| putative splicing factor [Schistosoma mansoni]
          Length = 463

 Score = 88.6 bits (218), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 91/380 (23%), Positives = 147/380 (38%), Gaps = 72/380 (18%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK---FAFVE 228
           R AR V+V  L     ++ +  FF+         S G    V  +  N  K+    A+VE
Sbjct: 101 RDARTVFVWQLSARIRQRDLEDFFT---------SVGKIRDVRLIMDNKTKRSKGIAYVE 151

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            R VE A  A+ L G    GV +++++       ++A   P  P P+             
Sbjct: 152 FREVESAQLALGLTGTRLLGVPIQIQQSHAEKNRVSAT--PSLPRPSQQ----------- 198

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               GP ++++G L Y  TE  +K + E FG +    L+KD  T  S+GYGF  Y +   
Sbjct: 199 --NRGPMKLYIGSLHYNITEEMLKGIFEPFGKIEDIKLIKDPTTNRSQGYGFVTYVNSDD 256

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL- 407
              A   LNG ++  + + V   T     ++E   + A          + L T+G   L 
Sbjct: 257 AKKALDQLNGFELAGRPMKVNHVT----ERSEYACLSALDNDEADRSGVDLGTTGRLALM 312

Query: 408 -----GGGMSLFGETLAKV------------------------LCLTEAITADA----LA 434
                G G+ +    LA++                        +C    + ++     +A
Sbjct: 313 AKLAEGTGLEIPKAALAQLHIGQNNPILGSAGSVSSSSAIAPPVCTQCFMLSNMFDPHVA 372

Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNAL 494
               +EEI +D+ EEC K G  +++ + R       T   G V+++           N L
Sbjct: 373 THSVFEEIRDDVIEECTKAGGCLHIFVDR-------TSAQGNVYVKCPSIAVATQCVNML 425

Query: 495 SGRKFGGNTVNAFYYPEDKY 514
            GR F G  + A Y P   Y
Sbjct: 426 HGRYFSGRLITAAYVPLINY 445


>gi|62122939|ref|NP_001014392.1| RNA binding motif protein 39b [Danio rerio]
 gi|61402832|gb|AAH91794.1| RNA binding motif protein 39b [Danio rerio]
          Length = 539

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 72/281 (25%), Positives = 119/281 (42%), Gaps = 51/281 (18%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  +AS    G  GP R++VG L +  TE  ++ + E FG + G  L+ D +TG SKGYG
Sbjct: 249 AAAMASMLQRGGAGPMRLYVGSLHFNITEDMLRGIFEPFGKIEGIQLMMDSETGRSKGYG 308

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
           F  + D      A   LNG ++  + + V     R+ AS  S                 T
Sbjct: 309 FISFADAECAKKALEQLNGFELAGRPMKVGHVTERSDASSASSFLDNDELERTGIDLGTT 368

Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLGG------------------GMSLFGE 416
            +  ++A+  +   +Q     K ALQ SG  + G                    M+L  +
Sbjct: 369 GRLQLMARLAEGTGLQIPAAAKQALQMSGSVSFGNMPNASATPPLIPNPGMNQAMNLPTQ 428

Query: 417 TLAKVLCLTEAITADALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
            LA        +    + ++  ++ EI +D+ EEC K+G ++++ +   D+N  +    G
Sbjct: 429 PLATHCLQLSNMFNPQMENEPGWDIEIRDDVIEECRKHGGVIHIYV---DKNSAQ----G 481

Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            V+++           ++L GR F G  + A Y P   Y N
Sbjct: 482 NVYVKCPTIPVAMAVVSSLHGRWFAGKMITAAYVPLPTYHN 522


>gi|432865706|ref|XP_004070573.1| PREDICTED: RNA-binding protein 39-like [Oryzias latipes]
          Length = 516

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 69/266 (25%), Positives = 109/266 (40%), Gaps = 46/266 (17%)

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
           G+ GP R++VG L +  TE  ++ + E FG + G  L+ D +TG SKGYGF  + D    
Sbjct: 241 GSAGPMRLYVGSLHFNITEEMLRGIFEPFGKIEGIQLMMDSETGRSKGYGFISFADAECA 300

Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKTEQE--------------------SILAQAQ 389
             A   LNG ++  + + V   T    S T                        ++A+  
Sbjct: 301 KKALEQLNGFELAGRPMKVGHVTERSDSSTASSFLDNDELERTGIDLGTTGRLQLMARLA 360

Query: 390 QHIAIQ-----KMALQTSGMNTLGG-------------GMSLFGETLA-KVLCLTEAITA 430
           +   ++     + ALQ +G    G               ++L  + LA   L L+     
Sbjct: 361 EGTGLKIPPAAQQALQMTGSIPFGNMAAPAIPTPAPSQALNLPSQPLATHCLQLSNLFDP 420

Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATA 490
            A  D     EI +D+ EEC K+G +V++ +   D+N  +    G V+++          
Sbjct: 421 QAENDPAWASEIQDDVIEECNKHGGVVHIYV---DKNSPQ----GNVYVKCPSIPAAMAT 473

Query: 491 KNALSGRKFGGNTVNAFYYPEDKYFN 516
            NAL GR F    + A Y P   Y N
Sbjct: 474 VNALHGRWFARKMITAAYVPLPTYHN 499


>gi|326931688|ref|XP_003211958.1| PREDICTED: LOW QUALITY PROTEIN: RNA-binding protein 39-like
           [Meleagris gallopavo]
          Length = 571

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 87/334 (26%), Positives = 131/334 (39%), Gaps = 65/334 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 243 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 285

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 286 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 345

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 346 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 405

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA-- 434
             ++A+  +   +Q     + ALQ SG    G    L      +   L  A +   LA  
Sbjct: 406 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQNEVLAAAASVQPLATQ 465

Query: 435 -----------DDEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
                       +EE     EI +D+ EEC K+G ++++ +   D+N  +    G V+++
Sbjct: 466 CFQLSNMFNPQTEEEAGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVYVK 518

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
                    A NAL GR F G  + A Y P   Y
Sbjct: 519 CPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTY 552


>gi|226469236|emb|CAX70097.1| RNA-binding protein 39 [Schistosoma japonicum]
          Length = 463

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 91/380 (23%), Positives = 148/380 (38%), Gaps = 72/380 (18%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK---FAFVE 228
           R AR V+V  L     ++ +  FF+         S G    V  +  N  K+    A+VE
Sbjct: 101 RDARTVFVWQLSARIRQRDLEDFFT---------SVGKIRDVRLIMDNKTKRSKGIAYVE 151

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            R VE A  A+ L G    GV +++++       ++A   P  P P+             
Sbjct: 152 FREVESAQLALGLTGTRLLGVPIQIQQSHAEKNRVSAT--PSLPRPSQQ----------- 198

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
              +GP ++++G L Y  TE  +K + E FG +    L+KD  T  S+GYGF  Y +   
Sbjct: 199 --NKGPMKLYIGSLHYNITEEMLKGIFEPFGKIEDIKLIKDPATNRSQGYGFVTYVNSDD 256

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL- 407
              A   LNG ++  + + V   T     ++E   + A          + L T+G   L 
Sbjct: 257 AKKALDQLNGFELAGRPMKVNHVT----ERSEYACLSALDNDEADRSGVDLGTTGRLALM 312

Query: 408 -----GGGMSLFGETLAKV------------------------LCLTEAITADA----LA 434
                G G+ +    LA++                        +C    + ++     +A
Sbjct: 313 AKLAEGTGLEIPKAALAQLHIGQNNPILGSAGSVSSSSAIAPPVCTQCFMLSNMFDPHVA 372

Query: 435 DDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNAL 494
               +EEI +D+ EEC K G  +++ + R       T   G V+++           N L
Sbjct: 373 THSVFEEIRDDVIEECTKAGGCLHIFVDR-------TSAQGNVYVKCPSIAVATQCVNML 425

Query: 495 SGRKFGGNTVNAFYYPEDKY 514
            GR F G  + A Y P   Y
Sbjct: 426 HGRYFSGRLITAAYVPLINY 445


>gi|126291195|ref|XP_001371651.1| PREDICTED: RNA-binding protein 39 isoform 1 [Monodelphis domestica]
          Length = 524

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 88/338 (26%), Positives = 135/338 (39%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFG--ETLAKVLCLTEAITADALA 434
             ++A+  +   +Q     + ALQ SG    G    L       ++V  L  A +   LA
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQSEVTALAAAASVQPLA 416

Query: 435 ---------------DDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
                          D+  ++ EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 417 TQCFQLSNMFNPQTEDELGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 469

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 470 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 507


>gi|340500276|gb|EGR27170.1| splicing factor u2af large subunit, putative [Ichthyophthirius
           multifiliis]
          Length = 201

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 82/149 (55%), Gaps = 12/149 (8%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGG--NSAGPGDAVVNVYINHEKKFAFVEM 229
           RHARR+Y+G +P   N++ ++ +  + + A GG  +S    + ++   I+ + KFAF+E+
Sbjct: 22  RHARRLYIGNIPDSINQEYLSEWLYRSLEAAGGLVDSLPNENPIIKCEIDSKGKFAFIEI 81

Query: 230 RTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL---AAVGLASG 286
           RT+EE +  + LDGII     +R+RRPT+Y       + P      LNL     +G+   
Sbjct: 82  RTIEETTTLLQLDGIILWHRQLRIRRPTEYEK--FPQIYPNYNVKKLNLDLFKTIGIVII 139

Query: 287 AIGGAEGPDRVFVGGLPYYFTETQIKELL 315
                +GP+++F+  LP     TQ+ EL+
Sbjct: 140 PTVVDDGPNKIFLANLP-----TQMDELM 163


>gi|47212427|emb|CAF93583.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 500

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 73/265 (27%), Positives = 112/265 (42%), Gaps = 46/265 (17%)

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
           G  GP R++VG L +  TE  ++ + E FG +    L+ D DTG SKGYGF  + D    
Sbjct: 226 GMMGPLRLYVGSLHFNITEEMLRGIFEPFGRIENIQLMVDSDTGRSKGYGFITFADAECA 285

Query: 350 DIACAALNGLKMGDKTLTVRRATASGQSKT-------EQE------------SILAQ--- 387
             A   LNG ++  + + V   T    +         EQE             ++AQ   
Sbjct: 286 KKALEQLNGFELAGRPMKVGHVTDRSDAVAPPFPDGEEQERAGADLGSTGRLQLMAQLSE 345

Query: 388 ---------AQQ------HIAIQKMALQTSGMNTLGGGMSLFGETLA-KVLCLTEAITAD 431
                    AQQ       IA+  MA  ++ MN  G  MS+  + LA     L+   + +
Sbjct: 346 GTGLPMPPSAQQALQMSGAIALGAMAAVSAAMNP-GLNMSIPSQPLATHCFQLSNMFSPN 404

Query: 432 ALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAK 491
           +        +I  ++ EEC K+G +V++ +   D++  E    G V+++         A 
Sbjct: 405 SELPPGWELDIQHNVIEECNKHGGVVHIYV---DKDSAE----GNVYIKCPTIPAAMAAV 457

Query: 492 NALSGRKFGGNTVNAFYYPEDKYFN 516
           N L GR F G  + A Y P   Y N
Sbjct: 458 NVLHGRFFNGKLITAAYVPLPTYHN 482


>gi|145502691|ref|XP_001437323.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404473|emb|CAK69926.1| unnamed protein product [Paramecium tetraurelia]
          Length = 438

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 40/110 (36%), Positives = 66/110 (60%), Gaps = 1/110 (0%)

Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
           M+ + +    +L +   +T + +  DEE+ +I+ED+REEC K+GT+ NV+IPRP+  G  
Sbjct: 294 MARYVQIPTNILVIKNVLTLEDVTIDEEFNDIMEDIREECSKFGTVKNVIIPRPE-FGKI 352

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
             GVGK+F+EY       TA+  L+GR +G  TV   Y   +K+  + ++
Sbjct: 353 IVGVGKIFVEYEKTQEARTARRYLAGRMYGDKTVECEYLSREKWAKRQFT 402



 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 56/223 (25%), Positives = 98/223 (43%), Gaps = 41/223 (18%)

Query: 81  RHRSRSHSSDRFRNR---SKSLSPSRSPSKSKRRSGFDMAPPAAAM-----------LPG 126
           + RSR+ S ++ + R    K  +P+++ ++  R   FD +PP   +           L  
Sbjct: 41  KKRSRNVSKEKDKKREEFQKPKAPTKANAEQSRGFRFD-SPPKDPLQNTPFSNFKSKLID 99

Query: 127 AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ---QATRHA-RRVYVGGL 182
               G+   +  A P   QN L          PL+ +Q + Q   QA   A R++YVG L
Sbjct: 100 QVSLGEFETILPANP--LQNPLASLEALQAMTPLIQMQRLQQLRAQADVKADRKLYVGNL 157

Query: 183 PPLANEQAIAT----------FFSQVMTAIGGNSAGPGDAVVNVYINHEKK--------- 223
           PP +  + +            F +Q +  +G +S   G ++ N +I+   +         
Sbjct: 158 PPNSQPKEVEMVMDILNQLQDFLNQTLLKMGVSSEHAG-SICNCWIDSNGQILRLIYLGH 216

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAA 266
           F F+E R+ EEA+    L  +IF+G  +++ RP  +  +LAA 
Sbjct: 217 FGFIEFRSPEEATQGFILKDVIFKGHQLKIGRPKSFLTSLAAV 259


>gi|395505312|ref|XP_003756986.1| PREDICTED: RNA-binding protein 39 [Sarcophilus harrisii]
          Length = 557

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 88/338 (26%), Positives = 135/338 (39%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 227 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 269

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 270 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 329

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 330 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 389

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFG--ETLAKVLCLTEAITADALA 434
             ++A+  +   +Q     + ALQ SG    G    L       ++V  L  A +   LA
Sbjct: 390 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQSEVTALAAAASVQPLA 449

Query: 435 ---------------DDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
                          D+  ++ EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 450 TQCFQLSNMFNPQTEDELGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 502

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 503 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 540


>gi|224077247|ref|XP_002192236.1| PREDICTED: RNA-binding protein 39 [Taeniopygia guttata]
          Length = 522

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 87/334 (26%), Positives = 131/334 (39%), Gaps = 65/334 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA-- 434
             ++A+  +   +Q     + ALQ SG    G    L      +   L  A +   LA  
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVTDLQTRLSQQNEVLAAAASVQPLATQ 416

Query: 435 -----------DDEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
                       +EE     EI +D+ EEC K+G ++++ +   D+N  +    G V+++
Sbjct: 417 CFQLSNMFNPQTEEEAGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVYVK 469

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
                    A NAL GR F G  + A Y P   Y
Sbjct: 470 CPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTY 503


>gi|384252245|gb|EIE25721.1| hypothetical protein COCSUDRAFT_64802 [Coccomyxa subellipsoidea
           C-169]
          Length = 581

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 40/92 (43%), Positives = 55/92 (59%), Gaps = 5/92 (5%)

Query: 422 LCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE-----TPGVGK 476
           L +T  +T D L DDEEY E+++D++EEC KYG ++ V++PRP           +   GK
Sbjct: 480 LQVTGMVTPDVLVDDEEYSEVIQDLQEECSKYGQVLRVLVPRPPNPAASNELFGSNNYGK 539

Query: 477 VFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
            F E+ D  GC+ AK A+ GR F G TV A Y
Sbjct: 540 AFAEFADVSGCSAAKAAIHGRLFAGETVQATY 571



 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 35/88 (39%), Positives = 50/88 (56%), Gaps = 3/88 (3%)

Query: 175 RRVYVGGLPP-LANEQAIATFFSQVMTAIGGNSAGPG-DAVVNVYINHEKKFAFVEMRTV 232
           R VYVG L   L  E A+   F+  M A   N    G +AVV+V ++ E ++AFVE+RT 
Sbjct: 294 REVYVGNLVAGLVTEDALRQLFNSTMAAAFPNLLAQGLEAVVSVSMHSEGRYAFVELRTP 353

Query: 233 EEASNAMAL-DGIIFEGVAVRVRRPTDY 259
           E AS A+ L + +   G ++ V RP+ Y
Sbjct: 354 EMASAALQLSNQVQLLGQSISVGRPSGY 381


>gi|403222792|dbj|BAM40923.1| RNA splicing factor [Theileria orientalis strain Shintoku]
          Length = 649

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 76/305 (24%), Positives = 138/305 (45%), Gaps = 40/305 (13%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL-AA 280
           K  A+VE  T E    A++++G+  +G  +RV+                Q   N    AA
Sbjct: 367 KGIAYVEFYTQESVIKALSMNGMSLKGQGIRVQ--------------SSQAEKNRAARAA 412

Query: 281 VGLASGAIGGAEGPDRVFVG---GLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
             L   A+  A+ P  V V    G+    +E  +++L   FG +    + ++ D G SKG
Sbjct: 413 KQLQENALKEADNPTTVMVSNLVGVLSNLSEGDLQQLFAPFGNVAEVAVARN-DLGLSKG 471

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           Y +  ++       A   +NG  +  + + V   T++ + +  + + L      + I+++
Sbjct: 472 YAYVRFKRWTEAREALNVMNGFDISGQPIKVSYVTSNKRGRGSRLNELG----DLDIERL 527

Query: 398 ALQTSGM-----NTLGGGMSLFGETLAKVLCLTEAITADALADDEEY-EEILEDMREECG 451
             + +G+     N +     L     A  + L+   T++  AD+ ++ +EI +D+REEC 
Sbjct: 528 DDEEAGLISGSSNKIALMKKLQQRVNAANIVLSNMYTSEDYADNNDFFDEIEDDVREECK 587

Query: 452 KYGTLVNVVIPR--PDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
           KYG +V V + R  PD         GKV++++       TA  +L GR F GNT+   Y 
Sbjct: 588 KYGEVVKVYLNRRKPD---------GKVYVKFRSNTDAQTAHKSLQGRYFAGNTIQVGYL 638

Query: 510 PEDKY 514
            +D++
Sbjct: 639 SDDQF 643


>gi|302695543|ref|XP_003037450.1| hypothetical protein SCHCODRAFT_80935 [Schizophyllum commune H4-8]
 gi|300111147|gb|EFJ02548.1| hypothetical protein SCHCODRAFT_80935 [Schizophyllum commune H4-8]
          Length = 409

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 151/357 (42%), Gaps = 52/357 (14%)

Query: 173 HARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTV 232
            AR V+V  L      + +  FF      +G N+      V +      K   +VE RTV
Sbjct: 74  EARSVFVSQLAARLTARDLGYFFED---KLGENTVMDARIVTDRISRRSKGIGYVEFRTV 130

Query: 233 EEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAE 292
           +    A+ L G +  G+ + V+        L A  G      NLNL      S   GGA 
Sbjct: 131 DLVDKALDLSGTVVMGLPIMVQLTEAERNRLHAGDG------NLNLPPG--VSAPHGGAM 182

Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
              +++VG L +  TE  IK++ E FG L   DL +D  TG SKGY F  Y+      +A
Sbjct: 183 ---QLYVGSLHFNLTEADIKQVFEPFGELEFVDLHRDPTTGRSKGYAFVQYKRAEDARMA 239

Query: 353 CAALNGLKMGDKTLTVRRATASG------QSKTEQES---ILAQAQQHIAIQKMALQTSG 403
              + G ++  + L V      G      Q+++  +S    L  A +   +QK+A   S 
Sbjct: 240 MEQMEGFELAGRQLKVNTVHDKGGVVRYAQTESLDDSGGGNLNAASRQALMQKLARTDSA 299

Query: 404 MNTLGGGMSLFGETLA----------KVLCLTEAITADALADDEEYEEILEDMREEC-GK 452
                    L  E +A          + + L      +  + D+  +E+ +D++ EC  K
Sbjct: 300 --------PLLPEPVARPNIPQTMESRSVLLKNMFDPEEESGDDWDKELADDVKGECESK 351

Query: 453 YGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCA-TAKNALSGRKFGGNTVNAFY 508
           YG +  + + +  Q        G+++++ +DAV  A  A   L+GR FGG  V+A +
Sbjct: 352 YGKVSAIKVEKETQ--------GEIYVK-FDAVDAARKAVQGLNGRWFGGKQVSAAF 399


>gi|118100450|ref|XP_425690.2| PREDICTED: RNA-binding protein 39 [Gallus gallus]
 gi|363741409|ref|XP_003642487.1| PREDICTED: RNA-binding protein 39-like [Gallus gallus]
          Length = 522

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 87/334 (26%), Positives = 131/334 (39%), Gaps = 65/334 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA-- 434
             ++A+  +   +Q     + ALQ SG    G    L      +   L  A +   LA  
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQNEVLAAAASVQPLATQ 416

Query: 435 -----------DDEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
                       +EE     EI +D+ EEC K+G ++++ +   D+N  +    G V+++
Sbjct: 417 CFQLSNMFNPQTEEEAGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVYVK 469

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
                    A NAL GR F G  + A Y P   Y
Sbjct: 470 CPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTY 503


>gi|325191168|emb|CCA25956.1| Poly(U)bindingsplicing factor PUF60 putative [Albugo laibachii
           Nc14]
          Length = 454

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 91/359 (25%), Positives = 145/359 (40%), Gaps = 56/359 (15%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE------KKFAFV 227
           A+R+YVG L     E  I   F+            P  A+ ++ ++ E      K F F+
Sbjct: 135 AKRLYVGNLYYELKEDDIRNVFA------------PFGAIHSIDLSMEPGTGRSKGFCFL 182

Query: 228 EMRTVEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
           E   V  A +A+  L+G      A++V RP             G  +P  + AAV +   
Sbjct: 183 EFNDVLAAESAVQVLNGSTMANRAIKVGRPHR-----------GNQNPKDSEAAVNIGKE 231

Query: 287 AIGGAEGPDR-VFVGGLPYYFTETQIKELLESFGTLHGFDL--VKDRDTGNSKGYGFCVY 343
           AI     P + V++GG+        I+ +   FG +    +  V   ++G  +GYGF  +
Sbjct: 232 AIRNV--PTKCVYIGGVRTELNSRHIESIFAPFGEIKHCVMTAVSSSESGVHRGYGFIEF 289

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASG---QSKTEQESILAQAQQHIAIQKMALQ 400
            D      A   +NG ++  +TL V +A+A       K   + ++        I  ++  
Sbjct: 290 GDEICAMNAIQHMNGFELAGQTLKVGKASAVALLVNLKISNDKVVD------GIHSLSDA 343

Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVV 460
                 +   + L  +     LCL   I    +      E +  ++  EC KYG +  VV
Sbjct: 344 KQRRKIIEPILELEEKEEQICLCLLNLIKPGDVD-----ENLRGEVASECSKYGDIAQVV 398

Query: 461 IPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           I        E     ++F++Y D  G   AK AL GR FGGN V A +YP   +  K Y
Sbjct: 399 I-------HELSSHVRIFVQYEDEAGALRAKGALHGRYFGGNAVKAHFYPIQMFLEKKY 450



 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 60/136 (44%), Gaps = 4/136 (2%)

Query: 257 TDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLE 316
           T  +P +A A    Q     NL +  L S      E   R++VG L Y   E  I+ +  
Sbjct: 102 TALDPEIAKARALAQA----NLLSQSLPSTLFNPIEFAKRLYVGNLYYELKEDDIRNVFA 157

Query: 317 SFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQ 376
            FG +H  DL  +  TG SKG+ F  + D    + A   LNG  M ++ + V R     Q
Sbjct: 158 PFGAIHSIDLSMEPGTGRSKGFCFLEFNDVLAAESAVQVLNGSTMANRAIKVGRPHRGNQ 217

Query: 377 SKTEQESILAQAQQHI 392
           +  + E+ +   ++ I
Sbjct: 218 NPKDSEAAVNIGKEAI 233


>gi|432101442|gb|ELK29624.1| RNA-binding protein 39 [Myotis davidii]
          Length = 491

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 86/327 (26%), Positives = 132/327 (40%), Gaps = 56/327 (17%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 172 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 214

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 215 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 274

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 275 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 334

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVLCL-TEAITADALAD 435
             ++A+  +   +Q     + ALQ SG    G   +      A V  L T+      + +
Sbjct: 335 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAASALAAAASVQPLATQCFQLSNMFN 394

Query: 436 DEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT 489
            +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+++         
Sbjct: 395 PQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVYVKCPSIAAAIA 447

Query: 490 AKNALSGRKFGGNTVNAFYYPEDKYFN 516
           A NAL GR F G  + A Y P   Y N
Sbjct: 448 AVNALHGRWFAGKMITAAYVPLPTYHN 474


>gi|449265754|gb|EMC76900.1| RNA-binding protein 39, partial [Columba livia]
          Length = 423

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 87/334 (26%), Positives = 131/334 (39%), Gaps = 65/334 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 95  KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 137

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 138 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 197

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 198 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 257

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA-- 434
             ++A+  +   +Q     + ALQ SG    G    L      +   L  A +   LA  
Sbjct: 258 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQNEVLAAAASVQPLATQ 317

Query: 435 -----------DDEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
                       +EE     EI +D+ EEC K+G ++++ +   D+N  +    G V+++
Sbjct: 318 CFQLSNMFNPQTEEEAGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVYVK 370

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
                    A NAL GR F G  + A Y P   Y
Sbjct: 371 CPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTY 404


>gi|346465875|gb|AEO32782.1| hypothetical protein [Amblyomma maculatum]
          Length = 558

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 87/327 (26%), Positives = 128/327 (39%), Gaps = 54/327 (16%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   VE    AM L+G    G+ + V+ PT      AAA      S  L     
Sbjct: 238 KGIAYVEFLDVESVPLAMGLNGQKLFGIPIVVQ-PTQAERNRAAAQNASTSSSTLQ---- 292

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
                   G  GP R++VG L +  TE  +K + E FG +   +L+KD +T  SKGYGF 
Sbjct: 293 -------RGNVGPMRLYVGSLHFNITEDMLKGIFEPFGKIDKIELIKDMETNRSKGYGFI 345

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQE-------------- 382
            + D      A   LNG ++  + + V     R   +   S  + E              
Sbjct: 346 TFHDSEDAKKALEQLNGFELAGRPMKVGHVTERTDVSQAPSFLDSEELDRSGIDLGATGR 405

Query: 383 -SILAQAQQHIAIQ--KMALQTSGMNTLG----------GGMSLFGETLAKVLCLTEAIT 429
             ++A+  +    Q  + A+    MNT G             +    T+A   C   +  
Sbjct: 406 LQLMAKLAEGTGFQIPQAAVNALQMNTTGLPGQPQAAAVAAAAAAAPTIA-TQCFLLSNM 464

Query: 430 ADALADDEEY--EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
            D L +      EEI  D+ EEC K+G  ++V + R           G V+++       
Sbjct: 465 FDPLTETNPSWDEEIRRDVIEECRKHGGALHVYVDRASPE-------GHVYVKCPTIASA 517

Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKY 514
             + NAL GR F G  + A Y P   Y
Sbjct: 518 VASVNALHGRWFAGRIITAAYVPVMSY 544


>gi|125584846|gb|EAZ25510.1| hypothetical protein OsJ_09334 [Oryza sativa Japonica Group]
          Length = 942

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 88/350 (25%), Positives = 142/350 (40%), Gaps = 80/350 (22%)

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQ-VMTAIGGNSAGPGDAVVNVYINHEKK 223
           V   QATR  RR+++  LP LA E  +    ++ ++++   +        ++  IN +K+
Sbjct: 464 VQLTQATRPLRRLHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKR 523

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
            AFVE  T E+A+ A++ DG  F G ++++RRP +Y     A + P +PS  + L +  +
Sbjct: 524 QAFVEFLTPEDATAALSFDGRSFGGSSLKIRRPKEY--VEMAHVAPKKPSEEIKLISDVV 581

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
                  A+ P ++F+ G+    +                                  +Y
Sbjct: 582 -------ADSPHKIFIAGISGVISSE--------------------------------MY 602

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
            D  +T  ACA LNG+K+G   LT   A     + TEQ     +A     I   A     
Sbjct: 603 IDHPITSKACAGLNGMKLGGGILT---AVNVFPNSTEQ--AFNEASPFYGIPDSA----- 652

Query: 404 MNTLGGGMSLFGETLAKVLCLTEAITADA--LADDEEYEEILEDMREECG--------KY 453
                   SL  E   KVL L      +   L    E EEILED+R EC         ++
Sbjct: 653 -------KSLLEEP-TKVLQLKNVFDQEEYLLLSKSELEEILEDVRVECASLHYGQDDRF 704

Query: 454 GTLVNV-VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
           G + ++ V+  P  +   T   G    E      C      +  +++GGN
Sbjct: 705 GAVKSINVVKYPASSDNTT---GDTITE------CEDGSTKIEPKEYGGN 745


>gi|66358384|ref|XP_626370.1| splicing factor U2AF U2 SnRNP auxiliary factor large subunit, RRM
           domain [Cryptosporidium parvum Iowa II]
 gi|46227901|gb|EAK88821.1| splicing factor U2AF U2 SnRNP auxiliary factor large subunit, RRM
           domain [Cryptosporidium parvum Iowa II]
          Length = 438

 Score = 85.1 bits (209), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 81/316 (25%), Positives = 133/316 (42%), Gaps = 47/316 (14%)

Query: 238 AMALDGIIFEGVAVRV--RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPD 295
            + LDG+I +   +++  RRP  Y+              NLN   V L +  I      D
Sbjct: 127 CLKLDGLIIDSQNIKLFCRRPNKYS--------------NLNNEKV-LDTFIIPRISQHD 171

Query: 296 ------RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
                 +  +  LP    E +I++ LES G L    ++ D  TG  KG G   +++ ++ 
Sbjct: 172 NFKENEKCILKNLPTDINEEKIRQHLESIGKLKSLTIIYDPITGIPKGVGSFEFEESSLC 231

Query: 350 DIACAALNG------------LKMGDKTLTVRRATAS--GQSKTEQESILAQAQQHIAIQ 395
             A A L+G            + +G  T+T  ++      QS     S + Q  +++ I 
Sbjct: 232 KKAIAILHGKPIESTKNGIWNIYLGSGTITNYKSNKGQFNQSNFSVNSNIIQNSEYLHIT 291

Query: 396 K----MALQTSGMNTLGGGMSL---FGETLAKVLCLTEAITADALADDEEYEEILEDMRE 448
           +    M         LG  M      GET ++++ L      + L D+E Y   L+ +R 
Sbjct: 292 EIPTSMTYNIFSNPVLGLMMKYSKQVGETPSQIIQLLNIFLPEELVDNEIYNSTLDSVRS 351

Query: 449 ECGKYGTLVNVVIPRPD--QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG-NTVN 505
           E   YGT++ +  PRP   +      G GKVF+ + D      A+   +GR F    TV+
Sbjct: 352 EAEVYGTILEIFCPRPKVIEEFHSCSGAGKVFIYFSDITAARRAQYQFNGRVFDNIKTVS 411

Query: 506 AFYYPEDKYFNKDYSA 521
           A ++P +KY   +YS 
Sbjct: 412 ATFFPLEKYLKHEYSV 427


>gi|224003373|ref|XP_002291358.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220973134|gb|EED91465.1| predicted protein, partial [Thalassiosira pseudonana CCMP1335]
          Length = 99

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Composition-based stats.
 Identities = 47/101 (46%), Positives = 59/101 (58%), Gaps = 6/101 (5%)

Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFL 479
           +V+ L   +T   L DD EY EILED R+EC  +GTL N++IPR   NG   PG  K+FL
Sbjct: 4   RVVELKHMLTQQDLEDDNEYNEILEDTRDECSSFGTLKNIIIPR---NG---PGATKIFL 57

Query: 480 EYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           EY      A A   L+GR F G  V A  + E K+ N+DYS
Sbjct: 58  EYMTNEDAAKAIAGLAGRTFDGRQVTAVCFDEIKFANEDYS 98


>gi|294878000|ref|XP_002768233.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239870430|gb|EER00951.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 638

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 131/562 (23%), Positives = 216/562 (38%), Gaps = 141/562 (25%)

Query: 48  RRDKNYKYDREGIRDHDRTDRHRDYNRDKERRHRHRSRSHSSD----------------- 90
           R D N +  R+  RD    + HR  N D E     + +  + D                 
Sbjct: 48  REDSNARSQRDNPRD----EPHRSRNDDPEDSPTRKRKEDTGDDGDRKSRERSRRRSPIR 103

Query: 91  -RFRNRSKSLSPSRSPS-KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNML 148
              R R +    SRSPS K K+   FD  P   A         QL G  +++  + Q ++
Sbjct: 104 RDRRPRDRRKRWSRSPSEKQKKPFKFDSPPKELA--------AQLDGSGTSLLGLPQTVV 155

Query: 149 PFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAG 208
              +T   AF       +  +  + AR +Y+G +PP  +   +    +  +  +G N A 
Sbjct: 156 SSSSTIKEAFN----ATLAAERQKIARELYIGQIPPGISAAHLIDVLNDSLMNMGAN-AM 210

Query: 209 PGDAVVNVYINHEKKFAFVEMRTVEEASNAMA-LDGIIFE--GVAVRVRRPTDYNPTLAA 265
           PG  +V+ ++  +  FAFVE RT EEAS A+  L+G   +  GV+++V RP  Y      
Sbjct: 211 PGRPIVHGWLGGDGLFAFVEFRTAEEASIALERLNGHQLKSYGVSIKVGRPKGY------ 264

Query: 266 ALGPGQPSPNLNLAAVG-----LASGAIGG-------AEGPDRVFVGGLPYYFTETQIKE 313
            +GP  P  ++N    G      +S AI G       A    R+ + G P   +E  IK 
Sbjct: 265 -MGPA-PEDSVNAYTAGGNTASSSSSAIPGGISAAEVASDTSRLCLIGFPLKASEHSIKR 322

Query: 314 LLESF--GTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
            L S   G +   +++K   T N +     V++   + D                     
Sbjct: 323 ALRSAAKGEIRHLEILK--HTWNDEQIVLAVFECVNIED--------------------- 359

Query: 372 TASGQSKTEQESILAQAQQHIAIQKMALQTSGMN----TLGGGMSLFGETL-AKVLCLTE 426
               + K + E  +   +  I   K A+    MN     +  GM L  E + ++VL +T 
Sbjct: 360 --EHRLKKKGEVEIQGVKARIINPKDAIVKGYMNFDGDIMKKGMGL--EVVPSRVLVMTN 415

Query: 427 -AITADALADDEEYEEILEDMREECGKY---GTLVNVVIPRPDQNGG------------- 469
            A + + L DD  Y ++++D++ EC        + +++IPRP+ N               
Sbjct: 416 FAGSVEELLDDINYSDLMDDIKVECKSITGGADVRSIIIPRPETNTTIPTVNDVNTPNGD 475

Query: 470 -------------------------------ETPGVGKVFLEYYDAVGCATAKNALSGRK 498
                                          + PG+G  F+E+         K  L GR 
Sbjct: 476 AHHHDSATMEDSHQTTVQGNTSTAAVPAVDMQVPGLGCCFIEFRSVEEAGQVKRILDGRI 535

Query: 499 FGGNTVNAFYYPEDKYFNKDYS 520
           FGG+ V   Y+ E ++   D++
Sbjct: 536 FGGHEVFVTYFSETRFQRGDFA 557


>gi|449689952|ref|XP_004212193.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like, partial [Hydra
           magnipapillata]
          Length = 210

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 60/170 (35%), Positives = 83/170 (48%), Gaps = 29/170 (17%)

Query: 102 SRSPSKSKRRSGFDMAPPA---------AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGA 152
           S S  KSKR + +D+ P            A+     V    P   SAVP ++   LP GA
Sbjct: 46  SHSVPKSKRNTLWDVPPKGYEDITPVQFKALRAAGKVEVANPVCGSAVPAVS---LPQGA 102

Query: 153 TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDA 212
                           Q T  ARR+Y+G +P   +E  +  FF+  M      +  PG+ 
Sbjct: 103 ----------------QTTWQARRIYLGNIPFGISEDLMVDFFNAKMRE-SDIARQPGNP 145

Query: 213 VVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPT 262
           V+   IN EK FAF+E R+VEE + AMA DGI+ +G A+++RRP DY P 
Sbjct: 146 VLACQINLEKNFAFLEFRSVEETTLAMAFDGIMLQGQALKIRRPKDYQPI 195


>gi|215820610|ref|NP_001135964.1| RNA binding motif protein 39 [Nasonia vitripennis]
          Length = 516

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 96/376 (25%), Positives = 146/376 (38%), Gaps = 69/376 (18%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR ++   L      + +  FFS         S G    V  +  N  ++F   A+VE
Sbjct: 154 RDARTIFCMQLSQRIRARDLEEFFS---------SVGKVQDVRLITCNKTRRFKGIAYVE 204

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            +  E  + A+ L G    GV + V+       T A     G   PNL            
Sbjct: 205 FKDPESVTLALGLSGQKLLGVPIVVQH------TQAEKNRMGNSMPNL----------MP 248

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
            G  GP R++VG L +  TE  +K + E FG +    L+ D +TG SKGYGF  +++   
Sbjct: 249 KGQTGPMRLYVGSLLFNITEEMLKGIFEPFGKIENIQLIMDPETGRSKGYGFLTFRNADD 308

Query: 349 TDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
              A   LNG ++  + + V     R     G S  + + +        A  ++ L    
Sbjct: 309 AKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLLDTDELDRSGIDLGATGRLQL---- 364

Query: 404 MNTLGGGMSL-FGETLAKVLCLTEAITADALAD-----------------DEEYE----- 440
           M  L  G  L      A  L +T  +TA  +                   D + E     
Sbjct: 365 MFKLAEGTGLEIPPAAANALNMTPVVTAPQINQQTAPPIATQCFMLSNMFDPQNENNSLW 424

Query: 441 --EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRK 498
             EI +D+ EEC K+G +++V +   DQ   +    G V+++         A N+L GR 
Sbjct: 425 VKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVYVKCPSIATAVAAVNSLHGRW 477

Query: 499 FGGNTVNAFYYPEDKY 514
           F G  + A Y P   Y
Sbjct: 478 FAGRVITAAYVPVVNY 493


>gi|145523992|ref|XP_001447829.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415351|emb|CAK80432.1| unnamed protein product [Paramecium tetraurelia]
          Length = 419

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 66/110 (60%), Gaps = 1/110 (0%)

Query: 411 MSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
           M+ + +    +L +   +T + +  DEE+ +I++D++EEC K+GT+ N++IPRP+  G  
Sbjct: 275 MARYVQIPTNILVIKNVLTLEDVTIDEEFNDIMDDIKEECSKFGTVKNIIIPRPE-FGKI 333

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
             GVGK+F+EY       TA+  L+GR +G  TV   Y   +K+  + ++
Sbjct: 334 IIGVGKIFVEYEKTQEARTARRYLAGRMYGDKTVECEYLSREKWAKRQFT 383



 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 56/204 (27%), Positives = 96/204 (47%), Gaps = 22/204 (10%)

Query: 81  RHRSRSHSSDRFRNR---SKSLSPSRSPSKSKRRSGFDMAPPAAAM-----------LPG 126
           + RSR+ S ++ + R    K  +P++  ++  R   FD +PP   +           L  
Sbjct: 41  KKRSRNVSKEKEKKRDEFQKPKAPTKQNAEQSRGFRFD-SPPKDPLQNTPFSNFKSKLID 99

Query: 127 AAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQ---QATRHA-RRVYVGGL 182
               G+   +  A P   QN L          PL+ +Q + Q   QA   A R++YVG L
Sbjct: 100 QVSLGEFETILPANP--LQNPLASLEALQAMTPLIQMQRLQQLRAQADVKADRKLYVGNL 157

Query: 183 PPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALD 242
           PP +  + +  F +Q +  +G +S   G ++ N +I+    F F+E R+ EEA+    L 
Sbjct: 158 PPNSQPKELQDFLNQTLLKMGVSSEHAG-SICNCWIDSNGHFGFIEFRSPEEATQGFILK 216

Query: 243 GIIFEGVAVRVRRPTDYNPTLAAA 266
            +IF+G  +++ RP  +  +LAA 
Sbjct: 217 DVIFKGHQLKIGRPKSFLTSLAAV 240


>gi|242015973|ref|XP_002428613.1| RNA-binding region-containing protein, putative [Pediculus humanus
           corporis]
 gi|212513276|gb|EEB15875.1| RNA-binding region-containing protein, putative [Pediculus humanus
           corporis]
          Length = 593

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 118/521 (22%), Positives = 195/521 (37%), Gaps = 80/521 (15%)

Query: 27  RTGERGRDRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDRHRDYNRDKER-RHRHRSR 85
           +  +R RD+H +  ++      RDK+    RE  RD DR  R RD +  +ER R+RH   
Sbjct: 26  KRSKRDRDKHDKRSRNSRSRHSRDKDRHSSRE--RDRDRHSRERDRHSSRERDRNRH--- 80

Query: 86  SHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQ 145
           S   DR+ ++ +          S+ R  +            +  P  L       P +  
Sbjct: 81  SRDRDRY-SKDRDRRSRDRDRHSRERDRYSRERDRYRSRRRSISPNNL------APHLLN 133

Query: 146 NMLPFGA----TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTA 201
           N   +       +   F  +P+  +T +  R  R V+   L      + +  FFS     
Sbjct: 134 NEYAYKKYASYRKSPTFSKLPIDDLTPEE-RDQRTVFCMQLSQRIRGRDLEEFFS----- 187

Query: 202 IGGNSAGPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTD 258
               S G    V  +  N  ++F   A+VE +  E    AM L G    G+ + V+    
Sbjct: 188 ----SVGKVRDVKLITCNKTRRFKGIAYVEFKDPESVPLAMGLTGQKLLGIPISVQ---- 239

Query: 259 YNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESF 318
             PT A     G  +  + + +            GP R++VG L +  TE  ++ + E F
Sbjct: 240 --PTQAEKNRQGNSTAPMMMPS---------DMRGPMRLYVGSLHFNITEDMLRGIFEPF 288

Query: 319 GTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV-----RRATA 373
           G +    L+ D +TG SKGYGF  +        A   LNG ++  + + V     R    
Sbjct: 289 GKIDSIQLIMDPETGRSKGYGFITFHSADDAKKALEQLNGFELAGRPMKVGNVQERTDNI 348

Query: 374 SGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAK------------- 420
           +G S  + + +        A  ++ L        G GM +                    
Sbjct: 349 AGTSILDTDELDRSGIDLGATGRLQLMYKLAE--GTGMQIPPAAATALNLANALPQAVQP 406

Query: 421 -----VLCLTEAITAD-ALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPG 473
                  C   A   D A   +  ++ EI +D+ EEC K+G +++V + +          
Sbjct: 407 APPIATQCFMLANMFDPATETNPTWDVEIRDDVIEECNKHGGVLHVYVDKTSN------- 459

Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
            G V+++         + N+L GR F G  + A Y P   Y
Sbjct: 460 -GNVYVKCPTIATAVASVNSLHGRWFAGRIITAAYVPLLNY 499


>gi|114794658|pdb|2HZC|A Chain A, Crystal Structure Of The N-terminal Rrm Of The U2af Large
           Subunit
          Length = 87

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 40/83 (48%), Positives = 59/83 (71%), Gaps = 1/83 (1%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           ARR+YVG +P    E+A+  FF+  M  +GG +  PG+ V+ V IN +K FAF+E R+V+
Sbjct: 6   ARRLYVGNIPFGITEEAMMDFFNAQMR-LGGLTQAPGNPVLAVQINQDKNFAFLEFRSVD 64

Query: 234 EASNAMALDGIIFEGVAVRVRRP 256
           E + AMA DGIIF+G ++++RRP
Sbjct: 65  ETTQAMAFDGIIFQGQSLKIRRP 87


>gi|59858555|ref|NP_001012304.1| RNA-binding protein 39 [Danio rerio]
 gi|27882534|gb|AAH44487.1| RNA binding motif protein 39a [Danio rerio]
 gi|182892014|gb|AAI65689.1| Rbm39a protein [Danio rerio]
          Length = 523

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 73/283 (25%), Positives = 118/283 (41%), Gaps = 53/283 (18%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  LA+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYG
Sbjct: 231 AAALANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIDSIQLMMDSETGRSKGYG 290

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
           F  + D      A   LNG ++  + + V     R  AS  S                 T
Sbjct: 291 FITFSDAECAKKALEQLNGFELAGRPMKVGHVTERTDASTASSFLDNDELERTGIDLGTT 350

Query: 380 EQESILAQAQQHIAIQ-----KMALQTSG-------------------MNT-LGGGMSLF 414
            +  ++A+  +   +Q     + ALQ SG                   +N      ++L 
Sbjct: 351 GRLQLMARLAEGTGLQIPPAAQQALQMSGSMVAMAAATAAMNPGLSFNINVPTNQALNLP 410

Query: 415 GETLA-KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPG 473
            + +A     L+     ++  D     EI +D+ EEC K+G ++++ +   D+   E   
Sbjct: 411 SQPIATHCFQLSNMFNPNSENDHGWEIEIQDDVIEECNKHGGVIHIYV---DKKSAE--- 464

Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            G V+++         A +AL GR FGG  + A Y P   Y N
Sbjct: 465 -GNVYVKCPTIPAAMAAVSALHGRWFGGKMITAAYVPLPTYHN 506


>gi|126291198|ref|XP_001371677.1| PREDICTED: RNA-binding protein 39 isoform 2 [Monodelphis domestica]
          Length = 533

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 75/289 (25%), Positives = 120/289 (41%), Gaps = 59/289 (20%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYG
Sbjct: 235 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 294

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
           F  + D      A   LNG ++  + + V     R  AS  S                 T
Sbjct: 295 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 354

Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLGG-GMSLFGETL----------AKVLC 423
            +  ++A+  +   +Q     + ALQ SG    G     +F              ++V  
Sbjct: 355 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAAKIFFPFFIDLQTRLSQQSEVTA 414

Query: 424 LTEAITADALA---------------DDEEYE-EILEDMREECGKYGTLVNVVIPRPDQN 467
           L  A +   LA               D+  ++ EI +D+ EEC K+G ++++ +   D+N
Sbjct: 415 LAAAASVQPLATQCFQLSNMFNPQTEDELGWDTEIKDDVIEECNKHGGVIHIYV---DKN 471

Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
             +    G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 472 SAQ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 516


>gi|321472566|gb|EFX83536.1| hypothetical protein DAPPUDRAFT_194972 [Daphnia pulex]
          Length = 366

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 84/374 (22%), Positives = 141/374 (37%), Gaps = 60/374 (16%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS V         G    V  +  N  ++F    +VE
Sbjct: 6   RDARTVFCMQLSQRIRARDLEEFFSAV---------GKVRDVRLITCNKTRRFKGLCYVE 56

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
               E    A+AL G    GV + V+        LA +  P   S N             
Sbjct: 57  FAEPESVPLAIALTGQRLCGVPIVVQPTQAEKNRLAGSNMPAMSSFN------------- 103

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
            G  GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  +++   
Sbjct: 104 KGPNGPMRLYVGSLHFNITEDMLRSIFEPFGKIEHMQLMIDTETGRSKGYGFITFRNAED 163

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSK------------------TEQESILAQAQQ 390
              A   LNG ++  + + +   T                        T +  ++A+  Q
Sbjct: 164 AKKAMEQLNGFELAGRPMKINHVTEHFTGNHTYLDSDEMDRAGIDLGATGRLQLMAKLAQ 223

Query: 391 HIAIQKMALQTSGMNTLGGGMSLFGETL----------AKVLCLTEAITADALADDEEYE 440
              ++  A   S +N      +   + L           +   L+    + +      ++
Sbjct: 224 GTGLEIPAAAQSALNLQASIQAAQQQALPVASVAPPIATQCFMLSNMFDSSSETHPLWHQ 283

Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
           EI +D+ +EC K+G ++++ + +           G V+++         A NAL GR F 
Sbjct: 284 EICDDVMDECNKHGGVLHIYVDKASPQ-------GNVYVKCPSVTVAVNAVNALHGRWFA 336

Query: 501 GNTVNAFYYPEDKY 514
           G  + A Y P   Y
Sbjct: 337 GRIITAAYVPLINY 350


>gi|297801306|ref|XP_002868537.1| hypothetical protein ARALYDRAFT_355725 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297314373|gb|EFH44796.1| hypothetical protein ARALYDRAFT_355725 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1370

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 70/289 (24%), Positives = 121/289 (41%), Gaps = 40/289 (13%)

Query: 96   SKSLSPSRSPSKSKRRSGFDMAPPA-AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQ 154
            +K++SP  + S  K+ + +D+AP   AAM  G+   G      +A P  ++  L      
Sbjct: 815  AKAVSPP-NLSSEKKSAKWDLAPAVTAAMFSGSVFSGLQAAAQTAYPTNSEASLTLLKPL 873

Query: 155  LGAFPLMPV--------QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS 206
            + A    P          V   ++TR  RR+Y   +   A+E+++   F+  M + G N 
Sbjct: 874  MEAPFRTPSAREITSVDSVQLTESTRRMRRLYAENVSDSASEKSLIECFNSYMLSSGSNH 933

Query: 207  AGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAA 266
                +  ++  IN EK  A VE  T  +AS A++LDG  F G+ +++RRP  Y  T    
Sbjct: 934  IKGSEPCISCIINKEKSQALVEFLTPHDASAALSLDGCSFAGLNLKIRRPKGYVETTGVY 993

Query: 267  LG-------PGQPS----------PNLNLAAVGLASGAIGGAE------------GPDRV 297
            +G        G  +                A+ + SG +   E              +++
Sbjct: 994  VGYVIIHIQEGDEAVCYVMVTIHEAGFQTVAIFMQSGELAKKEPATNAISDNVKDSSNKI 1053

Query: 298  FVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDP 346
            F+GG P   +   + E++  FG L  +  V + D  N +     V ++P
Sbjct: 1054 FIGGFPKSISSEMLMEIVSVFGPLKAYRFVINNDL-NKRCAFLEVNENP 1101


>gi|209155056|gb|ACI33760.1| RNA-binding protein 39 [Salmo salar]
          Length = 535

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 70/288 (24%), Positives = 115/288 (39%), Gaps = 58/288 (20%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  +A+    G  GP R++VG L +  TE  ++ + E FG +    L+ D +T  SKGYG
Sbjct: 238 AAAMANNLQKGNAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETARSKGYG 297

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE----------------- 382
           F  + D      A   LNG ++  + + V   T    S T                    
Sbjct: 298 FISFADAECAKKALEQLNGFELAGRPMKVGNVTERTDSSTASSFLDNDELERTGIDLGTT 357

Query: 383 ---SILAQAQQHIAIQ-----KMALQTSG--------MNTLGGGMSL------FGETLAK 420
               ++A+  +   +Q     + ALQ SG           +  G ++       G ++ +
Sbjct: 358 GRLQLMARLAEGTGLQIPPAAQQALQMSGSMHSSSIHFGNMAAGTAIANPALNLGPSMNQ 417

Query: 421 VLCL-TEAITADALADDEEYE-----------EILEDMREECGKYGTLVNVVIPRPDQNG 468
            + L T+ +    L     +            EI +D+ EEC K+G +V++ +   D+N 
Sbjct: 418 AMNLPTQPLATHCLQLSNMFSPQSENEPGWDIEIQDDVMEECNKHGGIVHIYV---DKNS 474

Query: 469 GETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            +    G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 475 PQ----GNVYVKCPTIPTAMAAVNALHGRWFAGKMITAAYVPLPTYHN 518


>gi|417411155|gb|JAA52027.1| Putative transcriptional coactivator caper rrm superfamily, partial
           [Desmodus rotundus]
          Length = 491

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 84/338 (24%), Positives = 130/338 (38%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 161 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 203

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 204 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 263

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 264 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 323

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVL------------CL 424
             ++A+  +   +Q     + ALQ SG    G    L      +                
Sbjct: 324 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 383

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 384 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 436

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 437 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 474


>gi|67617167|ref|XP_667532.1| U2 snRNP auxiliary factor [Cryptosporidium hominis TU502]
 gi|54658687|gb|EAL37312.1| U2 snRNP auxiliary factor [Cryptosporidium hominis]
          Length = 438

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 91/401 (22%), Positives = 161/401 (40%), Gaps = 69/401 (17%)

Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT--V 232
           R V  G  P + N + +  F + V+  I G ++   + +  +    E  F+ +   T  +
Sbjct: 42  RTVEFGSDPHVFNSETVEIFLTGVILTILGKASNDSEKLKLIEEVVESDFSSLSCSTGLI 101

Query: 233 EEASNAMALDGI--------IFEGVAVRV--------------RRPTDYNPTLAAALGPG 270
               N++ +D I        +F  + +++              RRP  Y+          
Sbjct: 102 ANLENSIKIDNIFCVTFTSSLFSLICLKLDGHIIDSQNIKLFCRRPNKYS---------- 151

Query: 271 QPSPNLNLAAVGLASGAI------GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
               NLN   V L +  I         +  ++  +  LP    E +I++ LE+ G L   
Sbjct: 152 ----NLNNEKV-LDTFIIPRISQYDNFKENEKCILKNLPTDINEEKIRQHLENIGKLKSL 206

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNG------------LKMGDKTLTVRRAT 372
            ++ D  TG  KG G   +++ ++   A A L+G            + +G  T+T  ++ 
Sbjct: 207 TIIYDPITGIPKGVGSFEFEESSLCKKAIAILHGKPIESTKNGIWNIYLGSGTITNYKSN 266

Query: 373 AS--GQSKTEQESILAQAQQHIAIQK----MALQTSGMNTLGGGMSL---FGETLAKVLC 423
                QS     S + Q  +++ I +    M         LG  M      GET ++++ 
Sbjct: 267 KGQFNQSNFSANSSIIQNSEYLHITEIPTSMTYNIFSNPVLGLMMKYSKQVGETPSQIIQ 326

Query: 424 LTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD--QNGGETPGVGKVFLEY 481
           L      + L D+E Y   L+ +R E   YGT++ +  PRP   +      G GKVF+ +
Sbjct: 327 LLNIFLPEELVDNEIYNSTLDSVRSEAEVYGTILEIFCPRPKVIEEFHSCSGAGKVFIYF 386

Query: 482 YDAVGCATAKNALSGRKFGG-NTVNAFYYPEDKYFNKDYSA 521
            D      A+   +GR F    TV+A ++P +KY   +YS 
Sbjct: 387 SDITAARRAQYQFNGRVFDNIKTVSATFFPLEKYLKHEYSV 427


>gi|395863336|ref|XP_003803852.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Otolemur
           garnettii]
 gi|395863340|ref|XP_003803854.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Otolemur
           garnettii]
          Length = 514

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 100/403 (24%), Positives = 149/403 (36%), Gaps = 79/403 (19%)

Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
           P+  +T +  R AR V+   L      + +  FFS V              + +      
Sbjct: 126 PIDNLTPE-ERDARTVFCMQLAARIRPRDLEAFFSTV------GKVRDVRMISDRNARRS 178

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + VR                Q   N    A 
Sbjct: 179 KGIAYVEFVDVSSVPLAIGLTGQRVFGVPILVR--------------ASQAEKN---RAA 221

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L    TE  ++ + E FG L    L+KD +TG SKGYGF 
Sbjct: 222 AMANNLQKGRAGPMRLYVGSLHLNITEAMLRGIFEPFGRLESIQLMKDSETGRSKGYGFI 281

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI------- 394
            + D      A   LNGL++  + + V   T    + T    + +   +   I       
Sbjct: 282 TFSDSECAKKALEQLNGLELAGRPMKVGHVTEGTDASTASSFLNSDELERTGIDLGTAGG 341

Query: 395 -QKMALQTSG--------------MNT--LGGGMSLFGETLAKVLCLTEAITADALA--- 434
            Q MA    G              MN+    G  + F   L  +  L++   A  LA   
Sbjct: 342 LQFMARLAEGTGLQIPPAAQQALQMNSPLAFGATAEFSFRLDLLTRLSQQTQASDLAAAA 401

Query: 435 ------------------DDEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETPG 473
                               EE     EI +D+ EEC K+G ++++ +   D+N  +   
Sbjct: 402 SVQPLATQCFQLSNMFNPQTEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ--- 455

Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            G V+++         A +AL GR F G  + A Y P   Y N
Sbjct: 456 -GNVYVKCPSIAAAVAAVSALHGRWFAGKMITAAYVPLPTYHN 497


>gi|328865553|gb|EGG13939.1| RNA-binding region RNP-1 domain-containing protein [Dictyostelium
           fasciculatum]
          Length = 949

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 126/318 (39%), Gaps = 43/318 (13%)

Query: 224 FAFVEMRTVEEASNAMA-LDGIIFEGVAVRVRRPTD--YNPTLAAA------------LG 268
           F F+E    E A NA+  ++     G  ++VR+P+    NP                 L 
Sbjct: 646 FCFIEYTYPEAAINAIQNMNQKTISGRQIKVRQPSIPVINPAATGVSVGMGGGGMSEILQ 705

Query: 269 PGQPSPNLNLAAVG----------LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESF 318
           P     N  L++            L +  +   +  +RV+VG +P+  TE QIK +  S 
Sbjct: 706 PNIIPSNTFLSSTSVASSFSSQALLNNTPVKERDNDNRVYVGSVPWNATEDQIKTIFSSI 765

Query: 319 GTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK 378
           G +    L  + +TG   GYGF  Y +P   + A +  NG  +  + L VR+   +    
Sbjct: 766 GNVVSCSLKPNLETGRHMGYGFIDYDNPKSAEDAISTFNGYDINGRQLKVRKPVRNAPKV 825

Query: 379 TEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEE 438
              +  L +    +  ++  L T  +        L     A   C+           DE 
Sbjct: 826 NNNDGNLLEDNISLNNEQKILLTQKL--------LAASEPATNRCMVMRNLGSPAELDEY 877

Query: 439 YEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRK 498
           +E   E+++ EC  +G +  VVI     N G +    K ++ + DA  CA   +  +GR 
Sbjct: 878 FE---EEIKNECSSFGAVEKVVI----TNEGTS---VKAYVLFRDAPSCAMCLSKQNGRY 927

Query: 499 FGGNTVNAFYYPEDKYFN 516
           F G  V A YY  + + N
Sbjct: 928 FSGYLVKAEYYNVNLFLN 945


>gi|148674239|gb|EDL06186.1| RNA binding motif protein 39, isoform CRA_d [Mus musculus]
          Length = 507

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 177 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 219

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 220 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 279

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 280 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 339

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
             ++A+  +   +Q     + ALQ SG    G    L                A V  L 
Sbjct: 340 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 399

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 400 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 452

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 453 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 490


>gi|88682991|gb|AAI05542.1| RBM39 protein [Bos taurus]
          Length = 528

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 198 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 240

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 241 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 300

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 301 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 360

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
             ++A+  +   +Q     + ALQ SG    G    L                A V  L 
Sbjct: 361 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 420

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 421 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 473

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 474 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 511


>gi|226469234|emb|CAX70096.1| RNA-binding protein 39 [Schistosoma japonicum]
          Length = 327

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 79/327 (24%), Positives = 129/327 (39%), Gaps = 60/327 (18%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE R VE A  A+ L G    GV +++++       ++A   P  P P+      
Sbjct: 9   KGIAYVEFREVESAQLALGLTGTRLLGVPIQIQQSHAEKNRVSAT--PSLPRPSQQ---- 62

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
                     +GP ++++G L Y  TE  +K + E FG +    L+KD  T  S+GYGF 
Sbjct: 63  ---------NKGPMKLYIGSLHYNITEEMLKGIFEPFGKIEDIKLIKDPATNRSQGYGFV 113

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
            Y +      A   LNG ++  + + V   T     ++E   + A          + L T
Sbjct: 114 TYVNSDDAKKALDQLNGFELAGRPMKVNHVT----ERSEYACLSALDNDEADRSGVDLGT 169

Query: 402 SGMNTL------GGGMSLFGETLAKV------------------------LCLTEAITAD 431
           +G   L      G G+ +    LA++                        +C    + ++
Sbjct: 170 TGRLALMAKLAEGTGLEIPKAALAQLHIGQNNPILGSAGSVSSSSAIAPPVCTQCFMLSN 229

Query: 432 A----LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
                +A    +EEI +D+ EEC K G  +++ + R       T   G V+++       
Sbjct: 230 MFDPHVATHSVFEEIRDDVIEECTKAGGCLHIFVDR-------TSAQGNVYVKCPSIAVA 282

Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKY 514
               N L GR F G  + A Y P   Y
Sbjct: 283 TQCVNMLHGRYFSGRLITAAYVPLINY 309


>gi|74179655|dbj|BAE22477.1| unnamed protein product [Mus musculus]
          Length = 521

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 191 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 233

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 234 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 293

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 294 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 353

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
             ++A+  +   +Q     + ALQ SG    G    L                A V  L 
Sbjct: 354 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 413

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 414 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 466

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 467 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 504


>gi|395863338|ref|XP_003803853.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Otolemur
           garnettii]
 gi|395863342|ref|XP_003803855.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Otolemur
           garnettii]
          Length = 487

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 100/403 (24%), Positives = 149/403 (36%), Gaps = 79/403 (19%)

Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
           P+  +T +  R AR V+   L      + +  FFS V              + +      
Sbjct: 99  PIDNLTPEE-RDARTVFCMQLAARIRPRDLEAFFSTV------GKVRDVRMISDRNARRS 151

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + VR                Q   N    A 
Sbjct: 152 KGIAYVEFVDVSSVPLAIGLTGQRVFGVPILVR--------------ASQAEKN---RAA 194

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L    TE  ++ + E FG L    L+KD +TG SKGYGF 
Sbjct: 195 AMANNLQKGRAGPMRLYVGSLHLNITEAMLRGIFEPFGRLESIQLMKDSETGRSKGYGFI 254

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI------- 394
            + D      A   LNGL++  + + V   T    + T    + +   +   I       
Sbjct: 255 TFSDSECAKKALEQLNGLELAGRPMKVGHVTEGTDASTASSFLNSDELERTGIDLGTAGG 314

Query: 395 -QKMALQTSG--------------MNT--LGGGMSLFGETLAKVLCLTEAITADALA--- 434
            Q MA    G              MN+    G  + F   L  +  L++   A  LA   
Sbjct: 315 LQFMARLAEGTGLQIPPAAQQALQMNSPLAFGATAEFSFRLDLLTRLSQQTQASDLAAAA 374

Query: 435 ------------------DDEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETPG 473
                               EE     EI +D+ EEC K+G ++++ +   D+N  +   
Sbjct: 375 SVQPLATQCFQLSNMFNPQTEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ--- 428

Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            G V+++         A +AL GR F G  + A Y P   Y N
Sbjct: 429 -GNVYVKCPSIAAAVAAVSALHGRWFAGKMITAAYVPLPTYHN 470


>gi|256082942|ref|XP_002577710.1| splicing factor [Schistosoma mansoni]
 gi|360043601|emb|CCD81147.1| putative splicing factor [Schistosoma mansoni]
          Length = 327

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 79/327 (24%), Positives = 128/327 (39%), Gaps = 60/327 (18%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE R VE A  A+ L G    GV +++++       ++A   P  P P+      
Sbjct: 9   KGIAYVEFREVESAQLALGLTGTRLLGVPIQIQQSHAEKNRVSAT--PSLPRPSQQ---- 62

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
                      GP ++++G L Y  TE  +K + E FG +    L+KD  T  S+GYGF 
Sbjct: 63  ---------NRGPMKLYIGSLHYNITEEMLKGIFEPFGKIEDIKLIKDPTTNRSQGYGFV 113

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
            Y +      A   LNG ++  + + V   T     ++E   + A          + L T
Sbjct: 114 TYVNSDDAKKALDQLNGFELAGRPMKVNHVT----ERSEYACLSALDNDEADRSGVDLGT 169

Query: 402 SGMNTL------GGGMSLFGETLAKV------------------------LCLTEAITAD 431
           +G   L      G G+ +    LA++                        +C    + ++
Sbjct: 170 TGRLALMAKLAEGTGLEIPKAALAQLHIGQNNPILGSAGSVSSSSAIAPPVCTQCFMLSN 229

Query: 432 A----LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
                +A    +EEI +D+ EEC K G  +++ + R       T   G V+++       
Sbjct: 230 MFDPHVATHSVFEEIRDDVIEECTKAGGCLHIFVDR-------TSAQGNVYVKCPSIAVA 282

Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKY 514
               N L GR F G  + A Y P   Y
Sbjct: 283 TQCVNMLHGRYFSGRLITAAYVPLINY 309


>gi|330688445|ref|NP_001193433.1| RNA-binding protein 39 [Bos taurus]
          Length = 530

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 200 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 242

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 243 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 302

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 303 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 362

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
             ++A+  +   +Q     + ALQ SG    G    L                A V  L 
Sbjct: 363 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 422

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 423 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 475

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 476 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 513


>gi|157108428|ref|XP_001650224.1| splicing factor [Aedes aegypti]
 gi|108879330|gb|EAT43555.1| AAEL005046-PA [Aedes aegypti]
          Length = 544

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 89/372 (23%), Positives = 140/372 (37%), Gaps = 62/372 (16%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS V         G    V  +  N  K+F   A++E
Sbjct: 186 RDARTVFCMQLSQRIRARDLEEFFSSV---------GKVRDVRLITCNKTKRFKGIAYIE 236

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            +  E  + A+ L G    G+ + V+        +A+   P QP P +            
Sbjct: 237 FKDPESVALALGLSGQRLLGIPISVQHTQAEKNRMAST--PPQPPPKV------------ 282

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               GP R++VG L +  TE  ++ + E FG +    L+ D DTG SKGYGF  + +   
Sbjct: 283 --TSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDSDTGRSKGYGFITFHNADD 340

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA---LQTSGMN 405
              A   LNG ++  + + V   T      T   S+         I   A   LQ     
Sbjct: 341 AKKALEQLNGFELAGRPMKVGNVTER-LDVTTHASLDTDEMDRSGIDLGATGRLQLMFKL 399

Query: 406 TLGGGMSL----------------------FGETLAKVLCLTEAITADALADDEEYEEIL 443
             G G+++                          +A    L   +       +  ++  +
Sbjct: 400 AEGAGLAVPRAAADALLATAPQPAPQQPVAPSPPIATQCFLLSNMFDPTTETNPTWDTEI 459

Query: 444 ED-MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
           ED + EEC K+G +++V + +      E P  G V+++         A NAL GR F G 
Sbjct: 460 EDDVIEECNKHGGVLHVYVDK------ENPA-GNVYVKCPSIATAVLAVNALHGRWFAGR 512

Query: 503 TVNAFYYPEDKY 514
            + A Y P   Y
Sbjct: 513 IITAAYVPLVNY 524


>gi|47215490|emb|CAG01598.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 463

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   I  D L +DEEYE+I++DM+EEC KYG++V+++IPR      E PG G+V++E
Sbjct: 369 VLRLINLIDDDHLNNDEEYEDIMDDMKEECQKYGSVVSLLIPR------ENPGKGQVYVE 422

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y ++     A+  L GR F G  V A +YP   Y
Sbjct: 423 YANSSDSKEAQRLLMGRTFDGKFVVATFYPLSAY 456


>gi|61557287|ref|NP_001013225.1| RNA-binding protein 39 [Rattus norvegicus]
 gi|392346874|ref|XP_003749654.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Rattus
           norvegicus]
 gi|60552170|gb|AAH91394.1| RNA binding motif protein 39 [Rattus norvegicus]
 gi|74196119|dbj|BAE32977.1| unnamed protein product [Mus musculus]
 gi|149030834|gb|EDL85861.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_f [Rattus
           norvegicus]
          Length = 524

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
             ++A+  +   +Q     + ALQ SG    G    L                A V  L 
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 416

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 417 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 469

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 470 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 507


>gi|4757926|ref|NP_004893.1| RNA-binding protein 39 isoform b [Homo sapiens]
 gi|197097940|ref|NP_001125339.1| RNA-binding protein 39 [Pongo abelii]
 gi|149733223|ref|XP_001501876.1| PREDICTED: RNA-binding protein 39 isoform 2 [Equus caballus]
 gi|194044529|ref|XP_001925282.1| PREDICTED: RNA-binding protein 39 isoform 2 [Sus scrofa]
 gi|296199701|ref|XP_002747278.1| PREDICTED: RNA-binding protein 39 isoform 1 [Callithrix jacchus]
 gi|301762104|ref|XP_002916459.1| PREDICTED: RNA-binding protein 39-like [Ailuropoda melanoleuca]
 gi|332858226|ref|XP_514808.3| PREDICTED: uncharacterized protein LOC458443 isoform 5 [Pan
           troglodytes]
 gi|344279921|ref|XP_003411734.1| PREDICTED: RNA-binding protein 39 isoform 2 [Loxodonta africana]
 gi|345789988|ref|XP_865124.2| PREDICTED: RNA-binding protein 39 isoform 12 [Canis lupus
           familiaris]
 gi|354477984|ref|XP_003501197.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Cricetulus
           griseus]
 gi|426391509|ref|XP_004062115.1| PREDICTED: RNA-binding protein 39 isoform 2 [Gorilla gorilla
           gorilla]
 gi|75070825|sp|Q5RC80.1|RBM39_PONAB RecName: Full=RNA-binding protein 39; AltName: Full=RNA-binding
           motif protein 39
 gi|405192|gb|AAA16346.1| splicing factor [Homo sapiens]
 gi|55727753|emb|CAH90627.1| hypothetical protein [Pongo abelii]
 gi|119596568|gb|EAW76162.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_c [Homo
           sapiens]
 gi|119596569|gb|EAW76163.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_c [Homo
           sapiens]
 gi|296480931|tpg|DAA23046.1| TPA: RNA binding motif protein 39 [Bos taurus]
 gi|307686241|dbj|BAJ21051.1| RNA binding motif protein 39 [synthetic construct]
 gi|344246681|gb|EGW02785.1| RNA-binding protein 39 [Cricetulus griseus]
 gi|380783275|gb|AFE63513.1| RNA-binding protein 39 isoform b [Macaca mulatta]
 gi|383408127|gb|AFH27277.1| RNA-binding protein 39 isoform b [Macaca mulatta]
 gi|384939256|gb|AFI33233.1| RNA-binding protein 39 isoform b [Macaca mulatta]
 gi|410218748|gb|JAA06593.1| RNA binding motif protein 39 [Pan troglodytes]
 gi|410255438|gb|JAA15686.1| RNA binding motif protein 39 [Pan troglodytes]
 gi|410292902|gb|JAA25051.1| RNA binding motif protein 39 [Pan troglodytes]
 gi|410350855|gb|JAA42031.1| RNA binding motif protein 39 [Pan troglodytes]
 gi|410350863|gb|JAA42035.1| RNA binding motif protein 39 [Pan troglodytes]
          Length = 524

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
             ++A+  +   +Q     + ALQ SG    G    L                A V  L 
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 416

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 417 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 469

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 470 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 507


>gi|449458894|ref|XP_004147181.1| PREDICTED: uncharacterized protein LOC101213128 [Cucumis sativus]
          Length = 910

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 75/302 (24%), Positives = 119/302 (39%), Gaps = 71/302 (23%)

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
           V   QATR  RR+Y+  LP  A+E+AI    +  + + G N        ++  I+ ++  
Sbjct: 453 VQLTQATRPMRRLYIENLPHSASEKAIIDCLNGFLMSSGVNHIEGTQPCISCIIHKDRGQ 512

Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
           A VE  T E+AS A+  DG  F G  +++RRP DY  TL                     
Sbjct: 513 ALVEFLTPEDASAALLFDGSDFSGSTLKIRRPKDYIETL--------------------- 551

Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
                                      ++++ +FG L  +    + D        F  Y 
Sbjct: 552 ---------------------------RDVVTAFGRLKAYHFEINDDLNGP--CAFLEYV 582

Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM 404
           D +V   ACA LNG+K+G + L V  A     ++           +H+   K  LQ   +
Sbjct: 583 DESVVSKACAGLNGMKIGGQVLKVFPAVPFPLTERTGCQPCYGIPEHV---KPLLQRPSV 639

Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADAL--ADDEEYEEILEDMREECGKYGTLVNVVIP 462
                           VL +     AD L    + + +E+LED+R EC ++GT+ +V   
Sbjct: 640 ----------------VLKINNVFNADVLPVLSESDIDEVLEDIRFECARFGTVKSVNFV 683

Query: 463 RP 464
           +P
Sbjct: 684 KP 685


>gi|410953912|ref|XP_003983612.1| PREDICTED: LOW QUALITY PROTEIN: RNA-binding protein 39 [Felis
           catus]
          Length = 523

 Score = 82.8 bits (203), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 74/280 (26%), Positives = 116/280 (41%), Gaps = 50/280 (17%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYG
Sbjct: 234 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 293

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
           F  + D      A   LNG ++  + + V     R  AS  S                 T
Sbjct: 294 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 353

Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLC 423
            +  ++A+  +   +Q     + ALQ SG    G    L                A V  
Sbjct: 354 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQP 413

Query: 424 L-TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGK 476
           L T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G 
Sbjct: 414 LATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GN 466

Query: 477 VFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 467 VYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 506


>gi|449498643|ref|XP_004160593.1| PREDICTED: uncharacterized LOC101213128 [Cucumis sativus]
          Length = 918

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 75/302 (24%), Positives = 119/302 (39%), Gaps = 71/302 (23%)

Query: 165 VMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF 224
           V   QATR  RR+Y+  LP  A+E+AI    +  + + G N        ++  I+ ++  
Sbjct: 461 VQLTQATRPMRRLYIENLPHSASEKAIIDCLNGFLMSSGVNHIEGTQPCISCIIHKDRGQ 520

Query: 225 AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
           A VE  T E+AS A+  DG  F G  +++RRP DY  TL                     
Sbjct: 521 ALVEFLTPEDASAALLFDGSDFSGSTLKIRRPKDYIETL--------------------- 559

Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
                                      ++++ +FG L  +    + D        F  Y 
Sbjct: 560 ---------------------------RDVVTAFGRLKAYHFEINDDLNGP--CAFLEYV 590

Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGM 404
           D +V   ACA LNG+K+G + L V  A     ++           +H+   K  LQ   +
Sbjct: 591 DESVVSKACAGLNGMKIGGQVLKVFPAVPFPLTERTGCQPCYGIPEHV---KPLLQRPSV 647

Query: 405 NTLGGGMSLFGETLAKVLCLTEAITADAL--ADDEEYEEILEDMREECGKYGTLVNVVIP 462
                           VL +     AD L    + + +E+LED+R EC ++GT+ +V   
Sbjct: 648 ----------------VLKINNVFNADVLPVLSESDIDEVLEDIRFECARFGTVKSVNFV 691

Query: 463 RP 464
           +P
Sbjct: 692 KP 693


>gi|432858814|ref|XP_004068952.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Oryzias latipes]
          Length = 502

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 69/270 (25%), Positives = 113/270 (41%), Gaps = 52/270 (19%)

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
           G  GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  + D    
Sbjct: 220 GLTGPMRLYVGSLHFNITEDMLRGIFEPFGRIENIQLMMDSETGRSKGYGFITFSDAECA 279

Query: 350 DIACAALNGLKMGDKTLTVRRAT-----ASGQSKTEQE---------------SILAQAQ 389
             A   LNG ++  + + V   T     +S  S  + +                ++A+  
Sbjct: 280 KKALEQLNGFELAGRPMKVGHVTERTDPSSAPSILDNDELERSGIDLGTTGRLQLMARLA 339

Query: 390 QHIAIQ-----KMALQTS-------------------GMNTLGGGMSLFGETLAKVLCLT 425
           +   +Q     + ALQ S                    +N   G ++L  + LA      
Sbjct: 340 EGTGLQIPPAAQQALQMSGAIAIGAMAAVSAAMNPSLNVNMNSGALNLPSQPLATHCFQL 399

Query: 426 EAITADALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDA 484
             +   +  +   +E +I  D+ EEC K+G +V++ +   D+N  E    G V+++    
Sbjct: 400 SNMFNPSSENTFGWEVDIQRDVIEECNKHGGVVHIYV---DKNSAE----GNVYVKCPSI 452

Query: 485 VGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
               +A NAL GR FGG  + A Y P   Y
Sbjct: 453 PAAMSAVNALHGRFFGGKMITAAYVPLPTY 482


>gi|426241410|ref|XP_004014584.1| PREDICTED: RNA-binding protein 39 isoform 5 [Ovis aries]
          Length = 530

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 84/338 (24%), Positives = 130/338 (38%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 200 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 242

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 243 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 302

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 303 TFSDSECAKKALEQLNGFELTGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 362

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVL------------CL 424
             ++A+  +   +Q     + ALQ SG    G    L      +                
Sbjct: 363 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 422

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 423 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 475

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 476 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 513


>gi|426241404|ref|XP_004014581.1| PREDICTED: RNA-binding protein 39 isoform 2 [Ovis aries]
          Length = 524

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 297 TFSDSECAKKALEQLNGFELTGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
             ++A+  +   +Q     + ALQ SG    G    L                A V  L 
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 416

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 417 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 469

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 470 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 507


>gi|431894348|gb|ELK04148.1| RNA-binding protein 39 [Pteropus alecto]
          Length = 601

 Score = 82.4 bits (202), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 84/338 (24%), Positives = 130/338 (38%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 271 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 313

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 314 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 373

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 374 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 433

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVL------------CL 424
             ++A+  +   +Q     + ALQ SG    G    L      +                
Sbjct: 434 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 493

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 494 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 546

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 547 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 584


>gi|332249061|ref|XP_003273679.1| PREDICTED: RNA-binding protein 39 [Nomascus leucogenys]
          Length = 432

 Score = 82.4 bits (202), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 76/287 (26%), Positives = 118/287 (41%), Gaps = 50/287 (17%)

Query: 273 SPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
           SP     A  +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +T
Sbjct: 136 SPAEKNRAAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSET 195

Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK---------- 378
           G SKGYGF  + D      A   LNG ++  + + V     R  AS  S           
Sbjct: 196 GRSKGYGFITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERT 255

Query: 379 ------TEQESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGE 416
                 T +  ++A+  +   +Q     + ALQ SG    G    L              
Sbjct: 256 GIDLGTTGRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALA 315

Query: 417 TLAKVLCL-TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGG 469
             A V  L T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  
Sbjct: 316 AAASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSA 372

Query: 470 ETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           +    G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 373 Q----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 415


>gi|417411216|gb|JAA52053.1| Putative transcriptional coactivator caper rrm superfamily, partial
           [Desmodus rotundus]
          Length = 499

 Score = 82.4 bits (202), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 163 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 205

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 206 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 265

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 266 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 325

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
             ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE        
Sbjct: 326 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 385

Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +  
Sbjct: 386 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 440

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
             G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 441 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 482


>gi|296199709|ref|XP_002747282.1| PREDICTED: RNA-binding protein 39 isoform 5 [Callithrix jacchus]
          Length = 504

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 168 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 210

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 211 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 270

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 271 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 330

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
             ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE        
Sbjct: 331 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 390

Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +  
Sbjct: 391 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 445

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
             G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 446 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 487


>gi|194386036|dbj|BAG59582.1| unnamed protein product [Homo sapiens]
          Length = 503

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 167 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 209

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 210 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 269

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 270 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 329

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
             ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE        
Sbjct: 330 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 389

Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +  
Sbjct: 390 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 444

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
             G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 445 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 486


>gi|194384132|dbj|BAG64839.1| unnamed protein product [Homo sapiens]
          Length = 521

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 185 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 227

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 228 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 287

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 288 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 347

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
             ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE        
Sbjct: 348 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 407

Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +  
Sbjct: 408 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 462

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
             G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 463 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 504


>gi|336176066|ref|NP_001229529.1| RNA-binding protein 39 isoform d [Homo sapiens]
 gi|73991836|ref|XP_865202.1| PREDICTED: RNA-binding protein 39 isoform 16 [Canis lupus
           familiaris]
 gi|296199705|ref|XP_002747280.1| PREDICTED: RNA-binding protein 39 isoform 3 [Callithrix jacchus]
 gi|332858230|ref|XP_003316933.1| PREDICTED: uncharacterized protein LOC458443 isoform 3 [Pan
           troglodytes]
 gi|335304749|ref|XP_003360015.1| PREDICTED: RNA-binding protein 39 [Sus scrofa]
 gi|338719245|ref|XP_003363967.1| PREDICTED: RNA-binding protein 39 [Equus caballus]
 gi|426391513|ref|XP_004062117.1| PREDICTED: RNA-binding protein 39 isoform 4 [Gorilla gorilla
           gorilla]
          Length = 502

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 84/338 (24%), Positives = 130/338 (38%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 172 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 214

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 215 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 274

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 275 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 334

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVL------------CL 424
             ++A+  +   +Q     + ALQ SG    G    L      +                
Sbjct: 335 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 394

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 395 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 447

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 448 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 485


>gi|194386804|dbj|BAG61212.1| unnamed protein product [Homo sapiens]
          Length = 502

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 84/338 (24%), Positives = 130/338 (38%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 172 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 214

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 215 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 274

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 275 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 334

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVL------------CL 424
             ++A+  +   +Q     + ALQ SG    G    L      +                
Sbjct: 335 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 394

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 395 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 447

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 448 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 485


>gi|391336770|ref|XP_003742751.1| PREDICTED: RNA-binding protein 39-like [Metaseiulus occidentalis]
          Length = 520

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 82/330 (24%), Positives = 134/330 (40%), Gaps = 58/330 (17%)

Query: 213 VVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQP 272
           +V+      K  A+VE   +E    AM L+G    GV + V+              P Q 
Sbjct: 207 IVDNKTRKSKGIAYVEFFDLESVPLAMGLNGQKLFGVPIIVQ--------------PTQA 252

Query: 273 SPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
             N        A+      +GP R++VG L +  +E  +KE+ E FG L   +L+K+ DT
Sbjct: 253 ERNRQ------ANQTAASTKGPMRLYVGSLHFDISEQMLKEIFEPFGRLDRVELIKE-DT 305

Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQ-----SKTEQE----- 382
           G SKGYGF  + +      A   LNG ++  + + V   T  G      S  + E     
Sbjct: 306 GKSKGYGFVTFHEADAAKKAMEQLNGFELAGRPMKVGNVTERGMDGSAPSILDNEELDRT 365

Query: 383 ----------SILAQAQQHIAIQ-----KMAL-QTSGMNTLGGGMSLFGETLA-KVLCLT 425
                     +++A+  +   IQ     K AL Q     + G   +   E++A +   L+
Sbjct: 366 GIELGAHGRLALMAKLAEGTGIQLPDAAKTALQQMQSAPSFGQTNNAQQESIATQCFLLS 425

Query: 426 EAITADALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDA 484
               A     +++++ E+ ED+ +EC K+G  V+  + +   N         V+++    
Sbjct: 426 NMFDAAEAHQEKDWDLELREDVLQECRKHGGAVHCFVDKEAAN---------VYVKCPSI 476

Query: 485 VGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
                A   L GR F G  + A Y P   Y
Sbjct: 477 ATAVAAVGVLHGRFFAGRVITAAYVPVMTY 506


>gi|281346065|gb|EFB21649.1| hypothetical protein PANDA_004543 [Ailuropoda melanoleuca]
          Length = 497

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 161 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 203

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 204 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 263

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 264 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 323

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
             ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE        
Sbjct: 324 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 383

Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +  
Sbjct: 384 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 438

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
             G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 439 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 480


>gi|52545994|emb|CAH18281.2| hypothetical protein [Homo sapiens]
          Length = 513

 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 177 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 219

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 220 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 279

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 280 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 339

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
             ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE        
Sbjct: 340 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 399

Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +  
Sbjct: 400 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 454

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
             G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 455 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 496


>gi|344298613|ref|XP_003420986.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Loxodonta
           africana]
          Length = 434

 Score = 82.0 bits (201), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 86/318 (27%), Positives = 121/318 (38%), Gaps = 62/318 (19%)

Query: 61  RDHDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPA 120
           RD DR  +    +R +ER+HRHRSRS    R        S SRS  + +       +PP 
Sbjct: 66  RDRDRHRQRNSLSRSRERQHRHRSRSWDHQRS-------SESRSWDRRREDRVRYRSPPL 118

Query: 121 AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVG 180
           A                S V E   N+ P                      R AR V+  
Sbjct: 119 ATGRRYGHSKSPHFREKSPVREPVDNLSP--------------------EERDARTVFCM 158

Query: 181 GLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI------NHEKKFAFVEMRTVEE 234
            L      + +  FFS    A+G         V +V I         K  A+VE   ++ 
Sbjct: 159 QLAARIRPRDLEDFFS----AVG--------KVRDVRIISDRNSRRSKGIAYVEFCEIQS 206

Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
              A+ L G    GV + V+        LAA                 +A+    G+ GP
Sbjct: 207 VPLAIGLTGQWLLGVPIIVQASQAEKNRLAA-----------------MANNLQKGSGGP 249

Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
            R++VG L +  TE  ++ + E FG +    L KD DTG+SKGYGF  + D      A  
Sbjct: 250 MRLYVGSLHFNITEDMLRGIFEPFGKIDDILLTKDSDTGHSKGYGFITFSDSECARRALE 309

Query: 355 ALNGLKMGDKTLTVRRAT 372
            LNG ++  + + V  AT
Sbjct: 310 QLNGFELAGRPMRVGHAT 327


>gi|410930137|ref|XP_003978455.1| PREDICTED: serine/threonine-protein kinase Kist-like [Takifugu
           rubripes]
          Length = 433

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 58/94 (61%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   I    L +DEEYE+I++DM+EEC KYG++V+++IPR      E PG G+VF+E
Sbjct: 336 VLRLINLIDDSHLNNDEEYEDIMDDMKEECQKYGSVVSLLIPR------ENPGKGQVFVE 389

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y ++     A+  L GR F G  V A +YP   Y
Sbjct: 390 YANSGDSKEAQRLLMGRTFDGKFVVATFYPSSAY 423


>gi|301088364|ref|XP_002996880.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110854|gb|EEY68906.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 96

 Score = 81.6 bits (200), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 37/91 (40%), Positives = 58/91 (63%), Gaps = 1/91 (1%)

Query: 424 LTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYD 483
           +   ++ D L DDEEY ++ ED+ EEC ++G +  + IPRP ++G E PG+G +++ +  
Sbjct: 1   MANMVSIDELRDDEEYADLAEDVEEECKRFGGVTGMEIPRP-KDGEEVPGLGCIYVRFGK 59

Query: 484 AVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
                +A  AL+GRKFGGN V   Y+P DK+
Sbjct: 60  EEDAVSALKALNGRKFGGNIVKVTYFPVDKF 90


>gi|17063213|gb|AAL32373.1| transcription coactivator CAPER [Mus musculus]
          Length = 530

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYG
Sbjct: 235 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 294

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
           F  + D      A   LNG ++  + + V     R  AS  S                 T
Sbjct: 295 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 354

Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
            +  ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE      
Sbjct: 355 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 414

Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
                  A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +
Sbjct: 415 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 471

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
               G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 472 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 513


>gi|334349754|ref|XP_001379564.2| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Monodelphis
           domestica]
          Length = 348

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 73/251 (29%), Positives = 104/251 (41%), Gaps = 52/251 (20%)

Query: 103 RSP---SKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFP 159
           RSP    K K R  +D+ PP    +     P Q   + +A    A  +LP       A  
Sbjct: 78  RSPRHEKKKKIRKYWDVPPPGFEHI----TPMQYKAMQAAGQIPATALLPTMTPDGLAVT 133

Query: 160 LMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYIN 219
             PV V+  Q TR ARR+YVG +P    E  +A   +++  A+ G  A            
Sbjct: 134 PTPVPVVGSQMTRQARRLYVGNIPFGITEGQVAISAARLPAALPGPPA------------ 181

Query: 220 HEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLA 279
                           +  + + G     +A +++ P  + P                  
Sbjct: 182 ------------CRPRAGNLGVGGAGPRPLAAQLKWPFAFPPA----------------- 212

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
             G+ S  +   +   ++F+GGLP Y  + Q+KELL SFG L  F+LVKD  TG SKGY 
Sbjct: 213 --GVVSTVV--PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 268

Query: 340 FCVYQDPAVTD 350
           FC Y D  VTD
Sbjct: 269 FCEYVDINVTD 279


>gi|344298617|ref|XP_003420988.1| PREDICTED: probable RNA-binding protein 23 isoform 3 [Loxodonta
           africana]
          Length = 416

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 85/318 (26%), Positives = 120/318 (37%), Gaps = 80/318 (25%)

Query: 61  RDHDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPA 120
           RD DR  +    +R +ER+HRHRSRS    R        S SRS  + +       +PP 
Sbjct: 66  RDRDRHRQRNSLSRSRERQHRHRSRSWDHQRS-------SESRSWDRRREDRVRYRSPPL 118

Query: 121 AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVG 180
           A   P          V +  PE                             R AR V+  
Sbjct: 119 ATGEP----------VDNLSPE----------------------------ERDARTVFCM 140

Query: 181 GLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI------NHEKKFAFVEMRTVEE 234
            L      + +  FFS    A+G         V +V I         K  A+VE   ++ 
Sbjct: 141 QLAARIRPRDLEDFFS----AVG--------KVRDVRIISDRNSRRSKGIAYVEFCEIQS 188

Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
              A+ L G    GV + V+        LAA                 +A+    G+ GP
Sbjct: 189 VPLAIGLTGQWLLGVPIIVQASQAEKNRLAA-----------------MANNLQKGSGGP 231

Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
            R++VG L +  TE  ++ + E FG +    L KD DTG+SKGYGF  + D      A  
Sbjct: 232 MRLYVGSLHFNITEDMLRGIFEPFGKIDDILLTKDSDTGHSKGYGFITFSDSECARRALE 291

Query: 355 ALNGLKMGDKTLTVRRAT 372
            LNG ++  + + V  AT
Sbjct: 292 QLNGFELAGRPMRVGHAT 309


>gi|344298615|ref|XP_003420987.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Loxodonta
           africana]
          Length = 450

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 86/318 (27%), Positives = 121/318 (38%), Gaps = 62/318 (19%)

Query: 61  RDHDRTDRHRDYNRDKERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPA 120
           RD DR  +    +R +ER+HRHRSRS    R        S SRS  + +       +PP 
Sbjct: 82  RDRDRHRQRNSLSRSRERQHRHRSRSWDHQRS-------SESRSWDRRREDRVRYRSPPL 134

Query: 121 AAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVG 180
           A                S V E   N+ P                      R AR V+  
Sbjct: 135 ATGRRYGHSKSPHFREKSPVREPVDNLSP--------------------EERDARTVFCM 174

Query: 181 GLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI------NHEKKFAFVEMRTVEE 234
            L      + +  FFS    A+G         V +V I         K  A+VE   ++ 
Sbjct: 175 QLAARIRPRDLEDFFS----AVG--------KVRDVRIISDRNSRRSKGIAYVEFCEIQS 222

Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGP 294
              A+ L G    GV + V+        LAA                 +A+    G+ GP
Sbjct: 223 VPLAIGLTGQWLLGVPIIVQASQAEKNRLAA-----------------MANNLQKGSGGP 265

Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
            R++VG L +  TE  ++ + E FG +    L KD DTG+SKGYGF  + D      A  
Sbjct: 266 MRLYVGSLHFNITEDMLRGIFEPFGKIDDILLTKDSDTGHSKGYGFITFSDSECARRALE 325

Query: 355 ALNGLKMGDKTLTVRRAT 372
            LNG ++  + + V  AT
Sbjct: 326 QLNGFELAGRPMRVGHAT 343


>gi|432855875|ref|XP_004068316.1| PREDICTED: serine/threonine-protein kinase Kist-like [Oryzias
           latipes]
          Length = 435

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 41/94 (43%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   I    L ++EEYE+I+EDM+EEC KYG++V+++IP+      E PG G+VF+E
Sbjct: 338 VLRLLNLIDDSHLHNEEEYEDIMEDMKEECQKYGSVVSLLIPK------ENPGKGQVFVE 391

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y ++     A+  L+GR F G  V A +YP   Y
Sbjct: 392 YANSSDSKEAQRLLTGRTFDGKFVVATFYPLSAY 425


>gi|118403314|ref|NP_573505.2| RNA-binding protein 39 [Mus musculus]
 gi|392346872|ref|XP_003749653.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Rattus
           norvegicus]
 gi|341941811|sp|Q8VH51.2|RBM39_MOUSE RecName: Full=RNA-binding protein 39; AltName: Full=Coactivator of
           activating protein 1 and estrogen receptors;
           Short=Coactivator of AP-1 and ERs; AltName:
           Full=RNA-binding motif protein 39; AltName:
           Full=RNA-binding region-containing protein 2; AltName:
           Full=Transcription coactivator CAPER
 gi|55991480|gb|AAH86645.1| RNA binding motif protein 39 [Mus musculus]
 gi|74151058|dbj|BAE27657.1| unnamed protein product [Mus musculus]
 gi|148674237|gb|EDL06184.1| RNA binding motif protein 39, isoform CRA_b [Mus musculus]
          Length = 530

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
             ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE        
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 416

Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +  
Sbjct: 417 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 471

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
             G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 472 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 513


>gi|35493811|ref|NP_909122.1| RNA-binding protein 39 isoform a [Homo sapiens]
 gi|281182530|ref|NP_001162566.1| RNA-binding protein 39 [Papio anubis]
 gi|284004921|ref|NP_001164806.1| RNA-binding protein 39 [Oryctolagus cuniculus]
 gi|149733225|ref|XP_001501869.1| PREDICTED: RNA-binding protein 39 isoform 1 [Equus caballus]
 gi|296199703|ref|XP_002747279.1| PREDICTED: RNA-binding protein 39 isoform 2 [Callithrix jacchus]
 gi|332858224|ref|XP_003316931.1| PREDICTED: uncharacterized protein LOC458443 isoform 1 [Pan
           troglodytes]
 gi|335304742|ref|XP_003360012.1| PREDICTED: RNA-binding protein 39 [Sus scrofa]
 gi|344279919|ref|XP_003411733.1| PREDICTED: RNA-binding protein 39 isoform 1 [Loxodonta africana]
 gi|345789986|ref|XP_864959.2| PREDICTED: RNA-binding protein 39 isoform 3 [Canis lupus
           familiaris]
 gi|354477982|ref|XP_003501196.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Cricetulus
           griseus]
 gi|397523808|ref|XP_003831910.1| PREDICTED: RNA-binding protein 39 [Pan paniscus]
 gi|426391507|ref|XP_004062114.1| PREDICTED: RNA-binding protein 39 isoform 1 [Gorilla gorilla
           gorilla]
 gi|28201880|sp|Q14498.2|RBM39_HUMAN RecName: Full=RNA-binding protein 39; AltName: Full=Hepatocellular
           carcinoma protein 1; AltName: Full=RNA-binding motif
           protein 39; AltName: Full=RNA-binding region-containing
           protein 2; AltName: Full=Splicing factor HCC1
 gi|405194|gb|AAA16347.1| splicing factor [Homo sapiens]
 gi|119596565|gb|EAW76159.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_a [Homo
           sapiens]
 gi|119596567|gb|EAW76161.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_a [Homo
           sapiens]
 gi|146327034|gb|AAI41836.1| RNA binding motif protein 39 [Homo sapiens]
 gi|164623752|gb|ABY64678.1| RNA binding motif protein 39, isoform 1 (predicted) [Papio anubis]
 gi|165971473|gb|AAI58173.1| RNA binding motif protein 39 [Homo sapiens]
 gi|166831598|gb|ABY90123.1| RNA binding motif protein 39 isoform a (predicted) [Callithrix
           jacchus]
 gi|169731519|gb|ACA64891.1| RNA binding motif protein 39 isoform a (predicted) [Callicebus
           moloch]
 gi|197215647|gb|ACH53039.1| RNA binding motif protein 39 isoform a (predicted) [Otolemur
           garnettii]
 gi|217038339|gb|ACJ76632.1| RNA binding motif protein 39 isoform a (predicted) [Oryctolagus
           cuniculus]
 gi|229368730|gb|ACQ63013.1| RNA binding motif protein 39 isoform a (predicted) [Dasypus
           novemcinctus]
 gi|351702535|gb|EHB05454.1| RNA-binding protein 39 [Heterocephalus glaber]
 gi|380783277|gb|AFE63514.1| RNA-binding protein 39 isoform a [Macaca mulatta]
 gi|383408125|gb|AFH27276.1| RNA-binding protein 39 isoform a [Macaca mulatta]
 gi|384939254|gb|AFI33232.1| RNA-binding protein 39 isoform a [Macaca mulatta]
 gi|410218746|gb|JAA06592.1| RNA binding motif protein 39 [Pan troglodytes]
 gi|410255434|gb|JAA15684.1| RNA binding motif protein 39 [Pan troglodytes]
 gi|410292900|gb|JAA25050.1| RNA binding motif protein 39 [Pan troglodytes]
 gi|410292904|gb|JAA25052.1| RNA binding motif protein 39 [Pan troglodytes]
 gi|410350859|gb|JAA42033.1| RNA binding motif protein 39 [Pan troglodytes]
 gi|440902514|gb|ELR53299.1| RNA-binding protein 39 [Bos grunniens mutus]
          Length = 530

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
             ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE        
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 416

Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +  
Sbjct: 417 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 471

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
             G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 472 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 513


>gi|355784531|gb|EHH65382.1| RNA-binding motif protein 39 [Macaca fascicularis]
          Length = 530

 Score = 81.6 bits (200), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYG
Sbjct: 235 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 294

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
           F  + D      A   LNG ++  + + V     R  AS  S                 T
Sbjct: 295 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 354

Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
            +  ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE      
Sbjct: 355 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 414

Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
                  A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +
Sbjct: 415 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 471

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
               G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 472 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 513


>gi|432858816|ref|XP_004068953.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Oryzias latipes]
          Length = 529

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 69/270 (25%), Positives = 113/270 (41%), Gaps = 52/270 (19%)

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
           G  GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  + D    
Sbjct: 247 GLTGPMRLYVGSLHFNITEDMLRGIFEPFGRIENIQLMMDSETGRSKGYGFITFSDAECA 306

Query: 350 DIACAALNGLKMGDKTLTVRRAT-----ASGQSKTEQE---------------SILAQAQ 389
             A   LNG ++  + + V   T     +S  S  + +                ++A+  
Sbjct: 307 KKALEQLNGFELAGRPMKVGHVTERTDPSSAPSILDNDELERSGIDLGTTGRLQLMARLA 366

Query: 390 QHIAIQ-----KMALQTS-------------------GMNTLGGGMSLFGETLAKVLCLT 425
           +   +Q     + ALQ S                    +N   G ++L  + LA      
Sbjct: 367 EGTGLQIPPAAQQALQMSGAIAIGAMAAVSAAMNPSLNVNMNSGALNLPSQPLATHCFQL 426

Query: 426 EAITADALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDA 484
             +   +  +   +E +I  D+ EEC K+G +V++ +   D+N  E    G V+++    
Sbjct: 427 SNMFNPSSENTFGWEVDIQRDVIEECNKHGGVVHIYV---DKNSAE----GNVYVKCPSI 479

Query: 485 VGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
               +A NAL GR FGG  + A Y P   Y
Sbjct: 480 PAAMSAVNALHGRFFGGKMITAAYVPLPTY 509


>gi|427792527|gb|JAA61715.1| Putative transcriptional coactivator caper rrm superfamily, partial
           [Rhipicephalus pulchellus]
          Length = 497

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 85/328 (25%), Positives = 130/328 (39%), Gaps = 58/328 (17%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE + VE    AM L+G    G+ + V+ PT      AAA      +  L    V
Sbjct: 179 KGIAYVEFQDVESVPLAMGLNGQKLFGIPIVVQ-PTQAERNRAAAQNASTSNSTLQRGNV 237

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
           G           P R++VG L +  TE  +K + E FG +   +L+KD +T  SKGYGF 
Sbjct: 238 G-----------PMRLYVGSLHFNITEEMLKGIFEPFGKIDKIELIKDMETNRSKGYGFI 286

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQESI------LAQAQQ 390
            + D      A   LNG ++  + + V     R   +   S  + E +      L    +
Sbjct: 287 TFHDSEDAKKALEQLNGFELAGRPMKVGHVTERTDVSQAPSFLDSEELDRSGIDLGATGR 346

Query: 391 HIAIQKMA------LQTSGMNTLGGGMSLF-------------GETLAKVLCLTEAITAD 431
              + K+A      +  + +N L    ++                T+A   C   +   D
Sbjct: 347 LQLMAKLAEGTGFQIPQAAVNALQMNPAVLPGQPQAAAVAAAAAPTIA-TQCFLLSNMFD 405

Query: 432 ALAD-----DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
            L +     DEE   I  D+ EEC K+G  ++V + R           G V+++      
Sbjct: 406 PLTETNPSWDEE---IRRDVIEECRKHGGALHVYVDRASPE-------GHVYVKCPTIAS 455

Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKY 514
              + NAL GR F G  + A Y P   Y
Sbjct: 456 AVASVNALHGRWFAGRIITAAYVPVMSY 483


>gi|427794973|gb|JAA62938.1| Putative transcriptional coactivator caper rrm superfamily, partial
           [Rhipicephalus pulchellus]
          Length = 509

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 85/328 (25%), Positives = 130/328 (39%), Gaps = 58/328 (17%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE + VE    AM L+G    G+ + V+ PT      AAA      +  L    V
Sbjct: 191 KGIAYVEFQDVESVPLAMGLNGQKLFGIPIVVQ-PTQAERNRAAAQNASTSNSTLQRGNV 249

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
           G           P R++VG L +  TE  +K + E FG +   +L+KD +T  SKGYGF 
Sbjct: 250 G-----------PMRLYVGSLHFNITEEMLKGIFEPFGKIDKIELIKDMETNRSKGYGFI 298

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQESI------LAQAQQ 390
            + D      A   LNG ++  + + V     R   +   S  + E +      L    +
Sbjct: 299 TFHDSEDAKKALEQLNGFELAGRPMKVGHVTERTDVSQAPSFLDSEELDRSGIDLGATGR 358

Query: 391 HIAIQKMA------LQTSGMNTLGGGMSLF-------------GETLAKVLCLTEAITAD 431
              + K+A      +  + +N L    ++                T+A   C   +   D
Sbjct: 359 LQLMAKLAEGTGFQIPQAAVNALQMNPAVLPGQPQAAAVAAAAAPTIA-TQCFLLSNMFD 417

Query: 432 ALAD-----DEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
            L +     DEE   I  D+ EEC K+G  ++V + R           G V+++      
Sbjct: 418 PLTETNPSWDEE---IRRDVIEECRKHGGALHVYVDRASPE-------GHVYVKCPTIAS 467

Query: 487 CATAKNALSGRKFGGNTVNAFYYPEDKY 514
              + NAL GR F G  + A Y P   Y
Sbjct: 468 AVASVNALHGRWFAGRIITAAYVPVMSY 495


>gi|326924922|ref|XP_003208671.1| PREDICTED: serine/threonine-protein kinase Kist-like [Meleagris
           gallopavo]
          Length = 550

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 41/94 (43%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   ++  +L  +EEYE+ILED+REEC KYG +V+++IP+      E PG G+VF+E
Sbjct: 453 VLRLLNVLSDASLQSEEEYEDILEDIREECQKYGPVVSLLIPK------ENPGKGQVFVE 506

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+G+ F G  V A +YP   Y
Sbjct: 507 YANAGDSKAAQKMLTGKIFDGKFVVATFYPLSAY 540


>gi|193785136|dbj|BAG54289.1| unnamed protein product [Homo sapiens]
          Length = 506

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 170 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 212

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 213 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 272

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 273 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 332

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
             ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE        
Sbjct: 333 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 392

Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +  
Sbjct: 393 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 447

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
             G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 448 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 489


>gi|194758325|ref|XP_001961412.1| GF14957 [Drosophila ananassae]
 gi|190615109|gb|EDV30633.1| GF14957 [Drosophila ananassae]
          Length = 594

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 84/370 (22%), Positives = 130/370 (35%), Gaps = 59/370 (15%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS V         G    V  +  N  K+F   A++E
Sbjct: 234 RDARTVFCIQLSQRVRARDLEEFFSSV---------GKVRDVRLILCNKTKRFKGIAYIE 284

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
               E  + A+ L G    GV + V+        L  A    QP  +             
Sbjct: 285 FEDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQNAAPAFQPKSHT------------ 332

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  Y +   
Sbjct: 333 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDAIQLIMDTETGRSKGYGFITYHNADD 388

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA-LQTSGMNTL 407
              A   LNG ++  + + V   T      T         +  I +     LQ       
Sbjct: 389 AKKALEQLNGFELAGRLMKVGNVTERLDMNTSSLDTDEMDRTGIDLGATGRLQLMFKLAE 448

Query: 408 GGGMSL-----------------------FGETLAKVLCLTEAITADALADDEEYEEILE 444
           G G+++                             +   L+         +     EI +
Sbjct: 449 GAGLAVPQAAANALLATAPQPAPVQQQQQAPSIATQCFILSNMFDPRTETNPTWDAEIRD 508

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           D+ EEC K+G ++++ +             G V+++         A NAL GR F G  +
Sbjct: 509 DVLEECAKHGGVLHIHV-------DTVSPTGTVYVKCPSTTTAVLAVNALHGRWFAGRVI 561

Query: 505 NAFYYPEDKY 514
            A Y P   Y
Sbjct: 562 TAAYVPLVNY 571


>gi|215820612|ref|NP_001135965.1| RNA binding motif protein 39 [Acyrthosiphon pisum]
          Length = 501

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 99/389 (25%), Positives = 150/389 (38%), Gaps = 65/389 (16%)

Query: 161 MPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH 220
           +P   M     R AR V+   L      + +  FFS V         G    V  +  N 
Sbjct: 126 VPPASMLTPEERDARTVFCMQLSKTIRARDLEEFFSSV---------GKVRDVRMITCNK 176

Query: 221 EKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLN 277
            ++F   A++E +  E    AM L+G    GV + V+      PT A         PN+ 
Sbjct: 177 TRRFKGIAYIEFKDPESVPLAMGLNGQKLLGVPIVVQ------PTQAEKNRMANSMPNM- 229

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
                          GP +++VG L Y  TE  ++ + E FG +    L+ D +TG SKG
Sbjct: 230 ---------VQRTHYGPMKLYVGSLHYNITEEMLRGIFEPFGHVDNIQLMMDTETGRSKG 280

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA--SGQSKTEQE-SILAQAQQHIA- 393
           YGF  Y++      A   LNG ++  + + V   T   S   KT  E   L +A   +  
Sbjct: 281 YGFLTYRNAEDAKKALEHLNGFEIAGRPMKVGHVTENHSVYDKTAFEVDELDRAGYDLGA 340

Query: 394 ---IQKM-----------------ALQT-SGMNTLGGGMSLFGETLAKVLCLTEAITADA 432
              +Q M                 ALQ  SG+       ++         C   A   D 
Sbjct: 341 TGRLQLMYKLAEGTGFPIPQAAANALQVASGVQAAPAAPTVQVTPPIATQCFLLANMFDP 400

Query: 433 LADDEE----YE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGC 487
             +D +    +E EI +D+ EEC K+G +++V + +    G        V+++       
Sbjct: 401 NKEDVDSNTTWETEIRDDVIEECNKHGGVLHVYVDKASPQGN-------VYVKCTTIETA 453

Query: 488 ATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
             +  AL GR FGG  + A Y P   Y N
Sbjct: 454 LASVAALHGRWFGGRVITAAYVPVTNYHN 482


>gi|350410158|ref|XP_003488966.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Bombus impatiens]
          Length = 532

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 97/396 (24%), Positives = 149/396 (37%), Gaps = 64/396 (16%)

Query: 148 LPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
           LPFG    G  PL           R AR V+   L      + +  FFS         S 
Sbjct: 149 LPFGK---GVSPLGIRNDELTPEERDARTVFCMQLSQRIRARDLEEFFS---------SV 196

Query: 208 GPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA 264
           G    V  +  N  ++F   A+VE +  E  + A+ L G    GV + V+       T A
Sbjct: 197 GKVQDVRLITCNKTRRFKGIAYVEFKDPESVTLALGLSGQKLLGVPIVVQH------TQA 250

Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
                G   PNL             G  GP R++VG L +  TE  ++ + E FG +   
Sbjct: 251 EKNRMGNSMPNL----------MPKGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNI 300

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKT 379
            L+ D +TG SKGYGF  +++      A   LNG ++  + + V     R     G S  
Sbjct: 301 QLIMDPETGRSKGYGFLTFRNADDAKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLL 360

Query: 380 EQESI------LAQAQQHIAIQKMALQT---------------SGMNTLGGGMSLFGETL 418
           + + +      L    +   + K+A  T                 M+T            
Sbjct: 361 DTDELDRSGIDLGATGRLQLMFKLAEGTGLEIPPAAANALNMAPVMSTPQPPPQAAPPIA 420

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
            +   L+         +    +EI +D+ EEC K+G +++V +   DQ   +    G V+
Sbjct: 421 TQCFMLSNMFDPQNETNPNWAKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVY 473

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           ++         A N+L GR F G  + A Y P   Y
Sbjct: 474 VKCPSIATAVAAVNSLHGRWFAGRVITAAYVPVVNY 509


>gi|426241406|ref|XP_004014582.1| PREDICTED: RNA-binding protein 39 isoform 3 [Ovis aries]
          Length = 502

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 84/338 (24%), Positives = 130/338 (38%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 172 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 214

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 215 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 274

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 275 TFSDSECAKKALEQLNGFELTGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 334

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSLFGETLAKVL------------CL 424
             ++A+  +   +Q     + ALQ SG    G    L      +                
Sbjct: 335 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 394

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 395 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 447

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 448 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 485


>gi|270007747|gb|EFA04195.1| hypothetical protein TcasGA2_TC014444 [Tribolium castaneum]
          Length = 522

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 94/400 (23%), Positives = 159/400 (39%), Gaps = 67/400 (16%)

Query: 149 PFGA---TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGN 205
           PFG    + LG     PV+ ++ +  R AR V+V  L      + +  FFS         
Sbjct: 136 PFGRRNRSPLGLRSNSPVEELSPE-ERDARTVFVMQLSQRIRARDLEEFFS--------- 185

Query: 206 SAGPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPT 262
           S G    V  +  N  ++F   A++E +  E  + A+ L G    GV + V+       T
Sbjct: 186 SVGKVRDVRLIVCNKTRRFKGIAYIEFKDPESVTLALGLSGQKLLGVPIIVQH------T 239

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
            A     G   PNL                GP R++VG L +  TE  ++ + E FG + 
Sbjct: 240 QAEKNRMGNSMPNL----------MPKNMTGPMRLYVGSLHFNITEDMLRSIFEPFGKID 289

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
              L+ D +TG SKGYGF  +++      A   LNG ++  + + V   T     + +  
Sbjct: 290 NIQLIMDPETGRSKGYGFIAFRNCEDAKKALEQLNGFELAGRPMKVGNVTERLDLQQQGP 349

Query: 383 SILAQ-----------AQQHIAIQKMALQTSGMN---------TLGGGMSLFGET----- 417
           SIL             A   + +     + +GM          ++  G  +  +      
Sbjct: 350 SILDSDELDRSGIDLGATGRLQLMFKLAEGAGMQVPQAAANALSIATGQPVVPQVQTNST 409

Query: 418 --LAKVLCLTEAITADALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
             +A    +   +   A    + ++ EI +D+ EEC K+G +++V +   D+   +    
Sbjct: 410 PPIATQCFMLSNMFDPATESTQTWDVEIRDDVIEECNKHGGVLHVYV---DKGSPQ---- 462

Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           G V+++         + N+L GR F G  + A Y P   Y
Sbjct: 463 GNVYVKCPSIATAVASVNSLHGRWFAGRVITAAYVPLLNY 502


>gi|426241402|ref|XP_004014580.1| PREDICTED: RNA-binding protein 39 isoform 1 [Ovis aries]
          Length = 530

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 73/344 (21%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 297 TFSDSECAKKALEQLNGFELTGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 356

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
             ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE        
Sbjct: 357 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 416

Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +  
Sbjct: 417 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 471

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
             G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 472 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 513


>gi|428182175|gb|EKX51036.1| hypothetical protein GUITHDRAFT_134574 [Guillardia theta CCMP2712]
          Length = 458

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 97/414 (23%), Positives = 169/414 (40%), Gaps = 74/414 (17%)

Query: 150 FGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP 209
            G T +    L+ +  M +   R+  R+YVG L     E  +   F            GP
Sbjct: 75  LGLTVIAQLALVTIAGMVKPPQRN--RLYVGSLHFDLKEADVRAIFQPF---------GP 123

Query: 210 GDAVVNVY---INHEKKFAFVE-MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAA 265
              +   Y       K +AF+E M   +  +   A+DG +  G  ++V RP  +N   A 
Sbjct: 124 IKTIEMSYEPTTGKSKGYAFIEYMNDAQADACEKAMDGFMIAGRPIKVGRP--HNTVSAN 181

Query: 266 A----------------LGPGQPSPNLNLAAVGLASGAIGG----AEGPDRVFVGGLPYY 305
           A                L P  PS     A     + A       A  P R+++G + + 
Sbjct: 182 APVHRRLFFLLNFSSVDLQPWPPSLPQQAALAAQKAQAQPLNTPVAGPPARIYIGSVLFD 241

Query: 306 FTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ--DPAVTDIACAALNGLKMG- 362
             E+++K++ + FG++    ++ + + G  KGYGF  Y+  D AV   A  A+NG ++  
Sbjct: 242 VKESEVKQIFQVFGSIKQISMIPNPENGKHKGYGFIEYEKHDDAVQ--AIQAMNGFQLAG 299

Query: 363 -----DKT----------LTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTL 407
                DKT              + T S  + +  +    + ++++++  +  +   M  L
Sbjct: 300 RPLKEDKTSNPIIVAAANAIADKVTTSLVTSSSNDITTVEDEENLSVSSVLQRKEIMCKL 359

Query: 408 GGGMSLFGETLAKVLCLTEAITADALADDEEYEEILE-DMREECGKYGTLVNVVIPRPDQ 466
               S       +V+ L        + + E+ + +LE ++ EEC K+G +  V+I    +
Sbjct: 360 ANRPS-------RVVLLKN------MVEPEDVDPLLEQEIAEECSKFGKVNKVLIVTMVE 406

Query: 467 NGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
            G     + KVF+E+ D      A   L  R FGG  VNA  Y E+++  +D S
Sbjct: 407 QGSR---LVKVFVEFGDQEAATKAVARLDKRWFGGKIVNASTYEEERFVRQDLS 457


>gi|384252120|gb|EIE25597.1| splicing factor, CC1-like protein [Coccomyxa subellipsoidea C-169]
          Length = 497

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 94/346 (27%), Positives = 142/346 (41%), Gaps = 70/346 (20%)

Query: 34  DRHHRDFKSGGDDRRRDKNYKYDREGIRDHDRTDR-----HRDYNRDKERRHRHRSRSHS 88
           D  H+  K    DR RD++   +RE  RD DR  +     HRD + D+ER+  H S  H 
Sbjct: 25  DGTHKREKREKKDRTRDRDS--ERERTRDQDRDRKSSKREHRDKSPDRERKRHHSSHDHH 82

Query: 89  SDRFRNRSKSLSPSRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNML 148
               R+R  S  P RS  K + R+                        P  V E  +   
Sbjct: 83  RSE-RDRKHSSRP-RSLEKRRERT------------------------PPEVREQREK-- 114

Query: 149 PFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAG 208
                          +   ++  R  R V+   LP  A E+ +  FFS+         AG
Sbjct: 115 ---------------ERELKELDRDIRTVFAYNLPLKAEERDLFEFFSK---------AG 150

Query: 209 PGDAVVNVYINHEKK---FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAA 265
           P + V  +   + +K   FA++E     +   AMAL G I  G AV V+  ++    LA 
Sbjct: 151 PIEDVKIIMDRNTRKSKGFAYIEYTNKADIVTAMALTGQILMGQAVMVK-SSEAEKNLAW 209

Query: 266 ALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFD 325
                Q +  L ++ +G A        GP ++++G L     E  +K++ E+FG +    
Sbjct: 210 EAAQAQNASMLQMSTIGNA------GTGPCKLYIGNLHPNIQEQDLKQVFEAFGAVEYIT 263

Query: 326 LVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRA 371
           L KD  TG S+GYGF  YQ       A   L+GL +    ++V+ A
Sbjct: 264 LQKD-PTGRSQGYGFVQYQTTPDATKAMQQLDGLDIAGSQISVKIA 308



 Score = 42.0 bits (97), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 36/69 (52%), Gaps = 8/69 (11%)

Query: 440 EEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKF 499
           +EI  D+ EEC KYG + +  +   D+N       G V+L++    G A A+ AL GR F
Sbjct: 425 QEIATDVTEECSKYGPVSHTHV---DKNSK-----GFVYLKFVTVEGSAAAQKALHGRWF 476

Query: 500 GGNTVNAFY 508
            G  V A +
Sbjct: 477 AGRQVVAEF 485


>gi|115495227|ref|NP_001070127.1| serine/threonine-protein kinase Kist [Danio rerio]
 gi|115313546|gb|AAI24287.1| Zgc:153241 [Danio rerio]
 gi|182890586|gb|AAI64787.1| Zgc:153241 protein [Danio rerio]
          Length = 410

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   I    L +++EYE+I+EDM+EEC KYGT+V+++IP+      E PG G+VF+E
Sbjct: 313 VLRLLNVIDDSHLYNEDEYEDIIEDMKEECQKYGTVVSLLIPK------ENPGKGQVFVE 366

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+GR F G  V A +YP   Y
Sbjct: 367 YANAGDSKEAQRLLTGRTFDGKFVVATFYPLGAY 400


>gi|350410161|ref|XP_003488967.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Bombus impatiens]
          Length = 508

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 97/396 (24%), Positives = 149/396 (37%), Gaps = 64/396 (16%)

Query: 148 LPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
           LPFG    G  PL           R AR V+   L      + +  FFS         S 
Sbjct: 125 LPFGK---GVSPLGIRNDELTPEERDARTVFCMQLSQRIRARDLEEFFS---------SV 172

Query: 208 GPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA 264
           G    V  +  N  ++F   A+VE +  E  + A+ L G    GV + V+       T A
Sbjct: 173 GKVQDVRLITCNKTRRFKGIAYVEFKDPESVTLALGLSGQKLLGVPIVVQH------TQA 226

Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
                G   PNL             G  GP R++VG L +  TE  ++ + E FG +   
Sbjct: 227 EKNRMGNSMPNL----------MPKGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNI 276

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKT 379
            L+ D +TG SKGYGF  +++      A   LNG ++  + + V     R     G S  
Sbjct: 277 QLIMDPETGRSKGYGFLTFRNADDAKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLL 336

Query: 380 EQESI------LAQAQQHIAIQKMALQT---------------SGMNTLGGGMSLFGETL 418
           + + +      L    +   + K+A  T                 M+T            
Sbjct: 337 DTDELDRSGIDLGATGRLQLMFKLAEGTGLEIPPAAANALNMAPVMSTPQPPPQAAPPIA 396

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
            +   L+         +    +EI +D+ EEC K+G +++V +   DQ   +    G V+
Sbjct: 397 TQCFMLSNMFDPQNETNPNWAKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVY 449

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           ++         A N+L GR F G  + A Y P   Y
Sbjct: 450 VKCPSIATAVAAVNSLHGRWFAGRVITAAYVPVVNY 485


>gi|340718900|ref|XP_003397900.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Bombus
           terrestris]
          Length = 508

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 97/396 (24%), Positives = 149/396 (37%), Gaps = 64/396 (16%)

Query: 148 LPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
           LPFG    G  PL           R AR V+   L      + +  FFS         S 
Sbjct: 125 LPFGK---GVSPLGIRNDELTPEERDARTVFCMQLSQRIRARDLEEFFS---------SV 172

Query: 208 GPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA 264
           G    V  +  N  ++F   A+VE +  E  + A+ L G    GV + V+       T A
Sbjct: 173 GKVQDVRLITCNKTRRFKGIAYVEFKDPESVTLALGLSGQKLLGVPIVVQH------TQA 226

Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
                G   PNL             G  GP R++VG L +  TE  ++ + E FG +   
Sbjct: 227 EKNRMGNSMPNL----------MPKGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNI 276

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKT 379
            L+ D +TG SKGYGF  +++      A   LNG ++  + + V     R     G S  
Sbjct: 277 QLIMDPETGRSKGYGFLTFRNADDAKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLL 336

Query: 380 EQESI------LAQAQQHIAIQKMALQT---------------SGMNTLGGGMSLFGETL 418
           + + +      L    +   + K+A  T                 M+T            
Sbjct: 337 DTDELDRSGIDLGATGRLQLMFKLAEGTGLEIPPAAANALNMAPVMSTPQPPPQAAPPIA 396

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
            +   L+         +    +EI +D+ EEC K+G +++V +   DQ   +    G V+
Sbjct: 397 TQCFMLSNMFDPQNETNPNWAKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVY 449

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           ++         A N+L GR F G  + A Y P   Y
Sbjct: 450 VKCPSIATAVAAVNSLHGRWFAGRVITAAYVPVVNY 485


>gi|336176064|ref|NP_001229528.1| RNA-binding protein 39 isoform c [Homo sapiens]
 gi|296199707|ref|XP_002747281.1| PREDICTED: RNA-binding protein 39 isoform 4 [Callithrix jacchus]
 gi|332858228|ref|XP_003316932.1| PREDICTED: uncharacterized protein LOC458443 isoform 2 [Pan
           troglodytes]
 gi|335304745|ref|XP_003360013.1| PREDICTED: RNA-binding protein 39 [Sus scrofa]
 gi|338719242|ref|XP_003363966.1| PREDICTED: RNA-binding protein 39 [Equus caballus]
 gi|345789990|ref|XP_003433300.1| PREDICTED: RNA-binding protein 39 [Canis lupus familiaris]
 gi|426391511|ref|XP_004062116.1| PREDICTED: RNA-binding protein 39 isoform 3 [Gorilla gorilla
           gorilla]
 gi|124297482|gb|AAI31544.1| RBM39 protein [Homo sapiens]
 gi|194389138|dbj|BAG61586.1| unnamed protein product [Homo sapiens]
          Length = 508

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYG
Sbjct: 213 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 272

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
           F  + D      A   LNG ++  + + V     R  AS  S                 T
Sbjct: 273 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 332

Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
            +  ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE      
Sbjct: 333 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 392

Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
                  A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +
Sbjct: 393 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 449

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
               G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 450 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 491


>gi|340718898|ref|XP_003397899.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Bombus
           terrestris]
          Length = 520

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 97/396 (24%), Positives = 149/396 (37%), Gaps = 64/396 (16%)

Query: 148 LPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
           LPFG    G  PL           R AR V+   L      + +  FFS         S 
Sbjct: 137 LPFGK---GVSPLGIRNDELTPEERDARTVFCMQLSQRIRARDLEEFFS---------SV 184

Query: 208 GPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA 264
           G    V  +  N  ++F   A+VE +  E  + A+ L G    GV + V+       T A
Sbjct: 185 GKVQDVRLITCNKTRRFKGIAYVEFKDPESVTLALGLSGQKLLGVPIVVQH------TQA 238

Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
                G   PNL             G  GP R++VG L +  TE  ++ + E FG +   
Sbjct: 239 EKNRMGNSMPNL----------MPKGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNI 288

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKT 379
            L+ D +TG SKGYGF  +++      A   LNG ++  + + V     R     G S  
Sbjct: 289 QLIMDPETGRSKGYGFLTFRNADDAKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLL 348

Query: 380 EQESI------LAQAQQHIAIQKMALQT---------------SGMNTLGGGMSLFGETL 418
           + + +      L    +   + K+A  T                 M+T            
Sbjct: 349 DTDELDRSGIDLGATGRLQLMFKLAEGTGLEIPPAAANALNMAPVMSTPQPPPQAAPPIA 408

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
            +   L+         +    +EI +D+ EEC K+G +++V +   DQ   +    G V+
Sbjct: 409 TQCFMLSNMFDPQNETNPNWAKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVY 461

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           ++         A N+L GR F G  + A Y P   Y
Sbjct: 462 VKCPSIATAVAAVNSLHGRWFAGRVITAAYVPVVNY 497


>gi|383864352|ref|XP_003707643.1| PREDICTED: RNA-binding protein 39-like isoform 1 [Megachile
           rotundata]
          Length = 530

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 90/372 (24%), Positives = 142/372 (38%), Gaps = 61/372 (16%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS         S G    V  +  N  ++F   A+VE
Sbjct: 168 RDARTVFCMQLSQRIRARDLEDFFS---------SVGKVQDVRLITCNKTRRFKGIAYVE 218

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            +  E  + A+ L G    GV + V+       T A     G   PNL            
Sbjct: 219 FKDPESVTLALGLSGQKLLGVPIVVQH------TQAEKNRMGNSMPNL----------MP 262

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
            G  GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  +++   
Sbjct: 263 KGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDPETGRSKGYGFLTFRNADD 322

Query: 349 TDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQESI------LAQAQQHIAIQKM 397
              A   LNG ++  + + V     R     G S  + + +      L    +   + K+
Sbjct: 323 AKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLLDTDELDRSGIDLGATGRLQLMFKL 382

Query: 398 ALQT---------------SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEI 442
           A  T                 M+T             +   L+         +    +EI
Sbjct: 383 AEGTGLEIPPAAANALNMAPVMSTPQPPPQAAPPIATQCFMLSNMFDPQNETNPNWAKEI 442

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
            +D+ EEC K+G +++V +   DQ   +    G V+++         A N+L GR F G 
Sbjct: 443 RDDVIEECNKHGGVLHVYV---DQASPQ----GNVYVKCPSIATAVAAVNSLHGRWFAGR 495

Query: 503 TVNAFYYPEDKY 514
            + A Y P   Y
Sbjct: 496 VITAAYVPVVNY 507


>gi|383864354|ref|XP_003707644.1| PREDICTED: RNA-binding protein 39-like isoform 2 [Megachile
           rotundata]
          Length = 507

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 90/372 (24%), Positives = 142/372 (38%), Gaps = 61/372 (16%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS         S G    V  +  N  ++F   A+VE
Sbjct: 145 RDARTVFCMQLSQRIRARDLEDFFS---------SVGKVQDVRLITCNKTRRFKGIAYVE 195

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            +  E  + A+ L G    GV + V+       T A     G   PNL            
Sbjct: 196 FKDPESVTLALGLSGQKLLGVPIVVQH------TQAEKNRMGNSMPNL----------MP 239

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
            G  GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  +++   
Sbjct: 240 KGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDPETGRSKGYGFLTFRNADD 299

Query: 349 TDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQESI------LAQAQQHIAIQKM 397
              A   LNG ++  + + V     R     G S  + + +      L    +   + K+
Sbjct: 300 AKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLLDTDELDRSGIDLGATGRLQLMFKL 359

Query: 398 ALQT---------------SGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEI 442
           A  T                 M+T             +   L+         +    +EI
Sbjct: 360 AEGTGLEIPPAAANALNMAPVMSTPQPPPQAAPPIATQCFMLSNMFDPQNETNPNWAKEI 419

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
            +D+ EEC K+G +++V +   DQ   +    G V+++         A N+L GR F G 
Sbjct: 420 RDDVIEECNKHGGVLHVYV---DQASPQ----GNVYVKCPSIATAVAAVNSLHGRWFAGR 472

Query: 503 TVNAFYYPEDKY 514
            + A Y P   Y
Sbjct: 473 VITAAYVPVVNY 484


>gi|410055060|ref|XP_003953766.1| PREDICTED: uncharacterized protein LOC458443 [Pan troglodytes]
 gi|426391517|ref|XP_004062119.1| PREDICTED: RNA-binding protein 39 isoform 6 [Gorilla gorilla
           gorilla]
 gi|20988961|gb|AAH30493.1| Rbm39 protein [Mus musculus]
 gi|34364789|emb|CAE45833.1| hypothetical protein [Homo sapiens]
 gi|111598490|gb|AAH82607.1| Rbm39 protein [Mus musculus]
 gi|119596570|gb|EAW76164.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_d [Homo
           sapiens]
 gi|149030835|gb|EDL85862.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_g [Rattus
           norvegicus]
          Length = 367

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 37  KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 79

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 80  AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 139

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 140 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 199

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
             ++A+  +   +Q     + ALQ SG    G    L                A V  L 
Sbjct: 200 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 259

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 260 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 312

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 313 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 350


>gi|13278367|gb|AAH04000.1| Rbm39 protein [Mus musculus]
          Length = 429

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYG
Sbjct: 134 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 193

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
           F  + D      A   LNG ++  + + V     R  AS  S                 T
Sbjct: 194 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 253

Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
            +  ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE      
Sbjct: 254 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 313

Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
                  A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +
Sbjct: 314 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 370

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
               G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 371 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 412


>gi|328781105|ref|XP_624668.3| PREDICTED: RNA-binding protein 39-like [Apis mellifera]
          Length = 506

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 95/396 (23%), Positives = 148/396 (37%), Gaps = 64/396 (16%)

Query: 148 LPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
           LPFG    G  PL           R AR V+   L      + +  FFS         S 
Sbjct: 123 LPFGK---GVSPLGIRNDELTPEERDARTVFCMQLSQRIRARDLEEFFS---------SV 170

Query: 208 GPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA 264
           G    V  +  N  ++F   A+VE +  E  + A+ L G    GV + V+       T A
Sbjct: 171 GKVQDVRLITCNKTRRFKGIAYVEFKDPESVTLALGLSGQKLLGVPIVVQH------TQA 224

Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
                G   PNL             G  GP R++VG L +  TE  ++ + E FG +   
Sbjct: 225 EKNRMGNSMPNL----------MPKGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNI 274

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKT 379
            L+ D +TG SKGYGF  +++      A   LNG ++  + + V     R     G S  
Sbjct: 275 QLIMDPETGRSKGYGFLTFRNADDAKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLL 334

Query: 380 EQESILAQAQQHIAIQKMAL---------------------QTSGMNTLGGGMSLFGETL 418
           + + +     +  A  ++ L                         M+T      +     
Sbjct: 335 DTDELDRSGIELGATGRLQLMFKLAEGTGLEIPPAAANALNMAPVMSTPQPPPQVAPPIA 394

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
            +   L+         +    +EI +D+ EEC K+G +++V +   DQ   +    G V+
Sbjct: 395 TQCFMLSNMFDPQNETNPNWAKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVY 447

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           ++         A N+L GR F G  + A Y P   Y
Sbjct: 448 VKCPSIGTAVAAVNSLHGRWFAGRVITAAYVPVVNY 483


>gi|449509171|ref|XP_002189260.2| PREDICTED: serine/threonine-protein kinase Kist [Taeniopygia
           guttata]
          Length = 593

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 41/94 (43%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   ++  +L  +EEYE+ILED+REEC KYG +V+++IP+      E PG G+VF+E
Sbjct: 496 VLRLLNVLSDASLQCEEEYEDILEDIREECQKYGPVVSLLIPK------ENPGKGQVFVE 549

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+G+ F G  V A +YP   Y
Sbjct: 550 YANAGDSKAAQKMLTGKIFDGKFVVATFYPLSAY 583


>gi|426241408|ref|XP_004014583.1| PREDICTED: RNA-binding protein 39 isoform 4 [Ovis aries]
          Length = 508

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYG
Sbjct: 213 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 272

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
           F  + D      A   LNG ++  + + V     R  AS  S                 T
Sbjct: 273 FITFSDSECAKKALEQLNGFELTGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 332

Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
            +  ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE      
Sbjct: 333 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 392

Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
                  A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +
Sbjct: 393 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 449

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
               G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 450 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 491


>gi|224110606|ref|XP_002315575.1| predicted protein [Populus trichocarpa]
 gi|222864615|gb|EEF01746.1| predicted protein [Populus trichocarpa]
          Length = 251

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 68/242 (28%), Positives = 105/242 (43%), Gaps = 31/242 (12%)

Query: 284 ASGAIGGA--EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
           A  AIG    + P ++F+GG+    +   + E+  +FG L  +     +D    + + F 
Sbjct: 27  AIDAIGDIVKDSPHKIFIGGISKVLSSKMLMEIASAFGPLKAYQFENSKDP--DEPFAFL 84

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSK-TEQESILAQAQQHIAIQKMALQ 400
            Y D +VT  ACA LNG+K+G + +T  +A  +  S  ++  S   Q  QH    K  L 
Sbjct: 85  EYADESVTFKACAGLNGMKLGGQVITAIQAVPNASSSGSDGNSQFGQISQH---AKALL- 140

Query: 401 TSGMNTLGGGMSLFGETLAKVLCLTEAITADALA--DDEEYEEILEDMREECGKYGTLVN 458
                          E   +VL L     +++L+   + E EE+LED+R EC +Y  +  
Sbjct: 141 ---------------EKPTEVLKLKNVFDSESLSSLSNTEVEEVLEDVRLECARYYNVDK 185

Query: 459 V-----VIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
           V     +    D N G     G VF+E+        A + L GR F    V   Y+P D 
Sbjct: 186 VTDDIEIEEVDDCNLGLIFERGCVFVEFRRTEAACMAAHCLHGRLFDDRAVVVEYFPLDI 245

Query: 514 YF 515
           Y 
Sbjct: 246 YL 247


>gi|219112083|ref|XP_002177793.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410678|gb|EEC50607.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 109

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 43/105 (40%), Positives = 63/105 (60%), Gaps = 5/105 (4%)

Query: 416 ETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
           + +++V+ L   ++ + L D++ Y+E+LED REEC ++G L++VVIP+     GET G G
Sbjct: 10  QVVSRVVELQNMLSDEDLVDEQAYQEVLEDTREECSQFGKLISVVIPKK----GET-GEG 64

Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           K+FLEY      A A  AL GR F G  V A    E K+   DY+
Sbjct: 65  KIFLEYETTNDAAQAIQALEGRTFDGRRVQATSCAEAKFVAMDYA 109


>gi|189237575|ref|XP_974855.2| PREDICTED: similar to splicing factor [Tribolium castaneum]
          Length = 501

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 94/400 (23%), Positives = 159/400 (39%), Gaps = 67/400 (16%)

Query: 149 PFGA---TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGN 205
           PFG    + LG     PV+ ++ +  R AR V+V  L      + +  FFS         
Sbjct: 115 PFGRRNRSPLGLRSNSPVEELSPE-ERDARTVFVMQLSQRIRARDLEEFFS--------- 164

Query: 206 SAGPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPT 262
           S G    V  +  N  ++F   A++E +  E  + A+ L G    GV + V+       T
Sbjct: 165 SVGKVRDVRLIVCNKTRRFKGIAYIEFKDPESVTLALGLSGQKLLGVPIIVQH------T 218

Query: 263 LAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLH 322
            A     G   PNL                GP R++VG L +  TE  ++ + E FG + 
Sbjct: 219 QAEKNRMGNSMPNL----------MPKNMTGPMRLYVGSLHFNITEDMLRSIFEPFGKID 268

Query: 323 GFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
              L+ D +TG SKGYGF  +++      A   LNG ++  + + V   T     + +  
Sbjct: 269 NIQLIMDPETGRSKGYGFIAFRNCEDAKKALEQLNGFELAGRPMKVGNVTERLDLQQQGP 328

Query: 383 SILAQ-----------AQQHIAIQKMALQTSGMN---------TLGGGMSLFGET----- 417
           SIL             A   + +     + +GM          ++  G  +  +      
Sbjct: 329 SILDSDELDRSGIDLGATGRLQLMFKLAEGAGMQVPQAAANALSIATGQPVVPQVQTNST 388

Query: 418 --LAKVLCLTEAITADALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
             +A    +   +   A    + ++ EI +D+ EEC K+G +++V +   D+   +    
Sbjct: 389 PPIATQCFMLSNMFDPATESTQTWDVEIRDDVIEECNKHGGVLHVYV---DKGSPQ---- 441

Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           G V+++         + N+L GR F G  + A Y P   Y
Sbjct: 442 GNVYVKCPSIATAVASVNSLHGRWFAGRVITAAYVPLLNY 481


>gi|195397963|ref|XP_002057597.1| GJ18017 [Drosophila virilis]
 gi|194141251|gb|EDW57670.1| GJ18017 [Drosophila virilis]
          Length = 599

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 83/370 (22%), Positives = 132/370 (35%), Gaps = 59/370 (15%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS         S G    V  +  N  K+F   A++E
Sbjct: 239 RDARTVFCIQLSQRVRARDLEEFFS---------SVGKVRDVRLITCNKTKRFKGIAYIE 289

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
               E  + A+ L G    GV + V+        L +A  P QP  +             
Sbjct: 290 FEDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQSAPPPFQPKAHT------------ 337

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  Y +   
Sbjct: 338 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDVIQLIMDTETGRSKGYGFITYHNADD 393

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA-LQTSGMNTL 407
              A   LNG ++  + + V   T      T         +  I +     LQ       
Sbjct: 394 AKKALEQLNGFELAGRPMKVGNVTERLDMNTSSLDTDEMDRSGIDLGATGRLQLMFKLAE 453

Query: 408 GGGMSL-----------------------FGETLAKVLCLTEAITADALADDEEYEEILE 444
           G G+++                             +   L+         +     ++ E
Sbjct: 454 GAGLAVPQAAANALLATAPQPAPLQQQQQTPSIATQCFILSNMFDPRTETNPTWDTDVRE 513

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           D+ +EC K+G ++++ +             G V+++         A NAL GR F G  +
Sbjct: 514 DVLDECAKHGGVLHIHV-------DTVSPTGTVYVKCPSTTTAVLAVNALHGRWFAGRVI 566

Query: 505 NAFYYPEDKY 514
            A Y P   Y
Sbjct: 567 TAAYVPVINY 576


>gi|426241414|ref|XP_004014586.1| PREDICTED: RNA-binding protein 39 isoform 7 [Ovis aries]
          Length = 367

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 87/338 (25%), Positives = 132/338 (39%), Gaps = 67/338 (19%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 37  KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 79

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 80  AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 139

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 140 TFSDSECAKKALEQLNGFELTGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 199

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMSL-----------FGETLAKVLCL- 424
             ++A+  +   +Q     + ALQ SG    G    L                A V  L 
Sbjct: 200 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQPLA 259

Query: 425 TEAITADALADDEEYE------EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           T+      + + +  E      EI +D+ EEC K+G ++++ +   D+N  +    G V+
Sbjct: 260 TQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ----GNVY 312

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
           ++         A NAL GR F G  + A Y P   Y N
Sbjct: 313 VKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 350


>gi|363736469|ref|XP_422213.3| PREDICTED: serine/threonine-protein kinase Kist [Gallus gallus]
          Length = 388

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 41/94 (43%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   ++  +L  +EEYE+ILED+REEC KYG +V+++IP+      E PG G+VF+E
Sbjct: 291 VLRLLNVLSDASLQSEEEYEDILEDIREECQKYGPVVSLLIPK------ENPGKGQVFVE 344

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+G+ F G  V A +YP   Y
Sbjct: 345 YANAGDSKAAQKMLTGKIFDGKFVVATFYPLSAY 378


>gi|307180960|gb|EFN68748.1| RNA-binding protein 39 [Camponotus floridanus]
          Length = 529

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 90/372 (24%), Positives = 144/372 (38%), Gaps = 61/372 (16%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS         S G    V  +  N  ++F   A+VE
Sbjct: 167 RDARTVFCMQLSQRIRARDLEEFFS---------SVGKVQDVRLITCNKTRRFKGIAYVE 217

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            +  E  + A+ L G    GV + V+       T A     G   PNL            
Sbjct: 218 FKDPESVTLALGLSGQKLLGVPIVVQH------TQAEKNRMGNSMPNL----------MP 261

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
            G  GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  +++   
Sbjct: 262 KGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDPETGRSKGYGFLTFRNADD 321

Query: 349 TDIACAALNGLKMGDKTLTVRRAT-----ASGQSKTEQESI------LAQAQQHIAIQKM 397
              A   LNG ++  + + V   T       G S  + + +      L    +   + K+
Sbjct: 322 AKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLLDTDELDRSGIDLGATGRLQLMFKL 381

Query: 398 --------------ALQTSGMNTLGGGMSLFGETLA-KVLCLTEAITADALADDEEYEEI 442
                         AL  + + T           +A +   L+         +    +EI
Sbjct: 382 AEGTGLEIPPAAANALNMAPVMTQPQPPPQAAPPIATQCFMLSNMFDPQNETNPNWAKEI 441

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
            +D+ EEC K+G +++V +   DQ   +    G V+++         A N+L GR F G 
Sbjct: 442 RDDVIEECNKHGGVLHVYV---DQASPQ----GNVYVKCPSIATAVAAVNSLHGRWFAGR 494

Query: 503 TVNAFYYPEDKY 514
            + A Y P   Y
Sbjct: 495 VITAAYVPVVNY 506


>gi|300121045|emb|CBK21427.2| unnamed protein product [Blastocystis hominis]
          Length = 457

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 99/384 (25%), Positives = 157/384 (40%), Gaps = 45/384 (11%)

Query: 151 GATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPG 210
           GAT LGA  +     M+  +T     + + G+P       I    + +M ++   + GPG
Sbjct: 70  GATGLGAAMMTIADSMSAMST-GVTSLAITGVPATITPDEICNSINILMKSLKL-TTGPG 127

Query: 211 DAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPG 270
           +    V +    + A  EMR+  EA+N +ALDG+      + V RP +Y        GP 
Sbjct: 128 NPCSGVGMEANGQTAIAEMRSPLEATNGLALDGLTVFNHVMHVNRPDNYT-------GPD 180

Query: 271 QPSPNL--NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFD--- 325
            P P L  NL  + + +G+    E  + V VG           ++LLE++      +   
Sbjct: 181 TPPPKLDPNLL-LQICTGSYAEKEYEEIVRVG-----------RKLLETYKEPDPENDAD 228

Query: 326 --LVKDRDTGNSKGYGF--CVYQ-------DPAVTDIACAALNGLK----MGDKTLTVRR 370
              +     G   GY    CV         + A     C    GLK    + DK      
Sbjct: 229 KPTISSLKDGEKTGYDIEKCVLMHNIPRELEEAEIHTFCEPFGGLKKIYMLKDKNCRFLG 288

Query: 371 ATASGQSKTEQESILAQAQQHIAI-QKMALQTSGMNTLGGGMSLFGETLAK---VLCLTE 426
              +    T    I  +  Q + I   + ++    +    G      T++    VL ++ 
Sbjct: 289 DAVAEYRDTLNYEIAMEGLQDLPIFNDIVIKVEKPDPKWPGFPQRVNTISNPSPVLRMSN 348

Query: 427 AITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVG 486
            I+ + L +DE+YE +LED+RE C K GT++N+ +PR      E PG+G  F++Y   + 
Sbjct: 349 IISLEDLEEDEDYEALLEDLREGCEKLGTVLNMHVPRIHSGEKEIPGLGFAFVQYSSVIE 408

Query: 487 CATAKNALSGRKFGGNTVNAFYYP 510
            A A   L    F G  V   YYP
Sbjct: 409 AAQAAKQLRLLTFNGKQVQVDYYP 432


>gi|357612395|gb|EHJ67964.1| putative RNA-binding region-containing protein [Danaus plexippus]
          Length = 536

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 87/370 (23%), Positives = 139/370 (37%), Gaps = 59/370 (15%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R  R V+   L      + +  FFS V         G    V  +  N  ++F   A++E
Sbjct: 179 RDLRTVFCMQLSQRIRAKDLEEFFSSV---------GKVRDVRLITCNKTRRFKGIAYIE 229

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            +  E    A+ L G    GV + V+       T A     G   PNL          A 
Sbjct: 230 FKDAESVPLALGLTGQKLLGVPIIVQH------TQAEKNRVGNTLPNL----------AP 273

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
             + GP R++VG L +  TE  ++ + E FG +    L+ D DTG SKGYGF  +     
Sbjct: 274 KTSNGPTRLYVGSLHFNITEDMLRGIFEPFGKIDHIQLMTDPDTGKSKGYGFLTFHHATD 333

Query: 349 TDIACAALNGLKMGDKTLTVRRAT--ASGQSKT-------EQESILAQAQQHIAIQKMAL 399
              A   LNG ++  + + V   T  A G S T       ++  +   A   + +     
Sbjct: 334 AKKAMEQLNGFELAGRPMKVGNVTERADGGSSTRFDADELDRAGVDLGATGRLQLMFKLA 393

Query: 400 QTSGMNTLGGGMSLF---GETLA------------KVLCLTEAITADALADDEEYEEILE 444
           + +G+       S+    G TL             +   L       + ++     EI +
Sbjct: 394 EGTGLQIPPAAASVLMGAGSTLVAPQPQVAPPIATQCFMLNNMFDPSSESNPSWDIEIRD 453

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           D+  EC K+G +++V + +    G        V+ +         + N+L GR F G  +
Sbjct: 454 DVISECNKHGGVLHVYVDKASPQGN-------VYCKCPTIATAVASVNSLHGRWFAGRVI 506

Query: 505 NAFYYPEDKY 514
            A Y P   Y
Sbjct: 507 TAAYVPLVNY 516


>gi|323456301|gb|EGB12168.1| hypothetical protein AURANDRAFT_8852, partial [Aureococcus
           anophagefferens]
          Length = 98

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 57/97 (58%), Gaps = 1/97 (1%)

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE-TPGVGKV 477
           +KVL L   +T   L DDE Y ++++D+ +ECG YG + NV IPRP+       PG G V
Sbjct: 2   SKVLQLRHMVTDADLIDDEAYADVVDDVLQECGSYGDVENVEIPRPEPGTTRPAPGQGSV 61

Query: 478 FLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           F+ + DA     A+ A  GR F G T+ A YYP+D +
Sbjct: 62  FVAFGDAFFAQAAREAFEGRAFDGKTIIAGYYPQDLF 98


>gi|340370502|ref|XP_003383785.1| PREDICTED: RNA-binding protein 39-like [Amphimedon queenslandica]
          Length = 497

 Score = 79.7 bits (195), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 75/315 (23%), Positives = 131/315 (41%), Gaps = 46/315 (14%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE +       A++  G    G+ + ++      PT+A             LAA 
Sbjct: 195 KGIAYVEFQEESSVFTALSFSGQKVHGIPIMIQ------PTMAE---------KNRLAA- 238

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
             A+  +  AEGP +++VG L Y  TE  ++ +   FG +    +++D  T  S+GY F 
Sbjct: 239 --AAENLKKAEGPKKLYVGSLHYNITEDMLQGIFSPFGNVERVSIMRDTATNVSRGYAFV 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
            ++D    + A A LNG ++  + + V   T    S    +S+  +            + 
Sbjct: 297 EFRDSDSAERAMANLNGFELAGRPMKVNYGTVD-TSLVNIDSLDGEDMDVGVGMTPQSRV 355

Query: 402 SGMNTLGGG----MSLFGETL--------AKVLCLTEAITADALADDEEYE-------EI 442
           + M+ L  G    MS+ G  +            C+T          D   E       EI
Sbjct: 356 ALMHKLAAGHNADMSIPGVQVPPPPFAVPTMPTCITSCCFVIGNMFDPSKETGSDWDKEI 415

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
            ED+ EEC K+G + ++ + +  Q        GKV+++       + A  + +GR++ GN
Sbjct: 416 REDVLEECVKFGNIFHIHVDKFSQ--------GKVYIKSQTPQTASAAVGSFNGRRYAGN 467

Query: 503 TVNAFYYPEDKYFNK 517
            ++A   PE+ Y  K
Sbjct: 468 VIHAELVPENTYHLK 482


>gi|195338839|ref|XP_002036031.1| GM16278 [Drosophila sechellia]
 gi|194129911|gb|EDW51954.1| GM16278 [Drosophila sechellia]
          Length = 596

 Score = 79.7 bits (195), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 89/372 (23%), Positives = 135/372 (36%), Gaps = 63/372 (16%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS V         G    V  +  N  K+F   A++E
Sbjct: 236 RDARTVFCIQLSQRVRARDLEEFFSSV---------GKVRDVRLITCNKTKRFKGIAYIE 286

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
               E  + A+ L G    GV + V+        L  A    QP  +             
Sbjct: 287 FDDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQNATPAFQPKSHT------------ 334

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  Y +   
Sbjct: 335 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDAIQLIMDTETGRSKGYGFITYHNADD 390

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA---LQTSGMN 405
              A   LNG ++  + + V   T      T   S+         I   A   LQ     
Sbjct: 391 AKKALEQLNGFELAGRLMKVGNVTERLDMNT--TSLDTDEMDRTGIDLGATGRLQLMFKL 448

Query: 406 TLGGGMS------------------LFGETLAKVLCLTEAITADALADDEEYE-----EI 442
             G G++                  L  + +A  +     I ++      E       EI
Sbjct: 449 AEGAGLAVPQAAANALLATAPQPAPLQQQEVAPSIATQCFILSNMFDPRTETNPTWDVEI 508

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
            +D+ EEC K+G ++++ +             G V+++         A NAL GR F G 
Sbjct: 509 RDDVLEECAKHGGVLHIHV-------DTISHTGTVYVKCPSTTTAVLAVNALHGRWFAGR 561

Query: 503 TVNAFYYPEDKY 514
            + A Y P   Y
Sbjct: 562 VITAAYLPVINY 573


>gi|66475436|ref|XP_627534.1| splicing factor U2AF U2 snRNP auxiliary factor large subunit; 3 RRM
           domains [Cryptosporidium parvum Iowa II]
 gi|32398751|emb|CAD98711.1| splicing factor, possible [Cryptosporidium parvum]
 gi|46228987|gb|EAK89836.1| splicing factor U2AF U2 snRNP auxiliary factor large subunit; 3 RRM
           domains [Cryptosporidium parvum Iowa II]
          Length = 492

 Score = 79.7 bits (195), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 86/375 (22%), Positives = 155/375 (41%), Gaps = 47/375 (12%)

Query: 167 TQQATRHARRVYVGGLPPLANEQAIAT--FFSQVMTAIGGNSAG--PGDAVVNVYINHEK 222
           T   ++  R VYVG LP     Q I        +  +I  NS     G+ VV+ +IN + 
Sbjct: 108 TSFTSKPLREVYVGNLP-----QGITVTELLEYINRSIIKNSVSHTNGNPVVSAWINSDG 162

Query: 223 KFAFVEMRTVEEASNAMALDGII-FEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K+AF E R++EEA+  + L+ ++ F+G  +R+ +     P ++  +   QPS N  L   
Sbjct: 163 KYAFCECRSIEEANTLLRLNNLLSFKGNLLRIGK-----PKVSENIIGDQPSNNSTLINQ 217

Query: 282 GLASGAIGG---------AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
              S +I            +  + + + G+   F    IKE+L S   +   +L+  R+ 
Sbjct: 218 ITQSTSIISPYFNNIPLVLKKKETILITGINKKFVLEDIKEML-SIKNIEILELIDYRN- 275

Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATA----SGQSKTEQESILAQA 388
                Y   + +    TDI    +N L    K L ++   +    +  +  +  S + + 
Sbjct: 276 ----KYKIAICEGDLNTDITDKVVNKLGTEIKILRMKNCNSKVIHAVNNHLKNLSCIVRE 331

Query: 389 QQHIAIQKMALQTSGMNTLGGGMS---LFGETLAKVLCLTEAITADALADDEEYEEILED 445
                  K+ L+T     +        L  +   + + L+  +T + L     Y  I E+
Sbjct: 332 S-----NKLLLKTEKFENIQSKNVISLLLPQKPCRCILLSNILTVEELLIPSTYSSIHEE 386

Query: 446 MREECGKYGTLVNVVIPRPD-----QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
           + E+C KYG +    IP P+     ++    P  G+ F+ +Y+      AK  L   +F 
Sbjct: 387 IHEKCLKYGEIYKTTIPIPERALSNKDQFNDPYFGRAFIFFYNVESAIKAKLDLFKMRFL 446

Query: 501 GNTVNAFYYPEDKYF 515
           G  +   YY E ++ 
Sbjct: 447 GRNMKISYYCEHEFL 461


>gi|19920866|ref|NP_609095.1| CG11266, isoform B [Drosophila melanogaster]
 gi|24582412|ref|NP_723243.1| CG11266, isoform A [Drosophila melanogaster]
 gi|7297213|gb|AAF52478.1| CG11266, isoform A [Drosophila melanogaster]
 gi|15292031|gb|AAK93284.1| LD35730p [Drosophila melanogaster]
 gi|22945834|gb|AAN10614.1| CG11266, isoform B [Drosophila melanogaster]
 gi|220946034|gb|ACL85560.1| CG11266-PA [synthetic construct]
 gi|220955788|gb|ACL90437.1| CG11266-PA [synthetic construct]
          Length = 594

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 89/372 (23%), Positives = 135/372 (36%), Gaps = 63/372 (16%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS         S G    V  +  N  K+F   A++E
Sbjct: 234 RDARTVFCIQLSQRVRARDLEEFFS---------SVGKVRDVRLITCNKTKRFKGIAYIE 284

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
               E  + A+ L G    GV + V+        L  A    QP  +             
Sbjct: 285 FDDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQNAAPAFQPKSHT------------ 332

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  Y +   
Sbjct: 333 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDAIQLIMDTETGRSKGYGFITYHNADD 388

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA---LQTSGMN 405
              A   LNG ++  + + V   T      T   S+         I   A   LQ     
Sbjct: 389 AKKALEQLNGFELAGRLMKVGNVTERLDMNT--TSLDTDEMDRTGIDLGATGRLQLMFKL 446

Query: 406 TLGGGMS------------------LFGETLAKVLCLTEAITADALADDEEYE-----EI 442
             G G++                  L  + +A  +     I ++      E       EI
Sbjct: 447 AEGAGLAVPQAAANALLATAPQPAPLQQQEVAPSIATQCFILSNMFDPRTETNPTWDVEI 506

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
            +D+ EEC K+G ++++ +             G V+++         A NAL GR F G 
Sbjct: 507 RDDVLEECAKHGGVLHIHV-------DTISHTGTVYVKCPSTTTAVLAVNALHGRWFAGR 559

Query: 503 TVNAFYYPEDKY 514
            + A Y P   Y
Sbjct: 560 VITAAYVPVINY 571


>gi|380012525|ref|XP_003690330.1| PREDICTED: LOW QUALITY PROTEIN: RNA-binding protein 39-like [Apis
           florea]
          Length = 506

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 95/396 (23%), Positives = 147/396 (37%), Gaps = 64/396 (16%)

Query: 148 LPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSA 207
           LPFG    G  PL           R AR V+   L      + +  FFS         S 
Sbjct: 123 LPFGK---GVSPLGIRNDELTPEERDARTVFCMQLSQRIRARDLEEFFS---------SV 170

Query: 208 GPGDAVVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLA 264
           G    V  +  N  ++F   A+VE +  E  + A+ L G    GV + V+       T A
Sbjct: 171 GKVQDVRLITCNKTRRFKGIAYVEFKDPESVTLALGLSGQKLLGVPIVVQH------TQA 224

Query: 265 AALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGF 324
                G   PNL             G  GP R++VG L +  TE  ++ + E FG +   
Sbjct: 225 EKNRMGNSMPNL----------MPKGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNI 274

Query: 325 DLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV-----RRATASGQSKT 379
            L+ D  TG SKGYGF  +++      A   LNG ++  + + V     R     G S  
Sbjct: 275 QLIMDPXTGRSKGYGFLTFRNADDAKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLL 334

Query: 380 EQESILAQAQQHIAIQKMAL---------------------QTSGMNTLGGGMSLFGETL 418
           + + +     +  A  ++ L                         M+T      +     
Sbjct: 335 DTDELDRSGIELGATGRLQLMFKLAEGTGLEIPPAAANALNMAPVMSTPQPPPQVAPPIA 394

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
            +   L+         +    +EI +D+ EEC K+G +++V +   DQ   +    G V+
Sbjct: 395 TQCFMLSNMFDPQNETNPNWAKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVY 447

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           ++         A N+L GR F G  + A Y P   Y
Sbjct: 448 VKCPSIGTAVAAVNSLHGRWFAGRVITAAYVPVVNY 483


>gi|195577213|ref|XP_002078467.1| GD23448 [Drosophila simulans]
 gi|194190476|gb|EDX04052.1| GD23448 [Drosophila simulans]
          Length = 608

 Score = 79.3 bits (194), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 87/370 (23%), Positives = 134/370 (36%), Gaps = 59/370 (15%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS         S G    V  +  N  K+F   A++E
Sbjct: 248 RDARTVFCIQLSQRVRARDLEEFFS---------SVGKVRDVRLITCNKTKRFKGIAYIE 298

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
               E  + A+ L G    GV + V+        L  A    QP  +             
Sbjct: 299 FDDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQNATPAFQPKSH------------- 345

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  Y +   
Sbjct: 346 ---TGPMRLYVGSLHFNITEDMLRGIFEPFGKIDAIQLIMDTETGRSKGYGFITYHNADD 402

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA-LQTSGMNTL 407
              A   LNG ++  + + V   T      T         +  I +     LQ       
Sbjct: 403 AKKALEQLNGFELAGRLMKVGNVTERLDMNTTSLDTDEMDRTGIDLGATGRLQLMFKLAE 462

Query: 408 GGGMS------------------LFGETLAKVLCLTEAITADALADDEEYE-----EILE 444
           G G++                  L  + +A  +     I ++      E       EI +
Sbjct: 463 GAGLAVPQAAANALLATAPQPAPLQQQEVAPSIATQCFILSNMFDPRTETNPTWDVEIRD 522

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           D+ EEC K+G ++++ +             G V+++         A NAL GR F G  +
Sbjct: 523 DVLEECAKHGGVLHIHV-------DTISHTGTVYVKCPSTTTAVLAVNALHGRWFAGRVI 575

Query: 505 NAFYYPEDKY 514
            A Y P   Y
Sbjct: 576 TAAYLPVINY 585


>gi|34365067|emb|CAE45890.1| hypothetical protein [Homo sapiens]
          Length = 373

 Score = 79.3 bits (194), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 136/344 (39%), Gaps = 73/344 (21%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V+                Q   N    A 
Sbjct: 37  KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIVVQ--------------ASQAEKN---RAA 79

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 80  AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 139

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------TEQ 381
            + D      A   LNG ++  + + V     R  AS  S                 T +
Sbjct: 140 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTTGR 199

Query: 382 ESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE-------- 426
             ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE        
Sbjct: 200 LQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAAAA 259

Query: 427 -----AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +  
Sbjct: 260 SVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ-- 314

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
             G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 315 --GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 356


>gi|31873732|emb|CAD97833.1| hypothetical protein [Homo sapiens]
          Length = 373

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYG
Sbjct: 78  AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 137

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
           F  + D      A   LNG ++  + + V     R  AS  S                 T
Sbjct: 138 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDGLERTGIDLGTT 197

Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
            +  ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE      
Sbjct: 198 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 257

Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
                  A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +
Sbjct: 258 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 314

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
               G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 315 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 356


>gi|413920209|gb|AFW60141.1| hypothetical protein ZEAMMB73_955987, partial [Zea mays]
          Length = 72

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 41/66 (62%), Positives = 45/66 (68%)

Query: 454 GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDK 513
           G LV V+IPRPD +G    GVGKVFLEY D  G A AK AL GRKFGGN V A  Y EDK
Sbjct: 5   GNLVKVIIPRPDPSGQPVVGVGKVFLEYADIDGAAKAKTALHGRKFGGNPVVAVCYAEDK 64

Query: 514 YFNKDY 519
           + N +Y
Sbjct: 65  FANGEY 70


>gi|410055058|ref|XP_003316934.2| PREDICTED: uncharacterized protein LOC458443 isoform 4 [Pan
           troglodytes]
 gi|410055062|ref|XP_003953767.1| PREDICTED: uncharacterized protein LOC458443 [Pan troglodytes]
 gi|426391515|ref|XP_004062118.1| PREDICTED: RNA-binding protein 39 isoform 5 [Gorilla gorilla
           gorilla]
 gi|426391519|ref|XP_004062120.1| PREDICTED: RNA-binding protein 39 isoform 7 [Gorilla gorilla
           gorilla]
 gi|119596566|gb|EAW76160.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_b [Homo
           sapiens]
 gi|119596571|gb|EAW76165.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_b [Homo
           sapiens]
          Length = 373

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYG
Sbjct: 78  AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 137

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
           F  + D      A   LNG ++  + + V     R  AS  S                 T
Sbjct: 138 FITFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 197

Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
            +  ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE      
Sbjct: 198 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 257

Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
                  A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +
Sbjct: 258 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 314

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
               G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 315 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 356


>gi|294905728|ref|XP_002777665.1| Splicing factor U2AF 65 kDa subunit, putative [Perkinsus marinus
           ATCC 50983]
 gi|239885556|gb|EER09481.1| Splicing factor U2AF 65 kDa subunit, putative [Perkinsus marinus
           ATCC 50983]
          Length = 680

 Score = 79.0 bits (193), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 98/385 (25%), Positives = 164/385 (42%), Gaps = 69/385 (17%)

Query: 102 SRSPSKSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGATQLGAFPLM 161
           SRSPS+ ++   FD  P   A         QL    S +P   Q ++   +T   AF   
Sbjct: 184 SRSPSEKRKPFKFDSPPKELA--------AQLAAGTSMLP---QTVVSSSSTIKEAFNA- 231

Query: 162 PVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE 221
               +  +  + AR +Y+G +PP  +   +    +  +  +G N A PG  +V+ ++  +
Sbjct: 232 ---TLAAERQKIARELYIGQIPPGISAAELIDVLNDGLMNMGAN-AMPGRPIVHGWLGGD 287

Query: 222 KKFAFVEMRTVEEASNAMA-LDGIIFE--GVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
             FAFVE RT EEAS A+  L+G   +  GV+++V RP  Y       +GP  P  ++N 
Sbjct: 288 GLFAFVEFRTPEEASIALERLNGHQLKSYGVSIKVGRPKGY-------MGPAAPDDSVNA 340

Query: 279 AAVGLAS------GAIGGAE---GPDRVFVGGLPYYFTETQIKELLE--SFGTLHGFDLV 327
              G A+      G I  AE      R+ + G P   +E  IK  L   S G +   +L+
Sbjct: 341 YTAGHAATSSTTPGGISAAEVSSDTSRLCLIGFPLKASEHSIKRALRNASKGEIRHLELL 400

Query: 328 KDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
           K   T N +     V++   + D                         + K + E  +  
Sbjct: 401 K--HTWNDEEIVMAVFECVNIED-----------------------EHRLKKKGEVEVQG 435

Query: 388 AQQHIAIQKMALQTSGMNTLGGGM-SLFGETL--AKVLCLTE-AITADALADDEEYEEIL 443
            +  I   K A+    MN  G  M    G  +  +++L +T  A + + L DD  Y +++
Sbjct: 436 VKARIINPKDAIVKGYMNFDGDIMKKAMGLEIVPSRILVMTNFAGSVEELLDDINYSDLM 495

Query: 444 EDMREECGKY---GTLVNVVIPRPD 465
           +D++ EC        + +++IPRP+
Sbjct: 496 DDIKVECKSITAGADVRSIIIPRPE 520


>gi|312374824|gb|EFR22303.1| hypothetical protein AND_15459 [Anopheles darlingi]
          Length = 560

 Score = 79.0 bits (193), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 113/513 (22%), Positives = 184/513 (35%), Gaps = 79/513 (15%)

Query: 50  DKNYKYDREG-------IRDHDRTDRHRDYNRDKERRH-RHRSRSHSSDRFRNRSKSLSP 101
           D++ + DR+G        +D  R+ R RD  R+K+RR  + RS+S S  R R++      
Sbjct: 63  DRDKERDRDGGEGRSRKDKDRSRSPRPRDKEREKDRRKSKERSKSRSPRRERSKDHKEKD 122

Query: 102 SRSPS---------KSKRRSGFDMAPPAAAMLPGAAVPGQLPGVPSAVPEMAQNMLPFGA 152
            RS +         +S+ R G  +     +         +          M+      G 
Sbjct: 123 HRSKNDHHRSVEKRRSRERGGGMIDHRRKSRERDHRRRSRSRDAGRRRRSMSPRHYRRGR 182

Query: 153 TQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDA 212
              G++          Q  R AR V+   L      + +  FFS         S G    
Sbjct: 183 GGYGSYRDRTPGDEVSQEDRDARTVFCMQLSQRIRARDLEEFFS---------SVGKVRD 233

Query: 213 VVNVYINHEKKF---AFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
           V  +  N  K+F   A++E +  E  + A+ L G    G+ + V+               
Sbjct: 234 VRLITCNKTKRFKGIAYIEFKDPESVALALGLSGQRLLGIPISVQ--------------- 278

Query: 270 GQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKD 329
                  N  A            GP R++VG L +  TE  ++ + E FG +    L+ D
Sbjct: 279 -HTQAEKNRMANQPPPAPPKNPAGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMD 337

Query: 330 RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
            DTG SKGYGF  + +      A   LNG ++  + + V   T      T   S+     
Sbjct: 338 TDTGRSKGYGFITFHNADDAKKALEQLNGFELAGRPMKVGNVTER-LDVTTHASLDTDEM 396

Query: 390 QHIAIQKMA---LQTSGMNTLGGGMSL-----------------------FGETLAKVLC 423
               I+  A   LQ       G G+++                             +   
Sbjct: 397 DRSGIELGATGRLQLMFKLAEGAGLAVPRAAADALLATAPQPIPQQPLQQSPPIATQCFL 456

Query: 424 LTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYD 483
           L+         +     EI +D+ EEC K+G +++V + +   +G        V+++  +
Sbjct: 457 LSNMFDPSTETNPNWDVEIQDDVIEECNKHGGVLHVYVDKLSPSGN-------VYVKCPN 509

Query: 484 AVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
                 A NAL GR F G  + A Y P   Y+N
Sbjct: 510 VATAVLAVNALHGRWFAGRVIGAAYVPLVNYYN 542


>gi|291397536|ref|XP_002715130.1| PREDICTED: kinase interacting stathmin [Oryctolagus cuniculus]
          Length = 419

 Score = 79.0 bits (193), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 40/94 (42%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   +  D L ++EEYE+I+ED++EEC KYG +V++++P+      E PG G+VF+E
Sbjct: 322 VLRLLNVLDDDYLENEEEYEDIVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 375

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+GR F G  V A +YP   Y
Sbjct: 376 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 409


>gi|159482188|ref|XP_001699155.1| hypothetical protein CHLREDRAFT_106436 [Chlamydomonas reinhardtii]
 gi|158273218|gb|EDO99010.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 80

 Score = 79.0 bits (193), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 51/76 (67%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           ++FVGGLP  ++E  +KELL  +GTL  F+LV D+ TG SKGY FC Y + +  D+    
Sbjct: 2   KLFVGGLPCEWSEDMVKELLAPYGTLKSFNLVMDKSTGKSKGYAFCEYSEESSADLLIKN 61

Query: 356 LNGLKMGDKTLTVRRA 371
           L+  ++G K LTV+RA
Sbjct: 62  LHMRRVGSKALTVKRA 77


>gi|209154564|gb|ACI33514.1| RNA-binding protein 39 [Salmo salar]
          Length = 525

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 74/281 (26%), Positives = 116/281 (41%), Gaps = 51/281 (18%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  +A+    G  GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYG
Sbjct: 234 AAAMANNLQKGNAGPMRLYVGSLHFNITEEMLRGIFEPFGKIESIQLMMDSETGRSKGYG 293

Query: 340 FCVYQDPAVTDIACAALNG-------LKMGDKTLTVRRATASG-------------QSKT 379
           F  + D      A   LNG       +K+G  T     +TAS                 T
Sbjct: 294 FITFSDTECAKKALDQLNGFELAGRPMKVGHVTERTDASTASSFLDSDELERTGIDLGTT 353

Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLGGGMS------------------LFGE 416
            +  ++A+  +   +Q     + ALQ SG   +G   +                  L  +
Sbjct: 354 GRLQLMARLAEGTGLQIPPAAQQALQMSGAIAIGAMAAVSAAMNPAMNMNMNTAMNLPSQ 413

Query: 417 TLAKVLCLTEAITADALADDEEYE-EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
            LA        +      D+ +++ +I  D+ EEC K+G +V++ +   D+N  E    G
Sbjct: 414 PLATHCFQLSNMFNPQSEDNPDWDVDIQHDVIEECNKHGGVVHIYV---DKNSTE----G 466

Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
            V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 467 NVYVKCPSIPAAMAAVNALHGRYFAGKMITAAYVPLPTYHN 507


>gi|307195359|gb|EFN77277.1| RNA-binding protein 39 [Harpegnathos saltator]
          Length = 370

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 90/372 (24%), Positives = 144/372 (38%), Gaps = 61/372 (16%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS         S G    V  +  N  ++F   A+VE
Sbjct: 8   RDARTVFCMQLSQRIRARDLEEFFS---------SVGKVQDVRLITCNKTRRFKGIAYVE 58

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            +  E  + A+ L G    GV + V+       T A     G   PNL            
Sbjct: 59  FKDPESVTLALGLSGQKLLGVPIVVQH------TQAEKNRMGNSMPNL----------MP 102

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
            G  GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  +++   
Sbjct: 103 KGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDPETGRSKGYGFLTFRNADD 162

Query: 349 TDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQESI------LAQAQQHIAIQKM 397
              A   LNG ++  + + V     R     G S  + + +      L    +   + K+
Sbjct: 163 AKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLLDTDELDRSGIDLGATGRLQLMFKL 222

Query: 398 --------------ALQTSGMNTLGGGMSLFGETLA-KVLCLTEAITADALADDEEYEEI 442
                         AL  + + T           +A +   L+         +    +EI
Sbjct: 223 AEGTGLEIPPAAANALNMAPVMTAPQPPPQAAPPIATQCFMLSNMFDPQNETNPNWAKEI 282

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
            +D+ EEC K+G +++V +   DQ   +    G V+++         A N+L GR F G 
Sbjct: 283 RDDVIEECNKHGGVLHVYV---DQASPQ----GNVYVKCPSIATAVAAVNSLHGRWFAGR 335

Query: 503 TVNAFYYPEDKY 514
            + A Y P   Y
Sbjct: 336 VITAAYVPVVNY 347


>gi|67593828|ref|XP_665753.1| splicing factor [Cryptosporidium hominis TU502]
 gi|54656571|gb|EAL35522.1| splicing factor [Cryptosporidium hominis]
          Length = 491

 Score = 78.6 bits (192), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 86/375 (22%), Positives = 153/375 (40%), Gaps = 45/375 (12%)

Query: 167 TQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP--GDAVVNVYINHEKKF 224
           T  A++  R VYVG LP       +A     +  +I  NS     G+ VV+ +IN + K+
Sbjct: 107 TSFASKPLREVYVGNLPQGI---TVAELLEYINRSIIKNSVSHTHGNPVVSAWINSDGKY 163

Query: 225 AFVEMRTVEEASNAMALDGII-FEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
           AF E R++EEA+  + L+ ++ F+G  +R+ +     P ++  +   QPS N  L     
Sbjct: 164 AFCECRSIEEANALLRLNNLLSFKGNLLRIGK-----PKVSENIIGDQPSNNSTLINQIS 218

Query: 284 ASGAIGG---------AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGN 334
            S AI            +  + + + G+   F    IKE+  S   +   +L+  R+   
Sbjct: 219 QSTAIISPYFNNIPLVLKKKETILITGINKKFVLEDIKEMF-SIKNIEILELIDYRN--- 274

Query: 335 SKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT-----ASGQSKTEQESILAQAQ 389
              Y   + +    TDI    +N L    K L ++        A          I+ ++ 
Sbjct: 275 --KYKIAICEGDLNTDITDKVVNKLGTEIKILRMKSCNSKVIHAVNNHLKNMSCIVRES- 331

Query: 390 QHIAIQKMALQTSGMNTLGGGMS---LFGETLAKVLCLTEAITADALADDEEYEEILEDM 446
                 K+ L+    N +        L  +   + + L+  +  + L     Y  I +++
Sbjct: 332 -----NKLLLKREKFNNIQNKNVISLLLPQKPCRCILLSNILAVEELLIPSTYSSIHKEI 386

Query: 447 REECGKYGTLVNVVIPRPD-----QNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
            E+C KYG +    IP P+     ++    P  G+ F+ +Y+      AK  L   +F G
Sbjct: 387 HEKCLKYGEIYKTTIPIPERALSSKDQFNDPYFGRAFIFFYNVESAIKAKLDLFRMRFLG 446

Query: 502 NTVNAFYYPEDKYFN 516
             +   YY E ++ N
Sbjct: 447 RNIKISYYCEHEFLN 461


>gi|426241412|ref|XP_004014585.1| PREDICTED: RNA-binding protein 39 isoform 6 [Ovis aries]
 gi|426241416|ref|XP_004014587.1| PREDICTED: RNA-binding protein 39 isoform 8 [Ovis aries]
          Length = 373

 Score = 78.6 bits (192), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 75/286 (26%), Positives = 119/286 (41%), Gaps = 56/286 (19%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYG
Sbjct: 78  AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 137

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTV----RRATASGQSK----------------T 379
           F  + D      A   LNG ++  + + V     R  AS  S                 T
Sbjct: 138 FITFSDSECAKKALEQLNGFELTGRPMKVGHVTERTDASSASSFLDSDELERTGIDLGTT 197

Query: 380 EQESILAQAQQHIAIQ-----KMALQTSGMNTLG--GGMSLFGETLAKVLCLTE------ 426
            +  ++A+  +   +Q     + ALQ SG    G     S   +   ++   TE      
Sbjct: 198 GRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVAEFSFVIDLQTRLSQQTEASALAA 257

Query: 427 -------AITADALAD------DEEY---EEILEDMREECGKYGTLVNVVIPRPDQNGGE 470
                  A     L++      +EE     EI +D+ EEC K+G ++++ +   D+N  +
Sbjct: 258 AASVQPLATQCFQLSNMFNPQTEEEVGWDTEIKDDVIEECNKHGGVIHIYV---DKNSAQ 314

Query: 471 TPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFN 516
               G V+++         A NAL GR F G  + A Y P   Y N
Sbjct: 315 ----GNVYVKCPSIAAAIAAVNALHGRWFAGKMITAAYVPLPTYHN 356


>gi|449268165|gb|EMC79035.1| Serine/threonine-protein kinase Kist, partial [Columba livia]
          Length = 331

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 41/94 (43%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   ++  +L  +EEYE+ILED+REEC KYG +V+++IP+      E PG G+VF+E
Sbjct: 234 VLRLLNVLSDASLQCEEEYEDILEDIREECQKYGPVVSLLIPK------ENPGKGQVFVE 287

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+G+ F G  V A +YP   Y
Sbjct: 288 YANAGDSKAAQKMLTGKIFDGKFVVATFYPLSAY 321


>gi|432107103|gb|ELK32526.1| Splicing factor U2AF 65 kDa subunit [Myotis davidii]
          Length = 243

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 53/84 (63%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           ++F+GGLP Y  + Q+KELL  FG L  F+LVKD  TG SKG  F  Y D ++ D A A 
Sbjct: 128 KLFMGGLPNYMKDDQVKELLTWFGPLKAFNLVKDSTTGLSKGCAFYEYVDISIRDQAMAG 187

Query: 356 LNGLKMGDKTLTVRRATASGQSKT 379
            NG+++G K L V+RA    ++ T
Sbjct: 188 PNGMQLGVKKLLVQRAGVGAKNAT 211


>gi|170071297|ref|XP_001869868.1| splicing factor [Culex quinquefasciatus]
 gi|167867202|gb|EDS30585.1| splicing factor [Culex quinquefasciatus]
          Length = 524

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 88/372 (23%), Positives = 139/372 (37%), Gaps = 62/372 (16%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R  R V+   L      + +  FFS V         G    V  +  N  K+F   A++E
Sbjct: 166 RDMRTVFCMQLSQRIRARDLEEFFSSV---------GKVRDVRLITCNKTKRFKGIAYIE 216

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            R  E  + A+ L G    G+ + V+        LA            N+         +
Sbjct: 217 FRDPESVALALGLSGQRLLGIPISVQHTQAEKNRLA------------NIPPPPPPKVIV 264

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           G    P R++VG L +  TE  ++ + E FG +    L+ D DTG SKGYGF  + +   
Sbjct: 265 G----PMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDSDTGRSKGYGFITFHNADD 320

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA---LQTSGMN 405
              A   LNG ++  + + V   T      T   S+         I   A   LQ     
Sbjct: 321 AKKALEQLNGFELAGRPMKVGNVTER-LDVTTHASLDTDEMDRSGIDLGATGRLQLMFKL 379

Query: 406 TLGGGMSL----------------------FGETLAKVLCLTEAITADALADDEEYE-EI 442
             G G+++                          +A    L   +   A   +  ++ EI
Sbjct: 380 AEGAGLAVPRAAADALLATAPQPAPNQPVQDSPAIATQCFLLSNMFDPATETNPSWDVEI 439

Query: 443 LEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
            +D+ EEC K+G +++V + +      ++P  G V+++         A NAL GR F G 
Sbjct: 440 EDDVIEECNKHGGVLHVYVDK------QSPA-GNVYVKCPSIATAVLAVNALHGRWFAGR 492

Query: 503 TVNAFYYPEDKY 514
            + A Y P   Y
Sbjct: 493 VIAAAYVPLVNY 504


>gi|294659352|ref|XP_461720.2| DEHA2G04004p [Debaryomyces hansenii CBS767]
 gi|199433897|emb|CAG90172.2| DEHA2G04004p [Debaryomyces hansenii CBS767]
          Length = 636

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 70/307 (22%), Positives = 128/307 (41%), Gaps = 29/307 (9%)

Query: 220 HEKKFAFVEMRTVEEASNAMALDGI---IFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL 276
           + +K +F EM+ V+       +  I     + + + + RP +Y   +   L P       
Sbjct: 346 NSRKLSFNEMKLVKNDDEGNPIHQIGQDTGDDIVLDISRPGEY---VVQCLPP------- 395

Query: 277 NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
             + +          + P ++ V  +P    ET++ + ++  GT+ GF ++++  T  S 
Sbjct: 396 -YSEIKEDEIEESVTDSPRKITVL-VPSTLDETELIKNIKEVGTIKGFQMLREIGTKKSL 453

Query: 337 GYGFC-VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQ 395
           G  F   Y DP        A+  ++   + L          S  E          H +IQ
Sbjct: 454 GIAFLEFYIDPTKYQKTINAIPVIQTLVEDL-------KQSSFIEDAFFSCIIPDHTSIQ 506

Query: 396 KMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGT 455
              +  S +  L     +     ++V+ L   +TA  L DD  ++ I +D+++E  K+G 
Sbjct: 507 DCPIDLSTLKKLVKNEHVTTHPSSRVIQLINIVTAKDLMDDASFKFIQKDIQQEVSKFGN 566

Query: 456 LVNVVIPRP--DQNGGET----PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYY 509
           L  + IPRP  D   G +    PG+GK+++E+ D      A   L+GR +   TV   +Y
Sbjct: 567 LKTIKIPRPANDYTPGISQFTQPGLGKIYIEFDDEETALNAIMGLAGRMYNDRTVLCSFY 626

Query: 510 PEDKYFN 516
             D + N
Sbjct: 627 DYDDFKN 633


>gi|301101828|ref|XP_002900002.1| Poly(U)-binding-splicing factor PUF60, putative [Phytophthora
           infestans T30-4]
 gi|262102577|gb|EEY60629.1| Poly(U)-binding-splicing factor PUF60, putative [Phytophthora
           infestans T30-4]
          Length = 444

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 102/371 (27%), Positives = 155/371 (41%), Gaps = 63/371 (16%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGN-SAGPGDAVVNVYINHEKKFAFVEM 229
           T  ARR+Y+G L     E+ I+  F+   T    + S  PG        +  K F F+E 
Sbjct: 114 TDLARRLYIGNLYYDLKEEDISNVFAPFGTIRSIDLSLEPG-------ASRSKGFCFLEY 166

Query: 230 RTVEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
             V  A +A+  L+G      A+RV RP   N     +L  GQ               AI
Sbjct: 167 EDVLAAESAVQVLNGTPLANRAIRVGRPHRGNTNSNDSLSIGQE--------------AI 212

Query: 289 GGAEGPDR-VFVGGLPYYFTETQIKELLESFGTLHGFDL--VKDRDTGNSKGYGFCVYQD 345
                P + ++V  +        ++ +   FG +H   +  V   ++G+ +GYGF  + +
Sbjct: 213 KNV--PTKCIYVANVRVELNSQHLESIFSPFGAIHSCVMTAVSPLESGH-RGYGFMRFVE 269

Query: 346 PAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMN 405
            +    A   +NG ++  + L V +A+ +          LA +Q  +        TSG N
Sbjct: 270 ESCALSAIQHMNGFELAGQALKVGKASEAAMLIN-----LATSQDKVVRDGSGATTSGAN 324

Query: 406 TLGG-GMSLFGE-----------TLAKV-LCLTEAITADALADDEEYEEILEDMREECGK 452
            +       FGE           T AK  LCL   +    + D     E+ +++R ECGK
Sbjct: 325 VIAAPEKKPFGEDDVEKVKDTTETDAKCCLCLVNLVNCGEVDD-----ELEDEVRGECGK 379

Query: 453 YGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYP-- 510
           +G++  V I        E     +VF+ + DA G A AK AL GR FGGN V A YYP  
Sbjct: 380 FGSVNKVDI-------HELADHVRVFVLFDDAAGAAKAKQALHGRFFGGNQVQAHYYPLR 432

Query: 511 --EDKYFNKDY 519
             E K +  D+
Sbjct: 433 ELEQKRYTSDF 443


>gi|384498450|gb|EIE88941.1| hypothetical protein RO3G_13652 [Rhizopus delemar RA 99-880]
          Length = 454

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 85/353 (24%), Positives = 140/353 (39%), Gaps = 49/353 (13%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           R  R V+V  L      +    FFSQ               + +      K   +VE   
Sbjct: 121 RDRRTVFVTQLAARLTTREFDAFFSQ------AGRVREAKIITDRNSRKSKGCGYVEFYD 174

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
                NA+AL G    G+ V V+        L+ A          N  A+     A+G  
Sbjct: 175 ETSVQNALALSGQKLLGIPVLVQ--------LSEA--------EKNRLAMAAQRNAMGVT 218

Query: 292 EGP--DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
             P   R+++G L +  TE  ++++ E FG L   +L KD +TG SKG+GF  Y++    
Sbjct: 219 TEPLYQRLYIGSLHFSLTENDVRQIFEPFGPLDFVNLHKDPETGRSKGFGFIQYKNANDA 278

Query: 350 DIACAALNGLKMGDKTLTV--------RRATASGQSKTEQESI----LAQAQQHIAIQKM 397
             A   +NG ++  + L V           +  G    E E +    L++A+    +   
Sbjct: 279 KQALEKMNGFELAGRNLKVGLVSEKSGTTMSTFGLDDEETEGLALNSLSRAELMAKLAAR 338

Query: 398 ALQTSGMNTLGGGMSL---FGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYG 454
             Q S  +       L         + + L      +   D +   ++  D++ EC KYG
Sbjct: 339 DPQNSPPSRHAPAPVLKPNIPTASTRYVMLNNMFNPNEETDPDWVSDLEADIKIECEKYG 398

Query: 455 TLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCA-TAKNALSGRKFGGNTVNA 506
            + ++ +         +  +G+VFL++ D VG A  A +AL+GR FGG  + A
Sbjct: 399 RVEHIKV--------NSDSMGEVFLKF-DRVGSAEKAISALNGRWFGGKQITA 442


>gi|21726713|emb|CAA71714.2| KIS protein kinase [Mus musculus]
 gi|117616788|gb|ABK42412.1| Kist [synthetic construct]
          Length = 414

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   +  D L +++EYE+++ED++EEC KYG +V++++P+      E PG G+VF+E
Sbjct: 322 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 375

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+GR F G  V A +YP   Y
Sbjct: 376 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 409


>gi|12850652|dbj|BAB28802.1| unnamed protein product [Mus musculus]
          Length = 419

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   +  D L +++EYE+++ED++EEC KYG +V++++P+      E PG G+VF+E
Sbjct: 322 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 375

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+GR F G  V A +YP   Y
Sbjct: 376 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 409


>gi|149058099|gb|EDM09256.1| rCG46339, isoform CRA_a [Rattus norvegicus]
          Length = 251

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   +  D L +++EYE+++ED++EEC KYG +V++++P+      E PG G+VF+E
Sbjct: 154 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 207

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+GR F G  V A +YP   Y
Sbjct: 208 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 241


>gi|8393668|ref|NP_058989.1| serine/threonine-protein kinase Kist [Rattus norvegicus]
 gi|24211854|sp|Q63285.1|UHMK1_RAT RecName: Full=Serine/threonine-protein kinase Kist; AltName:
           Full=Kinase interacting with stathmin; AltName: Full=PAM
           COOH-terminal interactor protein 2; Short=P-CIP2;
           AltName: Full=U2AF homology motif kinase 1
 gi|1403532|emb|CAA67021.1| KIS [Rattus norvegicus]
 gi|5821768|gb|AAC53031.2| PAM COOH-terminal interactor protein 2 [Rattus norvegicus]
 gi|149058100|gb|EDM09257.1| rCG46339, isoform CRA_b [Rattus norvegicus]
          Length = 419

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   +  D L +++EYE+++ED++EEC KYG +V++++P+      E PG G+VF+E
Sbjct: 322 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 375

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+GR F G  V A +YP   Y
Sbjct: 376 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 409


>gi|40254330|ref|NP_034763.3| serine/threonine-protein kinase Kist [Mus musculus]
 gi|57015387|sp|P97343.3|UHMK1_MOUSE RecName: Full=Serine/threonine-protein kinase Kist; AltName:
           Full=Kinase interacting with stathmin; AltName: Full=PAM
           COOH-terminal interactor protein 2; Short=P-CIP2;
           AltName: Full=U2AF homology motif kinase 1
 gi|27501712|gb|AAO13515.1| KIS kinase [Mus musculus]
 gi|37194893|gb|AAH58732.1| U2AF homology motif (UHM) kinase 1 [Mus musculus]
          Length = 419

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   +  D L +++EYE+++ED++EEC KYG +V++++P+      E PG G+VF+E
Sbjct: 322 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 375

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+GR F G  V A +YP   Y
Sbjct: 376 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 409


>gi|293354575|ref|XP_574033.2| PREDICTED: serine/threonine-protein kinase Kist-like [Rattus
           norvegicus]
          Length = 419

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   +  D L +++EYE+++ED++EEC KYG +V++++P+      E PG G+VF+E
Sbjct: 322 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 375

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+GR F G  V A +YP   Y
Sbjct: 376 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 409


>gi|344250069|gb|EGW06173.1| Serine/threonine-protein kinase Kist [Cricetulus griseus]
          Length = 375

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   +  D L +++EYE+++ED++EEC KYG +V++++P+      E PG G+VF+E
Sbjct: 278 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 331

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+GR F G  V A +YP   Y
Sbjct: 332 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 365


>gi|354487456|ref|XP_003505889.1| PREDICTED: serine/threonine-protein kinase Kist-like [Cricetulus
           griseus]
          Length = 461

 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   +  D L +++EYE+++ED++EEC KYG +V++++P+      E PG G+VF+E
Sbjct: 364 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 417

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+GR F G  V A +YP   Y
Sbjct: 418 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 451


>gi|148707213|gb|EDL39160.1| U2AF homology motif (UHM) kinase 1 [Mus musculus]
          Length = 232

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   +  D L +++EYE+++ED++EEC KYG +V++++P+      E PG G+VF+E
Sbjct: 135 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 188

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+GR F G  V A +YP   Y
Sbjct: 189 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 222


>gi|442570696|pdb|4FXW|A Chain A, Structure Of Phosphorylated Sf1 Complex With U2af65-uhm
           Domain
 gi|442570698|pdb|4FXW|C Chain C, Structure Of Phosphorylated Sf1 Complex With U2af65-uhm
           Domain
          Length = 106

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 45/100 (45%), Positives = 61/100 (61%), Gaps = 1/100 (1%)

Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFL 479
           +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++ IPRP  +G E PG GK+F+
Sbjct: 7   EVLCLXNXVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFV 65

Query: 480 EYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 66  EFTSVFDCQKAXQGLTGRKFANRVVVTKYCDPDSYHRRDF 105


>gi|195434196|ref|XP_002065089.1| GK15272 [Drosophila willistoni]
 gi|194161174|gb|EDW76075.1| GK15272 [Drosophila willistoni]
          Length = 612

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 86/372 (23%), Positives = 134/372 (36%), Gaps = 63/372 (16%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS V         G    V  +  N  K+F   A++E
Sbjct: 252 RDARTVFCIQLSQRVRARDLEEFFSSV---------GKVRDVRMITCNKTKRFKGIAYIE 302

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
               E  S A+ L G    GV + V+        L  A    QP  +             
Sbjct: 303 FEDPESVSLALGLSGQRLLGVPIMVQHTQAEKNRLQNAAPAFQPKSHT------------ 350

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  Y +   
Sbjct: 351 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDVIQLIMDTETGRSKGYGFITYHNADD 406

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA---LQTSGMN 405
              A   LNG ++  + + V   T      T   S+         I   A   LQ     
Sbjct: 407 AKKALEQLNGFELAGRPMKVGNVTERLDMNT--TSLDTDEMDRTGIDLGATGRLQLMFKL 464

Query: 406 TLGGGMSL----------------------FGETLAKVLCLTEAITADALADDEEYEEIL 443
             G G+++                         ++A    +   +   A   +  ++  +
Sbjct: 465 AEGAGLAVPQAAANALLATAPQPAPVQQQQQTPSIATQCFILSNMFDPATETNTTWDSEI 524

Query: 444 ED-MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
            D + EEC K+G ++++ +             G V+++         A NAL GR F G 
Sbjct: 525 RDDVLEECAKHGGVLHIHV-------DTASSTGTVYVKCPSTTTAVLAVNALHGRWFAGR 577

Query: 503 TVNAFYYPEDKY 514
            + A Y P   Y
Sbjct: 578 VITAAYVPLINY 589


>gi|159162769|pdb|1O0P|A Chain A, Solution Structure Of The Third Rna Recognition Motif
           (Rrm) Of U2af65 In Complex With An N-Terminal Sf1
           Peptide
          Length = 104

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 45/100 (45%), Positives = 61/100 (61%), Gaps = 1/100 (1%)

Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFL 479
           +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++ IPRP  +G E PG GK+F+
Sbjct: 5   EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFV 63

Query: 480 EYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 64  EFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 103


>gi|390980893|pdb|3V4M|A Chain A, Crystal Structure Of A Rna Binding Domain Of A U2 Small
           Nuclear Ribonucleoprotein Auxiliary Factor 2 (U2af) From
           Mus Musculus At 1.80 A Resolution
 gi|390980894|pdb|3V4M|B Chain B, Crystal Structure Of A Rna Binding Domain Of A U2 Small
           Nuclear Ribonucleoprotein Auxiliary Factor 2 (U2af) From
           Mus Musculus At 1.80 A Resolution
          Length = 105

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 45/100 (45%), Positives = 61/100 (61%), Gaps = 1/100 (1%)

Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFL 479
           +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++ IPRP  +G E PG GK+F+
Sbjct: 6   EVLCLXNXVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFV 64

Query: 480 EYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 65  EFTSVFDCQKAXQGLTGRKFANRVVVTKYCDPDSYHRRDF 104


>gi|159162801|pdb|1OPI|A Chain A, Solution Structure Of The Third Rna Recognition Motif
           (Rrm) Of U2af65 In Complex With An N-Terminal Sf1
           Peptide
 gi|444302011|pdb|2M0G|B Chain B, Structure, Phosphorylation And U2af65 Binding Of The
           Nterminal Domain Of Splicing Factor 1 During 3 Splice
           Site Recognition
          Length = 104

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 45/100 (45%), Positives = 61/100 (61%), Gaps = 1/100 (1%)

Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFL 479
           +VLCL   +  + L DDEEYEEI+ED+R+EC KYG + ++ IPRP  +G E PG GK+F+
Sbjct: 5   EVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFV 63

Query: 480 EYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           E+     C  A   L+GRKF    V   Y   D Y  +D+
Sbjct: 64  EFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDF 103


>gi|26326137|dbj|BAC26812.1| unnamed protein product [Mus musculus]
          Length = 330

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 59/94 (62%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   +  D L +++EYE+++ED++EEC KYG +V++++P+      E PG G+VF+E
Sbjct: 233 VLRLLNVLDDDYLENEDEYEDVVEDVKEECQKYGPVVSLLVPK------ENPGRGQVFVE 286

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+GR F G  V A +YP   Y
Sbjct: 287 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 320


>gi|388583572|gb|EIM23873.1| splicing factor, CC1-like protein [Wallemia sebi CBS 633.66]
          Length = 459

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 80/374 (21%), Positives = 145/374 (38%), Gaps = 38/374 (10%)

Query: 159 PLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI 218
           P +P++          R V+V  L    N   +  FF      +G  S      V++   
Sbjct: 82  PEIPIEDNVDSLESEQRSVFVSQLSTRTNSSDLRRFFQD---RLGERSIVDARIVMDKNS 138

Query: 219 NHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNL 278
              K   +VE++T      A+ L G +  G+ + V + ++ +    A       + ++  
Sbjct: 139 RRSKGIGYVEVKTASLIDKALELTGELLNGIPMIVTQ-SEADKNRQAKASSSLQTQSVQA 197

Query: 279 AAVGLA---------SGAIGGAEGPD--RVFVGGLPYYFTETQIKELLESFGTLHGFDLV 327
             V  +         S  I  A  P   +V+VG L Y   E  ++ + E FG +   +L 
Sbjct: 198 EEVRRSTKSRDYDNRSSTINPANDPTLYKVYVGSLSYTLKEYDVRSVFEPFGEIEDVELS 257

Query: 328 KDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQ-SKTEQESILA 386
            D D   SKGY +  Y+    + +AC  +N  ++  +TL V+     G   +  ++SI  
Sbjct: 258 VD-DQNRSKGYAYVKYKRMEDSRMACEQMNRFELAGRTLKVQLVNYYGDPVRMPEQSIEN 316

Query: 387 QAQQHIAIQKMALQTSGMNTLGGGMSLFGETLA-------------KVLCLTEAITADAL 433
           +     ++ +  L  + M +         E  A             K + L     A   
Sbjct: 317 EGLNLNSVSRHELMKTLMRSHDPNAQFEQELAAREKEKKVQERMKTKGVLLKYMFKASEE 376

Query: 434 ADDEEYEEILEDMREEC-GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKN 492
            +    +E+ ED++ EC  KYG +  + + +  +        G++ +++Y       A N
Sbjct: 377 TEAGWEKELAEDVKTECENKYGKVQEIGVDKESEE-------GEIVVKFYTIESAEDAIN 429

Query: 493 ALSGRKFGGNTVNA 506
            L+GR FGG  V A
Sbjct: 430 GLNGRWFGGRQVKA 443


>gi|302838915|ref|XP_002951015.1| hypothetical protein VOLCADRAFT_48801 [Volvox carteri f.
           nagariensis]
 gi|300263710|gb|EFJ47909.1| hypothetical protein VOLCADRAFT_48801 [Volvox carteri f.
           nagariensis]
          Length = 82

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           ++FVGGLP  + E  +KELL  FGTL  F+LV D+ TG SKGY FC Y + +  ++    
Sbjct: 2   KLFVGGLPCEWGEDMVKELLIPFGTLKSFNLVMDKSTGKSKGYAFCEYVEDSSAEVLIKN 61

Query: 356 LNGLKMGDKTLTVRRA 371
           L+  ++G K LTV+RA
Sbjct: 62  LHMRRIGSKALTVKRA 77


>gi|414591752|tpg|DAA42323.1| TPA: hypothetical protein ZEAMMB73_939656 [Zea mays]
          Length = 270

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 43/88 (48%), Positives = 51/88 (57%), Gaps = 5/88 (5%)

Query: 112 SGFDMAPP--AAAMLPGAAVPGQLPGVPSAVPEMA--QNMLPFGATQLGAFPLMPVQVMT 167
           SGFD AP   A  ++    +PGQLPGV + +P +    N+    A Q     + P Q MT
Sbjct: 184 SGFDQAPTQQAVPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQFNPHVIQP-QAMT 242

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFF 195
           QQATRHAR VYVGGLPP ANEQ I    
Sbjct: 243 QQATRHARPVYVGGLPPTANEQVITWIV 270


>gi|357154605|ref|XP_003576839.1| PREDICTED: splicing factor U2af large subunit A-like [Brachypodium
           distachyon]
          Length = 177

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 47/104 (45%), Positives = 65/104 (62%), Gaps = 2/104 (1%)

Query: 420 KVLCLTEAITADA--LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKV 477
           KV+CL + I+ADA  L DDE YE++++++ +E  K+G L++VVIPRP        GVG+V
Sbjct: 66  KVVCLAQMISADAEDLRDDELYEDLVDEVEDEAWKFGHLMSVVIPRPGHAPAAAAGVGRV 125

Query: 478 FLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYSA 521
           FLEY D  G    K  L  R FGG  + A +YP+DK+   DY  
Sbjct: 126 FLEYADLEGSDRCKTKLHWRWFGGRRIVAAFYPKDKFAGGDYDV 169


>gi|323451698|gb|EGB07574.1| hypothetical protein AURANDRAFT_64670 [Aureococcus anophagefferens]
          Length = 214

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 62/110 (56%)

Query: 164 QVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK 223
           Q+ +  ATR  RR+YVG LP   + + +  F ++ + A G   AG  + VV+ +++ +KK
Sbjct: 43  QMPSNPATRKERRLYVGNLPQTFDSEQLRIFLNEALRACGAIPAGVDEVVVSSWVSPDKK 102

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPS 273
           FAFVE+ TVE A+ ++ L GI   G  +++  P +Y        GP  P+
Sbjct: 103 FAFVELSTVEAATTSLGLSGITCMGCQLKICHPNNYVVGALPGTGPTLPA 152


>gi|300637966|gb|ADK26147.1| kinase-interacting stathmin [Oryctolagus cuniculus]
          Length = 254

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 39/94 (41%), Positives = 58/94 (61%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   +  D L ++EEYE+I+ED++EEC KYG +V++++P+      E PG  +VF+E
Sbjct: 165 VLRLLNVLDDDYLENEEEYEDIVEDVKEECQKYGPVVSLLVPK------ENPGRRQVFVE 218

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+GR F G  V A +YP   Y
Sbjct: 219 YANAGDSKAAQKLLTGRMFDGKFVVATFYPLSAY 252


>gi|403281374|ref|XP_003932163.1| PREDICTED: RNA-binding protein 39 [Saimiri boliviensis boliviensis]
          Length = 502

 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 122/315 (38%), Gaps = 43/315 (13%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQT 401
            + D      A   LNG ++  + + V   T    + +    + +   +   I    L T
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVTERTDASSASSFLDSDELERTGID---LGT 353

Query: 402 SG----MNTLGGGMSL-FGETLAKVLCLTEAITADALAD-----DEEYE-------EILE 444
           +G    M  L  G  L       + L ++ ++   A+AD      ++ E         ++
Sbjct: 354 TGRLQLMARLAEGTGLQIPPAAQQALQMSGSLAFGAVADLQTRLSQQTEASALAAAASVQ 413

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGV---GKVFLEYYDAVGCATAKNALSGRKFGG 501
            +  +C +   + N   P+   N    P     G V+++         A NAL GR F G
Sbjct: 414 PLATQCFQLSNMFN---PQTQWNNYHKPFCGYRGNVYVKCPSIAAAIAAVNALHGRWFAG 470

Query: 502 NTVNAFYYPEDKYFN 516
             + A Y P   Y N
Sbjct: 471 KMITAAYVPLPTYHN 485


>gi|60599450|gb|AAX26270.1| unknown [Schistosoma japonicum]
          Length = 156

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 89/164 (54%), Gaps = 11/164 (6%)

Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHI-AIQKMALQTSGMNTLGGGMSLF--G 415
           +++GDK L V+RA+   +  T    +L Q    +  ++   +Q    NT G G      G
Sbjct: 1   MQLGDKKLIVQRASVGAKHTT---GVLPQTLLSLPGLEDGTVQ----NTTGSGNITIRSG 53

Query: 416 ETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVG 475
               +VLCL   I    L DDEEYE+I+ED+R EC KYG + ++ IPRP   G E PGVG
Sbjct: 54  GPPTEVLCLMNMIETSELEDDEEYEDIVEDVRAECSKYGVVRSLEIPRPIP-GVEVPGVG 112

Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
           K+++E+   + C  A  AL+GRKF    V   ++  D Y  +++
Sbjct: 113 KIYVEFASLIDCQKAATALTGRKFNQRLVVTSFFSPDNYHRREF 156


>gi|307108143|gb|EFN56384.1| expressed protein [Chlorella variabilis]
          Length = 404

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 54/101 (53%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           Q TR  RR+YVGGLP    +  + TF +Q + A+G         ++   +  E+ FAF+E
Sbjct: 246 QMTRPMRRLYVGGLPQPCYDFMLTTFLNQALMALGICQVAGKAPIIACQVTPERNFAFIE 305

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGP 269
                +A+ A+ LDGI F G  ++++RP DY P   A   P
Sbjct: 306 FGDTSDATAALQLDGIPFRGNTLKIKRPKDYTPPFGAPPDP 346


>gi|340506971|gb|EGR33003.1| u2 small nuclear ribonucleoprotein auxiliary factor 2, putative
           [Ichthyophthirius multifiliis]
          Length = 302

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 40/106 (37%), Positives = 58/106 (54%), Gaps = 5/106 (4%)

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRP---DQNGGE--TPG 473
            +VL L   I    L  DEEY++I ED+++EC K+G +V++ IPRP   D   G+    G
Sbjct: 192 TQVLVLKNMINDGELIIDEEYKQIEEDVKDECSKHGKVVSIAIPRPSVDDVKAGKEHVLG 251

Query: 474 VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            GK+++EY        A+  L+GR F   TV   Y+   KY  +DY
Sbjct: 252 KGKIYVEYESIEAAREARRYLNGRLFSNRTVQVSYFNYQKYLEQDY 297


>gi|358342556|dbj|GAA49996.1| RNA-binding protein 39 [Clonorchis sinensis]
          Length = 730

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 58/208 (27%), Positives = 88/208 (42%), Gaps = 27/208 (12%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK---FAFVE 228
           R AR V+V  L     ++ +  FF+ V         G    V  +  N  K+    A+VE
Sbjct: 76  RDARTVFVWQLSARIRQRDLEDFFTSV---------GKIRDVRLIMDNKTKRSKGIAYVE 126

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            R VE A  A+ L G    GV +++++       + A   P  P P              
Sbjct: 127 FREVESAQLALGLTGTRLLGVPIQIQQSHAEKNRMNAI--PSVPKPTQQ----------- 173

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               GP ++++G L Y  TE  +K + E FG +    L+KD  TG S+GYGF  Y +   
Sbjct: 174 --NRGPMKLYIGSLHYNITEEMLKGIFEPFGKIDDIKLIKDPATGRSQGYGFVTYANSDD 231

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQ 376
              A   LNG ++  + + V   T  G+
Sbjct: 232 AKKALDQLNGFELAGRPMKVNHVTERGE 259


>gi|345496803|ref|XP_003427819.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 2
           [Nasonia vitripennis]
          Length = 425

 Score = 75.1 bits (183), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  +K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 19  VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E +S+L
Sbjct: 79  NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106


>gi|156553552|ref|XP_001601896.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 1
           [Nasonia vitripennis]
          Length = 434

 Score = 75.1 bits (183), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  +K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 19  VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E +S+L
Sbjct: 79  NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106


>gi|195116809|ref|XP_002002944.1| GI10246 [Drosophila mojavensis]
 gi|193913519|gb|EDW12386.1| GI10246 [Drosophila mojavensis]
          Length = 617

 Score = 75.1 bits (183), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 87/370 (23%), Positives = 134/370 (36%), Gaps = 59/370 (15%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS V         G    V  +  N  K+F   A++E
Sbjct: 257 RDARTVFCIQLSQRVRARDLEEFFSSV---------GKVRDVRLITCNKTKRFKGIAYIE 307

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
               E  + A+ L G    GV + V+        L +A  P QP  +             
Sbjct: 308 FEDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQSAPPPFQPKAHT------------ 355

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  Y +   
Sbjct: 356 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDAIQLIMDTETGRSKGYGFITYHNADD 411

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA-LQTSGMNTL 407
              A   LNG ++  + + V   T      T         +  I +     LQ       
Sbjct: 412 AKKALEQLNGFELAGRPMKVGNVTERLDMNTSSLDTDEMDRSGIDLGATGRLQLMFKLAE 471

Query: 408 GGGMSLFGETL----------AKVL-------CLTEAITADALAD--DEEYEEILEDMRE 448
           G G+++               A VL         T+      + D   E       D+R+
Sbjct: 472 GAGLAVPQAAANALLATAPQPAPVLQQQQTPSIATQCFILSNMFDPRTETNPTWATDVRD 531

Query: 449 E----CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           +    C K+G ++++ +             G V+++         A NAL GR F G  +
Sbjct: 532 DVLDECAKHGGVLHIHV-------DTVSPTGTVYVKCPSTTTAVLAVNALHGRWFAGRVI 584

Query: 505 NAFYYPEDKY 514
            A Y P   Y
Sbjct: 585 TAAYVPVINY 594


>gi|307202383|gb|EFN81811.1| Cleavage stimulation factor 64 kDa subunit [Harpegnathos saltator]
          Length = 439

 Score = 75.1 bits (183), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  +K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 19  VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E +S+L
Sbjct: 79  NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106


>gi|350401751|ref|XP_003486249.1| PREDICTED: cleavage stimulation factor subunit 2-like [Bombus
           impatiens]
          Length = 441

 Score = 75.1 bits (183), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  +K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 19  VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E +S+L
Sbjct: 79  NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106


>gi|170058744|ref|XP_001865056.1| splicing factor [Culex quinquefasciatus]
 gi|167877732|gb|EDS41115.1| splicing factor [Culex quinquefasciatus]
          Length = 546

 Score = 75.1 bits (183), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 87/372 (23%), Positives = 138/372 (37%), Gaps = 62/372 (16%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R  R V+   L      + +  FFS V         G    V  +  N  K+F   A++E
Sbjct: 188 RDMRTVFCMQLSQRIRARDLEEFFSSV---------GKVRDVRLITCNKTKRFKGIAYIE 238

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            R  E  + A+ L G    G+ + V+        LA            N+         +
Sbjct: 239 FRDPESVALALGLSGQRLLGIPISVQHTQAEKNRLA------------NIPPPPPPKVIV 286

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           G    P R++VG L +  TE  ++ + E FG +    L+ D DTG SKGYGF  + +   
Sbjct: 287 G----PMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDSDTGRSKGYGFITFHNADD 342

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA---LQTSGMN 405
              A   LNG ++  + + V   T      T   S+         I   A   LQ     
Sbjct: 343 AKKALEQLNGFELAGRPMKVGNVTER-LDVTTHASLDTDEMDRSGIDLGATGRLQLMFKL 401

Query: 406 TLGGGMSL----------------------FGETLAKVLCLTEAITADALADDEEYEEIL 443
             G G+++                          +A    L   +   A   +  ++  +
Sbjct: 402 AEGAGLAVPRAAADALLATAPQPAPNQPVQDSPAIATQCFLLSNMFDPATETNPSWDVEI 461

Query: 444 E-DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
           E D+ EEC K+G +++V + +      ++P  G V+++         A NAL GR F G 
Sbjct: 462 EDDVIEECNKHGGVLHVYVDK------QSPA-GNVYVKCPSIATAVLAVNALHGRWFAGR 514

Query: 503 TVNAFYYPEDKY 514
            + A Y P   Y
Sbjct: 515 VIAAAYVPLVNY 526


>gi|66516308|ref|XP_623321.1| PREDICTED: cleavage stimulation factor subunit 2 [Apis mellifera]
          Length = 441

 Score = 75.1 bits (183), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  +K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 19  VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E +S+L
Sbjct: 79  NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106


>gi|432103844|gb|ELK30681.1| Serine/threonine-protein kinase Kist [Myotis davidii]
          Length = 435

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 39/94 (41%), Positives = 57/94 (60%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   +  D L  +E YE+ +ED+REEC KYG +V++++P+      E+PG G+VF+E
Sbjct: 338 VLRLLNVLDGDYLESEEGYEDAVEDVREECQKYGPVVSLLVPK------ESPGRGQVFVE 391

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+  L+GR F G  V A +YP   Y
Sbjct: 392 YANAGDSKAAQKLLTGRLFDGKFVVATFYPLSAY 425


>gi|193598819|ref|XP_001951232.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 1
           [Acyrthosiphon pisum]
 gi|328706164|ref|XP_003243012.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 2
           [Acyrthosiphon pisum]
          Length = 386

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 37/89 (41%), Positives = 54/89 (60%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 14  VFVGNIPYEATEEKLKDIFSEVGPVISFKLVYDRETGKPKGYGFCEYKDQETALSAMRNL 73

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E +S++
Sbjct: 74  NGYEIGGRTLRVDNA-CTEKSRLEMQSLM 101


>gi|380028061|ref|XP_003697730.1| PREDICTED: cleavage stimulation factor subunit 2-like [Apis florea]
          Length = 441

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  +K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 19  VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E +S+L
Sbjct: 79  NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106


>gi|116793682|gb|ABK26841.1| unknown [Picea sitchensis]
          Length = 347

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 58/209 (27%), Positives = 98/209 (46%), Gaps = 29/209 (13%)

Query: 176 RVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY---INHEKKFAFVEMRTV 232
           ++YVG LP   + + +A  F +         +G  + V  +Y       + FAFV M TV
Sbjct: 160 KLYVGNLPFDIDSEGLAKMFDE---------SGVVEMVEVIYDRSSGRSRGFAFVTMSTV 210

Query: 233 EEASNAMA-LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
           EEA  A+   +G   +G ++RV  P    P L     P +   N             G  
Sbjct: 211 EEAEAAIKKFNGFEIDGRSLRVNFPE--VPRLQNGRSPARSPSNFG-----------GFV 257

Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
           + P +V+VG L +  T   ++E L   G + G  +++DR+TG S+G+GF  +   A  + 
Sbjct: 258 DSPHKVYVGNLAWSVTSETLREALNGKGNVLGAKVIQDRETGRSRGFGFVSFSSEAEVEA 317

Query: 352 ACAALNGLKMGDKTLTVRRA---TASGQS 377
           A + ++GL++  +++ V  A   +  GQS
Sbjct: 318 AVSEMDGLEVEGRSIRVNVAKSRSTEGQS 346


>gi|281209343|gb|EFA83511.1| RNA-binding region RNP-1 domain-containing protein [Polysphondylium
            pallidum PN500]
          Length = 1109

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 102/236 (43%), Gaps = 25/236 (10%)

Query: 295  DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
            +R++VG +P+   E QIK +  S G +    L+ + ++G  KG+GF  Y +    + A A
Sbjct: 873  NRIYVGSIPWNVNEDQIKVIFSSIGNVVSCSLMPNLESGRHKGFGFIDYDNSKSAEDAIA 932

Query: 355  ALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLF 414
             LNG  +G + L V R   +    +  ES          +  ++  + G +++   ++L 
Sbjct: 933  TLNGYDIGGRQLKVGRPIKNASISSSNESKQTTPLSTPMVPTLS-SSVGTDSIEDDVTLS 991

Query: 415  GETLAKVLCLTEAITADA------------LADDEEYEEIL-EDMREECGKYGTLVNVVI 461
             E   ++L   + +  D             L   +E ++   E+++ EC  +G +   V+
Sbjct: 992  TE--QRILLTQKLLRQDISRTSNRCLVLRNLGSPKEIDDFFEEEIKAECMSFGQVEKFVL 1049

Query: 462  PRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
                          K F+ + +   CAT  N  +GR F G  V A YY +   FNK
Sbjct: 1050 TH--------DASVKAFILFKEPAACATCFNKQNGRYFSGYIVKAEYY-DISLFNK 1096



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 37/75 (49%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           R+++G + +  TET +  +   FG +    L KD  TG SKGY F  Y  P   + A + 
Sbjct: 686 RIYIGNIHFNLTETDLTSIFSPFGPIKSLSLSKDPATGKSKGYCFIEYSYPEAANNAISH 745

Query: 356 LNGLKMGDKTLTVRR 370
           +N   +  + + V R
Sbjct: 746 MNHQSLAGRQIKVGR 760


>gi|383860217|ref|XP_003705587.1| PREDICTED: cleavage stimulation factor subunit 2-like [Megachile
           rotundata]
          Length = 441

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  +K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 19  VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E +S+L
Sbjct: 79  NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106


>gi|307172466|gb|EFN63915.1| Cleavage stimulation factor 64 kDa subunit [Camponotus floridanus]
          Length = 438

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  +K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 19  VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E +S+L
Sbjct: 79  NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106


>gi|322788027|gb|EFZ13868.1| hypothetical protein SINV_14012 [Solenopsis invicta]
          Length = 291

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  +K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 19  VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E +S+L
Sbjct: 79  NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106


>gi|193671655|ref|XP_001946102.1| PREDICTED: cleavage stimulation factor subunit 2-like
           [Acyrthosiphon pisum]
          Length = 388

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 37/89 (41%), Positives = 54/89 (60%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 14  VFVGNIPYEATEEKLKDIFNEVGPVISFKLVYDRETGKPKGYGFCEYKDQETALSAMRNL 73

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E +S++
Sbjct: 74  NGYEIGGRTLRVDNA-CTEKSRLEMQSLM 101


>gi|332026262|gb|EGI66401.1| Cleavage stimulation factor 64 kDa subunit [Acromyrmex echinatior]
          Length = 480

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 53/89 (59%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  +K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 19  VFVGNIPYEATEENLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 78

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E +S+L
Sbjct: 79  NGYEIGGRTLRVDNA-CTEKSRMEMQSLL 106


>gi|242018247|ref|XP_002429590.1| A-kinase anchor protein, putative [Pediculus humanus corporis]
 gi|212514557|gb|EEB16852.1| A-kinase anchor protein, putative [Pediculus humanus corporis]
          Length = 408

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 37/96 (38%), Positives = 58/96 (60%), Gaps = 2/96 (2%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +P+  TE ++KE+    G +  F LV DR+ G  KGYGFC Y+D    + A   L
Sbjct: 16  VFVGNIPFDLTEEKLKEIFSEVGPVLSFKLVYDRENGKPKGYGFCEYKDIETANSAMRNL 75

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI 392
           NG ++G + L V  A A+ +++ E ++++ QA + I
Sbjct: 76  NGFEIGGRVLKVDNA-ANEKTRMEMQNMI-QANEPI 109


>gi|195471585|ref|XP_002088083.1| GE14328 [Drosophila yakuba]
 gi|194174184|gb|EDW87795.1| GE14328 [Drosophila yakuba]
          Length = 590

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 86/372 (23%), Positives = 135/372 (36%), Gaps = 63/372 (16%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS V         G    V  +  N  K+F   A++E
Sbjct: 230 RDARTVFCIQLSQRVRARDLEEFFSSV---------GKVRDVRLITCNKTKRFKGIAYIE 280

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
               E  + A+ L G    GV + V+        L  A    QP  +             
Sbjct: 281 FEDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQNAAPAFQPKSHT------------ 328

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  Y +   
Sbjct: 329 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDAIQLIMDTETGRSKGYGFITYHNADD 384

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMA---LQTSGMN 405
              A   LNG ++  + + V   T      T   S+         I   A   LQ     
Sbjct: 385 AKKALEQLNGFELAGRLMKVGNVTERLDMNT--TSLDTDEMDRTGIDLGATGRLQLMFKL 442

Query: 406 TLGGGMSL-----------------FGETLAKVLCLTEAITADALAD-----DEEYEEIL 443
             G G+++                   +  A     T+      + D     +  ++  +
Sbjct: 443 AEGAGLAVPQAAANALLATAPQPAPLQQQEAAPSIATQCFILSNMFDPRTETNPTWDVEI 502

Query: 444 ED-MREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGN 502
           +D + EEC K+G ++++ +             G V+++         A NAL GR F G 
Sbjct: 503 KDDVLEECAKHGGVLHIHV-------DTISPTGTVYVKCPSTTTAVLAVNALHGRWFAGR 555

Query: 503 TVNAFYYPEDKY 514
            + A Y P   Y
Sbjct: 556 VITAAYVPVINY 567


>gi|116781814|gb|ABK22250.1| unknown [Picea sitchensis]
          Length = 355

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 58/209 (27%), Positives = 98/209 (46%), Gaps = 29/209 (13%)

Query: 176 RVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVY---INHEKKFAFVEMRTV 232
           ++YVG LP   + + +A  F +         +G  + V  +Y       + FAFV M TV
Sbjct: 168 KLYVGNLPFDIDSEGLAKMFDE---------SGVVEMVEVIYDRSSGRSRGFAFVTMSTV 218

Query: 233 EEASNAMA-LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
           EEA  A+   +G   +G ++RV  P    P L     P +   N             G  
Sbjct: 219 EEAEAAIKKFNGFEIDGRSLRVNFPE--VPRLQNGRSPARSPSNFG-----------GFV 265

Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
           + P +V+VG L +  T   ++E L   G + G  +++DR+TG S+G+GF  +   A  + 
Sbjct: 266 DSPHKVYVGNLAWSVTSETLREALNGKGNVLGAKVIQDRETGRSRGFGFVSFSSEAEVEA 325

Query: 352 ACAALNGLKMGDKTLTVRRA---TASGQS 377
           A + ++GL++  +++ V  A   +  GQS
Sbjct: 326 AVSEMDGLEVEGRSIRVNVAKSRSTEGQS 354


>gi|2459426|gb|AAB80661.1| putative splicing factor U2AF large chain [Arabidopsis thaliana]
          Length = 475

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 122/557 (21%), Positives = 204/557 (36%), Gaps = 141/557 (25%)

Query: 2   SGRDIRYDAVGEGSRHKSSWV--SGRSRTGERGRDRHHRDFKSGGDDRRRDKNYKYDREG 59
           S + +R   V +  R +SS    +G  R  + G  + +R+       +R D    +  E 
Sbjct: 9   SKKRLRSLVVADVPRDESSIKPDNGDKRKNQNGNHKKNREINMS---KRHDPGKVHSVEV 65

Query: 60  IRDHDRTDRHRDYNRD-KERRHRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSGFDMAP 118
               +R ++ +   RD +E+R R RSR H  DR ++ SKS     SP K + ++    A 
Sbjct: 66  SERWERREQPKSRQRDLREKRRRSRSRDHGQDRQKSASKSELGGYSPRKRREQASTKAAS 125

Query: 119 P------------------AAAMLPGAAVPGQLPGVPSAVPEMAQNML-----------P 149
           P                   A M   +   G      +A P +++  L           P
Sbjct: 126 PPNLSSEKKSAKWGLAATVTAGMFSDSVFSGLQAATQTAYPTISEASLTLLKPLMVMDAP 185

Query: 150 F---GATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNS 206
           F    A Q  +F      V   ++TR  RR+Y   +P  A+E+++   F+  M + G N 
Sbjct: 186 FRTPPARQTTSFD----SVQLTESTRRMRRLYAENVPDSASEKSLIECFNGYMLSSGSNH 241

Query: 207 AGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAA 266
               +  ++               T ++AS A++LDG  F G  +++RRP DY   + + 
Sbjct: 242 IKGSEPCISCI-----------FLTPQDASAALSLDGCSFAGSNLKIRRPKDYLMEIVSV 290

Query: 267 LGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDL 326
            GP                                                   L  +  
Sbjct: 291 FGP---------------------------------------------------LKAYRF 299

Query: 327 VKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQES--- 383
           V + D   ++   F  Y D +VT  ACA LNG+++G   +T   A     S    E+   
Sbjct: 300 VSNNDL--NQRCAFLEYTDGSVTLKACAGLNGMRLGGSVITAVCAFPDASSVAVNENPPF 357

Query: 384 --ILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALA--DDEEY 439
             I + A+  +   K                        +L L   +  + L    ++E 
Sbjct: 358 YGIPSHAKPLLGKPK-----------------------NILKLKNVVDPEDLTSFSEQEV 394

Query: 440 EEILEDMREECGKY--GTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGR 497
           +EILED+R EC ++  G  +        ++  ET   G +F+EY        A ++L GR
Sbjct: 395 KEILEDVRLECARWDAGDKIEEEQEEDPEDVFET---GCIFIEYRRPEATCDAAHSLHGR 451

Query: 498 KFGGNTVNAFYYPEDKY 514
            +    V A Y  ++ Y
Sbjct: 452 LYDNRIVKAEYVSKELY 468


>gi|391347243|ref|XP_003747874.1| PREDICTED: cleavage stimulation factor subunit 2-like [Metaseiulus
           occidentalis]
          Length = 422

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 54/91 (59%), Gaps = 1/91 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K + E  G +  F LV DR+TG  KGYGFC ++D      A   L
Sbjct: 20  VFVGNIPYDATEEQLKTIFEEVGPVVNFRLVYDRETGKPKGYGFCEFKDQETAMSAMRNL 79

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
           N  ++G + L V  A AS ++K E +++ AQ
Sbjct: 80  NSFEIGGRALRVDHA-ASERNKEELKALYAQ 109


>gi|330793087|ref|XP_003284617.1| hypothetical protein DICPUDRAFT_148416 [Dictyostelium purpureum]
 gi|325085416|gb|EGC38823.1| hypothetical protein DICPUDRAFT_148416 [Dictyostelium purpureum]
          Length = 829

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 56/223 (25%), Positives = 108/223 (48%), Gaps = 19/223 (8%)

Query: 295 DRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA 354
           +R+++G + +  TE QI+ +   FG +    L+++ +TG  KGYGF  +++    D A  
Sbjct: 622 NRIYIGSINWNVTEEQIRGIFSQFGKIISCFLMQNTETGKHKGYGFIDFENKKSADDAL- 680

Query: 355 ALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLF 414
           A+NG ++  + + V R T    + T    I   +    +++  A+ T+    +     L 
Sbjct: 681 AMNGFELLGRAMKVGRPTKGASANT----ISNGSIDKTSLEGEAMLTTSDQRIQLTQKLL 736

Query: 415 GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
           G    K L L  A + D +  D  +E   ED+R  C ++G +  +VI + D +       
Sbjct: 737 GNE-NKCLVLRNAGSPDDI--DPSFE---EDIRSGCNEFGEIEKLVI-KTDSS------T 783

Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
            +V++ + +A  C   ++ L+G+ F  + + A +Y +   FNK
Sbjct: 784 VRVYIVFKEAPSCVACQSKLNGKYFSYHCIKAEFY-DINLFNK 825



 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 39/81 (48%), Gaps = 1/81 (1%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           RV++G L +   E  I +    FGT+    L KD + G SKGY F  Y+ P     A  +
Sbjct: 464 RVYIGNLHFSLAEDAIIQAFSQFGTVKSILLGKDAN-GKSKGYAFIEYESPDSATKAIES 522

Query: 356 LNGLKMGDKTLTVRRATASGQ 376
           ++   M  + + V R  A GQ
Sbjct: 523 MSNYVMAGRVIKVNRPLAGGQ 543


>gi|194862772|ref|XP_001970115.1| GG10454 [Drosophila erecta]
 gi|190661982|gb|EDV59174.1| GG10454 [Drosophila erecta]
          Length = 593

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 86/374 (22%), Positives = 133/374 (35%), Gaps = 67/374 (17%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS         S G    V  +  N  K+F   A++E
Sbjct: 233 RDARTVFCIQLSQRVRARDLEEFFS---------SVGKVRDVRLITCNKTKRFKGIAYIE 283

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
               E  + A+ L G    GV + V+        L  A    QP  +             
Sbjct: 284 FEDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQNAAPAFQPKSHT------------ 331

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  Y +   
Sbjct: 332 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDAIQLIMDTETGRSKGYGFITYHNADD 387

Query: 349 TDIACAALNGLKMGDKTLTVRRAT-----------------------ASGQS----KTEQ 381
              A   LNG ++  + + V   T                       A+G+     K  +
Sbjct: 388 AKKALEQLNGFELAGRLMKVGNVTERLDMNTTSLDTDEMDRTGIDLGATGRLQLMFKLAE 447

Query: 382 ESILAQAQQHIAIQKMAL-QTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYE 440
            + LA  Q           Q + M       S+  +              +   D E  +
Sbjct: 448 GAGLAVPQAAANALLATAPQPAPMQQQEAAPSIATQCFILSNMFDPRTETNPTWDVEIRD 507

Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
           ++LE    EC K+G ++++ +             G V+++         A NAL GR F 
Sbjct: 508 DVLE----ECAKHGGVLHIHV-------DTISPTGTVYVKCPSTTTAVLAVNALHGRWFA 556

Query: 501 GNTVNAFYYPEDKY 514
           G  + A Y P   Y
Sbjct: 557 GRVITAAYVPVINY 570


>gi|30685698|ref|NP_850209.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
 gi|330253740|gb|AEC08834.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
          Length = 979

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 47/165 (28%), Positives = 76/165 (46%), Gaps = 18/165 (10%)

Query: 117 APPAAAMLPGAAVPGQLPGVPSAVPEMAQNML-----------PFG---ATQLGAFPLMP 162
           A   A M   +   G      +A P +++  L           PF    A Q  +F    
Sbjct: 810 ATVTAGMFSDSVFSGLQAATQTAYPTISEASLTLLKPLMVMDAPFRTPPARQTTSFD--- 866

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
             V   ++TR  RR+Y   +P  A+E+++   F+  M + G N     +  ++  IN EK
Sbjct: 867 -SVQLTESTRRMRRLYAENVPDSASEKSLIECFNGYMLSSGSNHIKGSEPCISCIINKEK 925

Query: 223 KFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAAL 267
             A VE  T ++AS A++LDG  F G  +++RRP DY  T  +++
Sbjct: 926 SQALVEFLTPQDASAALSLDGCSFAGSNLKIRRPKDYVRTTVSSI 970


>gi|262348230|gb|ACY56333.1| putative splicing factor u2af large subunit, partial [Monascus
           ruber]
          Length = 90

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 35/75 (46%), Positives = 54/75 (72%), Gaps = 2/75 (2%)

Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCAT-AKNALSGRKF 499
           EI++D+R+EC KYGT++ + IPRP   G ++PGVGK++++ +D+V  AT A  AL+GRKF
Sbjct: 12  EIMDDVRDECSKYGTILELKIPRPTTGGRQSPGVGKIYVK-FDSVKSATEALKALAGRKF 70

Query: 500 GGNTVNAFYYPEDKY 514
              TV   Y+ E+ +
Sbjct: 71  SDRTVVTTYFSEENF 85


>gi|426252723|ref|XP_004020052.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
           isoform 1 [Ovis aries]
          Length = 572

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|126342916|ref|XP_001364467.1| PREDICTED: cleavage stimulation factor subunit 2-like [Monodelphis
           domestica]
          Length = 551

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|156363387|ref|XP_001626026.1| predicted protein [Nematostella vectensis]
 gi|156212886|gb|EDO33926.1| predicted protein [Nematostella vectensis]
          Length = 468

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 62/248 (25%), Positives = 105/248 (42%), Gaps = 37/248 (14%)

Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
           GP R++VG L +  TE  +K + E FGT+    L+ D +T  SKGYGF  +++      A
Sbjct: 213 GPTRLYVGSLHFNITEAMVKAVFEPFGTVDSVQLIYDSETNRSKGYGFVQFREAEAAKRA 272

Query: 353 CAALNGLKMGDKTLTVRRATASGQS-------------------KTEQESILAQAQQ-HI 392
              +NG ++  + L +   T  G S                    + + +++A+  Q H 
Sbjct: 273 MEQMNGFELAGRPLKIGPVTERGDSSAYSFLDDEEYEKGGVELNSSARAALMAKLSQGHS 332

Query: 393 AIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALAD-----DEEYE-EILEDM 446
           A     L   G   +  G+     T   V   T       + D     D  ++ +I +D+
Sbjct: 333 A----GLSVPGAPPIVSGVQQALATPVAVSLPTPCFMLTNMFDPTKERDAGWDLDIRDDV 388

Query: 447 REECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNA 506
            EEC K+G +V++ +   D+N  +    G V+++        +A  +L GR F G  + A
Sbjct: 389 LEECNKFGPIVHIHV---DKNSPQ----GIVYVKCATPDIAISASKSLHGRWFAGKQIIA 441

Query: 507 FYYPEDKY 514
              P   Y
Sbjct: 442 APVPLSNY 449


>gi|88682979|gb|AAI05553.1| CSTF2 protein [Bos taurus]
          Length = 632

 Score = 73.2 bits (178), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|22478042|gb|AAH36719.1| Cstf2 protein [Mus musculus]
          Length = 510

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|332372985|gb|AEE61634.1| unknown [Dendroctonus ponderosae]
          Length = 409

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 37/91 (40%), Positives = 55/91 (60%), Gaps = 1/91 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 16  VFVGNIPYEATEEKLKDIFGEVGQVLSFKLVFDRETGKPKGYGFCEYRDQETALSAMRNL 75

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
           NG ++G ++L V  A  + +S+ E +++L Q
Sbjct: 76  NGYEIGGRSLRVDNA-CTEKSRMEMQNLLNQ 105


>gi|157821159|ref|NP_001101056.1| cleavage stimulation factor subunit 2 tau variant [Rattus
           norvegicus]
 gi|149062701|gb|EDM13124.1| rCG47773 [Rattus norvegicus]
          Length = 629

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|449498377|ref|XP_002191180.2| PREDICTED: cleavage stimulation factor subunit 2 [Taeniopygia
           guttata]
          Length = 575

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|344275007|ref|XP_003409305.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
           [Loxodonta africana]
          Length = 609

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|301788792|ref|XP_002929813.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
           [Ailuropoda melanoleuca]
          Length = 552

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|197246895|gb|AAI69065.1| Cleavage stimulation factor, 3' pre-RNA subunit 2, tau [Rattus
           norvegicus]
          Length = 629

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|196115100|ref|NP_001124486.1| cleavage stimulation factor, 3' pre-RNA subunit 2 [Rattus
           norvegicus]
 gi|195539770|gb|AAI68251.1| Cstf2 protein [Rattus norvegicus]
          Length = 575

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|336088638|ref|NP_001229511.1| cleavage stimulation factor subunit 2 tau variant [Bos taurus]
 gi|296472872|tpg|DAA14987.1| TPA: CSTF2 protein-like [Bos taurus]
          Length = 642

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|426252727|ref|XP_004020054.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
           isoform 3 [Ovis aries]
          Length = 623

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|426223062|ref|XP_004005698.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
           [Ovis aries]
          Length = 607

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|426252729|ref|XP_004020055.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
           isoform 4 [Ovis aries]
          Length = 646

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|47225344|emb|CAG09844.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 570

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 28  VFVGNIPYEATEEQLKDIFSEVGLVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 87

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 88  NGREFSGRALRVDNA-ASEKNKEELKSL 114


>gi|194332803|ref|NP_001123707.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Xenopus
           (Silurana) tropicalis]
 gi|189442601|gb|AAI67314.1| LOC100170457 protein [Xenopus (Silurana) tropicalis]
          Length = 498

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|431895723|gb|ELK05144.1| Cleavage stimulation factor 64 kDa subunit [Pteropus alecto]
          Length = 577

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|50510589|dbj|BAD32280.1| mKIAA0689 protein [Mus musculus]
          Length = 643

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 29  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 88

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 89  NGREFSGRALRVDNA-ASEKNKEELKSL 115


>gi|20072518|gb|AAH26995.1| Cstf2t protein [Mus musculus]
          Length = 637

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|26328597|dbj|BAC28037.1| unnamed protein product [Mus musculus]
          Length = 580

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|68481591|ref|XP_715304.1| potential spliceosomal U2AF large subunit [Candida albicans SC5314]
 gi|46436920|gb|EAK96275.1| potential spliceosomal U2AF large subunit [Candida albicans SC5314]
          Length = 719

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 59/214 (27%), Positives = 90/214 (42%), Gaps = 30/214 (14%)

Query: 311 IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
           I+ L ++ G L G   ++ + T N  G  F  ++D   ++     L  L        + R
Sbjct: 524 IESLEKNVGALQGIQFLRQKGTKNLLGLVFVEFKD--SSNDVIGKLRRLPF------ITR 575

Query: 371 ATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
           A  S         IL        IQK  +    +  L    ++     ++V+ L  A+T 
Sbjct: 576 AFHS--------CILPNK---TPIQKGPIDFHSLKNLVENKNVAPHPSSRVIRLLNAVTE 624

Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV---------GKVFLEY 481
             LADD  Y  I  DM  E  KYG +VNV IPRP +N   TPG+         G +++E+
Sbjct: 625 SELADDATYSFIRNDMYNEASKYGEVVNVRIPRPSRN--HTPGILQFNTSTGLGTIYIEF 682

Query: 482 YDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
            D      A   L+G+ +   TV A +Y  D + 
Sbjct: 683 KDEKIALAAMMELAGKSYNDRTVLATFYDFDDFL 716


>gi|18875338|ref|NP_573459.1| cleavage stimulation factor subunit 2 [Mus musculus]
 gi|71153229|sp|Q8BIQ5.2|CSTF2_MOUSE RecName: Full=Cleavage stimulation factor subunit 2; AltName:
           Full=CF-1 64 kDa subunit; AltName: Full=Cleavage
           stimulation factor 64 kDa subunit; Short=CSTF 64 kDa
           subunit; Short=CstF-64
 gi|11139720|gb|AAG31814.1|AF317552_1 polyadenylation protein CSTF64 [Mus musculus]
 gi|26353226|dbj|BAC40243.1| unnamed protein product [Mus musculus]
          Length = 580

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|148709763|gb|EDL41709.1| cleavage stimulation factor, 3' pre-RNA subunit 2, tau [Mus
           musculus]
          Length = 644

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 30  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 89

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 90  NGREFSGRALRVDNA-ASEKNKEELKSL 116


>gi|426252725|ref|XP_004020053.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
           isoform 2 [Ovis aries]
          Length = 612

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|270000744|gb|EEZ97191.1| hypothetical protein TcasGA2_TC004378 [Tribolium castaneum]
          Length = 409

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 1/91 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 16  VFVGNIPYEATEEKLKDIFGEVGQVLSFKLVFDRETGKPKGYGFCEYRDQETALSAMRNL 75

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
           NG ++G + L V  A  + +S+ E +++L Q
Sbjct: 76  NGYEIGGRNLRVDNA-CTEKSRMEMQNLLNQ 105


>gi|148277061|ref|NP_112539.2| cleavage stimulation factor subunit 2 tau variant [Mus musculus]
 gi|71153235|sp|Q8C7E9.2|CSTFT_MOUSE RecName: Full=Cleavage stimulation factor subunit 2 tau variant;
           AltName: Full=CF-1 64 kDa subunit tau variant; AltName:
           Full=Cleavage stimulation factor 64 kDa subunit tau
           variant; Short=CSTF 64 kDa subunit tau variant; AltName:
           Full=TauCstF-64
 gi|26330250|dbj|BAC28855.1| unnamed protein product [Mus musculus]
 gi|26350087|dbj|BAC38683.1| unnamed protein product [Mus musculus]
          Length = 632

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|26341156|dbj|BAC34240.1| unnamed protein product [Mus musculus]
          Length = 632

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|410043862|ref|XP_003951699.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant [Pan
           troglodytes]
          Length = 576

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|301788318|ref|XP_002929575.1| PREDICTED: cleavage stimulation factor subunit 2-like [Ailuropoda
           melanoleuca]
          Length = 582

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|192764314|gb|ACF05701.1| alphaCstF-64 variant 4 [Mus musculus]
          Length = 554

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|194373969|dbj|BAG62297.1| unnamed protein product [Homo sapiens]
          Length = 553

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|194042423|ref|XP_001926989.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
           isoform 1 [Sus scrofa]
          Length = 615

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|351715564|gb|EHB18483.1| Cleavage stimulation factor 64 kDa subunit, tau variant
           [Heterocephalus glaber]
          Length = 642

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|348570430|ref|XP_003471000.1| PREDICTED: cleavage stimulation factor subunit 2-like [Cavia
           porcellus]
          Length = 577

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|291408103|ref|XP_002720432.1| PREDICTED: cleavage stimulation factor subunit 2 [Oryctolagus
           cuniculus]
          Length = 576

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|57530169|ref|NP_001006433.1| cleavage stimulation factor subunit 2 [Gallus gallus]
 gi|53128673|emb|CAG31323.1| hypothetical protein RCJMB04_5b8 [Gallus gallus]
          Length = 475

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|51258747|gb|AAH80037.1| Cstf-64-prov protein [Xenopus laevis]
          Length = 498

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|74007924|ref|XP_861405.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 5 [Canis
           lupus familiaris]
          Length = 577

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|26351239|dbj|BAC39256.1| unnamed protein product [Mus musculus]
          Length = 642

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|221111756|ref|XP_002159647.1| PREDICTED: RNA-binding protein 39-like [Hydra magnipapillata]
          Length = 528

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 89/363 (24%), Positives = 144/363 (39%), Gaps = 48/363 (13%)

Query: 169 QATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVE 228
           Q  R +R V++  L      + I  FFS+V              + +      K   +VE
Sbjct: 178 QEERDSRTVFIMQLAKQVTIRDIQDFFSKV------GQVRDVRLISDRNSRRSKGIGYVE 231

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
                  + A+ L G    GV + V      +PT+A            N  A   A  A+
Sbjct: 232 FTDASAVTLAIKLSGQKLLGVPIMV------SPTMAEK----------NRYAA--AQAAL 273

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
              +GP +++VG L Y  TE  ++ + E FGT+    L  D +T  SKG+GF  +++   
Sbjct: 274 VKPQGPMKLYVGSLHYNITEPMLRAIFEPFGTVESVQLQYDSETNRSKGFGFVNFREAGA 333

Query: 349 TDIACAALNGLKMGDKTL---TVRRATASGQS-----KTEQESILAQAQQHIAI-QKMA- 398
              A   +NG ++  + +   TV   T    S     +TE+  I   AQ   ++ QK+A 
Sbjct: 334 AKRAMEQMNGFELAGRPMKVNTVSERTDGSMSFLDDEETEKGGIEMNAQSRASLMQKLAQ 393

Query: 399 -----LQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDE-EYE-EILEDMREECG 451
                LQ      +   +      +A   CL  +   D   + E ++E +I  D+ EE  
Sbjct: 394 THGSGLQVPTAPIIPAMLPTPMMNVAGSTCLILSNLFDPRKETESDWELDIRNDVLEEVT 453

Query: 452 KYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPE 511
           K G +V++ I   D+   E    G V+++              +GR F G T+ A   P 
Sbjct: 454 KMGIVVHISI---DKISAE----GNVYIKTLIPDTAQKILQTFNGRWFAGRTIRAVAIPV 506

Query: 512 DKY 514
             Y
Sbjct: 507 ANY 509


>gi|410927616|ref|XP_003977237.1| PREDICTED: cleavage stimulation factor subunit 2-like [Takifugu
           rubripes]
          Length = 497

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 31  VFVGNIPYEATEEQLKDIFSEVGLVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 90

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 91  NGREFSGRALRVDNA-ASEKNKEELKSL 117


>gi|348515337|ref|XP_003445196.1| PREDICTED: cleavage stimulation factor subunit 2-like [Oreochromis
           niloticus]
          Length = 478

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 31  VFVGNIPYEATEEQLKDIFSEVGLVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 90

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 91  NGREFSGRALRVDNA-ASEKNKEELKSL 117


>gi|27807239|ref|NP_777110.1| cleavage stimulation factor subunit 2 [Bos taurus]
 gi|71153228|sp|Q8HXM1.1|CSTF2_BOVIN RecName: Full=Cleavage stimulation factor subunit 2; AltName:
           Full=CF-1 64 kDa subunit; AltName: Full=Cleavage
           stimulation factor 64 kDa subunit; Short=CSTF 64 kDa
           subunit; Short=CstF-64
 gi|24416593|gb|AAN05427.1| CstF-64 [Bos taurus]
 gi|296470997|tpg|DAA13112.1| TPA: cleavage stimulation factor 64 kDa subunit [Bos taurus]
          Length = 572

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|68481460|ref|XP_715369.1| potential spliceosomal U2AF large subunit [Candida albicans SC5314]
 gi|46436988|gb|EAK96342.1| potential spliceosomal U2AF large subunit [Candida albicans SC5314]
 gi|238882073|gb|EEQ45711.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 717

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 59/214 (27%), Positives = 90/214 (42%), Gaps = 30/214 (14%)

Query: 311 IKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRR 370
           I+ L ++ G L G   ++ + T N  G  F  ++D   ++     L  L        + R
Sbjct: 522 IESLEKNVGALQGIQFLRQKGTKNLLGLVFVEFKD--SSNDVIGKLRRLPF------ITR 573

Query: 371 ATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITA 430
           A  S         IL        IQK  +    +  L    ++     ++V+ L  A+T 
Sbjct: 574 AFHS--------CILPNK---TPIQKGPIDFHSLKNLVENKNVAPHPSSRVIRLLNAVTE 622

Query: 431 DALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV---------GKVFLEY 481
             LADD  Y  I  DM  E  KYG +VNV IPRP +N   TPG+         G +++E+
Sbjct: 623 SELADDATYSFIRNDMYNEASKYGEVVNVRIPRPSRN--HTPGILQFNTSTGLGTIYIEF 680

Query: 482 YDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
            D      A   L+G+ +   TV A +Y  D + 
Sbjct: 681 KDEKIALAAMMELAGKSYNDRTVLATFYDFDDFL 714


>gi|426257829|ref|XP_004022524.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1 [Ovis
           aries]
          Length = 572

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|344296752|ref|XP_003420068.1| PREDICTED: cleavage stimulation factor subunit 2-like [Loxodonta
           africana]
          Length = 582

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|632500|gb|AAB50269.1| polyadenylation factor 64 kDa subunit [Xenopus laevis]
          Length = 497

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|260790683|ref|XP_002590371.1| hypothetical protein BRAFLDRAFT_216236 [Branchiostoma floridae]
 gi|229275563|gb|EEN46382.1| hypothetical protein BRAFLDRAFT_216236 [Branchiostoma floridae]
          Length = 222

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 43/103 (41%), Positives = 55/103 (53%), Gaps = 1/103 (0%)

Query: 278 LAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKG 337
           +AAVGL             VFVG +PY  TE Q+K++    G +  F LV DR+TG  KG
Sbjct: 1   MAAVGLQGLNPAQDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVISFRLVYDRETGKPKG 60

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTE 380
           YGFC Y+D      A   LNG ++  + L V  A AS +SK E
Sbjct: 61  YGFCEYKDQETALSAMRNLNGHELNGRQLRVDNA-ASEKSKEE 102


>gi|147898871|ref|NP_001080179.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Xenopus
           laevis]
 gi|27735464|gb|AAH41291.1| Cstf-64 protein [Xenopus laevis]
          Length = 518

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|395820735|ref|XP_003783716.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
           [Otolemur garnettii]
          Length = 601

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|332254742|ref|XP_003276491.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1
           [Nomascus leucogenys]
          Length = 577

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|21619877|gb|AAH33135.1| Similar to cleavage stimulation factor, 3' pre-RNA, subunit 2,
           64kD, partial [Homo sapiens]
          Length = 559

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 17  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 76

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 77  NGREFSGRALRVDNA-ASEKNKEELKSL 103


>gi|4557493|ref|NP_001316.1| cleavage stimulation factor subunit 2 [Homo sapiens]
 gi|332861154|ref|XP_003317595.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1 [Pan
           troglodytes]
 gi|397478196|ref|XP_003810439.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1 [Pan
           paniscus]
 gi|426396653|ref|XP_004064546.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1 [Gorilla
           gorilla gorilla]
 gi|461847|sp|P33240.1|CSTF2_HUMAN RecName: Full=Cleavage stimulation factor subunit 2; AltName:
           Full=CF-1 64 kDa subunit; AltName: Full=Cleavage
           stimulation factor 64 kDa subunit; Short=CSTF 64 kDa
           subunit; Short=CstF-64
 gi|181139|gb|AAA35724.1| cleavage stimulation factor [Homo sapiens]
 gi|17389334|gb|AAH17712.1| Cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Homo
           sapiens]
 gi|32879899|gb|AAP88780.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Homo
           sapiens]
 gi|61359609|gb|AAX41742.1| cleavage stimulation factor 3' pre-RNA subunit 2 [synthetic
           construct]
 gi|61359616|gb|AAX41743.1| cleavage stimulation factor 3' pre-RNA subunit 2 [synthetic
           construct]
 gi|119623223|gb|EAX02818.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, isoform
           CRA_a [Homo sapiens]
 gi|123981258|gb|ABM82458.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa
           [synthetic construct]
 gi|123996091|gb|ABM85647.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa
           [synthetic construct]
 gi|261860120|dbj|BAI46582.1| Cleavage stimulation factor 64 kDa subunit [synthetic construct]
 gi|410256936|gb|JAA16435.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Pan
           troglodytes]
 gi|410289934|gb|JAA23567.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Pan
           troglodytes]
          Length = 577

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|194205912|ref|XP_001917732.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
           [Equus caballus]
          Length = 619

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|444707906|gb|ELW49054.1| Cleavage stimulation factor subunit 2 tau variant [Tupaia
           chinensis]
          Length = 654

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|431839028|gb|ELK00957.1| Cleavage stimulation factor 64 kDa subunit, tau variant [Pteropus
           alecto]
          Length = 601

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|402910773|ref|XP_003918026.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1 [Papio
           anubis]
 gi|402910775|ref|XP_003918027.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 2 [Papio
           anubis]
          Length = 577

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|102269210|gb|ABF55966.2| cleavage stimulation factor 64-kDa subunit [Bombyx mori]
          Length = 326

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 55/93 (59%), Gaps = 1/93 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 20  VFVGNIPYEATEEKLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 79

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
           NG ++G ++L V  A  + +S+ E ++++   Q
Sbjct: 80  NGYEIGGRSLRVDNA-CTEKSRMEMQALMQGPQ 111


>gi|74007936|ref|XP_549135.2| PREDICTED: cleavage stimulation factor subunit 2 isoform 2 [Canis
           lupus familiaris]
          Length = 597

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|417412278|gb|JAA52529.1| Putative mrna cleavage and polyadenylation factor i complex subunit
           rna15, partial [Desmodus rotundus]
          Length = 678

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 52  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 111

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 112 NGREFSGRALRVDNA-ASEKNKEELKSL 138


>gi|355681345|gb|AER96778.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Mustela
           putorius furo]
          Length = 582

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|197100099|ref|NP_001125111.1| cleavage stimulation factor subunit 2 [Pongo abelii]
 gi|71153230|sp|Q5RDA3.1|CSTF2_PONAB RecName: Full=Cleavage stimulation factor subunit 2; AltName:
           Full=CF-1 64 kDa subunit; AltName: Full=Cleavage
           stimulation factor 64 kDa subunit; Short=CSTF 64 kDa
           subunit; Short=CstF-64
 gi|55726993|emb|CAH90254.1| hypothetical protein [Pongo abelii]
          Length = 577

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|410988989|ref|XP_004000752.1| PREDICTED: LOW QUALITY PROTEIN: cleavage stimulation factor subunit
           2 [Felis catus]
          Length = 577

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|348576422|ref|XP_003473986.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
           [Cavia porcellus]
          Length = 630

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|335306285|ref|XP_003360436.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 2
           [Sus scrofa]
          Length = 592

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|440901693|gb|ELR52585.1| Cleavage stimulation factor subunit 2 [Bos grunniens mutus]
          Length = 619

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|395850633|ref|XP_003797884.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 2
           [Otolemur garnettii]
          Length = 596

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|311276614|ref|XP_003135279.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 1
           [Sus scrofa]
          Length = 572

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|380798899|gb|AFE71325.1| cleavage stimulation factor subunit 2, partial [Macaca mulatta]
          Length = 575

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 16  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 75

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 76  NGREFSGRALRVDNA-ASEKNKEELKSL 102


>gi|332254744|ref|XP_003276492.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 2
           [Nomascus leucogenys]
          Length = 597

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|119224067|gb|AAI26544.1| CSTF2 protein [Bos taurus]
          Length = 592

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|403298766|ref|XP_003940178.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 3 [Saimiri
           boliviensis boliviensis]
          Length = 597

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|291404363|ref|XP_002718535.1| PREDICTED: cleavage stimulation factor, 3' pre-RNA, subunit 2,
           64kDa, tau variant [Oryctolagus cuniculus]
          Length = 601

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|192764310|gb|ACF05699.1| betaCstF-64 variant 2 [Homo sapiens]
          Length = 597

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|444518829|gb|ELV12414.1| Cleavage stimulation factor subunit 2 [Tupaia chinensis]
          Length = 409

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|332861156|ref|XP_529072.3| PREDICTED: cleavage stimulation factor subunit 2 isoform 2 [Pan
           troglodytes]
 gi|397478198|ref|XP_003810440.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 2 [Pan
           paniscus]
 gi|426396655|ref|XP_004064547.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 2 [Gorilla
           gorilla gorilla]
          Length = 597

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|296235968|ref|XP_002763125.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1
           [Callithrix jacchus]
          Length = 597

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|281348298|gb|EFB23882.1| hypothetical protein PANDA_020101 [Ailuropoda melanoleuca]
          Length = 612

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|410353769|gb|JAA43488.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa [Pan
           troglodytes]
          Length = 577

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|390480015|ref|XP_003735829.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 2
           [Callithrix jacchus]
          Length = 577

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|432877328|ref|XP_004073146.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
           isoform 1 [Oryzias latipes]
          Length = 494

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 31  VFVGNIPYEATEEQLKDIFSEVGLVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 90

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 91  NGREFSGRALRVDNA-ASEKNKEELKSL 117


>gi|11762098|gb|AAG40327.1|AF322194_1 variant polyadenylation protein CSTF-64 [Mus musculus]
          Length = 630

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|395850631|ref|XP_003797883.1| PREDICTED: cleavage stimulation factor subunit 2-like isoform 1
           [Otolemur garnettii]
          Length = 576

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|192764312|gb|ACF05700.1| betaCstF-64 variant 3 [Mus musculus]
          Length = 630

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|297304342|ref|XP_001089558.2| PREDICTED: cleavage stimulation factor subunit 2-like [Macaca
           mulatta]
 gi|402910777|ref|XP_003918028.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 3 [Papio
           anubis]
 gi|355704983|gb|EHH30908.1| hypothetical protein EGK_20728 [Macaca mulatta]
 gi|355757534|gb|EHH61059.1| hypothetical protein EGM_18986 [Macaca fascicularis]
          Length = 597

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|432877330|ref|XP_004073147.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
           isoform 2 [Oryzias latipes]
          Length = 479

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 31  VFVGNIPYEATEEQLKDIFSEVGLVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 90

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 91  NGREFSGRALRVDNA-ASEKNKEELKSL 117


>gi|426257831|ref|XP_004022525.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 2 [Ovis
           aries]
          Length = 592

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|403298762|ref|XP_003940176.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 1 [Saimiri
           boliviensis boliviensis]
 gi|403298764|ref|XP_003940177.1| PREDICTED: cleavage stimulation factor subunit 2 isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 577

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|351709510|gb|EHB12429.1| Cleavage stimulation factor 64 kDa subunit [Heterocephalus glaber]
          Length = 597

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|402880821|ref|XP_003903988.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant [Papio
           anubis]
          Length = 620

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|432111364|gb|ELK34639.1| Cleavage stimulation factor subunit 2 tau variant [Myotis davidii]
          Length = 641

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|417403608|gb|JAA48603.1| Putative mrna cleavage and polyadenylation factor i complex subunit
           rna15 [Desmodus rotundus]
          Length = 647

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|332212188|ref|XP_003255200.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
           [Nomascus leucogenys]
          Length = 622

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|326673799|ref|XP_003199996.1| PREDICTED: cleavage stimulation factor subunit 2-like [Danio rerio]
          Length = 488

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 26  VFVGNIPYEATEEQLKDIFSEVGLVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 85

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 86  NGREFSGRALRVDNA-ASEKNKEELKSL 112


>gi|192764316|gb|ACF05702.1| betaCstF-64 variant 1 [Mus musculus]
          Length = 604

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|76253771|ref|NP_956408.2| cleavage stimulation factor subunit 2 [Danio rerio]
 gi|41107668|gb|AAH65442.1| Cleavage stimulation factor, 3' pre-RNA, subunit 2 [Danio rerio]
          Length = 488

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 26  VFVGNIPYEATEEQLKDIFSEVGLVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 85

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 86  NGREFSGRALRVDNA-ASEKNKEELKSL 112


>gi|20380061|gb|AAH28239.1| Cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
           variant [Homo sapiens]
 gi|325463311|gb|ADZ15426.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
           variant [synthetic construct]
          Length = 616

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|62896707|dbj|BAD96294.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
           variant variant [Homo sapiens]
          Length = 616

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGSIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|354476121|ref|XP_003500273.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
           isoform 1 [Cricetulus griseus]
          Length = 614

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|355562595|gb|EHH19189.1| hypothetical protein EGK_19854 [Macaca mulatta]
          Length = 610

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 8   VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 67

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 68  NGREFSGRALRVDNA-ASEKNKEELKSL 94


>gi|14149675|ref|NP_056050.1| cleavage stimulation factor subunit 2 tau variant [Homo sapiens]
 gi|71153234|sp|Q9H0L4.1|CSTFT_HUMAN RecName: Full=Cleavage stimulation factor subunit 2 tau variant;
           AltName: Full=CF-1 64 kDa subunit tau variant; AltName:
           Full=Cleavage stimulation factor 64 kDa subunit tau
           variant; Short=CSTF 64 kDa subunit tau variant; AltName:
           Full=TauCstF-64
 gi|12053011|emb|CAB66681.1| hypothetical protein [Homo sapiens]
 gi|24416591|gb|AAN05429.1| tCstF-64 [Homo sapiens]
 gi|119574527|gb|EAW54142.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
           variant [Homo sapiens]
 gi|189067256|dbj|BAG36966.1| unnamed protein product [Homo sapiens]
          Length = 616

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|410974961|ref|XP_003993907.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant [Felis
           catus]
          Length = 613

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|388452994|ref|NP_001253727.1| cleavage stimulation factor subunit 2 tau variant [Macaca mulatta]
 gi|355758732|gb|EHH61510.1| hypothetical protein EGM_21244 [Macaca fascicularis]
 gi|383416951|gb|AFH31689.1| cleavage stimulation factor subunit 2 tau variant [Macaca mulatta]
 gi|384946038|gb|AFI36624.1| cleavage stimulation factor subunit 2 tau variant [Macaca mulatta]
 gi|387541538|gb|AFJ71396.1| cleavage stimulation factor subunit 2 tau variant [Macaca mulatta]
          Length = 620

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|397469483|ref|XP_003806381.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant [Pan
           paniscus]
          Length = 615

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|358421036|ref|XP_001254105.2| PREDICTED: cleavage stimulation factor subunit 2-like [Bos taurus]
          Length = 331

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|410043860|ref|XP_001163035.2| PREDICTED: cleavage stimulation factor subunit 2 tau variant
           isoform 4 [Pan troglodytes]
 gi|410335357|gb|JAA36625.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
           variant [Pan troglodytes]
 gi|410335359|gb|JAA36626.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
           variant [Pan troglodytes]
 gi|410335361|gb|JAA36627.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
           variant [Pan troglodytes]
 gi|410335363|gb|JAA36628.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa, tau
           variant [Pan troglodytes]
          Length = 615

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|426364779|ref|XP_004049473.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
           [Gorilla gorilla gorilla]
 gi|426364781|ref|XP_004049474.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant-like
           [Gorilla gorilla gorilla]
          Length = 617

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|354476123|ref|XP_003500274.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
           isoform 2 [Cricetulus griseus]
          Length = 623

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|344234537|gb|EGV66405.1| hypothetical protein CANTEDRAFT_91566 [Candida tenuis ATCC 10573]
          Length = 641

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 60/225 (26%), Positives = 100/225 (44%), Gaps = 17/225 (7%)

Query: 308 ETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ----DP--AVTDIACAALNGLKM 361
           E ++++ +E    +  F  +K++    S G  F  +Q    DP  A+T +    LN L  
Sbjct: 413 EDEVRKSIEEHIPIKQFQFLKEKYNKESMGIAFANFQLQSYDPTSAITVVQQVLLN-LTQ 471

Query: 362 GDKTLTVRRATASGQSKT---EQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
           G    +         ++T   EQ S +A  Q  +  Q ++   +  N       +     
Sbjct: 472 GSSIFSKADFACIVPNQTSIQEQPSNMASLQSFVKNQLISTTVTDNNPNIVSEVVRDNRK 531

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN------GGETP 472
           +KV+ +  A+T   L DDE Y  I  D+ +E  K+G ++ V IPRP  +         TP
Sbjct: 532 SKVIQIINAVTTKDLKDDETYGFISSDVEQEVKKFGEVIRVKIPRPANDFTPGLTESSTP 591

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNK 517
           G+G++F+E+ +      A   L+GR +   TV   Y+  D  FNK
Sbjct: 592 GLGRIFVEFSNEDSAFKAILGLAGRMYNDRTVLCSYFDVDD-FNK 635


>gi|297686917|ref|XP_002820977.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant [Pongo
           abelii]
          Length = 625

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|344241215|gb|EGV97318.1| Cleavage stimulation factor 64 kDa subunit, tau variant [Cricetulus
           griseus]
          Length = 645

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|357627631|gb|EHJ77269.1| hypothetical protein KGM_03087 [Danaus plexippus]
          Length = 425

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 35/89 (39%), Positives = 54/89 (60%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 8   VFVGNIPYEATEEKLKDIFSEVGPVLSFKLVFDRETGKPKGYGFCEYKDQETALSAMRNL 67

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G ++L V  A  + +S+ E ++++
Sbjct: 68  NGYEIGGRSLRVDNA-CTEKSRMEMQALM 95


>gi|403260042|ref|XP_003922497.1| PREDICTED: cleavage stimulation factor subunit 2 tau variant
           [Saimiri boliviensis boliviensis]
          Length = 621

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 52/88 (59%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|149055454|gb|EDM07038.1| rCG38164 [Rattus norvegicus]
          Length = 363

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|91094479|ref|XP_970762.1| PREDICTED: similar to cleavage stimulation factor 64-kDa subunit
           [Tribolium castaneum]
          Length = 424

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 37/91 (40%), Positives = 54/91 (59%), Gaps = 1/91 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 16  VFVGNIPYEATEEKLKDIFGEVGQVLSFKLVFDRETGKPKGYGFCEYRDQETALSAMRNL 75

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
           NG ++G + L V  A  + +S+ E +++L Q
Sbjct: 76  NGYEIGGRNLRVDNA-CTEKSRMEMQNLLNQ 105


>gi|59808930|gb|AAH89996.1| U2af2 protein, partial [Rattus norvegicus]
          Length = 88

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 34/79 (43%), Positives = 47/79 (59%), Gaps = 1/79 (1%)

Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
           EI+ED+R+EC KYG + ++ IPRP  +G E PG GK+F+E+     C  A   L+GRKF 
Sbjct: 10  EIVEDVRDECSKYGLVKSIEIPRP-VDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKFA 68

Query: 501 GNTVNAFYYPEDKYFNKDY 519
              V   Y   D Y  +D+
Sbjct: 69  NRVVVTKYCDPDSYHRRDF 87


>gi|294936223|ref|XP_002781665.1| Heterogeneous nuclear ribonucleoprotein A1, putative [Perkinsus
           marinus ATCC 50983]
 gi|239892587|gb|EER13460.1| Heterogeneous nuclear ribonucleoprotein A1, putative [Perkinsus
           marinus ATCC 50983]
          Length = 482

 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 59/212 (27%), Positives = 100/212 (47%), Gaps = 18/212 (8%)

Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP-GDAVVNV--YINHEKKFAFVEMRT 231
           ++V+VGGLP  A++ A+  +FSQ          GP  D+VV +  +    + F FV   T
Sbjct: 171 KKVFVGGLPREADKPALDAYFSQF---------GPVEDSVVMMDRFTGRSRGFGFVTFET 221

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAA----ALGPGQPSPNLNLAAVGLASGA 287
            E+    +A    +  G +V VRR  + + T  A    + G G  +P  N    G   G 
Sbjct: 222 KEQMLGCVAAAPHVIMGKSVEVRRSINDDGTSTAHERRSAGKGAGAPR-NYDDYGSGKGG 280

Query: 288 IGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPA 347
               + P+++FVGGLP   T   +++    +G L    ++ DR TG S+G+G+  Y+D +
Sbjct: 281 HRD-QNPNKLFVGGLPREITSEALRDFFIQYGNLVDCTVITDRMTGQSRGFGYVTYEDSS 339

Query: 348 VTDIACAALNGLKMGDKTLTVRRATASGQSKT 379
             + A +      +  K + V+  T  G  ++
Sbjct: 340 AAEAAISNSANNIIDGKWVDVKHTTREGPRRS 371



 Score = 43.1 bits (100), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 1/87 (1%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           +VFVGGLP    +  +      FG +    ++ DR TG S+G+GF  ++         AA
Sbjct: 172 KVFVGGLPREADKPALDAYFSQFGPVEDSVVMMDRFTGRSRGFGFVTFETKEQMLGCVAA 231

Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQE 382
              + MG K++ VRR+     + T  E
Sbjct: 232 APHVIMG-KSVEVRRSINDDGTSTAHE 257


>gi|195050249|ref|XP_001992854.1| GH13506 [Drosophila grimshawi]
 gi|193899913|gb|EDV98779.1| GH13506 [Drosophila grimshawi]
          Length = 628

 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 55/200 (27%), Positives = 82/200 (41%), Gaps = 28/200 (14%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS V         G    V  +  N  K+F   A++E
Sbjct: 268 RDARTVFCIQLSQRVRARDLEEFFSSV---------GKVRDVRLITCNKTKRFKGIAYIE 318

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
               E  + A+ L G    GV + V+        L +A  P QP  +             
Sbjct: 319 FEDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQSAPPPFQPKLHT------------ 366

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  Y +   
Sbjct: 367 ----GPMRLYVGSLHFNITEDMLRGIFEPFGKIDVIQLIMDNETGRSKGYGFITYHNADD 422

Query: 349 TDIACAALNGLKMGDKTLTV 368
              A   LNG ++  + + V
Sbjct: 423 AKKALEQLNGFELAGRPMKV 442


>gi|443694236|gb|ELT95429.1| hypothetical protein CAPTEDRAFT_160825 [Capitella teleta]
          Length = 548

 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 49/88 (55%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  +E Q+KE+ +  G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 26  VFVGNIPYEASEEQLKEVFQQAGPVISFRLVYDRETGKPKGYGFCEYQDVETAQSAMRNL 85

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           N      + L V  A A  QSK E +S+
Sbjct: 86  NNYDYNGRPLRVGVA-AGEQSKDENKSM 112


>gi|147852616|emb|CAN81690.1| hypothetical protein VITISV_009755 [Vitis vinifera]
          Length = 544

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 59/190 (31%), Positives = 88/190 (46%), Gaps = 35/190 (18%)

Query: 97  KSLSPS-RSPSKSKRRSGFDMAPPAAAMLPGAAVPGQL----PGV-------PSAVPEMA 144
           K+ SP+ RSP K  + +G+D+ P     +   +V   L    P V       PSAVP   
Sbjct: 343 KTPSPTNRSPEK--KNAGWDLPPSRTDGMNAGSVLSSLQVLKPTVSSNADELPSAVPVA- 399

Query: 145 QNMLPFGATQLGAFPLMPV---------------QVMTQQATRHARRVYVGGLPPLANEQ 189
              +P  AT   A P +P                 +   QATR  RR+YV  LP  ++E+
Sbjct: 400 ---VPVTATT--AKPPLPRIYSDAVSKNKNVSIDSIQLTQATRPMRRLYVENLPVSSSEK 454

Query: 190 AIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGV 249
           A+    +  + + G N        ++  I+ EK  A VE  T E+AS A++ DGI F G 
Sbjct: 455 ALMECLNNFLLSSGINHVQGTPPCISCIIHKEKGQALVEFLTPEDASAALSFDGISFSGS 514

Query: 250 AVRVRRPTDY 259
            +++RRP D+
Sbjct: 515 ILKIRRPKDF 524


>gi|321479154|gb|EFX90110.1| hypothetical protein DAPPUDRAFT_299933 [Daphnia pulex]
          Length = 381

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 54/91 (59%), Gaps = 1/91 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++K++    G +  F +V DR+TG  KGYGFC Y+D      A   L
Sbjct: 16  VFVGNIPYDVTEEKLKDIFSEAGPVVSFKIVYDRETGKPKGYGFCEYRDQETALCAMRNL 75

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQ 387
           NG ++  +TL V  A  + +S+ E +S++ +
Sbjct: 76  NGYEIAGRTLRVDNA-CTEKSRLEMQSLMQE 105


>gi|148688462|gb|EDL20409.1| cleavage stimulation factor, 3' pre-RNA subunit 2 [Mus musculus]
          Length = 363

 Score = 71.6 bits (174), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|303272315|ref|XP_003055519.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463493|gb|EEH60771.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 394

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 74/284 (26%), Positives = 109/284 (38%), Gaps = 61/284 (21%)

Query: 248 GVAVRVRRPTDYNPTLAAALGPGQPSPNLNL----------------------AAVGLAS 285
           G   R+RRP+ +  T  ++  PG     LNL                      A V  AS
Sbjct: 126 GYVPRIRRPSKHGITQPSS-APGASDAALNLVDPAERQRQAQQWILQQQAGSLARVQEAS 184

Query: 286 G-AIGGAEGPDRVFVGGLPYYFTETQIKELLE-----SFGTLHGFDL--VKDRDTGNSKG 337
             A+GG      +FVG +    T+  +  L +     +F      D   V +   G+   
Sbjct: 185 TVALGGPRKNREIFVGNIDAVVTKQALTALFDDALAVAFPNATSGDAKPVVNIQLGDQAT 244

Query: 338 YGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKM 397
           YGF       +   A A LNGL    + LT+ R T               A +       
Sbjct: 245 YGFVELLSEELATAAIAGLNGLVFCGRPLTIARPTGWVDPAAAATITARAAAER------ 298

Query: 398 ALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLV 457
                            GE  + ++CL+  +T   LAD+E Y E+L D+R EC K G + 
Sbjct: 299 -----------------GEEHSTIVCLSNIVTESDLADEEAYAELLADVRTECAKCGEVK 341

Query: 458 NVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGG 501
           ++ IPR    GG    VG VF++  D  G    +  ++GR+F G
Sbjct: 342 DIRIPR----GGP---VGSVFVKMGDESGANKVQTEMAGRRFDG 378



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 49/90 (54%), Gaps = 2/90 (2%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVM-TAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           R  R ++VG +  +  +QA+   F   +  A    ++G    VVN+ +  +  + FVE+ 
Sbjct: 192 RKNREIFVGNIDAVVTKQALTALFDDALAVAFPNATSGDAKPVVNIQLGDQATYGFVELL 251

Query: 231 TVEEASNAMA-LDGIIFEGVAVRVRRPTDY 259
           + E A+ A+A L+G++F G  + + RPT +
Sbjct: 252 SEELATAAIAGLNGLVFCGRPLTIARPTGW 281


>gi|384247050|gb|EIE20538.1| hypothetical protein COCSUDRAFT_54348 [Coccomyxa subellipsoidea
           C-169]
          Length = 213

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 61/104 (58%), Gaps = 3/104 (2%)

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREEC-GKYGTLVNVVIPRPDQNG--GETPGVG 475
           ++VL L   +T + L D EEY +I++D+  E   KYGTL ++VIP+P Q G   +  GVG
Sbjct: 110 SRVLRLANMVTREELLDPEEYSDIVDDITSELESKYGTLSSLVIPQPSQKGPASDPSGVG 169

Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDY 519
            VF+++        A+  L+GRKFG   +++ ++ E  +  K +
Sbjct: 170 LVFVQFPKLSDAVKAQEKLNGRKFGAGNIHSEFFDEGLFQRKHF 213


>gi|343429703|emb|CBQ73275.1| related to Cleavage stimulation factor [Sporisorium reilianum SRZ2]
          Length = 391

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 35/83 (42%), Positives = 48/83 (57%), Gaps = 4/83 (4%)

Query: 293 GPDR----VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           GP R    VFVG +PY  +E Q+ ++    G + GF LV DRDTG  KGYGFC ++DP  
Sbjct: 3   GPQRGSRVVFVGNIPYDMSEEQLTDVFREVGKVVGFRLVNDRDTGKFKGYGFCEFEDPET 62

Query: 349 TDIACAALNGLKMGDKTLTVRRA 371
              A   LN +++G + L +  A
Sbjct: 63  AASAVRNLNEVEVGGRPLRISFA 85


>gi|237829727|ref|XP_002364161.1| U2 small nuclear ribonucleoprotein auxiliary factor U2AF
           [Toxoplasma gondii ME49]
 gi|211961825|gb|EEA97020.1| U2 small nuclear ribonucleoprotein auxiliary factor U2AF
           [Toxoplasma gondii ME49]
 gi|221481075|gb|EEE19483.1| U2 small nuclear ribonucleoprotein auxiliary factor U2AF, putative
           [Toxoplasma gondii GT1]
 gi|221507020|gb|EEE32624.1| U2 small nuclear ribonucleoprotein auxiliary factor U2AF, putative
           [Toxoplasma gondii VEG]
          Length = 704

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 69/285 (24%), Positives = 116/285 (40%), Gaps = 75/285 (26%)

Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
           P+R+ +  LP   +E ++++L+E+FG ++ F L+K  D   S+      Y D    + A 
Sbjct: 302 PERLCILDLPPLMSEEKVRQLVETFGPVNAFHLLKKDD--GSEMVCIVEYVDLESQEQAM 359

Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSL 413
             L+     +    +  A  + Q +           +H+   K   +    + L  G  +
Sbjct: 360 DILHS----NSPYRILLAEEAIQQEVIAPFFKKAKAKHL---KTEDEEEEEDDLADGERM 412

Query: 414 FGETL------AKVLCLTEAITADALADDEEYEEILEDMR---EECGKYGTLVNVVIPRP 464
             ++L       +VL L+  +  + L DD+EYE+I+ED+R   EECG  G +++V IPRP
Sbjct: 413 NIQSLLRPQVCTRVLLLSNIVEVEDLLDDKEYEDIVEDIRLECEECG--GPVLSVNIPRP 470

Query: 465 ----------------------------------DQNGGETPG----------------- 473
                                             +Q  GE                    
Sbjct: 471 VRGFEHESKPEFQQQQEREALAKKEEVTVKQEVTEQTAGEEDAGEKGRQEKEKAKSSEEQ 530

Query: 474 ----VGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
               +G  ++E+ D    A A+ AL+GRKFGG  V A Y+ E K+
Sbjct: 531 KPATIGFAYVEFEDCEWSAKARKALNGRKFGGKIVEAHYFSEVKF 575



 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 34/85 (40%), Positives = 48/85 (56%), Gaps = 1/85 (1%)

Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEE 234
           R +YVG LPP      +  F +  M A+GG +  PG   V  + + +  +AFVE RT+EE
Sbjct: 113 RELYVGNLPPSLEVPQLMEFLNAAMAAVGG-ALLPGPPAVKAWRSTDGHYAFVEFRTMEE 171

Query: 235 ASNAMALDGIIFEGVAVRVRRPTDY 259
           ASN M L+G+   G  +R+ RP  Y
Sbjct: 172 ASNGMQLNGLNCMGFNLRIGRPKTY 196


>gi|430811846|emb|CCJ30702.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 486

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 64/263 (24%), Positives = 105/263 (39%), Gaps = 62/263 (23%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           R++VG + +  TE  ++++ E FG L    L K+ DTG S+GYGF  Y+DPA    A   
Sbjct: 227 RLYVGNIHFNLTEDDLRQIFEPFGELEFVQLQKEPDTGRSRGYGFVQYRDPAQARDALEK 286

Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTS---GMNTLG---- 408
           +NG ++  + + V      G  K   ES  A   +       A +     G   +G    
Sbjct: 287 MNGFELAGRAIRV----GLGNDKFTPESTSAVLARFSGFTGSAFENKNRGGTERIGGPRD 342

Query: 409 ------------GGMSLFG---ETLAKVLCLTEAITADALAD------------------ 435
                       GG+S      + L + L   E   +  LA                   
Sbjct: 343 GSSSVSLDDNEAGGVSFNNISRDALMRKLAREEGKESIGLAPPKPTPAVQMTSRCVLLKN 402

Query: 436 ---------DEEYEEILEDMREEC-GKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAV 485
                    D    E+ +D++ EC  KYG ++++ +        E    G V++++ + V
Sbjct: 403 MFNPQEESGDNWIRELEDDVKAECENKYGKVLHIHV--------EENSPGDVYIKFDNVV 454

Query: 486 GCATAKNALSGRKFGGNTVNAFY 508
               A   L+GR FGG T++A +
Sbjct: 455 AGERAIQGLNGRWFGGRTISASF 477


>gi|125774537|ref|XP_001358527.1| GA20525 [Drosophila pseudoobscura pseudoobscura]
 gi|54638266|gb|EAL27668.1| GA20525 [Drosophila pseudoobscura pseudoobscura]
          Length = 418

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 35/89 (39%), Positives = 52/89 (58%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++KE+    G +    LV DR++G  KG+GFC Y+D      A   L
Sbjct: 18  VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E + +L
Sbjct: 78  NGYEIGGRTLRVDNA-CTEKSRMEMQQLL 105


>gi|34810648|pdb|1P1T|A Chain A, Nmr Structure Of The N-Terminal Rrm Domain Of Cleavage
           Stimulation Factor 64 Kda Subunit
          Length = 104

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 39/88 (44%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 11  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 70

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 71  NGREFSGRALRVDNA-ASEKNKEELKSL 97


>gi|195145742|ref|XP_002013849.1| GL24357 [Drosophila persimilis]
 gi|194102792|gb|EDW24835.1| GL24357 [Drosophila persimilis]
          Length = 418

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 35/89 (39%), Positives = 52/89 (58%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++KE+    G +    LV DR++G  KG+GFC Y+D      A   L
Sbjct: 18  VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E + +L
Sbjct: 78  NGYEIGGRTLRVDNA-CTEKSRMEMQQLL 105


>gi|327284085|ref|XP_003226769.1| PREDICTED: serine/threonine-protein kinase Kist-like [Anolis
           carolinensis]
          Length = 217

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 55/94 (58%), Gaps = 6/94 (6%)

Query: 421 VLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLE 480
           VL L   +   +   +EE+E+I++D++EEC KYG +V++ +P+      E PG G VF+E
Sbjct: 120 VLRLLNILNDASFQSEEEFEDIVDDIKEECSKYGQIVSLFVPK------ENPGKGHVFVE 173

Query: 481 YYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           Y +A     A+ AL+G++F    V   +YP   Y
Sbjct: 174 YTNAGDSKAAQQALTGKRFDCKFVVTTFYPLSAY 207


>gi|222619898|gb|EEE56030.1| hypothetical protein OsJ_04814 [Oryza sativa Japonica Group]
          Length = 658

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 59/244 (24%), Positives = 102/244 (41%), Gaps = 46/244 (18%)

Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
           + P ++F+ G+P   +   +++++ SFG L  +  + + D G +    F  Y D ++T  
Sbjct: 375 DSPHKIFIAGIPRVISSKMLRDIVSSFGQLAAYRFLFNEDLGGA--CAFLEYIDHSITSK 432

Query: 352 ACAALNGLKMGDKTLTVRRATAS--GQSKTEQ---ESILAQAQQHIAIQKMALQTSGMNT 406
           ACA LNG+K+G   +T         GQ+  E      I A  +  +A+    LQ      
Sbjct: 433 ACAGLNGMKLGGCVITAVGVLTDHPGQAGNEACPFHGIPANPKPLLAVPTQVLQLK---- 488

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL--VNVVIPRP 464
                ++F +               +L    E + +LED+R +C +YG +  +NVV    
Sbjct: 489 -----NVFDQ------------EEYSLLSKYEVDAVLEDVRVKCARYGAVKSINVVEYPA 531

Query: 465 DQNGGETPGV----------------GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
             +  + P V                G + +E+        A ++L GR FG   V+A Y
Sbjct: 532 GSDNTKAPAVDARDNALASNNTALEAGCILVEFLCKEASFMAAHSLHGRPFGSRIVSAGY 591

Query: 509 YPED 512
            P D
Sbjct: 592 APYD 595


>gi|241997552|ref|XP_002433425.1| RNA recognition motif-containing protein [Ixodes scapularis]
 gi|215490848|gb|EEC00489.1| RNA recognition motif-containing protein [Ixodes scapularis]
          Length = 466

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 38/90 (42%), Positives = 51/90 (56%), Gaps = 1/90 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYKDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILA 386
           N   +  + L V  A AS +SK E +++ A
Sbjct: 78  NAFDLNGRPLRVDNA-ASEKSKEELKNLQA 106


>gi|198424504|ref|XP_002131946.1| PREDICTED: similar to poly-U binding splicing factor 60KDa [Ciona
           intestinalis]
          Length = 491

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 60/106 (56%), Gaps = 6/106 (5%)

Query: 416 ETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGE-TPGV 474
           E  +KV+ L   +  + + DD E E     + EECGK+G++  VVI +  Q+  E  P  
Sbjct: 390 EEASKVMVLHNMVDVEEIDDDLESE-----VTEECGKFGSVSRVVIYQEKQSEAEDAPVT 444

Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
            K+++E+ D+V C  A  +L+GR FGG  + A  YP+ K+ + D +
Sbjct: 445 VKIYVEFTDSVFCKKAVESLNGRWFGGRKIEAIIYPQHKFNHNDLT 490



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 53/210 (25%), Positives = 87/210 (41%), Gaps = 31/210 (14%)

Query: 168 QQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAV---VNVYINHEKKF 224
           QQA     RVYVG +     E+ I   FS           GP  ++    +      K F
Sbjct: 91  QQALVLMCRVYVGSIYYDLKEEIIRNAFSPF---------GPFKSINMSFDPITGKHKGF 141

Query: 225 AFVEMRTVEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
           AF+E  T E A  ++  + G++  G +++V RP +            Q  P ++L     
Sbjct: 142 AFIEYETPEAAQLSLDQMGGVMLGGRSIKVGRPANM----------PQSHPVIDLL---- 187

Query: 284 ASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVY 343
               +  ++   R+++  +        +K +  +FG +    LV D  TG  KGYGF  Y
Sbjct: 188 ----LDESKMQKRIYISSVHTDLNTEDLKSVFSAFGNILSCALVPDVLTGKHKGYGFIEY 243

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATA 373
                 + A A++N   +G + L V +A A
Sbjct: 244 DTLQAANDAVASMNLFDLGGQYLRVGKAIA 273



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/130 (26%), Positives = 58/130 (44%), Gaps = 3/130 (2%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           RV+VG + Y   E  I+     FG     ++  D  TG  KG+ F  Y+ P    ++   
Sbjct: 99  RVYVGSIYYDLKEEIIRNAFSPFGPFKSINMSFDPITGKHKGFAFIEYETPEAAQLSLDQ 158

Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNT--LGGGMSL 413
           + G+ +G +++ V R     QS    + +L +++    I   ++ T  +NT  L    S 
Sbjct: 159 MGGVMLGGRSIKVGRPANMPQSHPVIDLLLDESKMQKRIYISSVHTD-LNTEDLKSVFSA 217

Query: 414 FGETLAKVLC 423
           FG  L+  L 
Sbjct: 218 FGNILSCALV 227


>gi|17137710|ref|NP_477453.1| cleavage stimulation factor 64 kilodalton subunit [Drosophila
           melanogaster]
 gi|5713194|gb|AAD47839.1|AF170082_1 cleavage stimulation factor 64 kilodalton subunit [Drosophila
           melanogaster]
 gi|23171661|gb|AAF55577.2| cleavage stimulation factor 64 kilodalton subunit [Drosophila
           melanogaster]
 gi|205360993|gb|ACI03573.1| FI01908p [Drosophila melanogaster]
          Length = 419

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++KE+    G +    LV DR++G  KG+GFC Y+D      A   L
Sbjct: 18  VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
           NG ++G +TL V  A  + +S+ E + +L   Q
Sbjct: 78  NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 109


>gi|150864148|ref|XP_001382862.2| hypothetical protein PICST_42021 [Scheffersomyces stipitis CBS
           6054]
 gi|149385404|gb|ABN64833.2| predicted protein, partial [Scheffersomyces stipitis CBS 6054]
          Length = 533

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 58/219 (26%), Positives = 91/219 (41%), Gaps = 22/219 (10%)

Query: 307 TETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC-VYQDPAV---TDIACAALNGLKMG 362
           TETQI   L  +  +  F + ++  T  S G  F   + DPA    TD     L  L   
Sbjct: 321 TETQIITELNLYSPVRAFQMFREVGTKVSLGMAFVEFFIDPASYKHTDQVIERLQEL--- 377

Query: 363 DKTLTVRRATASGQSKTEQESILAQAQQH-IAIQKMALQTSGMNTLGGGMSLFGETLAKV 421
                        QS+   E+  +    H  +IQ   +    +  L    ++     ++V
Sbjct: 378 --------LQKLDQSQIIDEAFFSCIIPHKTSIQDCQINFDSLKHLVRNENVSTHPKSRV 429

Query: 422 LCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG------ETPGVG 475
           + L   +T   L +D  Y+ IL+D++ E  + GT+V++ IPRP             PG+G
Sbjct: 430 IQLLNVVTPKDLVEDSNYQFILKDIKREASRIGTVVSIKIPRPANEFTPGLAQFSVPGLG 489

Query: 476 KVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           KVF+E+ D      A   L+GR +    V   +Y  D Y
Sbjct: 490 KVFIEFEDEEVAFRAIMELAGRSYNDRCVICAFYNVDDY 528


>gi|195343246|ref|XP_002038209.1| GM17877 [Drosophila sechellia]
 gi|194133059|gb|EDW54627.1| GM17877 [Drosophila sechellia]
          Length = 419

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++KE+    G +    LV DR++G  KG+GFC Y+D      A   L
Sbjct: 18  VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
           NG ++G +TL V  A  + +S+ E + +L   Q
Sbjct: 78  NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 109


>gi|195395200|ref|XP_002056224.1| GJ10336 [Drosophila virilis]
 gi|194142933|gb|EDW59336.1| GJ10336 [Drosophila virilis]
          Length = 427

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++KE+    G +    LV DR++G  KG+GFC Y+D      A   L
Sbjct: 18  VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
           NG ++G +TL V  A  + +S+ E + +L   Q
Sbjct: 78  NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 109


>gi|194900156|ref|XP_001979623.1| GG22991 [Drosophila erecta]
 gi|190651326|gb|EDV48581.1| GG22991 [Drosophila erecta]
          Length = 416

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++KE+    G +    LV DR++G  KG+GFC Y+D      A   L
Sbjct: 18  VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
           NG ++G +TL V  A  + +S+ E + +L   Q
Sbjct: 78  NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 109


>gi|195497709|ref|XP_002096214.1| GE25546 [Drosophila yakuba]
 gi|194182315|gb|EDW95926.1| GE25546 [Drosophila yakuba]
          Length = 414

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 35/89 (39%), Positives = 52/89 (58%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++KE+    G +    LV DR++G  KG+GFC Y+D      A   L
Sbjct: 18  VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E + +L
Sbjct: 78  NGYEIGGRTLRVDNA-CTEKSRMEMQQLL 105


>gi|332019312|gb|EGI59819.1| RNA-binding protein 39 [Acromyrmex echinatior]
          Length = 528

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 90/376 (23%), Positives = 144/376 (38%), Gaps = 69/376 (18%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS         S G    V  +  N  ++F   A+VE
Sbjct: 166 RDARTVFCMQLSQRIRARDLEEFFS---------SVGKVQDVRLITCNKTRRFKGIAYVE 216

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            +  E  + A+ L G    GV + V+       T A     G   PNL            
Sbjct: 217 FKDPESVTLALGLSGQKLLGVPIVVQH------TQAEKNRMGNSMPNL----------MP 260

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
            G  GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF  +++   
Sbjct: 261 KGQTGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIQLIMDPETGRSKGYGFLTFRNADD 320

Query: 349 TDIACAALNGLKMGDKTLTV-----RRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
              A   LNG ++  + + V     R     G S  + + +        A  ++ L    
Sbjct: 321 AKKALEQLNGFELAGRPMKVGNVTERTDLIQGPSLLDTDELDRSGIDLGATGRLQL---- 376

Query: 404 MNTLGGGMSL-------FGETLAKVL------------CLTEAITADALAD--DEEYEEI 442
           M  L  G  L           +A V+              T+      + D  +E     
Sbjct: 377 MFKLAEGTGLEIPPAAANALNMAPVMAQPQPPPQAAPPIATQCFMLSNMFDPQNETNPNW 436

Query: 443 LEDMREE----CGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRK 498
            +++R++    C K+G +++V +   DQ   +    G V+++         A N+L GR 
Sbjct: 437 AKEIRDDVIEECNKHGGVLHVYV---DQASPQ----GNVYVKCPSIATAVAAVNSLHGRW 489

Query: 499 FGGNTVNAFYYPEDKY 514
           F G  + A Y P   Y
Sbjct: 490 FAGRVITAAYVPVVNY 505


>gi|195569859|ref|XP_002102926.1| GD19237 [Drosophila simulans]
 gi|194198853|gb|EDX12429.1| GD19237 [Drosophila simulans]
          Length = 419

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++KE+    G +    LV DR++G  KG+GFC Y+D      A   L
Sbjct: 18  VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
           NG ++G +TL V  A  + +S+ E + +L   Q
Sbjct: 78  NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 109


>gi|403364994|gb|EJY82272.1| Snrnp splicing factor (U2AF), putative [Oxytricha trifallax]
          Length = 411

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 102/219 (46%), Gaps = 20/219 (9%)

Query: 80  HRHRSRSHSSDRFRNRSKSLSPSRSPSKSKRRSG-----FDMAPPAAAMLPGAAVPGQLP 134
           HR +SR  + +R R+R +     R    +K+  G     FD +PP    L          
Sbjct: 48  HREKSRDRNGNRGRDREREKDRDRGGRDNKKGGGRDDFRFD-SPPKDHELT--------K 98

Query: 135 GVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATF 194
           G+ +A   +    +    + L +   M +     Q  +  R++YVG LPP   ++ +   
Sbjct: 99  GIMAAAASIGGGTIANAQSILQSIHSMSMA----QTAKIDRKLYVGNLPPGITQRMLIDV 154

Query: 195 FSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEASNAMALDGIIFEGVAVRVR 254
            ++ M ++      PG+ VV+ +I+ +  +AFVE RT EEA++   L G+  +   +++ 
Sbjct: 155 VNEAMLSLNVIEE-PGNPVVSAWISSDSHYAFVEFRTAEEANHGFNLQGMNIQNNEIKIG 213

Query: 255 RPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEG 293
           RP  Y+ T+  A+G    +  +N+     A+ A+ G +G
Sbjct: 214 RPKAYSGTM-NAIGLMASAGGMNVQGGSFANAALMGMKG 251


>gi|195108753|ref|XP_001998957.1| GI23336 [Drosophila mojavensis]
 gi|193915551|gb|EDW14418.1| GI23336 [Drosophila mojavensis]
          Length = 428

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++KE+    G +    LV DR++G  KG+GFC Y+D      A   L
Sbjct: 18  VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
           NG ++G +TL V  A  + +S+ E + +L   Q
Sbjct: 78  NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 109


>gi|218189760|gb|EEC72187.1| hypothetical protein OsI_05261 [Oryza sativa Indica Group]
          Length = 485

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 59/244 (24%), Positives = 102/244 (41%), Gaps = 46/244 (18%)

Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
           + P ++F+ G+P   +   +++++ SFG L  +  + + D G +    F  Y D ++T  
Sbjct: 259 DSPHKIFIAGIPRVISSKMLRDIVSSFGQLAAYRFLFNEDLGGA--CAFLEYIDHSITSK 316

Query: 352 ACAALNGLKMGDKTLTVRRATAS--GQSKTEQ---ESILAQAQQHIAIQKMALQTSGMNT 406
           ACA LNG+K+G   +T         GQ+  E      I A  +  +A+    LQ      
Sbjct: 317 ACAGLNGMKLGGCVITAVGVLTDHPGQAGNEACPFHGIPANPKPLLAVPTQVLQLK---- 372

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL--VNVVIPRP 464
                ++F +    +L               E + +LED+R +C +YG +  +NVV    
Sbjct: 373 -----NVFDQEEYSLL------------SKYEVDAVLEDVRVKCARYGAVKSINVVEYPA 415

Query: 465 DQNGGETPGV----------------GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
             +  + P V                G + +E+        A ++L GR FG   V+A Y
Sbjct: 416 GSDNTKAPAVDARDNALASNNTALEAGCILVEFLCKEASFMAAHSLHGRPFGSRIVSAGY 475

Query: 509 YPED 512
            P D
Sbjct: 476 APYD 479


>gi|195452858|ref|XP_002073531.1| GK14167 [Drosophila willistoni]
 gi|194169616|gb|EDW84517.1| GK14167 [Drosophila willistoni]
          Length = 401

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++KE+    G +    LV DR++G  KG+GFC Y+D      A   L
Sbjct: 19  VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 78

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
           NG ++G +TL V  A  + +S+ E + +L   Q
Sbjct: 79  NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 110


>gi|194743216|ref|XP_001954096.1| GF16912 [Drosophila ananassae]
 gi|190627133|gb|EDV42657.1| GF16912 [Drosophila ananassae]
          Length = 415

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 35/89 (39%), Positives = 52/89 (58%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++KE+    G +    LV DR++G  KG+GFC Y+D      A   L
Sbjct: 18  VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E + +L
Sbjct: 78  NGYEIGGRTLRVDNA-CTEKSRMEMQQLL 105


>gi|195037535|ref|XP_001990216.1| GH18352 [Drosophila grimshawi]
 gi|193894412|gb|EDV93278.1| GH18352 [Drosophila grimshawi]
          Length = 430

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 35/89 (39%), Positives = 52/89 (58%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++KE+    G +    LV DR++G  KG+GFC Y+D      A   L
Sbjct: 18  VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++G +TL V  A  + +S+ E + +L
Sbjct: 78  NGYEIGGRTLRVDNA-CTEKSRMEMQQLL 105


>gi|28557621|gb|AAO45216.1| RE27227p [Drosophila melanogaster]
          Length = 437

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 53/93 (56%), Gaps = 1/93 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++KE+    G +    LV DR++G  KG+GFC Y+D      A   L
Sbjct: 18  VFVGNIPYEATEEKLKEIFSEVGPVLSLKLVFDRESGKPKGFGFCEYKDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
           NG ++G +TL V  A  + +S+ E + +L   Q
Sbjct: 78  NGYEIGGRTLRVDNA-CTEKSRMEMQQLLQGPQ 109


>gi|115442323|ref|NP_001045441.1| Os01g0956600 [Oryza sativa Japonica Group]
 gi|57900079|dbj|BAD88141.1| splicing factor family protein-like [Oryza sativa Japonica Group]
 gi|57900192|dbj|BAD88277.1| splicing factor family protein-like [Oryza sativa Japonica Group]
 gi|113534972|dbj|BAF07355.1| Os01g0956600 [Oryza sativa Japonica Group]
 gi|215736836|dbj|BAG95765.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 608

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 59/244 (24%), Positives = 102/244 (41%), Gaps = 46/244 (18%)

Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
           + P ++F+ G+P   +   +++++ SFG L  +  + + D G +    F  Y D ++T  
Sbjct: 375 DSPHKIFIAGIPRVISSKMLRDIVSSFGQLAAYRFLFNEDLGGA--CAFLEYIDHSITSK 432

Query: 352 ACAALNGLKMGDKTLTVRRATAS--GQSKTEQ---ESILAQAQQHIAIQKMALQTSGMNT 406
           ACA LNG+K+G   +T         GQ+  E      I A  +  +A+    LQ      
Sbjct: 433 ACAGLNGMKLGGCVITAVGVLTDHPGQAGNEACPFHGIPANPKPLLAVPTQVLQLK---- 488

Query: 407 LGGGMSLFGETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTL--VNVVIPRP 464
                ++F +               +L    E + +LED+R +C +YG +  +NVV    
Sbjct: 489 -----NVFDQ------------EEYSLLSKYEVDAVLEDVRVKCARYGAVKSINVVEYPA 531

Query: 465 DQNGGETPGV----------------GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
             +  + P V                G + +E+        A ++L GR FG   V+A Y
Sbjct: 532 GSDNTKAPAVDARDNALASNNTALEAGCILVEFLCKEASFMAAHSLHGRPFGSRIVSAGY 591

Query: 509 YPED 512
            P D
Sbjct: 592 APYD 595


>gi|301781268|ref|XP_002926045.1| PREDICTED: probable RNA-binding protein 23-like isoform 1
           [Ailuropoda melanoleuca]
          Length = 446

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 68/151 (45%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 209 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 252

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 253 -MANNLQKGSSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 311

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V + T
Sbjct: 312 TFSDSECARRALEQLNGFELAGRPMRVGQVT 342


>gi|291228918|ref|XP_002734426.1| PREDICTED: cleavage stimulation factor subunit 2-like [Saccoglossus
           kowalevskii]
          Length = 220

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 40/99 (40%), Positives = 55/99 (55%), Gaps = 5/99 (5%)

Query: 286 GAIGGAEGPDR----VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            A+G +   DR    VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC
Sbjct: 2   SAVGQSAATDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFC 61

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTE 380
            Y+D      A   L+G ++  + L V  A AS ++K E
Sbjct: 62  EYKDQETALSAMRNLSGYELNGRQLRVDNA-ASEKNKEE 99


>gi|301100496|ref|XP_002899338.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262104255|gb|EEY62307.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 414

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 47/85 (55%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  +KE+    G++  F LV DR+TG  KGYGFC Y D A    A   L
Sbjct: 15  VFVGNIPYDVTEDMLKEIFSEAGSVMNFRLVTDRETGKPKGYGFCEYADGATALSAMRNL 74

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQ 381
           NG ++  + L V  A    +S + +
Sbjct: 75  NGYEINGRNLRVDFADGGDKSNSAE 99


>gi|427798067|gb|JAA64485.1| Putative mrna cleavage and polyadenylation factor i complex subunit
           rna15, partial [Rhipicephalus pulchellus]
          Length = 377

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 38/90 (42%), Positives = 51/90 (56%), Gaps = 1/90 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYKDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILA 386
           N   +  + L V  A AS +SK E +++ A
Sbjct: 78  NAFDLNGRPLRVDNA-ASEKSKEELKNLQA 106


>gi|255089613|ref|XP_002506728.1| predicted protein [Micromonas sp. RCC299]
 gi|226522001|gb|ACO67986.1| predicted protein [Micromonas sp. RCC299]
          Length = 317

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 48/81 (59%)

Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
           P +VFVG +PY  TE +++++    G +H F LV DR+TG  KGYGFC Y D A  + A 
Sbjct: 5   PYQVFVGNVPYDATEERLRDMFSEVGPVHDFRLVTDRETGKLKGYGFCEYMDLATAESAK 64

Query: 354 AALNGLKMGDKTLTVRRATAS 374
             LNG +   + L V  A A+
Sbjct: 65  RNLNGREYNGRNLRVDFADAA 85


>gi|124360616|gb|ABN08615.1| hypothetical protein MtrDRAFT_AC157507g26v2 [Medicago truncatula]
          Length = 64

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 32/39 (82%), Positives = 36/39 (92%)

Query: 248 GVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
           GVAVRVRRPTDYNP+LAA LGP QPS NLNL+AVGL++G
Sbjct: 26  GVAVRVRRPTDYNPSLAAVLGPCQPSANLNLSAVGLSAG 64


>gi|301781270|ref|XP_002926046.1| PREDICTED: probable RNA-binding protein 23-like isoform 2
           [Ailuropoda melanoleuca]
          Length = 430

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 68/151 (45%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 193 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 236

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 237 -MANNLQKGSSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 295

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V + T
Sbjct: 296 TFSDSECARRALEQLNGFELAGRPMRVGQVT 326


>gi|402875680|ref|XP_003901625.1| PREDICTED: probable RNA-binding protein 23 [Papio anubis]
          Length = 497

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 68/151 (45%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 262 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 305

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG+SKGYGF 
Sbjct: 306 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 364

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 365 TFSDSECARRALEQLNGFELAGRPMRVGHVT 395


>gi|73962357|ref|XP_537365.2| PREDICTED: probable RNA-binding protein 23 isoform 1 [Canis lupus
           familiaris]
          Length = 445

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 208 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 251

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 252 -MANNLQKGSSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 310

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 311 TFSDSECARRALEQLNGFELAGRPMRVGHVT 341


>gi|395859409|ref|XP_003802032.1| PREDICTED: probable RNA-binding protein 23 [Otolemur garnettii]
          Length = 449

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 212 KGIAYVEFCEIQSVPLAIGLTGQQLLGVPIIVQASQAEKNRLAA---------------- 255

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP  +FVG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 256 -MANNLQKGSSGPMHLFVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 314

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V + T
Sbjct: 315 TFSDSECARRALEQLNGFELAGRPMKVGQVT 345


>gi|281343373|gb|EFB18957.1| hypothetical protein PANDA_015655 [Ailuropoda melanoleuca]
          Length = 369

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 68/151 (45%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 132 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 175

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 176 -MANNLQKGSSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 234

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V + T
Sbjct: 235 TFSDSECARRALEQLNGFELAGRPMRVGQVT 265


>gi|388853962|emb|CCF52460.1| related to Cleavage stimulation factor [Ustilago hordei]
          Length = 402

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 34/83 (40%), Positives = 48/83 (57%), Gaps = 4/83 (4%)

Query: 293 GPDR----VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           GP R    VFVG +PY  +E Q+ ++    G + GF LV DR+TG  KGYGFC ++DP  
Sbjct: 3   GPQRGSRVVFVGNIPYDMSEEQLTDVFREVGKVVGFRLVNDRETGKFKGYGFCEFEDPET 62

Query: 349 TDIACAALNGLKMGDKTLTVRRA 371
              A   LN +++G + L +  A
Sbjct: 63  AASAVRNLNEVEVGGRALRISFA 85


>gi|320582580|gb|EFW96797.1| splicing factor u2af large subunit [Ogataea parapolymorpha DL-1]
          Length = 281

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 60/219 (27%), Positives = 96/219 (43%), Gaps = 25/219 (11%)

Query: 301 GLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF--CVYQDPAVTDIACAALNG 358
            +PY     ++ ELL+  G +     V D+ +  SKG  F   V +D  V       LN 
Sbjct: 75  NVPYGTPREKLLELLQPLGKVRSLAQVLDKLSYESKGVAFFEMVSEDSEVL----TKLNE 130

Query: 359 LKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETL 418
           L + D+ L + RA  + + K EQ  IL+      A+Q   +                   
Sbjct: 131 LAIDDQELQIFRACENPERKYEQAVILSAETLFGALQSDKISPHAP-------------- 176

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPD---QNGGET-PGV 474
           ++++     +  + L D  +Y +I      EC  +G    V+IPRP+   + G  T P V
Sbjct: 177 SEIVQFLNCVAVEDLVDSVKYNDIKVAFEAECSCHGHPEKVLIPRPEGDFRPGMPTKPEV 236

Query: 475 GKVFLEYYDAVGCATAKNALSGRKFGGNTV-NAFYYPED 512
           G++F+++  +        AL+GRKF G T+  AFY  ED
Sbjct: 237 GRIFVKFATSEEAQKCAEALAGRKFNGRTILAAFYETED 275


>gi|405963791|gb|EKC29337.1| RNA-binding protein 39 [Crassostrea gigas]
          Length = 557

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 58/272 (21%), Positives = 103/272 (37%), Gaps = 54/272 (19%)

Query: 290 GAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVT 349
           G  GP R++VG L +  TE  ++ + E FG +    L++D +T  S+GYGF  + D    
Sbjct: 255 GNIGPMRLYVGSLHFNITEEMLRGIFEPFGKIDDIKLIRDHETNRSQGYGFITFHDSEDA 314

Query: 350 DIACAALNGLKMGDKTLTV-----RRATASGQSKTEQE---------------SILAQAQ 389
             A   LNG ++  + + V     R+    G S  + +                ++A+  
Sbjct: 315 KKALEQLNGFELAGRPMKVGHVTERQGEIQGASMLDSDEMDRAGIDLGATGRLQLMAKLA 374

Query: 390 QHIAIQKMALQTSGMNTLGGG----------------MSLFGETLA-----------KVL 422
           +    Q      S +N                      ++ G   A           +  
Sbjct: 375 EGTGFQIPEYAVSALNITQQAPGVASAAPPAGPAPNVSAILGSQAAGQDNTAPPIATQCF 434

Query: 423 CLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYY 482
            L+     +A +     +EI +D+ EEC K+G ++++ + +    G        V+++  
Sbjct: 435 MLSNMFDPNAESRSSWDQEIRDDVIEECNKHGGVLHLYVDKASPQGN-------VYVKCP 487

Query: 483 DAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
                  +  AL GR FGG  + A Y P   Y
Sbjct: 488 TISAAVASVRALHGRYFGGKMITAAYVPLPNY 519


>gi|73962355|ref|XP_848788.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Canis lupus
           familiaris]
          Length = 429

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 192 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 235

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 236 -MANNLQKGSSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 294

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 295 TFSDSECARRALEQLNGFELAGRPMRVGHVT 325


>gi|444728803|gb|ELW69245.1| putative RNA-binding protein 23 [Tupaia chinensis]
          Length = 450

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 46/159 (28%), Positives = 69/159 (43%), Gaps = 17/159 (10%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 195 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 238

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 239 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 297

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTE 380
            + D      A   LNG ++  + + V   T      T+
Sbjct: 298 TFSDSECARRALEQLNGFELAGRPMRVGHVTERADGSTD 336


>gi|88911212|gb|ABD58896.1| chloroplast single strand DNA binding protein [Mesostigma viride]
          Length = 299

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 57/214 (26%), Positives = 97/214 (45%), Gaps = 8/214 (3%)

Query: 170 ATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEM 229
           +T  + ++YVG L    +++ +   FSQ             + V++      + FAFV M
Sbjct: 85  STAASTKLYVGNLAWSCDDEMLNQAFSQF------GEVKAAEVVLDRESGRSRGFAFVTM 138

Query: 230 RTVEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
            + + A  A   LDG    G A+RV  P         A    +   +          G  
Sbjct: 139 ASPDAAEKARRGLDGTELAGRAIRVNFPQPKGERAPRAERGERSERSERSERTYTPRGD- 197

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           G A   +R++VG LP+   +  +++L   FGT++   +V DRD+G S+G+ F     P  
Sbjct: 198 GEAGDANRLYVGNLPWSMDDGMLEDLFMEFGTVNYARVVMDRDSGRSRGFAFVALSTPEE 257

Query: 349 TDIACAALNGLKMGDKTLTVRRATASGQSKTEQE 382
            + A A L+G ++G +T+ V  AT S  ++  +E
Sbjct: 258 ANEAMANLDGEEIGGRTIRVNLATKSSGNREGRE 291


>gi|301781272|ref|XP_002926047.1| PREDICTED: probable RNA-binding protein 23-like isoform 3
           [Ailuropoda melanoleuca]
          Length = 412

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 68/151 (45%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 175 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 218

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 219 -MANNLQKGSSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 277

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V + T
Sbjct: 278 TFSDSECARRALEQLNGFELAGRPMRVGQVT 308


>gi|325189600|emb|CCA24085.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 358

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 32/72 (44%), Positives = 41/72 (56%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  ++E+    G +  F LV DRD+G  KGYGFC Y D A    A   L
Sbjct: 17  VFVGNIPYDVTEEMLREIFSEAGAVMNFRLVTDRDSGKPKGYGFCEYADGATALSAMRNL 76

Query: 357 NGLKMGDKTLTV 368
           NG ++  + L V
Sbjct: 77  NGYEINGRNLRV 88


>gi|302757135|ref|XP_002961991.1| hypothetical protein SELMODRAFT_9008 [Selaginella moellendorffii]
 gi|302775356|ref|XP_002971095.1| hypothetical protein SELMODRAFT_9010 [Selaginella moellendorffii]
 gi|300161077|gb|EFJ27693.1| hypothetical protein SELMODRAFT_9010 [Selaginella moellendorffii]
 gi|300170650|gb|EFJ37251.1| hypothetical protein SELMODRAFT_9008 [Selaginella moellendorffii]
          Length = 80

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 34/75 (45%), Positives = 46/75 (61%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           ++VG LP  + +  +K L  SFG +    ++KDR TG SKGYGF  + DPA    A  ++
Sbjct: 6   LYVGYLPATYDDESLKRLFSSFGQIEEVKVIKDRTTGASKGYGFVKFTDPAAASQAVFSM 65

Query: 357 NGLKMGDKTLTVRRA 371
           NG K+ DKTL VR A
Sbjct: 66  NGWKIEDKTLAVRIA 80


>gi|90085597|dbj|BAE91539.1| unnamed protein product [Macaca fascicularis]
          Length = 295

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 35/78 (44%), Positives = 47/78 (60%), Gaps = 5/78 (6%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G++  F LV DR+TG  KGYGFC YQD    + A +A+
Sbjct: 18  VFVGNIPYEATEEQLKDIFSEVGSVVSFRLVYDRETGKPKGYGFCEYQD---QETALSAM 74

Query: 357 NGLKMGDKTLTVRRATAS 374
             L   D   ++ RA AS
Sbjct: 75  RNLN--DAPESITRAVAS 90


>gi|354474899|ref|XP_003499667.1| PREDICTED: cleavage stimulation factor subunit 2 [Cricetulus
           griseus]
 gi|344238061|gb|EGV94164.1| Cleavage stimulation factor 64 kDa subunit [Cricetulus griseus]
          Length = 558

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 50/88 (56%), Gaps = 1/88 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           V VG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   L
Sbjct: 18  VCVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
           NG +   + L V  A AS ++K E +S+
Sbjct: 78  NGREFSGRALRVDNA-ASEKNKEELKSL 104


>gi|294877868|ref|XP_002768167.1| Nucleolysin TIAR, putative [Perkinsus marinus ATCC 50983]
 gi|239870364|gb|EER00885.1| Nucleolysin TIAR, putative [Perkinsus marinus ATCC 50983]
          Length = 474

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 59/208 (28%), Positives = 98/208 (47%), Gaps = 22/208 (10%)

Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGP-GDAVVNV--YINHEKKFAFVEMRT 231
           ++V+VGGLP  A++ A+  +FSQ          GP  D+VV +  +    + F FV   T
Sbjct: 157 KKVFVGGLPREADKPALDEYFSQF---------GPVEDSVVMMDRFTGRSRGFGFVTFET 207

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAA----ALGPGQPSPNLNLAAVGLASGA 287
            E+    +A    +  G  V VRR  + + T  A    + G G  +P    +    +SG 
Sbjct: 208 KEQMLGCVAAAPHVIMGKTVEVRRSINDDGTSTANERRSAGKGSGAPR---SYDDYSSGK 264

Query: 288 IGGA---EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
             G    + P+++FVGGLP   T   +++    +G L    ++ DR TG S+G+G+  Y+
Sbjct: 265 GKGGHRDQNPNKLFVGGLPREVTSDVLRDFFIQYGNLVDCTVITDRMTGQSRGFGYITYE 324

Query: 345 DPAVTDIACAALNGLKMGDKTLTVRRAT 372
           D A  + A +      +  K + V+  T
Sbjct: 325 DLAAAEAAISNSANNVIDGKWVDVKHTT 352


>gi|401410983|ref|XP_003884939.1| putative U2 small nuclear ribonucleoprotein auxiliary factor U2AF
           [Neospora caninum Liverpool]
 gi|325119358|emb|CBZ54911.1| putative U2 small nuclear ribonucleoprotein auxiliary factor U2AF
           [Neospora caninum Liverpool]
          Length = 588

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 41/117 (35%), Positives = 61/117 (52%), Gaps = 3/117 (2%)

Query: 175 RRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEE 234
           R +YVG LPP      +  F +  M A+GG +  PG   V  + + +  +AFVE RT+EE
Sbjct: 117 RELYVGNLPPSLEVPQLMEFLNAAMAAVGG-ALLPGPPAVKAWRSTDGHYAFVEFRTMEE 175

Query: 235 ASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
           ASN M L+G+   G  +R+ RP  Y   +   + P  P+  L      + +G +GGA
Sbjct: 176 ASNGMQLNGLNCMGFNLRIGRPKTYPQDMNHLIPP--PTIPLLHPQAAMGAGIVGGA 230



 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 45/174 (25%), Positives = 81/174 (46%), Gaps = 11/174 (6%)

Query: 294 PDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIAC 353
           P+R+ +  LP   +E ++++L+E+FG ++ F L+K  D   S+      Y D    + A 
Sbjct: 313 PERLCILDLPPLMSEEKVRQLVETFGPVNAFHLLKKDD--GSEMVCIVEYVDLESQEQAM 370

Query: 354 AALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSL 413
             L+     +    +  A  + Q +           + +  +    +      +     L
Sbjct: 371 DILHS----NSPYRILLAEEAIQQEVIAPFFKKAKAKQMKAEDEEEEEMDGEEMSIQALL 426

Query: 414 FGETLAKVLCLTEAITADALADDEEYEEILEDMR---EECGKYGTLVNVVIPRP 464
             +   +VL L+  +  + L DD+EYEEI+ED+R   EECG  G +++V IPRP
Sbjct: 427 RPQVCTRVLLLSNIVDVEDLLDDKEYEEIVEDIRLECEECG--GPVLSVNIPRP 478


>gi|449667931|ref|XP_002156035.2| PREDICTED: poly(U)-binding-splicing factor PUF60-like [Hydra
           magnipapillata]
          Length = 597

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 96/397 (24%), Positives = 165/397 (41%), Gaps = 64/397 (16%)

Query: 163 VQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEK 222
           ++ + Q+A ++AR +YV  + P  +E  I + F +    I      P     +      K
Sbjct: 225 IEQILQEARQYAR-IYVSSIHPDLSESDIKSVF-EAFGEILSCKLAP-----DQLTGKHK 277

Query: 223 KFAFVEMRTVEEASNAM-ALDGIIFEGVAVRVRR---PTDYNPTLAAALGPGQPSPNLNL 278
            + F+E      A++A+ A++     G  +RV R   P D+      AL  G P P  +L
Sbjct: 278 GYGFIEYANQSSANDAIVAMNLFDLGGQYIRVGRAITPPDH------ALKQG-PPPAASL 330

Query: 279 AAVGLASGAIGGAEG-------------------PDRVFVGGLPYY---FTETQIKELLE 316
            A  + S +I G E                    PD   +  +P     F+ T       
Sbjct: 331 LAANVISASIQGQEAVSAHGATALHMNPVTPSLSPDPFGLPPIPLQIPGFSPT------V 384

Query: 317 SFGTLHGFDLVKDRDTG---NSKGYGFCVYQ-----DPAVTDIACAALNGLKM---GDKT 365
           S G+++               +  YG    Q      P     + ++LN       G++ 
Sbjct: 385 SNGSVYNIQTSSQAQVSYGQTTLSYGLSTMQSPFSQQPVSYSQSTSSLNNTSYPTPGNQP 444

Query: 366 LTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGET-LAKVLCL 424
           +   R T+  Q K ++      A +    ++  L+ SG +     M     +  + VL L
Sbjct: 445 VPEERLTSRQQRKKQELLDHKHAGEQNLEREEKLEISGKDARYMMMQKLARSGSSPVLVL 504

Query: 425 TEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV-GKVFLEYYD 483
              +T+D     E  EE+  ++ EEC ++G +V VVI +  Q   +   V  K+F+E+  
Sbjct: 505 KNMVTSD-----EVDEELQTEVTEECSRFGDVVRVVIYQERQGEEDNAEVIVKIFVEFSK 559

Query: 484 AVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
                +A++AL+GR FGGN++ A  Y EDKY ++DY+
Sbjct: 560 HSEAESAQSALNGRWFGGNSIQADIYDEDKYKSQDYT 596



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 54/201 (26%), Positives = 91/201 (45%), Gaps = 33/201 (16%)

Query: 176 RVYVGGLPPLANEQAI-ATF--FSQV-MTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           R Y+G +    NE+++ A+F  F  + M  +  +SA          + H K FAFVE   
Sbjct: 140 RTYIGSINFQLNEESVRASFLPFGPIKMIDLSWDSAT---------MKH-KGFAFVEYEI 189

Query: 232 VEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
            E A  A+  ++ ++  G  ++V RP+                 N+  AA  +    +  
Sbjct: 190 PEAAQLALEQMNNVLMGGRNIKVGRPS-----------------NVPQAAPWIEQ-ILQE 231

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
           A    R++V  +    +E+ IK + E+FG +    L  D+ TG  KGYGF  Y + +  +
Sbjct: 232 ARQYARIYVSSIHPDLSESDIKSVFEAFGEILSCKLAPDQLTGKHKGYGFIEYANQSSAN 291

Query: 351 IACAALNGLKMGDKTLTVRRA 371
            A  A+N   +G + + V RA
Sbjct: 292 DAIVAMNLFDLGGQYIRVGRA 312



 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/99 (27%), Positives = 46/99 (46%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           R ++G + +   E  ++     FG +   DL  D  T   KG+ F  Y+ P    +A   
Sbjct: 140 RTYIGSINFQLNEESVRASFLPFGPIKMIDLSWDSATMKHKGFAFVEYEIPEAAQLALEQ 199

Query: 356 LNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAI 394
           +N + MG + + V R +   Q+    E IL +A+Q+  I
Sbjct: 200 MNNVLMGGRNIKVGRPSNVPQAAPWIEQILQEARQYARI 238


>gi|302834772|ref|XP_002948948.1| hypothetical protein VOLCADRAFT_89331 [Volvox carteri f.
           nagariensis]
 gi|300265693|gb|EFJ49883.1| hypothetical protein VOLCADRAFT_89331 [Volvox carteri f.
           nagariensis]
          Length = 729

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 58/104 (55%), Gaps = 9/104 (8%)

Query: 420 KVLCLTEAITADALADDEEYEEILEDMREECGKY--GTLVNVVIPRPDQ----NGGETPG 473
           +  C+   + AD L DDEEYE +++D+++EC ++  G +V V +PRP +       +  G
Sbjct: 544 RFFCVLGMLNADMLLDDEEYEAVIDDLKDECDRHAPGNVVAVKVPRPPEEVRAQTADFIG 603

Query: 474 V---GKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
           +   GK F+ + DA     A  A+ GR F GNTV   Y  E+++
Sbjct: 604 IGQYGKAFVCFKDATSAQRAHAAIHGRLFAGNTVQVQYITEEEF 647



 Score = 46.6 bits (109), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 48/87 (55%), Gaps = 3/87 (3%)

Query: 175 RRVYVGGLPPLA-NEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVE 233
           R +Y+G L P A  + A+   F+  + A      G  + VVNV ++ + ++AFVE RT E
Sbjct: 230 RELYIGNLVPGAVTDVALRQLFNTTLVA-AFPVTGSAEPVVNVNLHSDGRYAFVEFRTPE 288

Query: 234 EASNAMALDG-IIFEGVAVRVRRPTDY 259
            A+ A+AL+  +   G  + V RP+ Y
Sbjct: 289 MATAALALNAQVQLLGQTISVGRPSGY 315


>gi|50553814|ref|XP_504318.1| YALI0E23628p [Yarrowia lipolytica]
 gi|49650187|emb|CAG79917.1| YALI0E23628p [Yarrowia lipolytica CLIB122]
          Length = 621

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 60/204 (29%), Positives = 92/204 (45%), Gaps = 19/204 (9%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAV--VNVYINHEKKFAFVEM 229
           R  R VYV  + P      +  FF++         AGP   V  V    +  +  AFVE 
Sbjct: 327 RDKRTVYVQQVAPHVQSTELFDFFAE---------AGPVHDVSLVKDRSSRCRGVAFVEF 377

Query: 230 RTVEEASNAMALDGIIFEGVAVRVR-----RPTDYNPTLAAALGPGQPSPNLNLAAVGLA 284
             VE  S A+ L G    G A+ +R     R  +   + A+    G  S +  +A V  +
Sbjct: 378 EDVESVSRAIGLTGRSLHGQALLIRCTDSARNREEQQSEASFNSSGAGSTHA-VANVNAS 436

Query: 285 SGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQ 344
           + AI       R++VG + +  TE +I ++ E+FG +   DL K++ TG SKGY F  Y 
Sbjct: 437 TSAIDSVRF-HRLYVGNIYFGVTEGEIIQIFEAFGPIEFADLQKEK-TGKSKGYCFIQYV 494

Query: 345 DPAVTDIACAALNGLKMGDKTLTV 368
           +P     A   +NG ++  + L V
Sbjct: 495 NPDDAKTALEKMNGFELAGRKLRV 518


>gi|313234527|emb|CBY10484.1| unnamed protein product [Oikopleura dioica]
          Length = 333

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 50/88 (56%), Gaps = 6/88 (6%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA- 355
           VFVG +PY  TE QI+++    G +  F LV DR+TG  KGYGFC Y+D   TD A +A 
Sbjct: 27  VFVGNIPYEATEEQIRDIFNEVGVVLSFRLVYDRETGKPKGYGFCEYKD---TDTAMSAM 83

Query: 356 --LNGLKMGDKTLTVRRATASGQSKTEQ 381
             LN  ++  + L V  AT     + EQ
Sbjct: 84  RNLNTRELHGRNLRVDHATRDHGVEKEQ 111


>gi|383419607|gb|AFH33017.1| putative RNA-binding protein 23 isoform 1 [Macaca mulatta]
          Length = 441

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 251 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340


>gi|388490330|ref|NP_001253303.1| probable RNA-binding protein 23 [Macaca mulatta]
 gi|380814244|gb|AFE78996.1| putative RNA-binding protein 23 isoform 1 [Macaca mulatta]
 gi|384947950|gb|AFI37580.1| putative RNA-binding protein 23 isoform 1 [Macaca mulatta]
          Length = 441

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 251 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340


>gi|387539272|gb|AFJ70263.1| putative RNA-binding protein 23 isoform 1 [Macaca mulatta]
          Length = 439

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 251 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340


>gi|355693134|gb|EHH27737.1| hypothetical protein EGK_18008 [Macaca mulatta]
          Length = 441

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 251 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340


>gi|355778434|gb|EHH63470.1| hypothetical protein EGM_16442, partial [Macaca fascicularis]
          Length = 366

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 132 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 175

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 176 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 234

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 235 TFSDSECARRALEQLNGFELAGRPMRVGHVT 265


>gi|387539270|gb|AFJ70262.1| putative RNA-binding protein 23 isoform 2 [Macaca mulatta]
          Length = 423

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 235 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324


>gi|149756178|ref|XP_001494868.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Equus
           caballus]
          Length = 446

 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 210 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 253

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 254 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 312

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 313 TFSDSECARRALEQLNGFELAGRPMRVGHVT 343


>gi|345804022|ref|XP_003435135.1| PREDICTED: probable RNA-binding protein 23 [Canis lupus familiaris]
          Length = 411

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 174 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 217

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 218 -MANNLQKGSSGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 276

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 277 TFSDSECARRALEQLNGFELAGRPMRVGHVT 307


>gi|332222976|ref|XP_003260645.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Nomascus
           leucogenys]
          Length = 442

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 251 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340


>gi|348667221|gb|EGZ07047.1| hypothetical protein PHYSODRAFT_251824 [Phytophthora sojae]
          Length = 419

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 33/72 (45%), Positives = 42/72 (58%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  +KE+    G++  F LV DR+TG  KGYGFC Y D A    A   L
Sbjct: 15  VFVGNIPYDVTEDMLKEIFSEAGSVVNFRLVTDRETGKPKGYGFCEYADGATALSAMRNL 74

Query: 357 NGLKMGDKTLTV 368
           NG ++  + L V
Sbjct: 75  NGYEINGRNLRV 86


>gi|390342940|ref|XP_001198098.2| PREDICTED: RNA-binding protein 39-like [Strongylocentrotus
           purpuratus]
          Length = 666

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 58/215 (26%), Positives = 86/215 (40%), Gaps = 23/215 (10%)

Query: 171 TRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMR 230
           TR AR V+V  L   A E+ +  FFS V              + +      K   +VE  
Sbjct: 204 TRDARTVFVMQLSQRAKERELKEFFSSV------GKVRTVKIITDRNSRRSKGVGYVEYD 257

Query: 231 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGG 290
             +    A+ L+     GV + V+ P+      +A    GQ   N+ L  V         
Sbjct: 258 VADSVPLALGLNNQKLLGVPIIVQ-PSHAEKNRSA----GQ---NVTLQKVN-------- 301

Query: 291 AEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
             GP R++VG L Y  TE  ++ + E FG +    L+ D D   SKGYGF  + D     
Sbjct: 302 -SGPMRLYVGSLHYNITEAMLRGIFEPFGKIDNIQLMMDTDANRSKGYGFITFHDAEDAK 360

Query: 351 IACAALNGLKMGDKTLTVRRATASGQSKTEQESIL 385
            A   LNG ++  + + V   T   +   +  S L
Sbjct: 361 RALDQLNGFELAGRPMKVNHVTERNEQGQQAPSFL 395


>gi|384947948|gb|AFI37579.1| putative RNA-binding protein 23 isoform 2 [Macaca mulatta]
          Length = 425

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 235 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324


>gi|380814242|gb|AFE78995.1| putative RNA-binding protein 23 isoform 2 [Macaca mulatta]
 gi|383419605|gb|AFH33016.1| putative RNA-binding protein 23 isoform 2 [Macaca mulatta]
          Length = 425

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 235 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324


>gi|224125470|ref|XP_002329813.1| predicted protein [Populus trichocarpa]
 gi|222870875|gb|EEF08006.1| predicted protein [Populus trichocarpa]
          Length = 59

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 34/58 (58%), Positives = 35/58 (60%), Gaps = 4/58 (6%)

Query: 116 MAPPAAAMLPGAAV----PGQLPGVPSAVPEMAQNMLPFGATQLGAFPLMPVQVMTQQ 169
           MAP    MLPGAAV     GQLP VP  +P M QN L FG TQ G  PLMP   MTQQ
Sbjct: 1   MAPSMVGMLPGAAVTVNDAGQLPSVPQTMPGMIQNTLQFGTTQFGVLPLMPAHAMTQQ 58


>gi|119586626|gb|EAW66222.1| RNA binding motif protein 23, isoform CRA_a [Homo sapiens]
          Length = 483

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 251 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 294

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 295 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 353

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 354 TFSDSECARRALEQLNGFELAGRPMRVGHVT 384


>gi|71017595|ref|XP_759028.1| hypothetical protein UM02881.1 [Ustilago maydis 521]
 gi|46098750|gb|EAK83983.1| hypothetical protein UM02881.1 [Ustilago maydis 521]
          Length = 403

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 33/83 (39%), Positives = 47/83 (56%)

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
           G   G   VFVG +PY  +E Q+ ++    G + GF LV DR+TG  KGYGFC ++DP  
Sbjct: 3   GAQRGSRVVFVGNIPYDMSEEQLTDVFREVGKVVGFRLVNDRETGKFKGYGFCEFEDPET 62

Query: 349 TDIACAALNGLKMGDKTLTVRRA 371
              A   LN +++G + L +  A
Sbjct: 63  AASAVRNLNEVEVGGRPLRISFA 85


>gi|298710792|emb|CBJ32209.1| RNA-binding protein SiahBP [Ectocarpus siliculosus]
          Length = 696

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 82/315 (26%), Positives = 129/315 (40%), Gaps = 62/315 (19%)

Query: 111 RSGFDMAPPAAAMLPG---------------AAVPGQLPGVPSAVPEMAQNMLPFGATQL 155
           + G+  APP A ++P                AA  GQ P V S     A   L  GA   
Sbjct: 209 KKGWGDAPPVAPVVPMTPLQELQRKLAEEQVAAAMGQAPPVTSVSAGAAALGLTMGAAMQ 268

Query: 156 GAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVN 215
                   Q    Q + + RR+YVG L     E  I + F+               A+  
Sbjct: 269 APGAGTRTQTAPAQPS-NPRRIYVGSLHYELKESDITSIFANF------------GALKL 315

Query: 216 VYINHE------KKFAFVEMRTVEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAAL- 267
           V ++H+      K F F+E   V+ A  A+ A++G    G A++V RP + +  +A  + 
Sbjct: 316 VDMSHDSSTGRHKGFCFIEYVDVKSADAALRAMNGFELAGRAIKVGRPLNTDSGVAGGIE 375

Query: 268 GPGQPS----PNLNLAAVGLASGAIGGAEGP-------------------DRVFVGGLPY 304
           G G P+    P +       ++ A GGA+G                     +++VG +  
Sbjct: 376 GMGLPAAMQLPGMAAFMAQHSTSAAGGAQGSGVAAEQLKAAMGMTTAPAQTKIYVGNVEP 435

Query: 305 YFTETQIKELLESFGTLHGFDLVKD-RDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGD 363
           + T   IK + E FG + G ++V+D  + GN KG+GF  Y   +V       ++  ++  
Sbjct: 436 HITTEMIKTVFEPFGMVVGAEMVQDPSNPGNHKGFGFIQYAQESVARTVIDTMSSFELAG 495

Query: 364 KTLTVRRATASGQSK 378
           +TL  R A A  QSK
Sbjct: 496 RTL--RVAWAQDQSK 508



 Score = 62.4 bits (150), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 70/267 (26%), Positives = 108/267 (40%), Gaps = 24/267 (8%)

Query: 263 LAAALGPGQP--SPNLNLAAVGLASGAIGGAEG--------------PDRVFVGGLPYYF 306
           +AAA+G   P  S +   AA+GL  GA   A G              P R++VG L Y  
Sbjct: 239 VAAAMGQAPPVTSVSAGAAALGLTMGAAMQAPGAGTRTQTAPAQPSNPRRIYVGSLHYEL 298

Query: 307 TETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTL 366
            E+ I  +  +FG L   D+  D  TG  KG+ F  Y D    D A  A+NG ++  + +
Sbjct: 299 KESDITSIFANFGALKLVDMSHDSSTGRHKGFCFIEYVDVKSADAALRAMNGFELAGRAI 358

Query: 367 TVRRA--TASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGG--GMSLFGETLAKVL 422
            V R   T SG +   +   L  A Q   +     Q S  +  GG  G  +  E L   +
Sbjct: 359 KVGRPLNTDSGVAGGIEGMGLPAAMQLPGMAAFMAQHS-TSAAGGAQGSGVAAEQLKAAM 417

Query: 423 CLTEAITADAL-ADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEY 481
            +T A     +   + E     E ++     +G +V   + +   N G   G G  F++Y
Sbjct: 418 GMTTAPAQTKIYVGNVEPHITTEMIKTVFEPFGMVVGAEMVQDPSNPGNHKGFG--FIQY 475

Query: 482 YDAVGCATAKNALSGRKFGGNTVNAFY 508
                  T  + +S  +  G T+   +
Sbjct: 476 AQESVARTVIDTMSSFELAGRTLRVAW 502


>gi|410961880|ref|XP_003987506.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Felis catus]
          Length = 445

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 208 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 251

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 252 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 310

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 311 TFSDSECARRALEQLNGFELAGRPMRVGHVT 341


>gi|281354415|gb|EFB29999.1| hypothetical protein PANDA_019778 [Ailuropoda melanoleuca]
          Length = 568

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 37/87 (42%), Positives = 49/87 (56%), Gaps = 1/87 (1%)

Query: 298 FVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALN 357
            VG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   LN
Sbjct: 5   IVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNLN 64

Query: 358 GLKMGDKTLTVRRATASGQSKTEQESI 384
           G +   + L V  A AS ++K E +S+
Sbjct: 65  GREFSGRALRVDNA-ASEKNKEELKSL 90


>gi|343427636|emb|CBQ71163.1| related to HRP1-subunit of cleavage factor I [Sporisorium reilianum
           SRZ2]
          Length = 588

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 66/237 (27%), Positives = 100/237 (42%), Gaps = 18/237 (7%)

Query: 176 RVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRTVEEA 235
           ++++GGL     E ++  +FSQ          G    + +      + FAF+     +  
Sbjct: 168 KMFIGGLNWDTTEDSLRRYFSQF------GEVGHCTVMRDNMTGRSRGFAFLNFVNPKAV 221

Query: 236 SNAMA----LDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
           +  +     LDG + +      R   D N       G GQ S N N    G   G    +
Sbjct: 222 NTVVVREHYLDGKVIDPKRAIPRPQRDSNFNAHHNGGQGQASYNNNGGGAGGGGGYNAQS 281

Query: 292 EGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDI 351
           +   ++FVGGLP   T    +   E FGTL     + DR+TGN +G+GF  YQD A    
Sbjct: 282 Q---KLFVGGLPASVTPASFRMFFEQFGTLAECTCMMDRETGNPRGFGFLTYQDDAALQH 338

Query: 352 ACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLG 408
             +    L    K + V+RA    QSK + +S+  + QQ I   +MA+   GM   G
Sbjct: 339 VLST-RPLVFDGKEVDVKRA----QSKNDPQSLQIRRQQRIDNPEMAMGGMGMQQPG 390


>gi|332222974|ref|XP_003260644.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Nomascus
           leucogenys]
          Length = 426

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 235 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324


>gi|114652057|ref|XP_001159475.1| PREDICTED: probable RNA-binding protein 23 isoform 10 [Pan
           troglodytes]
          Length = 442

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG+SKGYGF 
Sbjct: 251 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 309

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340


>gi|410218752|gb|JAA06595.1| RNA binding motif protein 23 [Pan troglodytes]
 gi|410307724|gb|JAA32462.1| RNA binding motif protein 23 [Pan troglodytes]
 gi|410307728|gb|JAA32464.1| RNA binding motif protein 23 [Pan troglodytes]
          Length = 442

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG+SKGYGF 
Sbjct: 251 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 309

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340


>gi|116734696|ref|NP_001070819.1| probable RNA-binding protein 23 isoform 1 [Homo sapiens]
 gi|34925229|sp|Q86U06.1|RBM23_HUMAN RecName: Full=Probable RNA-binding protein 23; AltName:
           Full=RNA-binding motif protein 23; AltName:
           Full=RNA-binding region-containing protein 4; AltName:
           Full=Splicing factor SF2
 gi|28071058|emb|CAD61910.1| unnamed protein product [Homo sapiens]
 gi|119586629|gb|EAW66225.1| RNA binding motif protein 23, isoform CRA_d [Homo sapiens]
          Length = 439

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 251 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340


>gi|28059803|gb|AAO30095.1| splicing factor-like protein [Arabidopsis thaliana]
          Length = 527

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 91/389 (23%), Positives = 149/389 (38%), Gaps = 78/389 (20%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           R  R V+   +P  A E+ +  FFS+               +++      K   ++E   
Sbjct: 165 RDQRTVFAYQMPLKATERDVYEFFSK------AGKVRDVRLIMDRNSRRSKGVGYIEFYD 218

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
           V     A+AL G +F G  V V+ P++    LA +                  S  +GG 
Sbjct: 219 VMSVPMAIALSGQLFLGQPVMVK-PSEAEKNLAQS-----------------NSTTVGGT 260

Query: 292 EGPDR-VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
              DR ++VG L +  +E Q++++ E+FG +    L  D +TG  KG+GF  +     + 
Sbjct: 261 GPADRKLYVGNLHFNMSELQLRQIFEAFGPVELVQLPLDPETGQCKGFGFIQFVQLEHSK 320

Query: 351 IACAALNG-LKMGDKTLTVR----------RATASGQSKTEQESILAQAQQHIAIQKMAL 399
            A  ALNG L++  +T+ V            A  S     +    LA   Q  A+    L
Sbjct: 321 AAQIALNGKLEIAGRTIKVSSVSDHIGTQDSAPKSADFDDDDGGGLALNAQSRAMLMQKL 380

Query: 400 QTSGMNT----------LGG------GM---------------SLFGETL---AKVLCLT 425
             SG+ T          L G      GM               S   E +   ++ L L 
Sbjct: 381 DRSGIATSIVGSLGVPGLNGAAFNQPGMNPSFPTSVLPTTAIPSFVNEHVGLPSECLLLK 440

Query: 426 EAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAV 485
                    +     EI +D+ +EC KYG + ++ +   D+N       G V+L +    
Sbjct: 441 NMFDPATETEPNFDLEIRDDVADECSKYGPVNHIYV---DKNSA-----GFVYLRFQSVE 492

Query: 486 GCATAKNALSGRKFGGNTVNAFYYPEDKY 514
             A A+ A+  R F    ++A + P  +Y
Sbjct: 493 AAAAAQRAMHMRWFAQKMISATFMPPHEY 521


>gi|410961882|ref|XP_003987507.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Felis catus]
          Length = 429

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 192 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 235

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 236 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 294

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 295 TFSDSECARRALEQLNGFELAGRPMRVGHVT 325


>gi|397473313|ref|XP_003808159.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Pan paniscus]
          Length = 442

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 251 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340


>gi|296214508|ref|XP_002753659.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Callithrix
           jacchus]
          Length = 439

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 206 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 249

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 250 -MANNLQKGTGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 308

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 309 TFSDSECARRAMDQLNGFELAGRPMRVGHVT 339


>gi|149756180|ref|XP_001494897.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Equus
           caballus]
          Length = 430

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 194 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 237

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 238 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 297 TFSDSECARRALEQLNGFELAGRPMRVGHVT 327


>gi|410261842|gb|JAA18887.1| RNA binding motif protein 23 [Pan troglodytes]
 gi|410349771|gb|JAA41489.1| RNA binding motif protein 23 [Pan troglodytes]
          Length = 442

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG+SKGYGF 
Sbjct: 251 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 309

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340


>gi|12803481|gb|AAH02566.1| RNA binding motif protein 23 [Homo sapiens]
 gi|189055004|dbj|BAG37988.1| unnamed protein product [Homo sapiens]
 gi|312151810|gb|ADQ32417.1| RNA binding motif protein 23 [synthetic construct]
          Length = 424

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324


>gi|119586634|gb|EAW66230.1| RNA binding motif protein 23, isoform CRA_h [Homo sapiens]
          Length = 467

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 235 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 278

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 279 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 337

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 338 TFSDSECARRALEQLNGFELAGRPMRVGHVT 368


>gi|116734694|ref|NP_060577.3| probable RNA-binding protein 23 isoform 2 [Homo sapiens]
 gi|18848317|gb|AAH24208.1| RNA binding motif protein 23 [Homo sapiens]
 gi|119586627|gb|EAW66223.1| RNA binding motif protein 23, isoform CRA_b [Homo sapiens]
 gi|119586633|gb|EAW66229.1| RNA binding motif protein 23, isoform CRA_b [Homo sapiens]
          Length = 423

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324


>gi|410218750|gb|JAA06594.1| RNA binding motif protein 23 [Pan troglodytes]
 gi|410307726|gb|JAA32463.1| RNA binding motif protein 23 [Pan troglodytes]
          Length = 426

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG+SKGYGF 
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 293

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324


>gi|119586630|gb|EAW66226.1| RNA binding motif protein 23, isoform CRA_e [Homo sapiens]
          Length = 449

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 217 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 260

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 261 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 319

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 320 TFSDSECARRALEQLNGFELAGRPMRVGHVT 350


>gi|18416114|ref|NP_568220.1| RNA-binding protein 39 [Arabidopsis thaliana]
 gi|15451046|gb|AAK96794.1| splicing factor-like protein [Arabidopsis thaliana]
 gi|332004077|gb|AED91460.1| RNA-binding protein 39 [Arabidopsis thaliana]
          Length = 527

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 91/389 (23%), Positives = 149/389 (38%), Gaps = 78/389 (20%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           R  R V+   +P  A E+ +  FFS+               +++      K   ++E   
Sbjct: 165 RDQRTVFAYQMPLKATERDVYEFFSK------AGKVRDVRLIMDRNSRRSKGVGYIEFYD 218

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
           V     A+AL G +F G  V V+ P++    LA +                  S  +GG 
Sbjct: 219 VMSVPMAIALSGQLFLGQPVMVK-PSEAEKNLAQS-----------------NSTTVGGT 260

Query: 292 EGPDR-VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
              DR ++VG L +  +E Q++++ E+FG +    L  D +TG  KG+GF  +     + 
Sbjct: 261 GPADRKLYVGNLHFNMSELQLRQIFEAFGPVELVQLPLDPETGQCKGFGFIQFVQLEHSK 320

Query: 351 IACAALNG-LKMGDKTLTVR----------RATASGQSKTEQESILAQAQQHIAIQKMAL 399
            A  ALNG L++  +T+ V            A  S     +    LA   Q  A+    L
Sbjct: 321 AAQIALNGKLEIAGRTIKVSSVSDHIGTQDSAPKSADFDDDDGGGLALNAQSRAMLMQKL 380

Query: 400 QTSGMNT----------LGG------GM---------------SLFGETL---AKVLCLT 425
             SG+ T          L G      GM               S   E +   ++ L L 
Sbjct: 381 DRSGIATSIVGSLGVPGLNGAAFNQPGMNPSFPTSVLPTTAIPSFVNEHVGLPSECLLLK 440

Query: 426 EAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAV 485
                    +     EI +D+ +EC KYG + ++ +   D+N       G V+L +    
Sbjct: 441 NMFDPATETEPNFDLEIRDDVADECSKYGPVNHIYV---DKNSA-----GFVYLRFQSVE 492

Query: 486 GCATAKNALSGRKFGGNTVNAFYYPEDKY 514
             A A+ A+  R F    ++A + P  +Y
Sbjct: 493 AAAAAQRAMHMRWFAQKMISATFMPPHEY 521


>gi|114652069|ref|XP_001159523.1| PREDICTED: probable RNA-binding protein 23 isoform 11 [Pan
           troglodytes]
          Length = 426

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG+SKGYGF 
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 293

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324


>gi|410261840|gb|JAA18886.1| RNA binding motif protein 23 [Pan troglodytes]
 gi|410349769|gb|JAA41488.1| RNA binding motif protein 23 [Pan troglodytes]
 gi|410349773|gb|JAA41490.1| RNA binding motif protein 23 [Pan troglodytes]
          Length = 426

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG+SKGYGF 
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 293

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324


>gi|255085602|ref|XP_002505232.1| predicted protein [Micromonas sp. RCC299]
 gi|226520501|gb|ACO66490.1| predicted protein [Micromonas sp. RCC299]
          Length = 211

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 56/205 (27%), Positives = 89/205 (43%), Gaps = 33/205 (16%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKK-----FAFVE 228
           A ++YVG LP   N + +   F      +           ++V  + E++     FAFV 
Sbjct: 32  AAKLYVGHLPSTMNAERMLEMFKPFGRVLQ----------IDVIPDRERQLSCKGFAFVL 81

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
             T EEA  A AL+G + EG ++ VR   +          P  P   +N     +   A 
Sbjct: 82  FSTPEEAIAAKALNGHVVEGKSIDVRLKAE----------PRAPREPVNAPVAPVNDDA- 130

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
                  +++V  +P ++   ++K LL+ +G      ++ DR+TG S+G+GF    D   
Sbjct: 131 -------KLYVAYMPDHYRAEELKMLLQPYGLPSDVRVITDRETGRSRGFGFAQMMDEQQ 183

Query: 349 TDIACAALNGLKMGDKTLTVRRATA 373
              A   LNG  +  KTL VR A A
Sbjct: 184 AMAAIQGLNGQMLDGKTLVVRIAGA 208


>gi|221129809|ref|XP_002164481.1| PREDICTED: RNA-binding motif protein, X-linked 2-like [Hydra
           magnipapillata]
          Length = 255

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 44/72 (61%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           +F+GGLPY  TE  +  +   +G +   +LV+D+ TG  KGYGF  Y+D   T +A    
Sbjct: 39  IFIGGLPYDLTEGDVLAVFSQYGEIVNINLVRDKKTGKFKGYGFLCYEDQRSTILAVDNF 98

Query: 357 NGLKMGDKTLTV 368
           NG+K+G +T+ V
Sbjct: 99  NGIKLGGRTIRV 110


>gi|326431687|gb|EGD77257.1| hypothetical protein PTSG_08350 [Salpingoeca sp. ATCC 50818]
          Length = 397

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 47/84 (55%)

Query: 293 GPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIA 352
            P  V+VG +PY  TE ++ +   + G +  F L+ D++TG SKG+GFC + D A  + A
Sbjct: 3   APTSVWVGNIPYEATEEELIKFFSAVGDVKNFHLITDQNTGRSKGFGFCYFLDAAAAESA 62

Query: 353 CAALNGLKMGDKTLTVRRATASGQ 376
              L+G  + D+ L V  AT   Q
Sbjct: 63  VRNLSGQPLRDRPLRVDLATPRSQ 86


>gi|261858408|dbj|BAI45726.1| RNA binding motif protein 23 [synthetic construct]
          Length = 406

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 173 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 216

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 217 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 275

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 276 TFSDSECARRALEQLNGFELAGRPMRVGHVT 306


>gi|225711846|gb|ACO11769.1| Cleavage stimulation factor 64 kDa subunit [Lepeophtheirus
           salmonis]
          Length = 330

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 50/93 (53%), Gaps = 4/93 (4%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++K++    G +  F LV DR+ G  KGYGFC Y+D  +   A   L
Sbjct: 18  VFVGNIPYEATEEKLKDIFSEVGPVTSFKLVYDRENGKPKGYGFCEYKDADMALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQ----SKTEQESIL 385
           NG ++  +TL V  A         +K E E I+
Sbjct: 78  NGYEIEGRTLRVDNACTEKNRLEMAKGEAEEIV 110


>gi|397473311|ref|XP_003808158.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Pan paniscus]
          Length = 426

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324


>gi|198476543|ref|XP_001357388.2| GA10876 [Drosophila pseudoobscura pseudoobscura]
 gi|198137744|gb|EAL34457.2| GA10876 [Drosophila pseudoobscura pseudoobscura]
          Length = 625

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 53/200 (26%), Positives = 80/200 (40%), Gaps = 28/200 (14%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKF---AFVE 228
           R AR V+   L      + +  FFS         S G    V  +  N  K+F   A++E
Sbjct: 264 RDARTVFCIQLSQRVRARDLEEFFS---------SVGKVRDVRLITCNKTKRFKGIAYIE 314

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAI 288
               E  + A+ L G    GV + V+        L  A    QP  ++            
Sbjct: 315 FEDPESVALALGLSGQRLLGVPIMVQHTQAEKNRLQNATPAFQPKSHV------------ 362

Query: 289 GGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAV 348
               GP R++VG L +  TE  ++ + E FG +    L+ D +T  SKGYGF  Y +   
Sbjct: 363 ----GPMRLYVGSLHFDITEEMLRGIFEPFGKIDAIQLIMDTETNRSKGYGFITYHNAED 418

Query: 349 TDIACAALNGLKMGDKTLTV 368
              A   LNG ++  + + V
Sbjct: 419 AKKALEQLNGFELAGRPMKV 438



 Score = 40.0 bits (92), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 22/74 (29%), Positives = 34/74 (45%), Gaps = 7/74 (9%)

Query: 441 EILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFG 500
           EI +D+ EEC K+G ++++ +             G V+++         A NAL GR F 
Sbjct: 535 EIRDDVLEECAKHGGVLHIHV-------DTASPTGTVYVKCPSTTTAVLAVNALHGRWFA 587

Query: 501 GNTVNAFYYPEDKY 514
           G  + A Y P   Y
Sbjct: 588 GRVITAAYVPVVNY 601


>gi|395547925|ref|XP_003775192.1| PREDICTED: cleavage stimulation factor subunit 2 [Sarcophilus
           harrisii]
          Length = 556

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 49/86 (56%), Gaps = 1/86 (1%)

Query: 299 VGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNG 358
           VG +PY  TE Q+K++    G +  F LV DR+TG  KGYGFC YQD      A   LNG
Sbjct: 5   VGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQETALSAMRNLNG 64

Query: 359 LKMGDKTLTVRRATASGQSKTEQESI 384
            +   + L V  A AS ++K E +S+
Sbjct: 65  REFSGRALRVDNA-ASEKNKEELKSL 89


>gi|7022544|dbj|BAA91638.1| unnamed protein product [Homo sapiens]
          Length = 406

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 173 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 216

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 217 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 275

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 276 TFSDSECARRALEQLNGFELAGRPMRVGHVT 306


>gi|256085765|ref|XP_002579083.1| rna recognition motif containing protein [Schistosoma mansoni]
 gi|360043212|emb|CCD78624.1| putative rna recognition motif containing protein [Schistosoma
           mansoni]
          Length = 412

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 50/88 (56%), Gaps = 2/88 (2%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           +FVG +PY  TE ++ EL    G + GF LV DR++G  KGYGFC Y +PA+   A   L
Sbjct: 18  IFVGNIPYEATEEKLIELFSKAGPVIGFRLVYDRESGKPKGYGFCEYNNPAIAASALRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
             ++   + L  R   A+G+  + + ++
Sbjct: 78  QNIEFNGRPL--RIGPAAGEQNSAELAL 103


>gi|351697087|gb|EHB00006.1| Putative RNA-binding protein 23 [Heterocephalus glaber]
          Length = 436

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 56/203 (27%), Positives = 85/203 (41%), Gaps = 35/203 (17%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYI------NHEKKFA 225
           R AR V+   L      + +  FFS    AIG         V +V I         K  A
Sbjct: 163 RDARTVFCMQLAARIRPRDLEDFFS----AIG--------KVHDVRIISDRNSRRSKGIA 210

Query: 226 FVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLAS 285
           +VE   ++    A+ L G    GV + V+        LAA                 +A+
Sbjct: 211 YVEFCDIQSVPLAIGLTGQRLLGVPIVVQASQAEKNRLAA-----------------MAN 253

Query: 286 GAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQD 345
               G+ GP R++VG L +  TE  ++ + E FG +    L+KD +TG+SKGYGF  + +
Sbjct: 254 NLQKGSGGPKRLYVGCLHFNITEDMLRGIFEPFGKIENIVLMKDSETGHSKGYGFITFSE 313

Query: 346 PAVTDIACAALNGLKMGDKTLTV 368
                 A   LNG ++  + + V
Sbjct: 314 SECARRAVEQLNGFELAGRPMRV 336


>gi|149756182|ref|XP_001494921.1| PREDICTED: probable RNA-binding protein 23 isoform 3 [Equus
           caballus]
          Length = 412

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 176 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 219

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 220 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 278

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 279 TFSDSECARRALEQLNGFELAGRPMRVGHVT 309


>gi|335310533|ref|XP_003362077.1| PREDICTED: probable RNA-binding protein 23 [Sus scrofa]
          Length = 443

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 209 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 252

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 253 -MANNLQKGTGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 311

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + +   T
Sbjct: 312 TFSDSECARRALEQLNGFELAGRPMRIGHVT 342


>gi|116734698|ref|NP_001070820.1| probable RNA-binding protein 23 isoform 3 [Homo sapiens]
 gi|119586628|gb|EAW66224.1| RNA binding motif protein 23, isoform CRA_c [Homo sapiens]
          Length = 405

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 173 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 216

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 217 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 275

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 276 TFSDSECARRALEQLNGFELAGRPMRVGHVT 306


>gi|426376372|ref|XP_004054975.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Gorilla
           gorilla gorilla]
          Length = 437

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 207 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 250

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 251 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 309

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 310 TFSDSECARRALEQLNGFELAGRPMRVGHVT 340


>gi|294882869|ref|XP_002769861.1| splicing factor, putative [Perkinsus marinus ATCC 50983]
 gi|239873674|gb|EER02579.1| splicing factor, putative [Perkinsus marinus ATCC 50983]
          Length = 364

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/208 (26%), Positives = 86/208 (41%), Gaps = 28/208 (13%)

Query: 166 MTQQATRHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV--NVYINHEKK 223
           M QQA R    V V G+ P   E+ +  F SQ       N+    D  V  +   N  K 
Sbjct: 146 MVQQAHRDDCTVMVMGIHPKCTEKEVYVFMSQ-------NAGKVRDVQVIRDPRTNRSKG 198

Query: 224 FAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGL 283
            A+VE  T +    A+A +G    G  +R++                Q   N    A  +
Sbjct: 199 VAYVEFYTPDSILKALACNGQALMGHPIRIQ--------------ASQAEKNRAAEAARV 244

Query: 284 ASGAIGGAEGPDRVFVGGLP---YYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGF 340
                   + P RV+VGGL     +  E++I++L   FG +   ++ K   TG  +G+ F
Sbjct: 245 VQNQ--QQDLPMRVYVGGLTGVLIHLQESEIRKLFAPFGDIQCIEIAKSPYTGRPRGFAF 302

Query: 341 CVYQDPAVTDIACAALNGLKMGDKTLTV 368
            +Y       +A AA++  ++ D TL V
Sbjct: 303 VIYSRACDARVAIAAMHKYRIADTTLEV 330


>gi|296483628|tpg|DAA25743.1| TPA: RNA binding motif protein 23 [Bos taurus]
          Length = 463

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 52/190 (27%), Positives = 80/190 (42%), Gaps = 24/190 (12%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 222 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 265

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L    TE  ++ +LE FG +    L+KD +TG SKGYGF 
Sbjct: 266 -MANNLQKGSGGPVRLYVGSLHCNITEDMLRGILEPFGKIDNIVLMKDSETGRSKGYGFI 324

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT-------EQESILAQAQQHIAI 394
            + D      A   LNG ++  + + +   T      T       +QE  L  A  H+ +
Sbjct: 325 TFSDSECARRALEQLNGFELAGRPMRIGHVTERPDGGTDITFPDGDQELDLGSAGGHLQL 384

Query: 395 QKMALQTSGM 404
                + SG+
Sbjct: 385 MAKLAEGSGI 394


>gi|198418855|ref|XP_002123179.1| PREDICTED: similar to cleavage stimulation factor, 3 pre-RNA,
           subunit 2, 64kDa [Ciona intestinalis]
          Length = 455

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 49/92 (53%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+K++    G +  F LV DR++G  KGYGF  YQD      +   L
Sbjct: 18  VFVGNIPYEATEEQLKDIFNEVGNVISFRLVFDRESGKPKGYGFAEYQDKETALSSMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQA 388
           NG ++  + L V  AT+      E  ++ + A
Sbjct: 78  NGRELHGRPLRVDHATSERNRNDEFNNLRSMA 109


>gi|10880789|gb|AAG24388.1|AF275678_1 PP239 protein [Homo sapiens]
          Length = 418

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 185 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 228

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 229 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 287

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 288 TFSDSECARRALEQLNGFELAGRPMRVGHVT 318


>gi|255082273|ref|XP_002508355.1| predicted protein [Micromonas sp. RCC299]
 gi|226523631|gb|ACO69613.1| predicted protein [Micromonas sp. RCC299]
          Length = 518

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 94/396 (23%), Positives = 156/396 (39%), Gaps = 81/396 (20%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINH---EKKFAFVE 228
           R  R V+   L   A+E+ I  FFS+         AG  + V  +Y  +    K  A++E
Sbjct: 133 RDTRTVFAYNLSTKADERDIYQFFSK---------AGTVNDVRIIYDRNTPRSKGMAYIE 183

Query: 229 MRTVEEASNAMALDGIIFEGVAVRVR-RPTDYNPTLAAALGPGQPSPNLNLAAVG----- 282
                  ++A+AL G +     V V+    + N    A     Q    L + A+G     
Sbjct: 184 FADKANITDALALTGQMLRNQVVMVKASEAEKNIAWEAE----QAQKKLEMKALGATDPA 239

Query: 283 ---LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
               A  A     GP ++ V GL     ET +K + E FG      + +D  TG S+G G
Sbjct: 240 SAAAAVNAQAHGNGPCKLQVHGLDVNIGETDLKAVFEPFGETDFISIQRD-STGRSRGVG 298

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT-------------EQESILA 386
           F  Y+      +A + LNGL++  ++L V  A  +  +               EQE +  
Sbjct: 299 FVQYKQTQHAVLAISQLNGLELVGQSLKVTMAPIAASTLNAAQAASMVTDKIDEQEGVRL 358

Query: 387 QAQQHIAIQ-KMALQ--TSGMNTLGG-----GMSLFGETLA------------------- 419
            ++   A+  K+A Q  T G    GG     G+ +  E +A                   
Sbjct: 359 DSRSRAALMAKLAGQDETQGALYSGGIDPKTGLPVSAEEMAAAQRAAHMTEVEFAQGVLG 418

Query: 420 -------KVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETP 472
                  + L L          + E + +I ED+++EC K+G + ++ + +  +      
Sbjct: 419 PASPIPTQCLLLKNMFDPAEETEPEWWIDIGEDVKDECSKHGPVSHIHVDKESR------ 472

Query: 473 GVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
             G V+L++    G + A+ AL GR F G  + A +
Sbjct: 473 --GFVYLKFGSTEGASAARQALHGRWFAGKMIAAEF 506


>gi|410961884|ref|XP_003987508.1| PREDICTED: probable RNA-binding protein 23 isoform 3 [Felis catus]
          Length = 411

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 174 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 217

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 218 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 276

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 277 TFSDSECARRALEQLNGFELAGRPMRVGHVT 307


>gi|115497272|ref|NP_001069104.1| probable RNA-binding protein 23 [Bos taurus]
 gi|113911797|gb|AAI22594.1| RNA binding motif protein 23 [Bos taurus]
          Length = 463

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 52/190 (27%), Positives = 80/190 (42%), Gaps = 24/190 (12%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 222 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 265

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L    TE  ++ +LE FG +    L+KD +TG SKGYGF 
Sbjct: 266 -MANNLQKGSGGPVRLYVGSLHCNITEDMLRGILEPFGKIDNIVLMKDSETGRSKGYGFI 324

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT-------EQESILAQAQQHIAI 394
            + D      A   LNG ++  + + +   T      T       +QE  L  A  H+ +
Sbjct: 325 TFSDSECARRALEQLNGFELAGRPMRIGHVTERPDGGTDITFPDGDQELDLGSAGGHLQL 384

Query: 395 QKMALQTSGM 404
                + SG+
Sbjct: 385 MAKLAEGSGI 394


>gi|114652071|ref|XP_522797.2| PREDICTED: probable RNA-binding protein 23 isoform 12 [Pan
           troglodytes]
          Length = 408

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 173 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 216

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG+SKGYGF 
Sbjct: 217 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGHSKGYGFI 275

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 276 TFSDSECARRALEQLNGFELAGRPMRVGHVT 306


>gi|426376370|ref|XP_004054974.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Gorilla
           gorilla gorilla]
          Length = 421

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324


>gi|397473315|ref|XP_003808160.1| PREDICTED: probable RNA-binding protein 23 isoform 3 [Pan paniscus]
          Length = 408

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 173 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 216

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 217 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 275

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 276 TFSDSECARRALEQLNGFELAGRPMRVGHVT 306


>gi|291403543|ref|XP_002718110.1| PREDICTED: RNA binding motif protein 23 isoform 3 [Oryctolagus
           cuniculus]
          Length = 428

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 193 KGIAYVEFCDIQAVPLAIGLTGQRLLGVPIMVQASQAEKNRLAA---------------- 236

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 237 -MANNLQKGSGGPLRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 295

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 296 TFSDSECGRRALEQLNGFELAGRPMRVGHVT 326


>gi|226466604|emb|CAX69437.1| cleavage stimulation factor, 3' pre-RNA, subunit 2, 64kDa
           [Schistosoma japonicum]
          Length = 414

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 50/88 (56%), Gaps = 2/88 (2%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           +FVG +PY  TE ++ EL    G + GF LV DR++G  KGYGFC Y +PA+   A   L
Sbjct: 18  IFVGNIPYEATEEKLIELFSKAGPVIGFRLVYDRESGKPKGYGFCEYNNPAIAASALRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESI 384
             ++   + L  R   A+G+  + + ++
Sbjct: 78  QNIEFNGRPL--RIGPAAGEQNSAELAL 103


>gi|291403541|ref|XP_002718109.1| PREDICTED: RNA binding motif protein 23 isoform 2 [Oryctolagus
           cuniculus]
          Length = 444

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 209 KGIAYVEFCDIQAVPLAIGLTGQRLLGVPIMVQASQAEKNRLAA---------------- 252

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 253 -MANNLQKGSGGPLRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 311

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 312 TFSDSECGRRALEQLNGFELAGRPMRVGHVT 342


>gi|224081877|ref|XP_002306512.1| predicted protein [Populus trichocarpa]
 gi|222855961|gb|EEE93508.1| predicted protein [Populus trichocarpa]
          Length = 367

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 90/347 (25%), Positives = 138/347 (39%), Gaps = 76/347 (21%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNL-NLAA 280
           K   +VE         A+AL G +  G  V V+              P +   NL   +A
Sbjct: 35  KGVGYVEFYDAMSVPMAIALSGQLLFGQPVMVK--------------PSEAEKNLVQSSA 80

Query: 281 VGLASGAIGGAEGP-DR-VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGY 338
               +  + G  GP DR ++VG L +  TE Q+++L E FGT+    L  D +TG  KG+
Sbjct: 81  SSGGTSGVAGPFGPVDRKLYVGNLHFNMTEMQLRQLFEPFGTVELVQLPLDLETGQCKGF 140

Query: 339 GFCVYQDPAVTDIACAALNG-LKMGDKTLTVRRATA-SGQSKTEQES---------ILAQ 387
           GF  +        A +ALNG L++  +T+ V   T   GQ  T  +S          LA 
Sbjct: 141 GFVQFTQLENAKAAQSALNGKLEIAGRTIKVSSVTEHGGQQDTGAKSADFDDDDGGGLAL 200

Query: 388 AQQHIAIQKMALQTSGMNTLGGG------------------MSLFGETL-------AKVL 422
             Q  A+    L  +G+ T   G                  + + G+T        A VL
Sbjct: 201 NAQSRALLMQKLDRTGIATSIAGSLGVPLLNGSASNQQAISLPIIGQTAIGAAALPAPVL 260

Query: 423 -------------CLTEAITADALADDE-EYE-EILEDMREECGKYGTLVNVVIPRPDQN 467
                        CL      D   + E +++ +I ED+ EEC KYG + ++ +   D+N
Sbjct: 261 SSPAYEPIGQPSECLMLKNMFDPATETEPDFDLDIKEDVEEECSKYGQVEHIFV---DKN 317

Query: 468 GGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKY 514
                  G V+L +      A A+ A+  R F    + A + P  +Y
Sbjct: 318 -----STGCVYLRFGSIEAAAGAQRAMHMRWFARRLILAVFMPTREY 359


>gi|196005405|ref|XP_002112569.1| hypothetical protein TRIADDRAFT_56724 [Trichoplax adhaerens]
 gi|190584610|gb|EDV24679.1| hypothetical protein TRIADDRAFT_56724 [Trichoplax adhaerens]
          Length = 316

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 50/91 (54%)

Query: 283 LASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCV 342
           +AS  I        VFVG +PY  TE Q+K++  S G +  F LV DR++G  KGYGFC 
Sbjct: 1   MASATISRERSLRSVFVGNIPYEATEEQLKDIFGSAGPVVSFRLVYDRESGKPKGYGFCE 60

Query: 343 YQDPAVTDIACAALNGLKMGDKTLTVRRATA 373
           +QD      A   L+G ++  ++L V  A +
Sbjct: 61  FQDKETALSAMRNLSGYELNGRSLRVDSAAS 91


>gi|91085985|ref|XP_972080.1| PREDICTED: similar to CG10466 CG10466-PA [Tribolium castaneum]
 gi|270010180|gb|EFA06628.1| hypothetical protein TcasGA2_TC009547 [Tribolium castaneum]
          Length = 266

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 28/72 (38%), Positives = 46/72 (63%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VF+GGLP+  TE  I  +   +G +   +L++D+D+G SKG+ F  Y+D   TD+A    
Sbjct: 36  VFIGGLPFDLTEGDIICIFSQYGEVVNINLIRDKDSGKSKGFCFLCYEDQRSTDLAVDNF 95

Query: 357 NGLKMGDKTLTV 368
           NG+K+ ++T+ V
Sbjct: 96  NGIKILNRTIRV 107


>gi|255088499|ref|XP_002506172.1| predicted protein [Micromonas sp. RCC299]
 gi|226521443|gb|ACO67430.1| predicted protein [Micromonas sp. RCC299]
          Length = 628

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 64/258 (24%), Positives = 110/258 (42%), Gaps = 24/258 (9%)

Query: 139 AVPEMAQNMLPFGATQLGAFPLMPVQVMTQQATRHARRVYVGGLPPLANEQAIATFFSQV 198
           AV E A+      A  + A  L+  +V  +  T+ ARRV++G +      +  A      
Sbjct: 213 AVREEAKGKRGMNAAMVDAL-LLHNKVRWRDDTKPARRVHIGNVNAGVKAEEFARVLETR 271

Query: 199 MTAIGGNSA----------------GPGDAVV-NVYINHEKKFAFVEMRTVEEASNAMAL 241
           +  +   +                  PG  V+ ++Y+N +K F F+E   +E+    +AL
Sbjct: 272 IRTLSPEAVPWHYPLDKRGRVDERRAPGTRVIEHLYLN-DKGFGFLETTALEDVPAILAL 330

Query: 242 DGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDRVFVGG 301
           +G+   G   R RRP DY+P     +  G            + S  +   + P +VFVGG
Sbjct: 331 NGVRVNGGVTRFRRPKDYDPDNNPLVRDGSYRDVFQRVFTAVLSDEV--VDSPTKVFVGG 388

Query: 302 L-PYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQD-PAVTDIACAALNGL 359
           + P   T+  + E++ SFG L  F    D   G  +G+ +  Y +  +V   A A L+G 
Sbjct: 389 VEPRALTKLDLLEIVSSFGALTAFRCETD-GAGLCRGFAWMEYAEGESVAAKAVAGLSGY 447

Query: 360 KMGDKTLTVRRATASGQS 377
           ++  K +    AT   ++
Sbjct: 448 QLRGKPIAAALATPRAEA 465


>gi|313243391|emb|CBY42167.1| unnamed protein product [Oikopleura dioica]
          Length = 199

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/88 (43%), Positives = 50/88 (56%), Gaps = 6/88 (6%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA- 355
           VFVG +PY  TE QI+++    G +  F LV DR+TG  KGYGFC Y+D   TD A +A 
Sbjct: 27  VFVGNIPYEATEEQIRDIFNEVGVVLSFRLVYDRETGKPKGYGFCEYKD---TDTAMSAM 83

Query: 356 --LNGLKMGDKTLTVRRATASGQSKTEQ 381
             LN  ++  + L V  AT     + EQ
Sbjct: 84  RNLNTRELHGRNLRVDHATRDHGVEKEQ 111


>gi|255724230|ref|XP_002547044.1| predicted protein [Candida tropicalis MYA-3404]
 gi|240134935|gb|EER34489.1| predicted protein [Candida tropicalis MYA-3404]
          Length = 700

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 53/104 (50%), Gaps = 7/104 (6%)

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQ-------NGGET 471
           +KV+ L  A+T   L+D E Y+    D+  E  KYG +  VVIPRP +           +
Sbjct: 593 SKVIRLLNAVTERELSDVETYKFTKNDIYREASKYGVVEQVVIPRPIRGRTPGILKLNRS 652

Query: 472 PGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYF 515
           PG+G VF+EY D     TA   LSGR +   TV A ++  D Y 
Sbjct: 653 PGMGSVFIEYKDEKTALTAMMELSGRTYNDRTVLATFFDYDDYL 696


>gi|227206234|dbj|BAH57172.1| AT5G09880 [Arabidopsis thaliana]
          Length = 505

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 91/389 (23%), Positives = 149/389 (38%), Gaps = 78/389 (20%)

Query: 172 RHARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHEKKFAFVEMRT 231
           R  R V+   +P  A E+ +  FFS+               +++      K   ++E   
Sbjct: 143 RDQRTVFAYQMPLKATERDVYEFFSK------AGKVRDVRLIMDRNSRRSKGVGYIEFYD 196

Query: 232 VEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGA 291
           V     A+AL G +F G  V V+ P++    LA +                  S  +GG 
Sbjct: 197 VMSVPMAIALSGQLFLGQPVMVK-PSEAEKNLAQS-----------------NSTTVGGT 238

Query: 292 EGPDR-VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTD 350
              DR ++VG L +  +E Q++++ E+FG +    L  D +TG  KG+GF  +     + 
Sbjct: 239 GPADRKLYVGNLHFNMSELQLRQIFEAFGPVELVQLPLDPETGQCKGFGFIQFVQLEHSK 298

Query: 351 IACAALNG-LKMGDKTLTVR----------RATASGQSKTEQESILAQAQQHIAIQKMAL 399
            A  ALNG L++  +T+ V            A  S     +    LA   Q  A+    L
Sbjct: 299 AAQIALNGKLEIAGRTIKVSSVSDHIGTQDSAPKSADFDDDDGGGLALNAQSRAMLMQKL 358

Query: 400 QTSGMNT----------LGG------GM---------------SLFGETL---AKVLCLT 425
             SG+ T          L G      GM               S   E +   ++ L L 
Sbjct: 359 DRSGIATSIVGSLGVPGLNGAAFNQPGMNPSFPTSVLPTTAIPSFVNEHVGLPSECLLLK 418

Query: 426 EAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAV 485
                    +     EI +D+ +EC KYG + ++ +   D+N       G V+L +    
Sbjct: 419 NMFDPATETEPNFDLEIRDDVADECSKYGPVNHIYV---DKNSA-----GFVYLRFQSVE 470

Query: 486 GCATAKNALSGRKFGGNTVNAFYYPEDKY 514
             A A+ A+  R F    ++A + P  +Y
Sbjct: 471 AAAAAQRAMHMRWFAQKMISATFMPPHEY 499


>gi|426376374|ref|XP_004054976.1| PREDICTED: probable RNA-binding protein 23 isoform 3 [Gorilla
           gorilla gorilla]
          Length = 403

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 173 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 216

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 217 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 275

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 276 TFSDSECARRALEQLNGFELAGRPMRVGHVT 306


>gi|291400895|ref|XP_002716702.1| PREDICTED: RNA binding motif protein, X-linked 2 [Oryctolagus
           cuniculus]
          Length = 198

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 45/76 (59%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVGGLPY  TE  I  +   +G +    LV+D+ TG S+G+GF  Y+D   T +A    
Sbjct: 37  VFVGGLPYELTEGDILCVFSQYGEIVNIHLVRDKKTGKSRGFGFICYEDQRSTVLAVDNF 96

Query: 357 NGLKMGDKTLTVRRAT 372
           NG+K+  +T+ V  A+
Sbjct: 97  NGIKIKGRTIRVDHAS 112


>gi|440797518|gb|ELR18604.1| RNA recognition motif domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 696

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 63/251 (25%), Positives = 99/251 (39%), Gaps = 60/251 (23%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVV---NVYINHEKKFAFVEMR 230
           A R+YVG L    +E+ I T FS           GP  +V    +      K FAFVE  
Sbjct: 250 ACRIYVGSLNFELSEEDIKTAFSPF---------GPVKSVSLTKDPLTQRSKGFAFVEYA 300

Query: 231 TVEEASNAMA-LDGIIFEGVAVRVRRP------------------------TDYNPTLAA 265
             + A+ A+  ++G +  G  ++V RP                           NP+L  
Sbjct: 301 YPDAATAALKHMNGFMLAGRQLKVGRPHTPGAGLPGMPGMPGVMMPGLSPFPQLNPSLPV 360

Query: 266 ALGPG----------------------QPSPNLNLAAVGLASGAIGGAEGPDRVFVGGLP 303
            + P                       QP+P +      L       A   +R++VG + 
Sbjct: 361 -MNPSILLQANAAIEAQKAAAAAANGSQPTPEMMQEFTKLTGKTAADATASNRIYVGSIH 419

Query: 304 YYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGD 363
           +  T   IK + E+FGT+    L+ + +TG  KGYGF  Y++    + A   +NG  +G 
Sbjct: 420 WDLTSDDIKTVFEAFGTVKSCVLMPNPETGKHKGYGFVEYEESKSAEEAIQQMNGWDLGG 479

Query: 364 KTLTVRRATAS 374
           + + V RA +S
Sbjct: 480 RPIKVGRAISS 490



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/147 (28%), Positives = 71/147 (48%), Gaps = 22/147 (14%)

Query: 377 SKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADD 436
           S  E  ++    Q++  +QK+A       T+ GG S            +  +    +A  
Sbjct: 567 SHEENLTLSTPNQRYALMQKLA-----RGTITGGKS------------SRCVVLKDMAGP 609

Query: 437 EEYEEILE-DMREECGKYGTLVNVVIPRPDQNGGETPG--VGKVFLEYYDAVGCATAKNA 493
           E+ ++ LE ++ +E  KYG +  VVI +  Q+  E PG  + K+F+ +  A     A  +
Sbjct: 610 EDVDDELEGEITDEATKYGIVERVVIYQERQS--EKPGDVIIKIFILFQSADQAQKALTS 667

Query: 494 LSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           L+GR FGG  + A +Y E K+  +DYS
Sbjct: 668 LNGRWFGGRQIKAAFYDEKKFLAEDYS 694



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/240 (24%), Positives = 88/240 (36%), Gaps = 40/240 (16%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           R++VG L +  +E  IK     FG +    L KD  T  SKG+ F  Y  P     A   
Sbjct: 252 RIYVGSLNFELSEEDIKTAFSPFGPVKSVSLTKDPLTQRSKGFAFVEYAYPDAATAALKH 311

Query: 356 LNGLKMGDKTLTVRRATASGQS-------------------------KTEQESILAQAQQ 390
           +NG  +  + L V R    G                                SIL QA  
Sbjct: 312 MNGFMLAGRQLKVGRPHTPGAGLPGMPGMPGVMMPGLSPFPQLNPSLPVMNPSILLQANA 371

Query: 391 HIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCLTEAITADALADDEEY------EEILE 444
            I  QK A   +       G     E + +   LT    ADA A +  Y      +   +
Sbjct: 372 AIEAQKAAAAAA------NGSQPTPEMMQEFTKLTGKTAADATASNRIYVGSIHWDLTSD 425

Query: 445 DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTV 504
           D++     +GT+ + V+  P+   G+  G G  F+EY ++     A   ++G   GG  +
Sbjct: 426 DIKTVFEAFGTVKSCVL-MPNPETGKHKGYG--FVEYEESKSAEEAIQQMNGWDLGGRPI 482


>gi|260942693|ref|XP_002615645.1| hypothetical protein CLUG_04527 [Clavispora lusitaniae ATCC 42720]
 gi|238850935|gb|EEQ40399.1| hypothetical protein CLUG_04527 [Clavispora lusitaniae ATCC 42720]
          Length = 559

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 56/215 (26%), Positives = 93/215 (43%), Gaps = 12/215 (5%)

Query: 307 TETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACA-ALNGLKM-GDK 364
           TET + + L+    +  F LV+   T  S G  F  +    ++   C    + LK+ G  
Sbjct: 342 TETMLLDDLQKIAKVKAFKLVRAVGTKESLGVAFVEF---YISSKECTNTKSALKLIGTY 398

Query: 365 TLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLFGETLAKVLCL 424
               ++     + +     I        +IQ   +    + +L     +     +KV+ L
Sbjct: 399 VEEAKKLDIVSKIEFSCIKIGENYTSLTSIQDCPIDFKTLKSLVRNEYVQFHPKSKVIQL 458

Query: 425 TEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGG------ETPGVGKVF 478
              +T + L +DE Y+ I  D+ EE   +GT++++ IP+P             PGVGKVF
Sbjct: 459 INIVTIEDLCNDETYKFIYSDIFEEAKTFGTVLSLKIPKPSYKKSPGVEEVNEPGVGKVF 518

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTV-NAFYYPED 512
           +EY D     +A   L+GR +   TV  AF+  ED
Sbjct: 519 VEYEDEKTALSAIMGLAGRSYNDRTVLCAFFNHED 553


>gi|440906315|gb|ELR56591.1| Putative RNA-binding protein 23 [Bos grunniens mutus]
          Length = 463

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 52/190 (27%), Positives = 79/190 (41%), Gaps = 24/190 (12%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 222 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 265

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L    TE  ++ +LE FG +    L+KD +TG SKGYGF 
Sbjct: 266 -MANNLQKGNGGPVRLYVGSLHCNITEDMLRGILEPFGKIDNIVLMKDSETGRSKGYGFI 324

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKT-------EQESILAQAQQHIAI 394
            + D      A   LNG ++  + + +   T      T       +QE  L  A  H+ +
Sbjct: 325 TFSDSECARRALEQLNGFELAGRPMRIGHVTERPDGGTDITFPDGDQELDLGSAAGHLQL 384

Query: 395 QKMALQTSGM 404
                + SG+
Sbjct: 385 MAKLAEGSGI 394


>gi|170054071|ref|XP_001862961.1| cleavage stimulation factor 64 kDa subunit [Culex quinquefasciatus]
 gi|167874431|gb|EDS37814.1| cleavage stimulation factor 64 kDa subunit [Culex quinquefasciatus]
          Length = 400

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 52/93 (55%), Gaps = 1/93 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++K++    G +    LV DR++G  KGYGFC Y+D      A   L
Sbjct: 17  VFVGNIPYEATEEKLKDIFSEVGPVISLKLVFDRESGKPKGYGFCEYKDQETALSAMRNL 76

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
           NG ++G + L V  A  + +S+ E  ++L   Q
Sbjct: 77  NGYEIGGRALRVDNA-CTEKSRMEMAALLQGPQ 108


>gi|324514401|gb|ADY45855.1| Cleavage stimulation factor subunit 2 [Ascaris suum]
          Length = 324

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 44/84 (52%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG + Y   E Q+K++    G +    LV DR+TG  KGYGFC Y DP   + A   L
Sbjct: 25  VFVGNISYEVGEEQLKQVFSQVGPVVHLRLVHDRETGKPKGYGFCEYNDPQTAESAIRNL 84

Query: 357 NGLKMGDKTLTVRRATASGQSKTE 380
           NG ++  + L V  A    +S  E
Sbjct: 85  NGYELNGRQLRVDSAAGGERSADE 108


>gi|298248970|ref|ZP_06972774.1| RNP-1 like RNA-binding protein [Ktedonobacter racemifer DSM 44963]
 gi|297546974|gb|EFH80841.1| RNP-1 like RNA-binding protein [Ktedonobacter racemifer DSM 44963]
          Length = 104

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 44/76 (57%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           R++VGGLPY  TE  + +L E  G +    ++ DR+TG SKG+GF        T  A   
Sbjct: 2   RIYVGGLPYQSTEQDLIQLFEQIGQVTSATVITDRETGRSKGFGFVEMSSDDETRAAIEQ 61

Query: 356 LNGLKMGDKTLTVRRA 371
           LNG  +GD+T+TV  A
Sbjct: 62  LNGSTLGDRTITVNEA 77


>gi|291403539|ref|XP_002718108.1| PREDICTED: RNA binding motif protein 23 isoform 1 [Oryctolagus
           cuniculus]
          Length = 410

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 175 KGIAYVEFCDIQAVPLAIGLTGQRLLGVPIMVQASQAEKNRLAA---------------- 218

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 219 -MANNLQKGSGGPLRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 277

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 278 TFSDSECGRRALEQLNGFELAGRPMRVGHVT 308


>gi|148231281|ref|NP_001085808.1| RNA binding motif protein 23 [Xenopus laevis]
 gi|49118375|gb|AAH73374.1| MGC80803 protein [Xenopus laevis]
          Length = 416

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 47/154 (30%), Positives = 66/154 (42%), Gaps = 17/154 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   +     A+ L G    GV + V+        LAA       S NL     
Sbjct: 194 KGIAYVEFCEIHSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAAM------SNNLQ---- 243

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
                   G  GP R++VG L +  TE  ++ + E FG +    L+K+ DTG SKG+GF 
Sbjct: 244 -------RGNFGPMRLYVGSLHFNITEEMLRGIFEPFGKIENIQLLKEPDTGRSKGFGFI 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRATASG 375
            + D      A   LNG ++  K + V   T  G
Sbjct: 297 TFTDAECARRALEQLNGFELAGKPMKVGHVTGGG 330


>gi|156543304|ref|XP_001603981.1| PREDICTED: RNA-binding motif protein, X-linked 2-like [Nasonia
           vitripennis]
          Length = 139

 Score = 67.0 bits (162), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 30/72 (41%), Positives = 45/72 (62%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           +F+GGLPY  TE  +  +   +G +   +LV+D+DTG  KGYGF  Y+D   T +A   L
Sbjct: 36  IFIGGLPYDLTEGDVIAVFSQYGEIVNINLVRDKDTGKQKGYGFLCYEDQRSTILAVDNL 95

Query: 357 NGLKMGDKTLTV 368
           NG+K+  +T+ V
Sbjct: 96  NGIKILGRTIRV 107


>gi|426232776|ref|XP_004010396.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Ovis aries]
          Length = 447

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 206 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAAT--------------- 250

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
             AS    G+ GP R++VG L +  TE  ++ + E FG +    L+KD +TG SKGYGF 
Sbjct: 251 --ASNLQKGSGGPVRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSETGCSKGYGFI 308

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + +   T
Sbjct: 309 TFSDSECARRALEQLNGFELAGRPMRIGHVT 339


>gi|426232774|ref|XP_004010395.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Ovis aries]
          Length = 463

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 222 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAAT--------------- 266

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
             AS    G+ GP R++VG L +  TE  ++ + E FG +    L+KD +TG SKGYGF 
Sbjct: 267 --ASNLQKGSGGPVRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSETGCSKGYGFI 324

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + +   T
Sbjct: 325 TFSDSECARRALEQLNGFELAGRPMRIGHVT 355


>gi|358254188|dbj|GAA54213.1| cleavage stimulation factor subunit 2 [Clonorchis sinensis]
          Length = 437

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 42/72 (58%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           +FVG +PY  TE ++ EL    G + GF LV DR++G  KGYGFC Y +PA+   A   L
Sbjct: 23  IFVGNIPYEATEEKLIELFGKAGPVIGFRLVYDRESGKPKGYGFCEYNNPAIAASALRNL 82

Query: 357 NGLKMGDKTLTV 368
             ++   + L +
Sbjct: 83  QNIEFNGRPLRI 94


>gi|348676634|gb|EGZ16451.1| hypothetical protein PHYSODRAFT_314245 [Phytophthora sojae]
          Length = 449

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 90/373 (24%), Positives = 152/373 (40%), Gaps = 70/373 (18%)

Query: 174 ARRVYVGGLPPLANEQAIATFFSQVMTAIGGNSAGPGDAVVNVYINHE------KKFAFV 227
           ARR+Y+G L     E+ I + F+            P  A+ ++ ++ E      K F F+
Sbjct: 119 ARRLYIGNLYYDLKEEDIRSAFA------------PFGAIHSIDLSLEPGASRSKGFCFL 166

Query: 228 EMRTVEEASNAM-ALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAVGLASG 286
           E   V  A +A+  L+G      A+RV RP             G  +PN +L+   +   
Sbjct: 167 EYEDVLAAESAVQVLNGTPLANRAMRVGRPHR-----------GNTNPNDSLS---IGQE 212

Query: 287 AIGGAEGPDR-VFVGGLPYYFTETQIKELLESFGTLHGFDL--VKDRDTGNSKGYGFCVY 343
           AI     P + +++  +        ++ +   FG +    +  V   ++G  +GYGF  +
Sbjct: 213 AIKNV--PTKCIYIANVRVELNSQHLESIFSPFGAIRSSVMAAVSPLESGVHRGYGFMEF 270

Query: 344 QDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHIAIQKMALQTSG 403
            + +    A   +NG ++  + L V +A+ +          LA +Q  +        T G
Sbjct: 271 VEESCAASAIQHMNGFELAGQPLKVGKASEAAMLIN-----LATSQDKVVRDGPGATTDG 325

Query: 404 MNTL---GGGMSLFGETLAK------------VLCLTEAITADALADDEEYEEILEDMRE 448
            N L       + F E   +             LCL   +    + ++ E E     +R 
Sbjct: 326 ANGLVPEPKKTATFAEDDVEGVKDVADGDDKCCLCLMNLVNRGEVDEELEDE-----VRG 380

Query: 449 ECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYDAVGCATAKNALSGRKFGGNTVNAFY 508
           ECGK+G +  V I        E     +VF+ + +A G + AK AL GR FGGN V A Y
Sbjct: 381 ECGKFGNVNKVEI-------HELADHVRVFVLFDEAFGASKAKQALHGRFFGGNQVQAHY 433

Query: 509 YPEDKYFNKDYSA 521
           YP  +   + Y++
Sbjct: 434 YPLRELEQQRYTS 446



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/116 (25%), Positives = 53/116 (45%)

Query: 277 NLAAVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSK 336
           +L ++ L S  +   +   R+++G L Y   E  I+     FG +H  DL  +     SK
Sbjct: 102 SLLSLNLPSAKVSPNDLARRLYIGNLYYDLKEEDIRSAFAPFGAIHSIDLSLEPGASRSK 161

Query: 337 GYGFCVYQDPAVTDIACAALNGLKMGDKTLTVRRATASGQSKTEQESILAQAQQHI 392
           G+ F  Y+D    + A   LNG  + ++ + V R      +  +  SI  +A +++
Sbjct: 162 GFCFLEYEDVLAAESAVQVLNGTPLANRAMRVGRPHRGNTNPNDSLSIGQEAIKNV 217


>gi|341882558|gb|EGT38493.1| hypothetical protein CAEBREN_09163 [Caenorhabditis brenneri]
          Length = 757

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 61/102 (59%), Gaps = 10/102 (9%)

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           + V+ L   +T D   D +EY E   ++REECGK+GT+++VVI         + GV K+F
Sbjct: 665 SSVIVLRNMVTPD---DIDEYLE--GEIREECGKFGTVLDVVIAN-----FASSGVVKIF 714

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           ++Y D++    AK AL GR FGGN+V A  Y +  + + DY+
Sbjct: 715 VKYADSMQVDRAKAALDGRFFGGNSVKAEAYDQILFDHADYT 756



 Score = 45.1 bits (105), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 19/73 (26%), Positives = 38/73 (52%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           R++VG + +   E  +++  + FG +   ++  D  TG+ K + F  Y+ P    +A  +
Sbjct: 103 RIYVGSISFEIREDMLRKAFDPFGPIKSINMSWDPATGHHKTFAFVEYEIPEAALLAQES 162

Query: 356 LNGLKMGDKTLTV 368
           +NG  +G + L V
Sbjct: 163 MNGQMLGGRNLKV 175


>gi|312371125|gb|EFR19385.1| hypothetical protein AND_22609 [Anopheles darlingi]
          Length = 377

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/89 (40%), Positives = 47/89 (52%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  +KE+    G +    LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 18  VFVGNIPYDATEEALKEIFCEVGLVMSMKLVYDRETGKPKGYGFCEYKDKETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG   G + L V  A    +S+ E  ++L
Sbjct: 78  NGYVFGGRPLRVDNACTE-KSRLEMAALL 105


>gi|410989439|ref|XP_004000969.1| PREDICTED: RNA-binding motif protein, X-linked 2 [Felis catus]
          Length = 507

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 60/129 (46%), Gaps = 22/129 (17%)

Query: 262 TLAAALGPGQPSPNLNLAAV--------------------GLASGAIGGAEGPDR--VFV 299
            LAA  GP +  P+L  A +                    G+A      +E  D   +F+
Sbjct: 162 CLAAERGPAEMKPHLTSACLCSPLTKVKLINELNEREVQLGVADKVSWHSEYKDSAWIFL 221

Query: 300 GGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGL 359
           GGLPY  TE  I  +   +G +   +LV+D+ TG SKG+ F  Y+D   T +A    NG+
Sbjct: 222 GGLPYELTEGDIICVFSQYGEIVNINLVRDKKTGKSKGFCFLCYEDQRSTVLAVDNFNGI 281

Query: 360 KMGDKTLTV 368
           K+  +T+ V
Sbjct: 282 KIKGRTIRV 290


>gi|449665135|ref|XP_002159315.2| PREDICTED: cleavage stimulation factor subunit 2-like [Hydra
           magnipapillata]
          Length = 413

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/89 (37%), Positives = 47/89 (52%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  ++ Q+K++    G +  F LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 28  VFVGNIPYEASDDQLKDIFSQAGPVLSFRLVYDRETGKPKGYGFCEYKDSETAQSAMRNL 87

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG ++  + L V  A +   +  E    L
Sbjct: 88  NGTEIHGRQLRVDSAASQKGNGVEDPKAL 116


>gi|395503034|ref|XP_003755878.1| PREDICTED: probable RNA-binding protein 23 [Sarcophilus harrisii]
          Length = 451

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 219 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 262

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKG+GF 
Sbjct: 263 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDPDTGRSKGFGFL 321

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 322 TFSDSECARRALEQLNGFELAGRPMRVGHVT 352


>gi|126277396|ref|XP_001369125.1| PREDICTED: probable RNA-binding protein 23 isoform 1 [Monodelphis
           domestica]
          Length = 449

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 217 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 260

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKG+GF 
Sbjct: 261 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDPDTGRSKGFGFL 319

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 320 TFSDSECARRALEQLNGFELAGRPMRVGHVT 350


>gi|197102126|ref|NP_001124751.1| probable RNA-binding protein 23 [Pongo abelii]
 gi|55725769|emb|CAH89665.1| hypothetical protein [Pongo abelii]
          Length = 423

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 69/151 (45%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+         A+     +P+        
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQ---------ASQAEKNRPA-------- 233

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 234 AMANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 293

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324


>gi|221044666|dbj|BAH14010.1| unnamed protein product [Homo sapiens]
          Length = 270

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 37  KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 80

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 81  -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 139

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 140 TFSDSECARRALEQLNGFELAGRPMRVGHVT 170


>gi|158292144|ref|XP_313699.3| AGAP004414-PA [Anopheles gambiae str. PEST]
 gi|157017295|gb|EAA09129.3| AGAP004414-PA [Anopheles gambiae str. PEST]
          Length = 390

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 36/89 (40%), Positives = 47/89 (52%), Gaps = 1/89 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE  +KE+    G +    LV DR+TG  KGYGFC Y+D      A   L
Sbjct: 18  VFVGNIPYDATEEALKEIFCEVGLVLSMKLVYDRETGKPKGYGFCEYKDKETALSAMRNL 77

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESIL 385
           NG   G + L V  A    +S+ E  ++L
Sbjct: 78  NGYVFGGRPLRVDNACTE-KSRMEMAALL 105


>gi|126277398|ref|XP_001369153.1| PREDICTED: probable RNA-binding protein 23 isoform 2 [Monodelphis
           domestica]
          Length = 433

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 67/151 (44%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 201 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 244

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKG+GF 
Sbjct: 245 -MANNLQKGSGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDPDTGRSKGFGFL 303

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 304 TFSDSECARRALEQLNGFELAGRPMRVGHVT 334


>gi|119586631|gb|EAW66227.1| RNA binding motif protein 23, isoform CRA_f [Homo sapiens]
          Length = 269

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 66/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 37  KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 80

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYGF 
Sbjct: 81  -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGFI 139

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 140 TFSDSECARRALEQLNGFELAGRPMRVGHVT 170


>gi|213408745|ref|XP_002175143.1| RNA-binding protein [Schizosaccharomyces japonicus yFS275]
 gi|212003190|gb|EEB08850.1| RNA-binding protein [Schizosaccharomyces japonicus yFS275]
          Length = 308

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 38/72 (52%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE Q+ ++    GT+  F LV D +T   KGYGFC + DP     A   L
Sbjct: 9   VFVGNIPYDATEKQMADIFHQIGTVRSFKLVLDPETNQPKGYGFCEFHDPETAASAVRNL 68

Query: 357 NGLKMGDKTLTV 368
           N    G + L V
Sbjct: 69  NNFPFGARKLRV 80


>gi|321460847|gb|EFX71885.1| hypothetical protein DAPPUDRAFT_93311 [Daphnia pulex]
          Length = 81

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 45/76 (59%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  +E Q+K +    G +  F +V+DR+TG S+G+GFC +Q P     A   L
Sbjct: 4   VFVGNIPYGVSEDQLKAIFSEAGPVVSFRIVQDRETGRSRGFGFCEFQSPDSAQTAMRNL 63

Query: 357 NGLKMGDKTLTVRRAT 372
           NG ++  ++L V  A 
Sbjct: 64  NGYELNGRSLRVDSAN 79


>gi|449460375|ref|XP_004147921.1| PREDICTED: zinc finger CCCH domain-containing protein 25-like
           [Cucumis sativus]
          Length = 395

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 33/89 (37%), Positives = 51/89 (57%), Gaps = 2/89 (2%)

Query: 275 NLNLAAVGLASGAIGGAEGPDR--VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDT 332
           N   AA+G++  A   A+  D   VFVGG+PY  TE  +  +   +G +   +L++D+ T
Sbjct: 14  NSQEAALGISEEASWHAKYKDSAYVFVGGIPYDLTEGDLLAVFAQYGEIVDVNLIRDKGT 73

Query: 333 GNSKGYGFCVYQDPAVTDIACAALNGLKM 361
           G SKGY F  Y+D   T++A   LNG ++
Sbjct: 74  GKSKGYAFVAYEDQRSTNLAVDNLNGAQI 102


>gi|349604477|gb|AEQ00017.1| RNA-binding protein 39-like protein, partial [Equus caballus]
          Length = 374

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 64/151 (42%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 177 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 219

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 220 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 279

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 280 TFSDSECAKKALEQLNGFELAGRPMKVGHVT 310


>gi|190402270|gb|ACE77680.1| RNA binding motif protein 39 isoform a (predicted) [Sorex araneus]
          Length = 435

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 64/151 (42%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVT 327


>gi|281351269|gb|EFB26853.1| hypothetical protein PANDA_007127 [Ailuropoda melanoleuca]
          Length = 334

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 56/117 (47%), Gaps = 5/117 (4%)

Query: 254 RRPTDYNPTLAAALGPGQPSPNLNLAAVGLASGAIGGAEGPDR--VFVGGLPYYFTETQI 311
           R P   NP     L       N     +G+A      +E  D   +F+GGLPY  TE  I
Sbjct: 5   REPKKMNPLTKVKL---INELNEREVQLGVADKVSWHSEYKDSAWIFLGGLPYELTEGDI 61

Query: 312 KELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAALNGLKMGDKTLTV 368
             +   +G +   +LV+D+ TG SKG+ F  Y+D   T +A    NG+K+  +T+ V
Sbjct: 62  ICVFSQYGEIVNINLVRDKKTGKSKGFCFLCYEDQRSTILAVDNFNGIKIKGRTIRV 118


>gi|441628930|ref|XP_003275701.2| PREDICTED: ELAV-like protein 3 [Nomascus leucogenys]
          Length = 364

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 61/206 (29%), Positives = 92/206 (44%), Gaps = 16/206 (7%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           + V  LP   T+ + K L  S G +    LV+D+ TG S GYGF  Y DP   D A   L
Sbjct: 157 LIVNYLPQNMTQDEFKSLFGSIGDIESCKLVRDKITGQSLGYGFVNYSDPNDADKAINTL 216

Query: 357 NGLKMGDKTLTVRRATASGQSKT--EQESILAQAQQHIAIQKMALQTSGMNTLGGGMSLF 414
           NGLK+  KT+ V  + ++  + T  E +++L  A        +A+   GM+ L G     
Sbjct: 217 NGLKLQTKTIKVGASFSNPPNSTTLELDNLLNMAYGVKRFSPIAID--GMSGLAGVGLSG 274

Query: 415 GETLAKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGV 474
           G   A        ++ +A   DE        + +  G +G + NV + R D    +  G 
Sbjct: 275 GAAGAGWCIFVYNLSPEA---DESV------LWQLFGPFGAVTNVKVIR-DFTTNKCKGF 324

Query: 475 GKVFLEYYDAVGCATAKNALSGRKFG 500
           G V +  YD    A A  +L+G + G
Sbjct: 325 GFVTMTNYDEAAMAIA--SLNGYRLG 348



 Score = 43.5 bits (101), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 39/77 (50%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           +FV  L     E+ + +L   FG +    +++D  T   KG+GF    +     +A A+L
Sbjct: 283 IFVYNLSPEADESVLWQLFGPFGAVTNVKVIRDFTTNKCKGFGFVTMTNYDEAAMAIASL 342

Query: 357 NGLKMGDKTLTVRRATA 373
           NG ++G++ L V   T+
Sbjct: 343 NGYRLGERVLQVSFKTS 359


>gi|341895702|gb|EGT51637.1| CBN-RNP-6 protein [Caenorhabditis brenneri]
          Length = 757

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 60/102 (58%), Gaps = 10/102 (9%)

Query: 419 AKVLCLTEAITADALADDEEYEEILEDMREECGKYGTLVNVVIPRPDQNGGETPGVGKVF 478
           + V+ L   +T D   D +EY E   ++REECGK+GT+++VVI         + GV K+F
Sbjct: 665 SSVIVLRNMVTPD---DIDEYLE--GEIREECGKFGTVLDVVIAN-----FASSGVVKIF 714

Query: 479 LEYYDAVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           ++Y D++    AK AL GR FGGN V A  Y +  + + DY+
Sbjct: 715 VKYADSMQVDRAKAALDGRFFGGNIVKAEAYDQILFDHADYT 756



 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 19/73 (26%), Positives = 38/73 (52%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           R++VG + +   E  +++  + FG +   ++  D  TG+ K + F  Y+ P    +A  +
Sbjct: 103 RIYVGSISFEIREDMLRKAFDPFGPIKSINMSWDPATGHHKTFAFVEYEIPEAALLAQES 162

Query: 356 LNGLKMGDKTLTV 368
           +NG  +G + L V
Sbjct: 163 MNGQMLGGRNLKV 175


>gi|157137809|ref|XP_001664044.1| RNA-binding protein [Aedes aegypti]
 gi|108869641|gb|EAT33866.1| AAEL013869-PA [Aedes aegypti]
          Length = 399

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 52/93 (55%), Gaps = 1/93 (1%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           VFVG +PY  TE ++K++    G +    LV DR++G  KGYGFC Y+D      A   L
Sbjct: 16  VFVGNIPYEATEEKLKDIFCEVGPVISLKLVFDRESGKPKGYGFCEYKDQETALSAMRNL 75

Query: 357 NGLKMGDKTLTVRRATASGQSKTEQESILAQAQ 389
           NG ++G + L V  A  + +S+ E  ++L   Q
Sbjct: 76  NGYEIGGRALRVDNA-CTEKSRMEMAALLQGPQ 107


>gi|27734072|ref|NP_775552.1| RNA-binding motif protein, X-linked 2 [Mus musculus]
 gi|61230302|sp|Q8R0F5.1|RBMX2_MOUSE RecName: Full=RNA-binding motif protein, X-linked 2
 gi|20071694|gb|AAH26976.1| RNA binding motif protein, X-linked 2 [Mus musculus]
 gi|26345092|dbj|BAC36195.1| unnamed protein product [Mus musculus]
 gi|74207503|dbj|BAE40004.1| unnamed protein product [Mus musculus]
          Length = 326

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 43/72 (59%)

Query: 297 VFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 356
           +FVGGLPY  TE  I  +   +G +   +LV+D+ TG SKG+ F  Y+D   T +A    
Sbjct: 38  IFVGGLPYELTEGDIICVFSQYGEIVNINLVRDKKTGKSKGFCFLCYEDQRSTVLAVDNF 97

Query: 357 NGLKMGDKTLTV 368
           NG+K+  +T+ V
Sbjct: 98  NGIKIKGRTIRV 109


>gi|119596572|gb|EAW76166.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_e [Homo
           sapiens]
          Length = 445

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 64/151 (42%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   V     A+ L G    GV + V                 Q S      A 
Sbjct: 194 KGIAYVEFVDVSSVPLAIGLTGQRVLGVPIIV-----------------QASQAEKNRAA 236

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYGF 
Sbjct: 237 AMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYGFI 296

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 297 TFSDSECAKKALEQLNGFELAGRPMKVGHVT 327


>gi|48146631|emb|CAG33538.1| RNPC4 [Homo sapiens]
          Length = 423

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 65/151 (43%), Gaps = 17/151 (11%)

Query: 222 KKFAFVEMRTVEEASNAMALDGIIFEGVAVRVRRPTDYNPTLAAALGPGQPSPNLNLAAV 281
           K  A+VE   ++    A+ L G    GV + V+        LAA                
Sbjct: 191 KGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLAA---------------- 234

Query: 282 GLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFC 341
            +A+    G  GP R++VG L +  TE  ++ + E FG +    L+KD DTG SKGYG  
Sbjct: 235 -MANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLMKDSDTGRSKGYGLI 293

Query: 342 VYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
            + D      A   LNG ++  + + V   T
Sbjct: 294 TFSDSECARRALEQLNGFELAGRPMRVGHVT 324


>gi|17510025|ref|NP_491176.1| Protein RNP-6, isoform b [Caenorhabditis elegans]
 gi|373220165|emb|CCD72565.1| Protein RNP-6, isoform b [Caenorhabditis elegans]
          Length = 749

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 57/97 (58%), Gaps = 6/97 (6%)

Query: 425 TEAITADALADDEEYEEILE-DMREECGKYGTLVNVVIPRPDQNGGETPGVGKVFLEYYD 483
           +  I    +   ++ +E LE ++REECGKYG +++VVI         + G+ K+F++Y D
Sbjct: 657 SNVIVLRNMVTPQDIDEFLEGEIREECGKYGNVIDVVI-----ANFASSGLVKIFVKYSD 711

Query: 484 AVGCATAKNALSGRKFGGNTVNAFYYPEDKYFNKDYS 520
           ++    AK AL GR FGGNTV A  Y +  + + DY+
Sbjct: 712 SMQVDRAKAALDGRFFGGNTVKAEAYDQILFDHADYT 748



 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 19/73 (26%), Positives = 37/73 (50%)

Query: 296 RVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAA 355
           R++VG + +   E  ++   + FG +   ++  D  TG+ K + F  Y+ P    +A  +
Sbjct: 103 RIYVGSISFEIREDMLRRAFDPFGPIKSINMSWDPATGHHKTFAFVEYEVPEAALLAQES 162

Query: 356 LNGLKMGDKTLTV 368
           +NG  +G + L V
Sbjct: 163 MNGQMLGGRNLKV 175


>gi|328850276|gb|EGF99443.1| hypothetical protein MELLADRAFT_31912 [Melampsora larici-populina
           98AG31]
          Length = 79

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 33/77 (42%), Positives = 46/77 (59%), Gaps = 9/77 (11%)

Query: 433 LADDEEYEEILEDMREECGKYGTLVNVVIPRPDQN---------GGETPGVGKVFLEYYD 483
           L DDEEY+EILED+ EEC KY  + +V IPRP +N              G+GKVF+++  
Sbjct: 2   LVDDEEYKEILEDIIEECSKYVKIEDVKIPRPKKNQKGRIHSKASESVEGLGKVFIKFEQ 61

Query: 484 AVGCATAKNALSGRKFG 500
              C  A +A++GR+F 
Sbjct: 62  IEDCGQALSAIAGRQFA 78


>gi|119596573|gb|EAW76167.1| RNA-binding region (RNP1, RRM) containing 2, isoform CRA_f [Homo
           sapiens]
          Length = 423

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 48/93 (51%)

Query: 280 AVGLASGAIGGAEGPDRVFVGGLPYYFTETQIKELLESFGTLHGFDLVKDRDTGNSKGYG 339
           A  +A+    G+ GP R++VG L +  TE  ++ + E FG +    L+ D +TG SKGYG
Sbjct: 235 AAAMANNLQKGSAGPMRLYVGSLHFNITEDMLRGIFEPFGRIESIQLMMDSETGRSKGYG 294

Query: 340 FCVYQDPAVTDIACAALNGLKMGDKTLTVRRAT 372
           F  + D      A   LNG ++  + + V   T
Sbjct: 295 FITFSDSECAKKALEQLNGFELAGRPMKVGHVT 327


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.316    0.134    0.391 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,788,772,934
Number of Sequences: 23463169
Number of extensions: 397342696
Number of successful extensions: 1680132
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 13389
Number of HSP's successfully gapped in prelim test: 12927
Number of HSP's that attempted gapping in prelim test: 1410265
Number of HSP's gapped (non-prelim): 158424
length of query: 521
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 374
effective length of database: 8,910,109,524
effective search space: 3332380961976
effective search space used: 3332380961976
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 79 (35.0 bits)