BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 002280
(943 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359477752|ref|XP_002281833.2| PREDICTED: uncharacterized protein LOC100266510 [Vitis vinifera]
Length = 895
Score = 763 bits (1971), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/963 (48%), Positives = 604/963 (62%), Gaps = 88/963 (9%)
Query: 1 MGRSGRKKEKHGKRGEPSLADSNEGSAARTRPFSFDEIMIRRKTKNLSENIKEDAMDVSN 60
M RS R KEK+GK E L + EG+AARTRPFSFDEIM+RRK K L ++K+ A + N
Sbjct: 1 MSRSNRPKEKYGKSTELYLDNFKEGTAARTRPFSFDEIMLRRKNKKLYGDVKDGAGEEGN 60
Query: 61 IPGDCTIENVSDDHKS--GRHHGHKKSPGVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQ 118
I ++NVSD ++S G H PG H+SE++VKV S KK++N K +
Sbjct: 61 ISRKDIVKNVSDCYESDKGYKHNEDSFPGAINHSSEDFVKVSSRKKDENTSMKEGNLGNV 120
Query: 119 RDGENRDLETKLKEDYVAKSNHRRTLKSEIRLKDRSPGNEKDRGNRGSDTKLKADVTKET 178
+D E+ + E KLK AK N T D +KE
Sbjct: 121 KDKESHNSEDKLK----AKPNKGMT-----------------------------DKSKEG 147
Query: 179 SSKPNVKSEKLIPSQGRSHNQSIVDSRDEATKKHSRDLTGKDRHADKSGEKSERESKRKH 238
V K I + R + S E KK SRD GKDR+AD+S KSE+ESKRKH
Sbjct: 148 KINQRVHGRKKIDERSRRSSDD---SESEPEKKFSRDSVGKDRYADRS-RKSEKESKRKH 203
Query: 239 RNRDDEKNRDKSAAKKVDLGKGHDLKVSERKEKKESPKSRHGNSRLKRRRSRSRERE-DR 297
R +DEKNR++++ KK D GK H+ + +RKE++ESP SR+ SR KRRRSRSRER+ DR
Sbjct: 204 RTGEDEKNRERNSMKKHDPGKRHESEFLDRKERRESPPSRYEESRPKRRRSRSRERDKDR 263
Query: 298 NRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNGLSGHYRRHDG 357
++RS SLSPRAQKRTS++ REH L HS K SGRQHSD D +R++ NG S H+RRH G
Sbjct: 264 DKRSSSLSPRAQKRTSHHGREHAELSLHSLKDRSGRQHSDADRNRISNNGSSSHFRRHGG 323
Query: 358 STSGLGGYSPRKRRTEAAAKTPSPINRSPEKKSAKWDVAPVET-----YSVPSNVHTSNQ 412
S SGLGGYSPRKRRTEAA KTPSP NRSPEKKSA WD+ P T SV S++
Sbjct: 324 SASGLGGYSPRKRRTEAAIKTPSPTNRSPEKKSAGWDLPPSRTDGMNAGSVLSSLQVLKP 383
Query: 413 AASSNAHEMVS----SDPVTSTTQK-PLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLC 467
SSNA E+ S + PVT+TT K PL I A NVS+DS+QLTQ+ RPMRRL
Sbjct: 384 TVSSNADELPSAVPVAVPVTATTAKPPLPRIYSDAVSKNKNVSIDSIQLTQATRPMRRLY 443
Query: 468 VENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAA 527
VENLP+S+SEKALME LNNFLLSSG+ HV G+ PCI C+I +EKGQA VEFLT EDASAA
Sbjct: 444 VENLPVSSSEKALMECLNNFLLSSGINHVQGTPPCISCIIHKEKGQALVEFLTPEDASAA 503
Query: 528 LCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED 587
L DG SFSGSILKI+RPK+F +MEI AFGPLKAY F+VNED
Sbjct: 504 LSFDGISFSGSILKIRRPKDF------------------LMEIAAAFGPLKAYRFQVNED 545
Query: 588 HEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHA 647
EPCAF+EYVDQ VT KA AGLNG+K+GGQVLT VQA+ + M+N+GN PF+GIP+HA
Sbjct: 546 LGEPCAFLEYVDQSVTLKACAGLNGMKLGGQVLTVVQAIPNALAMENTGNLPFYGIPEHA 605
Query: 648 LPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGDSNI 707
PLL++PT+VLKLKNV NP+ SSLSE E+EE+LED+RLEC RFG+VKSVN+VKY +S++
Sbjct: 606 KPLLERPTQVLKLKNVVNPDDLSSLSEAELEEILEDIRLECTRFGTVKSVNIVKYNNSHV 665
Query: 708 STIQACEGNENTASAGVGQNLTNDETNEKGERLEEVTDHKSIKNNELEILNDSKEVMEAG 767
ST++ E +NT G NL D + K E L TD+ S + ++ D K++ E
Sbjct: 666 STLEVYEAADNT-----GSNLGCDGNSMKAETLGGGTDNGSSDISGIKPPTDVKDLKEVD 720
Query: 768 EV---NNVKDNRPASGTMGDEPSQLCELDTDMAVEYQARDSTSEIVSQGVPTQVNTLKDS 824
EV N++ D++ + + +E + +D++ AV+ S+ + +G+P Q+N +K
Sbjct: 721 EVVERNSISDDKSLTDLIKNELCEPSHIDSNTAVKEPGCPDGSDDIPRGLPDQLNNMKHE 780
Query: 825 PCAHDDKVTCNIQLEHMSEENKSSAKEDLNLEEVNGNSEAFTGASNEM----GMQSSAVE 880
+DK IQ + + + + +E+ N G S E+ G++S
Sbjct: 781 VELRNDKAADVIQEDFIIKNKLMTVEEETN--------RKLLGTSAELDSSPGIKSDFTG 832
Query: 881 NGDNENQDPNQGHIFEPGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRAR 940
D+E + +FE GCV VEY R EASCMAAH LH R FDDR+V V Y+ L+LYR +
Sbjct: 833 KNDSEKGLCDLDDMFEVGCVLVEYGRTEASCMAAHCLHGRYFDDRVVVVGYVALDLYRMK 892
Query: 941 FSK 943
F +
Sbjct: 893 FPR 895
>gi|118489922|gb|ABK96758.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 787
Score = 747 bits (1929), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/786 (55%), Positives = 533/786 (67%), Gaps = 75/786 (9%)
Query: 204 SRDEATKKHSRDLTGKDRHADKSGEKSERESKRKHRNRDDEKNRDKSAAKKVDLGKGHDL 263
S EA KKHSR++ KDRH DKS KSERE K K+RN D+K+RD++AAKK DLGKGH L
Sbjct: 31 SESEAVKKHSREVQ-KDRHVDKSRGKSERERKEKYRNGIDDKSRDRNAAKKHDLGKGHHL 89
Query: 264 KVSERKEKKESPKSRHGNSRLKRRRSRSREREDRNRRSLSLSPRAQKRTSYYDREHEGLP 323
+ SERKE+KES KS H RLKRRRSRSRE EDRNRRS+SLSPRA KR SY+ REH L
Sbjct: 90 ETSERKERKESSKSHHEELRLKRRRSRSREHEDRNRRSISLSPRAHKRGSYHKREHVELS 149
Query: 324 SHSTKISSGRQHSDIDSSRVTGNGLSGHYRRHDGSTSGLGGYSPRKRRTEAAAKTPSPIN 383
HS K SGRQ SD +++++ + S H RRH G SGLGGYSPRKR+TEAA KTPSP
Sbjct: 150 LHSVKERSGRQQSDAENNQLANSSSSRHQRRHGGFASGLGGYSPRKRKTEAAVKTPSPAK 209
Query: 384 RSPEKKSAKWDVAPVETYSV-----PSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGI 438
RSPEKKSAKWD+AP ET +V SN + NQ ASSN HE+VS+ PV S KP +G+
Sbjct: 210 RSPEKKSAKWDLAPEETNNVFPAVILSNFQSPNQTASSNIHEVVSAVPVVSAPMKPPSGV 269
Query: 439 SVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHG 498
S+S+ VS +S+QLTQ+ P+RRL +EN+P SASEKA+M+ LNNFL+SSGV H+ G
Sbjct: 270 SLSSLSTATKVSTESIQLTQATHPIRRLYMENIPASASEKAVMDCLNNFLISSGVHHIQG 329
Query: 499 SLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------ 552
+ PCI C+ Q+EKGQA VEFLT EDASAAL DG SFSGSI+K++RPK+F+EVA
Sbjct: 330 TQPCISCIRQKEKGQALVEFLTPEDASAALSFDGRSFSGSIIKVRRPKDFIEVATGELEK 389
Query: 553 ------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAF 594
IFIGGIS+ LSSKM+MEI AFGPLKAY FE +D +EP AF
Sbjct: 390 SAAAIDAIGDIVKDSPHKIFIGGISKVLSSKMLMEIASAFGPLKAYQFENRKDPDEPFAF 449
Query: 595 IEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKP 654
+EY D+ VT KA AGLNG+K+GGQV+TA+QAV + S + GN F I +HA LL+KP
Sbjct: 450 LEYADESVTFKACAGLNGMKLGGQVITAIQAVPNASSSGSDGNSQFGQISQHAKALLEKP 509
Query: 655 TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGDSNISTIQACE 714
TEVLKLKNVF+ E SSLS EVEEVLEDVRLECARFGSVKS+NV+KY IST ++CE
Sbjct: 510 TEVLKLKNVFDSESLSSLSNTEVEEVLEDVRLECARFGSVKSINVIKYAAITISTSKSCE 569
Query: 715 GNENTASAGVGQNLTNDETNEKGERLEEVTDHKSIKNNELEILNDSKEVMEAGEVNNVKD 774
N++T SA Q+L D TN K + D K + E N++ D
Sbjct: 570 FNDDTVSAEATQSLGCDGTNPKTRNIRGSIDQKFM------------------EGNSIGD 611
Query: 775 NRPASGTMGDEPSQLCELDTDMAVEYQARDSTSEIVSQGVPTQVNTLKDSPCAHDDKVTC 834
++PAS M DEP Q ++D+DMAV+ A S+S+ SQ P V+ ++ DKVT
Sbjct: 612 DKPASDVMEDEPCQPGQVDSDMAVQDLACKSSSD--SQEPPQDVSD------SNVDKVTD 663
Query: 835 NIQLEHMSEENKSSAKED----------------LNLEEVNGNSE-AFTGASNEMGMQSS 877
+I++E + ENKS+A ED LNLEEV+G+ E AF S M M+ +
Sbjct: 664 DIEIEEVDAENKSTAGEDLNLKEVGDNKLMAGEELNLEEVSGDVEKAFVNDS--MEMKPN 721
Query: 878 AVENGDNENQDPNQGHIFEPGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLY 937
++E GD + QD + G IFE GCVFVE+ R E +CMAAH LH RLFDDR V VEY+PL++Y
Sbjct: 722 SIEKGDCKEQDCSLGLIFERGCVFVEFRRTEGACMAAHCLHGRLFDDRAVVVEYVPLDIY 781
Query: 938 RARFSK 943
ARF K
Sbjct: 782 LARFPK 787
>gi|255575831|ref|XP_002528813.1| splicing factor u2af large subunit, putative [Ricinus communis]
gi|223531725|gb|EEF33547.1| splicing factor u2af large subunit, putative [Ricinus communis]
Length = 844
Score = 696 bits (1797), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/800 (50%), Positives = 524/800 (65%), Gaps = 47/800 (5%)
Query: 150 LKDRSPGNEKDRGNRGSDTKLKADVTKETSSKPNVKSEKLIPSQGRSHNQSIVDSRDEAT 209
+KD KDR + S+ KLK++V ++ KP ++ + I + +S Q + ++E
Sbjct: 86 MKDEYITKSKDREIQDSEVKLKSEVHRDL--KPKGRAGEEIYDRRKSDEQRSNNVKNEDL 143
Query: 210 KKHSRDLTGKDRHADKSGEKSERESKRKHRNRDDEKNRDKSAAKKVDLGKGHDLKVSERK 269
KKH R LT + RH D S SERE K K+R DEKN+D+ +K DLGKGHD + ++K
Sbjct: 144 KKHPRHLTERVRHEDGSRGVSEREDKSKYRKGVDEKNKDRLPTRKHDLGKGHDSENLDKK 203
Query: 270 EKKESPKSRHGNSRLKRRRSRSREREDRNRRSLSLSPRAQKRTSYYDREHEGLPS-HSTK 328
EK E KS + +LK RRSRSREREDR RRS+S PR++K SY+DREH G PS H K
Sbjct: 204 EKDELSKSHYEEIKLKSRRSRSREREDRKRRSISPLPRSRKHASYHDREH-GEPSLHFLK 262
Query: 329 ISSGRQHSDIDSSRVTGNGLSGHYRRHDGSTSGLGGYSPRKRRTEAAAKTPSPINRSPEK 388
SG+QHSDID +++T NG +GHY+RH GS S LGGYSPRKRR+EAAA+TPSP SPEK
Sbjct: 263 GKSGQQHSDIDRNKITNNGSTGHYKRHGGSASRLGGYSPRKRRSEAAARTPSPTKHSPEK 322
Query: 389 KSAKWDVAP-----VETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSAS 443
K AKWD+AP + SVP SNQ AS NA VS+ PV S KPL+G+S +
Sbjct: 323 KKAKWDLAPEGADSTFSVSVPPIFKLSNQIASLNARATVSAVPVASIPVKPLSGVSSNIL 382
Query: 444 LAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCI 503
L N ++DSVQLTQ+ RPMRRL VEN+P ASEKA++E LNN L+SSGV H+ G+ PCI
Sbjct: 383 LTNKNDTIDSVQLTQATRPMRRLYVENIPAEASEKAVLERLNNLLISSGVNHIQGTQPCI 442
Query: 504 GCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRTLS 563
C+I +EKGQA VEFLT EDASAAL DG FSGS +KI+RPK+F
Sbjct: 443 SCIIHKEKGQALVEFLTPEDASAALSFDGSYFSGSTIKIRRPKDF--------------- 487
Query: 564 SKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAV 623
+MEI FGPLKAYHFE +D PCAF+EY DQ VT +A AGLNG+K+GGQV++AV
Sbjct: 488 ---IMEIASTFGPLKAYHFENIDDVNGPCAFVEYADQSVTFRACAGLNGMKLGGQVISAV 544
Query: 624 QAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLED 683
Q + + S ++ G PF+G+P+ A PLL KPT+VLKLKN+F+PE SLS +E+EEVLED
Sbjct: 545 QVIPNASTLEIDGKQPFYGVPEQAKPLLDKPTQVLKLKNLFDPETLPSLSRIEIEEVLED 604
Query: 684 VRLECARFGSVKSVNVVKYGDSNISTIQACEGNENTASAGVGQNLTNDETNEKGERLEEV 743
VRLECARFG+VKSVNVV+ G I T +AC+ NE+ SAG QNL DETN + E+
Sbjct: 605 VRLECARFGTVKSVNVVRNGPIPIFTSEACKMNEDMDSAGPQQNLGGDETNAETEKTIGD 664
Query: 744 TDHKSIKNNELEILNDSKEVMEAGEVNNVKDNRPASGTMGDEPSQLCELDTDMAVEYQAR 803
H+ ++ N+ +D K V E N V+D++PA M DE SQL + D++MAVE +
Sbjct: 665 IHHEPVEAND---TDDDKPV----EGNGVEDDKPADDLMEDESSQLGQFDSNMAVENLSG 717
Query: 804 DSTSEIVSQGVPTQVNTLKDSPCAHDDKVTCNIQLEHMSEENKSSAKEDLNLEEVNGNSE 863
D E + +P Q + +S C H KVT ++Q++ E+K +++L E
Sbjct: 718 DGVPE-PQEPIPIQQTSKDESDCLH-GKVTDDVQMKDTIAEHKLPIQQELK--------E 767
Query: 864 AFTGASNEMGMQSSAVENGDNENQDPNQGHIFEPGCVFVEYMRAEASCMAAHSLHRRLFD 923
+FT N+ ++S A GD+E + + +IF P CVFVE+ R EASC+AAH LH RL+D
Sbjct: 768 SFT---NDHAVESDATGKGDHEEHNCDLSYIFYPSCVFVEFGRTEASCIAAHCLHGRLYD 824
Query: 924 DRIVAVEYIPLNLYRARFSK 943
R V V YIPL++YR+RF K
Sbjct: 825 GRTVTVGYIPLDVYRSRFPK 844
>gi|296083697|emb|CBI23686.3| unnamed protein product [Vitis vinifera]
Length = 882
Score = 692 bits (1785), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/958 (44%), Positives = 552/958 (57%), Gaps = 152/958 (15%)
Query: 16 EPSLADSNEGSAARTRPFSFDEIMIRRKTKNLSENIKEDAMDVSNIPGDCTIENVSDDHK 75
E L + EG+AARTRPFSFDEIM+RRK K L ++K+ A + NI ++NVSD ++
Sbjct: 47 ELYLDNFKEGTAARTRPFSFDEIMLRRKNKKLYGDVKDGAGEEGNISRKDIVKNVSDCYE 106
Query: 76 S--GRHHGHKKSPGVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQRDGENRDLETKLKED 133
S G H PG H+SE++VKV S KK++N K + +D E+ + E KLK
Sbjct: 107 SDKGYKHNEDSFPGAINHSSEDFVKVSSRKKDENTSMKEGNLGNVKDKESHNSEDKLK-- 164
Query: 134 YVAKSNHRRTLKSEIRLKDRSPGNEKDRGNRGSDTKLKADVTKETSSKPNVKSEKLIPSQ 193
AK N T D +KE V K I +
Sbjct: 165 --AKPNKGMT-----------------------------DKSKEGKINQRVHGRKKIDER 193
Query: 194 GRSHNQSIVDSRDEATKKHSRDLTGKDRHADKSGEKSERESKRKHRNRDDEKNRDKSAAK 253
R + S E KK SRD GKDR+AD+S KSE+ESKRKHR +DEKNR++++ K
Sbjct: 194 SRRSSDD---SESEPEKKFSRDSVGKDRYADRS-RKSEKESKRKHRTGEDEKNRERNSMK 249
Query: 254 KVDLGKGHDLKVSERKEKKESPKSRHGNSRLKRRRSRSREREDRNRRSLSLSPRAQKRTS 313
K D GK RH + L R+ R
Sbjct: 250 KHDPGK------------------RHESEFLDRKERR----------------------- 268
Query: 314 YYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNGLSGHYRRHDGSTSGLGGYSPRKRRTE 373
E PS RQHSD D +R++ NG S H+RRH GS SGLGGYSPRKRRTE
Sbjct: 269 ------ESPPSR-------RQHSDADRNRISNNGSSSHFRRHGGSASGLGGYSPRKRRTE 315
Query: 374 AAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQK 433
AA KTPSP NRSPEKKSA WD+ P T + + SN+ S+ + + PVT+TT K
Sbjct: 316 AAIKTPSPTNRSPEKKSAGWDLPPSRTDGMNAGSVLSNELPSA----VPVAVPVTATTAK 371
Query: 434 P-LAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSG 492
P L I A NVS+DS+QLTQ+ RPMRRL VENLP+S+SEKALME LNNFLLSSG
Sbjct: 372 PPLPRIYSDAVSKNKNVSIDSIQLTQATRPMRRLYVENLPVSSSEKALMECLNNFLLSSG 431
Query: 493 VQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA 552
+ HV G+ PCI C+I +EKGQA VEFLT EDASAAL DG SFSGSILKI+RPK+FV++
Sbjct: 432 INHVQGTPPCISCIIHKEKGQALVEFLTPEDASAALSFDGISFSGSILKIRRPKDFVDMT 491
Query: 553 -----------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE 589
IFIGGISR LSS M+MEI AFGPLKAY F+VNED
Sbjct: 492 GVQEKLVAAPDAISDIVKDSPHKIFIGGISRALSSDMLMEIAAAFGPLKAYRFQVNEDLG 551
Query: 590 EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALP 649
EPCAF+EYVDQ VT KA AGLNG+K+GGQVLT VQA+ + M+N+GN PF+GIP+HA P
Sbjct: 552 EPCAFLEYVDQSVTLKACAGLNGMKLGGQVLTVVQAIPNALAMENTGNLPFYGIPEHAKP 611
Query: 650 LLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGDSNIST 709
LL++PT+VLKLKNV NP+ SSLSE E+EE+LED+RLEC RFG+VKSVN+VKY +S++ST
Sbjct: 612 LLERPTQVLKLKNVVNPDDLSSLSEAELEEILEDIRLECTRFGTVKSVNIVKYNNSHVST 671
Query: 710 IQACEGNENTASAGVGQNLTNDETNEKGERLEEVTDHKSIKNNELEILNDSKEVMEAGEV 769
++ E +NT G NL D + K E L TD+ SI E E
Sbjct: 672 LEVYEAADNT-----GSNLGCDGNSMKAETLGGGTDNGSID--------------EVVER 712
Query: 770 NNVKDNRPASGTMGDEPSQLCELDTDMAVEYQARDSTSEIVSQGVPTQVNTLKDSPCAHD 829
N++ D++ + + +E + +D++ AV+ S+ + +G+P Q+N +K +
Sbjct: 713 NSISDDKSLTDLIKNELCEPSHIDSNTAVKEPGCPDGSDDIPRGLPDQLNNMKHEVELRN 772
Query: 830 DKVTCNIQLEHMSEENKSSAKEDLNLEEVNGNSEAFTGASNEM----GMQSSAVENGDNE 885
DK IQ + + + + +E+ N G S E+ G++S D+E
Sbjct: 773 DKAADVIQEDFIIKNKLMTVEEETN--------RKLLGTSAELDSSPGIKSDFTGKNDSE 824
Query: 886 NQDPNQGHIFEPGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
+ +FE GCV VEY R EASCMAAH LH R FDDR+V V Y+ L+LYR +F +
Sbjct: 825 KGLCDLDDMFEVGCVLVEYGRTEASCMAAHCLHGRYFDDRVVVVGYVALDLYRMKFPR 882
>gi|449458894|ref|XP_004147181.1| PREDICTED: uncharacterized protein LOC101213128 [Cucumis sativus]
Length = 910
Score = 548 bits (1412), Expect = e-153, Method: Compositional matrix adjust.
Identities = 388/984 (39%), Positives = 538/984 (54%), Gaps = 115/984 (11%)
Query: 1 MGRSGRKKEKHGKRGEPSLADSNEGSAARTRPFSFDEIMIRRKTKNLSENIKEDAMDVSN 60
M G K+ + + + ++S++ SAARTRPFSF++IM+RRKTK + ++ DA
Sbjct: 1 MSTHGCSKQYSRRSKKQTRSNSDDESAARTRPFSFEDIMLRRKTKGSAATVEVDATS--- 57
Query: 61 IPGDCTIENVSDDHKSGRHHGHKKSPGVDVHT---SEEYVKVGSGKKEDNALTKN---VD 114
+D+ S R H K +DV EE K S +K++ + KN V
Sbjct: 58 ----------TDNRASERIFRHSKGSSLDVQNLSLEEESAKDSSRRKKEETVLKNNMVVR 107
Query: 115 SSRQRDGENRDLETKLKEDYVAKSNHRRTLKSEIRLKDRSPGNEKDRGNRGSDTKLKADV 174
S R L +KLK H R E R ++ G +D + D+
Sbjct: 108 SDRNNYESGLSLMSKLK--------HDRNENDE-----RQKYGQESLGWGKNDQSCRIDI 154
Query: 175 TKETSSKPN----VKSEKLIPSQGRSHNQSIV------DSRDEATKKHSR---DLTGKD- 220
ET+ + + K + S+G S +S D R+ +K+ S+ D KD
Sbjct: 155 ETETAKRHSRDTACKDRRQDHSRGESEKESKRKPQNGEDDRNRDSKRKSQNVDDEINKDG 214
Query: 221 -RHADKSGEKSERESKRKHRNRDDEKNRDKSAAKKVDLGKGHDLKVSERKEKKESPKSRH 279
R + + R SKRK +N DD++NRDK AKK D GK HDL+ ERKE S SR+
Sbjct: 215 KRKSQNGDDDKNRVSKRKSQNGDDDRNRDKYIAKKHDHGKHHDLENRERKEAVVSLTSRY 274
Query: 280 GNSRLKRRRSRSREREDRNRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDID 339
+SRLKRR+ RS +RE ++RRS+SLSPR+ K ++ R+ E LP S SGR SD +
Sbjct: 275 EDSRLKRRQKRSPDRESKHRRSVSLSPRSHKHSTKLARQKE-LPLESHVKKSGRWRSDSE 333
Query: 340 SSRVTGNGLSGHYRRHDGSTSGLGGYSPRKRRTEAAAKTPSPINRSPEKKSAKWDVAPVE 399
+ N + YRRH GSTSGLGGYSPRKRRTE+A KTPSP+ R+PE+K+ D+ P E
Sbjct: 334 RTGDFTNTSNSQYRRHSGSTSGLGGYSPRKRRTESAVKTPSPL-RTPEEKNEVLDLPPTE 392
Query: 400 TY-----SVPSNVHTSNQAASSN-AHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDS 453
SV SN SN S +++ +S K L+ +S + K VS+D
Sbjct: 393 KVGLFSGSVASNFPPSNPTVSLGISNDQSGGAFFSSAMGKSLSVVSSNNIAMKTKVSLDL 452
Query: 454 VQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQ 513
VQLTQ+ RPMRRL +ENLP SASEKA+++ LN FL+SSGV H+ G+ PCI C+I +++GQ
Sbjct: 453 VQLTQATRPMRRLYIENLPHSASEKAIIDCLNGFLMSSGVNHIEGTQPCISCIIHKDRGQ 512
Query: 514 AFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRTLSSKMVMEIVCA 573
A VEFLT EDASAAL DG FSGS LKI+RPK+++E + ++V A
Sbjct: 513 ALVEFLTPEDASAALLFDGSDFSGSTLKIRRPKDYIET---------------LRDVVTA 557
Query: 574 FGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMD 633
FG LKAYHFE+N+D PCAF+EYVD+ V KA AGLNG+K+GGQVL AV + +
Sbjct: 558 FGRLKAYHFEINDDLNGPCAFLEYVDESVVSKACAGLNGMKIGGQVLKVFPAV-PFPLTE 616
Query: 634 NSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGS 693
+G P +GIP+H PLL++P+ VLK+ NVFN + LSE +++EVLED+R ECARFG+
Sbjct: 617 RTGCQPCYGIPEHVKPLLQRPSVVLKINNVFNADVLPVLSESDIDEVLEDIRFECARFGT 676
Query: 694 VKSVNVVKYGDSNISTIQACEGNENTASAGVGQNLTNDETNEKGERLEEVTDHKSIKNNE 753
VKSVN VK I A E + + ++T + + K E E T S +N+
Sbjct: 677 VKSVNFVK------PCISAEEEYKQIS------DIT--DVDIKHEIQENSTTVISRNSND 722
Query: 754 LEILNDSKEVMEAGEVNNVKDNRPASGTMGDEP--SQLCELDTDMAVEYQARDS------ 805
LE N + + + + N + N P +G DE +LC++ A ++ D
Sbjct: 723 LEDNNANLDNCPS-DSNQKQANCPDNGRHQDEAVEDKLCQMGNTDATCFEVADCENASER 781
Query: 806 TSEIVSQGVPTQVNTLKDSPCAH------DDKVTCNIQLEHMSEENKSSAKEDLNLEEVN 859
T ++S+ + N +D+ D K+ C E SS D + +N
Sbjct: 782 TGHVLSEQRSSPENDFQDAKVTEIIETGLDKKLVC--------VEASSSMMADNEKKSLN 833
Query: 860 GNSEAFTGASNEMGMQSSAVENGDNENQDPNQGHIFEPGCVFVEYMRAEASCMAAHSLHR 919
G ASN AVE + ++ D NQ +F G VFVE+ R EASCMAAHSLH
Sbjct: 834 GLDPVVRIASN-------AVEKSEKKDPDNNQESLFVLGSVFVEFGRIEASCMAAHSLHG 886
Query: 920 RLFDDRIVAVEYIPLNLYRARFSK 943
R++D + +++EYIP LYR RF K
Sbjct: 887 RIYDGQEISIEYIPHGLYRKRFPK 910
>gi|449498643|ref|XP_004160593.1| PREDICTED: uncharacterized LOC101213128 [Cucumis sativus]
Length = 918
Score = 547 bits (1409), Expect = e-152, Method: Compositional matrix adjust.
Identities = 389/992 (39%), Positives = 541/992 (54%), Gaps = 123/992 (12%)
Query: 1 MGRSGRKKEKHGKRGEPSLADSNEGSAARTRPFSFDEIMIRRKTKNLSENIKEDAMDVSN 60
M G K+ + + + ++S++ SAARTRPFSF++IM+RRKTK + ++ DA
Sbjct: 1 MSTHGCSKQYSRRSKKQTRSNSDDESAARTRPFSFEDIMLRRKTKGSAATVEVDATS--- 57
Query: 61 IPGDCTIENVSDDHKSGRHHGHKKSPGVDVHT---SEEYVKVGSGKKEDNALTKN---VD 114
+D+ S R H K +DV EE K S +K++ + KN V
Sbjct: 58 ----------TDNRASERIFRHSKGSSLDVQNLSLEEESAKDSSRRKKEETVLKNNMVVR 107
Query: 115 SSRQRDGENRDLETKLKEDYVAKSNHRRTLKSEIRLKDRSPGNEKDRGNRGSDTKLKADV 174
S R L +KLK H R E R ++ G +D + D+
Sbjct: 108 SDRNNYESGLSLMSKLK--------HDRNENDE-----RQKYGQESLGWGKNDQSCRIDI 154
Query: 175 TKETSSKPN----VKSEKLIPSQGRSHNQSIV------DSRDEATKKHSRDLT------G 218
ET+ + + K + S+G S +S D R+ +K+ S+++ G
Sbjct: 155 ETETAKRHSRDTACKDRRQDHSRGESEKESKRKPQNGEDDRNRDSKRKSQNVDDEINKDG 214
Query: 219 KDRHADKSGEKSE-------RESKRKHRNRDDEKNRDKSAAKKVDLGKGHDLKVSERKEK 271
D S KS+ R+SKRK +N DD++NRDK AKK D GK HDL+ ERKE
Sbjct: 215 DDDRNRDSKRKSQNGDDDRNRDSKRKSQNGDDDRNRDKYIAKKHDHGKHHDLENRERKEA 274
Query: 272 KESPKSRHGNSRLKRRRSRSREREDRNRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISS 331
S SR+ +SRLKRR+ RS +RE ++RRS+SLSPR+ K ++ R+ E LP S S
Sbjct: 275 VVSLTSRYEDSRLKRRQKRSPDRESKHRRSVSLSPRSHKHSTKLARQKE-LPLESHVKKS 333
Query: 332 GRQHSDIDSSRVTGNGLSGHYRRHDGSTSGLGGYSPRKRRTEAAAKTPSPINRSPEKKSA 391
GR SD + + N + YRRH GSTSGLGGYSPRKRRTE+A KTPSP+ R+PE+K+
Sbjct: 334 GRWRSDSERTGDFTNTSNSQYRRHSGSTSGLGGYSPRKRRTESAVKTPSPL-RTPEEKNE 392
Query: 392 KWDVAPVETY-----SVPSNVHTSNQAASSN-AHEMVSSDPVTSTTQKPLAGISVSASLA 445
D+ P E SV SN SN S +++ +S K L+ +S +
Sbjct: 393 VLDLPPTEKVGLFSGSVASNFPPSNPTVSLGISNDQSGGAFFSSAMGKSLSVVSSNNIAM 452
Query: 446 KLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
K VS+D VQLTQ+ RPMRRL +ENLP SASEKA+++ LN FL+SSGV H+ G+ PCI C
Sbjct: 453 KTKVSLDLVQLTQATRPMRRLYIENLPHSASEKAIIDCLNGFLMSSGVNHIEGTQPCISC 512
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRTLSSK 565
+I +++GQA VEFLT EDASAAL DG FSGS LKI+RPK+++E
Sbjct: 513 IIHKDRGQALVEFLTPEDASAALLFDGSDFSGSTLKIRRPKDYIET-------------- 558
Query: 566 MVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA 625
+ ++V AFG LKAYHFE+N+D PCAF+EYVD+ V KA AGLNG+K+GGQVL A
Sbjct: 559 -LRDVVTAFGRLKAYHFEINDDLNGPCAFLEYVDESVVSKACAGLNGMKIGGQVLKVFPA 617
Query: 626 VLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVR 685
V + + +G P +GIP+H PLL++P+ VLK+ NVFN + LSE +++EVLED+R
Sbjct: 618 V-PFPLTERTGCQPCYGIPEHVKPLLQRPSVVLKINNVFNADVLPVLSESDIDEVLEDIR 676
Query: 686 LECARFGSVKSVNVVKYGDSNISTIQACEGNENTASAGVGQNLTNDETNEKGERLEEVTD 745
ECARFG+VKSVN VK I A E + + ++T + + K E E T
Sbjct: 677 FECARFGTVKSVNFVK------PCISAEEEYKQIS------DIT--DVDIKHEIQENSTT 722
Query: 746 HKSIKNNELEILNDSKEVMEAGEVNNVKDNRPASGTMGDEP--SQLCELDTDMAVEYQAR 803
S +N+LE N + + + + N + N P +G DE +LC++ A ++
Sbjct: 723 VISRNSNDLEDNNANLDNCPS-DSNQKQANCPDNGRHQDEAVEDKLCQMGNTDATCFEVA 781
Query: 804 DS------TSEIVSQGVPTQVNTLKDSPCAH------DDKVTCNIQLEHMSEENKSSAKE 851
D T ++S+ + N +D+ D K+ C E SS
Sbjct: 782 DCENASERTGHVLSEQRSSPENDFQDAKVTEIIETGLDKKLVC--------VEASSSMMA 833
Query: 852 DLNLEEVNGNSEAFTGASNEMGMQSSAVENGDNENQDPNQGHIFEPGCVFVEYMRAEASC 911
D + +NG ASN AVE + ++ D NQ +F G VFVE+ R EASC
Sbjct: 834 DNEKKSLNGLDPVVRIASN-------AVEKSEKKDPDNNQESLFVLGSVFVEFGRIEASC 886
Query: 912 MAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
MAAHSLH R++D + +++EYIP LYR RF K
Sbjct: 887 MAAHSLHGRIYDGQEISIEYIPHGLYRKRFPK 918
>gi|356536627|ref|XP_003536838.1| PREDICTED: uncharacterized protein LOC100810537 [Glycine max]
Length = 735
Score = 512 bits (1318), Expect = e-142, Method: Compositional matrix adjust.
Identities = 331/753 (43%), Positives = 453/753 (60%), Gaps = 54/753 (7%)
Query: 203 DSRDEATKKHSRDLTGKDRHADKSGEKSERESKRKHRNRDDEKNRDKSAAKKVDLGKGHD 262
DS + KKH R+ + KD+HA K ERE+KRK++N DDE +D++ ++K D K H+
Sbjct: 25 DSEEVPEKKHYRE-SDKDKHAGGRA-KYERETKRKYKNGDDE-TQDRNTSRKQDAVKHHN 81
Query: 263 LKVSERKEKKESPKSRHGNSRLKRRRSRSREREDRNRRSLSLSPRAQKRTSYYDREHEGL 322
+ + ERKE++ KS N L +R SR RE +RRS S SPR QKRT Y D E + L
Sbjct: 82 MHIYERKERRVKVKSH--NEELTAKRRCSRSREREDRRSPSFSPREQKRT-YQDGERKEL 138
Query: 323 PSHSTKISSGRQHSDIDSSRVTGNGLSGHYRRHDGSTSGLGGYSPRKRRTEAAAKTPSPI 382
HS K SS ++ DID +RV+ NG SGH+ RH STSGLGGYSPRKR++EAAAKTPSP
Sbjct: 139 SMHSLKDSSRTKNPDIDRNRVSTNGSSGHHHRHGVSTSGLGGYSPRKRKSEAAAKTPSPS 198
Query: 383 NRSPEKKSAKWDVAPVETYS----VPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGI 438
S EKK A WD+ P T + V S+ SN A SN H++VS+ + KPL
Sbjct: 199 KHSLEKKRAGWDLPPAGTNNPSAVVSSSFPVSNCAVLSNMHDVVSTSSLDLALVKPLPVS 258
Query: 439 SVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHG 498
S N ++DSVQLTQ+ RP+RRL +ENLP SASEKA+M+ NN LLS+ V H+
Sbjct: 259 FPSDVSTGKNTNIDSVQLTQATRPIRRLYLENLPASASEKAVMDCFNNLLLSARVNHIQQ 318
Query: 499 SLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVAIFIGGI 558
+ PCI C++ ++KGQA VEFLTA+DASAAL DG GSI+KI+RPK+++E
Sbjct: 319 AQPCICCILHKDKGQALVEFLTADDASAALSFDGSMLFGSIVKIRRPKDYIE-------- 370
Query: 559 SRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQ 618
+MEI FG LKAYHFE ++ PCAF+EYVD VT KA AGLNG+K+GG+
Sbjct: 371 --------LMEIAGVFGSLKAYHFETKVNN-GPCAFLEYVDHSVTIKACAGLNGMKLGGE 421
Query: 619 VLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVE 678
VLT +QA+ D S ++N+G +G+P+HA PLL+KPT+VL++ NVF + SLS++ +E
Sbjct: 422 VLTVLQAMPDASPLENAGESLSYGVPEHAKPLLRKPTQVLEINNVFAADTILSLSDMAIE 481
Query: 679 EVLEDVRLECARFGSVKSVNVVKY--GDSNISTIQACEGNENTASAGVGQNLTNDETNEK 736
E+L+DVRLECARFG++KS+NVVK+ G++ + ++ C+ + V Q+ TN TN
Sbjct: 482 EILDDVRLECARFGTIKSINVVKHSSGENLATKLEECKVINKVDAKEVSQD-TNCITNNT 540
Query: 737 GERLEEVTDHKSIK-NNELEIL-NDSKEVMEAGEVNNVKDNRPASGTMGDEPSQLCELDT 794
+ + + N +EI N+ E ++ E + V ++ A + D S +D
Sbjct: 541 ESSFSDKATYPDFEGTNGMEIHDNNEMEEVKVDEGSCVYVDKNAE--VFDYKSCREHVD- 597
Query: 795 DMAVEYQARDSTSEIVSQGVPTQVNTLKDSPCAHDDKVTCNIQLEHMSEENKSSAKEDLN 854
D AVE + Q P Q +T D P +D V +I D+N
Sbjct: 598 DSAVEDVGDKGIPCSIIQECPDQQDTPNDGPEFYDKMVANDI---------------DVN 642
Query: 855 LE---EVNGNSEAFTGASNEMGMQSSAVENGDN-ENQDPNQGHIFEPGCVFVEYMRAEAS 910
+E E AF +E + + V + + ++ GH+F+PG V VEY RAEA
Sbjct: 643 IENNMESKDTVCAFQEGFSEWDISAELVSPQKSIDTEEDIYGHVFKPGSVLVEYGRAEAC 702
Query: 911 CMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
C AAHSLH R FD RIV V Y+ L+LYR+RF+K
Sbjct: 703 CSAAHSLHGRFFDGRIVTVGYVALSLYRSRFTK 735
>gi|147852616|emb|CAN81690.1| hypothetical protein VITISV_009755 [Vitis vinifera]
Length = 544
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 301/587 (51%), Positives = 373/587 (63%), Gaps = 61/587 (10%)
Query: 1 MGRSGRKKEKHGKRGEPSLADSNEGSAARTRPFSFDEIMIRRKTKNLSENIKEDAMDVSN 60
M RS R KEK+GK E L + EG+AARTRPFSFDEIM+RRK K L ++K+ A + N
Sbjct: 1 MSRSNRPKEKYGKSTELYLDNFKEGTAARTRPFSFDEIMLRRKNKKLYGDVKDGAGEEGN 60
Query: 61 IPGDCTIENVSDDHKS--GRHHGHKKSPGVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQ 118
I ++NVSD ++S G H PG H+SE++VKV S KK++N K + +
Sbjct: 61 ISRKDIVKNVSDCYESDKGYKHNEDSFPGAINHSSEDFVKVSSRKKDENTSMKEGNLGKV 120
Query: 119 RDGENRDLETKLKEDYVAKSNHRRTLKSEIRLKDRSPGNEKDRGNRGSDTKLKADVTKET 178
+D E+ + E KLK AK N K D +KE
Sbjct: 121 KDKESHNSEDKLK----AKPN-----------------------------KGMTDKSKEG 147
Query: 179 SSKPNVKSEKLIPSQGRSHNQSIVDSRDEATKKHSRDLTGKDRHADKSGEKSERESKRKH 238
V K I + R + S E KK SRD GKDR+AD+S KSE+ESKRKH
Sbjct: 148 KINQRVHGRKKIDERSRRSSDD---SESEPEKKFSRDSVGKDRYADRS-RKSEKESKRKH 203
Query: 239 RNRDDEKNRDKSAAKKVDLGKGHDLKVSERKEKKESPKSRHGNSRLKRRRSRSRERE-DR 297
R +DEKNR++++ KK D GK H+ + +RKE++ESP SR+ SR KRRRSRSRER+ DR
Sbjct: 204 RTGEDEKNRERNSMKKHDPGKRHESEFLDRKERRESPPSRYEESRPKRRRSRSRERDKDR 263
Query: 298 NRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNGLSGHYRRHDG 357
++RS SLSPRAQKRTS++ REH L SHS K SGRQHSD D +R++ NG S H+RRH G
Sbjct: 264 DKRSSSLSPRAQKRTSHHGREHAELSSHSLKDRSGRQHSDADRNRISNNGSSSHFRRHGG 323
Query: 358 STSGLGGYSPRKRRTEAAAKTPSPINRSPEKKSAKWDVAPVET-----YSVPSNVHTSNQ 412
S SGLGGYSPRKRRTEAA KTPSP NRSPEKK+A WD+ P T SV S++
Sbjct: 324 SASGLGGYSPRKRRTEAAIKTPSPTNRSPEKKNAGWDLPPSRTDGMNAGSVLSSLQVLKP 383
Query: 413 AASSNAHEMVS----SDPVTSTTQK-PLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLC 467
SSNA E+ S + PVT+TT K PL I A NVS+DS+QLTQ+ RPMRRL
Sbjct: 384 TVSSNADELPSAVPVAVPVTATTAKPPLPRIYSDAVSKNKNVSIDSIQLTQATRPMRRLY 443
Query: 468 VENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAA 527
VENLP+S+SEKALME LNNFLLSSG+ HV G+ PCI C+I +EKGQA VEFLT EDASAA
Sbjct: 444 VENLPVSSSEKALMECLNNFLLSSGINHVQGTPPCISCIIHKEKGQALVEFLTPEDASAA 503
Query: 528 LCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRTLSSKMVMEIVCAF 574
L DG SFSGSILKI+RPK+FV++ + +M++ I F
Sbjct: 504 LSFDGISFSGSILKIRRPKDFVDMVV-----------RMIITINAFF 539
>gi|218192051|gb|EEC74478.1| hypothetical protein OsI_09930 [Oryza sativa Indica Group]
Length = 1128
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 361/999 (36%), Positives = 510/999 (51%), Gaps = 126/999 (12%)
Query: 22 SNEGSAARTRPFSFDEIMIRRKTKNLSE--NIKEDAMDVSNIPGDCTIENVSDDHKSGRH 79
+++G+AARTRP S +IM RR+ K SE KE + SN S+ +SGR
Sbjct: 179 TSDGTAARTRPLSIQDIMSRREKKTASEAKKTKEGLEENSN--------GKSNHLESGRG 230
Query: 80 HGHKKSP---GVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQRDGENRDLETKL-KEDYV 135
+K ++ + K G G K+D ++ +R RD++ ++ K+D +
Sbjct: 231 SKSRKDVREMPLEGSKKDNSDKPGEGSKKDE--MRHTPREERRKDNMRDMQREVSKKDNL 288
Query: 136 AKSNHRRTLKSEIRLKDRSPGNEKDRGNRGSDTKLKADVTKETS-----SKPNVKSEKLI 190
+ +++K + P + R ++ + T++ S K N S KL
Sbjct: 289 KDRPKDISKIDNLKVKVKVPSKDDQRDAPKKSSRKERSSTRDDSHLVDKDKGNHNSHKLS 348
Query: 191 P------SQGRSHNQSIVDSRD-EATKKHSRDLTGKDR-----HADKSGEKSER--ESKR 236
+ R N + +RD +AT + S+ GK D+ E+SER + KR
Sbjct: 349 TYTSGQVGKSRGGNHGEITARDGDATIQESQKRPGKRWIDEPVGNDRIKERSERRTDGKR 408
Query: 237 KHRNRDDEKNR--DKSAAKKVDLGKGHDLKVSERKE-KKESPKSRHGNSRLKRRRSRSRE 293
K R DDEK+ D+ KK D + D + +RK+ +KE+ K H R KRRRS SR+
Sbjct: 409 KSRGFDDEKSSQVDRPTLKKQDAVRLQDPRHFDRKDGRKENAKLHHEEPRSKRRRSTSRD 468
Query: 294 R-EDRNRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNG--LSG 350
++R+ RS+S S R Q R SY H+ P + + S R+H + D R + N SG
Sbjct: 469 HYQERHDRSVSPSSREQ-RHSYRGHGHDYYPPYHSMDKSRRKHFETDRHRTSWNDGYSSG 527
Query: 351 HYRRHDGSTSGLGGYSPRKRRT-----EAAAKTPSPINRSPEKKSAKWDVAPVETYSVPS 405
YRR++ S LGGYSPRKR+T + KT S + RSPEKKSA WD PV T S
Sbjct: 528 SYRRYE---SRLGGYSPRKRKTAPKDEQTTIKTASLVIRSPEKKSATWDQLPVATDL--S 582
Query: 406 NVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRR 465
N T+ Q+ V PV TT K ++ L N+++DSVQLTQ+ RP+RR
Sbjct: 583 NFVTTLQSTVGLKDSTV---PVNFTTSKQDPNTTIGTILTGSNLAVDSVQLTQATRPLRR 639
Query: 466 LCVENLPLSASEKALMEFLNNFLLSSGVQHVHGS-LPCIGCVIQREKGQAFVEFLTAEDA 524
L +ENLP A+E L+ LN FLLSS H+ S PC+ CVI ++K QAFVEFLT EDA
Sbjct: 640 LHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKRQAFVEFLTPEDA 699
Query: 525 SAALCCDGCSFSGSILKIKRPKEFVEVA------------------------IFIGGISR 560
+AAL DG SF GS LKI+RPKE+VE+A IFI GIS
Sbjct: 700 TAALSFDGRSFGGSSLKIRRPKEYVEMAHVAPKKPSEEIKLISDVVADSPHKIFIAGISG 759
Query: 561 TLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVL 620
+SS+M+MEIV +FGPL AY F NED CAF+EY+D +T KA AGLNG+K+GG +L
Sbjct: 760 VISSEMLMEIVSSFGPLAAYRFLFNEDLGGACAFLEYIDHSITSKACAGLNGMKLGGGIL 819
Query: 621 TAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEV 680
TAV + + + PF+GIP A LL++PT+VL+LKNVF+ E + LS+ E+EE+
Sbjct: 820 TAVNVFPNSTEQAFNEASPFYGIPDSAKSLLEEPTKVLQLKNVFDQEEYLLLSKSELEEI 879
Query: 681 LEDVRLECARFGSVKSVNVVKYGDSNIS----TIQACE------------GNENTASAGV 724
LEDVR+ECARFG+VKS+NVV+Y S+ + TI CE GN + GV
Sbjct: 880 LEDVRVECARFGAVKSINVVEYPASSDNTTGDTITECEDGSTKIEPKEYGGNVSCTETGV 939
Query: 725 GQNLTNDETNEKGERLEEVTDHKSIKNNELEILNDSKEVMEAGEVNNVKDNRPASGTMGD 784
++ N T+ + EV D + + + D K + GE + + T GD
Sbjct: 940 ECSVLNQSTDVPDPSICEVQDPVELDTDSIPKGRDHKNLDTRGECD--------APTAGD 991
Query: 785 EPSQLCELDTDMAVEYQARDSTSEIVSQGVPTQVNTLKDSPCAHDDKVTCNIQLEHMSEE 844
E +TD VE DST Q + H D +++ +
Sbjct: 992 E-------NTDQGVEADQTDSTD--------AQDDARGTIERGHADADQASLETSCSTAP 1036
Query: 845 NKSSAKEDLNLEEVNGNSEAFTGASNEMGMQSSAVENGDNENQDPNQGHIFEPGCVFVEY 904
+ K E+ G G S ++ AV+ DN + E GC+ VE+
Sbjct: 1037 GDGADKSGRENEQQGG-----AGVSESNTEKAPAVDARDNALA--SNTSALEAGCILVEF 1089
Query: 905 MRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
+R EA+C AAHSLH R F RIV+ Y P +LY ++++
Sbjct: 1090 LRKEAACTAAHSLHGRRFGSRIVSAGYAPHDLYLQKYTR 1128
>gi|108706080|gb|ABF93875.1| RNA recognition motif family protein, expressed [Oryza sativa
Japonica Group]
Length = 964
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 361/999 (36%), Positives = 508/999 (50%), Gaps = 126/999 (12%)
Query: 22 SNEGSAARTRPFSFDEIMIRRKTKNLSE--NIKEDAMDVSNIPGDCTIENVSDDHKSGRH 79
+++G+AARTRP S +IM RR+ K SE KE + SN S+ +SGR
Sbjct: 15 TSDGTAARTRPLSIQDIMSRREKKTASEAKKTKEGLEENSN--------GKSNHLESGRG 66
Query: 80 HGHKKSP---GVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQRDGENRDLETKL-KEDYV 135
+K ++ + K G G K+D ++ +R RD++ ++ K+D +
Sbjct: 67 SKSRKDVREMPLEGSKKDNSDKPGEGSKKDE--MRHTPREERRKDNMRDMQREVSKKDNL 124
Query: 136 AKSNHRRTLKSEIRLKDRSPGNEKDRGNRGSDTKLKADVTKETS-----SKPNVKSEKLI 190
+ +++K + P + R +K + T++ S K N S KL
Sbjct: 125 KDRPKDISKIDNLKVKVKVPSKDDQRDAPKKSSKKERSSTRDDSHLVDKDKGNHNSHKLS 184
Query: 191 P------SQGRSHNQSIVDSRD-EATKKHSRDLTGKDR-----HADKSGEKSER--ESKR 236
+ R N + +RD +AT + S+ GK D+ E+SER + KR
Sbjct: 185 TYTSGRVGKSRGGNHGEITARDGDATIQESQKRPGKRWIDEPVGNDRIKERSERRTDGKR 244
Query: 237 KHRNRDDEKNR--DKSAAKKVDLGKGHDLKVSERKE-KKESPKSRHGNSRLKRRRSRSRE 293
R DDEK+ D+ KK D + D + +RK+ +KE+ K H R KRRRS SR+
Sbjct: 245 NSRGFDDEKSSQVDRPTLKKQDAVRLQDPRHFDRKDGRKENAKLHHEEPRSKRRRSTSRD 304
Query: 294 R-EDRNRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNG--LSG 350
++R+ RS+S S R Q R SY H+ P + + S R+H + D R + N SG
Sbjct: 305 HYQERHDRSVSPSSREQ-RHSYRGHGHDYYPPYHSMDKSRRKHFETDRHRTSWNDGYSSG 363
Query: 351 HYRRHDGSTSGLGGYSPRKRRT-----EAAAKTPSPINRSPEKKSAKWDVAPVETYSVPS 405
YRR++ S LGGYSPRKR+T + KT S + RSPEKKSA WD PV T S
Sbjct: 364 SYRRYE---SRLGGYSPRKRKTAPKDEQTTIKTASLVIRSPEKKSATWDQLPVATDL--S 418
Query: 406 NVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRR 465
N T+ Q+ V PV TT K ++ L N+++DSVQLTQ+ RP+RR
Sbjct: 419 NFVTTLQSTVGLKDSTV---PVNFTTSKQDPNTTIGTILTGSNLAVDSVQLTQATRPLRR 475
Query: 466 LCVENLPLSASEKALMEFLNNFLLSSGVQHVHGS-LPCIGCVIQREKGQAFVEFLTAEDA 524
L +ENLP A+E L+ LN FLLSS H+ S PC+ CVI ++K QAFVEFLT EDA
Sbjct: 476 LHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKRQAFVEFLTPEDA 535
Query: 525 SAALCCDGCSFSGSILKIKRPKEFVEVA------------------------IFIGGISR 560
+AAL DG SF GS LKI+RPKE+VE+A IFI GIS
Sbjct: 536 TAALSFDGRSFGGSSLKIRRPKEYVEMAHVAPKKPSEEIKLISDVVADSPHKIFIAGISG 595
Query: 561 TLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVL 620
+SS+M+MEIV +FGPL AY F NE CAF+EY+D +T KA AGLNG+K+GG +L
Sbjct: 596 VISSEMLMEIVSSFGPLAAYRFLFNEYLGGACAFLEYIDHSITSKACAGLNGMKLGGGIL 655
Query: 621 TAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEV 680
TAV + + + PF+GIP A LL++PT+VL+LKNVF+ E + LS+ E+EE+
Sbjct: 656 TAVNVFPNSTEQAFNEASPFYGIPDSAKSLLEEPTKVLQLKNVFDQEEYLLLSKSELEEI 715
Query: 681 LEDVRLECARFGSVKSVNVVKYGDSNIS----TIQACE------------GNENTASAGV 724
LEDVR+ECARFG+VKS+NVVKY S+ + TI CE GN + GV
Sbjct: 716 LEDVRVECARFGAVKSINVVKYPASSDNTTGDTITECEDGSTKIEPKEYGGNVSCTETGV 775
Query: 725 GQNLTNDETNEKGERLEEVTDHKSIKNNELEILNDSKEVMEAGEVNNVKDNRPASGTMGD 784
++ N T+ + EV D + + + D K + GE + + T GD
Sbjct: 776 ECSVLNQSTDVPDPSICEVQDPVELDTDSIPKGRDHKNLDTRGECD--------APTAGD 827
Query: 785 EPSQLCELDTDMAVEYQARDSTSEIVSQGVPTQVNTLKDSPCAHDDKVTCNIQLEHMSEE 844
E +TD VE DST Q + H D +++ +
Sbjct: 828 E-------NTDQGVEADQTDSTD--------AQDDARGTIERGHADADPASLETSCSTAP 872
Query: 845 NKSSAKEDLNLEEVNGNSEAFTGASNEMGMQSSAVENGDNENQDPNQGHIFEPGCVFVEY 904
+ K E+ G G S ++ AV+ DN + E GC+ VE+
Sbjct: 873 GDGADKSGRENEQQGG-----AGVSESNTEKAPAVDARDNALA--SNTSALEAGCILVEF 925
Query: 905 MRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
+R EA+C+AAHSLH R F RIV+ Y P +LY ++ +
Sbjct: 926 LRKEAACIAAHSLHGRRFGSRIVSAGYAPHDLYLQKYPR 964
>gi|108706079|gb|ABF93874.1| RNA recognition motif family protein, expressed [Oryza sativa
Japonica Group]
Length = 964
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 359/999 (35%), Positives = 506/999 (50%), Gaps = 126/999 (12%)
Query: 22 SNEGSAARTRPFSFDEIMIRRKTKNLSE--NIKEDAMDVSNIPGDCTIENVSDDHKSGRH 79
+++G+AARTRP S +IM RR+ K SE KE + SN S+ +SGR
Sbjct: 15 TSDGTAARTRPLSIQDIMSRREKKTASEAKKTKEGLEENSN--------GKSNHLESGRG 66
Query: 80 HGHKKSP---GVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQRDGENRDLETKL-KEDYV 135
+K ++ + K G G K+D ++ +R RD++ ++ K+D +
Sbjct: 67 SKSRKDVREMPLEGSKKDNSDKPGEGSKKDE--MRHTPREERRKDNMRDMQREVSKKDNL 124
Query: 136 AKSNHRRTLKSEIRLKDRSPGNEKDRGNRGSDTKLKADVTKETS-----SKPNVKSEKLI 190
+ +++K + P + R +K + T++ S K N S KL
Sbjct: 125 KDRPKDISKIDNLKVKVKVPSKDDQRDAPKKSSKKERSSTRDDSHLVDKDKGNHNSHKLS 184
Query: 191 P------SQGRSHNQSIVDSRD-EATKKHSRDLTGKDR-----HADKSGEKSER--ESKR 236
+ R N + +RD +AT + S+ GK D+ E+SER + KR
Sbjct: 185 TYTSGRVGKSRGGNHGEITARDGDATIQESQKRPGKRWIDEPVGNDRIKERSERRTDGKR 244
Query: 237 KHRNRDDEKNR--DKSAAKKVDLGKGHDLKVSERKE-KKESPKSRHGNSRLKRRRSRSRE 293
R DDEK+ D+ KK D + D + +RK+ +KE+ K H R KRRRS SR+
Sbjct: 245 NSRGFDDEKSSQVDRPTLKKQDAVRLQDPRHFDRKDGRKENAKLHHEEPRSKRRRSTSRD 304
Query: 294 R-EDRNRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNG--LSG 350
++R+ RS+S S R Q R SY H+ P + + S R+H + D R + N SG
Sbjct: 305 HYQERHDRSVSPSSREQ-RHSYRGHGHDYYPPYHSMDKSRRKHFETDRHRTSWNDGYSSG 363
Query: 351 HYRRHDGSTSGLGGYSPRKRRT-----EAAAKTPSPINRSPEKKSAKWDVAPVETYSVPS 405
YRR++ S LGGYSPRKR+T + KT S + RSPEKKSA WD PV T S
Sbjct: 364 SYRRYE---SRLGGYSPRKRKTAPKDEQTTIKTASLVIRSPEKKSATWDQLPVATDL--S 418
Query: 406 NVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRR 465
N T+ Q+ V PV TT K ++ L N+++DSVQLTQ+ RP+RR
Sbjct: 419 NFVTTLQSTVGLKDSTV---PVNFTTSKQDPNTTIGTILTGSNLAVDSVQLTQATRPLRR 475
Query: 466 LCVENLPLSASEKALMEFLNNFLLSSGVQHVHGS-LPCIGCVIQREKGQAFVEFLTAEDA 524
L +ENLP A+E L+ LN FLLSS H+ S PC+ CVI ++K QAFVEFLT EDA
Sbjct: 476 LHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKRQAFVEFLTPEDA 535
Query: 525 SAALCCDGCSFSGSILKIKRPKEFVEVA------------------------IFIGGISR 560
+AAL DG SF GS LKI+RPKE+VE+A IFI GIS
Sbjct: 536 TAALSFDGRSFGGSSLKIRRPKEYVEMAHVAPKKPSEEIKLISDVVADSPHKIFIAGISG 595
Query: 561 TLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVL 620
+SS+M+MEIV +FGPL AY F NE CAF+EY+D +T KA AGLNG+K+GG +L
Sbjct: 596 VISSEMLMEIVSSFGPLAAYRFLFNEYLGGACAFLEYIDHSITSKACAGLNGMKLGGGIL 655
Query: 621 TAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEV 680
TAV + + + PF+GIP A LL++PT+VL+LKNVF+ E + LS+ E+EE+
Sbjct: 656 TAVNVFPNSTEQAFNEASPFYGIPDSAKSLLEEPTKVLQLKNVFDQEEYLLLSKSELEEI 715
Query: 681 LEDVRLECARFGSVKSVNVVKYGDSNIS----TIQACE------------GNENTASAGV 724
LEDVR+E RFG+VKS+NVVKY S+ + TI CE GN + GV
Sbjct: 716 LEDVRVEYDRFGAVKSINVVKYPASSDNTTGDTITECEDGSTKIEPKEYGGNVSCTETGV 775
Query: 725 GQNLTNDETNEKGERLEEVTDHKSIKNNELEILNDSKEVMEAGEVNNVKDNRPASGTMGD 784
++ N T+ + EV D + + + D K + GE + + T GD
Sbjct: 776 ECSVLNQSTDVPDPSICEVQDPVELDTDSIPKGRDHKNLDTRGECD--------APTAGD 827
Query: 785 EPSQLCELDTDMAVEYQARDSTSEIVSQGVPTQVNTLKDSPCAHDDKVTCNIQLEHMSEE 844
E +TD VE DST Q + H D +++ +
Sbjct: 828 E-------NTDQGVEADQTDSTD--------AQDDARGTIERGHADADPASLETSCSTAP 872
Query: 845 NKSSAKEDLNLEEVNGNSEAFTGASNEMGMQSSAVENGDNENQDPNQGHIFEPGCVFVEY 904
+ K E+ G G S ++ AV+ DN + E GC+ VE+
Sbjct: 873 GDGADKSGRENEQQGG-----AGVSESNTEKAPAVDARDNALA--SNTSALEAGCILVEF 925
Query: 905 MRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
+R EA+C+AAHSLH R F RIV+ Y P +LY ++ +
Sbjct: 926 LRKEAACIAAHSLHGRRFGSRIVSAGYAPHDLYLQKYPR 964
>gi|242037001|ref|XP_002465895.1| hypothetical protein SORBIDRAFT_01g047730 [Sorghum bicolor]
gi|241919749|gb|EER92893.1| hypothetical protein SORBIDRAFT_01g047730 [Sorghum bicolor]
Length = 969
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 354/1018 (34%), Positives = 506/1018 (49%), Gaps = 159/1018 (15%)
Query: 22 SNEGSAARTRPFSFDEIMIRRKTKNLSENIKEDAMDVSNIPGDCTIENVSDDHKSGRHHG 81
++EG+AARTRP S +IM+RR+ K SE+ K+ + N G T +HKS +
Sbjct: 15 TSEGTAARTRPLSIKDIMLRREKKAASES-KKAKEGLENGKGKSTHLEQGREHKSRKDAR 73
Query: 82 HKKSPGVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQRDGENRDLETKLKEDYVAKSNHR 141
G S + + GS K++ + + E KED
Sbjct: 74 DMPVEGSRKEKSRDATREGSKKEKPRHIPR---------------ENPKKED-------T 111
Query: 142 RTLKSEIRLKDRSPGNEKDRGNRGSDTKLKADVTKETSSK-----PNVKSEKLIPS---- 192
R E+ KD N +DR N G+ D K + P S+K PS
Sbjct: 112 RYAPKEVSKKD----NSRDRPNTGAKMDDLKDAPKVPEKEGLGDAPKKASKKQRPSGDDN 167
Query: 193 ------QGRSHNQSIV--------DSRD--------------------EATKKHSRDLTG 218
+ H+Q +V +S+D E K+ + + G
Sbjct: 168 HSVRKDKDIHHSQKLVTNMSGRADESKDRNLSEIRERKGDVMRSEYQKEPGKRWNDEAVG 227
Query: 219 KDRHADKSGEKSERESKRKHRNRDDEKNR--DKSAAKKVDLGKGHDLKVSERKE-KKESP 275
D+ KS EK E+KRK R+ D+E++ D+ +KK D D K S+R + E
Sbjct: 228 DDKIKFKS-EKLRNETKRKDRSFDNERSPEVDRPMSKKQDSAWFQDSKHSDRYAGRNEYA 286
Query: 276 KSRHGNSRLKRRRSRSRERE-DRNRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQ 334
K HG KRRRSRSR+ + +R RS+S P ++R +Y R+ P H + S R+
Sbjct: 287 KPYHGEPGFKRRRSRSRDHDRERYGRSIS-PPLREERYNYRGRDFGNYPPHYSMEKSRRK 345
Query: 335 HSDIDSSRVTGNGLSGHYRRHDGSTSGLGGYSPRKRRT-----EAAAKTPSPINRSPEKK 389
++++D +++G+G H S LGGYSPRK++T +A KTPS + +SPEKK
Sbjct: 346 YAEVDKQKLSGSGGYIG-GSHQKYESRLGGYSPRKKKTSLQAEQATTKTPSKVIQSPEKK 404
Query: 390 SAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNV 449
SA WD PV+ SN T+ ++ P + +T K +V L ++
Sbjct: 405 SATWDQPPVKANQ--SNFPTTFLPTVGQ----MAPIPFSFSTIKKDPSTTVETMLVGNSL 458
Query: 450 SMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQR 509
+ DSVQLTQ+ RP+RRL +ENLP SA+E L++ LN+FLL +G++ S PC+ C I R
Sbjct: 459 TADSVQLTQATRPLRRLHIENLPDSATEDKLIDCLNDFLLPTGIK-PQRSKPCLSCTINR 517
Query: 510 EKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----------------- 552
EK QAFVEFLT EDA+AAL DG S +GS L+I+RPKE+VE
Sbjct: 518 EKRQAFVEFLTPEDATAALSFDGRSLNGSTLRIRRPKEYVETVNVTPKKPAEETALISDV 577
Query: 553 -------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPK 605
IFI GI+ +SS+M+MEIV AFGPL AY F N + PCAF+EY D+ +T K
Sbjct: 578 VADSPHKIFIAGIAGVISSEMLMEIVSAFGPLAAYRFLFNSELGGPCAFLEYADRSITSK 637
Query: 606 AIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFN 665
A AGLNG+ +GG VLTAV + + + PF+GIP++A LLK+PT+VL+LKN F
Sbjct: 638 ACAGLNGMMLGGCVLTAVHVFPNPPVEAANEASPFYGIPENAKSLLKEPTKVLQLKNTFE 697
Query: 666 PEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGDSNISTIQACEGNENTASAGVG 725
E + LS+ E+EE LEDVR+EC RFG+VKSV+VV+Y S + +NT +
Sbjct: 698 REEYMLLSKSELEETLEDVRVECTRFGAVKSVHVVEYPAGGGSAAE-----DNTVELKIE 752
Query: 726 QNLTNDETNEKGERLEEVTDHKSIKNNELEILNDSKEVMEAGEVNNVKDNRPASGTMGDE 785
D N + V+++ N +++LN S EA E +V D P S D+
Sbjct: 753 CTEFADTEN----IAKAVSEYSVPINQSIDVLNHS----EASETKDV-DPIPESQDHKDK 803
Query: 786 --PSQ--LCELDTDMAVEYQARDSTSEIVSQGVPT-QVNTLKDSPCAHDD-------KVT 833
PS LCE +A E D T + +PT Q + + A D+ +VT
Sbjct: 804 HLPSNAALCECKAPVADEDAELDETQSRAA--LPTSQHAEVGHTEAAVDENKHTGAGEVT 861
Query: 834 CNIQLEHMSEENKS--------SAKEDLNLEEVNGNSEAFTGASNEMGMQSSAVENGDNE 885
+ + E++ S E + E G+++ T E + AVE D
Sbjct: 862 ATVMDDDAVEKSHQDPRTSETCSPAEPTDKVEKPGSADDVTENRPE---KVPAVETSDT- 917
Query: 886 NQDPNQGHIFEPGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
G +FEPG V VE+MR EA+C+AAHSLH R F +R V Y P +LY ++ +
Sbjct: 918 ------GFVFEPGSVLVEFMRKEAACIAAHSLHGRRFGNRTVHAGYAPYDLYLQKYPR 969
>gi|224110608|ref|XP_002315576.1| predicted protein [Populus trichocarpa]
gi|222864616|gb|EEF01747.1| predicted protein [Populus trichocarpa]
Length = 500
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 260/513 (50%), Positives = 327/513 (63%), Gaps = 52/513 (10%)
Query: 1 MGRSGRKKEKHGKRGEPSLADSNEGSAARTRPFSFDEIMIRRKTKNLSENIKEDAMDVSN 60
M RS R K+K+G+ E S EG+AARTRPFSFDEIM RK K SE ++ + D
Sbjct: 2 MSRSSRHKDKYGRSSEMSQDHQYEGTAARTRPFSFDEIMSIRKNKKASEILEGELKD--- 58
Query: 61 IPGDCTIENVSDDHKSGRHHGHKK--SPGVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQ 118
I G E SD H+S R +GH + S G+ H SEE+ K KKEDN
Sbjct: 59 ILGGVINEKASD-HRSERGNGHNEESSTGLRQHLSEEHGKASYRKKEDN----------- 106
Query: 119 RDGENRDLETKLKEDYVAKSNHRRTLKSEIRLKDRSPGNEKDRGNRGSDTKLKADVTKET 178
+KEDY+AK +DR R S+TK KA + ++
Sbjct: 107 ---------VSMKEDYIAKG--------------------RDRDVRDSETKSKAKMNEDM 137
Query: 179 SSKPNVKSEKLIPSQGRSHNQSIVDSRDEATKKHSRDLTGKDRHADKSGEKSERESKRKH 238
++ K+ + I + + + S EA KKHSR++ KDRH DKS KSERE K K+
Sbjct: 138 RTEVKEKTNEKIHDRRKGDKRPSNISESEAVKKHSREVQ-KDRHVDKSRGKSERERKEKY 196
Query: 239 RNRDDEKNRDKSAAKKVDLGKGHDLKVSERKEKKESPKSRHGNSRLKRRRSRSREREDRN 298
RN D+K+RD++AAKK DLGKGH L+ SERKE+KES KS H RLKRRRSRSRE EDRN
Sbjct: 197 RNGIDDKSRDRNAAKKHDLGKGHHLETSERKERKESSKSHHEELRLKRRRSRSREHEDRN 256
Query: 299 RRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNGLSGHYRRHDGS 358
RRS+SLSPRA KR SY+ REH L SHS K SGRQ SD +++++T + S H RRH G
Sbjct: 257 RRSISLSPRAHKRGSYHKREHVELSSHSVKERSGRQQSDAENNQLTNSSSSRHQRRHGGF 316
Query: 359 TSGLGGYSPRKRRTEAAAKTPSPINRSPEKKSAKWDVAPVETYSV-P----SNVHTSNQA 413
SGLGGYSPRKR+TEAA KTPSP RSPEKKSAKWD+AP ET +V P SN + NQ
Sbjct: 317 ASGLGGYSPRKRKTEAAIKTPSPAKRSPEKKSAKWDLAPEETNNVFPAVILSNFQSPNQT 376
Query: 414 ASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPL 473
ASSN HE+VS+ PV S KPL+G+S+S+ + V+ +S+QLTQ+ P+RRL +EN+P
Sbjct: 377 ASSNIHEVVSAVPVVSAPMKPLSGVSLSSLSTAMKVATESIQLTQATHPIRRLYMENIPA 436
Query: 474 SASEKALMEFLNNFLLSSGVQHVHGSLPCIGCV 506
SASEKA+M+ LNNFL+SSGV H+ G+ PCI C+
Sbjct: 437 SASEKAVMDCLNNFLISSGVHHIQGTQPCISCI 469
>gi|297823139|ref|XP_002879452.1| hypothetical protein ARALYDRAFT_321074 [Arabidopsis lyrata subsp.
lyrata]
gi|297325291|gb|EFH55711.1| hypothetical protein ARALYDRAFT_321074 [Arabidopsis lyrata subsp.
lyrata]
Length = 497
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 231/486 (47%), Positives = 292/486 (60%), Gaps = 63/486 (12%)
Query: 208 ATKKHSRDLTGKDRHADKSGEKSERESKRKHRNRDDEKNRDKSAAKKVDLGKGHDLKVSE 267
+KK R L D D+S K + E KRK++N D +KNR+ + +K+ D GK H ++VSE
Sbjct: 8 GSKKRLRSLVVADVPRDESSIKHDNEDKRKNQNGDHKKNREMTISKRHDSGKVHSVEVSE 67
Query: 268 RKEKKESPKSRHGNSRLKRRRSRSRER-EDRNRRSLSLSPRAQKRTSYYDREHEGLPSHS 326
R E+KE PK+ + R KRRRSRSR+ +DR +R L PRA+K T+
Sbjct: 68 RMERKEQPKAHERDMREKRRRSRSRDHGQDRQKRPSPL-PRAEKATA------------- 113
Query: 327 TKISSGRQHSDIDSSRVTGNGLSGHYRRHDGSTSGLGGYSPRKRRTEAAAKTPSPINRSP 386
S S LGGYSPRKRR EA+ K SP N S
Sbjct: 114 -------------------------------SKSELGGYSPRKRREEASTKAVSPPNLSS 142
Query: 387 EKKSAKWDVAPVETYSVPSN-VHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLA 445
EKKSAKWD+AP T + S V + QAA+ A+ +S + KPL +
Sbjct: 143 EKKSAKWDLAPTVTSGMFSGPVFSGLQAATQTAYPTISE--ASLMLLKPLMEGTFRTPPP 200
Query: 446 KLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+ S DSVQLT+S RPMRRL EN+P SASEK+L+E N ++LSSG H+ GS PCI C
Sbjct: 201 RQITSFDSVQLTESTRPMRRLYAENVPDSASEKSLIECFNGYMLSSGSNHIKGSEPCISC 260
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRTLSSK 565
+I +EK QA VEFLT +DASAAL DGCSF+GS LKI+RPK++V +
Sbjct: 261 IINKEKSQALVEFLTPQDASAALSLDGCSFAGSNLKIRRPKDYVGTTL------------ 308
Query: 566 MVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA 625
MEIV FGPLKAY F N D + CA++EY D VT KA AGLNG+K+GG V+TAV A
Sbjct: 309 --MEIVSVFGPLKAYRFVSNNDLNQQCAYLEYTDGSVTLKACAGLNGMKLGGSVITAVCA 366
Query: 626 VLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVR 685
D S + + NPPF+GIP HA PLL KP +LKLKNV +PE F+ LSE EV+E+L+DVR
Sbjct: 367 FPDASSVAVNENPPFYGIPGHAKPLLGKPKHILKLKNVVDPEDFTLLSEQEVKEILDDVR 426
Query: 686 LECARF 691
LECAR+
Sbjct: 427 LECARW 432
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/57 (57%), Positives = 43/57 (75%), Gaps = 2/57 (3%)
Query: 885 ENQDPNQGHIFEPGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARF 941
E +DP+ +FEPGC+F+EY R EA+C AAHSLH RL+D+RIV EY+ LY+ RF
Sbjct: 440 EERDPDD--LFEPGCIFIEYGRPEATCDAAHSLHGRLYDNRIVKAEYVSKELYQIRF 494
>gi|413956976|gb|AFW89625.1| hypothetical protein ZEAMMB73_282398 [Zea mays]
Length = 635
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 254/634 (40%), Positives = 342/634 (53%), Gaps = 70/634 (11%)
Query: 355 HDGSTSGLGGYSPRKRRT-----EAAAKTPSPINRSPEKKSAKWDVAPVET--YSVPSN- 406
H S LGGYSPRK++T +A KTPS + +SPEKKSA WD PV+ + P+
Sbjct: 27 HQRYESRLGGYSPRKKKTSLQAEQATTKTPSKVIQSPEKKSATWDQPPVKANQSNFPTTF 86
Query: 407 VHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRL 466
+ T Q A + V DP T+ LAG S++A DSVQLTQ+ RP+RRL
Sbjct: 87 LPTVGQMAPTPFSFSVIKDPSTTAVTM-LAGNSLTA---------DSVQLTQATRPLRRL 136
Query: 467 CVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASA 526
+ENLP SA+E L++ LN+FLLS+G + + S PC+ C I REK QAFVEFLT EDA+A
Sbjct: 137 HIENLPDSATEDKLIDCLNDFLLSTGSK-LQRSKPCLSCTINREKRQAFVEFLTPEDATA 195
Query: 527 ALCCDGCSFSGSILKIKRPKEFVEVA-----------------------IFIGGISRTLS 563
A+ DG S +GS+L+I+RPKE+VE IFI GI+ +S
Sbjct: 196 AISFDGRSLNGSVLRIRRPKEYVETVNVTPKKAEETALISDVVADSPYKIFIAGIAGVIS 255
Query: 564 SKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAV 623
SKM+MEIV AFGPL AY F N + PCAF+EY D+ VT KA AGLNG+ +GG+VLTAV
Sbjct: 256 SKMLMEIVSAFGPLAAYRFLFNNELGGPCAFLEYADRSVTSKACAGLNGMMLGGRVLTAV 315
Query: 624 QAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLED 683
+ + + PF+GIP +A LLK+PT+VL+LKNVF E + LS+ E+EE LED
Sbjct: 316 HVFPNPHVEAANEASPFYGIPDNAKLLLKEPTKVLQLKNVFEREEYMLLSKSELEETLED 375
Query: 684 VRLECARFGSVKSVNVVKYGDSNISTIQACEGNENTASAGV-GQNLTNDETNEKGERLEE 742
VR+EC RFG+VKSVNVV+Y + +S + EN + ++ E K
Sbjct: 376 VRVECTRFGAVKSVNVVEYPAAGVSAAE-----ENIVELKIECTEFSDTENIAKAVSEYS 430
Query: 743 VTDHKSIKNNELEILNDSKEVMEAGEVNNVKDNRPASGTMGDEPSQLCELDTDMAVEYQA 802
V SI + +D+K+V E + KD S + LCE + +A E
Sbjct: 431 VPIIPSIDVLNHSVASDTKDVDLIPESQDQKDKHLPSN------AALCESEAPVADEDAE 484
Query: 803 RDSTSEIVSQGVPTQVNTLKDSPCAHDD--------KVTCNIQLEHMSEENKSSAK--ED 852
D T +PT + D A D KVT + E++ + E
Sbjct: 485 LDETQS--RAALPTPQHAEADHTEAAVDENKHTGAGKVTATATDDDAVEKSHGDPRTSET 542
Query: 853 LNLEEVNGNSEAFTGASNEMGMQSSAVENGDNENQ---DPNQGHIFEPGCVFVEYMRAEA 909
N +E G +E G + + E Q + G +FEPG V VE++R EA
Sbjct: 543 CNPAGPTDKAEK-PGRYSEQGAGDVTEDRPEKEAQAVGTSDTGFVFEPGSVLVEFLREEA 601
Query: 910 SCMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
+CMAAHSLH R F +R V Y P +LY ++ +
Sbjct: 602 ACMAAHSLHGRRFGNRTVHAGYAPYDLYLQKYPR 635
>gi|125584846|gb|EAZ25510.1| hypothetical protein OsJ_09334 [Oryza sativa Japonica Group]
Length = 942
Score = 370 bits (951), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 344/1007 (34%), Positives = 488/1007 (48%), Gaps = 164/1007 (16%)
Query: 22 SNEGSAARTRPFSFDEIMIRRKTKNLSE--NIKEDAMDVSNIPGDCTIENVSDDHKSGRH 79
+++G+AARTRP S +IM RR+ K SE KE + SN S+ +SGR
Sbjct: 15 TSDGTAARTRPLSIQDIMSRREKKTASEAKKTKEGLEENSN--------GKSNHLESGRG 66
Query: 80 HGHKKSP---GVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQRDGENRDLETKL-KEDYV 135
+K ++ + K G G K+D ++ +R RD++ ++ K+D +
Sbjct: 67 SKSRKDVREMPLEGSKKDNSDKPGEGSKKDE--MRHTPREERRKDNMRDMQREVSKKDNL 124
Query: 136 AKSNHRRTLKSEIRLKDRSPGNEKDRGNRGSDTKLKADVTKETS-----SKPNVKSEKLI 190
+ +++K + P + R +K + T++ S K N S KL
Sbjct: 125 KDRPKDISKIDNLKVKVKVPSKDDQRDAPKKSSKKERSSTRDDSHLVDKDKGNHNSHKLS 184
Query: 191 P------SQGRSHNQSIVDSRD-EATKKHSRDLTGKDR-----HADKSGEKSER--ESKR 236
+ R N + +RD +AT + S+ GK D+ E+SER + KR
Sbjct: 185 TYTSGRVGKSRGGNHGEITARDGDATIQESQKRPGKRWIDEPVGNDRIKERSERRTDGKR 244
Query: 237 KHRNRDDEKNR--DKSAAKKVDLGKGHDLKVSERKE-KKESPKSRHGNSRLKRRRSRSRE 293
R DDEK+ D+ KK D + D + +RK+ +KE+ K H R KRRRS SR+
Sbjct: 245 NSRGFDDEKSSQVDRPTLKKQDAVRLQDPRHFDRKDGRKENAKLHHEEPRSKRRRSTSRD 304
Query: 294 R-EDRNRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNG--LSG 350
++R+ RS+S S R Q R SY H+ P + + S R+H + D R + N SG
Sbjct: 305 HYQERHDRSVSPSSREQ-RHSYRGHGHDYYPPYHSMDKSRRKHFETDRHRTSWNDGYSSG 363
Query: 351 HYRRHDGSTSGLGGYSPRKRRT-----EAAAKTPSPINRSPEKKSAKWDVAPVETYSVPS 405
YRR++ S LGGYSPRKR+T + KT S + RSPEKKSA WD PV T S
Sbjct: 364 SYRRYE---SRLGGYSPRKRKTAPKDEQTTIKTASLVIRSPEKKSATWDQLPVATDL--S 418
Query: 406 NVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRR 465
N T+ Q+ V PV TT K ++ L N+++DSVQLTQ+ RP+RR
Sbjct: 419 NFVTTLQSTVGLKDSTV---PVNFTTSKQDPNTTIGTILTGSNLAVDSVQLTQATRPLRR 475
Query: 466 LCVENLPLSASEKALMEFLNNFLLSSGVQHVHGS-LPCIGCVIQREKGQAFVEFLTAEDA 524
L +ENLP A+E L+ LN FLLSS H+ S PC+ CVI ++K QAFVEFLT EDA
Sbjct: 476 LHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKRQAFVEFLTPEDA 535
Query: 525 SAALCCDGCSFSGSILKIKRPKEFVEVA------------------------IFIGGISR 560
+AAL DG SF GS LKI+RPKE+VE+A IFI GIS
Sbjct: 536 TAALSFDGRSFGGSSLKIRRPKEYVEMAHVAPKKPSEEIKLISDVVADSPHKIFIAGISG 595
Query: 561 TLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVL 620
+SS+M Y+D +T KA AGLNG+K+GG +L
Sbjct: 596 VISSEM------------------------------YIDHPITSKACAGLNGMKLGGGIL 625
Query: 621 TAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEV 680
TAV + + + PF+GIP A LL++PT+VL+LKNVF+ E + LS+ E+EE+
Sbjct: 626 TAVNVFPNSTEQAFNEASPFYGIPDSAKSLLEEPTKVLQLKNVFDQEEYLLLSKSELEEI 685
Query: 681 LEDVRLECA--------RFGSVKSVNVVKYGDSNIS----TIQACE------------GN 716
LEDVR+ECA RFG+VKS+NVVKY S+ + TI CE GN
Sbjct: 686 LEDVRVECASLHYGQDDRFGAVKSINVVKYPASSDNTTGDTITECEDGSTKIEPKEYGGN 745
Query: 717 ENTASAGVGQNLTNDETNEKGERLEEVTDHKSIKNNELEILNDSKEVMEAGEVNNVKDNR 776
+ GV ++ N T+ + EV D + + + D K + GE +
Sbjct: 746 VSCTETGVECSVLNQSTDVPDPSICEVQDPVELDTDSIPKGRDHKNLDTRGECD------ 799
Query: 777 PASGTMGDEPSQLCELDTDMAVEYQARDSTSEIVSQGVPTQVNTLKDSPCAHDDKVTCNI 836
+ T GDE +TD VE DST Q + H D ++
Sbjct: 800 --APTAGDE-------NTDQGVEADQTDSTD--------AQDDARGTIERGHADADPASL 842
Query: 837 QLEHMSEENKSSAKEDLNLEEVNGNSEAFTGASNEMGMQSSAVENGDNENQDPNQGHIFE 896
+ + + K E+ G G S ++ AV+ DN + E
Sbjct: 843 ETSCSTAPGDGADKSGRENEQQGG-----AGVSESNTEKAPAVDARDNALA--SNTSALE 895
Query: 897 PGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
GC+ VE++R EA+C+AAHSLH R F RIV+ Y P +LY ++ +
Sbjct: 896 AGCILVEFLRKEAACIAAHSLHGRRFGSRIVSAGYAPHDLYLQKYPR 942
>gi|357114131|ref|XP_003558854.1| PREDICTED: uncharacterized protein LOC100840355 [Brachypodium
distachyon]
Length = 840
Score = 366 bits (940), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 255/610 (41%), Positives = 352/610 (57%), Gaps = 67/610 (10%)
Query: 142 RTLKSEIRLKDRSPGNEKDRGNRGSDTKLKADVTKETSSKPNVKSEKLIPSQGRSHNQSI 201
+ LKS IR +S G KD GS K ++S + E + GR+ ++
Sbjct: 25 KKLKSTIRNDSQSAG--KDEVTHGS--------QKTSTSMRGLADESKDANHGRTRARNA 74
Query: 202 VDSRDEATKKHSRDLTGKDRHADKSGEKSER---ESKRKHRNRDDE----KNRDKSAAKK 254
E+ K + G+ D+ EKSE+ E+KRK DD+ ++ D+ KK
Sbjct: 75 HTIGYESLKGPGKRWNGETVDNDRIKEKSEKPRNETKRKFHGFDDQIEKSQDLDRPVLKK 134
Query: 255 VDLGKGHDLKVSERKE-KKESPKSRHGN-SRLKRRRSRSREREDRNRRSLSLSPRAQKRT 312
D + D + S+R + +KE K +RR +R RS+SLSPR Q+R
Sbjct: 135 HDFARFRDSRHSDRNDGRKEYGKPYDEEPRSKRRRSRSRDHDRERRDRSVSLSPREQRR- 193
Query: 313 SYYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNGL--SGHYRRHDGSTSGLGGYSPRKR 370
SY +H+ P GR++++ D R +GNG G YRR++ S LGGYSPRKR
Sbjct: 194 SYRGHDHDNYPP-------GRKYAETDRFRTSGNGGHNDGSYRRYE---SRLGGYSPRKR 243
Query: 371 RT-----EAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSD 425
+T + AKT P+ RSPE+KSA WD P + SN TS Q S +M SS
Sbjct: 244 KTAPQDEQMTAKTSFPVVRSPERKSATWDQGPAKADQ--SNFFTSLQPIVS---QMTSSV 298
Query: 426 PVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLN 485
V+S+ K + ++ L+ + S+DSVQLTQ+ RP+RRL +ENLP SASE L+ LN
Sbjct: 299 SVSSSAPKQNSATTLETILSGNSSSIDSVQLTQATRPLRRLHIENLPSSASEDMLIGCLN 358
Query: 486 NFLLSSGVQHVHGS-LPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKR 544
+F LSS V H+ S PC+ C I +EK QAFVEFLT EDA+AAL DG SF+GS LKI+R
Sbjct: 359 DFFLSSDVNHIQKSKQPCLSCTINKEKHQAFVEFLTPEDATAALSFDGRSFNGSALKIRR 418
Query: 545 PKEFVEVA-----------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYH 581
PKE++E+A IF+ GIS +SS+M+MEIV +FG L AY
Sbjct: 419 PKEYIEMANVVPKKTVEEIKLASDVADSPHKIFVAGISGVISSEMLMEIVSSFGQLAAYR 478
Query: 582 FEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFH 641
F+ +E CAF+EY+D +T KA AGLNG+K+GG +LTAVQ V + + PF+
Sbjct: 479 FQDHEALSGRCAFLEYIDHSITDKACAGLNGMKLGGCILTAVQ-VFPNPLEACNEASPFY 537
Query: 642 GIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVK 701
IP A LL+ PTEVL+LKNVF+ E + LS+ E+EE++ED+R+ECARFG+VKS+N+V+
Sbjct: 538 SIPDSAKMLLEAPTEVLQLKNVFDREEYLLLSKSELEEIMEDIRMECARFGAVKSINIVE 597
Query: 702 YGDSNISTIQ 711
Y S+ ST+Q
Sbjct: 598 YPASSDSTLQ 607
>gi|2459426|gb|AAB80661.1| putative splicing factor U2AF large chain [Arabidopsis thaliana]
Length = 475
Score = 332 bits (851), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 210/487 (43%), Positives = 271/487 (55%), Gaps = 89/487 (18%)
Query: 208 ATKKHSRDLTGKDRHADKSGEKSERESKRKHRNRDDEKNRDKSAAKKVDLGKGHDLKVSE 267
+KK R L D D+S K + KRK++N + +KNR+ + +K+ D GK H ++VSE
Sbjct: 8 CSKKRLRSLVVADVPRDESSIKPDNGDKRKNQNGNHKKNREINMSKRHDPGKVHSVEVSE 67
Query: 268 RKEKKESPKSRHGNSRLKRRRSRSREREDRNRRSLSLSPRAQKRTSYYDREHEGLPSHST 327
R E++E PK SR+R+ R +KR R+H
Sbjct: 68 RWERREQPK--------------SRQRDLR-----------EKRRRSRSRDH-------- 94
Query: 328 KISSGRQHSDIDSSRVTGNGLSGHYRRHDGSTSGLGGYSPRKRRTEAAAKTPSPINRSPE 387
G R+ S S LGGYSPRKRR +A+ K SP N S E
Sbjct: 95 ----------------------GQDRQKSASKSELGGYSPRKRREQASTKAASPPNLSSE 132
Query: 388 KKSAKWDVAPVETYSVPSN-VHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVS--ASL 444
KKSAKW +A T + S+ V + QAA+ A+ +S + T KPL +
Sbjct: 133 KKSAKWGLAATVTAGMFSDSVFSGLQAATQTAYPTISE--ASLTLLKPLMVMDAPFRTPP 190
Query: 445 AKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIG 504
A+ S DSVQLT+S R MRRL EN+P SASEK+L+E N ++LSSG H+ GS PCI
Sbjct: 191 ARQTTSFDSVQLTESTRRMRRLYAENVPDSASEKSLIECFNGYMLSSGSNHIKGSEPCIS 250
Query: 505 CVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRTLSS 564
C+ FLT +DASAAL DGCSF+GS LKI+RPK+++
Sbjct: 251 CI-----------FLTPQDASAALSLDGCSFAGSNLKIRRPKDYL--------------- 284
Query: 565 KMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQ 624
MEIV FGPLKAY F N D + CAF+EY D VT KA AGLNG+++GG V+TAV
Sbjct: 285 ---MEIVSVFGPLKAYRFVSNNDLNQRCAFLEYTDGSVTLKACAGLNGMRLGGSVITAVC 341
Query: 625 AVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDV 684
A D S + + NPPF+GIP HA PLL KP +LKLKNV +PE +S SE EV+E+LEDV
Sbjct: 342 AFPDASSVAVNENPPFYGIPSHAKPLLGKPKNILKLKNVVDPEDLTSFSEQEVKEILEDV 401
Query: 685 RLECARF 691
RLECAR+
Sbjct: 402 RLECARW 408
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/48 (60%), Positives = 37/48 (77%)
Query: 894 IFEPGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARF 941
+FE GC+F+EY R EA+C AAHSLH RL+D+RIV EY+ LY+ RF
Sbjct: 425 VFETGCIFIEYRRPEATCDAAHSLHGRLYDNRIVKAEYVSKELYQIRF 472
>gi|108706081|gb|ABF93876.1| RNA recognition motif family protein, expressed [Oryza sativa
Japonica Group]
Length = 704
Score = 325 bits (833), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 270/704 (38%), Positives = 374/704 (53%), Gaps = 80/704 (11%)
Query: 22 SNEGSAARTRPFSFDEIMIRRKTKNLSE--NIKEDAMDVSNIPGDCTIENVSDDHKSGRH 79
+++G+AARTRP S +IM RR+ K SE KE + SN S+ +SGR
Sbjct: 15 TSDGTAARTRPLSIQDIMSRREKKTASEAKKTKEGLEENSN--------GKSNHLESGRG 66
Query: 80 HGHKKSP---GVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQRDGENRDLETKL-KEDYV 135
+K ++ + K G G K+D ++ +R RD++ ++ K+D +
Sbjct: 67 SKSRKDVREMPLEGSKKDNSDKPGEGSKKDEM--RHTPREERRKDNMRDMQREVSKKDNL 124
Query: 136 AKSNHRRTLKSEIRLKDRSPGNEKDRGNRGSDTKLKADVTKETS-----SKPNVKSEKLI 190
+ +++K + P + R +K + T++ S K N S KL
Sbjct: 125 KDRPKDISKIDNLKVKVKVPSKDDQRDAPKKSSKKERSSTRDDSHLVDKDKGNHNSHKLS 184
Query: 191 P------SQGRSHNQSIVDSRD-EATKKHSRDLTGKDR-----HADKSGEKSERES--KR 236
+ R N + +RD +AT + S+ GK D+ E+SER + KR
Sbjct: 185 TYTSGRVGKSRGGNHGEITARDGDATIQESQKRPGKRWIDEPVGNDRIKERSERRTDGKR 244
Query: 237 KHRNRDDEKNR--DKSAAKKVDLGKGHDLKVSERKE-KKESPKSRHGNSRLKRRRSRSRE 293
R DDEK+ D+ KK D + D + +RK+ +KE+ K H R KRRRS SR+
Sbjct: 245 NSRGFDDEKSSQVDRPTLKKQDAVRLQDPRHFDRKDGRKENAKLHHEEPRSKRRRSTSRD 304
Query: 294 R-EDRNRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNG--LSG 350
++R+ RS+S S R Q R SY H+ P + + S R+H + D R + N SG
Sbjct: 305 HYQERHDRSVSPSSREQ-RHSYRGHGHDYYPPYHSMDKSRRKHFETDRHRTSWNDGYSSG 363
Query: 351 HYRRHDGSTSGLGGYSPRKRRT-----EAAAKTPSPINRSPEKKSAKWDVAPVETYSVPS 405
YRR++ S LGGYSPRKR+T + KT S + RSPEKKSA WD PV T S
Sbjct: 364 SYRRYE---SRLGGYSPRKRKTAPKDEQTTIKTASLVIRSPEKKSATWDQLPVATDL--S 418
Query: 406 NVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRR 465
N T+ Q+ V PV TT K ++ L N+++DSVQLTQ+ RP+RR
Sbjct: 419 NFVTTLQSTVGLKDSTV---PVNFTTSKQDPNTTIGTILTGSNLAVDSVQLTQATRPLRR 475
Query: 466 LCVENLPLSASEKALMEFLNNFLLSSGVQHVHGS-LPCIGCVIQREKGQAFVEFLTAEDA 524
L +ENLP A+E L+ LN FLLSS H+ S PC+ CVI ++K QAFVEFLT EDA
Sbjct: 476 LHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKRQAFVEFLTPEDA 535
Query: 525 SAALCCDGCSFSGSILKIKRPKEFVEVA------------------------IFIGGISR 560
+AAL DG SF GS LKI+RPKE+VE+A IFI GIS
Sbjct: 536 TAALSFDGRSFGGSSLKIRRPKEYVEMAHVAPKKPSEEIKLISDVVADSPHKIFIAGISG 595
Query: 561 TLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVL 620
+SS+M+MEIV +FGPL AY F NE CAF+EY+D +T KA AGLNG+K+GG +L
Sbjct: 596 VISSEMLMEIVSSFGPLAAYRFLFNEYLGGACAFLEYIDHSITSKACAGLNGMKLGGGIL 655
Query: 621 TAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVF 664
TAV + + + PF+GIP A LL++PT+VL+LKNV
Sbjct: 656 TAVNVFPNSTEQAFNEASPFYGIPDSAKSLLEEPTKVLQLKNVL 699
>gi|108706082|gb|ABF93877.1| RNA recognition motif family protein, expressed [Oryza sativa
Japonica Group]
Length = 720
Score = 324 bits (830), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 270/703 (38%), Positives = 374/703 (53%), Gaps = 80/703 (11%)
Query: 22 SNEGSAARTRPFSFDEIMIRRKTKNLSE--NIKEDAMDVSNIPGDCTIENVSDDHKSGRH 79
+++G+AARTRP S +IM RR+ K SE KE + SN S+ +SGR
Sbjct: 15 TSDGTAARTRPLSIQDIMSRREKKTASEAKKTKEGLEENSN--------GKSNHLESGRG 66
Query: 80 HGHKKSP---GVDVHTSEEYVKVGSGKKEDNALTKNVDSSRQRDGENRDLETKL-KEDYV 135
+K ++ + K G G K+D ++ +R RD++ ++ K+D +
Sbjct: 67 SKSRKDVREMPLEGSKKDNSDKPGEGSKKDEM--RHTPREERRKDNMRDMQREVSKKDNL 124
Query: 136 AKSNHRRTLKSEIRLKDRSPGNEKDRGNRGSDTKLKADVTKETS-----SKPNVKSEKLI 190
+ +++K + P + R +K + T++ S K N S KL
Sbjct: 125 KDRPKDISKIDNLKVKVKVPSKDDQRDAPKKSSKKERSSTRDDSHLVDKDKGNHNSHKLS 184
Query: 191 P------SQGRSHNQSIVDSRD-EATKKHSRDLTGKDR-----HADKSGEKSERES--KR 236
+ R N + +RD +AT + S+ GK D+ E+SER + KR
Sbjct: 185 TYTSGRVGKSRGGNHGEITARDGDATIQESQKRPGKRWIDEPVGNDRIKERSERRTDGKR 244
Query: 237 KHRNRDDEKNR--DKSAAKKVDLGKGHDLKVSERKE-KKESPKSRHGNSRLKRRRSRSRE 293
R DDEK+ D+ KK D + D + +RK+ +KE+ K H R KRRRS SR+
Sbjct: 245 NSRGFDDEKSSQVDRPTLKKQDAVRLQDPRHFDRKDGRKENAKLHHEEPRSKRRRSTSRD 304
Query: 294 R-EDRNRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNG--LSG 350
++R+ RS+S S R Q R SY H+ P + + S R+H + D R + N SG
Sbjct: 305 HYQERHDRSVSPSSREQ-RHSYRGHGHDYYPPYHSMDKSRRKHFETDRHRTSWNDGYSSG 363
Query: 351 HYRRHDGSTSGLGGYSPRKRRT-----EAAAKTPSPINRSPEKKSAKWDVAPVETYSVPS 405
YRR++ S LGGYSPRKR+T + KT S + RSPEKKSA WD PV T S
Sbjct: 364 SYRRYE---SRLGGYSPRKRKTAPKDEQTTIKTASLVIRSPEKKSATWDQLPVATDL--S 418
Query: 406 NVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRR 465
N T+ Q+ V PV TT K ++ L N+++DSVQLTQ+ RP+RR
Sbjct: 419 NFVTTLQSTVGLKDSTV---PVNFTTSKQDPNTTIGTILTGSNLAVDSVQLTQATRPLRR 475
Query: 466 LCVENLPLSASEKALMEFLNNFLLSSGVQHVHGS-LPCIGCVIQREKGQAFVEFLTAEDA 524
L +ENLP A+E L+ LN FLLSS H+ S PC+ CVI ++K QAFVEFLT EDA
Sbjct: 476 LHIENLPSLATEDMLIGCLNEFLLSSSASHIQRSKQPCLSCVINKDKRQAFVEFLTPEDA 535
Query: 525 SAALCCDGCSFSGSILKIKRPKEFVEVA------------------------IFIGGISR 560
+AAL DG SF GS LKI+RPKE+VE+A IFI GIS
Sbjct: 536 TAALSFDGRSFGGSSLKIRRPKEYVEMAHVAPKKPSEEIKLISDVVADSPHKIFIAGISG 595
Query: 561 TLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVL 620
+SS+M+MEIV +FGPL AY F NE CAF+EY+D +T KA AGLNG+K+GG +L
Sbjct: 596 VISSEMLMEIVSSFGPLAAYRFLFNEYLGGACAFLEYIDHSITSKACAGLNGMKLGGGIL 655
Query: 621 TAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNV 663
TAV + + + PF+GIP A LL++PT+VL+LKNV
Sbjct: 656 TAVNVFPNSTEQAFNEASPFYGIPDSAKSLLEEPTKVLQLKNV 698
>gi|297801306|ref|XP_002868537.1| hypothetical protein ARALYDRAFT_355725 [Arabidopsis lyrata subsp.
lyrata]
gi|297314373|gb|EFH44796.1| hypothetical protein ARALYDRAFT_355725 [Arabidopsis lyrata subsp.
lyrata]
Length = 1370
Score = 306 bits (783), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 245/677 (36%), Positives = 347/677 (51%), Gaps = 132/677 (19%)
Query: 59 SNIPGDCTIENVSDDHKSGRHHGHKKSPGVDVHTSEEYVKVGSGKKED---NALTKNVDS 115
+ + G+ ++ VSD H+S + G+ +S + EE V+ S KKE+ ++ +N+D
Sbjct: 498 AQLLGNDLVDKVSDYHESEK--GYDRSEKL---RREERVRTSSRKKEEAISSSREENLDK 552
Query: 116 -----------SRQRDGENRDLETKLKEDYVAKSNHRRTLKSEIRLKDRSPGNEKDRGNR 164
R+ +GE+ ETKL E++ S RR K E + +K +
Sbjct: 553 RKKEEEPAANRKRKAEGESSTAETKLVEEH---SKDRRRKKEETNFSCKEERRDKKKKKE 609
Query: 165 GSDTKLKADVTKETSSKPNVKSEKLIPSQGRSHNQSIVDSRDEATKKHSRDLTGKDRHAD 224
+ + T+ + + ++ Q RS +++ SR K R L DR D
Sbjct: 610 DRKIEGELPTTEIITMTDRDRLDETDSEQARSALENLSSSR-----KRLRSLVVTDRPRD 664
Query: 225 KSGEKSERESKRKHRNRDDEKNRDKSAAKKVDLGKGHDLKVSERKEKKESPKSRHGNSRL 284
++ K + +KRK++N D +KNR+++ +K+ D GK H ++VSER +K+E PKS R
Sbjct: 665 ENSMKPDNGNKRKNQNGDHKKNRERNMSKRHDPGKVHSVEVSERWQKREQPKSHQREMRK 724
Query: 285 KRRRSRSRER-EDRNRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRV 343
KRRRSRSR+ +DR +RS L PRAQK TS + R+HE ++ SG+ H + + ++V
Sbjct: 725 KRRRSRSRDHGQDRQKRSSPL-PRAQKATSRHKRDHEERLENAVTDRSGKHHFNDNGNKV 783
Query: 344 TGNGLSGHYRRHDGSTSGLGGYSPRKRRTEAAAKTPSPINRSPEKKSAKWDVAPVETYSV 403
++ +R+ S S LGGYSPRKRR EA+AK SP N S EKKSAKWD+AP T ++
Sbjct: 784 AST-VNNKSKRYSASKSELGGYSPRKRREEASAKAVSPPNLSSEKKSAKWDLAPAVTAAM 842
Query: 404 PS-NVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRP 462
S +V + QAA+ A+ S + T KPL A+ S+DSVQLT+S R
Sbjct: 843 FSGSVFSGLQAAAQTAYPTNSE--ASLTLLKPLMEAPFRTPSAREITSVDSVQLTESTRR 900
Query: 463 MRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAE 522
MRRL EN+ SASEK+L+E N+++LSSG H+ GS PCI C+I +EK QA VEFLT
Sbjct: 901 MRRLYAENVSDSASEKSLIECFNSYMLSSGSNHIKGSEPCISCIINKEKSQALVEFLTPH 960
Query: 523 DASAALCCDGCSFSGSILKIKRPKEFVE-------------------------------- 550
DASAAL DGCSF+G LKI+RPK +VE
Sbjct: 961 DASAALSLDGCSFAGLNLKIRRPKGYVETTGVYVGYVIIHIQEGDEAVCYVMVTIHEAGF 1020
Query: 551 --VAIF----------------------------IGGISRTLSSKMVMEIVCAFGPLKAY 580
VAIF IGG +++SS+M+MEIV FGPLKAY
Sbjct: 1021 QTVAIFMQSGELAKKEPATNAISDNVKDSSNKIFIGGFPKSISSEMLMEIVSVFGPLKAY 1080
Query: 581 HFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPF 640
F +N D + CAF+E + NPPF
Sbjct: 1081 RFVINNDLNKRCAFLEVNE-------------------------------------NPPF 1103
Query: 641 HGIPKHALPLLKKPTEV 657
+GIP+HA PLL KP E+
Sbjct: 1104 YGIPEHAKPLLGKPKEM 1120
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 52/84 (61%), Gaps = 5/84 (5%)
Query: 23 NEGSAARTRPFSFDEIMIRRKTKNLSENIKEDAMDVSNIPGDCTIENVSDDHKSGRHHGH 82
NEGSAARTRP S DEI +RRK K EN+ E+ + V+ + G+ +E D H+S + G+
Sbjct: 19 NEGSAARTRPTSVDEIRLRRKRKESLENVTEETVGVAQLLGNDLVEKAPDYHESEK--GY 76
Query: 83 KKSPGVDVHTSEEYVKVGSGKKED 106
+S + EE+VK S KKED
Sbjct: 77 DRSKNL---RHEEHVKDSSRKKED 97
>gi|30685698|ref|NP_850209.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
gi|330253740|gb|AEC08834.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
Length = 979
Score = 269 bits (688), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 210/515 (40%), Positives = 295/515 (57%), Gaps = 36/515 (6%)
Query: 51 IKEDAMDVSNIPGDCTIENVSDDHKSGRHHGHKKSPGVDVHTSEEYVKVGSGKKEDNALT 110
IKED + + + G+ +E VSD H++ + G+ +S + + EE VK S KKE+
Sbjct: 477 IKEDHVGAAQLLGNDLVEMVSDYHETEK--GYDRSKKL---SREERVKDSSRKKEEA--- 528
Query: 111 KNVDSSRQRDGENRDLETKLKEDYVAKSNHRRTLKSEIRLKDRSPGNEKDRGNRGSDTKL 170
+ SSR+ EN D K K+D + SN +R + E + E + RG +
Sbjct: 529 --ISSSRE---ENLD---KQKKDE-STSNRKRKAEGECSTAETESIEEHSKDRRGKKEET 579
Query: 171 KADVTKETSSKP--------NVKSEKLIPSQGRSHNQSIVDSRDEATKKHSRDLTGKDRH 222
++ +E K N K E IP+ + +++ D +KK R L D
Sbjct: 580 NSNCREERRDKKMKKEDLASNRKIEGEIPT---TETKTMTDRDGLCSKKRLRSLVVADVP 636
Query: 223 ADKSGEKSERESKRKHRNRDDEKNRDKSAAKKVDLGKGHDLKVSERKEKKESPKSRHGNS 282
D+S K + KRK++N + +KNR+ + +K+ D GK H ++VSER E++E PKSR +
Sbjct: 637 RDESSIKPDNGDKRKNQNGNHKKNREINMSKRHDPGKVHSVEVSERWERREQPKSRQRDL 696
Query: 283 RLKRRRSRSRER-EDRNRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSS 341
R KRRRSRSR+ +DR +RS L PRA+K TS + R HE + K SG+ H + +
Sbjct: 697 REKRRRSRSRDHGQDRQKRSSPL-PRAEKATSRHKRNHEERSENVVKDRSGKHHCNDNED 755
Query: 342 RVTGNGLSGHYRRHDGSTSGLGGYSPRKRRTEAAAKTPSPINRSPEKKSAKWDVAPVETY 401
+VT +S RR+ S S LGGYSPRKRR +A+ K SP N S EKKSAKW +A T
Sbjct: 756 KVTST-VSNKSRRYSASKSELGGYSPRKRREQASTKAASPPNLSSEKKSAKWGLAATVTA 814
Query: 402 SVPSN-VHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASL--AKLNVSMDSVQLTQ 458
+ S+ V + QAA+ A+ +S + T KPL + A+ S DSVQLT+
Sbjct: 815 GMFSDSVFSGLQAATQTAYPTISE--ASLTLLKPLMVMDAPFRTPPARQTTSFDSVQLTE 872
Query: 459 SNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEF 518
S R MRRL EN+P SASEK+L+E N ++LSSG H+ GS PCI C+I +EK QA VEF
Sbjct: 873 STRRMRRLYAENVPDSASEKSLIECFNGYMLSSGSNHIKGSEPCISCIINKEKSQALVEF 932
Query: 519 LTAEDASAALCCDGCSFSGSILKIKRPKEFVEVAI 553
LT +DASAAL DGCSF+GS LKI+RPK++V +
Sbjct: 933 LTPQDASAALSLDGCSFAGSNLKIRRPKDYVRTTV 967
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 43/69 (62%), Gaps = 1/69 (1%)
Query: 8 KEKHGKRGEPSLADSNEGSAARTRPFSFDEIMIRRKTKNLSENIKEDAMDVSNIPGDCTI 67
K+K + G +L + EGSAARTRP S +EI +RRK K EN+KE+ + + + G +
Sbjct: 6 KQKENRDGHETLVN-EEGSAARTRPTSVEEIRLRRKRKESLENVKEETVGAAQLLGYDLV 64
Query: 68 ENVSDDHKS 76
E SD H+S
Sbjct: 65 EKASDYHES 73
>gi|147795302|emb|CAN69457.1| hypothetical protein VITISV_036574 [Vitis vinifera]
Length = 630
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 152/352 (43%), Positives = 212/352 (60%), Gaps = 23/352 (6%)
Query: 596 EYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPT 655
+YVDQ VT KA AGLNG+K+GGQVLT VQA+ + M+N+GN PF+GIP+HA PLL++PT
Sbjct: 298 QYVDQSVTLKACAGLNGMKLGGQVLTVVQAIPNALAMENTGNLPFYGIPEHAKPLLERPT 357
Query: 656 EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGDSNISTIQACEG 715
+VLKLKNV NP+ SSLSE E+EE+LED+RLEC RFG+VKSVN+VKY +S++ T
Sbjct: 358 QVLKLKNVVNPDDLSSLSEAELEEILEDIRLECTRFGTVKSVNIVKYNNSHVLTWVVMVN 417
Query: 716 NENTASAGVGQNLTNDETNEKGERLEEVTDHKSIKNNELEILNDSKEVMEAGEVNNVKDN 775
+ + G G D+ + ++ TD K D KEV E E N + D+
Sbjct: 418 SMKAETLGGG----TDDGSSDISGIKPPTDVK-----------DLKEVDEVVERNGISDD 462
Query: 776 RPASGTMGDEPSQLCELDTDMAVEYQARDSTSEIVSQGVPTQVNTLKDSPCAHDDKVTCN 835
+ + + +E + ++D++ AV+ S+ + +G+P Q+N +K ++K
Sbjct: 463 KSLTDLIKNELCEPSQIDSNTAVKEPGCPDGSDDIPRGLPDQLNNMKHEVELRNNKAADV 522
Query: 836 IQLEHMSEENKSSAKEDLNLEEVNGNSEAFTGASNEM----GMQSSAVENGDNENQDPNQ 891
IQ E + +NK E+L LEE N G S E+ G++S D+E N
Sbjct: 523 IQ-EDIIIKNKLMTVEELKLEETN---RKLLGTSAELDSSPGIKSDFTGKNDSEKGLCNL 578
Query: 892 GHIFEPGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
+FE GCV VEY R EASCMAAH LH R FDDR+V V Y+ L+LYR +F +
Sbjct: 579 DDMFEVGCVLVEYGRTEASCMAAHCLHGRYFDDRVVVVGYVALDLYRMKFPR 630
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/35 (80%), Positives = 30/35 (85%)
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED 587
IFIGGISR LSS M+MEI AFGPLKAY F+VNED
Sbjct: 133 IFIGGISRALSSDMLMEIAAAFGPLKAYRFQVNED 167
>gi|115450613|ref|NP_001048907.1| Os03g0138100 [Oryza sativa Japonica Group]
gi|113547378|dbj|BAF10821.1| Os03g0138100 [Oryza sativa Japonica Group]
Length = 404
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 150/407 (36%), Positives = 211/407 (51%), Gaps = 46/407 (11%)
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNG 612
IFI GIS +SS+M+MEIV +FGPL AY F NE CAF+EY+D +T KA AGLNG
Sbjct: 28 IFIAGISGVISSEMLMEIVSSFGPLAAYRFLFNEYLGGACAFLEYIDHSITSKACAGLNG 87
Query: 613 LKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSL 672
+K+GG +LTAV + + + PF+GIP A LL++PT+VL+LKNVF+ E + L
Sbjct: 88 MKLGGGILTAVNVFPNSTEQAFNEASPFYGIPDSAKSLLEEPTKVLQLKNVFDQEEYLLL 147
Query: 673 SELEVEEVLEDVRLECARFGSVKSVNVVKYGDSNIS----TIQACE------------GN 716
S+ E+EE+LEDVR+ECARFG+VKS+NVVKY S+ + TI CE GN
Sbjct: 148 SKSELEEILEDVRVECARFGAVKSINVVKYPASSDNTTGDTITECEDGSTKIEPKEYGGN 207
Query: 717 ENTASAGVGQNLTNDETNEKGERLEEVTDHKSIKNNELEILNDSKEVMEAGEVNNVKDNR 776
+ GV ++ N T+ + EV D + + + D K + GE +
Sbjct: 208 VSCTETGVECSVLNQSTDVPDPSICEVQDPVELDTDSIPKGRDHKNLDTRGECD------ 261
Query: 777 PASGTMGDEPSQLCELDTDMAVEYQARDSTSEIVSQGVPTQVNTLKDSPCAHDDKVTCNI 836
+ T GDE +TD VE DST Q + H D ++
Sbjct: 262 --APTAGDE-------NTDQGVEADQTDSTD--------AQDDARGTIERGHADADPASL 304
Query: 837 QLEHMSEENKSSAKEDLNLEEVNGNSEAFTGASNEMGMQSSAVENGDNENQDPNQGHIFE 896
+ + + K E+ G G S ++ AV+ DN + E
Sbjct: 305 ETSCSTAPGDGADKSGRENEQQGG-----AGVSESNTEKAPAVDARDNALA--SNTSALE 357
Query: 897 PGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
GC+ VE++R EA+C+AAHSLH R F RIV+ Y P +LY ++ +
Sbjct: 358 AGCILVEFLRKEAACIAAHSLHGRRFGSRIVSAGYAPHDLYLQKYPR 404
>gi|413956975|gb|AFW89624.1| hypothetical protein ZEAMMB73_282398 [Zea mays]
Length = 378
Score = 199 bits (507), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 142/396 (35%), Positives = 202/396 (51%), Gaps = 36/396 (9%)
Query: 566 MVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA 625
M+MEIV AFGPL AY F N + PCAF+EY D+ VT KA AGLNG+ +GG+VLTAV
Sbjct: 1 MLMEIVSAFGPLAAYRFLFNNELGGPCAFLEYADRSVTSKACAGLNGMMLGGRVLTAVHV 60
Query: 626 VLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVR 685
+ + + PF+GIP +A LLK+PT+VL+LKNVF E + LS+ E+EE LEDVR
Sbjct: 61 FPNPHVEAANEASPFYGIPDNAKLLLKEPTKVLQLKNVFEREEYMLLSKSELEETLEDVR 120
Query: 686 LECARFGSVKSVNVVKYGDSNISTIQACEGNENTASAGV-GQNLTNDETNEKGERLEEVT 744
+EC RFG+VKSVNVV+Y + +S + EN + ++ E K V
Sbjct: 121 VECTRFGAVKSVNVVEYPAAGVSAAE-----ENIVELKIECTEFSDTENIAKAVSEYSVP 175
Query: 745 DHKSIKNNELEILNDSKEVMEAGEVNNVKDNRPASGTMGDEPSQLCELDTDMAVEYQARD 804
SI + +D+K+V E + KD S + LCE + +A E D
Sbjct: 176 IIPSIDVLNHSVASDTKDVDLIPESQDQKDKHLPSN------AALCESEAPVADEDAELD 229
Query: 805 STSEIVSQGVPTQVNTLKDSPCAHDDKVTCNIQLEHMSEENKSSAKEDLNLEEVNGN--- 861
T + P ++K T ++ ++A +D +E+ +G+
Sbjct: 230 ETQSRAALPTPQHAEADHTEAAVDENKHTGAGKV-------TATATDDDAVEKSHGDPRT 282
Query: 862 SEAFT-----------GASNEMGMQSSAVENGDNENQ---DPNQGHIFEPGCVFVEYMRA 907
SE G +E G + + E Q + G +FEPG V VE++R
Sbjct: 283 SETCNPAGPTDKAEKPGRYSEQGAGDVTEDRPEKEAQAVGTSDTGFVFEPGSVLVEFLRE 342
Query: 908 EASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
EA+CMAAHSLH R F +R V Y P +LY ++ +
Sbjct: 343 EAACMAAHSLHGRRFGNRTVHAGYAPYDLYLQKYPR 378
>gi|326492263|dbj|BAK01915.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 622
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 149/356 (41%), Positives = 201/356 (56%), Gaps = 34/356 (9%)
Query: 233 ESKRKHRNRDDEKNR--DKSAAKKVDLGKGHDLKVSERKE-KKESPKSRHGNSRLKRRRS 289
E+KRK DDEK D+ KK D + K +R + ++E K H RLKRRRS
Sbjct: 279 EAKRKIHGFDDEKTSYVDRPVLKKHDSARSQVSKHYDRNDARREYGKPYHEEPRLKRRRS 338
Query: 290 RSRERED-RNRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRVT---G 345
RSR+ + R RSLSLSPR Q+R SY+ +H P GR++++ D R + G
Sbjct: 339 RSRDHDQGRRDRSLSLSPREQRR-SYHGHDHGNYPP-------GRKYAENDRHRTSDNVG 390
Query: 346 NGLSGHYRRHDGSTSGLGGYSPRKRRT-----EAAAKTPSPINRSPEKKSAKWDVAPVET 400
G G Y+R++ S LGGYSPRKR+T + + P+ RS EKK WD P
Sbjct: 391 QG-GGSYQRYE---SRLGGYSPRKRKTVPQDEQLSTTITPPVIRSSEKKPVTWDQPPAAA 446
Query: 401 YSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN 460
S T+ Q S + S ++ Q P ++ L+ + S+DSVQLTQ+
Sbjct: 447 DQ--SIFFTNLQPIVSQTSSVSVS--FSAPKQNP--ATALDTILSGNSSSVDSVQLTQAT 500
Query: 461 RPMRRLCVENLPLSASEKALMEFLNNFLLSSG-VQHVHGS-LPCIGCVIQREKGQAFVEF 518
RP+RRL +ENL SASE AL+ LN+FLLS+G + + S PC+ C I +EK QAFVEF
Sbjct: 501 RPLRRLHIENLASSASEDALIGCLNDFLLSTGDINRIQRSKQPCLSCTINKEKRQAFVEF 560
Query: 519 LTAEDASAALCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRTLSS--KMVMEIVC 572
LT EDA+AAL DG SF+GS LKI+RPKE++E+A+ +S L S + I C
Sbjct: 561 LTPEDATAALSFDGRSFNGSTLKIRRPKEYIEMAVRSPSLSHLLFSIDGFTISINC 616
>gi|224110606|ref|XP_002315575.1| predicted protein [Populus trichocarpa]
gi|222864615|gb|EEF01746.1| predicted protein [Populus trichocarpa]
Length = 251
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 95/146 (65%), Positives = 113/146 (77%)
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNG 612
IFIGGIS+ LSSKM+MEI AFGPLKAY FE ++D +EP AF+EY D+ VT KA AGLNG
Sbjct: 42 IFIGGISKVLSSKMLMEIASAFGPLKAYQFENSKDPDEPFAFLEYADESVTFKACAGLNG 101
Query: 613 LKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSL 672
+K+GGQV+TA+QAV + S + GN F I +HA LL+KPTEVLKLKNVF+ E SSL
Sbjct: 102 MKLGGQVITAIQAVPNASSSGSDGNSQFGQISQHAKALLEKPTEVLKLKNVFDSESLSSL 161
Query: 673 SELEVEEVLEDVRLECARFGSVKSVN 698
S EVEEVLEDVRLECAR+ +V V
Sbjct: 162 SNTEVEEVLEDVRLECARYYNVDKVT 187
Score = 89.4 bits (220), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 39/57 (68%), Positives = 43/57 (75%)
Query: 885 ENQDPNQGHIFEPGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARF 941
E D N G IFE GCVFVE+ R EA+CMAAH LH RLFDDR V VEY PL++Y ARF
Sbjct: 194 EVDDCNLGLIFERGCVFVEFRRTEAACMAAHCLHGRLFDDRAVVVEYFPLDIYLARF 250
>gi|334184658|ref|NP_850210.2| RNA recognition motif-containing protein [Arabidopsis thaliana]
gi|330253741|gb|AEC08835.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
Length = 322
Score = 177 bits (448), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 93/183 (50%), Positives = 123/183 (67%), Gaps = 3/183 (1%)
Query: 566 MVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA 625
M+MEIV FGPLKAY F N D + CAF+EY D VT KA AGLNG+++GG V+TAV A
Sbjct: 1 MLMEIVSVFGPLKAYRFVSNNDLNQRCAFLEYTDGSVTLKACAGLNGMRLGGSVITAVCA 60
Query: 626 VLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVR 685
D S + + NPPF+GIP HA PLL KP +LKLKNV +PE +S SE EV+E+LEDVR
Sbjct: 61 FPDASSVAVNENPPFYGIPSHAKPLLGKPKNILKLKNVVDPEDLTSFSEQEVKEILEDVR 120
Query: 686 LECARFGSVKSVNVV--KYGDSNISTIQACEGNENTASAGVGQNLTNDETNEKGERLEEV 743
LECARFG +KS+N++ K D +S E+T S + ++ +E +E E+ E++
Sbjct: 121 LECARFGVIKSINILEHKSKDITVSETNPLLNLESTDSKEMNVSVI-EEKDEGSEKAEDI 179
Query: 744 TDH 746
D+
Sbjct: 180 ADN 182
Score = 72.8 bits (177), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 29/48 (60%), Positives = 37/48 (77%)
Query: 894 IFEPGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARF 941
+FE GC+F+EY R EA+C AAHSLH RL+D+RIV EY+ LY+ RF
Sbjct: 272 VFETGCIFIEYRRPEATCDAAHSLHGRLYDNRIVKAEYVSKELYQIRF 319
>gi|357452335|ref|XP_003596444.1| hypothetical protein MTR_2g077660 [Medicago truncatula]
gi|355485492|gb|AES66695.1| hypothetical protein MTR_2g077660 [Medicago truncatula]
Length = 325
Score = 176 bits (447), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 131/338 (38%), Positives = 184/338 (54%), Gaps = 20/338 (5%)
Query: 613 LKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSL 672
+ +GG+VLT VQA+ D S ++N PP +GIP+HA PLL++PT+VL++KNVF E SSL
Sbjct: 1 MNLGGEVLTVVQAMPDASPVENDSKPPSYGIPEHAEPLLREPTQVLEIKNVFTVESLSSL 60
Query: 673 SELEVEEVLEDVRLECARFGSVKSVNVVKYG-DSNIST-----IQACEGNENTASAGVGQ 726
S+ +EE+LEDVRLECARFG+VKS++V ++ D+N++T I+ E T
Sbjct: 61 SDTVIEEILEDVRLECARFGTVKSIHVARHCKDNNLATKSEEVIKKVGSEEPTPDT---H 117
Query: 727 NLTNDETNEKGERLEEVTDHKSIKNNELEILNDSKEVMEAGEVNN-VKDNRPASGTMGDE 785
+TND + EE T S +E D +V+E + N+ N + + D
Sbjct: 118 TVTNDAESSFS---EEATYSNSKGTGGMESHGD--KVLEEDKDNDGTSVNVDKNAEVFDN 172
Query: 786 PSQLCELDTDMAVEYQARDSTSEIVSQGVPTQVNTLKDSPCAHDDKVTCNIQLEHMSEEN 845
+ L +D V + QG P Q NT D P HD + +I +E N
Sbjct: 173 TTCQEHLVSDSTVIDAGNEDLPSSTIQGCPDQGNTPNDDPELHDSMIANDIDVEKTVIGN 232
Query: 846 KSSAKEDLNLEEVNGNSEAFTGASNEMGMQSSAVENGDNENQDPNQGHIFEPGCVFVEYM 905
S L+E G SE AS E+ ++ D E D H+FE G V VEY
Sbjct: 233 TDSENMVYPLQE--GFSEC--DASLELVGNRKDIKEEDEEEDD-TYNHVFEEGSVLVEYA 287
Query: 906 RAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
R+EA AAH +HRRLFD R+V+V+Y+ L+LYR RF+K
Sbjct: 288 RSEACRSAAHCMHRRLFDGRLVSVQYVALSLYRERFTK 325
>gi|218189760|gb|EEC72187.1| hypothetical protein OsI_05261 [Oryza sativa Indica Group]
Length = 485
Score = 172 bits (436), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 84/150 (56%), Positives = 109/150 (72%)
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNG 612
IFI GI R +SSKM+ +IV +FG L AY F NED CAF+EY+D +T KA AGLNG
Sbjct: 264 IFIAGIPRVISSKMLRDIVSSFGQLAAYRFLFNEDLGGACAFLEYIDHSITSKACAGLNG 323
Query: 613 LKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSL 672
+K+GG V+TAV + D + PFHGIP + PLL PT+VL+LKNVF+ E +S L
Sbjct: 324 MKLGGCVITAVGVLTDHPGQAGNEACPFHGIPANPKPLLAVPTQVLQLKNVFDQEEYSLL 383
Query: 673 SELEVEEVLEDVRLECARFGSVKSVNVVKY 702
S+ EV+ VLEDVR++CAR+G+VKS+NVV+Y
Sbjct: 384 SKYEVDAVLEDVRVKCARYGAVKSINVVEY 413
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 61/124 (49%), Gaps = 6/124 (4%)
Query: 813 GVPTQVNTLKDSPCAHDDKVTCNIQLEHMSEENKSSAKEDLNLEEVNGNSEAFTGASNEM 872
VPTQV LK+ + + +++ + E+ + ++ +N E G+ N
Sbjct: 363 AVPTQVLQLKNVFDQEEYSLLSKYEVDAVLEDVRVKCARYGAVKSIN-VVEYPAGSDNT- 420
Query: 873 GMQSSAVENGDNENQDPNQGHIFEPGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYI 932
++ AV+ DN N E GC+ VE++ EAS MAAHSLH R F RIV+ Y
Sbjct: 421 --KAPAVDARDNALASNNTA--LEAGCILVEFLCKEASFMAAHSLHGRPFGSRIVSAGYA 476
Query: 933 PLNL 936
P +L
Sbjct: 477 PYDL 480
>gi|115442323|ref|NP_001045441.1| Os01g0956600 [Oryza sativa Japonica Group]
gi|57900079|dbj|BAD88141.1| splicing factor family protein-like [Oryza sativa Japonica Group]
gi|57900192|dbj|BAD88277.1| splicing factor family protein-like [Oryza sativa Japonica Group]
gi|113534972|dbj|BAF07355.1| Os01g0956600 [Oryza sativa Japonica Group]
gi|215736836|dbj|BAG95765.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 608
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 84/150 (56%), Positives = 109/150 (72%)
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNG 612
IFI GI R +SSKM+ +IV +FG L AY F NED CAF+EY+D +T KA AGLNG
Sbjct: 380 IFIAGIPRVISSKMLRDIVSSFGQLAAYRFLFNEDLGGACAFLEYIDHSITSKACAGLNG 439
Query: 613 LKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSL 672
+K+GG V+TAV + D + PFHGIP + PLL PT+VL+LKNVF+ E +S L
Sbjct: 440 MKLGGCVITAVGVLTDHPGQAGNEACPFHGIPANPKPLLAVPTQVLQLKNVFDQEEYSLL 499
Query: 673 SELEVEEVLEDVRLECARFGSVKSVNVVKY 702
S+ EV+ VLEDVR++CAR+G+VKS+NVV+Y
Sbjct: 500 SKYEVDAVLEDVRVKCARYGAVKSINVVEY 529
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 61/124 (49%), Gaps = 6/124 (4%)
Query: 813 GVPTQVNTLKDSPCAHDDKVTCNIQLEHMSEENKSSAKEDLNLEEVNGNSEAFTGASNEM 872
VPTQV LK+ + + +++ + E+ + ++ +N E G+ N
Sbjct: 479 AVPTQVLQLKNVFDQEEYSLLSKYEVDAVLEDVRVKCARYGAVKSIN-VVEYPAGSDNT- 536
Query: 873 GMQSSAVENGDNENQDPNQGHIFEPGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYI 932
++ AV+ DN N E GC+ VE++ EAS MAAHSLH R F RIV+ Y
Sbjct: 537 --KAPAVDARDNALASNNTA--LEAGCILVEFLCKEASFMAAHSLHGRPFGSRIVSAGYA 592
Query: 933 PLNL 936
P +L
Sbjct: 593 PYDL 596
>gi|222619898|gb|EEE56030.1| hypothetical protein OsJ_04814 [Oryza sativa Japonica Group]
Length = 658
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 84/150 (56%), Positives = 109/150 (72%)
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGLNG 612
IFI GI R +SSKM+ +IV +FG L AY F NED CAF+EY+D +T KA AGLNG
Sbjct: 380 IFIAGIPRVISSKMLRDIVSSFGQLAAYRFLFNEDLGGACAFLEYIDHSITSKACAGLNG 439
Query: 613 LKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSL 672
+K+GG V+TAV + D + PFHGIP + PLL PT+VL+LKNVF+ E +S L
Sbjct: 440 MKLGGCVITAVGVLTDHPGQAGNEACPFHGIPANPKPLLAVPTQVLQLKNVFDQEEYSLL 499
Query: 673 SELEVEEVLEDVRLECARFGSVKSVNVVKY 702
S+ EV+ VLEDVR++CAR+G+VKS+NVV+Y
Sbjct: 500 SKYEVDAVLEDVRVKCARYGAVKSINVVEY 529
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 61/124 (49%), Gaps = 6/124 (4%)
Query: 813 GVPTQVNTLKDSPCAHDDKVTCNIQLEHMSEENKSSAKEDLNLEEVNGNSEAFTGASNEM 872
VPTQV LK+ + + +++ + E+ + ++ +N E G+ N
Sbjct: 479 AVPTQVLQLKNVFDQEEYSLLSKYEVDAVLEDVRVKCARYGAVKSINV-VEYPAGSDNT- 536
Query: 873 GMQSSAVENGDNENQDPNQGHIFEPGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYI 932
++ AV+ DN N E GC+ VE++ EAS MAAHSLH R F RIV+ Y
Sbjct: 537 --KAPAVDARDNALASNNTA--LEAGCILVEFLCKEASFMAAHSLHGRPFGSRIVSAGYA 592
Query: 933 PLNL 936
P +L
Sbjct: 593 PYDL 596
>gi|293336129|ref|NP_001170011.1| uncharacterized protein LOC100383918 [Zea mays]
gi|224032879|gb|ACN35515.1| unknown [Zea mays]
Length = 331
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 112/353 (31%), Positives = 172/353 (48%), Gaps = 44/353 (12%)
Query: 613 LKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSL 672
+ +GG+VLTAV + + + PF+GIP +A LLK+PT+VL+LKNVF E + L
Sbjct: 1 MMLGGRVLTAVHVFPNPHVEAANEASPFYGIPDNAKLLLKEPTKVLQLKNVFEREEYMLL 60
Query: 673 SELEVEEVLEDVRLECARFGSVKSVNVVKYGDSNISTIQACEGNENTASAGVGQNLTNDE 732
S+ E+EE LEDVR+EC RFG+VKSVNVV+Y + +S + EN + +D
Sbjct: 61 SKSELEETLEDVRVECTRFGAVKSVNVVEYPAAGVSAAE-----ENIVELKIECTEFSDT 115
Query: 733 TNEKGERLEEVTDHKSIKNNELEILN-----DSKEVMEAGEVNNVKDNRPASGTMGDEPS 787
N + V+++ +++LN D+K+V E + KD S +
Sbjct: 116 EN----IAKAVSEYSVPIIPSIDVLNHSVASDTKDVDLIPESQDQKDKHLPSN------A 165
Query: 788 QLCELDTDMAVEYQARDSTSEIVSQGVPTQVNTLKDSPCAHDDKVTCNIQLEHMSEENKS 847
LCE + +A E D T + P ++K T ++ +
Sbjct: 166 ALCESEAPVADEDAELDETQSRAALPTPQHAEADHTEAAVDENKHTGAGKV-------TA 218
Query: 848 SAKEDLNLEEVNGN---SEAFT-----------GASNEMGMQSSAVENGDNENQ---DPN 890
+A +D +E+ +G+ SE G +E G + + E Q +
Sbjct: 219 TATDDDAVEKSHGDPRTSETCNPAGPTDKAEKPGRYSEQGAGDVTEDRPEKEAQAVGTSD 278
Query: 891 QGHIFEPGCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
G +FEPG V VE++R EA+CMAAHSLH R F +R V Y P +LY ++ +
Sbjct: 279 TGFVFEPGSVLVEFLREEAACMAAHSLHGRRFGNRTVHAGYAPYDLYLQKYPR 331
>gi|71996485|ref|NP_001022969.1| Protein UAF-1, isoform c [Caenorhabditis elegans]
gi|351018336|emb|CCD62280.1| Protein UAF-1, isoform c [Caenorhabditis elegans]
Length = 474
Score = 135 bits (341), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 168/348 (48%), Gaps = 56/348 (16%)
Query: 377 KTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLA 436
+ P P KK WDV P + + + QAA V S P+
Sbjct: 101 REPEPQKPREPKKYRFWDVPPTGFETTTPMEYKNMQAAGQVPRGSVQS-------AVPVV 153
Query: 437 GISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHV 496
G SV+ QS RRL V N+P +E+A+++F N + G+
Sbjct: 154 GPSVTC---------------QS----RRLYVGNIPFGCNEEAMLDFFNQQMHLCGLAQA 194
Query: 497 HGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF-------- 548
G+ P + C I +K AF+EF + ++ +A + DG +F G LK++RP+++
Sbjct: 195 PGN-PILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVRRPRDYQPSQNTFD 253
Query: 549 -----------VEVA--IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVN-EDHEEPCAF 594
V+ A IFIGG+ L+ V E++C+FGPLKA+ V+ + + + AF
Sbjct: 254 MNSRMPVSTIVVDSANKIFIGGLPNYLTEDQVKELLCSFGPLKAFSLNVDSQGNSKGYAF 313
Query: 595 IEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLK-- 652
EY+D +T +AIAGLNG+++G + L VQ N+ N P + L +
Sbjct: 314 AEYLDPTLTDQAIAGLNGMQLGDKQLV-VQLACANQQRHNT-NLPNSASAIAGIDLSQGA 371
Query: 653 -KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ TE+L L N+ + ++ E EE+LEDVR EC+++G V+S+ +
Sbjct: 372 GRATEILCLMNMVTEDELK--ADDEYEEILEDVRDECSKYGIVRSLEI 417
>gi|71996490|ref|NP_001022970.1| Protein UAF-1, isoform d [Caenorhabditis elegans]
gi|351018337|emb|CCD62281.1| Protein UAF-1, isoform d [Caenorhabditis elegans]
Length = 471
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 168/348 (48%), Gaps = 56/348 (16%)
Query: 377 KTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLA 436
+ P P KK WDV P + + + QAA V S P+
Sbjct: 98 REPEPQKPREPKKYRFWDVPPTGFETTTPMEYKNMQAAGQVPRGSVQS-------AVPVV 150
Query: 437 GISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHV 496
G SV+ QS RRL V N+P +E+A+++F N + G+
Sbjct: 151 GPSVTC---------------QS----RRLYVGNIPFGCNEEAMLDFFNQQMHLCGLAQA 191
Query: 497 HGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF-------- 548
G+ P + C I +K AF+EF + ++ +A + DG +F G LK++RP+++
Sbjct: 192 PGN-PILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVRRPRDYQPSQNTFD 250
Query: 549 -----------VEVA--IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVN-EDHEEPCAF 594
V+ A IFIGG+ L+ V E++C+FGPLKA+ V+ + + + AF
Sbjct: 251 MNSRMPVSTIVVDSANKIFIGGLPNYLTEDQVKELLCSFGPLKAFSLNVDSQGNSKGYAF 310
Query: 595 IEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLK-- 652
EY+D +T +AIAGLNG+++G + L VQ N+ N P + L +
Sbjct: 311 AEYLDPTLTDQAIAGLNGMQLGDKQLV-VQLACANQQRHNT-NLPNSASAIAGIDLSQGA 368
Query: 653 -KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ TE+L L N+ + ++ E EE+LEDVR EC+++G V+S+ +
Sbjct: 369 GRATEILCLMNMVTEDELK--ADDEYEEILEDVRDECSKYGIVRSLEI 414
>gi|198432988|ref|XP_002130386.1| PREDICTED: similar to U2 small nuclear ribonucleoprotein auxiliary
factor (U2AF) 2 isoform 1 [Ciona intestinalis]
Length = 482
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 169/353 (47%), Gaps = 55/353 (15%)
Query: 384 RSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSAS 443
+ P+K WDV PV V + + QAA P+ +T+Q + I+ A+
Sbjct: 97 KKPKKAYKYWDVPPVGYEHVTPLQYKAMQAAGQ--------IPLMATSQT-MGSITAEAT 147
Query: 444 LAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCI 503
L V + Q+T R RRL V N+P +E A+M+F NN + +G+ G P +
Sbjct: 148 L---QVPVAGSQMT---RQARRLYVGNIPFGVTEDAMMDFFNNQMQIAGLAQAPGQ-PIL 200
Query: 504 GCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----------- 552
I +K AF+EF + ++ + AL DG +F LKI+RP ++ +
Sbjct: 201 AVQINLDKNFAFLEFRSVDETTQALAFDGINFMNQSLKIRRPSDYKPLPGSLEQPAIHLP 260
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEY 597
+FIGG+ L+ V E++ +FGPL+A++ + + AF E+
Sbjct: 261 GVISTVVQDSQHKMFIGGLPNYLNDDQVKELLTSFGPLRAFNLVKDSATALSKGYAFAEF 320
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLD------GSIMDNSGNPPFHGIPKHALPLL 651
D +T +AIAGLNG+++G + L +A + G+IM P IP A
Sbjct: 321 ADYSLTDQAIAGLNGMQLGDKKLIVQRASIGAKNNPHGAIM----APVTLQIPGMAHATG 376
Query: 652 KKP-TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYG 703
P T VL L N+ PE + + E EE++EDV+ EC + GSV S+ + + G
Sbjct: 377 AGPATTVLCLMNMVLPEELT--DDEEYEEIMEDVKDECGKLGSVVSLEIPRPG 427
>gi|328721670|ref|XP_001951521.2| PREDICTED: splicing factor U2AF 50 kDa subunit-like isoform 1
[Acyrthosiphon pisum]
Length = 416
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 105/354 (29%), Positives = 166/354 (46%), Gaps = 65/354 (18%)
Query: 380 SPINRSPEKK-SAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGI 438
SP N+S +K S WDV P + + + QAA ++ P T T P+ G
Sbjct: 37 SPKNKSRRRKPSLYWDVPPPGFEHIAPLQYKAMQAAGQIP---ANTMPDTPQTAVPVVGS 93
Query: 439 SVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHG 498
+++ R RRL V N+P +E +MEF N + SG+ G
Sbjct: 94 TIT-------------------RQARRLYVGNIPFGVTEDEMMEFFNQQMHLSGLAQAAG 134
Query: 499 SLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------ 552
+ P + C I +K AF+EF + ++ + A+ DG +F G LKI+RP ++
Sbjct: 135 N-PVLACQINLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPTPGMTESN 193
Query: 553 ------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPC 592
IFIGG+ L+ + V E++ +FG LKA++ + +
Sbjct: 194 PVTNYNSGMTLDMNSPHKIFIGGLPAYLNDEQVKELLTSFGQLKAFNLVKDAATGLSKGY 253
Query: 593 AFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHA-LPLL 651
AF EY D ++T +AIAGLNG+++G + L +A + + NP +P +P L
Sbjct: 254 AFCEYADVVMTDQAIAGLNGMQLGEKKLIVQRASI------GAKNPGLGQVPVTIQVPGL 307
Query: 652 K------KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ P+ E E E++LED+R EC ++G V+S+ +
Sbjct: 308 TVVGTAGPPTEVLCLLNMVTPDELK--DEEEYEDILEDIREECNKYGVVRSLEI 359
>gi|402590758|gb|EJW84688.1| hypothetical protein WUBG_04401 [Wuchereria bancrofti]
Length = 477
Score = 133 bits (335), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 102/339 (30%), Positives = 162/339 (47%), Gaps = 56/339 (16%)
Query: 386 PEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLA 445
P+KK WDV PV + + QAA V S P+ G SV+
Sbjct: 113 PKKKYKFWDVPPVGYEHLTPKEYKELQAAGQIPRNNVQS-------AVPVVGPSVTC--- 162
Query: 446 KLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
QS RRL V N+P SE A+++F N + G+ G+ P + C
Sbjct: 163 ------------QS----RRLYVGNIPFGCSEDAMLDFFNQQMHLCGLAQAPGN-PVLAC 205
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------- 552
+ +K AF+EF + ++ +A + DG +F G LKI+RP+++ ++
Sbjct: 206 QMNLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKIRRPRDYQPMSTSYDLGNMMVSNI 265
Query: 553 -------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVT 603
IFIGG+ L+++ V E++ +FG LKA++ E + + AF EY+D +T
Sbjct: 266 VPDSPHKIFIGGLPSYLNAEQVKELLSSFGQLKAFNLVTEQSTGVSKGYAFAEYLDPSLT 325
Query: 604 PKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPL---LKKPTEVLKL 660
+AIAGLNG+++G + L + + +N F I + L PTEVL L
Sbjct: 326 DQAIAGLNGMQLGDKNLVVQLSCANAR--NNVAQNTFPQIQVAGIDLSHGAGPPTEVLCL 383
Query: 661 KNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
N+ + + E E++LED+R ECA++G VKS+ +
Sbjct: 384 MNMVTEDELK--DDEEYEDILEDIREECAKYGIVKSLEI 420
>gi|341891946|gb|EGT47881.1| hypothetical protein CAEBREN_25972 [Caenorhabditis brenneri]
Length = 491
Score = 133 bits (334), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 106/351 (30%), Positives = 167/351 (47%), Gaps = 66/351 (18%)
Query: 379 PSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGI 438
P P KK WDV P + + + QAA V S P+ G
Sbjct: 120 PEPQKPREPKKYRFWDVPPTGFENTSPMEYKNMQAAGQVPRGSVQS-------AVPVVGP 172
Query: 439 SVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHG 498
SV+ QS RRL V N+P +E+A+++F N + G+ G
Sbjct: 173 SVTC---------------QS----RRLYVGNIPFGCNEEAMLDFFNQQMHLCGLAQAPG 213
Query: 499 SLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF---------- 548
+ P + C I +K AF+EF + ++ +A + DG +F G LK++RP+++
Sbjct: 214 N-PILLCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVRRPRDYQPSQNTFDMN 272
Query: 549 ---------VEVA--IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVN-EDHEEPCAFIE 596
V+ A IFIGG+ L+ V E++C+FGPLKA+ V+ + + + AF E
Sbjct: 273 SRMPVSSIVVDSANKIFIGGLPNYLTEDQVKELLCSFGPLKAFSLNVDSQGNSKGYAFAE 332
Query: 597 YVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPL------ 650
Y+D +T +AIAGLNG+++G + L VQ N+ H +P A +
Sbjct: 333 YLDPTLTDQAIAGLNGMQLGDKQLV-VQLACANQTRHNT-----H-LPNSASAIAGIDLS 385
Query: 651 --LKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ TEVL L N+ + S+ + EE+LEDVR EC+++G V+S+ +
Sbjct: 386 QGAGRATEVLCLMNMVTEDELK--SDEDYEEILEDVREECSKYGIVRSLEI 434
>gi|198432986|ref|XP_002130494.1| PREDICTED: similar to U2 small nuclear ribonucleoprotein auxiliary
factor (U2AF) 2 isoform 2 [Ciona intestinalis]
Length = 472
Score = 133 bits (334), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 104/350 (29%), Positives = 161/350 (46%), Gaps = 59/350 (16%)
Query: 384 RSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSAS 443
+ P+K WDV PV V + + QAA S + T + P+AG
Sbjct: 97 KKPKKAYKYWDVPPVGYEHVTPLQYKAMQAAGQIPLMATSQTMGSITAEVPVAG------ 150
Query: 444 LAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCI 503
+Q R RRL V N+P +E A+M+F NN + +G+ G P +
Sbjct: 151 -------------SQMTRQARRLYVGNIPFGVTEDAMMDFFNNQMQIAGLAQAPGQ-PIL 196
Query: 504 GCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----------- 552
I +K AF+EF + ++ + AL DG +F LKI+RP ++ +
Sbjct: 197 AVQINLDKNFAFLEFRSVDETTQALAFDGINFMNQSLKIRRPSDYKPLPGSLEQPAIHLP 256
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEY 597
+FIGG+ L+ V E++ +FGPL+A++ + + AF E+
Sbjct: 257 GVISTVVQDSQHKMFIGGLPNYLNDDQVKELLTSFGPLRAFNLVKDSATALSKGYAFAEF 316
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHG---IPKHALPLLKKP 654
D +T +AIAGLNG+++G + L +A + NP HG IP A P
Sbjct: 317 ADYSLTDQAIAGLNGMQLGDKKLIVQRASIGA-----KNNP--HGAIMIPGMAHATGAGP 369
Query: 655 -TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYG 703
T VL L N+ PE + + E EE++EDV+ EC + GSV S+ + + G
Sbjct: 370 ATTVLCLMNMVLPEELT--DDEEYEEIMEDVKDECGKLGSVVSLEIPRPG 417
>gi|270011684|gb|EFA08132.1| hypothetical protein TcasGA2_TC005736 [Tribolium castaneum]
Length = 432
Score = 133 bits (334), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 99/345 (28%), Positives = 163/345 (47%), Gaps = 65/345 (18%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 63 KPSLYWDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT------ 113
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I
Sbjct: 114 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQI 159
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG +F G LKI+RP ++ +
Sbjct: 160 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMAESSISVPAGVIS 219
Query: 553 ---------IFIGGISRTLSSKMVMEIVCAFGPLKAYH------FEVNEDHEEPCAFIEY 597
IFIGG+ L+ V E++ +FG L+A++ F +++ + AF EY
Sbjct: 220 TVVPDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDTAFGLSKGY----AFAEY 275
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKK---P 654
+D +T +AIAGLNG+++G + L +A + N+ P I L L+ P
Sbjct: 276 IDITMTDQAIAGLNGMQLGDKRLIVQRASVGAK---NATVIPAVQIQVPGLSLVGASGPP 332
Query: 655 TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
TEVL L N+ P+ E E E++LED++ EC ++G V+S+ +
Sbjct: 333 TEVLCLLNMVTPDELK--DEEEYEDILEDIKEECNKYGVVRSIEI 375
>gi|91088649|ref|XP_974465.1| PREDICTED: similar to U2 small nuclear ribonucleoprotein auxiliary
factor 2 [Tribolium castaneum]
Length = 450
Score = 132 bits (332), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 99/345 (28%), Positives = 163/345 (47%), Gaps = 65/345 (18%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 81 KPSLYWDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT------ 131
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I
Sbjct: 132 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQI 177
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG +F G LKI+RP ++ +
Sbjct: 178 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMAESSISVPAGVIS 237
Query: 553 ---------IFIGGISRTLSSKMVMEIVCAFGPLKAYH------FEVNEDHEEPCAFIEY 597
IFIGG+ L+ V E++ +FG L+A++ F +++ + AF EY
Sbjct: 238 TVVPDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDTAFGLSKGY----AFAEY 293
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLL---KKP 654
+D +T +AIAGLNG+++G + L +A + N+ P I L L+ P
Sbjct: 294 IDITMTDQAIAGLNGMQLGDKRLIVQRASVGAK---NATVIPAVQIQVPGLSLVGASGPP 350
Query: 655 TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
TEVL L N+ P+ E E E++LED++ EC ++G V+S+ +
Sbjct: 351 TEVLCLLNMVTPDELK--DEEEYEDILEDIKEECNKYGVVRSIEI 393
>gi|47575746|ref|NP_001001217.1| U2 small nuclear RNA auxiliary factor 2 isoform 2 [Xenopus
(Silurana) tropicalis]
gi|45709722|gb|AAH67966.1| U2 small nuclear ribonucleoprotein auxiliary factor (U2AF) 2
[Xenopus (Silurana) tropicalis]
Length = 456
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 161/341 (47%), Gaps = 53/341 (15%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA Q P + + + L V+
Sbjct: 77 WDVPPPGFEHITPMQYKAMQAAG----------------QIPATALLPTMTPDGLAVTPT 120
Query: 453 SVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQRE 510
V + +Q R RRL V N+P +E+A+M+F N + G+ G+ P + I ++
Sbjct: 121 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQD 179
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------ 552
K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 180 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVV 239
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTP 604
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD VT
Sbjct: 240 PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTD 299
Query: 605 KAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPKHALPLLKKPTEVL 658
+AIAGLNG+++G + L +A + + + P G+ + + PTEVL
Sbjct: 300 QAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMSSQVQMGGHPTEVL 359
Query: 659 KLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L N+ PE L + E EE++EDVR EC ++G+VKS+ +
Sbjct: 360 CLMNMVLPE--ELLDDDEYEEIVEDVRDECGKYGAVKSIEI 398
>gi|170575889|ref|XP_001893425.1| U2 auxiliary factor 65 kDa subunit [Brugia malayi]
gi|158600599|gb|EDP37742.1| U2 auxiliary factor 65 kDa subunit, putative [Brugia malayi]
Length = 502
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/339 (30%), Positives = 162/339 (47%), Gaps = 56/339 (16%)
Query: 386 PEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLA 445
P+KK WDV PV + + QAA V S P+ G SV+
Sbjct: 138 PKKKYKFWDVPPVGYEHLTPKEYKELQAAGQIPRNNVQS-------AVPVVGPSVTC--- 187
Query: 446 KLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
QS RRL V N+P SE A+++F N + G+ G+ P + C
Sbjct: 188 ------------QS----RRLYVGNIPFGCSEDAMLDFFNQQMHLCGLAQAPGN-PVLAC 230
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------- 552
+ +K AF+EF + ++ +A + DG +F G LKI+RP+++ ++
Sbjct: 231 QMNLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKIRRPRDYQPMSTSYDLGNMMVSNI 290
Query: 553 -------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVT 603
IFIGG+ L+++ V E++ +FG LKA++ E + + AF EY+D +T
Sbjct: 291 VPDSPHKIFIGGLPSYLNAEQVKELLSSFGQLKAFNLVTEQSTGVSKGYAFAEYLDPSLT 350
Query: 604 PKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPL---LKKPTEVLKL 660
+AIAGLNG+++G + L + + +N F I + L PTEVL L
Sbjct: 351 DQAIAGLNGMQLGDKNLVVQLSCANAR--NNVAQNTFPQIQVAGIDLSHGAGPPTEVLCL 408
Query: 661 KNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
N+ + + E E++LED+R ECA++G VKS+ +
Sbjct: 409 MNMVTEDELK--DDEEYEDILEDIREECAKYGIVKSLEI 445
>gi|328721668|ref|XP_003247369.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like isoform 2
[Acyrthosiphon pisum]
Length = 451
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 169/372 (45%), Gaps = 79/372 (21%)
Query: 376 AKTPSPINRSPEKK-SAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKP 434
+++ SP N+S +K S WDV P + + + QAA ++ P T T P
Sbjct: 54 SRSKSPKNKSRRRKPSLYWDVPPPGFEHIAPLQYKAMQAAGQIP---ANTMPDTPQTAVP 110
Query: 435 LAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQ 494
+ G +++ R RRL V N+P +E +MEF N + SG+
Sbjct: 111 VVGSTIT-------------------RQARRLYVGNIPFGVTEDEMMEFFNQQMHLSGLA 151
Query: 495 HVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA-- 552
G+ P + C I +K AF+EF + ++ + A+ DG +F G LKI+RP ++
Sbjct: 152 QAAGN-PVLACQINLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPTPGM 210
Query: 553 ------------------------------------IFIGGISRTLSSKMVMEIVCAFGP 576
IFIGG+ L+ + V E++ +FG
Sbjct: 211 TESNPVTNYNSGMTLDMMKYDSSSFGLGTVPDSPHKIFIGGLPAYLNDEQVKELLTSFGQ 270
Query: 577 LKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDN 634
LKA++ + + AF EY D ++T +AIAGLNG+++G + L +A +
Sbjct: 271 LKAFNLVKDAATGLSKGYAFCEYADVVMTDQAIAGLNGMQLGEKKLIVQRASI------G 324
Query: 635 SGNPPFHGIPKHA-LPLLK------KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLE 687
+ NP +P +P L PTEVL L N+ P+ E E E++LED+R E
Sbjct: 325 AKNPGLGQVPVTIQVPGLTVVGTAGPPTEVLCLLNMVTPDELK--DEEEYEDILEDIREE 382
Query: 688 CARFGSVKSVNV 699
C ++G V+S+ +
Sbjct: 383 CNKYGVVRSLEI 394
>gi|393909510|gb|EJD75480.1| hypothetical protein LOAG_17389 [Loa loa]
Length = 502
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/339 (30%), Positives = 162/339 (47%), Gaps = 56/339 (16%)
Query: 386 PEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLA 445
P+KK WDV PV + + QAA V S P+ G SV+
Sbjct: 138 PKKKYKFWDVPPVGYEHLTPKEYKELQAAGQIPRNNVQS-------AVPVVGPSVTC--- 187
Query: 446 KLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
QS RRL V N+P SE A+++F N + G+ G+ P + C
Sbjct: 188 ------------QS----RRLYVGNIPFGCSEDAMLDFFNQQMHLCGLAQAPGN-PVLAC 230
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------- 552
+ +K AF+EF + ++ +A + DG +F G LKI+RP+++ ++
Sbjct: 231 QMNLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKIRRPRDYQPMSTSYDLGNMMVSNI 290
Query: 553 -------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVT 603
IFIGG+ L+++ V E++ +FG LKA++ E + + AF EY+D +T
Sbjct: 291 VPDSPHKIFIGGLPSYLNAEQVKELLSSFGQLKAFNLVTEQSTGVSKGYAFAEYLDPSLT 350
Query: 604 PKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPL---LKKPTEVLKL 660
+AIAGLNG+++G + L + + +N F I + L PTEVL L
Sbjct: 351 DQAIAGLNGMQLGDKNLVVQLSCANAR--NNVAQNTFPQIQVAGIDLSHGAGPPTEVLCL 408
Query: 661 KNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
N+ + + E E++LED+R ECA++G VKS+ +
Sbjct: 409 MNMVTEDELK--DDEEYEDILEDIREECAKYGIVKSLEI 445
>gi|114052735|ref|NP_001040494.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 [Bombyx mori]
gi|95103122|gb|ABF51502.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 isoform 1
[Bombyx mori]
Length = 417
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/341 (28%), Positives = 160/341 (46%), Gaps = 54/341 (15%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 45 KPSLYWDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT------ 95
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ MEF N + SG+ G+ P + C I
Sbjct: 96 -------------RQARRLYVGNIPFGVTEEETMEFFNQQMHLSGLAQAAGN-PVLACQI 141
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG +F G LKI+RP ++ +
Sbjct: 142 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGTENPAINVPAGVIS 201
Query: 553 ---------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQL 601
IFIGG+ L+ V E++ +FG L+A++ + + + AF EYVD
Sbjct: 202 TVVPDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSSTGLSKGYAFAEYVDIS 261
Query: 602 VTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH-GIPKHALPLLKKPTEVL 658
+T +AIAGLNG+++G + L +A + S + +G P + L PTEVL
Sbjct: 262 MTDQAIAGLNGMQLGDKKLIVQRASIGAKNSTLALTGAAPVQIQVAGLTLAGAGPPTEVL 321
Query: 659 KLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L N+ P+ E E E++LED++ EC ++G V+S+ +
Sbjct: 322 CLLNMVTPDELR--DEEEYEDILEDIKEECNKYGVVRSIEI 360
>gi|348510221|ref|XP_003442644.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
[Oreochromis niloticus]
Length = 467
Score = 130 bits (328), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 103/352 (29%), Positives = 172/352 (48%), Gaps = 57/352 (16%)
Query: 384 RSPEKKSAK-WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSA 442
R+ +K++ K WDV P + + + QAA + + TSTT
Sbjct: 80 RTRKKRTCKYWDVPPPGFEHITPMQYKAMQAAGQIPTIALLA---TSTT--------TGV 128
Query: 443 SLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPC 502
+ A V + Q+T R RRL V N+P +E+++ EF N + +G+ S P
Sbjct: 129 AAAPTQVPIVGSQMT---RQARRLYVGNIPFGVTEESMAEFFNAQMRLAGLSQA-PSNPV 184
Query: 503 IGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA---------- 552
+ I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 185 LAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPAFHV 244
Query: 553 --------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIE 596
+FIGG+ L+ V E++ +FGPLKA++ + + AF E
Sbjct: 245 PGVVSTVVPDSPHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCE 304
Query: 597 YVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL---DGSIMDNSGNPPFHGIPKHALPLLKK 653
YVD T +A+AGLNG+++G + L +A + + + + SGN P +P L++
Sbjct: 305 YVDISATDQAVAGLNGMQLGDKKLIVQRASVGAKNANPVSTSGNTPV----TLQVPGLQR 360
Query: 654 ------PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ PE + + + EE+LED+R EC ++G+V+S+ +
Sbjct: 361 LQNSGMPTEVLCLLNMVMPEEL--VDDEDYEEILEDIREECCKYGTVRSIEI 410
>gi|332375140|gb|AEE62711.1| unknown [Dendroctonus ponderosae]
Length = 374
Score = 130 bits (328), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 161/343 (46%), Gaps = 65/343 (18%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 64 KPSLYWDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT------ 114
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E +ME+ N + SG+ G+ P + C I
Sbjct: 115 -------------RQARRLYVGNIPFGVTEDEMMEYFNQQMHLSGLAQAAGN-PVLACQI 160
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG +F G LKI+RP ++ +
Sbjct: 161 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMSENSISVPAGVIS 220
Query: 553 ---------IFIGGISRTLSSKMVMEIVCAFGPLKAYH------FEVNEDHEEPCAFIEY 597
IFIGG+ L+ V E++ +FG L+A++ F +++ + AF EY
Sbjct: 221 TVVPDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDTAFGLSKGY----AFAEY 276
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKK---P 654
+D +T +AIAGLNG+++G + L +A + N+ P I L L+ P
Sbjct: 277 IDISMTDQAIAGLNGMQLGDKRLIVQRASVGAK---NATVLPAVQIQVPGLSLVGASGPP 333
Query: 655 TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSV 697
TEVL L N+ P+ E E E++LED++ EC ++G V+S+
Sbjct: 334 TEVLCLLNMVTPDELK--DEEEYEDILEDIKEECNKYGVVRSI 374
>gi|268575804|ref|XP_002642882.1| C. briggsae CBR-UAF-1 protein [Caenorhabditis briggsae]
gi|60415989|sp|P90727.2|U2AF2_CAEBR RecName: Full=Splicing factor U2AF 65 kDa subunit; AltName: Full=U2
auxiliary factor 65 kDa subunit; Short=U2AF65; AltName:
Full=U2 snRNP auxiliary factor large subunit
Length = 488
Score = 130 bits (327), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 103/342 (30%), Positives = 166/342 (48%), Gaps = 59/342 (17%)
Query: 384 RSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSAS 443
R P KK WDV P ++ + + QA+ + V S P+ G SV+
Sbjct: 122 REP-KKYRFWDVPPTGFENITPMEYKNMQASGAVPRGSVQS-------AVPVVGPSVTC- 172
Query: 444 LAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCI 503
QS RRL V N+P +E+A+++F N + + G+ P +
Sbjct: 173 --------------QS----RRLYVGNIPFGCNEEAMLDFFNQQMHLCNLAQAPGN-PIL 213
Query: 504 GCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF--------------- 548
C I +K AF+EF + ++ +A + DG +F G LK++RP+++
Sbjct: 214 LCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVRRPRDYQPSQNTFDMNARMPV 273
Query: 549 ----VEVA--IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVN-EDHEEPCAFIEYVDQL 601
V+ A IFIGG+ L+ V E++C+FGPLKA+ V+ + + + AF EY+D
Sbjct: 274 SSIVVDSANKIFIGGLPNYLTEDQVKELLCSFGPLKAFSLNVDSQGNSKGYAFAEYLDPT 333
Query: 602 VTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNP----PFHGIPKHALPLLKKPTEV 657
+T +AIAGLNG+++G + L VQ N+ P GI + TE+
Sbjct: 334 LTDQAIAGLNGMQLGDKQLV-VQLACANQTRHNTHLPNSASAIAGIDLSQGA--GRATEI 390
Query: 658 LKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L L N+ + S+ + EE+LEDVR EC+++G V+S+ +
Sbjct: 391 LCLMNMVTEDELR--SDEDYEEILEDVREECSKYGIVRSLEI 430
>gi|1710361|gb|AAB38280.1| splicing factor U2AF65 [Caenorhabditis briggsae]
Length = 488
Score = 130 bits (327), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 103/342 (30%), Positives = 166/342 (48%), Gaps = 59/342 (17%)
Query: 384 RSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSAS 443
R P KK WDV P ++ + + QA+ + V S P+ G SV+
Sbjct: 122 REP-KKYRFWDVPPTGFENITPMEYKNMQASGAVPRGSVQS-------AVPVVGPSVTC- 172
Query: 444 LAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCI 503
QS RRL V N+P +E+A+++F N + + G+ P +
Sbjct: 173 --------------QS----RRLYVGNIPFGCNEEAMLDFFNQQMHLCNLAQAPGN-PIL 213
Query: 504 GCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF--------------- 548
C I +K AF+EF + ++ +A + DG +F G LK++RP+++
Sbjct: 214 LCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVRRPRDYQPSQNTFDMNARMPV 273
Query: 549 ----VEVA--IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVN-EDHEEPCAFIEYVDQL 601
V+ A IFIGG+ L+ V E++C+FGPLKA+ V+ + + + AF EY+D
Sbjct: 274 SSIVVDSANKIFIGGLPNYLTEDQVKELLCSFGPLKAFSLNVDSQGNSKGYAFAEYLDPT 333
Query: 602 VTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNP----PFHGIPKHALPLLKKPTEV 657
+T +AIAGLNG+++G + L VQ N+ P GI + TE+
Sbjct: 334 LTDQAIAGLNGMQLGDKQLV-VQLACANQTRHNTHLPNSASAIAGIDLSQGA--GRATEI 390
Query: 658 LKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L L N+ + S+ + EE+LEDVR EC+++G V+S+ +
Sbjct: 391 LCLMNMVTEDELR--SDEDYEEILEDVREECSKYGIVRSLEI 430
>gi|147902896|ref|NP_001080595.1| U2 small nuclear RNA auxiliary factor 2 [Xenopus laevis]
gi|111185517|gb|AAH44032.2| U2af2 protein [Xenopus laevis]
Length = 456
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 98/341 (28%), Positives = 161/341 (47%), Gaps = 53/341 (15%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WD+ P + + + QAA Q P + + + L V+
Sbjct: 77 WDIPPPGFEHITPLQYKAMQAAG----------------QIPATALLPTMTPDGLAVTPT 120
Query: 453 SVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQRE 510
V + +Q R RRL V N+P +E+A+M+F N + G+ G+ P + I ++
Sbjct: 121 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQD 179
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------ 552
K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 180 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVV 239
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTP 604
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD VT
Sbjct: 240 PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTD 299
Query: 605 KAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPKHALPLLKKPTEVL 658
+AIAGLNG+++G + L +A + + + P G+ + + PTEVL
Sbjct: 300 QAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMSSQVQMGGHPTEVL 359
Query: 659 KLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L N+ PE + + E EE++EDVR EC ++G+VKS+ +
Sbjct: 360 CLMNMVVPE--ELIDDDEYEEIVEDVRDECGKYGAVKSIEI 398
>gi|157132061|ref|XP_001662443.1| splicing factor u2af large subunit [Aedes aegypti]
gi|108881728|gb|EAT45953.1| AAEL002818-PA [Aedes aegypti]
Length = 418
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 101/345 (29%), Positives = 161/345 (46%), Gaps = 59/345 (17%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 43 KPSLYWDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT------ 93
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I
Sbjct: 94 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQI 139
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF-----------VEVA---- 552
+K AF+EF + ++ + A+ DG +F G LKI+RP ++ V V
Sbjct: 140 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMTDSAAVSVPEKFS 199
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEY 597
IFIGG+ L+ V E++ +FG LKA++ + + AF EY
Sbjct: 200 GVISTVVPDSPHKIFIGGLPNYLNEDQVKELLLSFGQLKAFNLVKDAATGLSKGYAFAEY 259
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLL---KKP 654
V+ +T +AIAGLNG+++G + L +A + + + P I L L+ P
Sbjct: 260 VEYTITDQAIAGLNGMQLGDKKLIVQRASVGAKNANVAAVAPVQ-IQVPGLSLVGSSGPP 318
Query: 655 TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
TEVL L N+ P+ E E E++LED++ EC ++G V+SV +
Sbjct: 319 TEVLCLLNMVTPDELK--DEEEYEDILEDIKEECNKYGVVRSVEI 361
>gi|393236224|gb|EJD43774.1| hypothetical protein AURDEDRAFT_137718 [Auricularia delicata
TFB-10046 SS5]
Length = 389
Score = 129 bits (325), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 98/307 (31%), Positives = 161/307 (52%), Gaps = 54/307 (17%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFL--LSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTA 521
RRL + ++ +E+ L +F N + ++ G Q+ G P + + EK AFVEF +A
Sbjct: 29 RRLYIGSITPEITEENLTKFFNQKMREMNLGQQNASGD-PVLAVQVNYEKNYAFVEFRSA 87
Query: 522 EDASAALCCDGCSFSGSILKIKRPKEFV--EVA---------------------IFIGGI 558
+DA+AA+ DG F L+I+RPK+++ E + IF+GG+
Sbjct: 88 DDATAAMAFDGIIFQSGPLRIRRPKDYMGNEYSAPSAMHVPGVVSTNVPDSLHKIFVGGL 147
Query: 559 SRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPC---AFIEYVDQLVTPKAIAGLNGLKV 615
L+ + VME++ +FG LKA++ V E++ P AF EYVD+ VT AI GLNG+++
Sbjct: 148 PTYLNEEQVMELLKSFGELKAFNL-VRENNNGPSKGYAFFEYVDEEVTEVAIQGLNGMEL 206
Query: 616 GGQVLTAVQAVL---DGSIMDNSGNP--PFHGIPKHALPLLKKPTE---VLKLKNVFNPE 667
G +VL +A + +G ++ N P +P+ +PL + PT+ +L + N+ PE
Sbjct: 207 GDRVLAVQRASVGSKNGMVVPNPDIPYDQMPEVPRPIMPLNEAPTQDARILLMLNMVVPE 266
Query: 668 GFSSLSELEVEEVLEDVRLECARFGSVKSVNV--------VKYGDSNISTIQACEGNENT 719
+ + E E+ EDV+ ECA+FG+V+ + + KYG + A E
Sbjct: 267 DL--VDDEEFAELYEDVKEECAKFGAVEDLRIPRPAKRAGPKYGPA------AVEAQRVD 318
Query: 720 ASAGVGQ 726
+AGVG+
Sbjct: 319 EAAGVGR 325
>gi|347968827|ref|XP_311994.4| AGAP002908-PA [Anopheles gambiae str. PEST]
gi|333467820|gb|EAA08228.4| AGAP002908-PA [Anopheles gambiae str. PEST]
Length = 446
Score = 129 bits (324), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 100/345 (28%), Positives = 159/345 (46%), Gaps = 59/345 (17%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 71 KPSLYWDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT------ 121
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I
Sbjct: 122 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQI 167
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ D +F G LKI+RP ++ +
Sbjct: 168 NLDKNFAFLEFRSIDETTQAMAFDSINFKGQSLKIRRPHDYQPMPGMTDSAAVNVPEKFS 227
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEY 597
IFIGG+ L+ V E++ +FG LKA++ + + AF EY
Sbjct: 228 GVISTVVPDSPHKIFIGGLPNYLNEDQVKELLLSFGQLKAFNLVKDAATGLGKGYAFAEY 287
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLL---KKP 654
V+ VT +AIAGLNG+++G + L +A + + + P I L L+ P
Sbjct: 288 VEYTVTDQAIAGLNGMQLGDKKLIVQRASVGAKNSNAAVVAPVQ-IQVPGLSLVGSSGPP 346
Query: 655 TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
TEVL L N+ P+ E E E++LED+R EC ++G V+SV +
Sbjct: 347 TEVLCLLNMVTPDELK--DEEEYEDILEDIREECNKYGVVRSVEI 389
>gi|45387787|ref|NP_991252.1| U2 small nuclear RNA auxiliary factor 2b [Danio rerio]
gi|41389016|gb|AAH65869.1| U2 small nuclear RNA auxiliary factor 2b [Danio rerio]
Length = 475
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 103/373 (27%), Positives = 177/373 (47%), Gaps = 60/373 (16%)
Query: 368 RKRRTEAAAKTPSP-INRSPEKKSAK-----WDVAPVETYSVPSNVHTSNQAASSNAHEM 421
R RR++ P ++RSP ++ K WDV P + + + QAA
Sbjct: 64 RHRRSDHTQNHPQENVSRSPHREKKKKIKKYWDVPPPGFEHITPMQYKAMQAAG------ 117
Query: 422 VSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKA 479
Q P + + + L V+ V + +Q R RRL V N+P +E++
Sbjct: 118 ----------QIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEES 167
Query: 480 LMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSI 539
+M+F N + G+ G+ P + I ++K AF+EF + ++ + A+ DG F
Sbjct: 168 MMDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQAQS 226
Query: 540 LKIKRPKEFVEVA------------------------IFIGGISRTLSSKMVMEIVCAFG 575
LKI+RP ++ + +FIGG+ L+ V E++ +FG
Sbjct: 227 LKIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSIHKLFIGGLPNYLNDDQVKELLTSFG 286
Query: 576 PLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL---DGS 630
PLKA++ + + AF EYVD V +AIAGLNG+++ + L +A + + +
Sbjct: 287 PLKAFNLVKDSATGLSKGYAFCEYVDVNVNDQAIAGLNGMQLADKKLLVQRASVGAKNAT 346
Query: 631 IMDNSGNPPFHGIPK-HALPLLKK---PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRL 686
+ + P +P + P+++ PTEVL L N+ PE + + E EE++EDV+
Sbjct: 347 MTSINETPVTLQVPGLTSNPMIQMGGIPTEVLCLMNMVAPEEL--IDDEEYEEIVEDVKE 404
Query: 687 ECARFGSVKSVNV 699
EC+++G VKS+ +
Sbjct: 405 ECSKYGQVKSIEI 417
>gi|308477324|ref|XP_003100876.1| CRE-UAF-1 protein [Caenorhabditis remanei]
gi|308264450|gb|EFP08403.1| CRE-UAF-1 protein [Caenorhabditis remanei]
Length = 496
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 102/346 (29%), Positives = 169/346 (48%), Gaps = 67/346 (19%)
Query: 384 RSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSAS 443
R P KK WDV P ++ + + QA+ + V S P+ G SV+
Sbjct: 131 REP-KKYRFWDVPPNGFENITPMEYKNMQASGAVPRGSVQS-------AVPVVGPSVTC- 181
Query: 444 LAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCI 503
QS RRL V N+P +E+A+++F N + G+ G+ P +
Sbjct: 182 --------------QS----RRLYVGNIPFGCNEEAMLDFFNQQMHLCGLAQAPGN-PIL 222
Query: 504 GCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF------------VEV 551
C I +K AF+EF + ++ +A + DG +F G LK++RP+++ + V
Sbjct: 223 LCQINLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKVRRPRDYQPSQNTFDMNARMPV 282
Query: 552 A---------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVN-EDHEEPCAFIEYVDQL 601
+ IFIGG+ L+ V E++C+FGPLKA+ ++ + + + AF EY+D
Sbjct: 283 SSIVVDSPNKIFIGGLPNYLTEDQVKELLCSFGPLKAFSLNMDSQGNSKGYAFAEYLDPT 342
Query: 602 VTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPL--------LKK 653
+T +AIAGLNG+++G + L VQ N+ H +P A + +
Sbjct: 343 LTDQAIAGLNGMQLGDKQLV-VQLACANQTRHNT-----H-LPNSASAIAGIDLSQGAGR 395
Query: 654 PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
TE+L L N+ + S+ + EE+LEDVR EC+++G V+S+ +
Sbjct: 396 ATEILCLMNMVTEDELR--SDEDYEEILEDVREECSKYGIVRSLEI 439
>gi|289741197|gb|ADD19346.1| splicing factor U2AF large subunit [Glossina morsitans morsitans]
Length = 423
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 100/345 (28%), Positives = 157/345 (45%), Gaps = 58/345 (16%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P P H + A + P + + P A + V S
Sbjct: 47 KPSLYWDVPP------PGFEHIT--PLQYKAMQAAGQIPANALPEIPQAAVPVVGSTI-- 96
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E +MEF N + +G+ G+ P + C I
Sbjct: 97 ------------TRQARRLYVGNIPFGVTEDEMMEFFNQQMHLTGLAQAAGN-PVLACQI 143
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF---------VEVA------ 552
+K AF+EF + ++ + A+ DG SF G LKI+RP ++ VA
Sbjct: 144 NLDKNFAFLEFRSTDETTQAMAFDGISFKGQSLKIRRPHDYQPMPGVVESTPVAQPVANG 203
Query: 553 ------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYV 598
IFIGG+ L+ V E++ +FG L+A++ + + AF EY+
Sbjct: 204 VISAVVPDSPHKIFIGGLPNYLNEDQVKELLLSFGQLRAFNLVKDAATGLSKGYAFCEYI 263
Query: 599 DQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDN---SGNPPFHGIPKHALPLLK-KP 654
D +T +AIAGLNG+++G + L +A + N + P +P ++ + P
Sbjct: 264 DHSITDQAIAGLNGMQLGDKKLIVQRASVGAKNAQNNHTTAAPVMIQVPGLSMVGISGPP 323
Query: 655 TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
TEVL L N+ P+ E E E++LED++ EC ++G V+SV +
Sbjct: 324 TEVLCLLNMVTPDELR--DEEEYEDILEDIKEECNKYGVVRSVEI 366
>gi|170054347|ref|XP_001863087.1| splicing factor u2af large subunit [Culex quinquefasciatus]
gi|167874693|gb|EDS38076.1| splicing factor u2af large subunit [Culex quinquefasciatus]
Length = 438
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 98/345 (28%), Positives = 159/345 (46%), Gaps = 59/345 (17%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 63 KPSLYWDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT------ 113
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I
Sbjct: 114 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQI 159
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG +F G LKI+RP ++ +
Sbjct: 160 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMTDSAVAPVQEKFS 219
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEY 597
IFIGG+ L+ V E++ +FG LKA++ + + AF EY
Sbjct: 220 GVISTVVPDSPHKIFIGGLPNYLNEDQVKELLLSFGQLKAFNLVKDAATGLSKGYAFAEY 279
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLL---KKP 654
V+ +T +AIAGLNG+++G + L +A + + + P I L L+ P
Sbjct: 280 VEYSITDQAIAGLNGMQLGDKKLIVQRASVGAKNANVAAVAPVQ-IQVPGLSLVGSSGPP 338
Query: 655 TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
TEVL L N+ P+ E E E++LED++ EC ++G V+S +
Sbjct: 339 TEVLCLLNMVTPDELK--DEEEYEDILEDIKEECNKYGVVRSAEI 381
>gi|348510223|ref|XP_003442645.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 3
[Oreochromis niloticus]
Length = 467
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 102/351 (29%), Positives = 172/351 (49%), Gaps = 56/351 (15%)
Query: 384 RSPEKKSAK-WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSA 442
R+ +K++ K WDV P + + + QAA + + TSTT G++ +
Sbjct: 81 RTRKKRTCKYWDVPPPGFEHITPMQYKAMQAAGQIPTIALLA---TSTT----TGVAAAP 133
Query: 443 SLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPC 502
+ V + Q+T R RRL V N+P +E+++ EF N + +G+ S P
Sbjct: 134 T----QVPIVGSQMT---RQARRLYVGNIPFGVTEESMAEFFNAQMRLAGLSQA-PSNPV 185
Query: 503 IGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA---------- 552
+ I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 186 LAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPAFHV 245
Query: 553 --------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIE 596
+FIGG+ L+ V E++ +FGPLKA++ + + AF E
Sbjct: 246 PGVVSTVVPDSPHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCE 305
Query: 597 YVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP--FHGIPKHALPLLKK- 653
YVD T +A+AGLNG+++G + L +A S+ + NP +P L++
Sbjct: 306 YVDISATDQAVAGLNGMQLGDKKLIVQRA----SVGAKNANPTSIIETPVTLQVPGLQRL 361
Query: 654 -----PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ PE + + + EE+LED+R EC ++G+V+S+ +
Sbjct: 362 QNSGMPTEVLCLLNMVMPEEL--VDDEDYEEILEDIREECCKYGTVRSIEI 410
>gi|443924699|gb|ELU43686.1| rRNA primary transcript binding protein [Rhizoctonia solani AG-1
IA]
Length = 678
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 103/315 (32%), Positives = 150/315 (47%), Gaps = 48/315 (15%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL V N+ A+E L EF N ++ + G P IG I EK AFVEF +AED
Sbjct: 236 RRLYVGNITYEANENNLQEFFNRKMVEMKIGTGGGGDPVIGVQINHEKSYAFVEFRSAED 295
Query: 524 ASAALCCDGCSFSGSILKIKRPKEF----------VEVA-------------IFIGGISR 560
A+AA+ DG F LKI+RPK++ V V IF+GG+
Sbjct: 296 ATAAMAFDGIMFQSGPLKIRRPKDYTGSDLSAPMGVHVPGVVSTNVPDSINKIFVGGLPT 355
Query: 561 TLSSKMVMEIVCAFGPLKAYHF---EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGG 617
L VME++ +FG LKA++ N C YVD VT AI GLNG+++G
Sbjct: 356 YLDENQVMELLKSFGELKAFNLVRENGNGAFRRDCQV--YVDPSVTDIAIQGLNGMELGD 413
Query: 618 QVLTAVQAVLDGSIMDNSGNPP---FHGIPKHALPLLKK----PTE--VLKLKNVFNPEG 668
+ L VQ G+ G PP IP+ +P+ + P++ +L L N+ PE
Sbjct: 414 RFLV-VQRASVGAKSGIPGVPPELFAPAIPRPIMPITETADPNPSDSTILLLLNMVAPED 472
Query: 669 FSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYG-------DSNISTIQACEGNENTAS 721
+ + E E++EDVR EC++FG V+S+ + + D+N + G A
Sbjct: 473 LT--DDGEYTEIVEDVREECSKFGPVRSLAIPRPAKKEKSKWDANAGAL-VTAGGAVLAP 529
Query: 722 AGVGQNLTNDETNEK 736
VG + +T+E+
Sbjct: 530 GAVGATAGDGKTDEQ 544
>gi|193629757|ref|XP_001950852.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Acyrthosiphon
pisum]
Length = 446
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 105/364 (28%), Positives = 166/364 (45%), Gaps = 71/364 (19%)
Query: 380 SPINRSPEKK-SAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGI 438
SP N+S +K S WDV P + + + QAA ++ P T T P+ G
Sbjct: 53 SPKNKSRRRKPSLYWDVPPPGFEHIAPLQYKAMQAAGQIP---ANTMPDTPQTAVPVVGS 109
Query: 439 SVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHG 498
+++ R RRL V N+P +E +MEF N + SG+ G
Sbjct: 110 TIT-------------------RQARRLYVGNIPFGVTEDEMMEFFNQQMHLSGLAQAAG 150
Query: 499 SLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------ 552
+ P + C I +K AF+EF + ++ + A+ DG +F G LKI+RP ++
Sbjct: 151 N-PVLACQINLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPTPGMTESN 209
Query: 553 --------------------------------IFIGGISRTLSSKMVMEIVCAFGPLKAY 580
IFIGG+ L+ + V E++ +FG LKA+
Sbjct: 210 PVTNYNSGMTLDMMKYDSSSFGLGTVPDSPHKIFIGGLPAYLNDEQVKELLTSFGQLKAF 269
Query: 581 HF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNP 638
+ + + AF EY D ++T +AIAGLNG+++G + L +A + G+ G
Sbjct: 270 NLVKDAATGLSKGYAFCEYADVVMTDQAIAGLNGMQLGEKKLIVQRASI-GAKNPGLGQA 328
Query: 639 PFHGIPKHALPLL---KKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVK 695
P I L ++ PTEVL L N+ P+ E E E++LED+R EC ++G V+
Sbjct: 329 PVT-IQVPGLTVVGTAGPPTEVLCLLNMVTPDELK--DEEEYEDILEDIREECNKYGVVR 385
Query: 696 SVNV 699
S+ +
Sbjct: 386 SLEI 389
>gi|62859443|ref|NP_001016998.1| U2 small nuclear RNA auxiliary factor 2 isoform 1 [Xenopus
(Silurana) tropicalis]
gi|89269799|emb|CAJ83531.1| U2 (RNU2) small nuclear RNA auxiliary factor 2 [Xenopus (Silurana)
tropicalis]
gi|115292148|gb|AAI22001.1| U2 small nuclear RNA auxiliary factor 2 [Xenopus (Silurana)
tropicalis]
Length = 465
Score = 126 bits (316), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 101/350 (28%), Positives = 162/350 (46%), Gaps = 62/350 (17%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA Q P + + + L V+
Sbjct: 77 WDVPPPGFEHITPMQYKAMQAAG----------------QIPATALLPTMTPDGLAVTPT 120
Query: 453 SVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQRE 510
V + +Q R RRL V N+P +E+A+M+F N + G+ G+ P + I ++
Sbjct: 121 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQD 179
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------ 552
K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 180 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVV 239
Query: 553 ------IFIGGISRTLS---------SKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFI 595
+FIGG+ L+ S V E++ +FGPLKA++ + + AF
Sbjct: 240 PDSAHKLFIGGLPNYLNDDQVTMESLSLWVKELLTSFGPLKAFNLVKDSATGLSKGYAFC 299
Query: 596 EYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPKHALP 649
EYVD VT +AIAGLNG+++G + L +A + + + P G+ +
Sbjct: 300 EYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMSSQVQ 359
Query: 650 LLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ PTEVL L N+ PE L + E EE++EDVR EC ++G+VKS+ +
Sbjct: 360 MGGHPTEVLCLMNMVLPE--ELLDDDEYEEIVEDVRDECGKYGAVKSIEI 407
>gi|348510219|ref|XP_003442643.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
[Oreochromis niloticus]
Length = 466
Score = 125 bits (315), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 102/351 (29%), Positives = 172/351 (49%), Gaps = 56/351 (15%)
Query: 384 RSPEKKSAK-WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSA 442
R+ +K++ K WDV P + + + QAA + + TSTT G++ +
Sbjct: 80 RTRKKRTCKYWDVPPPGFEHITPMQYKAMQAAGQIPTIALLA---TSTT----TGVAAAP 132
Query: 443 SLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPC 502
+ V + Q+T R RRL V N+P +E+++ EF N + +G+ S P
Sbjct: 133 T----QVPIVGSQMT---RQARRLYVGNIPFGVTEESMAEFFNAQMRLAGLSQA-PSNPV 184
Query: 503 IGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA---------- 552
+ I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 185 LAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPAFHV 244
Query: 553 --------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIE 596
+FIGG+ L+ V E++ +FGPLKA++ + + AF E
Sbjct: 245 PGVVSTVVPDSPHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCE 304
Query: 597 YVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP--FHGIPKHALPLLKK- 653
YVD T +A+AGLNG+++G + L +A S+ + NP +P L++
Sbjct: 305 YVDISATDQAVAGLNGMQLGDKKLIVQRA----SVGAKNANPTSIIETPVTLQVPGLQRL 360
Query: 654 -----PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ PE + + + EE+LED+R EC ++G+V+S+ +
Sbjct: 361 QNSGMPTEVLCLLNMVMPEEL--VDDEDYEEILEDIREECCKYGTVRSIEI 409
>gi|386764548|ref|NP_001245708.1| U2 small nuclear riboprotein auxiliary factor 50, isoform B
[Drosophila melanogaster]
gi|383293438|gb|AFH07421.1| U2 small nuclear riboprotein auxiliary factor 50, isoform B
[Drosophila melanogaster]
Length = 427
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 156/351 (44%), Gaps = 69/351 (19%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QA+ +V P T+ P+ G +++
Sbjct: 50 KPSLYWDVPPPGFEHITPMQYKAMQASGQIPASVVPDTPQTAV---PVVGSTIT------ 100
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + G+ GS P + C I
Sbjct: 101 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLVGLAQAAGS-PVLACQI 146
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG + G LKI+RP ++ +
Sbjct: 147 NLDKNFAFLEFRSIDETTQAMAFDGINLKGQSLKIRRPHDYQPMPGITDTPAIKPAVVSS 206
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEY 597
IFIGG+ L+ V E++ +FG L+A++ + + AF EY
Sbjct: 207 GVISTVVPDSPHKIFIGGLPNYLNDDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEY 266
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGN---------PPFHGIPKHAL 648
VD +T ++IAGLNG+++G + L +A + N+ N P +
Sbjct: 267 VDLSITDQSIAGLNGMQLGDKKLIVQRASVGAKNAQNAANTTQSVMLQVPGLSNVVTSG- 325
Query: 649 PLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ P+ E E E++LED++ EC ++G V+SV +
Sbjct: 326 ----PPTEVLCLLNMVTPDELR--DEEEYEDILEDIKEECTKYGVVRSVEI 370
>gi|195042782|ref|XP_001991497.1| GH12033 [Drosophila grimshawi]
gi|195134983|ref|XP_002011915.1| GI14308 [Drosophila mojavensis]
gi|193901255|gb|EDW00122.1| GH12033 [Drosophila grimshawi]
gi|193909169|gb|EDW08036.1| GI14308 [Drosophila mojavensis]
Length = 416
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 96/346 (27%), Positives = 159/346 (45%), Gaps = 59/346 (17%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QA+ +V P T+ P+ G +++
Sbjct: 39 KPSLYWDVPPPGFEHITPMQYKAMQASGQIPASVVPDTPQTAV---PVVGSTIT------ 89
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + G+ GS P + C I
Sbjct: 90 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLVGLAQAAGS-PVLACQI 135
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG + G LKI+RP ++ +
Sbjct: 136 NLDKNFAFLEFRSIDETTQAMAFDGINLKGQDLKIRRPHDYQPMPGITDTPAVKPAVVSS 195
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEY 597
IFIGG+ L+ + V E++ +FG L+A++ + + AF EY
Sbjct: 196 GVISTVVPDSPHKIFIGGLPNYLNDEQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEY 255
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP---FHGIPKHALPLLK-K 653
VD +T ++IAGLNG+++G + L +A + N+ N +P + +
Sbjct: 256 VDLSITDQSIAGLNGMQLGDKKLIVQRASVGAKNAQNAANTSQSVMLQVPGLSTVVTSGP 315
Query: 654 PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ P+ E E E++LED++ EC ++G V+SV +
Sbjct: 316 PTEVLCLLNMVTPDELR--DEEEYEDILEDIKEECTKYGVVRSVEI 359
>gi|195448282|ref|XP_002071589.1| GK10063 [Drosophila willistoni]
gi|194167674|gb|EDW82575.1| GK10063 [Drosophila willistoni]
Length = 416
Score = 125 bits (313), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/346 (27%), Positives = 158/346 (45%), Gaps = 59/346 (17%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QA+ +V P T+ P+ G +++
Sbjct: 39 KPSLYWDVPPPGFEHITPMQYKAMQASGQIPASVVPDTPQTAV---PVVGSTIT------ 89
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + G+ GS P + C I
Sbjct: 90 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLVGLAQAAGS-PVLACQI 135
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG + G LKI+RP ++ +
Sbjct: 136 NLDKNFAFLEFRSIDETTQAMAFDGINLKGQSLKIRRPHDYQPMPGITDTPAIKPAVVSS 195
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEY 597
IFIGG+ L+ V E++ +FG L+A++ + + AF EY
Sbjct: 196 GVISTVVPDSPHKIFIGGLPNYLNDDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEY 255
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP---FHGIPKHALPLLK-K 653
VD +T ++IAGLNG+++G + L +A + N+ N +P + +
Sbjct: 256 VDLSITDQSIAGLNGMQLGDKKLIVQRASVGAKNAQNAANTTQSVMLQVPGLSTVVTSGP 315
Query: 654 PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ P+ E E E++LED++ EC ++G V+SV +
Sbjct: 316 PTEVLCLLNMVTPDELR--DEEEYEDILEDIKEECTKYGVVRSVEI 359
>gi|345480698|ref|XP_001604333.2| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Nasonia
vitripennis]
Length = 455
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 102/340 (30%), Positives = 164/340 (48%), Gaps = 59/340 (17%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 90 KLSLYWDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT------ 140
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I
Sbjct: 141 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQI 186
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG +F G LKI+RP ++ +
Sbjct: 187 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMTDNPSMNVPDSPH 246
Query: 553 -IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAG 609
IFIGG+ L+ + V E++ +FG L+A++ + + AF EYVD +T +AIAG
Sbjct: 247 KIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKGYAFCEYVDVSMTDQAIAG 306
Query: 610 LNGLKVGGQVLTAVQAVLDGSIMDNSGNP-PFHG------IPKHALPLL---KKPTEVLK 659
LNG+++G + L +A + + NP P G I L ++ PTEVL
Sbjct: 307 LNGMQLGDKKLIVQRASV------GAKNPMPMIGAQAPVQIQVPGLSMVGTSGPPTEVLC 360
Query: 660 LKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L N+ PE + E E E++LED++ EC ++G V+SV +
Sbjct: 361 LLNMVTPE--ELMEEEEYEDILEDIKEECNKYGVVRSVEI 398
>gi|195167317|ref|XP_002024480.1| GL15893 [Drosophila persimilis]
gi|198469588|ref|XP_001355063.2| GA22177 [Drosophila pseudoobscura pseudoobscura]
gi|194107878|gb|EDW29921.1| GL15893 [Drosophila persimilis]
gi|198146942|gb|EAL32119.2| GA22177 [Drosophila pseudoobscura pseudoobscura]
Length = 418
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/346 (27%), Positives = 158/346 (45%), Gaps = 59/346 (17%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QA+ +V P T+ P+ G +++
Sbjct: 41 KPSLYWDVPPPGFEHITPMQYKAMQASGQIPASVVPDTPQTAV---PVVGSTIT------ 91
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + G+ GS P + C I
Sbjct: 92 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLVGLAQAAGS-PVLACQI 137
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG + G LKI+RP ++ +
Sbjct: 138 NLDKNFAFLEFRSIDETTQAMAFDGINLKGQSLKIRRPHDYQPMPGITDTPAIKPAVVSS 197
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEY 597
IFIGG+ L+ V E++ +FG L+A++ + + AF EY
Sbjct: 198 GVISTVVPDSPHKIFIGGLPNYLNDDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEY 257
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP---FHGIPKHALPLLK-K 653
VD +T ++IAGLNG+++G + L +A + N+ N +P + +
Sbjct: 258 VDLSITDQSIAGLNGMQLGDKKLIVQRASVGAKNAQNASNSSQSVMLQVPGLSTVVSSGP 317
Query: 654 PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ P+ E E E++LED++ EC ++G V+SV +
Sbjct: 318 PTEVLCLLNMVTPDELR--DEEEYEDILEDIKEECTKYGVVRSVEI 361
>gi|194770152|ref|XP_001967161.1| GF19596 [Drosophila ananassae]
gi|190619281|gb|EDV34805.1| GF19596 [Drosophila ananassae]
Length = 416
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/346 (27%), Positives = 158/346 (45%), Gaps = 59/346 (17%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QA+ +V P T+ P+ G +++
Sbjct: 39 KPSLYWDVPPPGFEHITPMQYKAMQASGQIPASVVPDTPQTAV---PVVGSTIT------ 89
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + G+ GS P + C I
Sbjct: 90 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLVGLAQAAGS-PVLACQI 135
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG + G LKI+RP ++ +
Sbjct: 136 NLDKNFAFLEFRSIDETTQAMAFDGINLKGQSLKIRRPHDYQPMPGITDTPAIKPAVVSS 195
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEY 597
IFIGG+ L+ V E++ +FG L+A++ + + AF EY
Sbjct: 196 GVISTVVPDSPHKIFIGGLPNYLNDDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEY 255
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP---FHGIPKHALPLLK-K 653
VD +T ++IAGLNG+++G + L +A + N+ N +P + +
Sbjct: 256 VDLSITDQSIAGLNGMQLGDKKLIVQRASVGAKNAQNASNTTQSVMLQVPGLSTVVTSGP 315
Query: 654 PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ P+ E E E++LED++ EC ++G V+SV +
Sbjct: 316 PTEVLCLLNMVTPDELR--DEEEYEDILEDIKEECTKYGVVRSVEI 359
>gi|17136764|ref|NP_476891.1| U2 small nuclear riboprotein auxiliary factor 50, isoform A
[Drosophila melanogaster]
gi|386764552|ref|NP_001245710.1| U2 small nuclear riboprotein auxiliary factor 50, isoform D
[Drosophila melanogaster]
gi|195351420|ref|XP_002042232.1| GM13406 [Drosophila sechellia]
gi|195479195|ref|XP_002100800.1| GE15975 [Drosophila yakuba]
gi|195555160|ref|XP_002077042.1| GD24494 [Drosophila simulans]
gi|4033485|sp|Q24562.1|U2AF2_DROME RecName: Full=Splicing factor U2AF 50 kDa subunit; AltName: Full=U2
auxiliary factor 50 kDa subunit; AltName: Full=U2 snRNP
auxiliary factor large subunit
gi|349761|gb|AAA03548.1| RNA binding protein [Drosophila melanogaster]
gi|7293214|gb|AAF48596.1| U2 small nuclear riboprotein auxiliary factor 50, isoform A
[Drosophila melanogaster]
gi|17861976|gb|AAL39465.1| LD03714p [Drosophila melanogaster]
gi|194124075|gb|EDW46118.1| GM13406 [Drosophila sechellia]
gi|194188324|gb|EDX01908.1| GE15975 [Drosophila yakuba]
gi|194203060|gb|EDX16636.1| GD24494 [Drosophila simulans]
gi|220943258|gb|ACL84172.1| U2af50-PA [synthetic construct]
gi|220953438|gb|ACL89262.1| U2af50-PA [synthetic construct]
gi|383293440|gb|AFH07423.1| U2 small nuclear riboprotein auxiliary factor 50, isoform D
[Drosophila melanogaster]
Length = 416
Score = 124 bits (311), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 156/351 (44%), Gaps = 69/351 (19%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QA+ +V P T+ P+ G +++
Sbjct: 39 KPSLYWDVPPPGFEHITPMQYKAMQASGQIPASVVPDTPQTAV---PVVGSTIT------ 89
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + G+ GS P + C I
Sbjct: 90 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLVGLAQAAGS-PVLACQI 135
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG + G LKI+RP ++ +
Sbjct: 136 NLDKNFAFLEFRSIDETTQAMAFDGINLKGQSLKIRRPHDYQPMPGITDTPAIKPAVVSS 195
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEY 597
IFIGG+ L+ V E++ +FG L+A++ + + AF EY
Sbjct: 196 GVISTVVPDSPHKIFIGGLPNYLNDDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEY 255
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGN---------PPFHGIPKHAL 648
VD +T ++IAGLNG+++G + L +A + N+ N P +
Sbjct: 256 VDLSITDQSIAGLNGMQLGDKKLIVQRASVGAKNAQNAANTTQSVMLQVPGLSNVVTSG- 314
Query: 649 PLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ P+ E E E++LED++ EC ++G V+SV +
Sbjct: 315 ----PPTEVLCLLNMVTPDELR--DEEEYEDILEDIKEECTKYGVVRSVEI 359
>gi|194893848|ref|XP_001977952.1| GG19328 [Drosophila erecta]
gi|190649601|gb|EDV46879.1| GG19328 [Drosophila erecta]
Length = 416
Score = 124 bits (311), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 156/351 (44%), Gaps = 69/351 (19%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QA+ +V P T+ P+ G +++
Sbjct: 39 KPSLYWDVPPPGFEHITPMQYKAMQASGQIPASVVPDTPQTAV---PVVGSTIT------ 89
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + G+ GS P + C I
Sbjct: 90 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLVGLAQAAGS-PVLACQI 135
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG + G LKI+RP ++ +
Sbjct: 136 NLDKNFAFLEFRSIDETTQAMAFDGINLKGQSLKIRRPHDYQPMPGITDTPAIKPAVVSS 195
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEY 597
IFIGG+ L+ V E++ +FG L+A++ + + AF EY
Sbjct: 196 GVISTVVPDSPHKIFIGGLPNYLNDDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEY 255
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGN---------PPFHGIPKHAL 648
VD +T ++IAGLNG+++G + L +A + N+ N P +
Sbjct: 256 VDLSITDQSIAGLNGMQLGDKKLIVQRASVGAKNAQNAANTTQSVMLQVPGLSNVVTSG- 314
Query: 649 PLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ P+ E E E++LED++ EC ++G V+SV +
Sbjct: 315 ----PPTEVLCLLNMVTPDELR--DEEEYEDILEDIKEECTKYGVVRSVEI 359
>gi|390341852|ref|XP_792919.3| PREDICTED: splicing factor U2AF 50 kDa subunit-like
[Strongylocentrotus purpuratus]
Length = 386
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 86/277 (31%), Positives = 137/277 (49%), Gaps = 37/277 (13%)
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
+Q R RRL V N+P +E A++EF N + + G+ G P + + +K AF+
Sbjct: 56 SQMTRQARRLYVGNIPFGVTEDAMVEFFNGKMHNVGLAQAPGP-PVLAVQVNHDKNFAFL 114
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------ 552
EF + E+ + A+ DG F LKI+RPK++ +
Sbjct: 115 EFRSVEETTQAMAFDGILFQNQALKIRRPKDYQAIPGMSATPTVHVPGVVSTVVQDSPNK 174
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLV-------T 603
IFIGG+ L+ V E++ +FGPLKA++ + + AF EYV+ + T
Sbjct: 175 IFIGGLPNYLNDDQVKELLSSFGPLKAFNLVKDSATSLSKGYAFCEYVETNLTDLGWETT 234
Query: 604 PKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKP-TEVLKLKN 662
KAIAGLNG+++G + L +A + N G IP +LP P TE+L L N
Sbjct: 235 DKAIAGLNGMQLGEKKLIVQRASVGAKNAMNQGQQVQINIPGLSLPGTTGPNTEILCLMN 294
Query: 663 VFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ PE + E ++++EDV+ EC ++G V+S+ +
Sbjct: 295 MVTPEELK--DDEEYDDIVEDVKEECQKYGQVRSLEI 329
>gi|228543|prf||1805352A splicing factor U2AF:SUBUNIT=large
Length = 475
Score = 123 bits (308), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 106/359 (29%), Positives = 168/359 (46%), Gaps = 59/359 (16%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP++ +K WDV P + + + QAA Q P +
Sbjct: 79 SPLHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 122
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 182
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 241
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 242 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 301
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP-----------F 640
AF EYVD VT +AIAGLNG+++G + L VQ G+ +PP
Sbjct: 302 YAFCEYVDINVTDQAIAGLNGMQLGDKKLL-VQRASVGAKNATLVSPPSTINQTPVTLQV 360
Query: 641 HGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
G+ + + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 361 PGLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 417
>gi|410903107|ref|XP_003965035.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 3
[Takifugu rubripes]
Length = 461
Score = 122 bits (307), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 86/277 (31%), Positives = 141/277 (50%), Gaps = 41/277 (14%)
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
+Q R RRL V N+P +E+++ EF N + +G+ + P + I ++K AF+
Sbjct: 135 SQMTRQARRLYVGNIPFGVTEESMAEFFNAQMRLAGLSQAPSN-PVLAVQINQDKNFAFL 193
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------ 552
EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 194 EFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPVFHVPGVVSTVVPDSPHK 253
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGL 610
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD T +A+AGL
Sbjct: 254 LFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCEYVDVSATDQAVAGL 313
Query: 611 NGLKVGGQVLTAVQAVLDGSIMDNSGNPP--FHGIPKHALPLLKK------PTEVLKLKN 662
NG+++G + L +A S+ + NP +P L++ PTEVL L N
Sbjct: 314 NGMQLGDKKLIVQRA----SVGAKNANPSAIIEAPVTLQVPGLQRLQNSGMPTEVLCLLN 369
Query: 663 VFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ PE + + + EE+LEDVR EC ++GSV+S+ +
Sbjct: 370 MVMPEEL--VDDDDYEEILEDVREECCKYGSVRSIEI 404
>gi|324503285|gb|ADY41429.1| Splicing factor U2AF 65 kDa subunit [Ascaris suum]
Length = 522
Score = 122 bits (307), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 86/262 (32%), Positives = 140/262 (53%), Gaps = 32/262 (12%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL V N+P SE A+++F N + G+ G+ P + C + +K AF+EF + ++
Sbjct: 210 RRLYVGNIPFGCSEDAMLDFFNQQMHLCGLAQAPGN-PVLACQMNLDKNFAFIEFRSIDE 268
Query: 524 ASAALCCDGCSFSGSILKIKRPKEFVEVA--------------------IFIGGISRTLS 563
+A + DG +F G LKI+RP+++ ++ IFIGG+ L+
Sbjct: 269 TTAGMAFDGINFMGQQLKIRRPRDYQPMSTSYDMGNMMVSNIVADSPYKIFIGGLPSYLN 328
Query: 564 SKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLT 621
++ V E++ +FG LKA++ +V+ + AF EY+D +T +AIAGLNG+++G + L
Sbjct: 329 AEQVKELLSSFGQLKAFNLVTDVSTGVSKGYAFAEYLDPSLTDQAIAGLNGMQLGDKNLV 388
Query: 622 AVQAVLDG-SIMDNSGNPPFH--GIP-KHALPLLKKPTEVLKLKNVFNPEGFSSLSELEV 677
+ + + M + P GI H PTEVL L N+ E + E
Sbjct: 389 VQLSCANARAAMSTTAFPQIQVAGIDLSHG---AGPPTEVLCLMNMVTEE--ELKEDEEY 443
Query: 678 EEVLEDVRLECARFGSVKSVNV 699
E++LED+R ECA++G VKS+ V
Sbjct: 444 EDILEDIREECAKYGFVKSIEV 465
>gi|410351435|gb|JAA42321.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
Length = 471
Score = 122 bits (306), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 166/354 (46%), Gaps = 53/354 (14%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 79 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 122
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 182
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 241
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 242 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 301
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPK 645
AF EYVD VT +AIAGLNG+++G + L +A + + + P G+
Sbjct: 302 YAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMS 361
Query: 646 HALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 362 SQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 413
>gi|410903103|ref|XP_003965033.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
[Takifugu rubripes]
Length = 446
Score = 122 bits (306), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 86/277 (31%), Positives = 141/277 (50%), Gaps = 41/277 (14%)
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
+Q R RRL V N+P +E+++ EF N + +G+ + P + I ++K AF+
Sbjct: 120 SQMTRQARRLYVGNIPFGVTEESMAEFFNAQMRLAGLSQAPSN-PVLAVQINQDKNFAFL 178
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------ 552
EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 179 EFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPVFHVPGVVSTVVPDSPHK 238
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGL 610
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD T +A+AGL
Sbjct: 239 LFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCEYVDVSATDQAVAGL 298
Query: 611 NGLKVGGQVLTAVQAVLDGSIMDNSGNPP--FHGIPKHALPLLKK------PTEVLKLKN 662
NG+++G + L +A S+ + NP +P L++ PTEVL L N
Sbjct: 299 NGMQLGDKKLIVQRA----SVGAKNANPSAIIEAPVTLQVPGLQRLQNSGMPTEVLCLLN 354
Query: 663 VFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ PE + + + EE+LEDVR EC ++GSV+S+ +
Sbjct: 355 MVMPEEL--VDDDDYEEILEDVREECCKYGSVRSIEI 389
>gi|358338608|dbj|GAA57078.1| splicing factor U2AF large subunit [Clonorchis sinensis]
Length = 518
Score = 122 bits (306), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 102/369 (27%), Positives = 163/369 (44%), Gaps = 79/369 (21%)
Query: 385 SPEKKSAKWDVAPV--------------ETYSVPSNVHTSNQAASSNAHEMVSSDPVTST 430
+P WDV P + +P NV+ + Q AH + P+T T
Sbjct: 118 TPPLIYKYWDVPPPGFEHVTPAQYKAMQASGQIPVNVYAAGQIPMP-AH--APNAPLTLT 174
Query: 431 TQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLS 490
T P AG +V R RRL V N+P +A+E+ +MEF N + +
Sbjct: 175 TNIPFAGSAVC-------------------RQARRLYVGNIPFTATEENMMEFFNKQMRA 215
Query: 491 SGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVE 550
G+ GS P I I EK AF+EF + ++ + L DG F LK++RP+++
Sbjct: 216 QGLVQAEGS-PIIAVQINMEKNFAFLEFRSVDETTQGLALDGILFHNQALKLRRPRDYAP 274
Query: 551 VA------------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EV 584
+ +F+GG+ L+ V E++ +FGPLK ++ +
Sbjct: 275 LPGVSETPSVIVPGVVSTVVQDSPHKVFVGGLPNYLNEDQVKELLLSFGPLKGFNLVKDG 334
Query: 585 NEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIP 644
+ + AF EYVD VT A AGLNG+++G + L +A + + P +P
Sbjct: 335 STGLSKGYAFCEYVDPNVTDHACAGLNGMQLGDKKLIVQRASVGAKHNATTAAPALLQLP 394
Query: 645 KHALPLLK--------------KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECAR 690
L++ PTEVL L N+ +P + E E+++EDVR EC++
Sbjct: 395 GLTDTLVQNTTGTGNITIRSGGPPTEVLCLMNMIDPA--ELEDDEEYEDIVEDVRAECSK 452
Query: 691 FGSVKSVNV 699
+G V+S+ +
Sbjct: 453 YGVVRSLEI 461
>gi|410351437|gb|JAA42322.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
Length = 475
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 106/359 (29%), Positives = 167/359 (46%), Gaps = 59/359 (16%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 79 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 122
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 182
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 241
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 242 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 301
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP-----------F 640
AF EYVD VT +AIAGLNG+++G + L VQ G+ +PP
Sbjct: 302 YAFCEYVDINVTDQAIAGLNGMQLGDKKLL-VQRASVGAKNATLVSPPSTINQTPVTLQV 360
Query: 641 HGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
G+ + + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 361 PGLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 417
>gi|410903105|ref|XP_003965034.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
[Takifugu rubripes]
Length = 454
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/277 (31%), Positives = 141/277 (50%), Gaps = 41/277 (14%)
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
+Q R RRL V N+P +E+++ EF N + +G+ + P + I ++K AF+
Sbjct: 128 SQMTRQARRLYVGNIPFGVTEESMAEFFNAQMRLAGLSQAPSN-PVLAVQINQDKNFAFL 186
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------ 552
EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 187 EFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPVFHVPGVVSTVVPDSPHK 246
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGL 610
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD T +A+AGL
Sbjct: 247 LFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATSLSKGYAFCEYVDVSATDQAVAGL 306
Query: 611 NGLKVGGQVLTAVQAVLDGSIMDNSGNPP--FHGIPKHALPLLKK------PTEVLKLKN 662
NG+++G + L +A S+ + NP +P L++ PTEVL L N
Sbjct: 307 NGMQLGDKKLIVQRA----SVGAKNANPSAIIEAPVTLQVPGLQRLQNSGMPTEVLCLLN 362
Query: 663 VFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ PE + + + EE+LEDVR EC ++GSV+S+ +
Sbjct: 363 MVMPEEL--VDDDDYEEILEDVREECCKYGSVRSIEI 397
>gi|350417886|ref|XP_003491628.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like isoform 2
[Bombus impatiens]
Length = 428
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 161/327 (49%), Gaps = 45/327 (13%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 70 WDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT----------- 115
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I +K
Sbjct: 116 --------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQINLDKN 166
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----------------IFIG 556
AF+EF + ++ + A+ DG +F G LKI+RP ++ + IFIG
Sbjct: 167 FAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMTDNPSMNVPDSPHKIFIG 226
Query: 557 GISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLK 614
G+ L+ + V E++ +FG L+A++ + + AF EYVD +T +AIAGLNG++
Sbjct: 227 GLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKGYAFCEYVDVSMTDQAIAGLNGMQ 286
Query: 615 VGGQVLTAVQAVLDG-SIMDNSGNPPFHGIPKHALPLLKKP-TEVLKLKNVFNPEGFSSL 672
+G + L +A + + M + P +P ++ P TEVL L N+ PE +
Sbjct: 287 LGDKKLIVQRASVGAKNPMIGAQAPVQIQVPGLSMVGTSGPATEVLCLLNMVTPE--ELM 344
Query: 673 SELEVEEVLEDVRLECARFGSVKSVNV 699
E E E++LED++ EC ++G V+SV +
Sbjct: 345 EEEEYEDILEDIKEECNKYGVVRSVEI 371
>gi|297302956|ref|XP_001119590.2| PREDICTED: splicing factor U2AF 65 kDa subunit-like, partial
[Macaca mulatta]
Length = 432
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 104/359 (28%), Positives = 168/359 (46%), Gaps = 53/359 (14%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 79 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 122
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 182
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 241
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 242 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 301
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPK 645
AF EYVD VT +AIAGLNG+++G + L +A + + + P G+
Sbjct: 302 YAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMS 361
Query: 646 HALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGD 704
+ + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ + + D
Sbjct: 362 SQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRLVD 418
>gi|1334149|emb|CAA45875.1| unnamed protein product [Mus musculus]
Length = 492
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 164/358 (45%), Gaps = 57/358 (15%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 96 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 139
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 140 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 199
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 200 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 258
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 259 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 318
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLT----------AVQAVLDGSIMDNSGNPPFH 641
AF EYVD VT +AIAGLNG+++G + L A L +I
Sbjct: 319 YAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSLPSTINQTPVTLQVP 378
Query: 642 GIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
G+ + + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 379 GLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 434
>gi|195380577|ref|XP_002049047.1| GJ20975 [Drosophila virilis]
gi|194143844|gb|EDW60240.1| GJ20975 [Drosophila virilis]
Length = 428
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 99/369 (26%), Positives = 164/369 (44%), Gaps = 62/369 (16%)
Query: 368 RKRRTEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPV 427
R R+ + P + R E+ ++ WD+ P+ + + + QA+ A +V P
Sbjct: 27 RPTRSSSITSQPIQVKRRKERINSSWDIPPMGYEHLSPVQYKAMQASGQIASRIVPDAPP 86
Query: 428 TSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNF 487
T SA++A + R RRL V N+P S +++ +M F N
Sbjct: 87 TGE----------SAAIATVT------------RQARRLYVGNIPFSTTDEDMMAFFNEQ 124
Query: 488 LLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRP-- 545
+ H + C +K AF+EF + ++A+ A+ DG S+ G LKI+RP
Sbjct: 125 IHRLNGTQGHDGNAVLTCQTNLDKNFAFLEFRSMDEATQAITFDGISYRGQTLKIRRPHD 184
Query: 546 ---------KEFVEVA-----------------------IFIGGISRTLSSKMVMEIVCA 573
E VE+A I+IGG+ L+ + V E++
Sbjct: 185 YHPVASITTAEIVEIAKGATQIHASNLPISPVVPDSPHKIYIGGLPTCLNEEQVKELLVT 244
Query: 574 FGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSI 631
FG L+ ++ + + AF EY+D +T +AIAGLNG+++G + L VQ + G
Sbjct: 245 FGQLRGFNLVKDTITGQSKGFAFCEYLDPSITEQAIAGLNGMQLGDRKL-VVQRSIAGVR 303
Query: 632 MDNSGNPPFHGIPKHALPL-LKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECAR 690
+ P +P + + K TEVL L N+ P L E E++ D++ ECA+
Sbjct: 304 NMVASQLPVLQVPGFPIDVSTCKATEVLCLLNMVLPSEL--LDNDEYEDIRTDIKQECAK 361
Query: 691 FGSVKSVNV 699
+G VKS+ +
Sbjct: 362 YGKVKSLKI 370
>gi|307195151|gb|EFN77144.1| Splicing factor U2AF 50 kDa subunit [Harpegnathos saltator]
Length = 402
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 161/327 (49%), Gaps = 45/327 (13%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 44 WDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT----------- 89
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I +K
Sbjct: 90 --------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQINLDKN 140
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----------------IFIG 556
AF+EF + ++ + A+ DG +F G LKI+RP ++ + IFIG
Sbjct: 141 FAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMTDNPSMNVPDSPHKIFIG 200
Query: 557 GISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLK 614
G+ L+ + V E++ +FG L+A++ + + AF EYVD +T +AIAGLNG++
Sbjct: 201 GLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKGYAFCEYVDVSMTDQAIAGLNGMQ 260
Query: 615 VGGQVLTAVQAVLDG-SIMDNSGNPPFHGIPKHALPLLKKP-TEVLKLKNVFNPEGFSSL 672
+G + L +A + + M + P +P ++ P TEVL L N+ PE +
Sbjct: 261 LGDKKLIVQRASVGAKNPMIGTQAPVQIQVPGLSMVGSSGPATEVLCLLNMVTPE--ELM 318
Query: 673 SELEVEEVLEDVRLECARFGSVKSVNV 699
E E E++LED++ EC ++G V+SV +
Sbjct: 319 EEEEYEDILEDIKEECNKYGVVRSVEI 345
>gi|60279268|ref|NP_001012496.1| splicing factor U2AF 65 kDa subunit isoform b [Homo sapiens]
gi|164565377|ref|NP_598432.2| splicing factor U2AF 65 kDa subunit isoform 2 [Mus musculus]
gi|109461136|ref|XP_001060115.1| PREDICTED: splicing factor U2AF 65 kDa subunit isoform 6 [Rattus
norvegicus]
gi|338709958|ref|XP_001496159.3| PREDICTED: splicing factor U2AF 65 kDa subunit [Equus caballus]
gi|348551787|ref|XP_003461710.1| PREDICTED: splicing factor U2AF 65 kDa subunit isoform 1 [Cavia
porcellus]
gi|359318549|ref|XP_003638845.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Canis lupus
familiaris]
gi|395861318|ref|XP_003802936.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Otolemur garnettii]
gi|397471087|ref|XP_003807136.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Pan paniscus]
gi|403308602|ref|XP_003944746.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Saimiri boliviensis
boliviensis]
gi|14250571|gb|AAH08740.1| U2 small nuclear RNA auxiliary factor 2 [Homo sapiens]
gi|27695339|gb|AAH43071.1| U2af2 protein [Mus musculus]
gi|39644972|gb|AAH30574.1| U2 small nuclear RNA auxiliary factor 2 [Homo sapiens]
gi|119592810|gb|EAW72404.1| U2 (RNU2) small nuclear RNA auxiliary factor 2, isoform CRA_c [Homo
sapiens]
gi|148699339|gb|EDL31286.1| U2 small nuclear ribonucleoprotein auxiliary factor (U2AF) 2,
isoform CRA_a [Mus musculus]
gi|149016700|gb|EDL75886.1| similar to U2 (RNU2) small nuclear RNA auxiliary factor 2 isoform b
[Rattus norvegicus]
gi|261858294|dbj|BAI45669.1| U2 small nuclear RNA auxiliary factor 2 [synthetic construct]
gi|325463253|gb|ADZ15397.1| U2 small nuclear RNA auxiliary factor 2 [synthetic construct]
gi|380783067|gb|AFE63409.1| splicing factor U2AF 65 kDa subunit isoform b [Macaca mulatta]
gi|389618965|gb|AFK92990.1| U2 small nuclear RNA auxiliary factor 2 [Sus scrofa]
gi|410212802|gb|JAA03620.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
gi|410260572|gb|JAA18252.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
gi|410291502|gb|JAA24351.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
gi|431902970|gb|ELK09152.1| Splicing factor U2AF 65 kDa subunit [Pteropus alecto]
Length = 471
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 166/354 (46%), Gaps = 53/354 (14%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 79 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 122
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 182
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 241
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 242 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 301
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPK 645
AF EYVD VT +AIAGLNG+++G + L +A + + + P G+
Sbjct: 302 YAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMS 361
Query: 646 HALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 362 SQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 413
>gi|197692223|dbj|BAG70075.1| U2 small nuclear RNA auxiliary factor 2 isoform b [Homo sapiens]
gi|197692475|dbj|BAG70201.1| U2 small nuclear RNA auxiliary factor 2 isoform b [Homo sapiens]
Length = 471
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 166/354 (46%), Gaps = 53/354 (14%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 79 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 122
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 182
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 241
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 242 LSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 301
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPK 645
AF EYVD VT +AIAGLNG+++G + L +A + + + P G+
Sbjct: 302 YAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMS 361
Query: 646 HALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 362 SQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 413
>gi|6005926|ref|NP_009210.1| splicing factor U2AF 65 kDa subunit isoform a [Homo sapiens]
gi|267188|sp|P26368.4|U2AF2_HUMAN RecName: Full=Splicing factor U2AF 65 kDa subunit; AltName: Full=U2
auxiliary factor 65 kDa subunit; Short=hU2AF(65);
Short=hU2AF65; AltName: Full=U2 snRNP auxiliary factor
large subunit
gi|37545|emb|CAA45409.1| splicing factor U2AF [Homo sapiens]
gi|380783065|gb|AFE63408.1| splicing factor U2AF 65 kDa subunit isoform a [Macaca mulatta]
gi|410212804|gb|JAA03621.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
gi|410260574|gb|JAA18253.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
gi|410291504|gb|JAA24352.1| U2 small nuclear RNA auxiliary factor 2 [Pan troglodytes]
Length = 475
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 106/359 (29%), Positives = 167/359 (46%), Gaps = 59/359 (16%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 79 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 122
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 182
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 241
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 242 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 301
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP-----------F 640
AF EYVD VT +AIAGLNG+++G + L VQ G+ +PP
Sbjct: 302 YAFCEYVDINVTDQAIAGLNGMQLGDKKLL-VQRASVGAKNATLVSPPSTINQTPVTLQV 360
Query: 641 HGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
G+ + + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 361 PGLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 417
>gi|389610875|dbj|BAM19048.1| U2 small nuclear riboprotein auxiliary factor 50 [Papilio polytes]
Length = 422
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 96/341 (28%), Positives = 156/341 (45%), Gaps = 59/341 (17%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 50 WDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT----------- 95
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
R RRL V N+P +E+ MEF N + SG+ G+ P + C I +K
Sbjct: 96 --------RQARRLYVGNIPFGVTEEETMEFFNQQMHLSGLAQAAGN-PVLACQINLDKN 146
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA-------------------- 552
AF+EF + ++ + A+ DG +F G LKI+RP ++ +
Sbjct: 147 FAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGTENPAINVPAGVISTVVPD 206
Query: 553 ----IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKA 606
IFIGG+ L+ V E++ +FG L+A++ + + + AF EYVD +T +A
Sbjct: 207 SPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSSTGLSKGYAFAEYVDISMTDQA 266
Query: 607 IAGLNGLKVGGQVLTAVQA-------VLDGSIMDNSGNPPFH-GIPKHALPLLKKPTEVL 658
IAGLNG+++G + L +A L I +G P + L TEVL
Sbjct: 267 IAGLNGMQLGDKKLIVQRASIGAKNSTLGVYIQSMTGAAPVTLQVAGLTLAGAGPATEVL 326
Query: 659 KLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L N+ P+ E E E++LED++ EC ++G V+S+ +
Sbjct: 327 CLLNMVTPDELR--DEEEYEDILEDIKEECNKYGCVRSIEI 365
>gi|395331854|gb|EJF64234.1| hypothetical protein DICSQDRAFT_144911 [Dichomitus squalens
LYAD-421 SS1]
Length = 587
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 171/366 (46%), Gaps = 61/366 (16%)
Query: 377 KTPSPINRSP----EKKSAKWDV-AP-VETYSVPSNVHTSNQAASSNAHEMVSSDPVTST 430
++P+P + +P ++K++ WDV AP E Y T+ QA + + P +
Sbjct: 147 RSPTPPDAAPLSQRKRKASGWDVHAPGYEQY-------TAMQAKQTGLFNL----PGANR 195
Query: 431 TQKP--LAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFL 488
TQ P LA + + M + +R RRL + ++ +E+ L +F N+ +
Sbjct: 196 TQIPPILAIPGLPPPMPVSTFGMGTGVNPNLSRQSRRLYIGSITPDINEQNLTDFFNSKM 255
Query: 489 LSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF 548
+ P + EK AFVEF +AEDA+AA+ DG F LKI+RPK++
Sbjct: 256 KEMNLGTGAPGNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFLNGPLKIRRPKDY 315
Query: 549 ---------VEVA-------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNE 586
+ V IF+GG+ L+ + VME++ +FG LKA++ V E
Sbjct: 316 GGPDVIAPNMHVPGVVSTNVPDSANKIFVGGLPTYLNEEQVMELLSSFGELKAFNL-VRE 374
Query: 587 DHEEP---CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHG- 642
+ P AF EYVD VT AI L+G+++G + L +A S+ G P G
Sbjct: 375 NGNGPSKGFAFFEYVDPSVTDVAIQSLSGMELGDKYLVVQRA----SVGAKPGQSPIPGM 430
Query: 643 ------IPKHALP---LLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGS 693
IPK LP L + +L + N+ PE + E ++LEDV+ EC ++G
Sbjct: 431 YDLNPEIPKPILPVGDLSESQDRILLMLNMVVPEELQ--DDQEYADILEDVKEECGKYGE 488
Query: 694 VKSVNV 699
V+ + +
Sbjct: 489 VEDLRI 494
>gi|351710523|gb|EHB13442.1| Splicing factor U2AF 65 kDa subunit [Heterocephalus glaber]
Length = 904
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 123/436 (28%), Positives = 197/436 (45%), Gaps = 69/436 (15%)
Query: 305 SPRAQKRTS---YYDREHEGLPSHSTKISSGRQHSDIDSSRVTGNGL--SGHYRRHD--G 357
SPR +K+ Y+D G H T + Q+ + + +G SG Y+
Sbjct: 79 SPRHEKKKKVRKYWDVPPPGF-EHITPM----QYKAMQAPGPSGGRAQRSGSYQVVSCCW 133
Query: 358 STSGLGGYSPRKRRTEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSN 417
+ S L +S R AA SP + +K WDV P + + + QAA
Sbjct: 134 AESSLACWSCRT----AAQSCRSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG-- 187
Query: 418 AHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSA 475
Q P + + + L V+ V + +Q R RRL V N+P
Sbjct: 188 --------------QIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGI 233
Query: 476 SEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF 535
+E+A+M+F N + G+ G+ P + I ++K AF+EF + ++ + A+ DG F
Sbjct: 234 TEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIF 292
Query: 536 SGSILKIKRPKEFVEVA------------------------IFIGGISRTLSSKMVMEIV 571
G LKI+RP ++ + +FIGG+ L+ V E++
Sbjct: 293 QGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELL 352
Query: 572 CAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG 629
+FGPLKA++ + + AF EYVD VT +AIAGLNG+++G + L +A +
Sbjct: 353 TSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGA 412
Query: 630 --SIMDNSGNPPFH----GIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLED 683
+ + P G+ + + PTEVL L N+ PE L + E EE++ED
Sbjct: 413 KNATLSTINQTPVTLQVPGLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVED 470
Query: 684 VRLECARFGSVKSVNV 699
VR EC+++G VKS+ +
Sbjct: 471 VRDECSKYGLVKSIEI 486
>gi|344270173|ref|XP_003406920.1| PREDICTED: LOW QUALITY PROTEIN: splicing factor U2AF 65 kDa
subunit-like [Loxodonta africana]
Length = 471
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 166/354 (46%), Gaps = 53/354 (14%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 79 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 122
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 182
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 241
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 242 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 301
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPK 645
AF EYVD VT +AIAGLNG+++G + L +A + + + P G+
Sbjct: 302 YAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMS 361
Query: 646 HALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 362 SQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 413
>gi|432908695|ref|XP_004077988.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
[Oryzias latipes]
Length = 458
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 102/356 (28%), Positives = 167/356 (46%), Gaps = 58/356 (16%)
Query: 383 NRSPEKKSAK-----WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAG 437
NRSP ++ K WDV P + + + QAA Q P
Sbjct: 64 NRSPHREKKKKIKKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATA 107
Query: 438 ISVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQH 495
+ + + L V+ V + +Q R RRL V N+P +E+++M+F N + G+
Sbjct: 108 LLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEESMMDFFNAQMRLGGLTQ 167
Query: 496 VHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 168 APGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMS 226
Query: 553 ---------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHE 589
+FIGG+ L+ V E++ +FGPLKA++ +
Sbjct: 227 ENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLS 286
Query: 590 EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL---DGSIMDNSGNPPFHGIP-- 644
+ AF EYVD + +AIAGLNG+++G + L +A + + ++ + P +P
Sbjct: 287 KGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQRASVGSKNATLTSINQTPVTLQVPGL 346
Query: 645 -KHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ PTEVL L N+ PE L + E EE++EDVR EC ++G VKS+ +
Sbjct: 347 NSSVTQMGGLPTEVLCLMNMVAPE--ELLDDEEYEEIVEDVREECGKYGQVKSIEI 400
>gi|327365322|ref|NP_001192160.1| splicing factor U2AF 65 kDa subunit isoform 1 [Mus musculus]
gi|348551789|ref|XP_003461711.1| PREDICTED: splicing factor U2AF 65 kDa subunit isoform 2 [Cavia
porcellus]
gi|392343893|ref|XP_003748811.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Rattus norvegicus]
gi|136628|sp|P26369.3|U2AF2_MOUSE RecName: Full=Splicing factor U2AF 65 kDa subunit; AltName: Full=U2
auxiliary factor 65 kDa subunit; AltName: Full=U2 snRNP
auxiliary factor large subunit
gi|55101|emb|CAA45874.1| splicing factor U2AF [Mus musculus]
gi|26347321|dbj|BAC37309.1| unnamed protein product [Mus musculus]
Length = 475
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 164/358 (45%), Gaps = 57/358 (15%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 79 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 122
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 182
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 241
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 242 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 301
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLT----------AVQAVLDGSIMDNSGNPPFH 641
AF EYVD VT +AIAGLNG+++G + L A L +I
Sbjct: 302 YAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSLPSTINQTPVTLQVP 361
Query: 642 GIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
G+ + + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 362 GLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 417
>gi|194756144|ref|XP_001960339.1| GF13310 [Drosophila ananassae]
gi|190621637|gb|EDV37161.1| GF13310 [Drosophila ananassae]
Length = 434
Score = 120 bits (301), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 105/365 (28%), Positives = 164/365 (44%), Gaps = 76/365 (20%)
Query: 390 SAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNV 449
+ WDV P + YS + + AS + D Q+P A
Sbjct: 46 ATSWDVPP-DGYSHLTPLQYKAMQASGQIASRIVPD------QQPTA------------- 85
Query: 450 SMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHG-SLPCIGCVIQ 508
+S + R RRL V N+P +E+ +M F N L++ G + + C
Sbjct: 86 --ESAAIAMVTRQARRLYVGNIPFGVTEEEMMGFFNQQLIALGSSSLKTDGKAVLTCQTN 143
Query: 509 REKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA---------------- 552
EK AF+EF + ++A+ A+ DG F G LKI+RP ++ VA
Sbjct: 144 LEKNFAFLEFRSMDEATQAINFDGIVFRGQTLKIRRPHDYHPVASISCSEPGFATTTMTS 203
Query: 553 ------------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNE 586
I+IGG+ L+ + E++ +FG LK ++ + N
Sbjct: 204 PQIVVSTTGPNHVISTLVPDSPQKIYIGGLPTCLNETQIKELLLSFGQLKGFNLVKDANT 263
Query: 587 DHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP--FHGIP 644
+ AF EYVD LVT +AIAGLNG+++G + L +++ G NSG P +P
Sbjct: 264 SLSKGFAFFEYVDPLVTEQAIAGLNGMQLGDRKLVVQRSIAGGR---NSGGVPATVLQVP 320
Query: 645 K-HALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV---V 700
A+P + PTEVL L N+ PE L + E E++ D++ ECA++G V+S+ + +
Sbjct: 321 GLTAIPNTESPTEVLCLLNMVLPE--ELLDDEEYEDIRTDIQQECAKYGDVRSLKIPRPI 378
Query: 701 KYGDS 705
GDS
Sbjct: 379 PKGDS 383
>gi|403413555|emb|CCM00255.1| predicted protein [Fibroporia radiculosa]
Length = 582
Score = 120 bits (301), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 105/365 (28%), Positives = 171/365 (46%), Gaps = 60/365 (16%)
Query: 377 KTPSPINRSP----EKKSAKWDV-AP-VETYSVPSNVHTSNQAASSNAHEMVSSDPVTST 430
++P+P + +P ++K++ WDV AP E Y T+ QA + + P +
Sbjct: 143 RSPTPPDATPLLQRKRKASGWDVHAPGYEQY-------TAMQAKQTGLFNL----PGANR 191
Query: 431 TQKP--LAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFL 488
TQ P LA + + + M +R RRL + ++ +E+ L +F N+ +
Sbjct: 192 TQIPPILAIPGLPPPMPVQSFGMGIGGNPNLSRQSRRLYIGSITPDINEQNLADFFNSKM 251
Query: 489 LSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF 548
+ P + EK AFVEF +AEDA+AA+ DG F LKI+RPK++
Sbjct: 252 KEMSIGTGGPGNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIRRPKDY 311
Query: 549 VEV----------------------AIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNE 586
+ IF+GG+ L+ + VME++ +FG LKA++ V E
Sbjct: 312 GGMDSIAPSMHVPGVVSTNVPDSINKIFVGGLPTYLNEEQVMELLKSFGDLKAFNL-VRE 370
Query: 587 DHEEP---CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHG- 642
+ P AF EYVD VT AI L+G+++G + L +A S+ G PP G
Sbjct: 371 NGNGPSKGFAFFEYVDPGVTDVAIQSLSGMELGDKFLVVQRA----SVGAKPGQPPIPGL 426
Query: 643 -----IPKHALP---LLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSV 694
IP+ LP + +L + N+ PE + + E +V EDV+ EC+++G V
Sbjct: 427 YDQVEIPRPILPAGDVEGTDARILLMLNMVVPEDLT--DDQEYADVYEDVKEECSKYGLV 484
Query: 695 KSVNV 699
+ + +
Sbjct: 485 EDLRI 489
>gi|339243511|ref|XP_003377681.1| splicing factor U2AFsubunit [Trichinella spiralis]
gi|316973494|gb|EFV57074.1| splicing factor U2AFsubunit [Trichinella spiralis]
Length = 402
Score = 120 bits (301), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 83/266 (31%), Positives = 141/266 (53%), Gaps = 38/266 (14%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL V N+P +E+A+M+F N + G+ G+ P + C I +K AF+EF + +
Sbjct: 88 RRLYVGNIPFGCTEEAMMDFFNQQMHLCGLAQALGN-PILACQINLDKNFAFIEFRSIAE 146
Query: 524 ASAALCCDGCSFSGSILKIKRPKEFVEVA--------------------IFIGGISRTLS 563
+AA+ DG ++ G LKI+RP+++ + +FIGG+ LS
Sbjct: 147 TTAAMAFDGINYQGQSLKIRRPRDYQPLPGQNDTLAGLVSSVVADSPYKLFIGGLPNYLS 206
Query: 564 SKMVMEIVCAFGPLKAYHF------EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGG 617
+ V E++ +FG LKA++ ++++ + AF EY D +T +AIAGLNG+++G
Sbjct: 207 EEQVKELLISFGQLKAFNLIKDPATQISKGY----AFAEYSDSTLTDQAIAGLNGMQLGD 262
Query: 618 QVLTAVQAVLDGSI--MDNSGNPPFHGIPKHAL--PLLKKPTEVLKLKNVFNPEGFSSLS 673
+ L VQ G+ M ++ P +P + P TE+L L N+ E +
Sbjct: 263 KKLV-VQLASVGAKNNMFSAAAPVAIQVPGMNVVNPAATPATEILCLMNMVVAE--ELVD 319
Query: 674 ELEVEEVLEDVRLECARFGSVKSVNV 699
E ++++ED++ EC ++GSVKSV +
Sbjct: 320 NEEYDDIVEDIKEECCKYGSVKSVEI 345
>gi|383854116|ref|XP_003702568.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Megachile
rotundata]
Length = 432
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 97/331 (29%), Positives = 161/331 (48%), Gaps = 49/331 (14%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 70 WDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT----------- 115
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I +K
Sbjct: 116 --------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQINLDKN 166
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA-------------------- 552
AF+EF + ++ + A+ DG +F G LKI+RP ++ +
Sbjct: 167 FAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMTDNPSMNVPGTVPDSPHK 226
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGL 610
IFIGG+ L+ + V E++ +FG L+A++ + + AF EYVD +T +AIAGL
Sbjct: 227 IFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKGYAFCEYVDVSMTDQAIAGL 286
Query: 611 NGLKVGGQVLTAVQAVLDG-SIMDNSGNPPFHGIPKHALPLLKKP-TEVLKLKNVFNPEG 668
NG+++G + L +A + + M + P +P ++ P TEVL L N+ PE
Sbjct: 287 NGMQLGDKKLIVQRASVGAKNPMIGAQAPVQIQVPGLSMVGTSGPATEVLCLLNMVTPE- 345
Query: 669 FSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ E E E++LED++ EC ++G V+SV +
Sbjct: 346 -ELMEEEEYEDILEDIKEECNKYGVVRSVEI 375
>gi|340715832|ref|XP_003396412.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Bombus
terrestris]
gi|350417884|ref|XP_003491627.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like isoform 1
[Bombus impatiens]
Length = 432
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 97/331 (29%), Positives = 161/331 (48%), Gaps = 49/331 (14%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 70 WDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT----------- 115
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I +K
Sbjct: 116 --------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQINLDKN 166
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA-------------------- 552
AF+EF + ++ + A+ DG +F G LKI+RP ++ +
Sbjct: 167 FAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMTDNPSMNVPGTVPDSPHK 226
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGL 610
IFIGG+ L+ + V E++ +FG L+A++ + + AF EYVD +T +AIAGL
Sbjct: 227 IFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKGYAFCEYVDVSMTDQAIAGL 286
Query: 611 NGLKVGGQVLTAVQAVLDG-SIMDNSGNPPFHGIPKHALPLLKKP-TEVLKLKNVFNPEG 668
NG+++G + L +A + + M + P +P ++ P TEVL L N+ PE
Sbjct: 287 NGMQLGDKKLIVQRASVGAKNPMIGAQAPVQIQVPGLSMVGTSGPATEVLCLLNMVTPE- 345
Query: 669 FSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ E E E++LED++ EC ++G V+SV +
Sbjct: 346 -ELMEEEEYEDILEDIKEECNKYGVVRSVEI 375
>gi|410982179|ref|XP_003997437.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Felis catus]
Length = 471
Score = 120 bits (300), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 161/341 (47%), Gaps = 53/341 (15%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA Q P + + + L V+
Sbjct: 92 WDVPPPGFEHITPMQYKAMQAAG----------------QIPATALLPTMTPDGLAVTPT 135
Query: 453 SVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQRE 510
V + +Q R RRL V N+P +E+A+M+F N + G+ G+ P + I ++
Sbjct: 136 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQD 194
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------ 552
K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 195 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVV 254
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTP 604
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD VT
Sbjct: 255 PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTD 314
Query: 605 KAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPKHALPLLKKPTEVL 658
+AIAGLNG+++G + L +A + + + P G+ + + PTEVL
Sbjct: 315 QAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMSSQVQMGGHPTEVL 374
Query: 659 KLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 375 CLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 413
>gi|380016747|ref|XP_003692335.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Apis florea]
Length = 428
Score = 120 bits (300), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 97/330 (29%), Positives = 161/330 (48%), Gaps = 45/330 (13%)
Query: 390 SAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNV 449
S WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 67 SLYWDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT-------- 115
Query: 450 SMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQR 509
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I
Sbjct: 116 -----------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQINL 163
Query: 510 EKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----------------I 553
+K AF+EF + ++ + A+ D +F G LKI+RP ++ + I
Sbjct: 164 DKNFAFLEFRSIDETTQAMAFDSINFKGQSLKIRRPHDYQPMPGMTDNPSMNVPDSPHKI 223
Query: 554 FIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLN 611
FIGG+ L+ + V E++ +FG L+A++ + + AF EYVD +T +AIAGLN
Sbjct: 224 FIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKGYAFCEYVDVSMTDQAIAGLN 283
Query: 612 GLKVGGQVLTAVQAVLDG-SIMDNSGNPPFHGIPKHALPLLKKP-TEVLKLKNVFNPEGF 669
G+++G + L +A + + M + P +P ++ P TEVL L N+ PE
Sbjct: 284 GMQLGDKKLIVQRASVGAKNPMIGAQAPVQIQVPGLSMVGTSGPATEVLCLLNMVTPE-- 341
Query: 670 SSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ E E E++LED++ EC ++G V+SV +
Sbjct: 342 ELMEEEEYEDILEDIKEECNKYGVVRSVEI 371
>gi|391337926|ref|XP_003743315.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Metaseiulus
occidentalis]
Length = 430
Score = 120 bits (300), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 94/344 (27%), Positives = 159/344 (46%), Gaps = 55/344 (15%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQ--KPLAGISVSASL 444
+K S WDV P + + + AS + + P+T+ Q +P+ G +++
Sbjct: 54 KKPSLYWDV-PAPGFEHITPLQYKAMQASGQIPATLLAPPITTNHQPSQPVIGSTIT--- 109
Query: 445 AKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIG 504
R RRL V N+P +E+ ++++ N + + G+ P +
Sbjct: 110 ----------------RQARRLYVGNIPFGCTEQEMIDYFNVQMHACAFAQAQGN-PVLA 152
Query: 505 CVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------ 552
C I +K AF+EF + ++ SAA+ DG +F G LKI+RP ++ +
Sbjct: 153 CQINMDKNFAFLEFRSIDETSAAMSFDGINFKGQSLKIRRPHDYQPMPGMSESQGSVIPG 212
Query: 553 ------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYV 598
+FIGG+ L+ V E++ +FG LKA++ + + AF EY
Sbjct: 213 VVSTVVQDSPHKVFIGGLPNYLNEDQVRELLMSFGQLKAFNLVKDTATGLSKGYAFCEYA 272
Query: 599 DQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNP---PFHGIPKHALPLLKKPT 655
+ +T AIAGLNG+++G + L VQ G+ N P G+P + T
Sbjct: 273 EVTITDDAIAGLNGMQLGDKKLI-VQRASVGAKNSNMAVPVQIQVPGMPNVPIGSSGPAT 331
Query: 656 EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
EVL L N+ P+ E E +++LED++ EC ++G VKS+ +
Sbjct: 332 EVLCLMNLVTPDELR--DEEEYDDILEDIQDECNKYGHVKSIEI 373
>gi|327280715|ref|XP_003225097.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
[Anolis carolinensis]
Length = 456
Score = 119 bits (299), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 161/341 (47%), Gaps = 53/341 (15%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA Q P + + + L V+
Sbjct: 77 WDVPPPGFEHITPMQYKAMQAAG----------------QIPATALLPTMTPDGLAVTPT 120
Query: 453 SVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQRE 510
V + +Q R RRL V N+P +E+A+M+F N + G+ G+ P + I ++
Sbjct: 121 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQD 179
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------ 552
K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 180 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVV 239
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTP 604
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD VT
Sbjct: 240 PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTD 299
Query: 605 KAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPKHALPLLKKPTEVL 658
+AIAGLNG+++G + L +A + + + P G+ + + PTEVL
Sbjct: 300 QAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMSSQVQMGGHPTEVL 359
Query: 659 KLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 360 CLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGVVKSIEI 398
>gi|440633242|gb|ELR03161.1| hypothetical protein GMDG_05987 [Geomyces destructans 20631-21]
Length = 559
Score = 119 bits (299), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 102/348 (29%), Positives = 164/348 (47%), Gaps = 55/348 (15%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P +V T+ QA S + + Q+P+ + A +++
Sbjct: 174 KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA-----PRQQPMDPSKLQAFMSQ 223
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+ S+ + L SN R +RL V NLP + SE+ ++EF N L +G+ V GS PC+
Sbjct: 224 PSGSITNAALKPSNSRQAKRLLVHNLPKTLSEEGIVEFFN--LQLNGLNVVEGSDPCLTA 281
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSF---SGS-ILKIKRPKEFVEVA--------- 552
+ ++K A VEF T DA+ AL DG GS L I+RPK+++ A
Sbjct: 282 QVSKDKSFALVEFKTTSDATVALAMDGIGIEENGGSRALSIRRPKDYIVPAVDEAMHEPG 341
Query: 553 ------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYV 598
I I + L+ + V E++ +FG LKA+ + EE AF EYV
Sbjct: 342 VVTNVVPDTPNKISISNVPPYLTDEQVTELLVSFGELKAFVLAKDSTTEESRGIAFCEYV 401
Query: 599 DQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPT--- 655
D T A+ GLNG+++G + L +A SI G+ +A+ +L T
Sbjct: 402 DAAATDIAVEGLNGMELGDKHLKVQRA----SIGTTQTAGLEMGV--NAMSMLAGTTTDG 455
Query: 656 ----EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
VL+L N+ E + + EE+LEDV+ EC ++G V + +
Sbjct: 456 LDEGRVLQLLNMVTAEEL--IDNEDYEEILEDVKEECEKYGKVLDIKI 501
>gi|444724150|gb|ELW64768.1| Splicing factor U2AF 65 kDa subunit [Tupaia chinensis]
Length = 447
Score = 119 bits (299), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 161/341 (47%), Gaps = 53/341 (15%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA Q P + + + L V+
Sbjct: 68 WDVPPPGFEHITPMQYKAMQAAG----------------QIPATALLPTMTPDGLAVTPT 111
Query: 453 SVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQRE 510
V + +Q R RRL V N+P +E+A+M+F N + G+ G+ P + I ++
Sbjct: 112 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQD 170
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------ 552
K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 171 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVV 230
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTP 604
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD VT
Sbjct: 231 PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTD 290
Query: 605 KAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPKHALPLLKKPTEVL 658
+AIAGLNG+++G + L +A + + + P G+ + + PTEVL
Sbjct: 291 QAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMSSQVQMGGHPTEVL 350
Query: 659 KLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 351 CLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 389
>gi|392565476|gb|EIW58653.1| hypothetical protein TRAVEDRAFT_37512 [Trametes versicolor
FP-101664 SS1]
Length = 548
Score = 119 bits (299), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 117/410 (28%), Positives = 190/410 (46%), Gaps = 75/410 (18%)
Query: 371 RTEAAAKTPSPINRSP----EKKSAKWDV-AP-VETYSVPSNVHTSNQAASSNAHEMVSS 424
R ++P+P + +P ++K++ WDV AP E Y T+ QA + +
Sbjct: 100 RVMTTRRSPTPSDATPLSQRKRKASGWDVHAPGYEQY-------TAMQAKQTGLFNL--- 149
Query: 425 DPVTSTTQKP--LAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALME 482
P + TQ P LA + + M + +R RRL + ++ +E+ L +
Sbjct: 150 -PGANRTQIPPILAIPGLPPPMPVNTFGMGTGVNPNLSRQSRRLYIGSITPDINEQNLTD 208
Query: 483 FLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKI 542
F N+ + + P + EK AFVEF +AEDA+AA+ DG F LKI
Sbjct: 209 FFNSKMKEMNLGTGAPGNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFLNGPLKI 268
Query: 543 KRPKEF--------VEVA-------------IFIGGISRTLSSKMVMEIVCAFGPLKAYH 581
+RPK++ + V IF+GG+ L+ + VME++ +FG LKA++
Sbjct: 269 RRPKDYGGPDMLANMHVPGVVSTNVPDSANKIFVGGLPTYLNEEQVMELLSSFGELKAFN 328
Query: 582 FEVNEDHEEP---CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNP 638
V E+ P AF EYVD VT AI L+G+++G + L +A S+ G
Sbjct: 329 L-VRENGNGPSKGFAFFEYVDPSVTDVAIPSLSGMELGDKYLVVQRA----SVGAKPGQS 383
Query: 639 PFHG---------IPKHALPLLKKPTE-----VLKLKNVFNPEGFSSLSELEVEEVLEDV 684
P G IPK +P+ ++ E +L + N+ PE S + E ++ EDV
Sbjct: 384 PIPGMGMFDMAPEIPKPIMPVGERDLEAMQDRILLMLNMVVPEELS--DDQEYGDLYEDV 441
Query: 685 RLECARFGSVKSVNV--------VKYGDSNISTIQACEGNENTASAGVGQ 726
+ EC ++G+V+ + + K+G+ S I A +E +AGVG+
Sbjct: 442 KEECEKYGTVEDLRIPRPVKKDKAKWGEGRESAIAAQRADE---AAGVGR 488
>gi|357623461|gb|EHJ74600.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 [Danaus
plexippus]
Length = 350
Score = 119 bits (299), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 84/269 (31%), Positives = 136/269 (50%), Gaps = 32/269 (11%)
Query: 460 NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFL 519
R RRL V N+P +E+ MEF N + SG+ G+ P + C I +K AF+EF
Sbjct: 28 TRQARRLYVGNIPFGVTEEETMEFFNQQMHLSGLAQAAGN-PVLACQINLDKNFAFLEFR 86
Query: 520 TAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------IFI 555
+ ++ + A+ DG +F G LKI+RP ++ + IFI
Sbjct: 87 SIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGTENPAINVPAGVISTVVPDSPHKIFI 146
Query: 556 GGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGL 613
GG+ L+ V E++ +FG L+A++ + + + AF EYVD +T +AIAGLNG+
Sbjct: 147 GGLPNYLNEDQVKELLMSFGQLRAFNLVKDSSTGLSKGYAFAEYVDISMTDQAIAGLNGM 206
Query: 614 KVGGQVLTAVQAVLDG--SIMDNSGNPPFH-GIPKHALPLLKKPTEVLKLKNVFNPEGFS 670
++G + L +A + S + +G P + L TEVL L N+ P+
Sbjct: 207 QLGDKKLIVQRASIGAKNSTLAMTGAAPVTLQVAGLTLAGAGPATEVLCLLNMVTPDELR 266
Query: 671 SLSELEVEEVLEDVRLECARFGSVKSVNV 699
E E E++LED++ EC ++G V+S+ +
Sbjct: 267 --DEEEYEDILEDIKEECNKYGCVRSIEI 293
>gi|195489053|ref|XP_002092574.1| GE11595 [Drosophila yakuba]
gi|194178675|gb|EDW92286.1| GE11595 [Drosophila yakuba]
Length = 437
Score = 119 bits (298), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 87/294 (29%), Positives = 139/294 (47%), Gaps = 50/294 (17%)
Query: 452 DSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGV---QHVHGSLPCIGCVIQ 508
+S + R RRL V N+P +E+ +MEF N L++ G+ Q++ G + C
Sbjct: 90 ESAAIAMITRQARRLYVGNIPFGVTEEEMMEFFNQQLMALGLEGAQYLDGK-AVLTCQTN 148
Query: 509 REKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA---------------- 552
EK AF+EF + ++A+ AL DG F G ILKI+RP ++ V
Sbjct: 149 LEKNFAFLEFRSMDEATQALNFDGIIFRGQILKIRRPHDYQPVPSIRVSNMESYRSFRLP 208
Query: 553 ------------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNE 586
I++GG+ L + +++ +FG LK + ++N
Sbjct: 209 ATTTTNPPISTIAVSSIVPDSPNKIYVGGLPTCLDQDQIKDLLQSFGELKGLNLVKDINT 268
Query: 587 DHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKH 646
+ AF EY+D VT AIAGL+G+++G + L VQ + G S P +P
Sbjct: 269 SLSKGFAFFEYIDPSVTDHAIAGLHGMQLGDRRLV-VQRSIPGGKNGLSVQQPIVQVPGI 327
Query: 647 ALPL-LKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ L PTE+L L N+ PE L E E++ D++ ECA++G V+S+ +
Sbjct: 328 STLLDPGSPTEILCLLNMVLPEEL--LDNEEFEDIRSDIKQECAKYGDVRSIKI 379
>gi|195999450|ref|XP_002109593.1| hypothetical protein TRIADDRAFT_21652 [Trichoplax adhaerens]
gi|190587717|gb|EDV27759.1| hypothetical protein TRIADDRAFT_21652 [Trichoplax adhaerens]
Length = 476
Score = 119 bits (298), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 98/347 (28%), Positives = 164/347 (47%), Gaps = 53/347 (15%)
Query: 386 PEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLA 445
P+K S WDV P +V + ++ E+ V + AG+ + + A
Sbjct: 93 PKKPSPYWDVPPAGYETV----------SPADYKELQRDGKVPAQGVPAAAGLGIDPTAA 142
Query: 446 KLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+ S Q R RRL V N+P +E+A+M+F N ++++G+ +G P +
Sbjct: 143 PGSGS-------QITRQSRRLYVGNIPFGITEQAMMDFFNEKMVTTGLTQANGD-PVLAV 194
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF------------VEVA- 552
I +K AF+EF + E+ + A+ DG F LKI+RPK++ + V
Sbjct: 195 QINFDKNFAFLEFRSIEETTNAMAFDGIMFQNQALKIRRPKDYQPPTGDPNSSANIHVPG 254
Query: 553 ------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYV 598
+FIGG+ L+ V E++ +FG LKA++ + + AF EYV
Sbjct: 255 VISTVVPDTPHKLFIGGLPNYLTEDQVKELLQSFGELKAFNLVKDSATGLSKGYAFCEYV 314
Query: 599 DQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG----SIMDNSGNPPFHGIPKHALP--LLK 652
VT +AIAGLN +++G + L +A + S+ SG P +P + L
Sbjct: 315 VVEVTDQAIAGLNNMQLGEKKLVVQRASVGAKHNYSVRCLSGIPVTVQVPGLQISNNALG 374
Query: 653 KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ TE+L+L N+ E + + E E+++EDVR E ++ VKS+ +
Sbjct: 375 EVTEILQLMNMVTEEEL--VDDEEYEDIIEDVRAEVSKIAPVKSLEI 419
>gi|348526426|ref|XP_003450720.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
[Oreochromis niloticus]
Length = 487
Score = 119 bits (298), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 108/390 (27%), Positives = 178/390 (45%), Gaps = 62/390 (15%)
Query: 353 RRHDGSTSGLGGYSPRKRRTEAAAKTPSP----INRSPEKKSAK-----WDVAPVETYSV 403
RRH + + Y ++++ + SP + RSP ++ K WDV P +
Sbjct: 59 RRHRRRSVPVCNYIWASKQSKLLLRQESPHYTGMYRSPHREKKKKVKKYWDVPPPGFEHI 118
Query: 404 PSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQL--TQSNR 461
+ + QAA Q P + + + L V+ V + +Q R
Sbjct: 119 TPMQYKAMQAAG----------------QIPATALLPTMTPDGLAVTPTPVPVVGSQMTR 162
Query: 462 PMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTA 521
RRL V N+P +E+++M+F N + G+ G+ P + I ++K AF+EF +
Sbjct: 163 QARRLYVGNIPFGITEESMMDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSV 221
Query: 522 EDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------IFIGG 557
++ + A+ DG F G LKI+RP ++ + +FIGG
Sbjct: 222 DETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSAHKLFIGG 281
Query: 558 ISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKV 615
+ L+ V E++ +FGPLKA++ + + AF EYVD + +AIAGLNG+++
Sbjct: 282 LPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLNDQAIAGLNGMQL 341
Query: 616 GGQVLTAVQAVLD------GSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGF 669
G + L +A + SI G+ + PTEVL L N+ PE
Sbjct: 342 GDKKLLVQRASVGSKNATLSSINQTPVTLQVPGLNSSVTQMGGLPTEVLCLMNMVAPE-- 399
Query: 670 SSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L + E EE++EDVR EC+++G VKS+ +
Sbjct: 400 ELLDDEEYEEIVEDVRDECSKYGQVKSIEI 429
>gi|115496604|ref|NP_001068804.1| splicing factor U2AF 65 kDa subunit [Bos taurus]
gi|89994093|gb|AAI14161.1| U2 small nuclear RNA auxiliary factor 2 [Bos taurus]
gi|296477253|tpg|DAA19368.1| TPA: U2 (RNU2) small nuclear RNA auxiliary factor 2 [Bos taurus]
Length = 475
Score = 119 bits (298), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 165/358 (46%), Gaps = 57/358 (15%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 79 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 122
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 182
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 241
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 242 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 301
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG------SIMDNSGNPPFH---- 641
AF EYVD VT +AIAGLNG+++G + L +A + S + P
Sbjct: 302 YAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLVSPLSTINQTPVTLQVP 361
Query: 642 GIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
G+ + + PTEVL L N+ PE L + E EE++EDVR EC ++G VKS+ +
Sbjct: 362 GLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECGKYGLVKSIEI 417
>gi|307176032|gb|EFN65791.1| Splicing factor U2AF 50 kDa subunit [Camponotus floridanus]
Length = 432
Score = 119 bits (298), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 97/331 (29%), Positives = 160/331 (48%), Gaps = 49/331 (14%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 70 WDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT----------- 115
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I +K
Sbjct: 116 --------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQINLDKN 166
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA-------------------- 552
AF+EF + ++ + A+ DG +F G LKI+RP ++ +
Sbjct: 167 FAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMTDNPSMNVPGTVPDSPHK 226
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGL 610
IFIGG+ L+ V E++ +FG L+A++ + + AF EYVD +T +AIAGL
Sbjct: 227 IFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSATGLSKGYAFCEYVDVSMTDQAIAGL 286
Query: 611 NGLKVGGQVLTAVQAVLDG-SIMDNSGNPPFHGIPKHALPLLKKP-TEVLKLKNVFNPEG 668
NG+++G + L +A + + M + P +P ++ P TEVL L N+ PE
Sbjct: 287 NGMQLGDKKLIVQRASVGAKNPMIGAQAPVQIQVPGLSMVGTSGPATEVLCLLNMVTPE- 345
Query: 669 FSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ E E E++LED++ EC ++G V+SV +
Sbjct: 346 -ELMEEEEYEDILEDIKEECNKYGVVRSVEI 375
>gi|417410850|gb|JAA51891.1| Putative splicing factor u2af large subunit rrm superfamily,
partial [Desmodus rotundus]
Length = 455
Score = 119 bits (298), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 161/341 (47%), Gaps = 53/341 (15%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA Q P + + + L V+
Sbjct: 76 WDVPPPGFEHITPMQYKAMQAAG----------------QIPATALLPTMTPDGLAVTPT 119
Query: 453 SVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQRE 510
V + +Q R RRL V N+P +E+A+M+F N + G+ G+ P + I ++
Sbjct: 120 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQD 178
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------ 552
K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 179 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVV 238
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTP 604
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD VT
Sbjct: 239 PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTD 298
Query: 605 KAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPKHALPLLKKPTEVL 658
+AIAGLNG+++G + L +A + + + P G+ + + PTEVL
Sbjct: 299 QAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMSSQVQMGGHPTEVL 358
Query: 659 KLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 359 CLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 397
>gi|432908699|ref|XP_004077990.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 3
[Oryzias latipes]
Length = 479
Score = 119 bits (298), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 101/356 (28%), Positives = 167/356 (46%), Gaps = 58/356 (16%)
Query: 383 NRSPEKKSAK-----WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAG 437
+RSP ++ K WDV P + + + QAA Q P
Sbjct: 85 SRSPHREKKKKIKKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATA 128
Query: 438 ISVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQH 495
+ + + L V+ V + +Q R RRL V N+P +E+++M+F N + G+
Sbjct: 129 LLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEESMMDFFNAQMRLGGLTQ 188
Query: 496 VHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 189 APGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMS 247
Query: 553 ---------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHE 589
+FIGG+ L+ V E++ +FGPLKA++ +
Sbjct: 248 ENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLS 307
Query: 590 EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL---DGSIMDNSGNPPFHGIP-- 644
+ AF EYVD + +AIAGLNG+++G + L +A + + ++ + P +P
Sbjct: 308 KGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQRASVGSKNATLTSINQTPVTLQVPGL 367
Query: 645 -KHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ PTEVL L N+ PE L + E EE++EDVR EC ++G VKS+ +
Sbjct: 368 NSSVTQMGGLPTEVLCLMNMVAPE--ELLDDEEYEEIVEDVREECGKYGQVKSIEI 421
>gi|260800970|ref|XP_002595369.1| hypothetical protein BRAFLDRAFT_113856 [Branchiostoma floridae]
gi|229280615|gb|EEN51381.1| hypothetical protein BRAFLDRAFT_113856 [Branchiostoma floridae]
Length = 524
Score = 119 bits (298), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 90/278 (32%), Positives = 140/278 (50%), Gaps = 41/278 (14%)
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
+Q R RRL V N+P +E+A+++F N + + + G+ P + C + +K AF+
Sbjct: 195 SQMTRQARRLYVGNIPFGVTEEAMIDFFNTQMHRASLAQAPGN-PVLACQVNLDKNFAFL 253
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------ 552
EF + ++ + A+ DG F G LK++RP ++ V
Sbjct: 254 EFRSVDETTLAMAFDGIIFQGQSLKLRRPHDYQPVPGMAENPDIHVPGGFPVIPGVVSTV 313
Query: 553 -------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVT 603
IFIGG+ L+ V E++ +FG LKA++ + + + AF EYVD VT
Sbjct: 314 VQDSPHKIFIGGLPNYLNDDQVKELLLSFGQLKAFNLVKDSSTALSKGYAFCEYVDPNVT 373
Query: 604 PKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFH-GIPKHALPL-LKKPTEVLKLK 661
+AIAGLNG+++G + L +A + N+ N P IP L PTEVL L
Sbjct: 374 DQAIAGLNGMQLGDKKLIVQRASVGAK---NAQNQPVQLQIPGLTLTGNAGPPTEVLCLM 430
Query: 662 NVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
N+ PE + E E E++LEDVR EC ++G+V SV +
Sbjct: 431 NMVMPE--ELMDEEEYEDILEDVREECGKYGAVLSVEI 466
>gi|390479436|ref|XP_002762565.2| PREDICTED: splicing factor U2AF 65 kDa subunit [Callithrix jacchus]
Length = 453
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 88/275 (32%), Positives = 141/275 (51%), Gaps = 35/275 (12%)
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
+Q R RRL V N+P +E+A+M+F N + G+ G+ P + I ++K AF+
Sbjct: 127 SQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFL 185
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------ 552
EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 186 EFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSAHK 245
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGL 610
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD VT +AIAGL
Sbjct: 246 LFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGL 305
Query: 611 NGLKVGGQVLTAVQAVLDG--SIMDNSGNPP----FHGIPKHALPLLKKPTEVLKLKNVF 664
NG+++G + L +A + + + P G+ + + PTEVL L N+
Sbjct: 306 NGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMSSQVQMGGHPTEVLCLMNMV 365
Query: 665 NPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 366 LPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 398
>gi|386764550|ref|NP_001245709.1| U2 small nuclear riboprotein auxiliary factor 50, isoform C
[Drosophila melanogaster]
gi|383293439|gb|AFH07422.1| U2 small nuclear riboprotein auxiliary factor 50, isoform C
[Drosophila melanogaster]
Length = 360
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 83/279 (29%), Positives = 133/279 (47%), Gaps = 47/279 (16%)
Query: 460 NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFL 519
R RRL V N+P +E+ +MEF N + G+ GS P + C I +K AF+EF
Sbjct: 33 TRQARRLYVGNIPFGVTEEEMMEFFNQQMHLVGLAQAAGS-PVLACQINLDKNFAFLEFR 91
Query: 520 TAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------------------- 552
+ ++ + A+ DG + G LKI+RP ++ +
Sbjct: 92 SIDETTQAMAFDGINLKGQSLKIRRPHDYQPMPGITDTPAIKPAVVSSGVISTVVPDSPH 151
Query: 553 -IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAG 609
IFIGG+ L+ V E++ +FG L+A++ + + AF EYVD +T ++IAG
Sbjct: 152 KIFIGGLPNYLNDDQVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAG 211
Query: 610 LNGLKVGGQVLTAVQAVLDGSIMDNSGN---------PPFHGIPKHALPLLKKPTEVLKL 660
LNG+++G + L +A + N+ N P + PTEVL L
Sbjct: 212 LNGMQLGDKKLIVQRASVGAKNAQNAANTTQSVMLQVPGLSNVVTSG-----PPTEVLCL 266
Query: 661 KNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
N+ P+ E E E++LED++ EC ++G V+SV +
Sbjct: 267 LNMVTPDELR--DEEEYEDILEDIKEECTKYGVVRSVEI 303
>gi|355756173|gb|EHH59920.1| hypothetical protein EGM_10153, partial [Macaca fascicularis]
Length = 442
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 110/376 (29%), Positives = 172/376 (45%), Gaps = 61/376 (16%)
Query: 365 YSPRKR--RTEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMV 422
Y P R + E SP + +K WDV P + + + QAA
Sbjct: 49 YKPLTRGAKEEHGGLIRSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG------- 101
Query: 423 SSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKAL 480
Q P + + + L V+ V + +Q R RRL V N+P +E+A+
Sbjct: 102 ---------QIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAM 152
Query: 481 MEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSIL 540
M+F N + G+ G+ P + I ++K AF+EF + ++ + A+ DG F G L
Sbjct: 153 MDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSL 211
Query: 541 KIKRPKEFVEVA------------------------IFIGGISRTLSSKMVMEIVCAFGP 576
KI+RP ++ + +FIGG+ L+ V E++ +FGP
Sbjct: 212 KIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGP 271
Query: 577 LKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDN 634
LKA++ + + AF EYVD VT +AIAGLNG+++G + L VQ G+
Sbjct: 272 LKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLL-VQRASVGAKNAT 330
Query: 635 SGNPP-----------FHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLED 683
+PP G+ + + PTEVL L N+ PE L + E EE++ED
Sbjct: 331 LVSPPSTINQTPVTLQVPGLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVED 388
Query: 684 VRLECARFGSVKSVNV 699
VR EC+++G VKS+ +
Sbjct: 389 VRDECSKYGLVKSIEI 404
>gi|297277970|ref|XP_001091568.2| PREDICTED: splicing factor U2AF 65 kDa subunit [Macaca mulatta]
Length = 471
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 107/362 (29%), Positives = 168/362 (46%), Gaps = 64/362 (17%)
Query: 382 INRSPEKKSAK-----WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLA 436
+ RSP + K WDV P + + + QAA Q P
Sbjct: 72 LIRSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPAT 115
Query: 437 GISVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQ 494
+ + + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 116 ALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLT 175
Query: 495 HVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA-- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 176 QAPGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGM 234
Query: 553 ----------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDH 588
+FIGG+ L+ V E++ +FGPLKA++ +
Sbjct: 235 SENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGL 294
Query: 589 EEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP--------- 639
+ AF EYVD VT +AIAGLNG+++G + L VQ G+ +PP
Sbjct: 295 SKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLL-VQRASVGAKNATLVSPPSTINQTPVT 353
Query: 640 --FHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSV 697
G+ + + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+
Sbjct: 354 LQVPGLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSI 411
Query: 698 NV 699
+
Sbjct: 412 EI 413
>gi|71480064|ref|NP_001025127.1| U2 small nuclear RNA auxiliary factor 2a [Danio rerio]
gi|68533572|gb|AAH98548.1| U2 small nuclear RNA auxiliary factor 2a [Danio rerio]
Length = 465
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 100/348 (28%), Positives = 161/348 (46%), Gaps = 66/348 (18%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA Q P + + + L V+
Sbjct: 85 WDVPPPGFEHITPMQYKAMQAAG----------------QIPATALLPTMTPEGLAVTPT 128
Query: 453 SVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQRE 510
V + +Q R RRL V N+P +E+++M+F N + G+ G+ P + I ++
Sbjct: 129 PVPVVGSQMTRQARRLYVGNIPFGITEESMMDFFNAQMRLGGLTQAPGN-PVLAVQINQD 187
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------ 552
K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 188 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVV 247
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTP 604
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD ++
Sbjct: 248 PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNISD 307
Query: 605 KAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKK----------- 653
+AIAGLNG+++G + L +A + S N GI + + L
Sbjct: 308 QAIAGLNGMQLGDKKLLVQRASV------GSKNTTLTGINQTPVTLQVPGLMNSSVNQMG 361
Query: 654 --PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 362 GIPTEVLCLMNMVAPE--ELLDDEEYEEIVEDVRDECSKYGQVKSIEI 407
>gi|405976087|gb|EKC40607.1| Splicing factor U2AF 50 kDa subunit [Crassostrea gigas]
Length = 428
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 95/287 (33%), Positives = 146/287 (50%), Gaps = 42/287 (14%)
Query: 449 VSMDSVQLTQSNRPM---------RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGS 499
+S+ +V LT + P RRL V N+P +E+A+M+F N+ + +G+ GS
Sbjct: 91 ISVPNVALTNTTVPFAGSAISRQARRLYVGNIPFGVTEEAMMDFFNHQMKMTGLAQAEGS 150
Query: 500 LPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------- 552
P I I +K AF+EF + ++ + A+ DG +F G LKI+RP+++ +
Sbjct: 151 -PVIAVQINLDKNFAFLEFRSVDETTQAMAFDGINFQGQSLKIRRPRDYQPLPGMAETPS 209
Query: 553 -----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCA 593
IFIGG+ L+ V E++ +FGPLKA++ + + A
Sbjct: 210 VNVPGVVSTVVQDSPHKIFIGGLPNYLNEDQVKELLTSFGPLKAFNLVKDSATGLSKGYA 269
Query: 594 FIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPL-LK 652
F EYVD VT + AGLNG+++G + L +A L NS P IP L
Sbjct: 270 FCEYVDPNVTDQGCAGLNGMQLGDKKLIVQRASLGAK---NSQVPVQLQIPGLNLNQGAG 326
Query: 653 KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ PE E E E++LEDV+ EC+++G V+S+ +
Sbjct: 327 PPTEVLCLMNMIVPE--ELEDEEEYEDILEDVKEECSKYGVVRSIEI 371
>gi|355703931|gb|EHH30422.1| hypothetical protein EGK_11092, partial [Macaca mulatta]
Length = 453
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 110/376 (29%), Positives = 172/376 (45%), Gaps = 61/376 (16%)
Query: 365 YSPRKR--RTEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMV 422
Y P R + E SP + +K WDV P + + + QAA
Sbjct: 49 YKPLTRGAKEEHGGLIRSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG------- 101
Query: 423 SSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKAL 480
Q P + + + L V+ V + +Q R RRL V N+P +E+A+
Sbjct: 102 ---------QIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAM 152
Query: 481 MEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSIL 540
M+F N + G+ G+ P + I ++K AF+EF + ++ + A+ DG F G L
Sbjct: 153 MDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSL 211
Query: 541 KIKRPKEFVEVA------------------------IFIGGISRTLSSKMVMEIVCAFGP 576
KI+RP ++ + +FIGG+ L+ V E++ +FGP
Sbjct: 212 KIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGP 271
Query: 577 LKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDN 634
LKA++ + + AF EYVD VT +AIAGLNG+++G + L VQ G+
Sbjct: 272 LKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLL-VQRASVGAKNAT 330
Query: 635 SGNPP-----------FHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLED 683
+PP G+ + + PTEVL L N+ PE L + E EE++ED
Sbjct: 331 LVSPPSTINQTPVTLQVPGLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVED 388
Query: 684 VRLECARFGSVKSVNV 699
VR EC+++G VKS+ +
Sbjct: 389 VRDECSKYGLVKSIEI 404
>gi|322792032|gb|EFZ16137.1| hypothetical protein SINV_12499 [Solenopsis invicta]
Length = 344
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 86/259 (33%), Positives = 140/259 (54%), Gaps = 23/259 (8%)
Query: 461 RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLT 520
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I +K AF+EF +
Sbjct: 32 RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQINLDKNFAFLEFRS 90
Query: 521 AEDASAALCCDGCSFSGSILKIKRPKEFVEVA----------------IFIGGISRTLSS 564
++ + A+ DG +F G LKI+RP ++ + IFIGG+ L+
Sbjct: 91 IDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMTDNPSMNVPDSPHKIFIGGLPNYLNE 150
Query: 565 KMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTA 622
+ V E++ +FG L+A++ + + AF EYVD +T +AIAGLNG+++G + L
Sbjct: 151 EQVKELLMSFGQLRAFNLVKDSATGLSKGYAFCEYVDVSMTDQAIAGLNGMQLGDKKLIV 210
Query: 623 VQAVLDG-SIMDNSGNPPFHGIPKHALPLLKKP-TEVLKLKNVFNPEGFSSLSELEVEEV 680
+A + + M + P +P ++ P TEVL L N+ PE + E E E++
Sbjct: 211 QRASVGAKNPMIGAQAPVQIQVPGLSMVGTSGPATEVLCLLNMVTPE--ELMEEEEYEDI 268
Query: 681 LEDVRLECARFGSVKSVNV 699
LED++ EC ++G V+SV +
Sbjct: 269 LEDIKEECNKYGVVRSVEI 287
>gi|301782083|ref|XP_002926459.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Ailuropoda
melanoleuca]
Length = 496
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 166/354 (46%), Gaps = 53/354 (14%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 104 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 147
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 148 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 207
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 208 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 266
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 267 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 326
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPK 645
AF EYVD VT +AIAGLNG+++G + L +A + + + P G+
Sbjct: 327 YAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMS 386
Query: 646 HALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 387 SQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 438
>gi|310793965|gb|EFQ29426.1| U2 snRNP auxilliary factor [Glomerella graminicola M1.001]
Length = 549
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 118/430 (27%), Positives = 187/430 (43%), Gaps = 96/430 (22%)
Query: 364 GYSPRKRRTEAAAKTPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVHTS 410
G+ R+R+ A TP P R P +++ +WD+ P +V T+
Sbjct: 124 GFGRRERQRSA---TPPPKKREPTPDLTDVTSVLERKRRLTQWDIKPPGYENV-----TA 175
Query: 411 NQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN-RPMRRLCVE 469
QA S + + Q+P+ + A + + ++S L SN R +RL +
Sbjct: 176 EQAKLSGMFPLPGA-----PRQQPMDPSKLQAIMNQPGGQVNSAALKPSNSRQAKRLLIN 230
Query: 470 NLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALC 529
NLP SA+E+ + F N L +G+ + + PC C + ++ A VEF A +A+ AL
Sbjct: 231 NLPPSATEEGIQNFFN--LQLNGLNIIESTDPCTSCQVSKDHSFAVVEFRNASEATVALA 288
Query: 530 CDGCSF------SGSI----LKIKRPKEFVEVAIF----------------------IGG 557
DG S +G+ L I+RPK+++ A+ +
Sbjct: 289 LDGISMEAEDATNGAAADQGLVIRRPKDYIVPAVVDDVPYEPGVVSNVVVDTHNKISVAN 348
Query: 558 ISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKV 615
+ LS + V E++ +FG LKA+ ++ EE AF EYVD T AI GLNG+ +
Sbjct: 349 MPVYLSEEQVSELLVSFGELKAFVLVRDKSTEESRGIAFCEYVDPAATDVAIQGLNGMDL 408
Query: 616 GGQVLTAVQAVLDGSIMDNSGNPPFHGIPK--HALPLL-------KKPTEVLKLKNVFNP 666
G + L +A + G G+ A+ +L + T VL+L N+ P
Sbjct: 409 GDKRLKVQKASV--------GVTQVAGVEMGVAAMSMLAGTTSTDSEETRVLQLLNMVTP 460
Query: 667 EGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVK-YGDSNISTIQACEGNENTASAGVG 725
E + + EE+ EDV+ ECA+FG+V V + + G S SAGVG
Sbjct: 461 EEL--MDNDDYEEIKEDVQEECAKFGTVLDVKIPRPVGGSR-------------QSAGVG 505
Query: 726 QNLTNDETNE 735
+ ET E
Sbjct: 506 KIFVKFETKE 515
>gi|63101571|gb|AAH94451.1| U2af2 protein, partial [Mus musculus]
Length = 403
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 166/354 (46%), Gaps = 53/354 (14%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 11 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 54
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 55 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 114
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 115 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 173
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 174 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 233
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPK 645
AF EYVD VT +AIAGLNG+++G + L +A + + + P G+
Sbjct: 234 YAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMS 293
Query: 646 HALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 294 SQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 345
>gi|299745153|ref|XP_001831503.2| rRNA primary transcript binding protein [Coprinopsis cinerea
okayama7#130]
gi|298406457|gb|EAU90350.2| rRNA primary transcript binding protein [Coprinopsis cinerea
okayama7#130]
Length = 550
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 148/308 (48%), Gaps = 51/308 (16%)
Query: 460 NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFL 519
+R RRL + ++ +E+ L EF N + + P + EK AFVEF
Sbjct: 192 SRQSRRLYIGSITPDVNEQNLAEFFNKKMAEMNIGTGSTGNPVLAVQCNYEKNYAFVEFR 251
Query: 520 TAEDASAALCCDGCSFSGSILKIKRPKEF----------VEVA-------------IFIG 556
+A+DA+AA+ DG F LKI+RPK++ + V +F+G
Sbjct: 252 SADDATAAMAFDGIIFINGPLKIRRPKDYGGEVVTGSPGIHVPGAVSTNVPDSINKVFVG 311
Query: 557 GISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP---CAFIEYVDQLVTPKAIAGLNGL 613
G+ L+ + VME++ +FG LKA++ V E+ P AF EYVD VT AI LNG+
Sbjct: 312 GLPTYLNEEQVMELLKSFGELKAFNL-VRENGNGPSKGFAFFEYVDSSVTDVAIQSLNGM 370
Query: 614 KVGGQVLTAVQAVLDGSIMDNSGNP--PFHGIPKHALPLLKKPTEV-----LKLKNVFNP 666
++G + L +A S+ G P P+ P P++ EV L + N+ P
Sbjct: 371 ELGDRYLVVQRA----SVGAKPGAPGLPYDQFPDIPRPIMPAGAEVTDARILLMLNMVTP 426
Query: 667 EGFSSLSELEVEEVLEDVRLECARFGSVKSVNV--------VKYGDSNISTIQACEGNEN 718
+ + + E ++ EDV+ EC+++G V+ + + K+G+ IS A +E
Sbjct: 427 DDL--IDDEEYGDLYEDVKEECSKYGEVEDLRIPRPVKKDKAKWGEGQISAQDAQRIDE- 483
Query: 719 TASAGVGQ 726
+AGVG+
Sbjct: 484 --AAGVGR 489
>gi|66520699|ref|XP_623055.1| PREDICTED: splicing factor U2AF 50 kDa subunit [Apis mellifera]
Length = 432
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 97/334 (29%), Positives = 161/334 (48%), Gaps = 49/334 (14%)
Query: 390 SAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNV 449
S WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 67 SLYWDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT-------- 115
Query: 450 SMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQR 509
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I
Sbjct: 116 -----------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQINL 163
Query: 510 EKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----------------- 552
+K AF+EF + ++ + A+ D +F G LKI+RP ++ +
Sbjct: 164 DKNFAFLEFRSIDETTQAMAFDSINFKGQSLKIRRPHDYQPMPGMTDNPSMNVPGTVPDS 223
Query: 553 ---IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAI 607
IFIGG+ L+ + V E++ +FG L+A++ + + AF EYVD +T +AI
Sbjct: 224 PHKIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKGYAFCEYVDVSMTDQAI 283
Query: 608 AGLNGLKVGGQVLTAVQAVLDG-SIMDNSGNPPFHGIPKHALPLLKKP-TEVLKLKNVFN 665
AGLNG+++G + L +A + + M + P +P ++ P TEVL L N+
Sbjct: 284 AGLNGMQLGDKKLIVQRASVGAKNPMIGAQAPVQIQVPGLSMVGTSGPATEVLCLLNMVT 343
Query: 666 PEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PE + E E E++LED++ EC ++G V+SV +
Sbjct: 344 PE--ELMEEEEYEDILEDIKEECNKYGVVRSVEI 375
>gi|358059688|dbj|GAA94557.1| hypothetical protein E5Q_01209 [Mixia osmundae IAM 14324]
Length = 564
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 168/373 (45%), Gaps = 48/373 (12%)
Query: 371 RTEAAAKTPS---PINRSPEKKSAKWDVAPVETYSVPS-NVHTSNQAASSNAHEMVSSDP 426
R E KTP PI++ ++A WDV P+ +V + S + +V P
Sbjct: 133 REEVREKTPENTIPISKRRRAQTA-WDVRPIGFETVSAETARMSGHFLLPGQNGVVRFPP 191
Query: 427 VTSTTQKPLAGISVSASLAKLNVSMDSVQ-LTQSNRPMRRLCVENLPLSASEKALMEFLN 485
+ G+++S + M VQ + R RRL V N+ +A E+ + EF N
Sbjct: 192 GFHEGRGAFGGLNMSGA-GSAAAPMGGVQPIISFARQQRRLYVGNIMPTADEQNVTEFFN 250
Query: 486 NFLLSSGVQ------HVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSI 539
+ +G+ V + P + + EK AFVEF + E+AS+A+ DG F
Sbjct: 251 AKMRENGLSLDDKKVDVQTADPVVSVQVNHEKSYAFVEFRSPEEASSAMSFDGIVFQDQQ 310
Query: 540 LKIKRPKEF--------------------VEVAIFIGGISRTLSSKMVMEIVCAFGPLKA 579
LKI+RPK++ +F+GG+ L + V+E++ +FG L++
Sbjct: 311 LKIRRPKDYTGDESGGTHLPGVISSNVPDTPNKVFVGGLPSYLDDEQVLELLSSFGELRS 370
Query: 580 YHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSG- 636
++ E ++ + AF EY D VT A AGLNG+++G + L +A + ++ + G
Sbjct: 371 FNLVKEGPQNASKGFAFCEYADPNVTDAACAGLNGMEIGDRYLVVQRAQVGANVYKHPGG 430
Query: 637 ----NPPFHGIPKHALPLL------KKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRL 686
NP P + T L++ N+ PE + + + ++ ED++
Sbjct: 431 YGGSNPALPPALARVAPTIFGQDETAPATRCLQMLNMVTPEEL--VDDQDYADINEDIKD 488
Query: 687 ECARFGSVKSVNV 699
EC+++G V V +
Sbjct: 489 ECSKYGEVIDVKI 501
>gi|332026432|gb|EGI66560.1| Splicing factor U2AF 50 kDa subunit [Acromyrmex echinatior]
Length = 435
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 99/332 (29%), Positives = 156/332 (46%), Gaps = 48/332 (14%)
Query: 393 WDVAPVETYSVPSNVHTSNQAA-----SSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
WDV P P H + A A + P P A + V S
Sbjct: 70 WDVPP------PGFEHITPLQARLLILYYKAMQAAGQIPANIVADTPQAAVPVVGSTI-- 121
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I
Sbjct: 122 ------------TRQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQI 168
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG +F G LKI+RP ++ +
Sbjct: 169 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMTDNPSMNVPDSPH 228
Query: 553 -IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAG 609
IFIGG+ L+ + V E++ +FG L+A++ + + AF EYVD +T +AIAG
Sbjct: 229 KIFIGGLPNYLNEEQVKELLMSFGQLRAFNLVKDSATGLSKGYAFCEYVDVSMTDQAIAG 288
Query: 610 LNGLKVGGQVLTAVQAVLDG-SIMDNSGNPPFHGIPKHALPLLKKP-TEVLKLKNVFNPE 667
LNG+++G + L +A + + M + P +P ++ P TEVL L N+ PE
Sbjct: 289 LNGMQLGDKKLIVQRASVGAKNPMIGAQAPVQIQVPGLSMVGTSGPATEVLCLLNMVTPE 348
Query: 668 GFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ E E E++LED++ EC ++G V+SV +
Sbjct: 349 --ELMEEEEYEDILEDIKEECNKYGVVRSVEI 378
>gi|195124159|ref|XP_002006561.1| GI21125 [Drosophila mojavensis]
gi|193911629|gb|EDW10496.1| GI21125 [Drosophila mojavensis]
Length = 427
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 99/367 (26%), Positives = 167/367 (45%), Gaps = 71/367 (19%)
Query: 382 INRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVS 441
+ R E+ + WD+ P+ + + + QA+ A +V P T S
Sbjct: 41 VKRRKERVRSSWDIPPLGYEHLTPVRYKAMQASGQIASRIVPDAPPTGE----------S 90
Query: 442 ASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFL-LSSGVQHVHGSL 500
A++A + R RRL V N+P S +++ +M F N + +G V G+
Sbjct: 91 AAIATVT------------RQARRLYVGNIPFSTTDEDMMAFFNEQINRLNGTNGVDGN- 137
Query: 501 PCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA-------- 552
+ C +K AF+EF + ++A+ A+ DG + G LKI+RP ++ +A
Sbjct: 138 AVLTCQTNLDKNFAFLEFRSMDEATQAINFDGILYRGQTLKIRRPHDYHPMASVSSSEAA 197
Query: 553 -------------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVN 585
I++GG+ L+ + V E++ FG L+ ++ E
Sbjct: 198 DAAKGSATHVNSVPISPMVPDSPHKIYVGGLPTCLNEEQVKELLVTFGKLRGFNLVKEAV 257
Query: 586 EDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPK 645
+ AF EYVD +T +AIAGLNG+++G + L VQ + G + P +P
Sbjct: 258 TGQSKGFAFCEYVDPCITEQAIAGLNGMQLGDRKLI-VQRSIAGVRNLVANQLPVLQVPG 316
Query: 646 HALPL-LKKPTEVLKLKNVFNPEGFSSLSEL----EVEEVLEDVRLECARFGSVKSVNVV 700
+ + K TEVL L N+ P SEL E +++ D++ ECA++G VKS+ +
Sbjct: 317 FPVDVSTGKATEVLCLLNMVLP------SELTDDDEYDDIRTDIKQECAKYGKVKSLKIP 370
Query: 701 KYGDSNI 707
+ GD +I
Sbjct: 371 RPGDDSI 377
>gi|195429288|ref|XP_002062695.1| GK19586 [Drosophila willistoni]
gi|194158780|gb|EDW73681.1| GK19586 [Drosophila willistoni]
Length = 466
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 96/375 (25%), Positives = 156/375 (41%), Gaps = 72/375 (19%)
Query: 368 RKRRTEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPV 427
R RR+ AAK+ + ++ + WD AP+ + + + QA+ A +V
Sbjct: 63 RPRRSLTAAKS---LVGKKQRAPSSWDKAPLGYEHLTPVQYKAMQASGQIASRIVPD--- 116
Query: 428 TSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNF 487
+ +S L R RRL V N+P ++K +M F N
Sbjct: 117 -------------------ILPEAESPALAMVTRQARRLYVGNIPFGVTDKEMMNFFNVQ 157
Query: 488 LLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKE 547
L S G++ H P + C EK AF+EF + + + A+ DG +F G LKI+RP +
Sbjct: 158 LQSLGLKQFHDGTPVLTCQTNLEKNFAFLEFRSMGETTQAIAFDGVNFRGQTLKIRRPHD 217
Query: 548 FVEVA--------------------------------------IFIGGISRTLSSKMVME 569
+ V I+IG + L + E
Sbjct: 218 YHPVTSLSSLETVGLSDTIVTSAHTPVPMKDLVSTLVPDSPQKIYIGSLPPCLDEAQIKE 277
Query: 570 IVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL 627
++ +FG L+ ++ + N + AF EYVD VT +AIAGLNG+ +G + L +++
Sbjct: 278 LLLSFGRLRGFNLVKDANTGMSKGYAFFEYVDSAVTEQAIAGLNGMLLGDRRLVVQRSIA 337
Query: 628 DGSIMDNSGNPPFHGIPKHALPLL---KKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDV 684
G N +P P + TE+L L N+ PE L + E E++ D+
Sbjct: 338 GGRNASNHSPASVLQVP--GFPSVFSTGAATEILCLLNMVQPEDL--LDDEEYEDICVDI 393
Query: 685 RLECARFGSVKSVNV 699
+ EC + G VK + +
Sbjct: 394 KQECDKHGKVKGLKI 408
>gi|348526424|ref|XP_003450719.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 1
[Oreochromis niloticus]
Length = 475
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 103/359 (28%), Positives = 166/359 (46%), Gaps = 58/359 (16%)
Query: 380 SPINRSPEKKSAK-----WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKP 434
S + RSP ++ K WDV P + + + QAA Q P
Sbjct: 78 SKLLRSPHREKKKKVKKYWDVPPPGFEHITPMQYKAMQAAG----------------QIP 121
Query: 435 LAGISVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSG 492
+ + + L V+ V + +Q R RRL V N+P +E+++M+F N + G
Sbjct: 122 ATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEESMMDFFNAQMRLGG 181
Query: 493 VQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA 552
+ G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 182 LTQAPGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLP 240
Query: 553 ------------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNE 586
+FIGG+ L+ V E++ +FGPLKA++ +
Sbjct: 241 GMSENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSAT 300
Query: 587 DHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLD------GSIMDNSGNPPF 640
+ AF EYVD + +AIAGLNG+++G + L +A + SI
Sbjct: 301 GLSKGYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQRASVGSKNATLSSINQTPVTLQV 360
Query: 641 HGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
G+ + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 361 PGLNSSVTQMGGLPTEVLCLMNMVAPE--ELLDDEEYEEIVEDVRDECSKYGQVKSIEI 417
>gi|432908697|ref|XP_004077989.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
[Oryzias latipes]
Length = 474
Score = 117 bits (294), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/355 (28%), Positives = 166/355 (46%), Gaps = 58/355 (16%)
Query: 384 RSPEKKSAK-----WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGI 438
RSP ++ K WDV P + + + QAA Q P +
Sbjct: 81 RSPHREKKKKIKKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATAL 124
Query: 439 SVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHV 496
+ + L V+ V + +Q R RRL V N+P +E+++M+F N + G+
Sbjct: 125 LPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEESMMDFFNAQMRLGGLTQA 184
Query: 497 HGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA---- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 185 PGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSE 243
Query: 553 --------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEE 590
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 244 NPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSK 303
Query: 591 PCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL---DGSIMDNSGNPPFHGIP--- 644
AF EYVD + +AIAGLNG+++G + L +A + + ++ + P +P
Sbjct: 304 GYAFCEYVDVNLNDQAIAGLNGMQLGDKKLLVQRASVGSKNATLTSINQTPVTLQVPGLN 363
Query: 645 KHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ PTEVL L N+ PE L + E EE++EDVR EC ++G VKS+ +
Sbjct: 364 SSVTQMGGLPTEVLCLMNMVAPE--ELLDDEEYEEIVEDVREECGKYGQVKSIEI 416
>gi|440900150|gb|ELR51345.1| Splicing factor U2AF 65 kDa subunit, partial [Bos grunniens mutus]
Length = 411
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 165/354 (46%), Gaps = 53/354 (14%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 18 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 61
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 62 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 121
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 122 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 180
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 181 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 240
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPK 645
AF EYVD VT +AIAGLNG+++G + L +A + + + P G+
Sbjct: 241 YAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMS 300
Query: 646 HALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + PTEVL L N+ PE L + E EE++EDVR EC ++G VKS+ +
Sbjct: 301 SQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECGKYGLVKSIEI 352
>gi|390596686|gb|EIN06087.1| hypothetical protein PUNSTDRAFT_145447 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 602
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 172/364 (47%), Gaps = 59/364 (16%)
Query: 377 KTPSPINRSP----EKKSAKWDV-AP-VETYSVPSNVHTSNQAASSNAHEMVSSDPVTST 430
++P+P P ++K++ WDV AP E Y T+ QA + + P +
Sbjct: 160 RSPTPEGAVPLSQRKRKASGWDVHAPGYEQY-------TAMQAKQTGLFPL----PGANR 208
Query: 431 TQKP--LAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFL 488
TQ P LA + + + M +R RRL + ++ +E+ L +F N+ +
Sbjct: 209 TQVPPILAVPGLPPPMPVPSFGMGIGGNPNLSRQSRRLYIGSITPEITEQNLADFFNSKM 268
Query: 489 LSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF 548
+ + P + EK AFVEF +AEDA+AA+ DG F LKI+RPK++
Sbjct: 269 IEMSIGTGGPGNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFLNGPLKIRRPKDY 328
Query: 549 ---------VEVA-------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNE 586
+ V +F+GG+ L+ + VME++ +FG LKA++ V E
Sbjct: 329 GGPDPMGAGLHVPGVVSTNVPDSINKVFVGGLPAYLNEEQVMELLTSFGELKAFNL-VRE 387
Query: 587 DHEEP---CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNP--PFH 641
+ P AF EYVD+ VT AI LNG+++G + L +A S+ G P P+
Sbjct: 388 NGNGPSKGFAFFEYVDESVTDVAIQALNGMELGDRYLVVQRA----SVGAKPGMPNLPYE 443
Query: 642 GIPKHALPLLK------KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVK 695
P+ P++ + +L + ++ PE + + E ++ EDV+ EC +FG+V+
Sbjct: 444 QFPELPRPIMPAGDVSNRDARILLMLSMVVPEDL--VDDQEYADICEDVKEECEKFGAVE 501
Query: 696 SVNV 699
+ +
Sbjct: 502 DLRI 505
>gi|242019185|ref|XP_002430045.1| Splicing factor U2AF 50 kDa subunit, putative [Pediculus humanus
corporis]
gi|212515110|gb|EEB17307.1| Splicing factor U2AF 50 kDa subunit, putative [Pediculus humanus
corporis]
Length = 445
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 96/339 (28%), Positives = 158/339 (46%), Gaps = 54/339 (15%)
Query: 390 SAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNV 449
S WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 75 SLYWDVPPPGFEHITPLQYKAMQAAGQIPANVVADTPQAAV---PVVGSTIT-------- 123
Query: 450 SMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQR 509
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I
Sbjct: 124 -----------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQINL 171
Query: 510 EKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF-----------VEVA------ 552
+K AF+EF + ++ + A+ DG +F G LKI+RP ++ V V
Sbjct: 172 DKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMSENPSVNVPAGVIST 231
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDH--EEPCAFIEYVDQLV 602
IFIGG+ L+ + E++ +FG L+A++ ++ + AF +VD V
Sbjct: 232 VVPDSPHKIFIGGLPNYLNEDQLKELLMSFGQLRAFNLVMDSTTGLSKGYAFCLFVDINV 291
Query: 603 TPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFH-GIPK-HALPLLKKPTEVLKL 660
T +AIAGLNG+++G + L +A + P +P ++ + PTEVL L
Sbjct: 292 TDQAIAGLNGMQLGDKKLIVQRASVGAKNTALGQQAPVQIQVPGLTSVGMSGPPTEVLCL 351
Query: 661 KNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
N+ P + E E E++LED++ EC + G VKS+ +
Sbjct: 352 LNMVTPSELN--DEEEYEDILEDIKEECNKHGVVKSLEI 388
>gi|226483519|emb|CAX74060.1| Splicing factor U2AF 65 kDa subunit [Schistosoma japonicum]
gi|226483521|emb|CAX74061.1| Splicing factor U2AF 65 kDa subunit [Schistosoma japonicum]
Length = 518
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 104/378 (27%), Positives = 164/378 (43%), Gaps = 97/378 (25%)
Query: 385 SPEKKSAKWDVAPV--------------ETYSVPSNVHTSNQAASSNAHEMVSSDPVTST 430
SP WDV P + +P NV+ + Q H + P+T T
Sbjct: 118 SPTLVYKYWDVPPPGFEHVTPAQYKALQASGQIPVNVYAAGQVPMP-VH--APNAPLTLT 174
Query: 431 TQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLS 490
T P AG +V R RRL V N+P +A+E+ +MEF N + +
Sbjct: 175 TNVPFAGSAVC-------------------RQARRLYVGNIPFTATEENMMEFFNKQMRA 215
Query: 491 SGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVE 550
G+ G+ P I I EK AF+EF + ++ + L DG F LK++RP+++
Sbjct: 216 QGLIQAEGN-PIIAVQINMEKNFAFLEFRSVDETTQGLALDGVLFQNQALKLRRPRDYAP 274
Query: 551 VA------------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EV 584
+ IF+GG+ L+ V E++ +FGPLK ++ +
Sbjct: 275 LPGVSEQPSVIVPGVVSTVVQDSPHKIFVGGLPNYLNEDQVKELLLSFGPLKGFNLVKDG 334
Query: 585 NEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVG--------------------GQVLTAVQ 624
+ + AF EYVD VT A AGLNG+++G Q L ++
Sbjct: 335 STGLSKGYAFCEYVDANVTDHACAGLNGMQLGDKKLIVQRASVGAKHTTGVLPQTLLSLP 394
Query: 625 AVLDGSIMDNSGNPPF---HGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVL 681
+ DG++ + +G+ G P PTEVL L N+ E + E E+++
Sbjct: 395 GLEDGTVQNTTGSGNITIRSGGP---------PTEVLCLMNMI--ETSELEDDEEYEDIV 443
Query: 682 EDVRLECARFGSVKSVNV 699
EDVR EC+++G V+S+ +
Sbjct: 444 EDVRAECSKYGVVRSLEI 461
>gi|47221657|emb|CAF97922.1| unnamed protein product [Tetraodon nigroviridis]
Length = 458
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 97/341 (28%), Positives = 161/341 (47%), Gaps = 53/341 (15%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA Q P + + + L V+
Sbjct: 79 WDVPPPGFEHITPMQYKAMQAAG----------------QIPATALLPTMTPDGLAVTPT 122
Query: 453 SVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQRE 510
V + +Q R RRL V N+P +E+++M+F N + G+ G+ P + I ++
Sbjct: 123 PVPVVGSQMTRQARRLYVGNIPFGITEESMMDFFNAQMRLGGLTQAPGN-PVLAVQINQD 181
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------ 552
K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 182 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVV 241
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTP 604
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD +
Sbjct: 242 PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLND 301
Query: 605 KAIAGLNGLKVGGQVLTAVQAVL---DGSIMDNSGNPPFHGIP---KHALPLLKKPTEVL 658
+AIAGLNG+++G + L +A + + ++ + P +P + PTEVL
Sbjct: 302 QAIAGLNGMQLGDKKLLVQRASVGSKNATLTSINQTPVTLQVPGLNSSVTQMGGVPTEVL 361
Query: 659 KLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 362 CLMNMVAPE--ELLDDEEYEEIVEDVRDECSKYGQVKSIEI 400
>gi|410903109|ref|XP_003965036.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 4
[Takifugu rubripes]
Length = 455
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 142/286 (49%), Gaps = 50/286 (17%)
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
+Q R RRL V N+P +E+++ EF N + +G+ + P + I ++K AF+
Sbjct: 120 SQMTRQARRLYVGNIPFGVTEESMAEFFNAQMRLAGLSQAPSN-PVLAVQINQDKNFAFL 178
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------ 552
EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 179 EFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYRPLPGISEQPVFHVPGVVSTVVPDSPHK 238
Query: 553 IFIGGISRTLSSKMVM---------EIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQL 601
+FIGG+ L+ V+ E++ +FGPLKA++ + + AF EYVD
Sbjct: 239 LFIGGLPNYLNDDQVLIRRLGWRVKELLTSFGPLKAFNLVKDSATSLSKGYAFCEYVDVS 298
Query: 602 VTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP--FHGIPKHALPLLKK------ 653
T +A+AGLNG+++G + L +A S+ + NP +P L++
Sbjct: 299 ATDQAVAGLNGMQLGDKKLIVQRA----SVGAKNANPSAIIEAPVTLQVPGLQRLQNSGM 354
Query: 654 PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ PE + + + EE+LEDVR EC ++GSV+S+ +
Sbjct: 355 PTEVLCLLNMVMPEEL--VDDDDYEEILEDVREECCKYGSVRSIEI 398
>gi|256074204|ref|XP_002573416.1| splicing factor u2af large subunit [Schistosoma mansoni]
gi|238658595|emb|CAZ29648.1| splicing factor u2af large subunit, putative [Schistosoma mansoni]
Length = 521
Score = 116 bits (291), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 104/378 (27%), Positives = 164/378 (43%), Gaps = 97/378 (25%)
Query: 385 SPEKKSAKWDVAPV--------------ETYSVPSNVHTSNQAASSNAHEMVSSDPVTST 430
SP WDV P + +P NV+ + Q H + P+T T
Sbjct: 121 SPTLVYKYWDVPPPGFEHVTPAQYKALQASGQIPVNVYAAGQVPMP-VH--APNAPLTLT 177
Query: 431 TQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLS 490
T P AG +V R RRL V N+P +A+E+ +MEF N + +
Sbjct: 178 TNVPFAGSAVC-------------------RQARRLYVGNIPFTATEENMMEFFNKQMRA 218
Query: 491 SGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVE 550
G+ G+ P I I EK AF+EF + ++ + L DG F LK++RP+++
Sbjct: 219 QGLIQAEGN-PIIAVQINMEKNFAFLEFRSVDETTQGLALDGVLFQNQALKLRRPRDYAP 277
Query: 551 VA------------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EV 584
+ IF+GG+ L+ V E++ +FGPLK ++ +
Sbjct: 278 LPGVSEQPSVIVPGVVSTVVQDSPHKIFVGGLPNYLNEDQVKELLLSFGPLKGFNLVKDG 337
Query: 585 NEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVG--------------------GQVLTAVQ 624
+ + AF EYVD VT A AGLNG+++G Q L ++
Sbjct: 338 STGLSKGYAFCEYVDANVTDHACAGLNGMQLGDKKLIVQRASVGAKHTTGVLPQTLLSLP 397
Query: 625 AVLDGSIMDNSGNPPF---HGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVL 681
+ DG++ + +G+ G P PTEVL L N+ E + E E+++
Sbjct: 398 GLEDGTVQNTTGSGNITIRSGGP---------PTEVLCLMNMI--ETSELEDDEEYEDIV 446
Query: 682 EDVRLECARFGSVKSVNV 699
EDVR EC+++G V+S+ +
Sbjct: 447 EDVRAECSKYGVVRSLEI 464
>gi|348526428|ref|XP_003450721.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 3
[Oreochromis niloticus]
Length = 458
Score = 116 bits (291), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 98/341 (28%), Positives = 158/341 (46%), Gaps = 53/341 (15%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA Q P + + + L V+
Sbjct: 79 WDVPPPGFEHITPMQYKAMQAAG----------------QIPATALLPTMTPDGLAVTPT 122
Query: 453 SVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQRE 510
V + +Q R RRL V N+P +E+++M+F N + G+ G+ P + I ++
Sbjct: 123 PVPVVGSQMTRQARRLYVGNIPFGITEESMMDFFNAQMRLGGLTQAPGN-PVLAVQINQD 181
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------ 552
K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 182 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVV 241
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTP 604
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD +
Sbjct: 242 PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLND 301
Query: 605 KAIAGLNGLKVGGQVLTAVQAVLD------GSIMDNSGNPPFHGIPKHALPLLKKPTEVL 658
+AIAGLNG+++G + L +A + SI G+ + PTEVL
Sbjct: 302 QAIAGLNGMQLGDKKLLVQRASVGSKNATLSSINQTPVTLQVPGLNSSVTQMGGLPTEVL 361
Query: 659 KLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 362 CLMNMVAPE--ELLDDEEYEEIVEDVRDECSKYGQVKSIEI 400
>gi|410905623|ref|XP_003966291.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Takifugu
rubripes]
Length = 458
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 98/341 (28%), Positives = 156/341 (45%), Gaps = 53/341 (15%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA Q P + + + L V+
Sbjct: 79 WDVPPPGFEHITPMQYKAMQAAG----------------QIPATALLPTMTPDGLAVTPT 122
Query: 453 SVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQRE 510
V + +Q R RRL V N+P +E+++M+F N + G+ G+ P + I ++
Sbjct: 123 PVPVVGSQMTRQARRLYVGNIPFGITEESMMDFFNAQMRLGGLTQAPGN-PVLAVQINQD 181
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------ 552
K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 182 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVV 241
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTP 604
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD +
Sbjct: 242 PDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDVNLND 301
Query: 605 KAIAGLNGLKVGGQVLTAVQA------VLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVL 658
+AIAGLNG+++G + L +A SI G+ + PTEVL
Sbjct: 302 QAIAGLNGMQLGDKKLLVQRASXXXXXSFQTSINQTPVTLQVPGLNSSVTQMGGVPTEVL 361
Query: 659 KLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L N+ PE L + E EE++EDVR EC ++G VKS+ +
Sbjct: 362 CLMNMVAPE--ELLDDEEYEEIVEDVRDECGKYGQVKSIEI 400
>gi|426196755|gb|EKV46683.1| hypothetical protein AGABI2DRAFT_206147 [Agaricus bisporus var.
bisporus H97]
Length = 558
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 108/397 (27%), Positives = 181/397 (45%), Gaps = 53/397 (13%)
Query: 368 RKRRTEAAAKTPSPINRSPEKKSAKWDV-AP-VETYSVPSNVHTSNQAASSNAHEMVSSD 425
++RR+ PS R ++K++ WDV AP E YS + QA + +
Sbjct: 117 QERRSPTPENCPSLSQR--KRKASGWDVHAPGYEQYS-------AMQAKQTGLFNL---- 163
Query: 426 PVTSTTQKP--LAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEF 483
P + TQ P L+ + + + M +R RRL + ++ +E+ L +F
Sbjct: 164 PGANRTQIPPILSIPGLPPPMPVQSFGMGIGGNPNLSRQSRRLYIGSITQEVNEQNLADF 223
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIK 543
N + + P + EK AFVEF +AEDA+AA+ DG F LKI+
Sbjct: 224 FNAKMAEMNIGTGITGNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFINGPLKIR 283
Query: 544 RPKEF---------VEVA-------------IFIGGISRTLSSKMVMEIVCAFGPLKAYH 581
RPK++ V V +F+GG+ L+ + VME++ +FG LKA++
Sbjct: 284 RPKDYGGDTIVSPGVHVPGVVSTNVPDSINKVFVGGLPTYLNEEQVMELLKSFGELKAFN 343
Query: 582 F--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG-----SIMDN 634
E + AF EYVDQ VT AI LNG+++G + L +A + ++ N
Sbjct: 344 LVRENGTGTSKGFAFFEYVDQAVTDVAIQSLNGMELGDRYLVVQRASVGAKPGTPGMIPN 403
Query: 635 SGNPPFHGIPKHALPLLKKPT----EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECAR 690
F IP+ +P K T +L + N+ PE + E ++ +DV+ EC++
Sbjct: 404 LPYDQFPEIPRPIMPAGKDQTSSESRILLMLNMVTPEDLH--EDDEYGDLYDDVKAECSK 461
Query: 691 FGSVKSVNVVK-YGDSNISTIQACEGNENTASAGVGQ 726
+G ++ + + + S++ A + +AGVG+
Sbjct: 462 YGELEDLRIPRPVKKDKTSSMSAQDAQRIDEAAGVGR 498
>gi|291190480|ref|NP_001167275.1| Splicing factor U2AF 65 kDa subunit [Salmo salar]
gi|223648990|gb|ACN11253.1| Splicing factor U2AF 65 kDa subunit [Salmo salar]
Length = 474
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 141/277 (50%), Gaps = 38/277 (13%)
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
+Q R RRL V N+P +E+A+M+F N + G+ G+ P + I ++K AF+
Sbjct: 144 SQMTRQARRLYVGNIPFGITEEAMMDFFNAQMCLGGLTQAPGN-PVLAVQINQDKNFAFL 202
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------ 552
EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 203 EFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSESPSVYVPGVVSTVVPDSAHK 262
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGL 610
+FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD + +AIAGL
Sbjct: 263 LFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATALSKGYAFCEYVDVNLNDQAIAGL 322
Query: 611 NGLKVGGQVLTAVQAVL---DGSIMDNSGNPPFHGIPKHALP-----LLKKPTEVLKLKN 662
NG+++G + L +A + + ++ + P +P +P L PTEVL L N
Sbjct: 323 NGMQLGDKKLLVQRASVGAKNAALTGMNQTPVTLQVPG-LMPTSMASLGGLPTEVLCLMN 381
Query: 663 VFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ E L + E EE++EDVR EC ++G VKS+ +
Sbjct: 382 MVAVE--ELLDDEEYEEIVEDVRDECGKYGQVKSIEI 416
>gi|409040470|gb|EKM49957.1| hypothetical protein PHACADRAFT_264412 [Phanerochaete carnosa
HHB-10118-sp]
Length = 575
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 108/383 (28%), Positives = 176/383 (45%), Gaps = 62/383 (16%)
Query: 387 EKKSAKWDV-AP-VETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKP--LAGISVSA 442
++K++ WD+ AP E Y T+ QA + + P + TQ P LA +
Sbjct: 152 KRKASGWDIHAPGYEQY-------TAMQAKQTGLFNL----PGANRTQIPPILAVPGLPP 200
Query: 443 SLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPC 502
+ + M +R RRL + ++ +E+ L +F N + + P
Sbjct: 201 PMPVQSFGMGMGVNPNLSRQSRRLYIGSITPEINEQNLADFFNEKMKEMSIGTGAPGNPV 260
Query: 503 IGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF---------VEVA- 552
+ EK AFVEF +AEDA+AA+ DG F LKI+RPK++ + V
Sbjct: 261 LAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFLSGPLKIRRPKDYGGSENLAPSMHVPG 320
Query: 553 ------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP---CAFIEY 597
+F+GG+ L+ + VME++ +FG LKA++ V E+ P AF EY
Sbjct: 321 VVSTNVPDSINKVFVGGLPPYLNEEQVMELLTSFGDLKAFNL-VRENGNGPSKGFAFFEY 379
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVL---DGSIMDNSGNPPFHGIPKHALP---LL 651
VD VT AI LN +++G + L +A + +G I + P IPK LP L
Sbjct: 380 VDPAVTDVAIQSLNEMELGDKYLVVQRASVGAKNGQIPPQALYP--TEIPKPILPAGDLE 437
Query: 652 KKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV--------VKYG 703
+L + N+ PE + + E ++ EDV+ EC + G ++ + + K+G
Sbjct: 438 GVEARILLMLNMVVPEDLN--DDQEYADIYEDVKDECEKHGPIEDLRIPRPVKKDKAKWG 495
Query: 704 DSNISTIQACEGNENTASAGVGQ 726
+S + + A +E +AGVG+
Sbjct: 496 ESGLDPLSAQRVDE---AAGVGR 515
>gi|344277364|ref|XP_003410472.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Loxodonta
africana]
Length = 471
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 102/343 (29%), Positives = 158/343 (46%), Gaps = 57/343 (16%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA ++ ++ + + T LA I V M
Sbjct: 92 WDVPPPGFEHITPMQYKAMQAAG----QIPATAFLPTMTPDGLAMIPTP-------VPMG 140
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
Q+T R RRL V N+P +E+A+M+F N + GV P + I ++K
Sbjct: 141 GSQMT---RKARRLYVGNIPFGITEEAMMDFFN-IQMRLGVLTQAPGNPILAVQINQDKN 196
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA-------------------- 552
AF+EF + ++ + A DG F G LKI+RP ++ +
Sbjct: 197 FAFLEFRSVDETTQATALDGIIFQGQSLKIRRPHDYQPLPSMSENLSAYMAGVASTVVPD 256
Query: 553 ----IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKA 606
+FI G+ L+ V E++ +FGPLKA+ + + A EYVD T +A
Sbjct: 257 SDHKLFIEGLPTYLNDDQVKELLTSFGPLKAFSLVKDSATGLSKGYAVCEYVDINDTDQA 316
Query: 607 IAGLNGLKVGGQVLTAVQAVLDGSIMDNSG----------NPPFHGIPKHALPLLKKPTE 656
AGLNG+++G + L VL GS+ +G P G+ + + PTE
Sbjct: 317 TAGLNGMQLGDKKLL----VLRGSVGAKNGTLSTINQVPVTPQVPGLRSSQVQMGGHPTE 372
Query: 657 VLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
VL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 373 VLCLMNMVLPE--ELLDDEEYEEIMEDVREECSKYGLVKSMEI 413
>gi|194884971|ref|XP_001976363.1| GG20057 [Drosophila erecta]
gi|190659550|gb|EDV56763.1| GG20057 [Drosophila erecta]
Length = 440
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 99/364 (27%), Positives = 157/364 (43%), Gaps = 72/364 (19%)
Query: 382 INRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVS 441
+ R + WDV P E Y + + AS + D + + S
Sbjct: 45 LQRRLGRAPTSWDVPP-EGYGQLTPLQYKAMQASGQIASRIVPDAMPTGE---------S 94
Query: 442 ASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLS---SGVQHVHG 498
A++A + R RRL V N+P +E+ +M+F N LL+ +G Q++ G
Sbjct: 95 AAIAMIT------------RQARRLYVGNIPFGVTEEEMMKFFNQQLLALGLAGAQYMDG 142
Query: 499 SLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------ 552
+ C EK AF+EF + ++A+ AL DG F G +LKI+RP ++ V
Sbjct: 143 KA-VLTCQTNLEKNFAFLEFRSMDEATQALNFDGILFRGQVLKIRRPHDYQPVPSIRVSA 201
Query: 553 ----------------------------------IFIGGISRTLSSKMVMEIVCAFGPLK 578
IF+GG+ L + +++ +FG LK
Sbjct: 202 MESYRSFRLPDNTVTHPPLATIPLSSIVPDSPNKIFVGGLPTCLGQDQIRDLLQSFGELK 261
Query: 579 AYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSG 636
+ + N + AF EY D VT AIAGL+G+++G + L VQ + G SG
Sbjct: 262 RLNLVKDTNTCLSKGFAFFEYFDPTVTDHAIAGLHGMQLGNRRLV-VQRSIPGGKHAVSG 320
Query: 637 NPPFHGIPKHALPL-LKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVK 695
P +P + L PTE++ L N+ PE L E E++ D+ ECA++G V+
Sbjct: 321 QQPLVQVPGISTLLDPGSPTEIICLLNMVLPE--ELLDNEEFEDIRTDIEQECAKYGEVR 378
Query: 696 SVNV 699
S+ +
Sbjct: 379 SIKI 382
>gi|380481793|emb|CCF41637.1| U2 snRNP auxilliary factor [Colletotrichum higginsianum]
Length = 550
Score = 115 bits (287), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 106/403 (26%), Positives = 170/403 (42%), Gaps = 100/403 (24%)
Query: 363 GGYSPRKRRTEAAAKTPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVHT 409
G+ R+R+ A TP P R P +++ +WD+ P +V T
Sbjct: 124 AGFGRRERQRSA---TPPPKKREPTPDLTDITSVLERKRRLTQWDIKPPGYENV-----T 175
Query: 410 SNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN-RPMRRLCV 468
+ QA S + + Q+P+ + A + + ++S L SN R +RL +
Sbjct: 176 AEQAKLSGMFPLPGA-----PRQQPIDPSKLQAIMNQPGGQVNSAALKPSNSRQAKRLLI 230
Query: 469 ENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAAL 528
NLP SA+E+++ F N L +G+ + + PC C + ++ A VEF A +A+ AL
Sbjct: 231 NNLPPSATEESIQSFFN--LQLNGLNIIESADPCTSCQVSKDNSFAVVEFRNASEATIAL 288
Query: 529 CCDGCSF----------SGSILKIKRPKEFVEVAIF----------------------IG 556
DG S + L I RPK+++ A+ I
Sbjct: 289 ALDGISMEADDATNGEAANQGLSIHRPKDYIVPAVVDDVPYEPGVVSNVVIDTPSKLSIA 348
Query: 557 GISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLK 614
+ LS + V E++ +FG LKA+ + EE AF EYVD T AI GLNG+
Sbjct: 349 NLPTYLSDEQVSELLVSFGELKAFVLVRDRSTEESRGIAFCEYVDPAATDVAIQGLNGMD 408
Query: 615 VGGQVL------------------TAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTE 656
+G + L A ++L G+ +SG T
Sbjct: 409 LGDKKLRVQKASVGVTQVAGVEMGVAAMSMLAGTTSTDSGE-----------------TR 451
Query: 657 VLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
VL+L N+ PE + + EE+ EDV+ EC++FG+V + +
Sbjct: 452 VLQLLNMVTPEEL--MDNDDYEEIKEDVQEECSKFGNVLDIKI 492
>gi|430813569|emb|CCJ29085.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 545
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 101/366 (27%), Positives = 170/366 (46%), Gaps = 63/366 (17%)
Query: 381 PINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSN---------AHEMVSSDPVTSTT 431
PIN ++ + WDV P ++ T+ QA S + P+
Sbjct: 139 PINER-RRRMSMWDVKPTGYEAI-----TAEQAKMSGKTILNIFLLWLRVPGLFPLPGAP 192
Query: 432 QKPLAGISVSASLAKLNVSMD-----SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNN 486
++ + +S +++ K +M+ ++Q QS R RR+ V N+P E+ L+ F N+
Sbjct: 193 RQSMMDLSKLSTVHKGPGAMNIPNPQALQPLQS-RQSRRIHVGNIPQPIDEEHLVNFFND 251
Query: 487 FLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPK 546
+ V G P I + EKG AF+EF EDA+ A+ DG S+ + LKI+RP
Sbjct: 252 TMSCLNVT-TSGDNPVISAQVNHEKGYAFLEFRQPEDATVAIGFDGISYMNNSLKIRRPM 310
Query: 547 EFVEVA-----------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF- 582
+++ I IGG+ L + V+E++ +FG LKA++
Sbjct: 311 DYIVPQMPTDDGSYVPGVISTNFTDTPNKIHIGGLPTYLDDEQVIELLKSFGELKAFNLI 370
Query: 583 -EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPP 639
+ + + AF EYVD VT A GLNG+++G ++L +A + + SG
Sbjct: 371 KDAATNESKGFAFCEYVDPDVTDIACEGLNGMELGDKILVVKRASIGTKQKPISTSGG-- 428
Query: 640 FHGIPKHALPLLK------KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGS 693
GI ++ +L +PT VL++ N+ PE + E EE+ ED+R EC+++G
Sbjct: 429 --GI--ASITMLAEEEGQLRPTRVLQMFNMVTPEELQ--DDDEYEEISEDIRDECSKYGK 482
Query: 694 VKSVNV 699
V + +
Sbjct: 483 VLDLKI 488
>gi|291241059|ref|XP_002740425.1| PREDICTED: U2 (RNU2) small nuclear RNA auxiliary factor 2-like
[Saccoglossus kowalevskii]
Length = 466
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 161/362 (44%), Gaps = 63/362 (17%)
Query: 380 SPINRSPEKKSAK----WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPL 435
+P +RS K K WD+ P + + + Q A Q P
Sbjct: 69 TPPSRSARPKRKKPFMYWDIPPPGFEHIAPLQYKAMQGAG----------------QIPQ 112
Query: 436 AGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQH 495
+ + A N +M V +Q R RRL V N+P +E+A+M+F N + S +
Sbjct: 113 TALENQMAQAAANSNMPIVG-SQMTRQARRLYVGNIPFGVTEEAMMDFFNRQMKSFRITQ 171
Query: 496 VHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--- 552
G+ P + I K AF+EF + ++ + A+ DG F G LKI+RPK++ V
Sbjct: 172 AQGN-PVLAVQIDLNKNFAFLEFRSVDETTQAMAFDGILFQGQSLKIRRPKDYQPVPGMA 230
Query: 553 ----------------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF-- 582
IFIGG+ L+ V E++ +FG LKA++
Sbjct: 231 EMPSVHVPDYLFSPTGVVSTVVPDSPHKIFIGGLPNYLNEDQVKELLTSFGELKAFNLVK 290
Query: 583 EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG-----SIMDNSGN 637
+ + AF EY+D+ +T +AIAGLNG+++G + L +A + + M N
Sbjct: 291 DSATSLSKGYAFCEYIDEKITDQAIAGLNGMQLGDKKLIVQRASVGAKNAQTAQMIAQLN 350
Query: 638 PPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSV 697
G+ L+ TEVL L N+ P+ E EE+L+DVR EC ++G V+S+
Sbjct: 351 IQVPGV-NIGQGLVGPTTEVLCLMNMVTPDELQDEE--EYEEILDDVRQECGKYGQVRSL 407
Query: 698 NV 699
+
Sbjct: 408 EI 409
>gi|443898020|dbj|GAC75358.1| splicing factor U2AF, large subunit [Pseudozyma antarctica T-34]
Length = 699
Score = 113 bits (283), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 85/306 (27%), Positives = 139/306 (45%), Gaps = 42/306 (13%)
Query: 432 QKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSS 491
Q P AG A N ++ + S R RRL V N+ A+E +++ F N +L
Sbjct: 300 QPPFAGQYQQGHPASHNQALATAD---SGRQARRLYVGNITHQANEPSMVAFFNEQMLKL 356
Query: 492 GVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFV-- 549
+ G P + + +KG AFVEF ++A+ A+ DG F G LKI+RPK++
Sbjct: 357 KLGTEPGE-PAVSAQVNVDKGYAFVEFRHPDEATNAMSFDGIVFQGQSLKIRRPKDYTGP 415
Query: 550 ---------------------EVAIFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNE 586
IF+GG+ L+ V+E++ AFG L++++ +
Sbjct: 416 DVRPPSSIHVPGVISTNVPDSPFKIFVGGLPTYLTDDQVIELLQAFGELRSFNLVKDPAT 475
Query: 587 DHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG-----SIMDNSGNPPFH 641
+ + AF EYVD +T A GLNG+++G + L +A + +I N
Sbjct: 476 NASKGFAFCEYVDTALTDLACQGLNGMELGDRNLVVQRASVGSEKKAQAIAAYGANVGAL 535
Query: 642 GIPKHALPL------LKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVK 695
G+P +PT + + N+ PE + E +++ED+R EC +FG+V
Sbjct: 536 GVPSSVQQFAGAGGDAGEPTSCMVMLNMVTPEELQ--DDEEYADIVEDIRDECTKFGTVN 593
Query: 696 SVNVVK 701
V V +
Sbjct: 594 DVRVPR 599
>gi|302685922|ref|XP_003032641.1| hypothetical protein SCHCODRAFT_75908 [Schizophyllum commune H4-8]
gi|300106335|gb|EFI97738.1| hypothetical protein SCHCODRAFT_75908 [Schizophyllum commune H4-8]
Length = 556
Score = 113 bits (283), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 91/315 (28%), Positives = 146/315 (46%), Gaps = 60/315 (19%)
Query: 460 NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFL 519
+R RRL + ++ +E L EF N+ + + P + EK AFVEF
Sbjct: 190 SRQSRRLYIGSITPEINEHNLAEFFNSKMTEMNIGTGGPGNPVLAVQCNYEKNYAFVEFR 249
Query: 520 TAEDASAALCCDGCSFSGSILKIKRPKEF-VEVA---------------------IFIGG 557
+A+DA+AA+ DG F LKI+RPK++ + VA IF+GG
Sbjct: 250 SADDATAAMAFDGIIFLNGPLKIRRPKDYDISVASAPMIHVPGIISTNVPDSANKIFVGG 309
Query: 558 ISRTLSSKMVMEIVCAFGPLKAYHFEVNED----HEEPCAFIEYVDQLVTPKAIAGLNGL 613
+ L+ + V E++ +FG LKA++ V E ++ AF EYVD VT AI LNG+
Sbjct: 310 LPAYLNEEQVQELLTSFGELKAFNL-VRETGTGASKQGYAFFEYVDPNVTDVAIQSLNGM 368
Query: 614 KVGGQVLTAVQAVL---DGSIMDNSGNPPFHGIPKHALPLLKKP------------TEVL 658
++G + L +A + DG+I + +P +P + KP VL
Sbjct: 369 ELGDRFLVVQRASVGAKDGTIPN---------LPAELMPAIPKPIMPAGQTDTSGDARVL 419
Query: 659 KLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV-------KYGDSNISTIQ 711
+ N+ P+ + + E ++LED++ EC++FG V+ + V K+
Sbjct: 420 LMLNMVTPDDL--VDDDEYGDLLEDIKEECSKFGPVEDLRVPRPVKKEKKWAPGEGGREA 477
Query: 712 ACEGNENTASAGVGQ 726
A E +AGVG+
Sbjct: 478 AVEAQRADEAAGVGR 492
>gi|170088030|ref|XP_001875238.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164650438|gb|EDR14679.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 370
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 85/270 (31%), Positives = 137/270 (50%), Gaps = 38/270 (14%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL + ++ +E+ L +F N+ ++ + P + EK AFVEF +AED
Sbjct: 29 RRLYIGSITPEVNEQNLADFFNSKMIEMSIGTGGPGNPVLAVQCNYEKNYAFVEFRSAED 88
Query: 524 ASAALCCDGCSFSGSILKIKRPKEF--VEVA--------------------IFIGGISRT 561
A+AA+ DG F LKI+RPK++ +E+A IF+GG+
Sbjct: 89 ATAAMAFDGIIFINGPLKIRRPKDYGGMEIASPGVHVPGVVSTNVPDSINKIFVGGLPTY 148
Query: 562 LSSKMVMEIVCAFGPLKAYHFEVNEDHEEP---CAFIEYVDQLVTPKAIAGLNGLKVGGQ 618
L+ + VME++ +FG LKA++ V E+ P AF EYVD VT AI LNG+++G +
Sbjct: 149 LNEEQVMELLKSFGDLKAFNL-VRENGNGPSKGFAFFEYVDVGVTDVAIQSLNGMELGDR 207
Query: 619 VLTAVQAVLDGSIMDNSGNPP------FHGIPKHALPLLKKP---TEVLKLKNVFNPEGF 669
L VQ G+ G P F IP+ +P K P + +L + N+ P+
Sbjct: 208 YLV-VQRASVGAKPGTPGMIPNLPYDQFPEIPRPIMPAGKDPATDSRILLMLNMVTPDDL 266
Query: 670 SSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + E ++ EDV+ EC+ +G+V+ + +
Sbjct: 267 T--DDQEYGDLYEDVKEECSNYGAVEDLRI 294
>gi|327280717|ref|XP_003225098.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like isoform 2
[Anolis carolinensis]
Length = 467
Score = 113 bits (282), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 101/352 (28%), Positives = 162/352 (46%), Gaps = 64/352 (18%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P + + + QAA Q P + + + L V+
Sbjct: 77 WDVPPPGFEHITPMQYKAMQAAG----------------QIPATALLPTMTPDGLAVTPT 120
Query: 453 SVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQRE 510
V + +Q R RRL V N+P +E+A+M+F N + G+ G+ P + I ++
Sbjct: 121 PVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQD 179
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------ 552
K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 180 KNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVV 239
Query: 553 ------IFIGGISRTLSSKMVM-----------EIVCAFGPLKAYHF--EVNEDHEEPCA 593
+FIGG+ L+ VM E++ +FGPLKA++ + + A
Sbjct: 240 PDSAHKLFIGGLPNYLNDDQVMFLPPFLSCQVKELLTSFGPLKAFNLVKDSATGLSKGYA 299
Query: 594 FIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPKHA 647
F EYVD VT +AIAGLNG+++G + L +A + + + P G+
Sbjct: 300 FCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMSSQ 359
Query: 648 LPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + PTEVL L N+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 360 VQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECSKYGVVKSIEI 409
>gi|367042858|ref|XP_003651809.1| hypothetical protein THITE_2112508 [Thielavia terrestris NRRL 8126]
gi|346999071|gb|AEO65473.1| hypothetical protein THITE_2112508 [Thielavia terrestris NRRL 8126]
Length = 563
Score = 112 bits (280), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 100/380 (26%), Positives = 175/380 (46%), Gaps = 69/380 (18%)
Query: 367 PRKRRTEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDP 426
PRKR PI +++ +WD+ P +V T+ QA S + +
Sbjct: 148 PRKREPTPDLTDVVPILER-KRRLTQWDIKPPGYDNV-----TAEQAKLSGMFPLPGA-- 199
Query: 427 VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLN 485
Q+ + + A + + S++S L +N R +RL VENLP SA+E++++ F+N
Sbjct: 200 ---PRQQAMDPTKLQAFMNQPGGSVNSAALRPTNSRQSKRLVVENLPASATEESMVNFIN 256
Query: 486 NFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSG-------- 537
L +G+ + + PC+ C+I ++ A +EF + DA+ AL DG S
Sbjct: 257 --LQLNGLNVIENTDPCLQCLIAPDRSFAMLEFRNSPDATVALAFDGISMEADDAHAANG 314
Query: 538 -----SILKIKRPKEFVEVAIF------------------------IGGISRTLSSKMVM 568
+ L+I+RPK+++ A+ + + L+ VM
Sbjct: 315 NGAAPAGLRIRRPKDYIVPAVVEDPNYDPDSDVPSSVVIDSPNKISVTNLPLYLTDDQVM 374
Query: 569 EIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAV 626
E++ +FG LK++ + +E AF+EYVD VT AI GLN + +G + L +A
Sbjct: 375 ELLVSFGKLKSFVLVKDNGTQESRGIAFLEYVDPSVTNVAIQGLNNMMLGERALKVQKAS 434
Query: 627 LDGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEVEE 679
+ I +G G+ +A+ +L T VL+L N+ + + + EE
Sbjct: 435 I--GITQVAGE---MGV--NAMSMLAGTTSTDSDVSRVLQLLNMVTADEL--MDNDDYEE 485
Query: 680 VLEDVRLECARFGSVKSVNV 699
+ +DV+ EC +FG+V S+ +
Sbjct: 486 IRDDVQDECEKFGTVLSLKI 505
>gi|388856534|emb|CCF49840.1| related to pre-mRNA splicing factor U2AF large chain [Ustilago
hordei]
Length = 718
Score = 112 bits (280), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 84/307 (27%), Positives = 139/307 (45%), Gaps = 42/307 (13%)
Query: 434 PLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGV 493
P G ++ + + V +NR RRL V N+ SA+E ++ F N +L +
Sbjct: 313 PQHGFQPGNGASQPAPTPEEVAQQNNNRQARRLYVGNITHSANEPNMVAFFNEQMLKLKL 372
Query: 494 QHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----- 548
G P + + +KG AFVEF E+A+ A+ DG F G LKI+RPK++
Sbjct: 373 GTEPGE-PAVSAQVNVDKGYAFVEFRHPEEATNAMSFDGIVFQGQSLKIRRPKDYTGPDV 431
Query: 549 -----VEVA-------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDH 588
+ V IF+GG+ L+ V+E++ AFG L+A++ +
Sbjct: 432 RPASNIHVPGVISTNVPDSPHKIFVGGLPTYLTDDQVIELLQAFGELRAFNLVKDTANGA 491
Query: 589 EEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLD--------GSIMDNSGNPPF 640
+ AF EYVD +T A GLNG+++G + L +A + + N+G
Sbjct: 492 SKGFAFCEYVDTALTDLACQGLNGMELGDRNLVVQRASVGSEKKAQAIAATGANAGALGD 551
Query: 641 HGIPKHALPL------LKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSV 694
G+P +P + + N+ PE + E +++ED+R EC ++G+V
Sbjct: 552 AGMPSSVQQFAGEGGDAGEPRSCMVMLNMVTPEELQ--DDEEYADIVEDIREECTKYGTV 609
Query: 695 KSVNVVK 701
V V +
Sbjct: 610 TDVRVPR 616
>gi|389745686|gb|EIM86867.1| hypothetical protein STEHIDRAFT_57258 [Stereum hirsutum FP-91666
SS1]
Length = 417
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 82/274 (29%), Positives = 133/274 (48%), Gaps = 45/274 (16%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL + ++ +E+ L +F N+ ++ + P + EK AFVEF +AED
Sbjct: 64 RRLYIGSITPDITEQNLTDFFNSKMIEMNIGTGAPGPPVLAVQCNYEKNYAFVEFRSAED 123
Query: 524 ASAALCCDGCSFSGSILKIKRPKEF--VEVA--------------------IFIGGISRT 561
A+AA+ DG F LKI+RPK++ +E+ IF+GG+
Sbjct: 124 ATAAMAFDGIIFVNGPLKIRRPKDYGGMEMPAPPLHVPGVVSTNVPDSPHKIFVGGLPSY 183
Query: 562 LSSKMVMEIVCAFGPLKAYHFEVNEDHEEP---CAFIEYVDQLVTPKAIAGLNGLKVGGQ 618
L+ + VME++ +FG LKA++ V E+ P AF EYVD VT AI L+G+++G +
Sbjct: 184 LNEEQVMELLKSFGDLKAFNL-VRENGNGPSKGFAFFEYVDPEVTDVAIQSLSGMELGDR 242
Query: 619 VLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKP-------------TEVLKLKNVFN 665
L +A S+ +G P +P P + +P + +L + N+
Sbjct: 243 YLVVQRA----SVGAKAGQPGMPNLPYDQFPEIPRPILPAGASDLSSANSRILLMLNMVT 298
Query: 666 PEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PE + + E ++LED+R E A +G V V +
Sbjct: 299 PEDL--IDDSEYADLLEDIREEVANYGDVDDVRI 330
>gi|449547880|gb|EMD38847.1| hypothetical protein CERSUDRAFT_81656 [Ceriporiopsis subvermispora
B]
Length = 476
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 107/396 (27%), Positives = 182/396 (45%), Gaps = 65/396 (16%)
Query: 377 KTPSPINRSP----EKKSAKWDV-AP-VETYSVPSNVHTSNQAASSNAHEMVSSDPVTST 430
++P+P + P ++K++ WDV AP E Y T+ QA + + P +
Sbjct: 36 RSPTPPSAVPISQRKRKASGWDVHAPGYEQY-------TAMQAKQTGLFNL----PGANR 84
Query: 431 TQKP--LAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFL 488
TQ P LA + + + M +R RRL + ++ +E+ L EF N +
Sbjct: 85 TQIPPILAIPGLPPPMPVQSFGMGIGGNPNLSRQSRRLYIGSITPDINEQNLAEFFNGKM 144
Query: 489 LSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF 548
+ P + EK AFVEF +AEDA+AA+ DG F LKI+RPK++
Sbjct: 145 KEMDIGTGAPGNPVLAVQCNYEKNYAFVEFRSAEDATAAMAFDGIIFLNGPLKIRRPKDY 204
Query: 549 ---------VEVA-------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNE 586
+ V +F+GG+ L+ + VME++ +FG LKA++ V E
Sbjct: 205 GGPDVLAPMMHVPGVVSTNVPDSANKVFVGGLPMYLNEEQVMELLKSFGELKAFNL-VRE 263
Query: 587 DHEEP---CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG-----SIMDNSGNP 638
+ P AF EYVD VT AI L+G+++G + L +A + I + G+P
Sbjct: 264 NGNGPSKGFAFFEYVDPSVTDVAIQSLSGMELGDKYLVVQRASVGAKPGQSPIDEMYGSP 323
Query: 639 PFHGIPKHALP---LLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVK 695
IP+ +P + + +L + N+ PE + E ++ ED+ EC R+G+V+
Sbjct: 324 --APIPRPIMPATDIDSTQSRILLMLNMVVPEELQ--DDQEYADIYEDITEECGRYGAVE 379
Query: 696 SVNV--------VKYGDSNISTIQACEGNENTASAG 723
+ + K+G++ + + +A + + A G
Sbjct: 380 DLRIPRPVKRDKAKWGENGMDSARAAQLADEAAGVG 415
>gi|156404394|ref|XP_001640392.1| predicted protein [Nematostella vectensis]
gi|156227526|gb|EDO48329.1| predicted protein [Nematostella vectensis]
Length = 332
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 83/276 (30%), Positives = 131/276 (47%), Gaps = 42/276 (15%)
Query: 461 RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLT 520
R RRL V N+P +E ++EF N + + + G+ P I I E+ AF+E +
Sbjct: 3 RQARRLYVGNIPFGVTENLMIEFFNAKMKEAKLNTAPGN-PVIAAQINTEQNFAFIELRS 61
Query: 521 AEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------IFIG 556
E+ + A+ DG G LKI+RPK++ + IFIG
Sbjct: 62 VEETTQAMAFDGIILQGQALKIRRPKDYQPIPGMSENASVHVPGVVSTVVPDSPHKIFIG 121
Query: 557 GISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLK 614
G+ L+ V E++ +FG L+A++ + + AF EYVD +T AI GLNG++
Sbjct: 122 GLPNYLNEDQVKELLSSFGELRAFNLVKDSATGLSKGYAFCEYVDLGITDVAIQGLNGMQ 181
Query: 615 VGGQVLTAVQAVLDGSIMDNSGNP-PFHGIPKH----------ALPLLKKPTEVLKLKNV 663
+G + L +A + N NP + +P A+P TEVL L N+
Sbjct: 182 LGDKKLIVQRASVGAK--QNLNNPQAMNMVPAQLQIPGLDISMAVPGAVAATEVLALMNM 239
Query: 664 FNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
P+ + E EE+ +DVR EC+++G VKS+ +
Sbjct: 240 VTPDELG--DDEEFEEIYDDVREECSKYGRVKSMEI 273
>gi|347842431|emb|CCD57003.1| hypothetical protein [Botryotinia fuckeliana]
Length = 606
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 165/382 (43%), Gaps = 82/382 (21%)
Query: 378 TPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS 424
TP P R P +++ +WD+ P +V T+ QA S + +
Sbjct: 189 TPPPKKREPTPDLTDVVSVLERKRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA 243
Query: 425 DPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEF 483
Q+P+ + A +A+ + + + L SN R +RL V N+P E+ ++ F
Sbjct: 244 -----PRQQPMDPSKLQAFMAQPSGQVTNAALKPSNSRQSKRLLVHNIPADTKEETIVSF 298
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSI---- 539
N L +G+ + GS P I + ++ A +EF T DA+ AL DG + G+
Sbjct: 299 FN--LQLNGLNVIEGSDPLISAQVSKDGSFALLEFKTQSDATVALALDGITMDGNDHMET 356
Query: 540 ---------LKIKRPKEFVEVA----------------------IFIGGISRTLSSKMVM 568
L I+RPK+++ A I + I L+ + V
Sbjct: 357 GNGSADTRGLSIRRPKDYIVPAVTDETPFEPGVISNVVADTQNKISVTNIPHYLNDEQVT 416
Query: 569 EIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAV 626
E++ +FG LKA+ + N D AF EYVD T A+ GLNG+++G + L +A
Sbjct: 417 ELLVSFGELKAFVLVKDSNTDESRGIAFCEYVDPAATDIAVEGLNGMELGDKHLKVQRAS 476
Query: 627 LDGSIMDNSGNPPFHGIPK--HALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEV 677
+ GN G+ +A+ +L T VL+L N+ PE + +
Sbjct: 477 I--------GNTQVSGLEMSVNAMSMLAGTTSQDLENGRVLQLLNMVTPEEL--IDNEDY 526
Query: 678 EEVLEDVRLECARFGSVKSVNV 699
EE+ EDV+ EC ++G V + V
Sbjct: 527 EEICEDVKEECEKYGKVLEMKV 548
>gi|384939342|gb|AFI33276.1| splicing factor U2AF 65 kDa subunit isoform b [Macaca mulatta]
Length = 471
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 166/354 (46%), Gaps = 53/354 (14%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 79 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 122
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 182
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 241
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 242 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 301
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPK 645
AF EYVD VT +AIAGLNG+++G + L +A + + + P G+
Sbjct: 302 YAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMS 361
Query: 646 HALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + PTEVL L N+ PE L + E EE++E+VR EC+++G VKS+ +
Sbjct: 362 SQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEEVRDECSKYGLVKSIEI 413
>gi|154319442|ref|XP_001559038.1| hypothetical protein BC1G_02202 [Botryotinia fuckeliana B05.10]
Length = 596
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 165/382 (43%), Gaps = 82/382 (21%)
Query: 378 TPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS 424
TP P R P +++ +WD+ P +V T+ QA S + +
Sbjct: 179 TPPPKKREPTPDLTDVVSVLERKRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA 233
Query: 425 DPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEF 483
Q+P+ + A +A+ + + + L SN R +RL V N+P E+ ++ F
Sbjct: 234 -----PRQQPMDPSKLQAFMAQPSGQVTNAALKPSNSRQSKRLLVHNIPADTKEETIVSF 288
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSI---- 539
N L +G+ + GS P I + ++ A +EF T DA+ AL DG + G+
Sbjct: 289 FN--LQLNGLNVIEGSDPLISAQVSKDGSFALLEFKTQSDATVALALDGITMDGNDHMET 346
Query: 540 ---------LKIKRPKEFVEVA----------------------IFIGGISRTLSSKMVM 568
L I+RPK+++ A I + I L+ + V
Sbjct: 347 ENGSADTRGLSIRRPKDYIVPAVTDETPFEPGVISNVVADTQNKISVTNIPHYLNDEQVT 406
Query: 569 EIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAV 626
E++ +FG LKA+ + N D AF EYVD T A+ GLNG+++G + L +A
Sbjct: 407 ELLVSFGELKAFVLVKDSNTDESRGIAFCEYVDPAATDIAVEGLNGMELGDKHLKVQRAS 466
Query: 627 LDGSIMDNSGNPPFHGIPK--HALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEV 677
+ GN G+ +A+ +L T VL+L N+ PE + +
Sbjct: 467 I--------GNTQVSGLEMSVNAMSMLAGTTSQDLENGRVLQLLNMVTPEEL--IDNEDY 516
Query: 678 EEVLEDVRLECARFGSVKSVNV 699
EE+ EDV+ EC ++G V + V
Sbjct: 517 EEICEDVKEECEKYGKVLEMKV 538
>gi|156061663|ref|XP_001596754.1| hypothetical protein SS1G_02977 [Sclerotinia sclerotiorum 1980]
gi|154700378|gb|EDO00117.1| hypothetical protein SS1G_02977 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 518
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/360 (27%), Positives = 163/360 (45%), Gaps = 69/360 (19%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P +V T+ QA S + + Q+P+ + A +A+
Sbjct: 126 KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA-----PRQQPMDPSKLQAFMAQ 175
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+ S+ + L SN R +RL V N+P E+ L+ F N L +G+ + GS PCI
Sbjct: 176 PSGSVTNAALKPSNSRQSKRLLVHNIPADTKEETLVGFFN--LQLNGLNVIEGSDPCISA 233
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSF---------SGSI----LKIKRPKEFVEVA 552
+ ++ A +EF T DA+ AL DG + +GS L I+RPK+++ A
Sbjct: 234 QVSKDGSFALLEFKTQSDATVALALDGITMENNDHMVTGNGSADTQGLSIRRPKDYIVPA 293
Query: 553 ----------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDH 588
I + I L+ + V E++ +FG LKA+ + N D
Sbjct: 294 VTDETPFEPGVVSNIVPDTQNKISVANIPHYLNDEQVTELLVSFGELKAFVLVKDSNTDE 353
Query: 589 EEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPK--H 646
AF EYVD T A+ GLNG+++G + L +A + GN G+ +
Sbjct: 354 SRGIAFCEYVDPAATDIAVEGLNGMELGDKHLKVQRASI--------GNTQVSGLEMSVN 405
Query: 647 ALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
A+ +L T VL+L N+ PE + + EE+ EDV+ EC ++G V + V
Sbjct: 406 AMSMLAGTTSQDLENGRVLQLLNMVTPEEL--IDNEDYEEICEDVKEECEKYGKVLEMKV 463
>gi|195057468|ref|XP_001995263.1| GH23055 [Drosophila grimshawi]
gi|193899469|gb|EDV98335.1| GH23055 [Drosophila grimshawi]
Length = 453
Score = 110 bits (276), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 166/374 (44%), Gaps = 91/374 (24%)
Query: 383 NRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSA 442
+R + S+ WDV PV + + S QA+ A +V + T SA
Sbjct: 69 SRKCRRTSSAWDVPPVGYEHLSPVQYKSMQASGQIALRIVPNALPTGE----------SA 118
Query: 443 SLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSL-- 500
S+A + R RRL V N+P + ++ + F N +Q + G+L
Sbjct: 119 SIATVT------------RQARRLYVGNIPFNTTDDEMRAFFN-----VQIQRMCGALEN 161
Query: 501 ---PCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF--------- 548
+ C EK AF+E + ++ + A+ DG ++ G LKI+RP ++
Sbjct: 162 DGKAVLTCQTNLEKNFAFLELRSMDETTLAISFDGINYRGQSLKIRRPHDYHAGGTTGSF 221
Query: 549 ---------------VEVA---------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEV 584
+A I+IGG+ L+ V E++ FG L+ ++
Sbjct: 222 VGATGYVSGAVVQSNAAIATVVPDTPHKIYIGGLPTCLNDDQVKELLMTFGHLRGFNMVK 281
Query: 585 NE-DHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGI 643
+E H + AF EY+D +T +AIAGLNG+++G + L VQ L G +
Sbjct: 282 DELGHGKGYAFCEYMDASITEQAIAGLNGMQLGERKLI-VQRSLAG----------VRNL 330
Query: 644 PKHALPLLKKP-----------TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFG 692
H LP+L+ P TEVL L N+ P+ L + E E++ +D++ ECA+FG
Sbjct: 331 VTHQLPVLQVPGFPADVKVGKATEVLCLLNMVMPDEL--LDDAEYEDIRKDIKEECAKFG 388
Query: 693 SVKSVNVVK-YGDS 705
V S+ + + +G+S
Sbjct: 389 KVISIKIPRPFGES 402
>gi|384939340|gb|AFI33275.1| splicing factor U2AF 65 kDa subunit isoform a [Macaca mulatta]
Length = 475
Score = 110 bits (276), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 105/359 (29%), Positives = 167/359 (46%), Gaps = 59/359 (16%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 79 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 122
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 182
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 241
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 242 PSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKG 301
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP-----------F 640
AF EYVD VT +AIAGLNG+++G + L VQ G+ +PP
Sbjct: 302 YAFCEYVDINVTDQAIAGLNGMQLGDKKLL-VQRASVGAKNATLVSPPSTINQTPVTLQV 360
Query: 641 HGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
G+ + + PTEVL L N+ PE L + E EE++E+VR EC+++G VKS+ +
Sbjct: 361 PGLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEEVRDECSKYGLVKSIEI 417
>gi|354486866|ref|XP_003505598.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Cricetulus
griseus]
gi|344242983|gb|EGV99086.1| Splicing factor U2AF 65 kDa subunit [Cricetulus griseus]
Length = 469
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 101/352 (28%), Positives = 162/352 (46%), Gaps = 61/352 (17%)
Query: 387 EKKSAK--WDVAPVETYSVPSNVHTSNQAA----SSNAHEMVSSD-PVTSTTQKPLAGIS 439
EKK + WDV P + + + QAA ++ V+SD V S P+ G
Sbjct: 82 EKKKVRKYWDVPPPGFEHITPMQYKAMQAAGQIPATALLPTVTSDGLVASPMPVPVVG-- 139
Query: 440 VSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGS 499
+Q R RRL V N+P +E+A+ +F N + + V G+
Sbjct: 140 -----------------SQMTRQARRLYVGNIPFGITEEAMKDFFNAQMQLGVLTQVPGN 182
Query: 500 LPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------- 552
P + I +EK AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 -PVLAVQINQEKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPS 241
Query: 553 -----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCA 593
+FIGG+ L+ V E++ +FG LKA++ + + A
Sbjct: 242 VYVPGVVSTVVPDSAHKLFIGGMPSYLNDDKVKELLTSFGTLKAFNLVKDSATGLSKGYA 301
Query: 594 FIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLD------GSIMDNSGNPPFHGIPKHA 647
F EY+D T +AIAGLNG+++G + L +A + +I G+
Sbjct: 302 FCEYLDINATDQAIAGLNGMQLGDKKLIVQRASVGSKNATLSTINQTPVTVQVPGLMSSQ 361
Query: 648 LPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + PTEVL L N+ P+ L + E EE++EDVR EC+++G VKS+ +
Sbjct: 362 VQMGGHPTEVLCLMNMVLPK--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 411
>gi|340373805|ref|XP_003385430.1| PREDICTED: splicing factor U2AF 50 kDa subunit-like [Amphimedon
queenslandica]
Length = 529
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 129/273 (47%), Gaps = 34/273 (12%)
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
+Q R RRL + N+P +E+ ++ F N +L + + G +P + I +K AF+
Sbjct: 197 SQLTRQARRLYIGNIPFGIAEEVMVNFFNEKMLEAKLCSAPG-IPVLAVQINMDKNFAFI 255
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA-----------------------I 553
EF + E+ + A+ DG G LKI+RPK++ + +
Sbjct: 256 EFRSVEETTNAMAFDGIVLQGQSLKIRRPKDYAPIPGVDIMPKHVPGVISTVVPDGPHKV 315
Query: 554 FIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLN 611
F GG+ LS V E++ +FG LKA++ + + F EY+D VT AI GLN
Sbjct: 316 FCGGLPTYLSDDQVKELLSSFGDLKAFNLVKDSGTSFSKGYCFFEYLDTDVTDGAIQGLN 375
Query: 612 GLKVGGQVLTAVQAVLDGSIM---DNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEG 668
G+ +G + L +A + +M D S + +P ++P L+ P+ V
Sbjct: 376 GMALGDKKLVVQRASVGAKVMEEYDISTDITSMAMP-ISIPGLQMPSTAQTATTVLCLMN 434
Query: 669 FSSLSEL----EVEEVLEDVRLECARFGSVKSV 697
++ EL E E +LEDVR EC+ +G V SV
Sbjct: 435 MTTEEELRDDDEYEGILEDVREECSNYGQVLSV 467
>gi|171684585|ref|XP_001907234.1| hypothetical protein [Podospora anserina S mat+]
gi|170942253|emb|CAP67905.1| unnamed protein product [Podospora anserina S mat+]
Length = 585
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 165/358 (46%), Gaps = 67/358 (18%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
++ +WD+ P +V T+ QA S + + Q+ + + A +++
Sbjct: 191 RRLTQWDIKPPGYDNV-----TAEQAKLSGMFPLPGA-----PRQQAMDPTKLQAFMSQP 240
Query: 448 NVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCV 506
+++S L +N R +RL + N+P SA++ +++ F N L +G+ + + PC+ C
Sbjct: 241 GGAVNSAALKPTNSRQAKRLILSNIPASATDDSIVNFFN--LQLNGLNVIEQTDPCLLCN 298
Query: 507 IQREKGQAFVEFLTAEDASAALCCDGCSF------------SGSILKIKRPKEFVEVAIF 554
I ++ A +EF DA+ AL DG + + + LKI+RPK+++ AI
Sbjct: 299 ISPDRSFAMLEFRNNTDATVALALDGITMDADDHQANGNGAAATGLKIRRPKDYIVPAIV 358
Query: 555 ------------------------IGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEE 590
+ I L+ VME++ +FG LK++ F + +E
Sbjct: 359 EDPNYDPDSSVPSTNVVDGPNKISVTNIPPYLTEDQVMELLVSFGKLKSFVFVKDNGTQE 418
Query: 591 P--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHAL 648
P AF+EY D VT AI+GLN + +G + L +A + I +G + +A+
Sbjct: 419 PRGIAFLEYADSSVTDVAISGLNNMMLGEKALKVQKASI--GITQVAGE-----LSVNAM 471
Query: 649 PLLKKPT-------EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+L T VL+L N+ + + + EE+ +DV+ EC +FG + S+ +
Sbjct: 472 SMLAGTTPSDNDAGRVLQLLNMVTADEL--MDNDDYEEIRDDVQEECEKFGKILSLKI 527
>gi|71022561|ref|XP_761510.1| hypothetical protein UM05363.1 [Ustilago maydis 521]
gi|46101379|gb|EAK86612.1| hypothetical protein UM05363.1 [Ustilago maydis 521]
Length = 727
Score = 108 bits (270), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 133/287 (46%), Gaps = 52/287 (18%)
Query: 459 SNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEF 518
+NR RRL V N+ A+E+ ++ F N +L + G P + + +KG AFVEF
Sbjct: 348 ANRQARRLYVGNITHQANEQNIVAFFNEQMLKLKLGTEPGE-PAVSAQVNVDKGYAFVEF 406
Query: 519 LTAEDASAALCCDGCSFSGSILKIKRPKEF----------VEVA-------------IFI 555
E+A+ A+ DG F LKI+RPK++ + V IF+
Sbjct: 407 RHPEEATNAMSFDGIVFQAQSLKIRRPKDYTGPDIRPPSNIHVPGVISTNVPDSPHKIFV 466
Query: 556 GGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGL 613
GG+ L+ V+E++ AFG L+A++ + + AF EYVD +T A GLNG+
Sbjct: 467 GGLPTYLNDDQVIELLQAFGELRAFNLVKDTGTGASKGFAFCEYVDPALTDLACQGLNGM 526
Query: 614 KVG----------------GQVLTAVQA---VLDGSIMDNSGNPPFHGIPKHALPLLKKP 654
++G Q L A A L G+ + +S F G A +P
Sbjct: 527 ELGDRNLVVQRASVGSEKKAQALAATGANMGALGGAAVPSSVQ-KFAGDGGDA----GEP 581
Query: 655 TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVK 701
T + + N+ PE + E +++ED+R EC ++G+V V + +
Sbjct: 582 TTCMVMLNMVTPEELQ--DDEEYADIVEDIRDECNKYGAVSDVRIPR 626
>gi|409081516|gb|EKM81875.1| hypothetical protein AGABI1DRAFT_70425 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 375
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 89/302 (29%), Positives = 144/302 (47%), Gaps = 38/302 (12%)
Query: 460 NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFL 519
+R RRL + ++ +E+ L +F N + + P + EK AFVEF
Sbjct: 17 SRQSRRLYIGSITQEVNEQNLADFFNAKMAEMNIGTGITGNPVLAVQCNYEKNYAFVEFR 76
Query: 520 TAEDASAALCCDGCSFSGSILKIKRPKEF---------VEVA-------------IFIGG 557
+AEDA+AA+ DG F LKI+RPK++ V V +F+GG
Sbjct: 77 SAEDATAAMAFDGIIFINGPLKIRRPKDYGGDTIVSPGVHVPGVVSTNVPDSINKVFVGG 136
Query: 558 ISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKV 615
+ L+ + VME++ +FG LKA++ E + AF EYVDQ VT AI LNG+++
Sbjct: 137 LPTYLNEEQVMELLKSFGELKAFNLVRENGTGTSKGFAFFEYVDQAVTDVAIQSLNGMEL 196
Query: 616 GGQVLTAVQAVLDGSIMDNSGNPP------FHGIPKHALPLLKKPTE----VLKLKNVFN 665
G + L VQ G+ G P F IP+ +P K T +L + N+
Sbjct: 197 GDRYLV-VQRASVGAKPGTPGMIPNLPYDQFPEIPRPIMPAGKDQTSSESRILLMLNMVT 255
Query: 666 PEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVK-YGDSNISTIQACEGNENTASAGV 724
PE + E ++ +DV+ EC+++G ++ + + + S++ A + +AGV
Sbjct: 256 PEDLH--EDDEYGDLYDDVKAECSKYGELEDLRIPRPVKKDKTSSMSAQDAQRIDEAAGV 313
Query: 725 GQ 726
G+
Sbjct: 314 GR 315
>gi|336381013|gb|EGO22165.1| hypothetical protein SERLADRAFT_371639 [Serpula lacrymans var.
lacrymans S7.9]
Length = 381
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 132/273 (48%), Gaps = 44/273 (16%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL + ++ +E+ L +F N+ ++ + P + EK AFVEF +AED
Sbjct: 21 RRLYIGSITPDVNEQNLADFFNSKMIEMSIGTGAPGNPVLAVQCNYEKNYAFVEFRSAED 80
Query: 524 ASAALCCDGCSFSGSILKIKRPKEFVEV----------------------AIFIGGISRT 561
A+AA+ DG F LKI+RPK++ V +F+GG+
Sbjct: 81 ATAAMAFDGIIFINGPLKIRRPKDYGGVDMSAPSVHVPGVVSTNVPDSINKVFVGGLPTY 140
Query: 562 LSSKMVMEIVCAFGPLKAYHFEVNEDHEEP---CAFIEYVDQLVTPKAIAGLNGLKVGGQ 618
L+ + VME++ +FG LKA++ V E+ P AF EYVD VT AI LNG+++G +
Sbjct: 141 LNEEQVMELLKSFGELKAFNL-VRENGNGPSKGFAFFEYVDISVTDVAIQSLNGMELGDR 199
Query: 619 VLTAVQAVLDGSIMDNSGNP------PFHGIPKHALPLLK------KPTEVLKLKNVFNP 666
L +A S+ G P P+ P+ P++ +L + N+ P
Sbjct: 200 YLVVQRA----SVGAKPGTPGMIPNLPYDQFPEIPRPIMPAGENSSADARILLMLNMVTP 255
Query: 667 EGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + + E ++ EDV+ EC+ +G+V+ + +
Sbjct: 256 DDLT--DDQEYGDLYEDVKEECSVYGAVEDLRI 286
>gi|195585954|ref|XP_002082743.1| GD25073 [Drosophila simulans]
gi|194194752|gb|EDX08328.1| GD25073 [Drosophila simulans]
Length = 445
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 100/376 (26%), Positives = 162/376 (43%), Gaps = 93/376 (24%)
Query: 382 INRSPEKKSAKWDVAPVETYS-VPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISV 440
+ R + + WD++P E YS + + + QA+ A ++S GI
Sbjct: 47 LQRRFGRPATSWDISP-EGYSHLTPQQYKAMQASGQIASRILSD------------GIHA 93
Query: 441 SASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQ----HV 496
S A +SM + R RRL V N+P +E+ +M+F N+ + + G + H
Sbjct: 94 GESAA---ISMIT-------RQARRLYVGNIPFGVTEEEMMQFFNHRITALGYEAKSSHY 143
Query: 497 HGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA---- 552
+ C EK AF+EF + ++A+ AL DG F G LKI+RP ++ V
Sbjct: 144 MDGKAVLTCQTNLEKNFAFLEFRSIDEATQALNFDGMVFRGQTLKIRRPHDYQPVPSISF 203
Query: 553 ------------------------------------IFIGGISRTLSSKMVMEIVCAFGP 576
I++GG+ L+ V E++ +FG
Sbjct: 204 SAMENYRSFRVPDTTIANPPNVTIPVTTIVPDSPNKIYVGGLPTCLNQDQVKELLQSFGE 263
Query: 577 LKAYHFEV--NEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDN 634
LK + + N + AF EY D LVT AIAGL+G+ +G + L +++ G
Sbjct: 264 LKGLNLVMDGNTSLNKGFAFFEYYDPLVTDHAIAGLHGMLLGDRRLVVQRSIPGGK---- 319
Query: 635 SGNPPFHGIPKHALPLLK-----------KPTEVLKLKNVFNPEGFSSLSELEVEEVLED 683
+ P H P+++ PTE L L N+ PE L + E E++ D
Sbjct: 320 ------NAFPGHTAPVVQVPGISTLLDPGSPTETLCLLNMVRPE--ELLDDEEFEDIRTD 371
Query: 684 VRLECARFGSVKSVNV 699
++ ECA+FG V+S+ +
Sbjct: 372 IKQECAKFGEVRSIKI 387
>gi|449669310|ref|XP_004206989.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Hydra
magnipapillata]
Length = 480
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 91/347 (26%), Positives = 151/347 (43%), Gaps = 63/347 (18%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K++ WDV P + Y + V AS EM + PL+G + A+
Sbjct: 104 KRNTLWDVPP-KGYEDITPVQFKALRASGQV-EMPT----------PLSGGVIPATALP- 150
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
Q Q RRL NLP +E +++F N + S + G+ P + C I
Sbjct: 151 -------QGAQMTMQARRLYCGNLPFGITEDLMVDFFNAKMRESDMARQPGN-PVLACQI 202
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
EK AF+EF + E+ + A+ DG G LKI+RPK++ +
Sbjct: 203 NLEKNFAFLEFRSVEETTLAMAFDGIILQGQALKIRRPKDYQPIPGINGMAYPTLFAESQ 262
Query: 553 ------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPC 592
+F+GG+ L+ V E++ FG L+A++ + +
Sbjct: 263 ATHIPGVVSTVVSDTINKVFVGGLPNYLNEDQVKELLSTFGELRAFNLVKDSATGLSKGY 322
Query: 593 AFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLK 652
AF EYVD +T AIAG+NG+++G + L +A + M N P + K
Sbjct: 323 AFCEYVDIGITDVAIAGMNGMQLGDKKLIVQRASVGSKTMTAQLNIPGFDLNKEI----- 377
Query: 653 KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
T +L L N+ E + + + +E+ ED+R EC+++G ++S+ +
Sbjct: 378 TATNILCLMNMVVAE--ELMDDEDYDEIFEDIREECSKYGRIRSMQI 422
>gi|119592808|gb|EAW72402.1| U2 (RNU2) small nuclear RNA auxiliary factor 2, isoform CRA_a [Homo
sapiens]
Length = 376
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 126/275 (45%), Gaps = 50/275 (18%)
Query: 384 RSPEKKSAK-----WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGI 438
RSP + K WDV P + + + QAA Q P +
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATAL 121
Query: 439 SVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHV 496
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 122 LPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQA 181
Query: 497 HGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA---- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 182 PGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSE 240
Query: 553 --------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEE 590
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 241 NPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSK 300
Query: 591 PCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA 625
AF EYVD VT +AIAGLNG+++G + L +A
Sbjct: 301 GYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRA 335
>gi|350295506|gb|EGZ76483.1| hypothetical protein NEUTE2DRAFT_76972 [Neurospora tetrasperma FGSC
2509]
Length = 592
Score = 107 bits (268), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 162/357 (45%), Gaps = 66/357 (18%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P +V T+ QA S + + Q+ + + A + +
Sbjct: 200 KRRMTQWDIKPPGYGNV-----TAEQAKLSGMFPLPGA-----PRQQAMDPTKLQAFMTQ 249
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+++S L +N R +RL V N+P SA++++L+ F N L +G+ + + PC+ C
Sbjct: 250 PGGAVNSAALKPTNSRQSKRLIVSNIPPSATDESLLGFFN--LQLNGLNVIDSADPCVQC 307
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSG---------SILKIKRPKEFVEVAIF-- 554
I + A +EF + DA+ AL DG + LKI+RPK+++ AI
Sbjct: 308 QISPDHSFAMLEFKNSPDATVALALDGITMEAEDANGAAGAGGLKIRRPKDYIVPAIVED 367
Query: 555 ----------------------IGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP- 591
+ I LS + +ME++ AFG LK++ V + H E
Sbjct: 368 PNYDPDSEVPSSIVIDSPNKISVTNIPAYLSEEQIMELLVAFGKLKSFVL-VKDKHTEES 426
Query: 592 --CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALP 649
AF EY D VT AI GLN + +G + L +A I +G + +A+
Sbjct: 427 RGIAFCEYHDSSVTSVAIDGLNNMMLGDRALKVQKASY--GIQQVAGE-----LSVNAMS 479
Query: 650 LLKKPT-------EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+L T V++L N+ + + + EE+ +DV+ EC +FG++ S+ +
Sbjct: 480 MLAGTTSLDGDVSRVVQLLNMVTADEL--MDNDDYEEIRDDVQEECEKFGTIVSLKI 534
>gi|195346998|ref|XP_002040041.1| GM15574 [Drosophila sechellia]
gi|194135390|gb|EDW56906.1| GM15574 [Drosophila sechellia]
Length = 445
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 162/376 (43%), Gaps = 93/376 (24%)
Query: 382 INRSPEKKSAKWDVAPVETYS-VPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISV 440
+ R + + WD+ P E YS + + + QA+ A ++S GI
Sbjct: 47 LQRRFGRPATSWDIPP-EGYSHLTPQQYKAMQASGQIASRILSD------------GIHA 93
Query: 441 SASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQ----HV 496
S A +SM + R RRL V N+P +E+ +M+F N+ +++ G + H
Sbjct: 94 GESAA---ISMIT-------RQARRLYVGNIPFGVTEEEMMQFFNHRIMAQGYEAKSSHY 143
Query: 497 HGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA---- 552
+ C EK AF+EF + ++A+ AL DG + G LKI+RP ++ V
Sbjct: 144 MDGKAVLTCQTHLEKNFAFLEFRSIDEATQALNFDGMVYRGQTLKIRRPHDYQPVPSISF 203
Query: 553 ------------------------------------IFIGGISRTLSSKMVMEIVCAFGP 576
I++GG+ L+ V E++ +FG
Sbjct: 204 SAMENYRSFRVPATTIANPPNVTIPVTTIVPDSPNKIYVGGLPTCLNQDQVKELLQSFGE 263
Query: 577 LKAYHFEV--NEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDN 634
LK + + N + AF EY D LVT AIAGL+G+ +G + L +++ G
Sbjct: 264 LKGLNLVMDGNTSLNKGFAFFEYYDPLVTDHAIAGLHGMLLGDRRLVVQRSIPGGK---- 319
Query: 635 SGNPPFHGIPKHALPLLK-----------KPTEVLKLKNVFNPEGFSSLSELEVEEVLED 683
+ P H P+++ PTE L L N+ PE L + E E++ D
Sbjct: 320 ------NAFPGHTGPVVQVPGISTLLDPGSPTETLCLLNMVRPE--ELLDDEEFEDIRTD 371
Query: 684 VRLECARFGSVKSVNV 699
++ ECA+FG V+S+ +
Sbjct: 372 IKQECAKFGEVRSIKI 387
>gi|112490659|pdb|2G4B|A Chain A, Structure Of U2af65 Variant With Polyuridine Tract
Length = 172
Score = 107 bits (267), Expect = 3e-20, Method: Composition-based stats.
Identities = 61/168 (36%), Positives = 98/168 (58%), Gaps = 7/168 (4%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL V N+P +E+A+M+F N + G+ G+ P + I ++K AF+EF + ++
Sbjct: 5 RRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSVDE 63
Query: 524 ASAALCCDGCSFSGSILKIKRPKEFVEV----AIFIGGISRTLSSKMVMEIVCAFGPLKA 579
+ A+ DG F G LKI+RP ++ + +FIGG+ L+ V E++ +FGPLKA
Sbjct: 64 TTQAMAFDGIIFQGQSLKIRRPHDYQPLPGAHKLFIGGLPNYLNDDQVKELLTSFGPLKA 123
Query: 580 YHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA 625
++ + + AF EYVD VT +AIAGLNG+++G + L +A
Sbjct: 124 FNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRA 171
>gi|85111663|ref|XP_964044.1| hypothetical protein NCU03039 [Neurospora crassa OR74A]
gi|28925805|gb|EAA34808.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 584
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 162/357 (45%), Gaps = 66/357 (18%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P +V T+ QA S + + Q+ + + A + +
Sbjct: 192 KRRMTQWDIKPPGYGNV-----TAEQAKLSGMFPLPGA-----PRQQAMDPTKLQAFMTQ 241
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+++S L +N R +RL V N+P SA++++L+ F N L +G+ + + PC+ C
Sbjct: 242 PGGAVNSAALKPTNSRQSKRLIVSNIPPSATDESLLGFFN--LQLNGLNVIDSADPCVQC 299
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSG---------SILKIKRPKEFVEVAIF-- 554
I + A +EF + DA+ AL DG + LKI+RPK+++ AI
Sbjct: 300 QISPDHSFAMLEFRNSPDATVALALDGITMEAEDANGAAGAGGLKIRRPKDYIVPAIVED 359
Query: 555 ----------------------IGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP- 591
+ I LS + +ME++ AFG LK++ V + H E
Sbjct: 360 PNYDPDSEVPSSIVIDSPNKISVTNIPAYLSEEQIMELLVAFGKLKSFVL-VKDKHTEES 418
Query: 592 --CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALP 649
AF EY D VT AI GLN + +G + L +A I +G + +A+
Sbjct: 419 RGIAFCEYHDSSVTSVAIDGLNNMMLGDRALKVQKASY--GIQQVAGE-----LSVNAMS 471
Query: 650 LLKKPT-------EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+L T V++L N+ + + + EE+ +DV+ EC +FG++ S+ +
Sbjct: 472 MLAGTTSLDGDVSRVVQLLNMVTADEL--MDNDDYEEIRDDVQEECEKFGTIVSLKI 526
>gi|449802099|pdb|3VAF|A Chain A, Structure Of U2af65 Variant With Bru3 Dna
gi|449802100|pdb|3VAF|B Chain B, Structure Of U2af65 Variant With Bru3 Dna
gi|449802101|pdb|3VAG|A Chain A, Structure Of U2af65 Variant With Bru3c2 Dna
gi|449802102|pdb|3VAG|B Chain B, Structure Of U2af65 Variant With Bru3c2 Dna
gi|449802103|pdb|3VAH|A Chain A, Structure Of U2af65 Variant With Bru3c4 Dna
gi|449802104|pdb|3VAH|B Chain B, Structure Of U2af65 Variant With Bru3c4 Dna
gi|449802105|pdb|3VAI|A Chain A, Structure Of U2af65 Variant With Bru3c5 Dna
gi|449802106|pdb|3VAI|B Chain B, Structure Of U2af65 Variant With Bru3c5 Dna
gi|449802107|pdb|3VAJ|A Chain A, Structure Of U2af65 Variant With Bru5c6 Dna
gi|449802108|pdb|3VAJ|B Chain B, Structure Of U2af65 Variant With Bru5c6 Dna
gi|449802109|pdb|3VAK|A Chain A, Structure Of U2af65 Variant With Bru5 Dna
gi|449802110|pdb|3VAK|B Chain B, Structure Of U2af65 Variant With Bru5 Dna
gi|449802113|pdb|3VAL|A Chain A, Structure Of U2af65 Variant With Bru5c1 Dna
gi|449802114|pdb|3VAL|B Chain B, Structure Of U2af65 Variant With Bru5c1 Dna
gi|449802115|pdb|3VAL|D Chain D, Structure Of U2af65 Variant With Bru5c1 Dna
gi|449802116|pdb|3VAL|I Chain I, Structure Of U2af65 Variant With Bru5c1 Dna
gi|449802117|pdb|3VAM|A Chain A, Structure Of U2af65 Variant With Bru5c2 Dna
gi|449802118|pdb|3VAM|B Chain B, Structure Of U2af65 Variant With Bru5c2 Dna
Length = 174
Score = 107 bits (267), Expect = 3e-20, Method: Composition-based stats.
Identities = 61/168 (36%), Positives = 98/168 (58%), Gaps = 7/168 (4%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL V N+P +E+A+M+F N + G+ G+ P + I ++K AF+EF + ++
Sbjct: 7 RRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSVDE 65
Query: 524 ASAALCCDGCSFSGSILKIKRPKEFVEV----AIFIGGISRTLSSKMVMEIVCAFGPLKA 579
+ A+ DG F G LKI+RP ++ + +FIGG+ L+ V E++ +FGPLKA
Sbjct: 66 TTQAMAFDGIIFQGQSLKIRRPHDYQPLPGAHKLFIGGLPNYLNDDQVKELLTSFGPLKA 125
Query: 580 YHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA 625
++ + + AF EYVD VT +AIAGLNG+++G + L +A
Sbjct: 126 FNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRA 173
>gi|168036006|ref|XP_001770499.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678207|gb|EDQ64668.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 207
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 55/123 (44%), Positives = 80/123 (65%), Gaps = 2/123 (1%)
Query: 567 VMEIVCAFGPLKAYHFEVNEDHEEPC--AFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQ 624
V EI+ AFG LK++H++ + C AF EY+D VT +A GLNG+ +GGQVLT +Q
Sbjct: 14 VKEIITAFGQLKSFHWKTMHHNNGGCSIAFFEYLDPTVTHQACKGLNGMNLGGQVLTVLQ 73
Query: 625 AVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDV 684
A D S NPPF+ IP+ ALPLL+ PT VL++ N+ + E + + +V +++EDV
Sbjct: 74 ATPDAKYEVFSKNPPFYDIPEEALPLLETPTNVLEIHNLVSEEEVLHMQDKDVRKLMEDV 133
Query: 685 RLE 687
R+E
Sbjct: 134 RIE 136
>gi|336465212|gb|EGO53452.1| hypothetical protein NEUTE1DRAFT_92746 [Neurospora tetrasperma FGSC
2508]
Length = 584
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 162/357 (45%), Gaps = 66/357 (18%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P +V T+ QA S + + Q+ + + A + +
Sbjct: 192 KRRMTQWDIKPPGYGNV-----TAEQAKLSGMFPLPGA-----PRQQAMDPTKLQAFMTQ 241
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+++S L +N R +RL V N+P SA++++L+ F N L +G+ + + PC+ C
Sbjct: 242 PGGAVNSTALKPTNSRQSKRLIVSNIPPSATDESLLGFFN--LQLNGLNVIDSADPCVQC 299
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSG---------SILKIKRPKEFVEVAIF-- 554
I + A +EF + DA+ AL DG + LKI+RPK+++ AI
Sbjct: 300 QISPDHSFAMLEFRNSPDATVALALDGITMEAEDANGAAGAGGLKIRRPKDYIVPAIVED 359
Query: 555 ----------------------IGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP- 591
+ I LS + +ME++ AFG LK++ V + H E
Sbjct: 360 PNYDPDSEVPSSIVIDSPNKISVTNIPAYLSEEQIMELLVAFGKLKSFVL-VKDKHTEES 418
Query: 592 --CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALP 649
AF EY D VT AI GLN + +G + L +A I +G + +A+
Sbjct: 419 RGIAFCEYHDSSVTSVAIDGLNNMMLGDRALKVQKASY--GIQQVAGE-----LSVNAMS 471
Query: 650 LLKKPT-------EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+L T V++L N+ + + + EE+ +DV+ EC +FG++ S+ +
Sbjct: 472 MLAGTTSLDGDVSRVVQLLNMVTADEL--MDNDDYEEIRDDVQEECEKFGTIVSLKI 526
>gi|427789501|gb|JAA60202.1| Putative splicing factor u2af large subunit rrm superfamily
[Rhipicephalus pulchellus]
Length = 462
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 82/267 (30%), Positives = 131/267 (49%), Gaps = 30/267 (11%)
Query: 460 NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFL 519
R RRL V N+P SE+ +M++ N + + G G+ P + C I +K AF+EF
Sbjct: 142 TRQARRLYVGNIPFGCSEEEMMDYFNAQMHACGFSQAPGN-PVLACQINLDKNFAFLEFR 200
Query: 520 TAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------IFI 555
+ ++ + A+ DG +F G LKI+RP ++ + IFI
Sbjct: 201 SIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMSETPSVAVPGVISTVVQDSPHKIFI 260
Query: 556 GGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGL 613
GG+ L+ V E++ +FG L+A++ + + AF EYV+ T +AI GLNG+
Sbjct: 261 GGLPNYLNEDQVRELLMSFGQLRAFNLVKDSATGLSKGYAFCEYVEVATTDQAIMGLNGM 320
Query: 614 KVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALP-LLKKPTEVLKLKNVFNPEGFSSL 672
++G + L +A + + P +P L PTEVL L N+ PE
Sbjct: 321 QLGDKKLIVQRASVGAKNSQMNQAPVQIQVPGLQLQGGAGPPTEVLCLMNLVCPE--ELK 378
Query: 673 SELEVEEVLEDVRLECARFGSVKSVNV 699
E E E++LED+ EC ++G VKS+ +
Sbjct: 379 DEEEYEDILEDIHEECNKYGVVKSIEI 405
>gi|343426615|emb|CBQ70144.1| related to pre-mRNA splicing factor U2AF large chain [Sporisorium
reilianum SRZ2]
Length = 710
Score = 107 bits (266), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 77/282 (27%), Positives = 127/282 (45%), Gaps = 42/282 (14%)
Query: 459 SNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEF 518
+NR RRL V N+ +E ++ F N +L + G P + + +KG AFVEF
Sbjct: 334 ANRQARRLYVGNITHQTNEHNIVAFFNEQMLKLKLGTEPGE-PAVSAQVNVDKGYAFVEF 392
Query: 519 LTAEDASAALCCDGCSFSGSILKIKRPKEFV-----------------------EVAIFI 555
++A+ A+ DG F LKI+RPK++ IF+
Sbjct: 393 RHPDEATNAMSFDGIVFQAQSLKIRRPKDYTGPDVRPPSNIHVPGVISTNVPDSPFKIFV 452
Query: 556 GGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGL 613
GG+ L+ V+E++ AFG L+A++ + + AF EYVD +T A GLNG+
Sbjct: 453 GGLPTYLTDDQVIELLQAFGELRAFNLVKDTGTGASKGFAFCEYVDTALTDLACQGLNGM 512
Query: 614 KVGGQVLTAVQAVLD--------GSIMDNSGNPPFHGIPKHALPL------LKKPTEVLK 659
++G + L +A + + NSG +P +PT +
Sbjct: 513 ELGDRNLVVQRASVGSEKKAQALAATGANSGALGIAAVPSSVQQSAGEDGDAGEPTSCMV 572
Query: 660 LKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVK 701
+ N+ PE + E +++ED+R EC ++G+V V V +
Sbjct: 573 MLNMVTPEELQ--DDEEYADIVEDIRDECTKYGAVTDVRVPR 612
>gi|358391563|gb|EHK40967.1| hypothetical protein TRIATDRAFT_135674 [Trichoderma atroviride IMI
206040]
Length = 558
Score = 107 bits (266), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 97/353 (27%), Positives = 159/353 (45%), Gaps = 55/353 (15%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P Y ++ T+ QA S + + Q+P+ + A + +
Sbjct: 163 KRRLTQWDIKP-PGY----DLVTAEQAKLSGMFPLPGA-----PRQQPMDPTKLQAFITQ 212
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+ S L SN R +RL V N+P A E AL+ F N L +G+ + S PC+ C
Sbjct: 213 PGGQVTSAGLKASNSRQAKRLLVSNVPSGAGEDALISFFN--LQLNGLNVIESSDPCVLC 270
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSG------------SILKIKRPKEFVEVA- 552
++ A +EF A DA+ AL DG S S L I+RPK++V A
Sbjct: 271 QFSADRAFAVLEFRNASDATVALALDGISMEADDAMNGTADGVSSGLNIRRPKDYVMPAL 330
Query: 553 ---------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP 591
+ I I L+ + V+E++ AFG KA+ ++ EE
Sbjct: 331 PDEMPFDPEVISNVVPDTVHKLCITNIPSFLTEEQVIELLAAFGKPKAFVLVKDQSTEES 390
Query: 592 --CAFIEYVD-QLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDN--SGNPPFHGIPKH 646
AF EY++ A+ LNG+ VGG+ L +A + + + N G G+
Sbjct: 391 RGIAFTEYLEPSSANEPALNSLNGMDVGGKKLKVTKASIGPTQVANFDVGITAISGLASQ 450
Query: 647 ALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+++ + V++L N+ PE + + EE+ EDV+ ECA+FG V + V
Sbjct: 451 TSNDIER-SSVIQLLNMVTPEEL--IDNDDYEEICEDVQDECAKFGKVVELKV 500
>gi|226478958|emb|CAX72974.1| Splicing factor U2AF 65 kDa subunit [Schistosoma japonicum]
Length = 520
Score = 107 bits (266), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 127/478 (26%), Positives = 204/478 (42%), Gaps = 129/478 (26%)
Query: 274 SPKSRHGNSRLKRRRSRSREREDRNRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGR 333
SP++++ +S R RS+ RER +RS S R ++ + R++ G H + R
Sbjct: 63 SPQTKYDSSSDGRSRSKKRER----KRSHSHGHRRHSKSRH--RDYSG--GHKS-----R 109
Query: 334 QHSDIDSSRVTGNGLSGHYRRHDGSTSGLGGYSPRKRRTEAAAKTPSPINRSPEKKSAKW 393
+H R N +S H + D G +P + + A +T +
Sbjct: 110 RHQ--SHHRSPSNSVSAH-KYWDVPPPGFEHVTPAQYK---ALQTSGQV----------- 152
Query: 394 DVAPVETYS---VPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVS 450
PV Y+ VP VH N P+T TT P AG +V
Sbjct: 153 ---PVNVYAAGQVPMPVHAPNA-------------PLTLTTNVPFAGSAVC--------- 187
Query: 451 MDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQRE 510
R RRL V N+P +A+E+ +MEF N + + G+ G+ P I I E
Sbjct: 188 ----------RQARRLYVGNIPFTATEENMMEFFNKQMRAQGLIQAEGN-PIIAVQINME 236
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------ 552
K AF+EF + ++ + L DG F LK++RP+++ +
Sbjct: 237 KNFAFLEFRSVDETTQGLALDGVLFQNQALKLRRPRDYAPLPGVSEQPSVIVPGVVSTVV 296
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTP 604
IF+GG+ L+ V E++ +FGPLK ++ + + + AF EYVD VT
Sbjct: 297 QDSPHKIFVGGLPNYLNEDQVKELLLSFGPLKGFNLVKDGSTGLSKGYAFCEYVDSNVTD 356
Query: 605 KAIAGLNGLKVG--------------------GQVLTAVQAVLDGSIMDNSGNPPF---H 641
A AGLNG+++G Q L + + +G++ + +G+
Sbjct: 357 HACAGLNGMQLGDKKLIVQRASVGAKHTTGVLPQCLLQMSGLEEGAVQNTTGSGNLTVRS 416
Query: 642 GIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
G P PTEVL L N+ E + E E+++EDVR EC+++G V+S+ +
Sbjct: 417 GGP---------PTEVLCLMNMI--ETSELEDDEEYEDIVEDVRAECSKYGVVRSLEI 463
>gi|119592809|gb|EAW72403.1| U2 (RNU2) small nuclear RNA auxiliary factor 2, isoform CRA_b [Homo
sapiens]
Length = 356
Score = 107 bits (266), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 126/275 (45%), Gaps = 50/275 (18%)
Query: 384 RSPEKKSAK-----WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGI 438
RSP + K WDV P + + + QAA Q P +
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATAL 121
Query: 439 SVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHV 496
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 122 LPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQA 181
Query: 497 HGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA---- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 182 PGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSE 240
Query: 553 --------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEE 590
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 241 NPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSK 300
Query: 591 PCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA 625
AF EYVD VT +AIAGLNG+++G + L +A
Sbjct: 301 GYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRA 335
>gi|50882018|gb|AAT85577.1| U2 small nuclear ribonucleoprotein auxiliary factor large subunit
[Alvinella pompejana]
Length = 479
Score = 106 bits (265), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 101/345 (29%), Positives = 164/345 (47%), Gaps = 49/345 (14%)
Query: 393 WDVAPVET-YSVPSN---VHTSNQAASSNAHEMVSSDPVTS---TTQKPLAGISVSASLA 445
WDV PV + P +H A M PV + P A ++++ ++
Sbjct: 89 WDVPPVGFEHMTPMQYKALHGLGPPGGLAAPGMPVVAPVIAANNVVASPTAPMALNTTIP 148
Query: 446 KLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
++ +R RRL V N+P +E+ +M++ N + +G+ G+ P I C
Sbjct: 149 FAGSAI--------SRQARRLYVGNIPFGVTEEMMMDYFNTQMKMAGLAQAEGN-PVIAC 199
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF------VEVA------- 552
+ +K AF+EF + ++ + A+ DG +F G LKI+RPK++ EV
Sbjct: 200 HVNLDKNFAFLEFRSVDETTQAMAFDGINFQGQSLKIRRPKDYQPLPGMAEVPSVAVPGV 259
Query: 553 -----------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVD 599
IFIGG+ L+ V E++ +FG LKA++ + + AF EY+D
Sbjct: 260 VSTVVQDSAHKIFIGGLPNYLNEDQVKELLTSFGLLKAFNLVKDSATGLSKGYAFCEYLD 319
Query: 600 QLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGN--PPFHGIPKHALPLLKKP--- 654
+T +A AGLNG+++G + L +A + N P IP + ++ P
Sbjct: 320 PSITDQACAGLNGMQLGDEKLIVQRASVGAKNAQGGPNVLPVQLQIPGLNMAQVQGPGPT 379
Query: 655 TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
TEVL L N+ PE E E EE+LEDV+ EC+++G VKS+ +
Sbjct: 380 TEVLCLMNMVTPE--DLEDEEEYEEILEDVKEECSKYGYVKSIEI 422
>gi|312078073|ref|XP_003141580.1| U2af splicing factor protein 1 [Loa loa]
Length = 460
Score = 106 bits (265), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 156/342 (45%), Gaps = 64/342 (18%)
Query: 386 PEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLA 445
P+KK WDV PV + + QAA V S P+ G SV+
Sbjct: 138 PKKKYKFWDVPPVGYEHLTPKEYKELQAAGQIPRNNVQS-------AVPVVGPSVTC--- 187
Query: 446 KLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
QS RRL V N+P SE A+++F N + G+ G+ P + C
Sbjct: 188 ------------QS----RRLYVGNIPFGCSEDAMLDFFNQQMHLCGLAQAPGN-PVLAC 230
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------- 552
+ +K AF+EF + ++ +A + DG +F G LKI+RP+++ ++
Sbjct: 231 QMNLDKNFAFIEFRSIDETTAGMAFDGINFMGQQLKIRRPRDYQPMSTSYDLGNMMVSNI 290
Query: 553 -------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVT 603
IFIGG+ L+++ V E++ +FG LKA++ E + + AF EY+D +T
Sbjct: 291 VPDSPHKIFIGGLPSYLNAEQVKELLSSFGQLKAFNLVTEQSTGVSKGYAFAEYLDPSLT 350
Query: 604 PKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPL---LKKPTEVLKL 660
+AIAGLNG+++G + L + + +N F I + L PTEVL L
Sbjct: 351 DQAIAGLNGMQLGDKNLVVQLSCANAR--NNVAQNTFPQIQVAGIDLSHGAGPPTEVLCL 408
Query: 661 KNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKY 702
N+ + EL+ +E E + C+ F K N++ +
Sbjct: 409 MNMVTED------ELKDDEEYEGI---CS-FSLFKMANIIWF 440
>gi|148699340|gb|EDL31287.1| U2 small nuclear ribonucleoprotein auxiliary factor (U2AF) 2,
isoform CRA_b [Mus musculus]
Length = 356
Score = 106 bits (265), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 126/275 (45%), Gaps = 50/275 (18%)
Query: 384 RSPEKKSAK-----WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGI 438
RSP + K WDV P + + + QAA Q P +
Sbjct: 78 RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATAL 121
Query: 439 SVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHV 496
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 122 LPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQA 181
Query: 497 HGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA---- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 182 PGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSE 240
Query: 553 --------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEE 590
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 241 NPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSK 300
Query: 591 PCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA 625
AF EYVD VT +AIAGLNG+++G + L +A
Sbjct: 301 GYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRA 335
>gi|170053756|ref|XP_001862821.1| splicing factor U2AF 50 kDa subunit [Culex quinquefasciatus]
gi|167874130|gb|EDS37513.1| splicing factor U2AF 50 kDa subunit [Culex quinquefasciatus]
Length = 382
Score = 106 bits (265), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 75/268 (27%), Positives = 122/268 (45%), Gaps = 53/268 (19%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 77 KPSLYWDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT------ 127
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + SG+ G+ P + C I
Sbjct: 128 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-PVLACQI 173
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--------------- 552
+K AF+EF + ++ + A+ DG +F G LKI+RP ++ +
Sbjct: 174 NLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGMTDSAVAPVQEKFS 233
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEY 597
IFIGG+ L+ V E++ +FG LKA++ + + AF EY
Sbjct: 234 GVISTVVPDSPHKIFIGGLPNYLNEDQVKELLLSFGQLKAFNLVKDAATGLSKGYAFAEY 293
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQA 625
V+ +T +AIAGLNG+++G + L +A
Sbjct: 294 VEYSITDQAIAGLNGMQLGDKKLIVQRA 321
>gi|336274240|ref|XP_003351874.1| hypothetical protein SMAC_00421 [Sordaria macrospora k-hell]
gi|380096157|emb|CCC06204.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 594
Score = 106 bits (264), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 161/357 (45%), Gaps = 66/357 (18%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P +V T+ QA S + + Q+ + + A + +
Sbjct: 202 KRRMTQWDIKPPGYGNV-----TAEQAKLSGMFPLPGA-----PRQQAMDPTKLQAFMTQ 251
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+++S L +N R +RL V N+P SA++++L+ F N L +G+ + PC+ C
Sbjct: 252 PGGTVNSTALKPTNSRQSKRLIVSNIPPSATDESLLGFFN--LQLNGLNVIDSVDPCVQC 309
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSGSI---------LKIKRPKEFVEVAIF-- 554
I + A VEF + DA+ AL DG + LKI+RPK+++ AI
Sbjct: 310 QISPDHSFAMVEFRNSPDATVALALDGITMEADDANDAASAGGLKIRRPKDYIVPAIVED 369
Query: 555 ----------------------IGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP- 591
+ I L+ + +ME++ AFG LK++ V + H E
Sbjct: 370 PNYDPDSEVPSNIVIDSPNKISVTNIPAYLTEEQIMELLVAFGKLKSFVL-VKDKHTEES 428
Query: 592 --CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALP 649
AF EY D VT AI GLN + +G + L +A I +G + +A+
Sbjct: 429 RGIAFCEYHDSSVTSVAIDGLNNMMLGDRALKVQKASY--GIQQVAGE-----LSVNAMS 481
Query: 650 LLKKPT-------EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+L T V++L N+ + + + EE+ +DV+ EC +FG++ S+ +
Sbjct: 482 MLAGTTSLDGDVSRVVQLLNMVTADEL--MDNDDYEEIRDDVQEECEKFGTIVSLKI 536
>gi|336368252|gb|EGN96595.1| hypothetical protein SERLA73DRAFT_93106 [Serpula lacrymans var.
lacrymans S7.3]
Length = 396
Score = 105 bits (263), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 132/273 (48%), Gaps = 44/273 (16%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL + ++ +E+ L +F N+ ++ + P + EK AFVEF +AED
Sbjct: 52 RRLYIGSITPDVNEQNLADFFNSKMIEMSIGTGAPGNPVLAVQCNYEKNYAFVEFRSAED 111
Query: 524 ASAALCCDGCSFSGSILKIKRPKEFVEV----------------------AIFIGGISRT 561
A+AA+ DG F LKI+RPK++ V +F+GG+
Sbjct: 112 ATAAMAFDGIIFINGPLKIRRPKDYGGVDMSAPSVHVPGVVSTNVPDSINKVFVGGLPTY 171
Query: 562 LSSKMVMEIVCAFGPLKAYHFEVNEDHEEP---CAFIEYVDQLVTPKAIAGLNGLKVGGQ 618
L+ + VME++ +FG LKA++ V E+ P AF EYVD VT AI LNG+++G +
Sbjct: 172 LNEEQVMELLKSFGELKAFNL-VRENGNGPSKGFAFFEYVDISVTDVAIQSLNGMELGDR 230
Query: 619 VLTAVQAVLDGSIMDNSGNP------PFHGIPKHALPLLK------KPTEVLKLKNVFNP 666
L +A S+ G P P+ P+ P++ +L + N+ P
Sbjct: 231 YLVVQRA----SVGAKPGTPGMIPNLPYDQFPEIPRPIMPAGENSSADARILLMLNMVTP 286
Query: 667 EGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + + E ++ EDV+ EC+ +G+V+ + +
Sbjct: 287 DDLT--DDQEYGDLYEDVKEECSVYGAVEDLRI 317
>gi|406867695|gb|EKD20733.1| U2 snRNP auxilliary factor [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 613
Score = 105 bits (263), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 167/374 (44%), Gaps = 81/374 (21%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P +V T+ QA S + + Q+P+ + A +++
Sbjct: 202 KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA-----PRQQPMDPSKLQAFMSQ 251
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+ S+ + L SN R +RL V NLP + +E+A++ F N L +G+ + GS PC+
Sbjct: 252 PSGSVTNAALKPSNSRQAKRLLVSNLPPTVNEEAIVNFFN--LQLNGLNVIEGSDPCVSA 309
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCS-----FSGSI-------LKIKRPKEFV---- 549
I ++K A +EF DA+ AL DG + +GS L I+RPK+++
Sbjct: 310 QISKDKTFALLEFKQTSDATIALALDGITMEDEHMNGSATNGDTQGLSIRRPKDYIVPTI 369
Query: 550 ------------------EVAIFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHE 589
+ I I I L+ + V E++ +FG LKA+ + D
Sbjct: 370 TDETPFEAGVISNVVVDTQNKISISNIPLYLTDEQVTELLVSFGELKAFVLVKDNGTDES 429
Query: 590 EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPK--HA 647
AF EYVD + T A+ GLNG+++G + L +A + G+ G+ +A
Sbjct: 430 RGIAFCEYVDPVATDIAVEGLNGMELGDKHLKVQRASI--------GHTQVSGLEMSVNA 481
Query: 648 LPLLKKPT-------EVLKLKNVFNP---------EGFSSLSEL------EVEEVLEDVR 685
+ +L T VL+L N+ P EG L+ L E+ EDV+
Sbjct: 482 MSMLAGTTSQGLEEGRVLQLLNMVTPEELIDNEDYEGTILLTTLIQSLLTMYPEICEDVK 541
Query: 686 LECARFGSVKSVNV 699
EC ++G V + V
Sbjct: 542 EECEKYGKVLDMKV 555
>gi|325191172|emb|CCA25959.1| splicing factor U2af large subunit putative [Albugo laibachii Nc14]
Length = 553
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 85/320 (26%), Positives = 147/320 (45%), Gaps = 57/320 (17%)
Query: 456 LTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAF 515
+ Q R RR+ V + +E + F N+ + + + G + I RE+ AF
Sbjct: 207 MAQQTRHARRIYVGGIG-EVNETEISAFFNDVIDRALGERQEGGA-VVSVYINRERHFAF 264
Query: 516 VEFLTAEDASAALCCDGCSFSGSILKIKRPKEF--------------VEVA--------- 552
VE + E +A + DG +F G LK++RP ++ + +A
Sbjct: 265 VELKSIELTTACMNLDGIAFRGQPLKVRRPNDYNPGLVPKDLGPIPALNLAALGIVSTTV 324
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTP 604
+FIGGI L+ + + E++ AFGPLK++H ++ + + AF EY+D VT
Sbjct: 325 QDGPGKVFIGGIPYHLTEEQIKELLQAFGPLKSFHLVKDLTTNLSKGYAFCEYMDSGVTD 384
Query: 605 KAIAGLNGLKVGGQVLTAVQAVLDGS--IMDNSGNPPFHGI-----PKHA---------- 647
A GLN +K+G + LT +A+ S ++ N+ G+ P A
Sbjct: 385 AACIGLNDMKLGDRTLTVRRALSQESAKVIANAAGTVNAGVEMGLDPSRAAMQTISMAGV 444
Query: 648 -LPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVK----Y 702
L + P+ VL L+N+ PE E E ++++D+R EC R+G V ++ + + Y
Sbjct: 445 HLGPIGSPSRVLVLRNMVTPEELE--DEDEYRDIMDDIRSECERYGRVTTIILPRAKEGY 502
Query: 703 GDSNISTIQACEGNENTASA 722
GD + + G+ +T+ A
Sbjct: 503 GDEALGKVYIEFGDISTSQA 522
>gi|346978171|gb|EGY21623.1| splicing factor U2AF 50 kDa subunit [Verticillium dahliae VdLs.17]
Length = 589
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 111/428 (25%), Positives = 183/428 (42%), Gaps = 95/428 (22%)
Query: 365 YSPRKRRTEAAAKTPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVHTSN 411
++ R++R +A +P P R P +++ +WD+ P +V T+
Sbjct: 141 FARREQRPRSA--SPPPKKREPTPDLTDIVSVLDRKRRLTQWDIKPPGYENV-----TAE 193
Query: 412 QAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN-RPMRRLCVEN 470
QA S + + Q+P+ + A + + +++S L SN R +RL V
Sbjct: 194 QAKLSGMFPLPGA-----PRQQPMDPTKLQAFMNQPG-TVNSASLKPSNSRQAKRLLVSK 247
Query: 471 LPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCC 530
LP SA+E ++ F N L +G+ + + PC+ C + +K VEF A +A+ AL
Sbjct: 248 LPSSATEDSVASFFN--LQLNGLNVIESTDPCVSCQLSNDKSFCVVEFRNASEATVALAL 305
Query: 531 DGCSF---------SGSILKIKRPKEFVEVAIF----------------------IGGIS 559
DG S +G L+I+RPK+++ A+ I G
Sbjct: 306 DGISMEADSATDGAAGRGLEIRRPKDYIVPAVTEELPYEPGVVSSNVVDTPNKLSITGFP 365
Query: 560 RTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGG 617
L+ + V E++ +FG LKA+ + + D F EYVD AI GL+G+ +G
Sbjct: 366 PYLTEEQVTELLTSFGELKAFVLVRDRHTDESRGFVFCEYVDSAANDVAIQGLSGMDLGN 425
Query: 618 QVLTAVQAVLDGSIMDNSGNPPFHGIPK--HALPLLKKPT-------EVLKLKNVFNPEG 668
L +A + G G+ A+ +L T VL+L N+ +
Sbjct: 426 SKLKIQKASI--------GVTQVAGVEMGVAAMSMLAGTTATDSEESRVLQLLNMVTADE 477
Query: 669 FSSLSELEVEEVLEDVRLECARFGSVKSVNVVK-YGDSNISTIQACEGNENTASAGVGQN 727
+ + EE++EDV+ ECA++G+V V V + G S SAGVG+
Sbjct: 478 L--MDNEDYEEIVEDVQEECAKYGTVLEVKVPRPVGGSR-------------QSAGVGKI 522
Query: 728 LTNDETNE 735
ET E
Sbjct: 523 FVKYETKE 530
>gi|346324367|gb|EGX93964.1| splicing factor u2af large subunit [Cordyceps militaris CM01]
Length = 583
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 113/379 (29%), Positives = 163/379 (43%), Gaps = 75/379 (19%)
Query: 366 SPRKRRTEAAAKTPSPINRSP----EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEM 421
SP KRR TP + P +++ +WD+ P +V TS QA S +
Sbjct: 177 SPPKRRE----PTPDLTDIIPVLERKRRMTQWDIKPPGYEAV-----TSEQAKMSGMFPL 227
Query: 422 VSS------DP--VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPL 473
+ DP + + +P G SA L N +R RRL V ++P
Sbjct: 228 PGAPRQQQVDPTKLQALMNQPAGGQVSSAGLKANN-----------SRQARRLLVSDIPS 276
Query: 474 SASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGC 533
+E AL+ F N L +G+ + + PC C + +K A +EF DA+ AL DG
Sbjct: 277 GTTEDALVAFFN--LQLNGLNVIEATDPCALCQLSNDKSFAVLEFKNTGDATVALALDGS 334
Query: 534 SFSGSI--LKIKRPKEFVEVA-----IF-----------------IGGISRTLSSKMVME 569
S L I+RPK++V A IF I I L+ V+E
Sbjct: 335 SMVADTPGLSIRRPKDYVMPAVPDEIIFNPEVVSNSVPDTIHKLCITNIPPFLTEDQVLE 394
Query: 570 IVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQL-VTPKAIAGLNGLKVGGQVLTAVQAV 626
++ AFG KA+ EE AF EYV+ A+ LNG+ VGG+ L A +A
Sbjct: 395 LLAAFGKPKAFVLVKERSTEESRGIAFAEYVEPTNANEPALNTLNGMDVGGKKLKARKAC 454
Query: 627 LDGSIMDNSGNPPFHGIPKHALPLLK------KPTEVLKLKNVFNPEGFSSLSELEVEEV 680
+ G+ + N GI +A+ L T VL+L N+ E L + EE+
Sbjct: 455 VGGTQVANFD----AGI--NAISNLAGQGNGGDATRVLQLLNMVTAEEL--LDNDDYEEI 506
Query: 681 LEDVRLECARFGSVKSVNV 699
EDVR EC+++G V V V
Sbjct: 507 CEDVRDECSKYGKVLDVKV 525
>gi|443731660|gb|ELU16702.1| hypothetical protein CAPTEDRAFT_155651 [Capitella teleta]
Length = 480
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 92/305 (30%), Positives = 150/305 (49%), Gaps = 56/305 (18%)
Query: 427 VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNN 486
+ + T P AG ++S R RRL V N+P +E+ +M+F N
Sbjct: 143 IVAPTNIPFAGSAIS-------------------RQARRLYVGNIPFGVTEEMMMDFFNT 183
Query: 487 FLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPK 546
+ + + G+ P I C + +K AF+E+ + ++ + A+ DG +F G LKI+RPK
Sbjct: 184 QMKMAALSQAEGN-PVIACQVNLDKNFAFLEYRSVDETTHAMALDGINFQGQSLKIRRPK 242
Query: 547 EFVEVA------------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF 582
++ + IFIGG+ L+ V E++ +FGPLKA+
Sbjct: 243 DYQPLPGIAETPNVAVPGVVSTVVQDSAHKIFIGGLPNYLNEDQVKELLTSFGPLKAFSL 302
Query: 583 --EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL---DGSIMDNSGN 637
+ + AF EY+D +T +A AGLNG+++G + L +A + + +I+ S
Sbjct: 303 VKDSATGLSKGYAFCEYLDISITDQACAGLNGMQLGDKKLIVQRASVGAKNAAII--SSA 360
Query: 638 PPFHGIPKHAL-PLLK--KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSV 694
P IP A+ P+ TEVL L N+ PE E E E++LEDV+ EC+++G V
Sbjct: 361 PMQMQIPGLAVNPMAAAGPATEVLCLMNMVLPE--ELEDEEEYEDILEDVKEECSKYGFV 418
Query: 695 KSVNV 699
KS+ +
Sbjct: 419 KSLEI 423
>gi|392589921|gb|EIW79251.1| hypothetical protein CONPUDRAFT_83522 [Coniophora puteana
RWD-64-598 SS2]
Length = 411
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/304 (30%), Positives = 144/304 (47%), Gaps = 46/304 (15%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL + ++ +E+ L +F N ++ + P + EK AFVEF +AED
Sbjct: 52 RRLYIGSITPDVNEQNLADFFNGKMIEMSIGTGGPGNPVLAVQCNYEKNYAFVEFRSAED 111
Query: 524 ASAALCCDGCSFSGSILKIKRPKEFVEVA----------------------IFIGGISRT 561
A+AA+ DG F LKI+RPK++ A IF+GG+
Sbjct: 112 ATAAMAFDGIIFINGPLKIRRPKDYGGDAIMAPNFHVPGVVSTNVPDSIHKIFVGGLPPY 171
Query: 562 LSSKMVMEIVCAFGPLKAYHFEVNEDHEEP---CAFIEYVDQLVTPKAIAGLNGLKVGGQ 618
L+ + VME++ +FG LKA++ V E+ P AF EYVD VT AI LNG+++G +
Sbjct: 172 LNEEQVMELLKSFGELKAFNL-VRENGNGPSKGFAFFEYVDSSVTDVAIQSLNGMELGDR 230
Query: 619 VLTAVQAVLDGSIMDNSGNPP------FHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSL 672
L VQ G+ G P F IP+ +P TE ++ + N L
Sbjct: 231 YLV-VQRASVGAKPGTPGMIPNLPYDQFPEIPRPIMPAGDGSTEDARILLMLNMVTVDDL 289
Query: 673 S-ELEVEEVLEDVRLECARFGSVKSVNVV--------KYGDSN-ISTIQACEGNENTASA 722
+ E ++ EDV+ EC++ G+V+ + + K+GD+ S A +E +A
Sbjct: 290 QDDDEYGDIYEDVKEECSKHGAVEDLRIPRPIKKDKSKWGDAGQQSQTDAARADE---AA 346
Query: 723 GVGQ 726
GVG+
Sbjct: 347 GVGR 350
>gi|322693990|gb|EFY85833.1| splicing factor, putative [Metarhizium acridum CQMa 102]
Length = 584
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 96/352 (27%), Positives = 155/352 (44%), Gaps = 54/352 (15%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P V T+ QA S + + Q+P+ + A + +
Sbjct: 179 KRRMTQWDIKPPGYELV-----TAEQAKLSGMFPLPGA-----PRQQPMDPTKLQAFMKE 228
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
N + S L SN R +RL V N+P SE++L+ F N L +G+ + S PC C
Sbjct: 229 PNGGVSSAGLKASNSRQAKRLIVSNIPQGNSEESLISFFN--LQLNGLNVIESSDPCNLC 286
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSG-----------SILKIKRPKEFVEVAI- 553
++ A +EF A DA+ AL DG + L I+RPK++V AI
Sbjct: 287 QFSTDRSFAVLEFRNAGDATVALALDGINMEADDTMNGDGGEKQGLSIRRPKDYVMPAIP 346
Query: 554 ---------------------FIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP- 591
I I L+ ++E++ AFG KA+ + EE
Sbjct: 347 EEMAYDPNVVSNVVPDTVHKLSITNIPTFLTEDQIIELLAAFGKPKAFVLVKDRSTEESR 406
Query: 592 -CAFIEYVDQ-LVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDN--SGNPPFHGIPKHA 647
AF EY++ A++ LNG+ VGG+ L +A + + + N G G+
Sbjct: 407 GIAFAEYLEPGSANEPALSALNGMDVGGKKLKVAKASIGPTQVANFDVGITAISGLASQT 466
Query: 648 LPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+K VL+L N+ PE + E EE+ EDVR EC++FG++ + +
Sbjct: 467 STDAEK-GRVLQLLNMVTPEEL--MDTEEYEEICEDVREECSKFGNILELKI 515
>gi|302411252|ref|XP_003003459.1| splicing factor U2AF 65 kDa subunit [Verticillium albo-atrum
VaMs.102]
gi|261357364|gb|EEY19792.1| splicing factor U2AF 65 kDa subunit [Verticillium albo-atrum
VaMs.102]
Length = 568
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 110/428 (25%), Positives = 184/428 (42%), Gaps = 95/428 (22%)
Query: 365 YSPRKRRTEAAAKTPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVHTSN 411
++ R++R +A +P P R P +++ +WD+ P +V T+
Sbjct: 145 FARREQRPRSA--SPPPKKREPTPDLTDIVSVLDRKRRLTQWDIKPPGYENV-----TAE 197
Query: 412 QAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN-RPMRRLCVEN 470
QA S + + Q+P+ + A + + +++S L SN R +RL V
Sbjct: 198 QAKLSGMFPLPGA-----PRQQPMDPTKLQAFMNQPG-TVNSASLKPSNSRQAKRLLVSK 251
Query: 471 LPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCC 530
LP SA+E+++ F N L +G+ + + PC+ C + +K VEF A +A+ AL
Sbjct: 252 LPSSATEESVASFFN--LQLNGLNVIESTDPCVSCQLSNDKSFCVVEFRNASEATVALAL 309
Query: 531 DGCSF---------SGSILKIKRPKEFVEVAIF----------------------IGGIS 559
DG S +G ++I+RPK+++ A+ I G
Sbjct: 310 DGISMEADSGTDGAAGRGMEIRRPKDYIVPAVTEELPYEPGVVSSNVVDTPNKLSITGFP 369
Query: 560 RTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGG 617
L+ + V E++ +FG LKA+ + + D F EYVD AI GL+G+ +G
Sbjct: 370 PYLTEEQVTELLTSFGELKAFVLVRDRHTDESRGFVFCEYVDSAANDVAIQGLSGMDLGN 429
Query: 618 QVLTAVQAVLDGSIMDNSGNPPFHGIPK--HALPLLKKPT-------EVLKLKNVFNPEG 668
L +A + G G+ A+ +L T VL+L N+ +
Sbjct: 430 SKLKIQKASI--------GVTQVAGVEMGVAAMSMLAGTTATDSEESRVLQLLNMVTADE 481
Query: 669 FSSLSELEVEEVLEDVRLECARFGSVKSVNVVK-YGDSNISTIQACEGNENTASAGVGQN 727
+ + EE++EDV+ ECA++G+V V V + G S SAGVG+
Sbjct: 482 L--MDNEDYEEIVEDVQEECAKYGTVLEVKVPRPVGGSR-------------QSAGVGKI 526
Query: 728 LTNDETNE 735
ET E
Sbjct: 527 FVKYETKE 534
>gi|328862941|gb|EGG12041.1| hypothetical protein MELLADRAFT_41759 [Melampsora larici-populina
98AG31]
Length = 397
Score = 104 bits (259), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/292 (29%), Positives = 138/292 (47%), Gaps = 51/292 (17%)
Query: 454 VQLTQS-NRPMRRLCVENLPLSASEKALMEFLN------NFLLSSGVQHVHGSL---PCI 503
V TQS R RRL V N+ +A+E + EF N L+ +G S+ P +
Sbjct: 23 VAQTQSFARQARRLYVGNILHTANEMNVAEFFNAKMKELGLLVRNGEDGSMISISENPVV 82
Query: 504 GCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEV------------ 551
+ EK AFVEF AE+A+ + DG F LKI+RPK++
Sbjct: 83 AVQVNHEKNYAFVEFRNAEEATHGMSFDGIIFQNQALKIRRPKDYTGTEHTSTNHIPGVV 142
Query: 552 ---------AIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDH----EEPCAFIEYV 598
IFIGG+ L+ VME++ +FG LK+++ + + AF EYV
Sbjct: 143 STNVPDSPNKIFIGGLPSYLTDDQVMELLKSFGELKSFNLVKDTSSGGQVSKGFAFCEYV 202
Query: 599 DQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGN-----------PPFHGIPKHA 647
D +T A GLNG+++G + L +A + + N F+G + +
Sbjct: 203 DSDLTDIACQGLNGMELGDRYLVVQRAQIGQNAKKEKENAEGAGGAGMGVGGFNGTNRAS 262
Query: 648 LPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ T+VL++ N+ NPE + + E +E+LED++ EC+++G ++ V +
Sbjct: 263 ---EGERTKVLQMLNMVNPE--ELVDDEEYKEILEDIKEECSKYGQIEDVKI 309
>gi|398403643|ref|XP_003853288.1| hypothetical protein MYCGRDRAFT_100024 [Zymoseptoria tritici
IPO323]
gi|339473170|gb|EGP88264.1| hypothetical protein MYCGRDRAFT_100024 [Zymoseptoria tritici
IPO323]
Length = 544
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 129/284 (45%), Gaps = 52/284 (18%)
Query: 459 SNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEF 518
S R +RL V NLP SA++ +LM+F N L +G+ G+ PCI I + G A +EF
Sbjct: 212 SARQSKRLLVYNLPASATDDSLMDFFN--LQLNGLNVTKGADPCISANISQGNGYALLEF 269
Query: 519 LTAEDASAALCCDGCSFSGSI-------------LKIKRPKEFV---------------- 549
T EDA+ A+ DG + L+IKRPK+++
Sbjct: 270 KTPEDATNAMAMDGIKMEADVDMGNGESNGTSKGLEIKRPKDYIVPTVSDETENTSGLFS 329
Query: 550 ------EVAIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQL 601
+ I I I L + V+E++ +FG LK + ++ EE AF+EY D
Sbjct: 330 SIVPDTQNKISITNIPVYLQEEQVVELLTSFGQLKNFVLVKDKSTEESRGIAFVEYKDPD 389
Query: 602 VTPK-AIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTE---- 656
T K A+ LNG+ +G L A + I SG + L K T+
Sbjct: 390 STTKIALEALNGMDLGDAALKVKLASI--GIQQVSGE---MTVGAMGLIAGTKSTDADNG 444
Query: 657 -VLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
VL L N+ PE + E +E+LEDV+ ECA++G + V +
Sbjct: 445 RVLCLMNMITPEEL--MDADEADEILEDVKEECAKYGELMEVKM 486
>gi|452823555|gb|EME30564.1| U2 snRNP auxiliary factor large subunit, putative isoform 1
[Galdieria sulphuraria]
Length = 522
Score = 103 bits (258), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 161/352 (45%), Gaps = 78/352 (22%)
Query: 388 KKSAKWDV--APVETYSVPSNVHTSNQAA-----SSNAHEMVSSDPVTSTTQKPLAGISV 440
++S++WDV P E NV+ S + S+ + M+ PV TTQ
Sbjct: 150 RRSSQWDVRKLPWEQSERDPNVNRSAYSVGGLDFSALSQYMI---PVAPTTQP------- 199
Query: 441 SASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFL-LSSGVQHVHGS 499
Q+ + RRL V NLP +E + +F N+ L L+ GV V G
Sbjct: 200 --------------NTQQATKHARRLYVGNLPSDVTESEVADFFNSALYLAKGVD-VPGD 244
Query: 500 LPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------- 548
P + +K AF+E +A +A+AA+ DG F G L+++RP ++
Sbjct: 245 -PVQSVYLNLDKRFAFIELNSAAEAAAAIQMDGVLFRGMSLRMRRPNDYNPNIHAPVYPP 303
Query: 549 ------------VEVA-----IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHE 589
+V +FIGG+ L+ + EI+ ++GPL A++ + N
Sbjct: 304 IGFDPSALGVVSTQVPDGPDKVFIGGLPYHLTEDQIKEILSSYGPLNAFNLVKDPNTGLS 363
Query: 590 EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA--VLDGSIMDNSGNPPFHGIPKHA 647
+ AF +Y D + AI GLNG+ +G + LT +A V GS+ + +
Sbjct: 364 KGYAFFQYKDPSIVEAAIKGLNGMTMGDKTLTVRRASQVSSGSVE----------LGQSF 413
Query: 648 LPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
P ++ PT +L+L+N+ PE + + E E+++EDVR E +++G V V +
Sbjct: 414 SPTVRYPTRILELRNMVEPEEL--VDDEEYEDIIEDVREESSKYGEVTEVKI 463
>gi|297706019|ref|XP_002829852.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Pongo abelii]
Length = 352
Score = 103 bits (257), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 79/256 (30%), Positives = 131/256 (51%), Gaps = 35/256 (13%)
Query: 476 SEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF 535
+++A+M+F N + G+ G+ P + I ++K AF+EF + ++ + A+ DG F
Sbjct: 73 TKEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIF 131
Query: 536 SGSILKIKRPKEFVEVA------------------------IFIGGISRTLSSKMVMEIV 571
G LKI+RP ++ + +FIGG+ L+ V E++
Sbjct: 132 QGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELL 191
Query: 572 CAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG 629
+FGPLKA++ + + AF EYVD VT +AIAGLNG+++G + L +A +
Sbjct: 192 TSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGA 251
Query: 630 --SIMDNSGNPPFH----GIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLED 683
+ + P G+ + + PTEVL L N+ PE L + E EE++ED
Sbjct: 252 KNATLSTINQTPVTLQVPGLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVED 309
Query: 684 VRLECARFGSVKSVNV 699
VR EC+++G VKS+ +
Sbjct: 310 VRDECSKYGLVKSIEI 325
>gi|95103124|gb|ABF51503.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 isoform 2
[Bombyx mori]
Length = 306
Score = 103 bits (256), Expect = 5e-19, Method: Composition-based stats.
Identities = 82/293 (27%), Positives = 133/293 (45%), Gaps = 52/293 (17%)
Query: 381 PINRSPEKK-SAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
P RS +K S WDV P + + + QAA +V+ P + P+ G +
Sbjct: 37 PNKRSRRRKPSLYWDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGST 93
Query: 440 VSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGS 499
++ R RRL V N+P +E+ MEF N + SG+ G+
Sbjct: 94 IT-------------------RQARRLYVGNIPFGVTEEETMEFFNQQMHLSGLAQAAGN 134
Query: 500 LPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEV-------- 551
P + C I +K AF+EF + ++ + A+ DG +F G LKI+RP ++ +
Sbjct: 135 -PVLACQINLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPGTENPAI 193
Query: 552 ----------------AIFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCA 593
IFIGG+ L+ V E++ +FG L+A++ + + + A
Sbjct: 194 NVPAGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSSTGLSKGYA 253
Query: 594 FIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL--DGSIMDNSGNPPFHGIP 644
F EYVD +T +AIAGLNG+++G + L +A + S + +G P P
Sbjct: 254 FAEYVDISMTDQAIAGLNGMQLGDKKLIVQRASIGAKNSTLALTGAAPVQRYP 306
>gi|18446992|gb|AAL68087.1| AT16577p [Drosophila melanogaster]
Length = 449
Score = 103 bits (256), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 84/305 (27%), Positives = 138/305 (45%), Gaps = 69/305 (22%)
Query: 452 DSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQ----HVHGSLPCIGCVI 507
+S ++ R RRL V N+P +++ +M+F N+ +++ G + H + C
Sbjct: 99 ESAAISMITRQARRLYVGNIPFGVTDEEMMQFFNHQIMALGFEAKSSHYMDGNAVLTCQT 158
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF------------------- 548
EK AF+EF + ++AS AL DG F G LKI+RP ++
Sbjct: 159 NLEKNFAFLEFRSIDEASQALNFDGMVFRGQTLKIRRPHDYQPVPSISVSAMESYRSFRV 218
Query: 549 --VEVA-------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVN 585
+ VA I++GG+ L+ V E++ +FG LK + + N
Sbjct: 219 PAINVAQQPAVTLPVTTIVPDSPNKIYVGGLPTCLNQDQVKELLQSFGELKGLNLVMDTN 278
Query: 586 EDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPK 645
+ + AF EY D VT AIAGL+G+ +G + L +++ G + P
Sbjct: 279 TNLNKGFAFFEYCDPSVTDHAIAGLHGMLLGDRRLVVQRSIPGGK----------NAFPG 328
Query: 646 HALPLLK-----------KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSV 694
H +L+ PTE L L N+ PE L + E E++ D++ ECA+FG V
Sbjct: 329 HLPTVLQVPGISTLQDPGSPTETLCLLNMVRPE--ELLDDEEFEDIRTDIKQECAKFGEV 386
Query: 695 KSVNV 699
+S+ +
Sbjct: 387 RSIKI 391
>gi|361132025|gb|EHL03640.1| putative Splicing factor U2AF 50 kDa subunit [Glarea lozoyensis
74030]
Length = 568
Score = 103 bits (256), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 104/361 (28%), Positives = 165/361 (45%), Gaps = 71/361 (19%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
++ +WD+ P + Y NV T+ QA S + + Q+P+ + A +A+
Sbjct: 172 RRLTQWDIKP-QGYE---NV-TAEQAKLSGMFPLPGA-----PRQQPMDPSKLQAIMAQP 221
Query: 448 NVSMDSVQLTQSN-RPMRRLCVENLP-LSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+ S+ + L SN R +RL V NLP S SE+A++ F N L +G+ V GS PCI
Sbjct: 222 SGSVTNAALKPSNSRQAKRLLVHNLPSTSISEEAIINFFN--LQMNGLNIVEGSDPCISA 279
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSGSI-------------LKIKRPKEFVEVA 552
I ++K A +EF T DA+ AL DG + S L I+RPK+++ A
Sbjct: 280 QISKDKSFALLEFKTPSDATLALAFDGITMDDSEYVNREANGGDTKGLSIRRPKDYIVPA 339
Query: 553 ----------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEE 590
I + + L+ + V+E++ +FG LKA+ + E
Sbjct: 340 VSDETPQEPGVVSSVVVDTQNKICMSNVPLYLTDEQVIELLTSFGELKAFVLVKDNSTGE 399
Query: 591 P--CAFIEYVD-QLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPK-- 645
AF EY D T A+ GLNG+++G + L +A + GN G+
Sbjct: 400 SRGIAFCEYADPAAATDIAVEGLNGMELGDKHLRVQRASI--------GNTQVSGLEMGV 451
Query: 646 HALPLLKKPTE-------VLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVN 698
+A+ +L T VL+L N+ PE + + EE+ EDV+ EC ++G V +
Sbjct: 452 NAMSMLAGTTSAGLEDGRVLQLLNMVTPEEL--VDNDDYEEICEDVKEECEKYGKVLDMK 509
Query: 699 V 699
V
Sbjct: 510 V 510
>gi|309271453|ref|XP_003085312.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Mus musculus]
Length = 730
Score = 102 bits (255), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 99/352 (28%), Positives = 168/352 (47%), Gaps = 53/352 (15%)
Query: 384 RSP--EKKSAK--WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
RSP EKK + WDV P V + + QAA +++++ + + T LA
Sbjct: 338 RSPCHEKKKVRKYWDVPPPGFEHVTPMQYKAMQAAG----QILATALLPTMTPGGLA--- 390
Query: 440 VSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGS 499
+ + V + Q+T R RRL V +P +E+++++FLN + G+ G+
Sbjct: 391 ----VTPMPVPVVGSQMT---RQARRLYVGTIPFGITEESMLDFLNTQMHLRGLTQAPGN 443
Query: 500 LPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------- 552
P + I +K AF+EF + ++ + AL DG F G LKI+RP ++ +
Sbjct: 444 -PILAVQINLDKNFAFLEFRSVDETTQALAFDGIIFQGQSLKIRRPHDYQPLPGMSGSPS 502
Query: 553 -----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCA 593
+FIGG+ L+ V E++ + G L+A++ + + A
Sbjct: 503 VYVPGVVSTIVPDSAHKLFIGGLPNYLNDDQVKELLTSVGILRAFNLVKDSITGLSKGYA 562
Query: 594 FIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG-----SIMDNSG-NPPFHGIPKHA 647
F EY+D VT +AIA LNG+ +G + L +A + SI++ + G+
Sbjct: 563 FCEYMDINVTDQAIAWLNGMHLGDKKLLVQRASVGAKNVALSIINQTPVTLQVPGLTSSQ 622
Query: 648 LPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + PT VL L N+ P+ L + E EE+++DVR EC ++G VKS+ +
Sbjct: 623 VQMGGHPTTVLCLMNMVLPK--ELLDDEEYEEIVDDVRDECNKYGLVKSIEI 672
>gi|24659166|ref|NP_611769.2| large subunit 2 [Drosophila melanogaster]
gi|7291545|gb|AAF46969.1| large subunit 2 [Drosophila melanogaster]
gi|201066057|gb|ACH92438.1| FI08027p [Drosophila melanogaster]
Length = 449
Score = 102 bits (255), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 84/305 (27%), Positives = 138/305 (45%), Gaps = 69/305 (22%)
Query: 452 DSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQ----HVHGSLPCIGCVI 507
+S ++ R RRL V N+P +++ +M+F N+ +++ G + H + C
Sbjct: 99 ESAAISMITRQARRLYVGNIPFGVTDEEMMQFFNHQIMALGFEAKSSHYMDGNAVLTCQT 158
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF------------------- 548
EK AF+EF + ++AS AL DG F G LKI+RP ++
Sbjct: 159 NLEKNFAFLEFRSIDEASQALNFDGMVFRGQTLKIRRPHDYQPVPSISVSAMESYRSFRV 218
Query: 549 --VEVA-------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVN 585
+ VA I++GG+ L+ V E++ +FG LK + + N
Sbjct: 219 PAINVAQQPAVTLPVTTIVPDSPNKIYVGGLPTCLNQDQVKELLQSFGELKGLNLVMDTN 278
Query: 586 EDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPK 645
+ + AF EY D VT AIAGL+G+ +G + L +++ G + P
Sbjct: 279 TNLNKGFAFFEYCDPSVTDHAIAGLHGMLLGDRRLVVQRSIPGGK----------NAFPG 328
Query: 646 HALPLLK-----------KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSV 694
H +L+ PTE L L N+ PE L + E E++ D++ ECA+FG V
Sbjct: 329 HLPTVLQVPGISTLQDPGSPTETLCLLNMVRPE--ELLDDEEFEDIRTDIKQECAKFGEV 386
Query: 695 KSVNV 699
+S+ +
Sbjct: 387 RSIKI 391
>gi|449496757|ref|XP_004160219.1| PREDICTED: splicing factor U2af large subunit B-like [Cucumis
sativus]
Length = 587
Score = 102 bits (255), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 97/358 (27%), Positives = 156/358 (43%), Gaps = 71/358 (19%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
+D+AP T + + AA+ + P T PLA L V
Sbjct: 191 FDMAPPTT-----AILSGATAAAGQIPGTTPAIPGMFPTMFPLA---TGQPFGALPVMPV 242
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
Q+ R RR+ V LP +A+E+++ F + + + G + I EK
Sbjct: 243 QAMTQQATRHARRVYVGGLPPTANEQSVATFFSQVMAAIGGNTAGPGDAVVNVYINHEKK 302
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA---- 552
AFVE + E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 303 FAFVEMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGL 362
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFI 595
IF+GG+ + V E++ +FGPL+ F++ +D E + AF
Sbjct: 363 TPGSAGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFC 420
Query: 596 EYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP--------FHGIPKHA 647
Y D VT A A LNG+K+G + LT +A + N P H + A
Sbjct: 421 VYQDLSVTDIACAALNGIKMGDKTLTVRRA-------NQGANQPKPEQESVLLHAQQQIA 473
Query: 648 LP-LLKKP----TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV 700
L L+ +P T+VL L V PE +++ + E+++ED+R E +FG++ VNVV
Sbjct: 474 LQKLMLQPGAVSTKVLCLTQVVTPE--ELINDEDYEDIMEDMRGEGGKFGTL--VNVV 527
>gi|309266895|ref|XP_003086891.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like, partial [Mus
musculus]
Length = 493
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 99/352 (28%), Positives = 168/352 (47%), Gaps = 53/352 (15%)
Query: 384 RSP--EKKSAK--WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
RSP EKK + WDV P V + + QAA +++++ + + T LA
Sbjct: 101 RSPCHEKKKVRKYWDVPPPGFEHVTPMQYKAMQAAG----QILATALLPTMTPGGLA--- 153
Query: 440 VSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGS 499
+ + V + Q+T R RRL V +P +E+++++FLN + G+ G+
Sbjct: 154 ----VTPMPVPVVGSQMT---RQARRLYVGTIPFGITEESMLDFLNTQMHLRGLTQAPGN 206
Query: 500 LPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------- 552
P + I +K AF+EF + ++ + AL DG F G LKI+RP ++ +
Sbjct: 207 -PILAVQINLDKNFAFLEFRSVDETTQALAFDGIIFQGQSLKIRRPHDYQPLPGMSGSPS 265
Query: 553 -----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCA 593
+FIGG+ L+ V E++ + G L+A++ + + A
Sbjct: 266 VYVPGVVSTIVPDSAHKLFIGGLPNYLNDDQVKELLTSVGILRAFNLVKDSITGLSKGYA 325
Query: 594 FIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG-----SIMDNSG-NPPFHGIPKHA 647
F EY+D VT +AIA LNG+ +G + L +A + SI++ + G+
Sbjct: 326 FCEYMDINVTDQAIAWLNGMHLGDKKLLVQRASVGAKNVALSIINQTPVTLQVPGLTSSQ 385
Query: 648 LPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + PT VL L N+ P+ L + E EE+++DVR EC ++G VKS+ +
Sbjct: 386 VQMGGHPTTVLCLMNMVLPK--ELLDDEEYEEIVDDVRDECNKYGLVKSIEI 435
>gi|330803435|ref|XP_003289712.1| hypothetical protein DICPUDRAFT_56283 [Dictyostelium purpureum]
gi|325080222|gb|EGC33787.1| hypothetical protein DICPUDRAFT_56283 [Dictyostelium purpureum]
Length = 501
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 75/262 (28%), Positives = 128/262 (48%), Gaps = 31/262 (11%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RR+ V N+P ++ L+EF N +L++ + +V P + C I K AF+EF T E+
Sbjct: 161 RRIYVGNIPPGITDSELIEFFNAAVLAANL-NVKPGPPVVFCQINAPKCFAFIEFRTPEE 219
Query: 524 ASAALCCDGCSFSGSILKIKRPKEF----------------------VEVAIFIGGISRT 561
A+ A+ DG + LKI+RPK++ E IF+GG+
Sbjct: 220 ATNAMRFDGITLKNYTLKIRRPKDYQQSNDPTNTSALPTIVPTNVPDSEHKIFVGGLPSN 279
Query: 562 LSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQV 619
L+ + V ++ A+G LKA++ + N + AF EY+D VT +A A LNG+ + +
Sbjct: 280 LNEEQVKTLLSAYGKLKAFNLVKDTNTGISKGYAFCEYLDPDVTDQACASLNGISLADKN 339
Query: 620 LTAVQAVLDG----SIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSEL 675
L +A + +I + P + ++ KP+ V++L N+ + E +
Sbjct: 340 LIVQRASIVAQTLSTIRSTVPSSPTTSTTQTSIDNNTKPSRVIQLLNLVDKEDL--YDDK 397
Query: 676 EVEEVLEDVRLECARFGSVKSV 697
E + +L DV+ EC FG +S+
Sbjct: 398 EYDNILIDVKEECENFGPTQSL 419
>gi|342873171|gb|EGU75391.1| hypothetical protein FOXB_14096 [Fusarium oxysporum Fo5176]
Length = 661
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 166/376 (44%), Gaps = 63/376 (16%)
Query: 367 PRKRRTEAAAKTPSPINRSP----EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMV 422
P+KR TP N P +++ +WD+ P +V T+ QA S +
Sbjct: 159 PKKRE-----PTPDLTNIVPVLDRKRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLP 208
Query: 423 SSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALM 481
+ Q+P+ + A + + + S L +N R +RL V +P SE+ALM
Sbjct: 209 GA-----PRQQPMDPSKLQAFMNQPGGQVTSAGLKANNSRQSKRLLVSRIPSGTSEEALM 263
Query: 482 EFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSI-- 539
F N L +G+ + + PC+ C ++ A +EF A +A+ AL DG S S
Sbjct: 264 SFFN--LQLNGLNVIDTTDPCVLCQFSNDRSFAVIEFKDAPEATVALAMDGISMEASDAS 321
Query: 540 ---------LKIKRPKEFVEVAIF----------------------IGGISRTLSSKMVM 568
L+I+RP+++V A+ I I L+ + ++
Sbjct: 322 NGTDGGHRGLEIRRPRDYVVPAVTEEVSYDSEVVSNIVPDTVNKLSITNIPTFLTEEQII 381
Query: 569 EIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVT-PKAIAGLNGLKVGGQVLTAVQA 625
E++ +FG KA+ + EE AF EY D + P A+ LNG+++GG+ L +A
Sbjct: 382 ELLASFGKPKAFVLVKDRGTEESRGIAFAEYQDPAASNPTALDTLNGMEIGGKKLKVSKA 441
Query: 626 VLDGSIMDN--SGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLED 683
+ + + N G G+ ++ + VL+L N+ E L + EE+ ED
Sbjct: 442 SIGPTQVANFDVGITAISGLASQTANEVES-SRVLQLLNMVTAEEL--LDNDDYEEICED 498
Query: 684 VRLECARFGSVKSVNV 699
V+ EC++FG + V V
Sbjct: 499 VKEECSKFGKIIDVKV 514
>gi|402906865|ref|XP_003916203.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Papio anubis]
Length = 446
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/337 (28%), Positives = 155/337 (45%), Gaps = 44/337 (13%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 79 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 122
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 182
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVAI---- 553
G+ P + I ++K AF+E + LC ++ + P+ V +
Sbjct: 183 GN-PVLAVQINQDKNFAFLEVSWGNRSGPVLC--------ALAMLTFPEWVVSTVVPDSA 233
Query: 554 ---FIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIA 608
FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD VT +AIA
Sbjct: 234 HKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIA 293
Query: 609 GLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPKHALPLLKKPTEVLKLKN 662
GLNG+++G + L +A + + + P G+ + + PTEVL L N
Sbjct: 294 GLNGMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMSSQVQMGGHPTEVLCLMN 353
Query: 663 VFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 354 MVLPE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 388
>gi|76779874|gb|AAI06135.1| U2af2 protein [Mus musculus]
Length = 307
Score = 102 bits (253), Expect = 1e-18, Method: Composition-based stats.
Identities = 78/252 (30%), Positives = 127/252 (50%), Gaps = 35/252 (13%)
Query: 480 LMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSI 539
+M+F N + G+ G+ P + I ++K AF+EF + ++ + A+ DG F G
Sbjct: 1 MMDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQS 59
Query: 540 LKIKRPKEFVEV------------------------AIFIGGISRTLSSKMVMEIVCAFG 575
LKI+RP ++ + +FIGG+ L+ V E++ +FG
Sbjct: 60 LKIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFG 119
Query: 576 PLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL--DGSI 631
PLKA++ + + AF EYVD VT +AIAGLNG+++G + L +A + +
Sbjct: 120 PLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNAT 179
Query: 632 MDNSGNPPFH----GIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLE 687
+ P G+ + + PTEVL L N+ PE L + E EE++EDVR E
Sbjct: 180 LSTINQTPVTLQVPGLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDE 237
Query: 688 CARFGSVKSVNV 699
C+++G VKS+ +
Sbjct: 238 CSKYGLVKSIEI 249
>gi|405121398|gb|AFR96167.1| rRNA primary transcript binding protein [Cryptococcus neoformans
var. grubii H99]
Length = 655
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/335 (25%), Positives = 148/335 (44%), Gaps = 48/335 (14%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WD APV+ + A + + + P P A + + A+
Sbjct: 224 WDTAPVQFQGI--------SALEAKTTGLFTYGP---GRVPPPAHLGIPATFVA-GAFPP 271
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
S + Q+N RL + + E+ + +F NN + G+ P C I ++
Sbjct: 272 SNPVRQNN----RLYIGGIKEDMQEQQIQDFFNNLMKEKGMADGKED-PVKQCQINNDRN 326
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEV--------------------A 552
AF+E T E A+AAL DG G+ L+++RPK++ +
Sbjct: 327 FAFIELHTPEQATAALELDGVVLDGASLRVRRPKDYAGIDPLLQTFNGVVAPSVADSPNK 386
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDH--EEPCAFIEYVDQLVTPKAIAGL 610
+FIGGI L+ + VME++ +FG LK+++ V E + AF EY+D VT AI GL
Sbjct: 387 LFIGGIPTYLNDEQVMELLKSFGELKSFNL-VKESAGVSKGFAFAEYLDPEVTDMAIQGL 445
Query: 611 NGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKK------PTEVLKLKNVF 664
+ +G + L +A + + N+ P A+P L + + V+ L N+
Sbjct: 446 HNFALGDRNLVVQRAAVGRNTGVNAPIPGSAAYLSQAIPHLMQNNADAPSSRVMLLLNMV 505
Query: 665 NPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PE ++ + +++ED+ EC+++G ++ V +
Sbjct: 506 TPEEL--YNDDDYNDIIEDINDECSKYGEIEGVRI 538
>gi|449441167|ref|XP_004138355.1| PREDICTED: splicing factor U2af large subunit B-like [Cucumis
sativus]
Length = 587
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 97/358 (27%), Positives = 156/358 (43%), Gaps = 71/358 (19%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
+D+AP T + + AA+ + P T PLA L V
Sbjct: 191 FDMAPPTT-----AILSGATAAAGQIPGTTPAIPGMFPTMFPLA---TGQPFGALPVMPV 242
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
Q+ R RR+ V LP +A+E+++ F + + + G + I EK
Sbjct: 243 QAMTQQATRHARRVYVGGLPPTANEQSVATFFSQVMAAIGGNTAGPGDAVVNVYINHEKK 302
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA---- 552
AFVE + E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 303 FAFVEMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGL 362
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFI 595
IF+GG+ + V E++ +FGPL+ F++ +D E + AF
Sbjct: 363 TPGSAGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFC 420
Query: 596 EYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP--------FHGIPKHA 647
Y D VT A A LNG+K+G + LT +A + N P H + A
Sbjct: 421 VYQDLSVTDIACAALNGIKMGDKTLTVRRA-------NQGANQPKPEQESVLLHAQQQIA 473
Query: 648 LP-LLKKP----TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV 700
L L+ +P T+VL L V PE +++ + E+++ED+R E +FG++ VNVV
Sbjct: 474 LQKLMLQPGAVSTKVLCLTQVVTPE--ELINDEDYEDIMEDMRGEGGKFGTL--VNVV 527
>gi|328766440|gb|EGF76494.1| hypothetical protein BATDEDRAFT_21058 [Batrachochytrium
dendrobatidis JAM81]
Length = 551
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 79/292 (27%), Positives = 136/292 (46%), Gaps = 32/292 (10%)
Query: 436 AGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQH 495
AG ++ L+ + + M+ R +RL N+P+ E+ ++ F ++ G+
Sbjct: 207 AGFGINRGLS-MGMGMNGRNQQPIARQFKRLYFGNIPVDCIEERILSFASSSYEKLGLPK 265
Query: 496 VHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA--- 552
G+ + I RE+ AFVEF + E+A+ A+ DG F G+ILK++RPK++ A
Sbjct: 266 DPGN-AAVNAYINRERNYAFVEFRSPEEATRAMALDGSLFDGNILKVRRPKDYNPEAAPD 324
Query: 553 -------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEP 591
IF+G I LS V E++ FG LK + + +
Sbjct: 325 GATQPSIAPATSAQESLDKIFVGAIPTYLSDDQVQELLKTFGELKTFSLIRDSATGLSKG 384
Query: 592 CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLL 651
AF EYVD +T A GLNG+++G + L +A + + S +P L +
Sbjct: 385 FAFCEYVDGQITDAACQGLNGMELGEKKLIVQRASVGSNKNTISAVGQSQLLPMEILATI 444
Query: 652 KKP----TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
K T VL L N+ E +S+ + +++L D++ EC +FG++ + +
Sbjct: 445 AKDPCKVTRVLLLLNMVVSEDL--VSDEDYQDILLDIQEECEKFGTILDIAI 494
>gi|116192501|ref|XP_001222063.1| hypothetical protein CHGG_05968 [Chaetomium globosum CBS 148.51]
gi|88181881|gb|EAQ89349.1| hypothetical protein CHGG_05968 [Chaetomium globosum CBS 148.51]
Length = 566
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 101/395 (25%), Positives = 172/395 (43%), Gaps = 80/395 (20%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
++ +WD+ P +V T+ QA S + + Q+ + + A + +
Sbjct: 171 RRLTQWDIKPPGYDNV-----TAEQAKLSGMFPLPGA-----PRQQAMDPTKLQAFMNQP 220
Query: 448 NVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCV 506
+++S L SN R +RL + NLP S ++++L F N L +G+ + + PC+
Sbjct: 221 GGAVNSAALKPSNSRQSKRLIISNLPASVTDESLTNFFN--LQLNGLNVIETADPCLQAH 278
Query: 507 IQREKGQAFVEFLTAEDASAALCCDGCSF----------SGSI---LKIKRPKEFVEVAI 553
I E+ A VEF DA+ AL DG S +G+ L I+RPK+++ A+
Sbjct: 279 IAAERAFAMVEFRNNTDATVALALDGISMEADDAHAANGNGTAPQGLHIRRPKDYIVPAV 338
Query: 554 F------------------------IGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE 589
+ + L+ VME++ +FG LK++ + E
Sbjct: 339 VEDPNYDPDSDRPSSVVVDSPNKISVTNLPLYLTEDQVMELLVSFGKLKSFVLVKDNGTE 398
Query: 590 EP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHA 647
E AF+EY D VT AI GL+ + +G + L +A + I SG G+ +A
Sbjct: 399 ESRGIAFLEYADPGVTTVAIQGLHNMMLGERALKVQKASI--GITQVSGE---MGV--NA 451
Query: 648 LPLLKKPT-------EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV 700
+ +L T VL+L N+ P+ + + EE+ +DV+ EC +FG + S+ +
Sbjct: 452 MSMLAGTTSADAGAGRVLQLLNMVTPDEL--MDNDDYEEIRDDVQEECEKFGKILSIKIP 509
Query: 701 KYGDSNISTIQACEGNENTASAGVGQNLTNDETNE 735
+ + SAGVG+ ET E
Sbjct: 510 R------------PAGGSRQSAGVGKIFIKFETPE 532
>gi|148697816|gb|EDL29763.1| mCG68163 [Mus musculus]
Length = 472
Score = 100 bits (250), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 99/352 (28%), Positives = 168/352 (47%), Gaps = 53/352 (15%)
Query: 384 RSP--EKKSAK--WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
RSP EKK + WDV P V + + QAA +++++ + + T LA
Sbjct: 80 RSPCHEKKKVRKYWDVPPPGFEHVTPMQYKAMQAAG----QILATALLPTMTPGGLA--- 132
Query: 440 VSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGS 499
+ + V + Q+T R RRL V +P +E+++++FLN + G+ G+
Sbjct: 133 ----VTPMPVPVVGSQMT---RQARRLYVGTIPFGITEESMLDFLNTQMHLRGLTQAPGN 185
Query: 500 LPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------- 552
P + I +K AF+EF + ++ + AL DG F G LKI+RP ++ +
Sbjct: 186 -PILAVQINLDKNFAFLEFRSVDETTQALAFDGIIFQGQSLKIRRPHDYQPLPGMSGSPS 244
Query: 553 -----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCA 593
+FIGG+ L+ V E++ + G L+A++ + + A
Sbjct: 245 VYVPGVVSTIVPDSAHKLFIGGLPNYLNDDQVKELLTSVGILRAFNLVKDSITGLSKGYA 304
Query: 594 FIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG-----SIMDNSG-NPPFHGIPKHA 647
F EY+D VT +AIA LNG+ +G + L +A + SI++ + G+
Sbjct: 305 FCEYMDINVTDQAIAWLNGMHLGDKKLLVQRASVGAKNVALSIINQTPVTLQVPGLTSSQ 364
Query: 648 LPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + PT VL L N+ P+ L + E EE+++DVR EC ++G VKS+ +
Sbjct: 365 VQMGGHPTTVLCLMNMVLPK--ELLDDEEYEEIVDDVRDECNKYGLVKSIEI 414
>gi|156070781|gb|ABU45194.1| unknown [Petunia integrifolia subsp. inflata]
Length = 557
Score = 100 bits (250), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 94/359 (26%), Positives = 154/359 (42%), Gaps = 74/359 (20%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
+D+AP + +P T+ Q ++ S P + PL S L V
Sbjct: 164 FDMAPPTSALLPGATDTAGQVPGAS-----PSIPGMFSNMFPL----TSGQFGALPVMPI 214
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
Q+ R RR+ V LP SA+E+++ F ++ + + G + I EK
Sbjct: 215 QAMTQQATRHARRVYVGGLPPSANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKK 274
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA---- 552
AFVE + E+AS A+ DG F G +K++RP ++ + +A
Sbjct: 275 FAFVEMRSVEEASNAMALDGVIFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGL 334
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFI 595
IF+GG+ + + E++ +FG L+ F++ +D E + AF
Sbjct: 335 TPGSSGGLEGPDRIFVGGLPYYFTESQIRELLESFGQLRG--FDLVKDRETGNSKGYAFC 392
Query: 596 EYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPT 655
Y D VT A A LNG+K+G + LT +A N G P P+ LL
Sbjct: 393 VYQDVSVTDIACAALNGIKMGDKTLTVRRA--------NQGTP--QPNPEQESVLLHAQQ 442
Query: 656 EVLKLKNVFNPEGFS----------SLSEL----EVEEVLEDVRLECARFGSVKSVNVV 700
++ K +F P + S+ EL + +++LED+R EC +FG++ +NVV
Sbjct: 443 QIALQKFMFQPGALATKVLCLTQAVSVDELNDDDDYQDILEDMRTECGKFGAL--LNVV 499
>gi|391864554|gb|EIT73849.1| splicing factor U2AF, large subunit [Aspergillus oryzae 3.042]
Length = 538
Score = 100 bits (250), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 104/385 (27%), Positives = 172/385 (44%), Gaps = 80/385 (20%)
Query: 367 PRKRR-TEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS- 424
PRKR T + S ++R +++ +WD+ P +V T+ QA S + +
Sbjct: 124 PRKREATPDLTEVESVLHR--KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGAP 176
Query: 425 -----DP--VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASE 477
DP + + +P AG + SASL N +R +RL V NLP SA+
Sbjct: 177 RQQPMDPSRLQAFMSQPGAGTAESASLKPSN-----------SRQAKRLFVSNLPASATG 225
Query: 478 KALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSG 537
+ L+ F N L +G+ +H PCI + ++ A +EF T DA+ AL DG +
Sbjct: 226 ENLLSFFN--LQLNGLNVIHSVDPCISAQVSDDRSFALLEFKTPNDATVALAFDGITMDE 283
Query: 538 SI-------------LKIKRPKEFV----------------EVA-----IFIGGISRTLS 563
S L+++RPK+++ EV I + I +
Sbjct: 284 SEAAGNGAANGAPQGLEVRRPKDYIVPSGNEQEYQEGVLLNEVPDSPNKICVSNIPHYIP 343
Query: 564 SKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLT 621
+ V ++ +FG LK++ + EE AF EY D T A+ GLNG+++G + L
Sbjct: 344 EEPVTMLLKSFGELKSFVLVKDGSTEESRGIAFCEYADPNATSIAVEGLNGMELGDRHLK 403
Query: 622 AVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTE-------VLKLKNVFNPEGFSSLSE 674
V+A + I +G + +A+ + K T VL+L N+ PE +
Sbjct: 404 VVRASI--GITQAAG----LDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEEL--MDN 455
Query: 675 LEVEEVLEDVRLECARFGSVKSVNV 699
+ +E+ +DVR ECA++G V + +
Sbjct: 456 DDYDEICDDVREECAKYGQVVELKI 480
>gi|317139209|ref|XP_001817348.2| splicing factor u2af large subunit [Aspergillus oryzae RIB40]
Length = 538
Score = 100 bits (250), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 104/385 (27%), Positives = 172/385 (44%), Gaps = 80/385 (20%)
Query: 367 PRKRR-TEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS- 424
PRKR T + S ++R +++ +WD+ P +V T+ QA S + +
Sbjct: 124 PRKREATPDLTEVESVLHR--KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGAP 176
Query: 425 -----DP--VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASE 477
DP + + +P AG + SASL N +R +RL V NLP SA+
Sbjct: 177 RQQPMDPSRLQAFMSQPGAGTAESASLKPSN-----------SRQAKRLFVSNLPASATG 225
Query: 478 KALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSG 537
+ L+ F N L +G+ +H PCI + ++ A +EF T DA+ AL DG +
Sbjct: 226 ENLLSFFN--LQLNGLNVIHSVDPCISAQVSDDRSFALLEFKTPNDATVALAFDGITMDE 283
Query: 538 SI-------------LKIKRPKEFV----------------EVA-----IFIGGISRTLS 563
S L+++RPK+++ EV I + I +
Sbjct: 284 SEAAGNGAANGAPQGLEVRRPKDYIVPSGNEQEYQEGVLLNEVPDSPNKICVSNIPHYIP 343
Query: 564 SKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLT 621
+ V ++ +FG LK++ + EE AF EY D T A+ GLNG+++G + L
Sbjct: 344 EEPVTMLLKSFGELKSFVLVKDGSTEESRGIAFCEYADPNATSIAVEGLNGMELGDRHLK 403
Query: 622 AVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTE-------VLKLKNVFNPEGFSSLSE 674
V+A + I +G + +A+ + K T VL+L N+ PE +
Sbjct: 404 VVRASI--GITQAAG----LDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEEL--MDN 455
Query: 675 LEVEEVLEDVRLECARFGSVKSVNV 699
+ +E+ +DVR ECA++G V + +
Sbjct: 456 DDYDEICDDVREECAKYGQVVELKI 480
>gi|13938661|gb|AAH07487.1| U2 small nuclear ribonucleoprotein auxiliary factor (U2AF) 2 [Mus
musculus]
Length = 306
Score = 100 bits (250), Expect = 3e-18, Method: Composition-based stats.
Identities = 78/251 (31%), Positives = 126/251 (50%), Gaps = 35/251 (13%)
Query: 481 MEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSIL 540
M+F N + G+ G+ P + I ++K AF+EF + ++ + A+ DG F G L
Sbjct: 1 MDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSL 59
Query: 541 KIKRPKEFVEV------------------------AIFIGGISRTLSSKMVMEIVCAFGP 576
KI+RP ++ + +FIGG+ L+ V E++ +FGP
Sbjct: 60 KIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGP 119
Query: 577 LKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL--DGSIM 632
LKA++ + + AF EYVD VT +AIAGLNG+++G + L +A + + +
Sbjct: 120 LKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKNATL 179
Query: 633 DNSGNPPFH----GIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLEC 688
P G+ + + PTEVL L N+ PE L + E EE++EDVR EC
Sbjct: 180 STINQTPVTLQVPGLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDEC 237
Query: 689 ARFGSVKSVNV 699
+++G VKS+ +
Sbjct: 238 SKYGLVKSIEI 248
>gi|425773483|gb|EKV11835.1| Splicing factor u2af large subunit [Penicillium digitatum Pd1]
Length = 585
Score = 100 bits (249), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 163/352 (46%), Gaps = 55/352 (15%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P +V T+ QA S + + Q+P+ + L
Sbjct: 206 KRRLTQWDMKPPGYENV-----TAEQAKISGMFPLPGA-----PRQQPMDPSRMKDFLNP 255
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
D+ L SN R +RL V N+P S A++ F N L +G+ VH PCI
Sbjct: 256 PTGDSDNAALKPSNSRQSKRLFVYNIPSGVSGDAVIAFFN--LQLNGLNVVHSVDPCISA 313
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSGS---ILKIKRPKEFV------------- 549
+ +K A +EF DA+ AL DG + + S L+++RPK+++
Sbjct: 314 QVSEDKTFALLEFKDPNDATVALAFDGITMAESGDKGLEVRRPKDYIVPDGSASQPVQAG 373
Query: 550 ----EVA-----IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYV 598
EV I I I ++ + ++ ++ +FG LK++ + EE AF EYV
Sbjct: 374 VVLNEVPDSPNKICISNIPTYINEEAIIMLLKSFGDLKSFVLVKDAATEESRGIAFYEYV 433
Query: 599 DQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKK----- 653
D T A+ GLNG+++ + L V+A SI + G+ +A+ + K
Sbjct: 434 DPNNTALAVEGLNGMELVDRHLKFVRA----SIGTTQASGLDMGV--NAMQMFAKTTSQD 487
Query: 654 --PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYG 703
T+VL+L N+ + L++ + EE++EDV EC++FG++ + + + G
Sbjct: 488 LETTQVLQLLNMVTLDEL--LNDEDYEEIMEDVSDECSKFGTILGIKIPRRG 537
>gi|425775779|gb|EKV14031.1| Splicing factor u2af large subunit [Penicillium digitatum PHI26]
Length = 585
Score = 100 bits (249), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 163/352 (46%), Gaps = 55/352 (15%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P +V T+ QA S + + Q+P+ + L
Sbjct: 206 KRRLTQWDMKPPGYENV-----TAEQAKISGMFPLPGA-----PRQQPMDPSRMKDFLNP 255
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
D+ L SN R +RL V N+P S A++ F N L +G+ VH PCI
Sbjct: 256 PTGDSDNAALKPSNSRQSKRLFVYNIPSGVSGDAVIAFFN--LQLNGLNVVHSVDPCISA 313
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSGS---ILKIKRPKEFV------------- 549
+ +K A +EF DA+ AL DG + + S L+++RPK+++
Sbjct: 314 QVSEDKTFALLEFKDPNDATVALAFDGITMAESGDKGLEVRRPKDYIVPDGSASQPVQAG 373
Query: 550 ----EVA-----IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYV 598
EV I I I ++ + ++ ++ +FG LK++ + EE AF EYV
Sbjct: 374 VVLNEVPDSPNKICISNIPTYINEEAIIMLLKSFGDLKSFVLVKDAATEESRGIAFYEYV 433
Query: 599 DQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKK----- 653
D T A+ GLNG+++ + L V+A SI + G+ +A+ + K
Sbjct: 434 DPNNTALAVEGLNGMELVDRHLKFVRA----SIGTTQASGLDMGV--NAMQMFAKTTSQD 487
Query: 654 --PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYG 703
T+VL+L N+ + L++ + EE++EDV EC++FG++ + + + G
Sbjct: 488 LETTQVLQLLNMVTLDEL--LNDEDYEEIMEDVSDECSKFGTILGIKIPRRG 537
>gi|46125343|ref|XP_387225.1| hypothetical protein FG07049.1 [Gibberella zeae PH-1]
Length = 564
Score = 100 bits (249), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 97/386 (25%), Positives = 168/386 (43%), Gaps = 71/386 (18%)
Query: 366 SPRKRRTEAAAKTPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVHTSNQ 412
+P++RR+ +P P R P +++ +WD+ P +V T+ Q
Sbjct: 130 APQERRS----ASPPPKKREPTPDLTNIVSVLERQRRLTQWDIKPPGYDNV-----TAEQ 180
Query: 413 AASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN-RPMRRLCVENL 471
A S + + Q+P+ + A + + + S L SN R +RL V +
Sbjct: 181 AKLSGMFPLPGA-----PRQQPMDPSKLQAFMNQPGGQVTSASLKASNSRQSKRLLVSRI 235
Query: 472 PLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCD 531
P SE AL+ F N L +G+ + + PC+ C ++ A +EF A + + AL D
Sbjct: 236 PPGTSEDALIAFFN--LQLNGLNVIDTTDPCVLCQFSNDRSFAVIEFKDAPETTVALALD 293
Query: 532 GCSFSGSI-----------LKIKRPKEFVEVAIF----------------------IGGI 558
G S + L+I+RP+++V A+ I I
Sbjct: 294 GISMEANDASNGADGGHRGLEIRRPRDYVVPAVTEDVAYDPEVVSNVVPDTVNKLSITNI 353
Query: 559 SRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVT-PKAIAGLNGLKV 615
L+ + ++E++ +FG KA+ + EE AF EY D V+ P A+ LNG+ +
Sbjct: 354 PPFLTEEQIIELLASFGKPKAFVLVKDRGTEESRGIAFAEYQDPAVSNPTALDTLNGMDI 413
Query: 616 GGQVLTAVQAVLDGSIMDN--SGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLS 673
GG+ + +A + + + N G G+ ++ + VL+L N+ E L
Sbjct: 414 GGKQIKVSKASIGPTQVANFDVGITAISGLASQTANEVES-SRVLQLLNMVTAEEL--LD 470
Query: 674 ELEVEEVLEDVRLECARFGSVKSVNV 699
+ EE+ EDVR EC+++G + V V
Sbjct: 471 NDDYEEICEDVREECSKYGKILDVKV 496
>gi|238482353|ref|XP_002372415.1| splicing factor u2af large subunit [Aspergillus flavus NRRL3357]
gi|220700465|gb|EED56803.1| splicing factor u2af large subunit [Aspergillus flavus NRRL3357]
Length = 556
Score = 100 bits (248), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 104/385 (27%), Positives = 172/385 (44%), Gaps = 80/385 (20%)
Query: 367 PRKRR-TEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS- 424
PRKR T + S ++R +++ +WD+ P +V T+ QA S + +
Sbjct: 142 PRKREATPDLTEVESVLHR--KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGAP 194
Query: 425 -----DP--VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASE 477
DP + + +P AG + SASL N +R +RL V NLP SA+
Sbjct: 195 RQQPMDPSRLQAFMSQPGAGTAESASLKPSN-----------SRQAKRLFVSNLPASATG 243
Query: 478 KALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSG 537
+ L+ F N L +G+ +H PCI + ++ A +EF T DA+ AL DG +
Sbjct: 244 ENLLSFFN--LQLNGLNVIHSVDPCISAQVSDDRSFALLEFKTPNDATVALAFDGITMDE 301
Query: 538 SI-------------LKIKRPKEFV----------------EVA-----IFIGGISRTLS 563
S L+++RPK+++ EV I + I +
Sbjct: 302 SEAAGNGAANGAPQGLEVRRPKDYIVPSGNEQEYQEGVLLNEVPDSPNKICVSNIPHYIP 361
Query: 564 SKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLT 621
+ V ++ +FG LK++ + EE AF EY D T A+ GLNG+++G + L
Sbjct: 362 EEPVTMLLKSFGELKSFVLVKDGSTEESRGIAFCEYADPNATSIAVEGLNGMELGDRHLK 421
Query: 622 AVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTE-------VLKLKNVFNPEGFSSLSE 674
V+A + I +G + +A+ + K T VL+L N+ PE +
Sbjct: 422 VVRASI--GITQAAG----LDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEEL--MDN 473
Query: 675 LEVEEVLEDVRLECARFGSVKSVNV 699
+ +E+ +DVR ECA++G V + +
Sbjct: 474 DDYDEICDDVREECAKYGQVVELKI 498
>gi|119482894|ref|XP_001261475.1| splicing factor u2af large subunit [Neosartorya fischeri NRRL 181]
gi|119409630|gb|EAW19578.1| splicing factor u2af large subunit [Neosartorya fischeri NRRL 181]
Length = 563
Score = 100 bits (248), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 168/382 (43%), Gaps = 74/382 (19%)
Query: 367 PRKRR-TEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSD 425
PRKR T PS + R +++ +WD+ P +V T+ QA S + +
Sbjct: 149 PRKREPTPDLTDVPSVLTR--KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA- 200
Query: 426 PVTSTTQKPLAGISVSASLAKLNV-SMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEF 483
Q+P+ + A + + S D+ L SN R RRL V NLP S + L+ F
Sbjct: 201 ----PRQQPMDPSRLQAFMNQPGGGSADNSALKPSNSRQARRLFVYNLPSVVSSEHLVSF 256
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF------SG 537
N L +G+ +H PCI I + A +EF T D + AL DG + SG
Sbjct: 257 FN--LQLNGLNVIHSVDPCISAQISEDHSFALLEFKTPNDTTVALAFDGITMEEHEPASG 314
Query: 538 SI------LKIKRPKEFV-----------------EVA-----IFIGGISRTLSSKMVME 569
+ L+++RPK+++ EV I + I + + + V
Sbjct: 315 TENGAPKGLEVRRPKDYIVPNGSADQEYQEGVLLNEVPDSPNKICVSNIPQYIPEEPVTM 374
Query: 570 IVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL 627
++ +FG LK++ + EE AF EY D T A+ GLNG+++G + L V+A +
Sbjct: 375 LLKSFGELKSFVLVKDSSTEESRGIAFCEYADPSATAIAVEGLNGMELGDRHLKVVRASI 434
Query: 628 ---DGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEV 677
+ +D N A+ + K T VL+L N+ PE L +
Sbjct: 435 GMTQAAGLDMGVN---------AMSMFAKTTSQDLESSRVLQLLNMVTPEEL--LDNDDY 483
Query: 678 EEVLEDVRLECARFGSVKSVNV 699
EE+ +DVR EC+++G V + V
Sbjct: 484 EEICDDVREECSKYGKVLDLKV 505
>gi|410054709|ref|XP_003954504.1| PREDICTED: LOW QUALITY PROTEIN: splicing factor U2AF 65 kDa subunit
[Pan troglodytes]
Length = 394
Score = 99.8 bits (247), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 79/254 (31%), Positives = 129/254 (50%), Gaps = 35/254 (13%)
Query: 478 KALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSG 537
+A+M+F N + G+ G+ P + I ++K AF+EF + ++ + A+ DG F G
Sbjct: 86 EAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQG 144
Query: 538 SILKIKRPKEFVEVA------------------------IFIGGISRTLSSKMVMEIVCA 573
LKI+RP ++ + +FIGG+ L+ V E++ +
Sbjct: 145 QSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTS 204
Query: 574 FGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG-- 629
FGPLKA++ + + AF EYVD VT +AIAGLNG+++G + L +A +
Sbjct: 205 FGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQRASVGAKN 264
Query: 630 SIMDNSGNPPFH----GIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVR 685
+ + P G+ + + PTEVL L N+ PE L + E EE++EDVR
Sbjct: 265 ATLSTINQTPVTLQVPGLMSSQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVR 322
Query: 686 LECARFGSVKSVNV 699
EC+++G VKS+ +
Sbjct: 323 DECSKYGLVKSIEI 336
>gi|408397958|gb|EKJ77095.1| hypothetical protein FPSE_02739 [Fusarium pseudograminearum CS3096]
Length = 554
Score = 99.4 bits (246), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 96/386 (24%), Positives = 167/386 (43%), Gaps = 71/386 (18%)
Query: 366 SPRKRRTEAAAKTPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVHTSNQ 412
+P++RR+ +P P R P +++ +WD+ P +V T+ Q
Sbjct: 130 APQERRS----ASPPPKKREPTPDLTNIVSVLERQRRLTQWDIKPPGYDNV-----TAEQ 180
Query: 413 AASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN-RPMRRLCVENL 471
A S + + Q+P+ + A + + + S L SN R +RL V +
Sbjct: 181 AKLSGMFPLPGA-----PRQQPMDPSKLQAFMNQPGGQVTSAGLKASNSRQSKRLLVSRI 235
Query: 472 PLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCD 531
P SE L+ F N L +G+ + + PC+ C ++ A +EF A + + AL D
Sbjct: 236 PPGTSEDTLIAFFN--LQLNGLNVIDTTDPCVLCQFSNDRSFAVIEFKDAPETTVALALD 293
Query: 532 GCSFSGSI-----------LKIKRPKEFVEVAIF----------------------IGGI 558
G S + L+I+RP+++V A+ I I
Sbjct: 294 GISMEANDASNGADGGHRGLEIRRPRDYVVPAVTEDVAYDPEVVSNVVPDTVNKLSITNI 353
Query: 559 SRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVT-PKAIAGLNGLKV 615
L+ + ++E++ +FG KA+ + EE AF EY D V+ P A+ LNG+ +
Sbjct: 354 PPFLTEEQIIELLASFGKPKAFVLVKDRGTEESRGIAFAEYQDPAVSNPTALDTLNGMDI 413
Query: 616 GGQVLTAVQAVLDGSIMDN--SGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLS 673
GG+ + +A + + + N G G+ ++ + VL+L N+ E L
Sbjct: 414 GGKQIKVSKASIGPTQVANFDVGITAISGLASQTANEVES-SRVLQLLNMVTAEEL--LD 470
Query: 674 ELEVEEVLEDVRLECARFGSVKSVNV 699
+ EE+ EDVR EC+++G + V V
Sbjct: 471 NDDYEEICEDVREECSKYGKILDVKV 496
>gi|219116422|ref|XP_002179006.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409773|gb|EEC49704.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 325
Score = 99.4 bits (246), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 83/283 (29%), Positives = 133/283 (46%), Gaps = 41/283 (14%)
Query: 458 QSNRPMRRLCVENLPLSASEKAL-MEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
Q R RRL V NLP +E A+ +EF ++S P + I E+ FV
Sbjct: 2 QQTRHARRLYVGNLPPHITEDAIHVEFRRAIEIASPTPLSED--PVLSTYINHERRFCFV 59
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEF---------------VEVA--------- 552
EF T E A+A + DG G +K+KRP ++ ++V+
Sbjct: 60 EFKTVEMATACMNLDGLHVQGVPVKVKRPNDYNANMAPKIHPSALPPLDVSKLGIVSGTV 119
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNE---DHEEPCAFIEYVDQLVT 603
IFIGG+ L VME++ AFG +KA+H N+ + + F+EY D +T
Sbjct: 120 EDGPNKIFIGGLHYHLQDSQVMELLQAFGKIKAFHLVSNDPESNMSKGYCFVEYADPNIT 179
Query: 604 PKAIAGLNGLKVG-GQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKN 662
P A+ GLNG+ +G G+ LTA A + P++ +P PT VL L N
Sbjct: 180 PIAVQGLNGMDIGNGKALTARLAGDRTGGAGGAAFLAHAMDPQNGVP--NIPTRVLVLHN 237
Query: 663 VFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGDS 705
+ E + ++ E + + ++V+ ECA+FG ++ + + + G +
Sbjct: 238 MVTDEDLA--TDTEYQGLFDEVKDECAKFGRLERLEIPRQGPA 278
>gi|83765203|dbj|BAE55346.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 563
Score = 99.4 bits (246), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 104/385 (27%), Positives = 172/385 (44%), Gaps = 80/385 (20%)
Query: 367 PRKRR-TEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS- 424
PRKR T + S ++R +++ +WD+ P +V T+ QA S + +
Sbjct: 145 PRKREATPDLTEVESVLHR--KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGAP 197
Query: 425 -----DP--VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASE 477
DP + + +P AG + SASL N +R +RL V NLP SA+
Sbjct: 198 RQQPMDPSRLQAFMSQPGAGTAESASLKPSN-----------SRQAKRLFVSNLPASATG 246
Query: 478 KALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSG 537
+ L+ F N L +G+ +H PCI + ++ A +EF T DA+ AL DG +
Sbjct: 247 ENLLSFFN--LQLNGLNVIHSVDPCISAQVSDDRSFALLEFKTPNDATVALAFDGITMDE 304
Query: 538 SI-------------LKIKRPKEFV----------------EVA-----IFIGGISRTLS 563
S L+++RPK+++ EV I + I +
Sbjct: 305 SEAAGNGAANGAPQGLEVRRPKDYIVPSGNEQEYQEGVLLNEVPDSPNKICVSNIPHYIP 364
Query: 564 SKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLT 621
+ V ++ +FG LK++ + EE AF EY D T A+ GLNG+++G + L
Sbjct: 365 EEPVTMLLKSFGELKSFVLVKDGSTEESRGIAFCEYADPNATSIAVEGLNGMELGDRHLK 424
Query: 622 AVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTE-------VLKLKNVFNPEGFSSLSE 674
V+A + I +G + +A+ + K T VL+L N+ PE +
Sbjct: 425 VVRASI--GITQAAG----LDMGVNAMSMFAKTTSQDLETSRVLQLLNMVTPEEL--MDN 476
Query: 675 LEVEEVLEDVRLECARFGSVKSVNV 699
+ +E+ +DVR ECA++G V + +
Sbjct: 477 DDYDEICDDVREECAKYGQVVELKI 501
>gi|295663747|ref|XP_002792426.1| splicing factor U2AF 50 kDa subunit [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226279096|gb|EEH34662.1| splicing factor U2AF 50 kDa subunit [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 567
Score = 99.4 bits (246), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 108/434 (24%), Positives = 185/434 (42%), Gaps = 112/434 (25%)
Query: 361 GLGGYSPRKRRTEAAAKTPSPINRSP-------------EKKSAKWDVAPVETYSVPSNV 407
GG RKR + +P P R P +++ +WD+ P P
Sbjct: 135 AFGGGRDRKR-----SASPPPKKREPTPDLTDVVPILERKRRLTQWDIKP------PGYE 183
Query: 408 H-TSNQAASSNAHEMVSS------DP--VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQ 458
H T+ QA S + + DP + + +P+ GIS + L N
Sbjct: 184 HVTAEQAKLSGMFPLPGAPRQQAVDPSRLQAFMNQPVPGISTNTVLRPSN---------- 233
Query: 459 SNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEF 518
+R +RL V N+P SA+E++L++F N L +G+ + G PC+ + +++ +EF
Sbjct: 234 -SRQAKRLFVHNIPPSATEESLVQFFN--LQLNGLNVIKGVDPCVSAQLSKDRSFGLLEF 290
Query: 519 LTAEDASAALCCDGCSF---------------SGSILKIKRPKEFVEVA----------- 552
++ DA+ AL DG + S L ++RPK+++
Sbjct: 291 KSSADATVALAFDGITMEDTGDMDTSNGEANGSNQGLSLRRPKDYILPVGGEEEPHREGV 350
Query: 553 -----------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVD 599
I + I + + V ++ +FG LK++ + + E AF EY+D
Sbjct: 351 VSNVVPDTPNKICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSETGESRGIAFCEYLD 410
Query: 600 QLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPT---- 655
+ T A+ LNG+++G + L V+A + G+I + GI +A+ + K T
Sbjct: 411 PMSTDIAVENLNGMELGNKRLKVVRASI-GTIQATGLD---MGI--NAMSMYAKTTSQDI 464
Query: 656 ---EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGDSNISTIQA 712
VL+L N+ + L + EE+ +DVR EC+++G V + V +
Sbjct: 465 EAGRVLQLLNMVTTDEL--LDNDDYEEICDDVRDECSKYGQVVELKVPR----------- 511
Query: 713 CEGNENTASAGVGQ 726
GN N SAGVG+
Sbjct: 512 PTGN-NKQSAGVGK 524
>gi|340520531|gb|EGR50767.1| predicted protein [Trichoderma reesei QM6a]
Length = 539
Score = 99.4 bits (246), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 94/353 (26%), Positives = 158/353 (44%), Gaps = 55/353 (15%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P Y ++ T+ QA S + + Q+P+ + A + +
Sbjct: 161 KRRLTQWDIKP-PGY----DLVTAEQAKLSGMFPLPGA-----PRQQPMDPTKLQAFMTQ 210
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+ S L SN R +RL V N+P +E+AL+ F N L +G+ + PC+ C
Sbjct: 211 PGGQVSSAGLKASNSRQAKRLLVYNVPSGVTEEALIAFFN--LQLNGLNVIETPDPCVLC 268
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSF------SGSI------LKIKRPKEFVEVAI 553
+K A VEF A DA+ AL DG + +G+ L I+RPK++V I
Sbjct: 269 QFSSDKTFAVVEFRNASDATVALALDGITMEADDAQNGTANGGSHGLDIRRPKDYVMPGI 328
Query: 554 ----------------------FIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP 591
I I L+ + ++E++ AFG K++ + EE
Sbjct: 329 PDDIPYDPEVISNVVPDTVHKLCITNIPTFLNEEQIIELLAAFGKPKSFVLVKDRSTEES 388
Query: 592 --CAFIEYVD-QLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDN--SGNPPFHGIPKH 646
AF EY+D A+ LNG+ V G+ L +A + + + N G G+
Sbjct: 389 RGIAFTEYLDPSSANEPALNSLNGMDVAGKKLKVTKASIGPTQVANFDVGITAISGLASQ 448
Query: 647 ALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+++ + V++L N+ PE L + EE+ EDV+ EC++FG V + V
Sbjct: 449 TSNDIER-SSVIQLLNMVTPEEL--LDNDDYEEICEDVQDECSKFGKVVELKV 498
>gi|261196608|ref|XP_002624707.1| splicing factor u2af large subunit [Ajellomyces dermatitidis
SLH14081]
gi|239595952|gb|EEQ78533.1| splicing factor u2af large subunit [Ajellomyces dermatitidis
SLH14081]
gi|239609528|gb|EEQ86515.1| splicing factor u2af large subunit [Ajellomyces dermatitidis ER-3]
gi|327350238|gb|EGE79095.1| splicing factor u2af large subunit [Ajellomyces dermatitidis ATCC
18188]
Length = 570
Score = 99.4 bits (246), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 110/427 (25%), Positives = 181/427 (42%), Gaps = 94/427 (22%)
Query: 361 GLGGYSPRKRRTEAAAKTPSPINRSP-------------EKKSAKWDVAPVETYSVPSNV 407
LGG RKR + +P P R P +++ +WD+ P P
Sbjct: 134 ALGGGRERKR-----SASPPPKKREPTPDLTDVVPILERKRRLTQWDIKP------PGYE 182
Query: 408 H-TSNQAASSNAHEMVSSDPVTSTTQKPL-AGISVSASLAKLNVSMDSVQLTQSNRPMRR 465
H T+ QA S + + + L A I ++ S ++V ++R +R
Sbjct: 183 HVTAEQAKLSGMFPLPGAPRQQAVDPSRLQAFIHPPTTITAPGSSTNTVLKPSNSRQAKR 242
Query: 466 LCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDAS 525
L V NLP SA+E+ L++F N L +G+ + G PC+ + R+K A +EF A D +
Sbjct: 243 LFVHNLPPSATEERLVQFFN--LQLNGLNVIKGVDPCLSAQLSRDKTFALLEFRNAADTT 300
Query: 526 AALCCDGCSF--SGSI-------------LKIKRPKEFVEVA------------------ 552
AL DG + SG + L I+RPK+++ +
Sbjct: 301 VALAFDGITMEDSGEMDTSNGDVDGPSQGLSIRRPKDYILPSAVEEEPQQEGVVSNVVPD 360
Query: 553 ----IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKA 606
I + I + + V ++ +FG LK++ + + E AF EY+D T A
Sbjct: 361 SPNKICVSNIPPFIEEEPVTMLLVSFGELKSFILVKDSETGESRGIAFCEYLDPTSTEIA 420
Query: 607 IAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLK 659
+ LNG+++G + L V+A S+ G+ +A+ + K T VL+
Sbjct: 421 VENLNGMELGNKRLKVVRA----SVGTTQAAGLDMGV--NAMSMYAKTTSQDIEASRVLQ 474
Query: 660 LKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGDSNISTIQACEGNENT 719
L N+ E + + EE+ +DVR EC+++G V + V + GN N
Sbjct: 475 LLNMVTTEEL--MDNDDYEEICDDVRDECSKYGEVVELKVPR-----------PTGN-NK 520
Query: 720 ASAGVGQ 726
SAGVG+
Sbjct: 521 QSAGVGK 527
>gi|75338884|sp|Q9ZR40.1|U2A2B_NICPL RecName: Full=Splicing factor U2af large subunit B; AltName:
Full=NpU2AF65b; AltName: Full=U2 auxiliary factor 65 kDa
subunit B; AltName: Full=U2 small nuclear
ribonucleoprotein auxiliary factor large subunit B;
Short=U2 snRNP auxiliary factor large subunit B
gi|3850821|emb|CAA77135.1| U2 snRNP auxiliary factor, large subunit [Nicotiana
plumbaginifolia]
Length = 573
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 154/359 (42%), Gaps = 74/359 (20%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
+D+AP + +P + Q +N P PLA S L V
Sbjct: 177 FDMAPPTSAMLPGITAAAGQVPGTN-----PPIPGMFPNMFPLA----SGQFGALPVMPI 227
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
Q+ R RR+ V LP A+E+++ F ++ + + G + I EK
Sbjct: 228 QAMTQQATRHARRVYVGGLPAHANEQSVATFFSHVMSAIGGNTAGPGDAVVNVYINYEKK 287
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA---- 552
AFVE + E+AS A+ DG F G+ K++RP ++ + +A
Sbjct: 288 FAFVEMRSVEEASNAMALDGIIFEGAPCKVRRPSDYNPSLAATLGPSQPNPNLNLAAVGL 347
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFI 595
IF+GG+ + + E++ +FGPL+ F++ +D E + AF
Sbjct: 348 SPGSAGGLEGPDRIFVGGLPYYFTEAQIRELLESFGPLRG--FDLVKDRETGNSKGYAFC 405
Query: 596 EYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGN----PPFHGIPKHALP-- 649
Y D VT A A LNG+K+G + LT +A N G P + HA
Sbjct: 406 VYQDVSVTDIACAALNGIKMGDKTLTVRRA--------NQGTTQPKPEQESVLLHAQQQI 457
Query: 650 ----LLKKP----TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV 700
L+ +P T+VL L V + + + + + +++LED+R EC +FGS+ VNVV
Sbjct: 458 ALQRLMLQPATLATKVLSLTEVISADELN--DDEDYQDILEDMRTECGKFGSL--VNVV 512
>gi|340780291|pdb|2YH0|A Chain A, Solution Structure Of The Closed Conformation Of Human
U2af65 Tandem Rrm1 And Rrm2 Domains
gi|340780292|pdb|2YH1|A Chain A, Model Of Human U2af65 Tandem Rrm1 And Rrm2 Domains With
Eight-Site Uridine Binding
Length = 198
Score = 99.0 bits (245), Expect = 1e-17, Method: Composition-based stats.
Identities = 61/188 (32%), Positives = 98/188 (52%), Gaps = 27/188 (14%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL V N+P +E+A+M+F N + G+ G+ P + I ++K AF+EF + ++
Sbjct: 5 RRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSVDE 63
Query: 524 ASAALCCDGCSFSGSILKIKRPKEFVEV------------------------AIFIGGIS 559
+ A+ DG F G LKI+RP ++ + +FIGG+
Sbjct: 64 TTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSAHKLFIGGLP 123
Query: 560 RTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGG 617
L+ V E++ +FGPLKA++ + + AF EYVD VT +AIAGLNG+++G
Sbjct: 124 NYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGD 183
Query: 618 QVLTAVQA 625
+ L +A
Sbjct: 184 KKLLVQRA 191
>gi|426244214|ref|XP_004015921.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Ovis aries]
Length = 471
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/354 (27%), Positives = 155/354 (43%), Gaps = 57/354 (16%)
Query: 384 RSPEKKSAK-----WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGI 438
RSP + K WDV P + + + QAA Q P +
Sbjct: 79 RSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATAL 122
Query: 439 SVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHV 496
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 LPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQA 182
Query: 497 HGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA---- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 PGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSE 241
Query: 553 --------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEE 590
+FIGG+ L+ V E++ +FGPLKA++ + +
Sbjct: 242 NPSVYVPGVVSTVVPGSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSK 301
Query: 591 PCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP-----FHGIPK 645
AF EYVD VT + + G L A + S ++ N G+
Sbjct: 302 GYAFCEYVDINVTDQVSPAPAHPALLGSPLRAGRGACSSSPFASTINQTPVTLQVPGLMS 361
Query: 646 HALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + PTEVL L N+ PE L + E EE++EDVR EC ++G VKS+ +
Sbjct: 362 SQVQMGGHPTEVLCLMNMVLPE--ELLDDEEYEEIVEDVRDECGKYGLVKSIEI 413
>gi|341038664|gb|EGS23656.1| hypothetical protein CTHT_0003520 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 584
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/295 (28%), Positives = 136/295 (46%), Gaps = 58/295 (19%)
Query: 452 DSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQRE 510
+S L +N R +RL V NLP SA+E++L+ F N L +G+ + + PC+ I +
Sbjct: 243 NSAALKPTNSRQAKRLVVRNLPPSATEESLVNFFN--LQLNGLNVIETTDPCLQAHIAPD 300
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSI------------LKIKRPKEFVEVAIF---- 554
+ A +EF + +A+ AL DG S L+I RPK+++ A+
Sbjct: 301 RSFAMLEFRNSSEATVALAFDGISMDADDAGANGAEAVHGGLQITRPKDYIVPAVVEDPN 360
Query: 555 --------------------IGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--C 592
+ I L+ VME++ +FG LK++ + +E
Sbjct: 361 YDPDSDVPSSVVIDSPNKISVANIPPYLNEDQVMELLVSFGKLKSFVLVKDNGTQESRGI 420
Query: 593 AFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLK 652
AF+EYVD V+ AI GLN + +G Q L +A + I +G + +A+ +L
Sbjct: 421 AFLEYVDPSVSNVAIQGLNDMPLGEQKLKVKKASI--GITQVAGE-----MSVNAMSMLA 473
Query: 653 KPT--------EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
T VL+L N+ PE + + EE+ EDV EC +FG+V S+ +
Sbjct: 474 GTTSTHAEASSRVLQLLNMVTPEEL--MDNDDYEEIREDVLEECKKFGNVLSLKI 526
>gi|159123253|gb|EDP48373.1| splicing factor u2af large subunit [Aspergillus fumigatus A1163]
Length = 567
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 167/382 (43%), Gaps = 74/382 (19%)
Query: 367 PRKRR-TEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSD 425
PRKR T PS + R +++ +WD+ P +V T+ QA S + +
Sbjct: 153 PRKREPTPDLTDVPSVLTR--KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA- 204
Query: 426 PVTSTTQKPLAGISVSASLAKLNV-SMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEF 483
Q+P+ + A + + S D+ L SN R RRL V NLP S + L+
Sbjct: 205 ----PRQQPMDPSRLQAFMNQSGGGSADNSALKPSNSRQARRLFVYNLPPGVSSEHLVSL 260
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF------SG 537
N L +G+ +H PCI I + A +EF T DA+ AL DG + SG
Sbjct: 261 FN--LQLNGLNVIHHVDPCISAQISEDHSFALLEFKTPNDATVALAFDGITMEEHEPVSG 318
Query: 538 SI------LKIKRPKEFV-----------------EVA-----IFIGGISRTLSSKMVME 569
+ L+++RPK+++ EV I + I + + + V
Sbjct: 319 AENGAPKGLEVRRPKDYIVPNGSADQEYQEGVLLNEVPDSPNKICVSNIPQYIPEEPVTM 378
Query: 570 IVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL 627
++ +FG LK++ + EE AF EY D T A+ GLNG+++G + L V+A +
Sbjct: 379 LLKSFGELKSFVLVKDSSTEESRGIAFCEYADPSATAIAVEGLNGMELGDRHLKVVRASI 438
Query: 628 ---DGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEV 677
+ +D N A+ + K T VL+L N+ PE L +
Sbjct: 439 GMTQAAGLDMGVN---------AMSMFAKTTSQDLESSRVLQLLNMVTPEEL--LDNDDY 487
Query: 678 EEVLEDVRLECARFGSVKSVNV 699
EE+ +DVR EC ++G V + V
Sbjct: 488 EEICDDVREECFKYGKVLDLKV 509
>gi|146324846|ref|XP_748978.2| splicing factor u2af large subunit [Aspergillus fumigatus Af293]
gi|129556630|gb|EAL86940.2| splicing factor u2af large subunit [Aspergillus fumigatus Af293]
Length = 563
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 167/382 (43%), Gaps = 74/382 (19%)
Query: 367 PRKRR-TEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSD 425
PRKR T PS + R +++ +WD+ P +V T+ QA S + +
Sbjct: 149 PRKREPTPDLTDVPSVLTR--KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA- 200
Query: 426 PVTSTTQKPLAGISVSASLAKLNV-SMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEF 483
Q+P+ + A + + S D+ L SN R RRL V NLP S + L+
Sbjct: 201 ----PRQQPMDPSRLQAFMNQSGGGSADNSALKPSNSRQARRLFVYNLPPGVSSEHLVSL 256
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF------SG 537
N L +G+ +H PCI I + A +EF T DA+ AL DG + SG
Sbjct: 257 FN--LQLNGLNVIHHVDPCISAQISEDHSFALLEFKTPNDATVALAFDGITMEEHEPVSG 314
Query: 538 SI------LKIKRPKEFV-----------------EVA-----IFIGGISRTLSSKMVME 569
+ L+++RPK+++ EV I + I + + + V
Sbjct: 315 AENGAPKGLEVRRPKDYIVPNGSADQEYQEGVLLNEVPDSPNKICVSNIPQYIPEEPVTM 374
Query: 570 IVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL 627
++ +FG LK++ + EE AF EY D T A+ GLNG+++G + L V+A +
Sbjct: 375 LLKSFGELKSFVLVKDSSTEESRGIAFCEYADPSATAIAVEGLNGMELGDRHLKVVRASI 434
Query: 628 ---DGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEV 677
+ +D N A+ + K T VL+L N+ PE L +
Sbjct: 435 GMTQAAGLDMGVN---------AMSMFAKTTSQDLESSRVLQLLNMVTPEEL--LDNDDY 483
Query: 678 EEVLEDVRLECARFGSVKSVNV 699
EE+ +DVR EC ++G V + V
Sbjct: 484 EEICDDVREECFKYGKVLDLKV 505
>gi|357455537|ref|XP_003598049.1| Splicing factor U2af large subunit B [Medicago truncatula]
gi|355487097|gb|AES68300.1| Splicing factor U2af large subunit B [Medicago truncatula]
Length = 627
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/336 (25%), Positives = 148/336 (44%), Gaps = 54/336 (16%)
Query: 413 AASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLP 472
A+ A ++ + P + + + + L V Q+ R RR+ V L
Sbjct: 237 GATGVAGQITGASPAIPGMFPNMFPLPTNQPFSALPVLPVQAMTQQATRHARRVYVGGLS 296
Query: 473 LSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDG 532
+A+E+++ F + + + G + I +K AFVE + E+AS A+ DG
Sbjct: 297 PTANEQSVATFFSQVMATIGGNTAGPGDAVVNVYINHDKKFAFVEMRSVEEASNAMALDG 356
Query: 533 CSFSGSILKIKRPKEF---VEVA------------------------------IFIGGIS 559
F G+ +K++RP ++ + A IF+GG+
Sbjct: 357 IIFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLGLVGLSPGSAGGLDGPDRIFVGGVP 416
Query: 560 RTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQLVTPKAIAGLNGLKV 615
+ + E++ FGPL+ F++ +D E + AF Y D VT A A LNG+K+
Sbjct: 417 YYFTETQIRELLETFGPLRG--FDLVKDRETGNSKGYAFCVYQDLAVTDIACAALNGIKM 474
Query: 616 GGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALP------LLKKP----TEVLKLKNVFN 665
G + LT +A + + M P I HA L+ +P T+VL L + +
Sbjct: 475 GDKTLTVRRANQNTNPM--QPKPEQESILMHAQQQIALQKLMLQPALVATKVLCLTHAVS 532
Query: 666 PEGFSSLSELEVEEVLEDVRLECARFGSV-KSVNVV 700
P+ + + EE+L+D+R EC++FG++ VNVV
Sbjct: 533 PDELK--DDEDYEEILDDMRQECSKFGNICNLVNVV 566
>gi|358378060|gb|EHK15743.1| hypothetical protein TRIVIDRAFT_79964 [Trichoderma virens Gv29-8]
Length = 503
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/353 (26%), Positives = 160/353 (45%), Gaps = 55/353 (15%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P Y ++ T+ QA S + + Q+P+ + A + +
Sbjct: 108 KRRLTQWDIKP-PGY----DLVTAEQAKLSGMFPLPGA-----PRQQPMDPTKLQAFMTQ 157
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+ S L SN R +RL V N+P S ++ AL+ F N L +G+ + S PC+
Sbjct: 158 PGGQVTSAGLKASNSRQAKRLLVSNVPSSVTDDALISFFN--LQLNGLNVIDSSDPCVLS 215
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSF------SGSI------LKIKRPKEFVEVA- 552
++K A +EF A DA+ AL DG + +G+ L I+RPK++V A
Sbjct: 216 QFSQDKAFAVLEFRNASDATVALALDGITMEADDAQNGTANGGNHGLVIRRPKDYVMPAL 275
Query: 553 ---------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP 591
+ I I L+ V+E++ AFG KA+ + EE
Sbjct: 276 PDEMPYDPEVISNVVPDTVHKLCITNIPSFLNEDQVIELLAAFGKPKAFVLVKDRSTEES 335
Query: 592 --CAFIEYVD-QLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDN--SGNPPFHGIPKH 646
AF EY++ A+ LNG+ VGG+ L +A + + + N G G+
Sbjct: 336 RGIAFTEYLEPSTANEPALNSLNGMDVGGKKLKVTKASIGPTQVANFDVGITAISGLASQ 395
Query: 647 ALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+++ + V++L N+ PE + + EE+ EDV+ EC++FG V + V
Sbjct: 396 TSNDIER-SSVIQLLNMVTPEEL--MDNDDYEEICEDVQDECSKFGKVVELKV 445
>gi|378731414|gb|EHY57873.1| U2AF domain-containing protein (UHM) kinase 1 [Exophiala
dermatitidis NIH/UT8656]
Length = 574
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 95/361 (26%), Positives = 155/361 (42%), Gaps = 66/361 (18%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P +V T+ QA S + + Q+ + + A +
Sbjct: 175 KRRLTQWDIKPPGYENV-----TAEQAKMSGMFPLPGA-----PRQQQMDPSRLQAFMNL 224
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+ S ++ L SN R RRL V NLP S SE+AL++F N L +G+ PC
Sbjct: 225 PSSSSNNTALKPSNSRQARRLFVHNLPASVSEEALVQFFN--LQLNGLNVTKAVDPCAQA 282
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSF-------------------SGSILKIKRPK 546
I ++ A VEF A DA+ AL DG + + L+I+RPK
Sbjct: 283 NIAEDRSFALVEFKNASDATLALALDGITMPEHHSEMNGNGDANGNGTAAPKGLEIRRPK 342
Query: 547 EFV------------EVA---------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVN 585
+++ E++ + + + L+ V+E++ AFG +KA+
Sbjct: 343 DYIVPSADEATYAEGEISSEVPDTANKLAVTNLPPFLTDDQVIELLKAFGEVKAFVLVRE 402
Query: 586 EDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLD-----GSIMDNSGNP 638
D +E AF EY D T AI GLNG+ + G + +A + G M +
Sbjct: 403 PDSQESRGIAFCEYADPASTAVAIEGLNGMDLAGNSIKVTRASIGYQQAAGLDMGVNAMS 462
Query: 639 PFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVN 698
F G A VL+L N+ PE + + EE+ EDV EC+++G + S+
Sbjct: 463 MFAGTTSDA----HDEGRVLQLLNMVTPEDL--MDNDDYEEICEDVMEECSKYGKILSMK 516
Query: 699 V 699
+
Sbjct: 517 I 517
>gi|258576333|ref|XP_002542348.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902614|gb|EEP77015.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 621
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 103/384 (26%), Positives = 165/384 (42%), Gaps = 76/384 (19%)
Query: 366 SPRKRRTEAAAKTPSPINRSP----EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEM 421
SP +R+ E TP + P +++ +WD+ P +V T+ QA S +
Sbjct: 134 SPAQRKKEP---TPDLTDIVPIMERKRRLTQWDIKPPGYENV-----TAEQAKLSGMFPL 185
Query: 422 VSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKAL 480
+ Q+ + + A + + + +S L SN R +RL V NLP S SE L
Sbjct: 186 PGA-----PRQQTVDPSRLQAFMNQPAGNANSTLLKPSNSRQAKRLFVHNLPPSVSEDTL 240
Query: 481 MEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF----- 535
+F N L +G+ + G PCI + + A +EF TA DA+ AL DG S
Sbjct: 241 AQFFN--LQLNGLNVISGVDPCISAQVSSDGKFALLEFKTASDATVALALDGISLEHDDA 298
Query: 536 ------SGSILKIKRPKEFV-----------------EVA-----IFIGGISRTLSSKMV 567
G L +KRPK+++ EV I + I + + V
Sbjct: 299 NGTSSAPGQGLSLKRPKDYIVPSEADDSNRQDGVVSNEVPDSPNKICVTNIPPFIQEEQV 358
Query: 568 MEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA 625
++ +FG LK++ + D AF EYVD T A+ GLNG+++G + L +A
Sbjct: 359 TMLLVSFGELKSFVLVKDSGTDESRGIAFCEYVDPSSTNIAVEGLNGMELGDKRLKVTRA 418
Query: 626 VLDGSI---MDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFSSLSEL 675
+ + +D N A+ + K T VL+L N+ E +
Sbjct: 419 SIGATQAAGLDMGVN---------AMSMFAKTTSQDLETGRVLQLLNMVTAEEL--MDSD 467
Query: 676 EVEEVLEDVRLECARFGSVKSVNV 699
E EE+ +DVR EC+++G V + +
Sbjct: 468 EYEEICDDVREECSKYGQVLDLKI 491
>gi|296420976|ref|XP_002840043.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295636253|emb|CAZ84234.1| unnamed protein product [Tuber melanosporum]
Length = 540
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 104/418 (24%), Positives = 170/418 (40%), Gaps = 70/418 (16%)
Query: 358 STSGLGGYSPRKRRTEAAAK----TP-----SPINRSPEKKSAKWDVAPVETYSVPSNVH 408
ST+ Y RKR K TP +PIN ++ WD+ P +V
Sbjct: 119 STAAADAYPVRKRSATPPVKKREPTPDLTDVTPINER-RRRMTMWDIKPQGYENV----- 172
Query: 409 TSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL-NVSMDSVQLTQSNRPMRRLC 467
T+ QA S + + Q P+ + A + + NV+ S ++R +RL
Sbjct: 173 TAEQAKLSGMFPLPGA-----PRQAPMDPTRLHAFMTQPSNVATASTLKPTNSRQAKRLL 227
Query: 468 VENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAA 527
+ N+P E++L++F N L S V G P + K +EF DA+
Sbjct: 228 MSNIPPGTDEESLLQFFNQTLSSLNVT-TGGPDPITSVQLSGSKILGLLEFKNTNDATVC 286
Query: 528 LCCDGCSFSGSILKIKRPKEFVEVA-----------------------IFIGGISRTLSS 564
L G F+G ++IKRP++++ I I I L
Sbjct: 287 LALSGIEFNGGNIEIKRPRDYIVPIVPEDHRHQEPGVISSDVPDTPNKILISEIPEYLQD 346
Query: 565 KMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTA 622
+ V+E++ +FG LKA+ +V ++ + AF EY+D T A+ GLN +++ L
Sbjct: 347 EQVIELLKSFGDLKAFVLVKDVTDETSKGIAFCEYLDPGTTEIAVEGLNAMEIADNTLRV 406
Query: 623 VQAVLD-----GSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEV 677
+A + G M + G L + + VL+L N+ + L E
Sbjct: 407 RRASIGMKQAAGVEMGVNAMAMMAGTTSVDL----EASRVLQLLNMVTADEL--LDPEEY 460
Query: 678 EEVLEDVRLECARFGSVKSVNVVKYGDSNISTIQACEGNENTASAGVGQNLTNDETNE 735
+E+ ED+R EC +FGS+ + V + + +AGVG+ ET E
Sbjct: 461 DEICEDIRDECQKFGSLVDMKVPR------------PSGGSRQAAGVGKIFVRFETQE 506
>gi|156070776|gb|ABU45190.1| unknown [Capsicum frutescens]
Length = 551
Score = 97.8 bits (242), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 156/358 (43%), Gaps = 72/358 (20%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
+D+AP + +P + Q +N S P + PLA + L +
Sbjct: 155 FDMAPPTSALLPGATDVTGQVPGAN-----PSIPGMFSNMFPLA----AGQFGALPIMPV 205
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
Q+ R RR+ V LP +A+E+++ F ++ + + G + I EK
Sbjct: 206 QAMTQQATRHARRVYVGGLPPTANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKK 265
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA---- 552
AFVE + E+AS A+ DG F G +K++RP ++ + +A
Sbjct: 266 FAFVEMRSVEEASNAMALDGVIFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGL 325
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFI 595
IF+GG+ + + E++ +FG L+ F++ +D E + AF
Sbjct: 326 TPGSSGGLEGPDRIFVGGLPYYFTESQIRELLESFGQLRG--FDLVKDRETGNSKGYAFC 383
Query: 596 EYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALP------ 649
Y D VT A A LNG+K+G + LT +A + N NP + HA
Sbjct: 384 VYQDVSVTDIACAALNGIKMGDKTLTVRRA----NQGTNQPNPEQESVLLHAQQQIALQR 439
Query: 650 LLKKP----TEVLKLKNVFNPEGFSSLSEL----EVEEVLEDVRLECARFGSVKSVNV 699
+ +P T+VL L V S+ EL + +++LED+R+EC +FGS+ +V +
Sbjct: 440 FMLQPGALATKVLCLTQVV------SVDELNDDDDYQDILEDMRVECGKFGSLLNVVI 491
>gi|225677913|gb|EEH16197.1| splicing factor U2AF 65 kDa subunit ) [Paracoccidioides
brasiliensis Pb03]
gi|226287345|gb|EEH42858.1| splicing factor U2AF 50 kDa subunit [Paracoccidioides brasiliensis
Pb18]
Length = 567
Score = 97.8 bits (242), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 107/433 (24%), Positives = 185/433 (42%), Gaps = 112/433 (25%)
Query: 362 LGGYSPRKRRTEAAAKTPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVH 408
GG RKR + +P P R P +++ +WD+ P P H
Sbjct: 136 FGGGRDRKR-----SASPPPKKREPTPDLTDVVPILERKRRLTQWDIKP------PGYEH 184
Query: 409 -TSNQAASSNAHEMVSS------DP--VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQS 459
T+ QA S + + DP + + +P+ G S++ L N
Sbjct: 185 VTAEQAKLSGMFPLPGAPRQQAVDPSRLQAFMNQPVPGTSINTVLRPSN----------- 233
Query: 460 NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFL 519
+R +RL V N+P SA+E++L++F N L +G+ + G PC+ + +++ +EF
Sbjct: 234 SRQAKRLFVHNIPPSATEESLVQFFN--LQLNGLNVIKGVDPCVSAQLSKDRSFGLLEFK 291
Query: 520 TAEDASAALCCDGCSF---------------SGSILKIKRPKEFVEVA------------ 552
++ DA+ AL DG + S L ++RPK+++
Sbjct: 292 SSADATVALAFDGITMEDTGDMDTSNGEANGSNQGLSLRRPKDYILPVGGEEEPHREGVV 351
Query: 553 ----------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQ 600
I + I + + V ++ +FG LK++ + + E AF EY+D
Sbjct: 352 SNVVPDTPNKICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSETGESRGIAFCEYLDP 411
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPT----- 655
+ T A+ LNG+++G + L V+A + G+I + GI +A+ + K T
Sbjct: 412 MSTDIAVENLNGMELGNKRLRVVRASI-GTIQATGLD---MGI--NAMSMYAKTTSQDIE 465
Query: 656 --EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGDSNISTIQAC 713
VL+L N+ + L + EE+ +DVR EC+++G V + V +
Sbjct: 466 AGRVLQLLNMVTTDEL--LDNDDYEEICDDVRDECSKYGQVVELKVPR-----------P 512
Query: 714 EGNENTASAGVGQ 726
GN N SAGVG+
Sbjct: 513 TGN-NKQSAGVGK 524
>gi|356509477|ref|XP_003523474.1| PREDICTED: splicing factor U2af large subunit B-like [Glycine max]
Length = 600
Score = 97.8 bits (242), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 93/365 (25%), Positives = 157/365 (43%), Gaps = 75/365 (20%)
Query: 390 SAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTS---TTQKPLAGISVSASLAK 446
++ +D+AP P++ + AS+ A ++ ++P PLA S +
Sbjct: 196 TSGFDMAP------PASAMLT--GASAVAGQITGANPTIPGMFPNMFPLA-TSQMQQFSA 246
Query: 447 LNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCV 506
L V Q+ R RR+ V LP +A+E+++ F + + G +
Sbjct: 247 LPVMPVQAMTQQATRHARRVYVGGLPPTANEQSVATFFSQVMAKIGGNTAGPGDAVVNVY 306
Query: 507 IQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF------------------ 548
I +K AFVE + E+AS A+ DG F G+ +K++RP ++
Sbjct: 307 INHDKKFAFVEMRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAATLGPSQPNPNLN 366
Query: 549 ---VEVA------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE---- 589
V + +F+GG+ + + E++ FGPL+ F++ +D E
Sbjct: 367 LGAVGLTPGSAGGLDGPDRVFVGGLPYYFTETQIRELLETFGPLRG--FDLVKDRETGNS 424
Query: 590 EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALP 649
+ AF Y D VT A A LNG+K+G + LT +A N G P P+
Sbjct: 425 KGYAFCVYQDLAVTDIACAALNGIKMGDKTLTVRRA--------NQGANPQQPKPEQESI 476
Query: 650 LLKKPTEVLKLKNVFNPEGFS----------SLSEL----EVEEVLEDVRLECARFGSVK 695
L+ ++ K + P + S EL + EE+L+D+R EC++FG++
Sbjct: 477 LMHAQQQIALQKLMLQPALVATKVVCLTHAVSSDELKDDEDYEEILDDMRQECSKFGTL- 535
Query: 696 SVNVV 700
VNVV
Sbjct: 536 -VNVV 539
>gi|321479007|gb|EFX89963.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 isoform 1
[Daphnia pulex]
Length = 487
Score = 97.1 bits (240), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 85/299 (28%), Positives = 136/299 (45%), Gaps = 64/299 (21%)
Query: 460 NRPMRRLCVENLPLS-----ASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQA 514
R RRL V N+P A+E+ +M+F N + SG+ G P + C I +K A
Sbjct: 136 TRQARRLYVGNIPFGVSDVRAAEEEMMDFFNQQMHLSGLSQAPGH-PILACQINLDKNFA 194
Query: 515 FVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF------------------VEVAIFIG 556
F+EF + ++ S A+ DG +F G LKI+RP ++ + G
Sbjct: 195 FLEFRSIDETSQAMAFDGINFKGQTLKIRRPHDYHPTPGGGGGGGGGGGGPETTPGMSSG 254
Query: 557 GISR------------TLSS-------KMVM-----------EIVCAFGPLKAYHF--EV 584
GI+ T+S+ K+ + E++ +FG L+A++ +
Sbjct: 255 GITEKRAGGGGGGGGGTMSTIVPDTPHKLFIGGLPNYLNEEQELLMSFGQLRAFNLVKDT 314
Query: 585 NEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFH--- 641
+ AF EY D VT +AI+GLNG+++G + + +A + P
Sbjct: 315 ATGLSKGYAFCEYADVTVTDQAISGLNGMQLGDKKIIIQRASVGAKNAAAYAQMPVQIQV 374
Query: 642 -GIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
G A P +PTEVL L N+ PE + E EE+++D+R EC R G+V+SV +
Sbjct: 375 PGFNLAAGP--GQPTEVLCLLNMVTPEELR--DDEEYEEIVDDIREECNRHGAVRSVEI 429
>gi|367020820|ref|XP_003659695.1| hypothetical protein MYCTH_2297048 [Myceliophthora thermophila ATCC
42464]
gi|347006962|gb|AEO54450.1| hypothetical protein MYCTH_2297048 [Myceliophthora thermophila ATCC
42464]
Length = 567
Score = 97.1 bits (240), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 98/400 (24%), Positives = 172/400 (43%), Gaps = 80/400 (20%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P +V T+ QA S + + Q+ + + A +
Sbjct: 171 KRRLTQWDIKPPGYDNV-----TAEQAKLSGMFPLPGA-----PRQQAMDPTKLQAFMNH 220
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
+++S L +N R +RL V NLP SA+E++L+ F N L +G+ + + PC+
Sbjct: 221 PGGAVNSAALKPTNSRQSKRLIVSNLPPSATEESLVNFFN--LQLNGLNVIETADPCLQA 278
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSFSGSI-------------LKIKRPKEFVEVA 552
I ++ A +EF DA+ AL DG + L ++RPK+++ A
Sbjct: 279 HIAPDRSFAMLEFRHNTDATVALALDGITMEAEDADAANGNGAATQGLHLRRPKDYIVPA 338
Query: 553 IF------------------------IGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDH 588
+ + + L+ VME++ +FG LK++ +
Sbjct: 339 VVEDPNYDPDSDTPSSVVLDSPNKISVTNLPLYLTDDQVMELLVSFGKLKSFVLVKDNGT 398
Query: 589 EEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKH 646
+E AF+EY D T A+ GLN + +G + L +A + I SG G+ +
Sbjct: 399 QESRGIAFLEYADPSATNVAVQGLNNMMLGERALKVQKASI--GITQVSGE---MGV--N 451
Query: 647 ALPLLKKP-------TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
A+ +L + VL+L N+ PE + + EE+ EDV+ EC +FG + S+ +
Sbjct: 452 AMSMLAGTMSADAGGSRVLQLLNMVTPEEL--MDNDDYEEIREDVQEECQKFGKILSLKI 509
Query: 700 VKYGDSNISTIQACEGNENTASAGVGQNLTNDETNEKGER 739
+ G+ SAGVG+ ET E +
Sbjct: 510 PR----------PVGGSRQ--SAGVGKIYIKYETAESATK 537
>gi|357455533|ref|XP_003598047.1| Splicing factor U2af large subunit B [Medicago truncatula]
gi|355487095|gb|AES68298.1| Splicing factor U2af large subunit B [Medicago truncatula]
Length = 626
Score = 96.7 bits (239), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 135/290 (46%), Gaps = 55/290 (18%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V L +A+E+++ F + + + G + I +K AFVE
Sbjct: 284 QATRHARRVYVGGLSPTANEQSVATFFSQVMATIGGNTAGPGDAVVNVYINHDKKFAFVE 343
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF---VEVA---------------------- 552
+ E+AS A+ DG F G+ +K++RP ++ + A
Sbjct: 344 MRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLGLVGLSPGSA 403
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ + + E++ FGPL+ F++ +D E + AF Y D
Sbjct: 404 GGLDGPDRIFVGGVPYYFTETQIRELLETFGPLRG--FDLVKDRETGNSKGYAFCVYQDL 461
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALP------LLKKP 654
VT A A LNG+K+G + LT +A + + M P I HA L+ +P
Sbjct: 462 AVTDIACAALNGIKMGDKTLTVRRANQNTNPM--QPKPEQESILMHAQQQIALQKLMLQP 519
Query: 655 ----TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV 700
T+VL L + +P+ + + EE+L+D+R EC++FG++ VNVV
Sbjct: 520 ALVATKVLCLTHAVSPDELK--DDEDYEEILDDMRQECSKFGNL--VNVV 565
>gi|302916595|ref|XP_003052108.1| hypothetical protein NECHADRAFT_38412 [Nectria haematococca mpVI
77-13-4]
gi|256733047|gb|EEU46395.1| hypothetical protein NECHADRAFT_38412 [Nectria haematococca mpVI
77-13-4]
Length = 564
Score = 96.7 bits (239), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 108/412 (26%), Positives = 176/412 (42%), Gaps = 75/412 (18%)
Query: 367 PRKRRTEAAAKTPSPINRSP----EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMV 422
P+KR TP N P +++ +WD+ P +V T+ QA S +
Sbjct: 138 PKKRE-----PTPDLTNIVPILERKRRLTQWDIKPPGYDNV-----TAEQAKLSGMFPLP 187
Query: 423 SSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALM 481
+ Q+P+ + A + + + S L +N R +RL V +P SE+AL+
Sbjct: 188 GA-----PRQQPMDPSKLQAFMNQPGGQVTSAGLKANNSRQSKRLLVSKIPSGTSEEALI 242
Query: 482 EFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF-----S 536
F N L +G+ + + PCI C ++ A +EF A +A+ AL DG S +
Sbjct: 243 SFFN--LQLNGLNVIDATDPCILCQFSNDRSFAVLEFREASEATVALALDGTSMEPDDAN 300
Query: 537 GSI------LKIKRPKEFVEVAIF----------------------IGGISRTLSSKMVM 568
G+ L+I+RP+++V A+ I I L+ V+
Sbjct: 301 GASNGESRGLEIRRPRDYVVPAVTEEVSYNPDVVSNIVPDTINKLCITNIPPFLAEDQVI 360
Query: 569 EIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVD-QLVTPKAIAGLNGLKVGGQVLTAVQA 625
E++ AFG KA+ + EE AF EY D P A+ LNG+ VGG+ L +A
Sbjct: 361 ELLAAFGKPKAFVLVKDRGTEESRGIAFAEYQDPNAANPTALDTLNGMDVGGKKLKVTKA 420
Query: 626 VLDGSIMDN--SGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLED 683
+ + + N G G+ ++ + VL+L N+ E L + EE+ ED
Sbjct: 421 SIGPTQVANFDVGITAISGLASQTANDVEG-SRVLQLLNMVTAEEL--LDNDDYEEICED 477
Query: 684 VRLECARFGSVKSVNVVKYGDSNISTIQACEGNENTASAGVGQNLTNDETNE 735
V+ EC++FG + + + + + SAGVG+ ET E
Sbjct: 478 VKEECSKFGKIIDMKIPR------------PTGGSRQSAGVGKIFVKYETIE 517
>gi|357455535|ref|XP_003598048.1| Splicing factor U2af large subunit B [Medicago truncatula]
gi|355487096|gb|AES68299.1| Splicing factor U2af large subunit B [Medicago truncatula]
Length = 629
Score = 96.7 bits (239), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 81/291 (27%), Positives = 135/291 (46%), Gaps = 54/291 (18%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V L +A+E+++ F + + + G + I +K AFVE
Sbjct: 284 QATRHARRVYVGGLSPTANEQSVATFFSQVMATIGGNTAGPGDAVVNVYINHDKKFAFVE 343
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF---VEVA---------------------- 552
+ E+AS A+ DG F G+ +K++RP ++ + A
Sbjct: 344 MRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLGLVGLSPGSA 403
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ + + E++ FGPL+ F++ +D E + AF Y D
Sbjct: 404 GGLDGPDRIFVGGVPYYFTETQIRELLETFGPLRG--FDLVKDRETGNSKGYAFCVYQDL 461
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALP------LLKKP 654
VT A A LNG+K+G + LT +A + + M P I HA L+ +P
Sbjct: 462 AVTDIACAALNGIKMGDKTLTVRRANQNTNPM--QPKPEQESILMHAQQQIALQKLMLQP 519
Query: 655 ----TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSV-KSVNVV 700
T+VL L + +P+ + + EE+L+D+R EC++FG++ VNVV
Sbjct: 520 ALVATKVLCLTHAVSPDELK--DDEDYEEILDDMRQECSKFGNICNLVNVV 568
>gi|403175591|ref|XP_003888994.1| hypothetical protein PGTG_22302 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171670|gb|EHS64431.1| hypothetical protein PGTG_22302 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 713
Score = 96.3 bits (238), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 87/302 (28%), Positives = 132/302 (43%), Gaps = 58/302 (19%)
Query: 454 VQLTQS-NRPMRRLCVENLPLSASEKALMEFLNN-----FLLS----SGVQHVHGSLPCI 503
V TQS R RRL V N+ +A+E + EF N LL+ G+ P +
Sbjct: 326 VAQTQSFARQARRLYVGNILHTANEMNVAEFFNAKMKELGLLARNNEDGMAISISENPVV 385
Query: 504 GCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEV------------ 551
+ EK AFVEF AE+A+ + DG F LKI+RPK++
Sbjct: 386 AVQVNHEKNYAFVEFRNAEEATHGMSFDGIIFQNQALKIRRPKDYTGPDHAGPTHIPGVV 445
Query: 552 ---------AIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDH----EEPCAFIEYV 598
IFIGG+ L+ VME++ +FG LK+++ + + AF EYV
Sbjct: 446 STNVPDSPNKIFIGGLPSYLTDDQVMELLKSFGELKSFNLVKDTSSGGHVSKGFAFCEYV 505
Query: 599 DQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP------------FHGIPKH 646
D +T A GLNG+++G + L +A + + N P F G
Sbjct: 506 DPDLTDIACQGLNGMELGDRYLVVQRAQIGQNAKKEKENNPDGQRNNYNQFNNFAGGQAT 565
Query: 647 ALPLLK---------KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSV 697
A + T VL++ N+ N E + + E E+LED+R EC ++G ++ V
Sbjct: 566 AAASSVLAAVKSGEGEKTRVLQMLNMVNQEEL--VDDQEYGEILEDIRDECGKYGKIEGV 623
Query: 698 NV 699
+
Sbjct: 624 RI 625
>gi|452823554|gb|EME30563.1| U2 snRNP auxiliary factor large subunit, putative isoform 2
[Galdieria sulphuraria]
Length = 538
Score = 95.9 bits (237), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 161/368 (43%), Gaps = 94/368 (25%)
Query: 388 KKSAKWDV--APVETYSVPSNVHTSNQAA-----SSNAHEMVSSDPVTSTTQKPLAGISV 440
++S++WDV P E NV+ S + S+ + M+ PV TTQ
Sbjct: 150 RRSSQWDVRKLPWEQSERDPNVNRSAYSVGGLDFSALSQYMI---PVAPTTQP------- 199
Query: 441 SASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFL-LSSGVQHVHGS 499
Q+ + RRL V NLP +E + +F N+ L L+ GV V G
Sbjct: 200 --------------NTQQATKHARRLYVGNLPSDVTESEVADFFNSALYLAKGVD-VPGD 244
Query: 500 LPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------- 548
P + +K AF+E +A +A+AA+ DG F G L+++RP ++
Sbjct: 245 -PVQSVYLNLDKRFAFIELNSAAEAAAAIQMDGVLFRGMSLRMRRPNDYNPNIHAPVYPP 303
Query: 549 --------------VEVA-------------------IFIGGISRTLSSKMVMEIVCAFG 575
++ +FIGG+ L+ + EI+ ++G
Sbjct: 304 VCQLLTCFLGYIEKFQIGFDPSALGVVSTQVPDGPDKVFIGGLPYHLTEDQIKEILSSYG 363
Query: 576 PLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA--VLDGSI 631
PL A++ + N + AF +Y D + AI GLNG+ +G + LT +A V GS+
Sbjct: 364 PLNAFNLVKDPNTGLSKGYAFFQYKDPSIVEAAIKGLNGMTMGDKTLTVRRASQVSSGSV 423
Query: 632 MDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARF 691
+ + P ++ PT +L+L+N+ PE + + E E+++EDVR E +++
Sbjct: 424 ----------ELGQSFSPTVRYPTRILELRNMVEPEEL--VDDEEYEDIIEDVREESSKY 471
Query: 692 GSVKSVNV 699
G V V +
Sbjct: 472 GEVTEVKI 479
>gi|302820212|ref|XP_002991774.1| hypothetical protein SELMODRAFT_161898 [Selaginella moellendorffii]
gi|300140455|gb|EFJ07178.1| hypothetical protein SELMODRAFT_161898 [Selaginella moellendorffii]
Length = 420
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/329 (27%), Positives = 145/329 (44%), Gaps = 72/329 (21%)
Query: 434 PLAGISVSASLAKLNVSMDSVQLTQ-SNRPMRRLCVENLPLSASEKALMEFLNNFLLSSG 492
P AG S +M + +TQ + R RR+ V LP A+E+ + F + + + G
Sbjct: 32 PFAGTQASLLFFAGLPTMPAQAMTQQATRHARRVYVGGLPPLANEQTIATFFSQVMSAIG 91
Query: 493 VQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF---- 548
+ I +EK AFVE T E+AS A+ DG F G ++++RP ++
Sbjct: 92 GNTAGPGDAVVNVYINQEKKFAFVEMRTVEEASNAMALDGIIFEGVSVRVRRPSDYNPSM 151
Query: 549 ------------VEVA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKA 579
+ +A IF+GG+ L+ + E++ +FGPL+
Sbjct: 152 AATLGPSQPSPHLNLAAVGLTPGAAGGADGPDRIFVGGLPYYLTEGQIKELLESFGPLRG 211
Query: 580 YHFEVNED--HEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLT---AVQAVLDGSIMDN 634
+ + D + + F Y D VT A A LNGLK+G + LT A +V G +
Sbjct: 212 FDLVKDRDTGNSKGYGFCVYQDPAVTDVACAALNGLKMGDKTLTVRRATASVHSGQPKPD 271
Query: 635 SGN-----------------PPFH-----GIPKHALPLLKKPTEVLKLKNVFNPEGFSSL 672
N P++ G+ + + + + PT+V+ LK V +P+
Sbjct: 272 QANVLAQAQQQIALQLALQGAPYYNMMMPGV-DNGMTMPETPTKVVCLKQVVSPD----- 325
Query: 673 SEL----EVEEVLEDVRLECARFGSVKSV 697
EL E EE+LED+R EC ++GSV ++
Sbjct: 326 -ELKEDDEYEEILEDMREECGKYGSVATL 353
>gi|75338883|sp|Q9ZR39.1|U2A2A_NICPL RecName: Full=Splicing factor U2af large subunit A; AltName:
Full=NpU2AF65a; AltName: Full=U2 auxiliary factor 65 kDa
subunit A; AltName: Full=U2 small nuclear
ribonucleoprotein auxiliary factor large subunit A;
Short=U2 snRNP auxiliary factor large subunit A
gi|3850823|emb|CAA77136.1| U2 snRNP auxiliary factor, large subunit [Nicotiana
plumbaginifolia]
Length = 555
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 90/359 (25%), Positives = 155/359 (43%), Gaps = 74/359 (20%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
+D+AP T +P + Q +N + P + PLA S+ L +
Sbjct: 159 FDMAPPTTALLPGATDAAGQVPGTN-----PAIPGLFSNMFPLA----SSQFGALPMMPV 209
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
Q+ R RR+ V LP +A+E+++ F ++ + + G + I EK
Sbjct: 210 QAMTQQATRHARRVYVGGLPPTANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKK 269
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA---- 552
AFVE + E+AS A+ DG F G +K++RP ++ + +A
Sbjct: 270 FAFVEMRSVEEASNAMALDGVIFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGS 329
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFI 595
IF+GG+ + + E++ +FG L+ F++ +D E + AF
Sbjct: 330 TPGSSGGLEGPDRIFVGGLPYYFTESQIRELLESFGQLRG--FDLVKDRETGNSKGYAFC 387
Query: 596 EYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPT 655
Y D VT A A LNG+K+G + LT V+ G+ N P+ LL
Sbjct: 388 VYQDVSVTDIACAALNGIKMGDKTLT-VRRANQGTTQPN---------PEQESVLLHAQQ 437
Query: 656 EVLKLKNVFNPEGFSS----LSEL----------EVEEVLEDVRLECARFGSVKSVNVV 700
++ + + P ++ L+E+ + +++LED+R EC +FG++ VNVV
Sbjct: 438 QIALQRFMLQPGALATKVLCLTEVVTVDELNDDDDYQDILEDMRTECEKFGAL--VNVV 494
>gi|121711505|ref|XP_001273368.1| splicing factor u2af large subunit [Aspergillus clavatus NRRL 1]
gi|119401519|gb|EAW11942.1| splicing factor u2af large subunit [Aspergillus clavatus NRRL 1]
Length = 583
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 166/380 (43%), Gaps = 69/380 (18%)
Query: 367 PRKRR-TEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSD 425
PRKR T S + R +++ +WD+ P +V T+ QA S + +
Sbjct: 150 PRKREPTPDLTDVQSVLTR--KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA- 201
Query: 426 PVTSTTQKPLAGISVSASLAKLNV-SMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEF 483
Q+P+ + A + + S D+ L SN R +RL V NLP S + L+ F
Sbjct: 202 ----PRQQPMDPSRLQAFMNQPGGGSADNAALKPSNSRQAKRLFVYNLPPGVSNEHLVSF 257
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF-------- 535
N L +G+ +H PCI I + A +EF + D + AL DG +
Sbjct: 258 FN--LQLNGLNVIHNVDPCISAQISEDHTFALLEFKSPNDTTVALAFDGITMEEHEAMGA 315
Query: 536 -----SGSILKIKRPKEFV-----------------EVA-----IFIGGISRTLSSKMVM 568
+ L+++RPK++V EV I + I + + + V
Sbjct: 316 GAENGASKGLEVRRPKDYVVPNGSADQEYQEGVLLNEVPDSPNKICVSNIPQYIPEEPVT 375
Query: 569 EIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAV 626
++ +FG LK++ + EE AF EY D TP A+ GLNG+++G + L V+A
Sbjct: 376 MLLKSFGELKSFVLVKDSSTEESRGIAFCEYADPSATPIAVEGLNGMELGDRHLKVVRA- 434
Query: 627 LDGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEVEE 679
SI G+ +A+ + K T VL+L N+ PE + + EE
Sbjct: 435 ---SIGMTQAVGLDMGV--NAMSMFAKTTSQDLESGRVLQLLNMVTPEEL--MDNDDYEE 487
Query: 680 VLEDVRLECARFGSVKSVNV 699
+ +DVR EC+++G V + V
Sbjct: 488 ICDDVREECSKYGKVLDLKV 507
>gi|388520789|gb|AFK48456.1| unknown [Lotus japonicus]
Length = 527
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/324 (25%), Positives = 137/324 (42%), Gaps = 66/324 (20%)
Query: 435 LAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQ 494
L GIS + L++ Q+ R RR+ + LP +E+++ F ++ + + G
Sbjct: 151 LFGIS-QPQIGALSLMQVQPMTQQATRHARRVYIGGLPPLTNEQSIATFFSHVMTAIGGN 209
Query: 495 HVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA-- 552
+ I EK AF+E T E+AS A+ DG F G ++++RP ++
Sbjct: 210 SAGAGDSVVNVYINHEKKFAFLEMRTVEEASNAMSLDGIVFEGVSVRVRRPTDYNPTLAA 269
Query: 553 -------------------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYH 581
IF+GG+ + + + E++ AFGPL++
Sbjct: 270 ALGPCQPSPYLNLSAVGLSGGTIGGTDGLDRIFVGGLPYYFAEEQIRELLQAFGPLRS-- 327
Query: 582 FEVNEDHE----EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGS------- 630
F++ D E + F Y D VT A A LNGLKVG + LT +A + G
Sbjct: 328 FDLVRDKETGNSKGYGFCIYQDPAVTDMACASLNGLKVGDKTLTVRRATVSGHSKTEQEH 387
Query: 631 --------------IMDNSG-NPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSEL 675
++ G N P G+ + + + T+VL L + +
Sbjct: 388 IFAQAQQNITMQKVALEVVGLNIP--GVERVPTTIDESATKVLCLTEAITTDEL--MDNG 443
Query: 676 EVEEVLEDVRLECARFGSVKSVNV 699
E EE++ED+R EC +FG++ +V +
Sbjct: 444 EYEEIVEDMRDECGKFGTLMNVVI 467
>gi|225562835|gb|EEH11114.1| splicing factor u2af large subunit [Ajellomyces capsulatus G186AR]
Length = 571
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 108/427 (25%), Positives = 177/427 (41%), Gaps = 94/427 (22%)
Query: 361 GLGGYSPRKRRTEAAAKTPSPINRSP-------------EKKSAKWDVAPVETYSVPSNV 407
GG RKR + +P P R P +++ +WD+ P P
Sbjct: 135 AFGGGRERKR-----SASPPPKKREPTPDLTDVVPILERKRRLTQWDIKP------PGYE 183
Query: 408 H-TSNQAASSNAHEMVSSDPVTSTTQKPL-AGISVSASLAKLNVSMDSVQLTQSNRPMRR 465
H T+ QA S + + + L A I + S ++V ++R +R
Sbjct: 184 HVTAEQAKLSGMFPLPGAPRQQAVDPSRLQAFIHPPTTSTAPGTSTNTVLKPSNSRQAKR 243
Query: 466 LCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDAS 525
L V NLP SA+E++L++F N L +G+ + G PC+ + +K A +EF A D +
Sbjct: 244 LFVHNLPSSATEESLVQFFN--LQLNGLNVIKGVDPCVTAQLSNDKTFALLEFRNAADTT 301
Query: 526 AALCCDGCSF---------------SGSILKIKRPKEFVEVA------------------ 552
AL DG + S L I+RPK+++ +
Sbjct: 302 VALAFDGITMEDNDDMDTTNGDSNGSNQGLSIRRPKDYILPSAVEGEPHQEGVVSNVVPD 361
Query: 553 ----IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKA 606
I + I + + V ++ +FG LK++ + + E AF EY D + T A
Sbjct: 362 SPNKICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSETGESRGIAFCEYRDPMSTDIA 421
Query: 607 IAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLK 659
+ LNG+++G + L V+A SI G+ +A+ + K T VL+
Sbjct: 422 VENLNGMELGNKKLKVVRA----SIGTTQAAGLDMGV--NAMSMYAKTTSQDIEASRVLQ 475
Query: 660 LKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGDSNISTIQACEGNENT 719
L N+ E + + EE+ +DVR EC+++G V + V + GN N
Sbjct: 476 LLNMVTTEEL--IDNDDYEEICDDVRDECSKYGEVVELKVPR-----------PTGN-NK 521
Query: 720 ASAGVGQ 726
SAGVG+
Sbjct: 522 QSAGVGK 528
>gi|348688506|gb|EGZ28320.1| hypothetical protein PHYSODRAFT_309221 [Phytophthora sojae]
Length = 694
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 89/294 (30%), Positives = 140/294 (47%), Gaps = 61/294 (20%)
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFL-LSSGVQHVHGSLPCIGCVIQREKGQAF 515
+Q R RRL V + SE + F N+ + + G + GS+ + I RE+ AF
Sbjct: 350 SQQTRHARRLYVGGIG-EISEPEITAFFNDVIDRALGEKQEGGSV--VSVYINRERHFAF 406
Query: 516 VEFLTAEDASAALCCDGCSFSGSILKIKRPKEF--------------VEVA--------- 552
VE T E +A + DG S++G LKI+RP ++ + +A
Sbjct: 407 VELRTIELTTACMNLDGVSYNGQPLKIRRPNDYNPATVPKDLGPIPQLNLAALGIVSTTV 466
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTP 604
IFIGG+ L+ + V E++ AFGPL+++H E++ + + F EY+D VT
Sbjct: 467 SDGPGKIFIGGLPYHLNEEQVKELLQAFGPLRSFHLVKELSSNLSKGYGFCEYMDINVTD 526
Query: 605 KAIAGLNGLKVGGQVLTA--------VQAVLDGSIMDNSG-----NP--------PFHGI 643
A GLN +++G + LT +AV + N+G +P GI
Sbjct: 527 AACLGLNDMRLGDKTLTVRRAMSQENAKAVASAAGTVNTGLEMGLDPSRAAMQAMSLAGI 586
Query: 644 PKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSV 697
P +LP L P+ V+ L N+ PE E E ++L+D++ EC RFG+V S+
Sbjct: 587 P--SLP-LGTPSRVIVLLNMVTPE--ELEDEEEYADILDDIKGECERFGAVPSL 635
>gi|301117646|ref|XP_002906551.1| splicing factor U2af large subunit, putative [Phytophthora
infestans T30-4]
gi|262107900|gb|EEY65952.1| splicing factor U2af large subunit, putative [Phytophthora
infestans T30-4]
Length = 569
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 147/308 (47%), Gaps = 61/308 (19%)
Query: 456 LTQSNRPMRRLCVENLPLSASEKALMEFLNNFL-LSSGVQHVHGSLPCIGCVIQREKGQA 514
L+Q R RRL V + SE + F N+ + + G + GS+ + I RE+ A
Sbjct: 224 LSQQTRHARRLYVGGIG-EISEPEITAFFNDVIDRALGEKQEGGSV--VSVYINRERHFA 280
Query: 515 FVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF--------------VEVA-------- 552
FVE + E +A + DG S++G LKI+RP ++ + +A
Sbjct: 281 FVELRSIELTTACMNLDGVSYNGQPLKIRRPNDYNPATVPKDLGPIPQLNLAALGIVSTT 340
Query: 553 -------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVT 603
IFIGG+ L+ + V E++ AFGPL+++H E++ + + F EY+D VT
Sbjct: 341 VSDGPGKIFIGGLPYHLNEEQVKELLQAFGPLRSFHLVKELSSNLSKGYGFCEYMDINVT 400
Query: 604 PKAIAGLNGLKVGGQVLTA--------VQAVLDGSIMDNSG-----NPPF--------HG 642
A GLN +++G + LT +AV + N+G +P G
Sbjct: 401 DAACIGLNDMQLGDKTLTVRRAMSQENAKAVASAAGTVNTGLEMGLDPSLAAMQAMSMAG 460
Query: 643 IPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKY 702
IP ++P L P+ V+ L N+ PE E E ++L+D++ EC RFG+V S+ + +
Sbjct: 461 IP--SVP-LGTPSRVIVLLNMVTPE--ELEDEEEYADILDDIKGECERFGAVPSLLLPRP 515
Query: 703 GDSNISTI 710
D +S +
Sbjct: 516 RDGVLSAV 523
>gi|154280004|ref|XP_001540815.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150412758|gb|EDN08145.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 571
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/388 (25%), Positives = 166/388 (42%), Gaps = 76/388 (19%)
Query: 387 EKKSAKWDVAPVETYSVPSNVH-TSNQAASSNAHEMVSSDPVTSTTQKPL-AGISVSASL 444
+++ +WD+ P P H T+ QA S + + + L A I +
Sbjct: 169 KRRLTQWDIKP------PGYEHVTAEQAKLSGMFPLPGAPRQQAVDPSRLQAFIHPPTTS 222
Query: 445 AKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIG 504
S ++V ++R +RL V NLP SA+E++L++F N L +G+ + G PC+
Sbjct: 223 TAPGTSTNTVLKPSNSRQAKRLFVHNLPSSATEESLVQFFN--LQLNGLNVIKGVDPCVT 280
Query: 505 CVIQREKGQAFVEFLTAEDASAALCCDGCSF---------------SGSILKIKRPKEFV 549
+ +K A VEF A D + AL DG + S L I+RPK+++
Sbjct: 281 AQLSNDKTFALVEFRNAADTTVALAFDGITMEDNDEMDTTNGNSNGSNQGLSIRRPKDYI 340
Query: 550 EVA----------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED 587
+ I + I + + V ++ +FG LK++ + +
Sbjct: 341 LPSAVEGEPHQEGVVSNVVPDSPNKICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSE 400
Query: 588 HEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPK 645
E AF EY D + T A+ LNG+++G + L V+A SI G+
Sbjct: 401 TGESRGIAFCEYRDPMSTDIAVENLNGMELGNKKLKVVRA----SIGTTQAAGLDMGV-- 454
Query: 646 HALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVN 698
+A+ + K T VL+L N+ E + + EE+ +DVR EC+++G V +
Sbjct: 455 NAMSMYAKTTSQDIEASRVLQLLNMVTTEEL--IDNDDYEEICDDVRDECSKYGEVVELK 512
Query: 699 VVKYGDSNISTIQACEGNENTASAGVGQ 726
V + GN N SAGVG+
Sbjct: 513 VPR-----------PTGN-NKQSAGVGK 528
>gi|389642205|ref|XP_003718735.1| splicing factor U2AF 50 kDa subunit [Magnaporthe oryzae 70-15]
gi|351641288|gb|EHA49151.1| splicing factor U2AF 50 kDa subunit [Magnaporthe oryzae 70-15]
Length = 620
Score = 94.4 bits (233), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 80/287 (27%), Positives = 132/287 (45%), Gaps = 47/287 (16%)
Query: 454 VQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
V L +N R +RL + NLP +E +L+ FLN L +G+ + S PC+ C + +
Sbjct: 282 VTLRPTNSRQSKRLILSNLPAGTTEDSLISFLN--LQLNGLNVIEASDPCLACQMAPDGS 339
Query: 513 QAFVEFLTAEDASAALCCDGCSF-------------SGSILKIKRPKEFVEVAIF----- 554
A VEF + D + A DG S + L ++RPK+++ A+
Sbjct: 340 FAMVEFRSPSDTTVAYALDGISMEAEDAGNGDANGAASKGLAMRRPKDYIVPAVVDDTGY 399
Query: 555 -----------------IGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFI 595
+ + L+ + V+E++ +FG LKA + EE AF
Sbjct: 400 EPGVVSSRVVDTPHKISVTNLPAYLTDEQVVELLSSFGELKALVLAKDSSTEESRGIAFC 459
Query: 596 EYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSG-NPPFHGIPKHALPLLKKP 654
EYVD T AI GLNG+++G + L +A + I SG + + A + + P
Sbjct: 460 EYVDVTNTDVAIEGLNGMELGDKRLKVRKASI--GITQVSGMEMGVNAMSMLAGTVAQDP 517
Query: 655 --TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ VL+L N+ + + + EE+ EDV+ ECA++G+V + V
Sbjct: 518 DLSPVLQLLNMVTADEL--MDNDDYEEICEDVQEECAKYGTVIELKV 562
>gi|240279650|gb|EER43155.1| splicing factor u2af large subunit [Ajellomyces capsulatus H143]
gi|325092783|gb|EGC46093.1| splicing factor u2af large subunit [Ajellomyces capsulatus H88]
Length = 572
Score = 94.4 bits (233), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 106/429 (24%), Positives = 177/429 (41%), Gaps = 97/429 (22%)
Query: 361 GLGGYSPRKRRTEAAAKTPSPINRSP-------------EKKSAKWDVAPVETYSVPSNV 407
GG RKR + +P P R P +++ +WD+ P P
Sbjct: 135 AFGGGRERKR-----SASPPPKKREPTPDLTDVVPILERKRRLTQWDIKP------PGYE 183
Query: 408 HTSNQAASSNAHEMVSSDP----VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPM 463
H + + A + + P V + + + S A S ++V ++R
Sbjct: 184 HVTAEQAKLSGMFPLPGAPRQQAVDPSRLQAFIHPPTTTSTAP-GTSTNTVLKPSNSRQA 242
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
+RL V NLP SA+E++L++F N L +G+ + G PC+ + +K A +EF A D
Sbjct: 243 KRLFVHNLPSSATEESLVQFFN--LQLNGLNVIKGVDPCVTAQLSNDKTFALLEFRNAAD 300
Query: 524 ASAALCCDGCSF---------------SGSILKIKRPKEFVEVA---------------- 552
+ AL DG + S L I+RPK+++ +
Sbjct: 301 TTVALAFDGITMEDNDEMDTTNGDSNGSNQGLSIRRPKDYILPSAVEGEPNQEGVVSNVV 360
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTP 604
I + I + + V ++ +FG LK++ + + E AF EY D + T
Sbjct: 361 PDSPNKICVSNIPPFIEEEPVTMLLVSFGELKSFVLVKDSETGESRGIAFCEYRDPMSTD 420
Query: 605 KAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPT-------EV 657
A+ LNG+++G + L V+A SI G+ +A+ + K T V
Sbjct: 421 IAVENLNGMELGNKKLKVVRA----SIGTTQAAGLDMGV--NAMSMYAKTTSQDIEASRV 474
Query: 658 LKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGDSNISTIQACEGNE 717
L+L N+ E + + EE+ +DVR EC+++G V + V + GN
Sbjct: 475 LQLLNMVTTEEL--IDNDDYEEICDDVRDECSKYGEVVELKVPR-----------PTGN- 520
Query: 718 NTASAGVGQ 726
N SAGVG+
Sbjct: 521 NKQSAGVGK 529
>gi|259480265|tpe|CBF71237.1| TPA: splicing factor u2af large subunit (AFU_orthologue;
AFUA_7G05310) [Aspergillus nidulans FGSC A4]
Length = 547
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/379 (25%), Positives = 167/379 (44%), Gaps = 64/379 (16%)
Query: 366 SPRKRRTEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSD 425
SP ++R T P + +++ +WD+ P P + + + A +A ++
Sbjct: 130 SPSRKREPTPDLTDVPSVLTRKRRLTQWDIKP------PGYENVTAEQAKLSADDLGMFP 183
Query: 426 PVTSTTQKPLAGISVSASLAKLNV-SMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEF 483
+ Q+P+ + A + + S D+ L SN R +RL V NLP +A+ + L+ F
Sbjct: 184 LPGAPRQQPMDPSRLQAFMNQSGGGSADTSALKPSNSRQAKRLFVYNLPPNATVENLVSF 243
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF-------- 535
N L +G+ + PCI I + A +EF + D + AL DG +
Sbjct: 244 FN--LQLNGLNVIQSVDPCISAQISDDHSFALLEFKSPNDTTVALALDGITMGEHESNGE 301
Query: 536 --SGSILKIKRPKEFV---------EVA------------IFIGGISRTLSSKMVMEIVC 572
+ L+++RPK+++ E A I + I + + + V ++
Sbjct: 302 NGAAKGLEVRRPKDYIVPNLAEQDLEGASGMKDVPDSPNKICVSNIPQYIPEEPVTMLLK 361
Query: 573 AFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL--- 627
+FG LK++ + EE AF EY D T A+ GLNG+++G + L V+A +
Sbjct: 362 SFGELKSFVLVKDSSTEESRGIAFCEYADPNTTTIAVQGLNGMELGDRHLKVVRASIGMT 421
Query: 628 DGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEVEEV 680
+ +D N A+ + K T VL+L N+ PE + + EE+
Sbjct: 422 QAAGLDMGVN---------AMSMFAKTTSQDLESSRVLQLLNMVTPEEL--MDNEDYEEI 470
Query: 681 LEDVRLECARFGSVKSVNV 699
+DVR EC++FG V + +
Sbjct: 471 CDDVRDECSKFGRVLELKI 489
>gi|345569109|gb|EGX51978.1| hypothetical protein AOL_s00043g712 [Arthrobotrys oligospora ATCC
24927]
Length = 569
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 85/313 (27%), Positives = 144/313 (46%), Gaps = 54/313 (17%)
Query: 460 NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFL 519
+R +RL +NLP +E+ + F ++FL S P I + A +EF
Sbjct: 248 SRQAKRLLCQNLPPMCTEETIYSFFSSFLKSLNAVDSENE-PLITVYLNPTGTMAMLEFR 306
Query: 520 TAEDASAALCCDGCSFSGSILKIK--RPKEFV-------------EVA---------IFI 555
+ A+ L DG F + +KI+ RPK+++ +++ I +
Sbjct: 307 STAYATLCLAFDGMEFDDTEVKIRLSRPKDYIIPQYSESSESHNGDISPNVPDSINKICV 366
Query: 556 GGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGL 613
I L+ + VME++ FGPLK++ +++ +E AF EY+D + AI GLNGL
Sbjct: 367 SNIPTHLADQQVMELLQTFGPLKSFFLVKDKEMDESKGVAFCEYLDPNIAEIAIEGLNGL 426
Query: 614 KVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTE-------VLKLKNVFNP 666
+ Q+L +A + + ++G GIP A+ ++ T VL+L N+
Sbjct: 427 DINEQLLNVKRASI--GVKQSAGAE--AGIP--AMTVIAATTSAEMEGGRVLQLLNMVTA 480
Query: 667 EGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGDSNISTIQACEGNENTASAGVGQ 726
+ L + E EE+LEDV EC +FG + + + + GN+ A+AGVG+
Sbjct: 481 DEL--LDQEEYEEILEDVTDECNKFGPIIDIKIPR-----------PSGNQR-AAAGVGK 526
Query: 727 NLTNDETNEKGER 739
E +E E+
Sbjct: 527 IYVRFEEHESAEK 539
>gi|156070760|gb|ABU45175.1| unknown [Solanum melongena]
Length = 553
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 84/315 (26%), Positives = 139/315 (44%), Gaps = 73/315 (23%)
Query: 441 SASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSL 500
+ L V Q+ R RR+ V LP +A+E+++ F ++ + + G
Sbjct: 194 AGQFGALPVMPVQAMTQQATRHARRVYVGGLPPTANEQSVATFFSHVMYAIGGNTAGPGD 253
Query: 501 PCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF------------ 548
+ I EK AFVE + E+AS A+ DG F G +K++RP ++
Sbjct: 254 AVVNVYINHEKKFAFVEMRSVEEASNAMALDGVIFEGGPVKVRRPSDYNPSLAATLGPSQ 313
Query: 549 ----VEVA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED 587
+ +A IF+GG+ + + E++ +FG L+ F++ +D
Sbjct: 314 PSPNLNLAAVGLTPGSSGGLEGPDRIFVGGLPYYFTESQIRELLESFGQLRG--FDLVKD 371
Query: 588 HE----EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSG----NPP 639
E + AF Y D VT A A LNG+K+G + LT +A N G NP
Sbjct: 372 RETGNSKGYAFCVYQDVSVTDIACAALNGIKMGDKTLTVRRA--------NQGITQPNPE 423
Query: 640 FHGIPKHALP------LLKKP----TEVLKLKNVFNPEGFSSLSEL----EVEEVLEDVR 685
+ HA + +P T++L L V S+ EL + +++LED+R
Sbjct: 424 QESVLLHAQQQIALQRFMLQPGALATKILCLTQVV------SVDELKDDDDYQDILEDMR 477
Query: 686 LECARFGSVKSVNVV 700
+EC +FG++ +NVV
Sbjct: 478 IECGKFGAL--LNVV 490
>gi|188998293|gb|ACD67872.1| U2 snRNP auxiliary factor large subunit [Solanum melongena]
Length = 554
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 84/315 (26%), Positives = 139/315 (44%), Gaps = 73/315 (23%)
Query: 441 SASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSL 500
+ L V Q+ R RR+ V LP +A+E+++ F ++ + + G
Sbjct: 195 AGQFGALPVMPVQAMTQQATRHARRVYVGGLPPTANEQSVATFFSHVMYAIGGNTAGPGD 254
Query: 501 PCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF------------ 548
+ I EK AFVE + E+AS A+ DG F G +K++RP ++
Sbjct: 255 AVVNVYINHEKKFAFVEMRSVEEASNAMALDGVIFEGGPVKVRRPSDYNPSLAATLGPSQ 314
Query: 549 ----VEVA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED 587
+ +A IF+GG+ + + E++ +FG L+ F++ +D
Sbjct: 315 PSPNLNLAAVGLTPGSSGGLEGPDRIFVGGLPYYFTESQIRELLESFGQLRG--FDLVKD 372
Query: 588 HE----EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSG----NPP 639
E + AF Y D VT A A LNG+K+G + LT +A N G NP
Sbjct: 373 RETGNSKGYAFCVYQDVSVTDIACAALNGIKMGDKTLTVRRA--------NQGITQPNPE 424
Query: 640 FHGIPKHALP------LLKKP----TEVLKLKNVFNPEGFSSLSEL----EVEEVLEDVR 685
+ HA + +P T++L L V S+ EL + +++LED+R
Sbjct: 425 QESVLLHAQQQIALQRFMLQPGALATKILCLTQVV------SVDELKDDDDYQDILEDMR 478
Query: 686 LECARFGSVKSVNVV 700
+EC +FG++ +NVV
Sbjct: 479 IECGKFGAL--LNVV 491
>gi|440468063|gb|ELQ37246.1| splicing factor U2AF 50 kDa subunit [Magnaporthe oryzae Y34]
gi|440489023|gb|ELQ68704.1| splicing factor U2AF 50 kDa subunit [Magnaporthe oryzae P131]
Length = 640
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 80/287 (27%), Positives = 132/287 (45%), Gaps = 47/287 (16%)
Query: 454 VQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
V L +N R +RL + NLP +E +L+ FLN L +G+ + S PC+ C + +
Sbjct: 282 VTLRPTNSRQSKRLILSNLPAGTTEDSLISFLN--LQLNGLNVIEASDPCLACQMAPDGS 339
Query: 513 QAFVEFLTAEDASAALCCDGCSF-------------SGSILKIKRPKEFVEVAIF----- 554
A VEF + D + A DG S + L ++RPK+++ A+
Sbjct: 340 FAMVEFRSPSDTTVAYALDGISMEAEDAGNGDANGAASKGLAMRRPKDYIVPAVVDDTGY 399
Query: 555 -----------------IGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFI 595
+ + L+ + V+E++ +FG LKA + EE AF
Sbjct: 400 EPGVVSSRVVDTPHKISVTNLPAYLTDEQVVELLSSFGELKALVLAKDSSTEESRGIAFC 459
Query: 596 EYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSG-NPPFHGIPKHALPLLKKP 654
EYVD T AI GLNG+++G + L +A + I SG + + A + + P
Sbjct: 460 EYVDVTNTDVAIEGLNGMELGDKRLKVRKASI--GITQVSGMEMGVNAMSMLAGTVAQDP 517
Query: 655 --TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ VL+L N+ + + + EE+ EDV+ ECA++G+V + V
Sbjct: 518 DLSPVLQLLNMVTADEL--MDNDDYEEICEDVQEECAKYGTVIELKV 562
>gi|356517814|ref|XP_003527581.1| PREDICTED: splicing factor U2af large subunit A-like [Glycine max]
Length = 605
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 152/362 (41%), Gaps = 69/362 (19%)
Query: 390 SAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNV 449
++ +D+AP + + + Q +N + P PLA + + L V
Sbjct: 201 TSGFDMAPPASAMLAGASAVAGQITGAN-----PTIPGMFPNMFPLATNQMQ-QFSALPV 254
Query: 450 SMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQR 509
Q+ R RR+ V LP +A+E+++ F + + G + I
Sbjct: 255 MPVQAMTQQATRHARRVYVGGLPPTANEQSVATFFSQVMAKIGGNTAGPGDAVVNVYINH 314
Query: 510 EKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF--------------------- 548
+K AFVE + E+AS A+ DG F G+ +K++RP ++
Sbjct: 315 DKKFAFVEMRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAATLGPSQPNPNLNLGA 374
Query: 549 VEVA------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPC 592
V + IF+GG+ + + E++ FGPL+ F++ +D E +
Sbjct: 375 VGLTPGSAGGLDGPDRIFVGGLPYYFTETQIRELLETFGPLRG--FDLVKDRETGNSKGY 432
Query: 593 AFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLK 652
AF Y D VT A A LNG+K+G + LT +A N G P P+ L+
Sbjct: 433 AFCVYQDLAVTDIACAALNGIKMGDKTLTVRRA--------NQGANPQQPKPEQESILMH 484
Query: 653 KPTEVLKLKNVFNPEGFS----------SLSEL----EVEEVLEDVRLECARFGSVKSVN 698
++ K + P + S EL + +E+L+D+R EC++FG++ VN
Sbjct: 485 AQQQIALQKLMLQPALVATKVVCLTHAVSSDELKDDEDYDEILDDMRQECSKFGTL--VN 542
Query: 699 VV 700
VV
Sbjct: 543 VV 544
>gi|302816055|ref|XP_002989707.1| hypothetical protein SELMODRAFT_160385 [Selaginella moellendorffii]
gi|300142484|gb|EFJ09184.1| hypothetical protein SELMODRAFT_160385 [Selaginella moellendorffii]
Length = 421
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 89/333 (26%), Positives = 143/333 (42%), Gaps = 79/333 (23%)
Query: 434 PLAGISVSASLAKLNVSMDSVQLTQ-SNRPMRRLCVENLPLSASEKALMEFLNNFLLSSG 492
P AG +M + +TQ + R RR+ V LP A+E+ + F + + + G
Sbjct: 32 PFAGTQARLLFFAGLPTMPAQAMTQQATRHARRVYVGGLPPLANEQTIATFFSQVMSAIG 91
Query: 493 VQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF---- 548
+ I +EK AFVE T E+AS A+ DG F G ++++RP ++
Sbjct: 92 GNTAGPGDAVVNVYINQEKKFAFVEMRTVEEASNAMALDGIIFEGVSVRVRRPSDYNPSM 151
Query: 549 ------------VEVA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKA 579
+ +A IF+GG+ L+ + E++ +FGPL+
Sbjct: 152 AATLGPSQPSPHLNLAAVGLTPGAAGGADGPDRIFVGGLPYYLTEGQIKELLESFGPLRG 211
Query: 580 YHFEVNED--HEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGN 637
+ + D + + F Y D VT A A LNGLK+G + LT +A S+ +SG
Sbjct: 212 FDLVKDRDTGNSKGYGFCVYQDPAVTDVACAALNGLKMGDKTLTVRRAT--ASV--HSGQ 267
Query: 638 P--------------------PFHGIPKH---------ALPLLKKPTEVLKLKNVFNPEG 668
P G P + + + + PT+V+ LK V +P+
Sbjct: 268 PKPDQANVLAQAQQQIALQKLALQGAPYYNMMMPGVDNGMTMPETPTKVVCLKQVVSPD- 326
Query: 669 FSSLSEL----EVEEVLEDVRLECARFGSVKSV 697
EL E EE+LED+R EC ++GSV ++
Sbjct: 327 -----ELKEDDEYEEILEDMREECGKYGSVATL 354
>gi|168026451|ref|XP_001765745.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682922|gb|EDQ69336.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 491
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 74/282 (26%), Positives = 131/282 (46%), Gaps = 42/282 (14%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V LP A+E+ + + + + + G + I +EK AFVE
Sbjct: 153 QATRHARRVYVGGLPPMANEQTIATYFSQVMAAVGGNTAGPGDAVVNVYINQEKKFAFVE 212
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA--------- 552
T E+AS A+ DG F G ++++RP ++ + +A
Sbjct: 213 MRTVEEASNAMALDGIMFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNLAAVGLTPGAA 272
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED--HEEPCAFIEYVDQLV 602
IF+GG+ L+ + E++ +FGPL+ + + D + + F Y D V
Sbjct: 273 GGADGPDRIFVGGLPYYLTEIQIKELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAV 332
Query: 603 TPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPL-----LKKPTEV 657
T AIA LNGLK+G + L+ +A G + N H + A+ + L+ T+V
Sbjct: 333 TDVAIAALNGLKMGDKTLSVRRASASGQPKPDQANVLAHAQQQIAIQVFWMSPLETSTKV 392
Query: 658 LKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ L V +P+ + E +++LED++ EC ++G++ V +
Sbjct: 393 VALTQVVSPDELK--DDEEYQDILEDMKEECGKYGNLLRVVI 432
>gi|392572624|gb|EIW65769.1| hypothetical protein TREMEDRAFT_41238 [Tremella mesenterica DSM
1558]
Length = 596
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 88/348 (25%), Positives = 148/348 (42%), Gaps = 60/348 (17%)
Query: 389 KSAKWDVAPVETYSVP------SNVHTSNQAASSNAHEM-VSSDPVTSTTQKPLAGISVS 441
K + WD+ PV+ + + + T E+ + S + + P A
Sbjct: 172 KHSLWDIRPVQFAGIGAMAAKMTGMFTYGPGRVPPPPELGIPSALIAGSFPPPGA----- 226
Query: 442 ASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLP 501
R +R+ V + S ++ +L+EF N + G P
Sbjct: 227 ----------------NGLRQAKRIYVGGITESMTDASLLEFFNTTMSERGFTLEIPGDP 270
Query: 502 CIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEV---------- 551
+ EK AF+EF +AE+AS+AL D F L++KRPK++ +
Sbjct: 271 IGAVQVNHEKAFAFLEFRSAEEASSALKLDNVMFEDVPLRVKRPKDYTGLDPLQHTMGGA 330
Query: 552 --------AIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEE--PCAFIEYVDQL 601
+FIGG+ L VME++ +FG L++++ + D E AF EY D
Sbjct: 331 QAMSDSPNKLFIGGLPTYLDEAQVMELLKSFGELRSFNLVKDPDSSENKGFAFAEYTDPS 390
Query: 602 VTPKAIAGLNGLKVGGQVLTAVQAVL---DGSIMDNSGNPPFHGIPKHALPLLKK----P 654
T AI+GLN +G ++L +A + G+ G+ F + K A+ + P
Sbjct: 391 NTDMAISGLNNFSLGDRILVVQRAAVGRASGTTDAIPGSESF--LAKSAIFAQENQQSGP 448
Query: 655 T-EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVK 701
T V+ L N+ + + E +E+LED+ EC+RFG ++ V V +
Sbjct: 449 TSRVMLLLNMVTADEL--YDDQEYQEILEDITSECSRFGEIEGVRVPR 494
>gi|402223467|gb|EJU03531.1| hypothetical protein DACRYDRAFT_77158 [Dacryopinax sp. DJM-731 SS1]
Length = 392
Score = 92.8 bits (229), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 76/271 (28%), Positives = 122/271 (45%), Gaps = 41/271 (15%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL + ++ +E+ L FL L G++ + + EK A++EF +D
Sbjct: 57 RRLYIGDITPDTTEENLTAFLKKTLPELGIKVEGEDVGFEEVRVSHEKNYAYIEFSNPDD 116
Query: 524 ASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------IFIGGIS 559
A+ + DG F G LKI+RP +++ IF+GG+
Sbjct: 117 ATKTMELDGTVFLGQPLKIRRPHDYLSATDLAVVFGGIVPGVVSTNVPDSINKIFVGGLP 176
Query: 560 RTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGG 617
L+ VME++ FG L+A++ + + + AF EY+D VT A GLNG+++G
Sbjct: 177 TYLNEAQVMELLQTFGELRAFNLVKDGSTGVSKGFAFFEYMDPGVTDVACQGLNGMELGD 236
Query: 618 QVLTAVQAVLDGSIMDNSGNPPFHGIP-------KHALPL--LKKPTEVLKLKNVFNPEG 668
+ L +A SI N P +P LP+ P+ +L L N+ E
Sbjct: 237 RYLVVQRA----SIGANPTKPNMPNMPGTLPPPRPAILPVDNTNPPSPILLLLNMVTAE- 291
Query: 669 FSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L + + E++LEDVR E RFG V + +
Sbjct: 292 -ELLQDQDYEDILEDVREEMGRFGPVIDIKI 321
>gi|440792998|gb|ELR14199.1| U2 snRNP auxilliary factor, large subunit, splicing factor
subfamily protein [Acanthamoeba castellanii str. Neff]
Length = 462
Score = 92.8 bits (229), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 70/246 (28%), Positives = 116/246 (47%), Gaps = 57/246 (23%)
Query: 501 PCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA-------- 552
P + + +K +F+EF T ++A+A + DG + +G LK++RPK++V
Sbjct: 193 PVVSSQLNPDKSFSFIEFSTIDEATAGMALDGITMNGMTLKVRRPKDYVSPPTAQAPASG 252
Query: 553 ------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPC 592
IFIGG+ L+ V E++ AFGPLKA++ + + +
Sbjct: 253 GIHIPGIVSTNVPDSPNKIFIGGLPSYLNEAQVKELLTAFGPLKAFNLVKDTATGNSKGY 312
Query: 593 AFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLK 652
AF EY+D VT +A GLNG+K+G + L VQ G+ D +G L+
Sbjct: 313 AFFEYLDASVTDRACQGLNGMKLGDKTLL-VQRANIGAKQDGTGG------------LIM 359
Query: 653 KPTEVLKLKNVFNPEGFSS--------------LSELEVEEVLEDVRLECARFGSVKSVN 698
P + + N +P S +S+ + +++EDVR EC ++G+V SV
Sbjct: 360 MPMDPSGMLNA-SPSAASLLNLQLLNLVRPEELVSDEDHADIVEDVRQECEKYGNVMSV- 417
Query: 699 VVKYGD 704
V++ D
Sbjct: 418 FVEFAD 423
>gi|67541022|ref|XP_664285.1| hypothetical protein AN6681.2 [Aspergillus nidulans FGSC A4]
gi|40738434|gb|EAA57624.1| hypothetical protein AN6681.2 [Aspergillus nidulans FGSC A4]
Length = 624
Score = 92.8 bits (229), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 95/379 (25%), Positives = 167/379 (44%), Gaps = 64/379 (16%)
Query: 366 SPRKRRTEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSD 425
SP ++R T P + +++ +WD+ P P + + + A +A ++
Sbjct: 130 SPSRKREPTPDLTDVPSVLTRKRRLTQWDIKP------PGYENVTAEQAKLSADDLGMFP 183
Query: 426 PVTSTTQKPLAGISVSASLAKLNV-SMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEF 483
+ Q+P+ + A + + S D+ L SN R +RL V NLP +A+ + L+ F
Sbjct: 184 LPGAPRQQPMDPSRLQAFMNQSGGGSADTSALKPSNSRQAKRLFVYNLPPNATVENLVSF 243
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF-------- 535
N L +G+ + PCI I + A +EF + D + AL DG +
Sbjct: 244 FN--LQLNGLNVIQSVDPCISAQISDDHSFALLEFKSPNDTTVALALDGITMGEHESNGE 301
Query: 536 --SGSILKIKRPKEFV---------EVA------------IFIGGISRTLSSKMVMEIVC 572
+ L+++RPK+++ E A I + I + + + V ++
Sbjct: 302 NGAAKGLEVRRPKDYIVPNLAEQDLEGASGMKDVPDSPNKICVSNIPQYIPEEPVTMLLK 361
Query: 573 AFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL--- 627
+FG LK++ + EE AF EY D T A+ GLNG+++G + L V+A +
Sbjct: 362 SFGELKSFVLVKDSSTEESRGIAFCEYADPNTTTIAVQGLNGMELGDRHLKVVRASIGMT 421
Query: 628 DGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEVEEV 680
+ +D N A+ + K T VL+L N+ PE + + EE+
Sbjct: 422 QAAGLDMGVN---------AMSMFAKTTSQDLESSRVLQLLNMVTPEEL--MDNEDYEEI 470
Query: 681 LEDVRLECARFGSVKSVNV 699
+DVR EC++FG V + +
Sbjct: 471 CDDVRDECSKFGRVLELKI 489
>gi|357470349|ref|XP_003605459.1| Splicing factor U2af large subunit B [Medicago truncatula]
gi|355506514|gb|AES87656.1| Splicing factor U2af large subunit B [Medicago truncatula]
Length = 634
Score = 92.4 bits (228), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 87/295 (29%), Positives = 135/295 (45%), Gaps = 67/295 (22%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V LP +A+E+++ F + + + G + I +K AFVE
Sbjct: 294 QATRHARRVYVGGLPPTANEQSVAIFFSQVMANIGGNTAGPGDAVVNVYINHDKKFAFVE 353
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF---------------------VEVA---- 552
+ E+AS A+ DG F G+ +K++RP ++ V +
Sbjct: 354 MRSVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAATLGPSQPNPNLNLGAVGLTPGSA 413
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ + + E++ FGPL+ F++ +D E + AF Y D
Sbjct: 414 GGLEGPDRIFVGGLPYYFTETQIRELLETFGPLRG--FDLVKDRETGNSKGYAFCVYADL 471
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGN----PPFHGIPKHALP------L 650
VT A A LNG+K+G + LT +A N G P I HA L
Sbjct: 472 AVTDIACAALNGIKMGDKTLTVRRA--------NQGTTQPKPEQESILMHAQQQIALQKL 523
Query: 651 LKKP----TEVLKLKNVFNPEGFSSLSELE-VEEVLEDVRLECARFGSVKSVNVV 700
+ +P T+V+ L N PE L E E EE+++D+R EC++FGS+ VNVV
Sbjct: 524 IFQPALVATKVVCLTNAVAPE---ELKEDEDFEEIIDDMRQECSKFGSL--VNVV 573
>gi|168056046|ref|XP_001780033.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668531|gb|EDQ55136.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 536
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/365 (23%), Positives = 151/365 (41%), Gaps = 72/365 (19%)
Query: 389 KSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL- 447
K++ +D+AP P A M P + P G A L
Sbjct: 131 KTSGFDMAP------PGGTIVPGAAVPGQISGMPPQMPGVFPSMFPFGG----AQFGGLP 180
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
+ ++ Q+ R RR+ V LP A+E+ + + + + + G + I
Sbjct: 181 GMPAQAMTQQQATRHARRVYVGGLPPMANEQTIATYFSQVMAAVGGNTAGPGDAVVNVYI 240
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEV 551
+EK AFVE T E+AS A+ DG F G ++++RP ++ + +
Sbjct: 241 NQEKKFAFVEMRTVEEASNAMSLDGIIFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNL 300
Query: 552 A-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED--HEEPC 592
A IF+GG+ L+ + E++ +FGPL+ + + D + +
Sbjct: 301 AAVGLTPGAAGGADGPDRIFVGGLPYYLTEPQIKELLESFGPLRGFDLVKDRDTGNSKGY 360
Query: 593 AFIEYVDQLVTPK-AIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLL 651
F Y D VT AIA LNGLK+G + L+ +A S G P P A L+
Sbjct: 361 GFCVYQDPSVTTDVAIAALNGLKMGDKTLSVRRA----SARYGIGQPK----PDQANVLI 412
Query: 652 KKPTEV---LKLKNVFNPEGFSSLSEL--------------EVEEVLEDVRLECARFGSV 694
++ + LK + + + F++ E +E+LED+R+EC ++G++
Sbjct: 413 HAQQQIALQVTLKMLLHRKTFTAAWTFYAQVVTPNQLEDDEEYQEILEDMRMECGKYGNL 472
Query: 695 KSVNV 699
+V +
Sbjct: 473 LNVVI 477
>gi|302813365|ref|XP_002988368.1| hypothetical protein SELMODRAFT_235524 [Selaginella moellendorffii]
gi|300143770|gb|EFJ10458.1| hypothetical protein SELMODRAFT_235524 [Selaginella moellendorffii]
Length = 353
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 82/287 (28%), Positives = 129/287 (44%), Gaps = 64/287 (22%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R R++ V LP +E+ + F N +++ G + I +EK AFVE
Sbjct: 17 QATRHARQVYVGGLPGLVNEQTIATFFNQVMVNVGGNTAGPGDVVVNVYINQEKKFAFVE 76
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA--------- 552
T E+AS A+ DG SF G ++++RP ++ + +A
Sbjct: 77 MRTVEEASNAMALDGISFQGVSVRVRRPSDYNPSVAANLGPSQPSPSLNLAAVGLTPGAG 136
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ L+ + E++ +FGPL+ F++ +D E + F Y D
Sbjct: 137 GGVDGPDRIFVGGLPYYLTEPQIRELLESFGPLRG--FDLVKDRESGNSKGYGFCVYQDP 194
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKL 660
VT A A LNGLK+G + LT +A +G H H L L K L +
Sbjct: 195 NVTDVACAALNGLKMGDRTLTVRRATANGQQAGQD-----H---AHILSLAKS----LTM 242
Query: 661 KNVFNPEGFSSL---------SEL----EVEEVLEDVRLECARFGSV 694
VF EG + + +EL + +E+LED+R EC +FG+V
Sbjct: 243 NGVFPDEGATRVLCLKEAVLEAELIEDEQFDEILEDMRDECGKFGTV 289
>gi|331243454|ref|XP_003334370.1| splicing factor U2AF subunit [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 600
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/302 (28%), Positives = 132/302 (43%), Gaps = 58/302 (19%)
Query: 454 VQLTQS-NRPMRRLCVENLPLSASEKALMEFLNN-----FLLS----SGVQHVHGSLPCI 503
V TQS R RRL V N+ +A+E + EF N LL+ G+ P +
Sbjct: 213 VAQTQSFARQARRLYVGNILHTANEMNVAEFFNAKMKELGLLARNNEDGMAISISENPVV 272
Query: 504 GCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEV------------ 551
+ EK AFVEF AE+A+ + DG F LKI+RPK++
Sbjct: 273 AVQVNHEKNYAFVEFRNAEEATHGMSFDGIIFQNQALKIRRPKDYTGPDHAGPTHIPGVV 332
Query: 552 ---------AIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDH----EEPCAFIEYV 598
IFIGG+ L+ VME++ +FG LK+++ + + AF EYV
Sbjct: 333 STNVPDSPNKIFIGGLPSYLTDDQVMELLKSFGELKSFNLVKDTSSGGHVSKGFAFCEYV 392
Query: 599 DQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP------------FHGIPKH 646
D +T A GLNG+++G + L +A + + N P F G
Sbjct: 393 DPDLTDIACQGLNGMELGDRYLVVQRAQIGQNAKKEKENNPDGQRNNYNQFNNFAGGQAT 452
Query: 647 ALPLLK---------KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSV 697
A + T VL++ N+ N E + + E E+LED+R EC ++G ++ V
Sbjct: 453 AAASSVLAAVKSGEGEKTRVLQMLNMVNQEEL--VDDQEYGEILEDIRDECGKYGKIEGV 510
Query: 698 NV 699
+
Sbjct: 511 RI 512
>gi|226532558|ref|NP_001140768.1| uncharacterized protein LOC100272843 [Zea mays]
gi|194701008|gb|ACF84588.1| unknown [Zea mays]
gi|414591744|tpg|DAA42315.1| TPA: hypothetical protein ZEAMMB73_924732 [Zea mays]
Length = 583
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 70/259 (27%), Positives = 113/259 (43%), Gaps = 42/259 (16%)
Query: 407 VHTSNQAASSNAHEMVSSDPVTST---TQKPLAGISVSASLAKLNVSMDSVQLTQSNRPM 463
V +QA + A +V++ + P+ G+ V +L L Q+ R
Sbjct: 189 VSGFDQAPTQQALPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQPQAMTQQATRHA 248
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RR+ V LP +A+E+ + F N + + G + I +K AFVE + E+
Sbjct: 249 RRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEE 308
Query: 524 ASAALCCDGCSFSGSILKIKRPKEF---VEVA---------------------------- 552
AS A+ DG F G+ +KI+RP ++ + A
Sbjct: 309 ASNAMVLDGIMFEGAPVKIRRPTDYNPSLAAALGPSQPNPNLNLSAVGLTPGSAGGLEGP 368
Query: 553 --IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQLVTPKA 606
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D VT A
Sbjct: 369 DRIFVGGLQYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQDLNVTDIA 426
Query: 607 IAGLNGLKVGGQVLTAVQA 625
A LNG+K+G + LT +A
Sbjct: 427 CAALNGIKMGDKTLTVRRA 445
>gi|302795921|ref|XP_002979723.1| hypothetical protein SELMODRAFT_13030 [Selaginella moellendorffii]
gi|300152483|gb|EFJ19125.1| hypothetical protein SELMODRAFT_13030 [Selaginella moellendorffii]
Length = 360
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/287 (28%), Positives = 129/287 (44%), Gaps = 64/287 (22%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R R++ V LP +E+ + F N +++ G + I +EK AFVE
Sbjct: 24 QATRHARQVYVGGLPGLVNEQTIATFFNQVMVNVGGNTAGPGDVVVNVYINQEKKFAFVE 83
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA--------- 552
T E+AS A+ DG SF G ++++RP ++ + +A
Sbjct: 84 MRTVEEASNAMALDGISFQGVSVRVRRPSDYNPSVAANLGPSQPSPSLNLAAVGLTPGAG 143
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ L+ + E++ +FGPL+ F++ +D E + F Y D
Sbjct: 144 GGVDGPDRIFVGGLPYYLTEPQIRELLESFGPLRG--FDLVKDRESGNSKGYGFCVYQDP 201
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKL 660
VT A A LNGLK+G + LT +A +G H H L L K L +
Sbjct: 202 NVTDVACAALNGLKMGDRTLTVRRATANGQQAGQD-----H---AHILSLAKS----LTM 249
Query: 661 KNVFNPEGFSSL---------SEL----EVEEVLEDVRLECARFGSV 694
VF EG + + +EL + +E+LED+R EC +FG+V
Sbjct: 250 NGVFPDEGATRVLCLKEAVLEAELIEDEQFDEILEDMRDECGKFGTV 296
>gi|452841884|gb|EME43820.1| hypothetical protein DOTSEDRAFT_71600 [Dothistroma septosporum
NZE10]
Length = 433
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 98/395 (24%), Positives = 162/395 (41%), Gaps = 84/395 (21%)
Query: 368 RKRRTEAAAKTPSPINRSPE----------------KKSAKWDVAPVETYSVPSNVHTSN 411
R+ + A K+ SP R P+ ++ +WD+ P +V T+
Sbjct: 2 RQLERQTARKSASPPPRKPKEPTPDLTEVTSVLERKRRLTQWDIKPPGYENV-----TAE 56
Query: 412 QAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN-RPMRRLCVEN 470
QA S + + Q+P+ + A + + + L S R +RL + N
Sbjct: 57 QAKLSGMFPLPGA-----PRQQPMDPQKLQAFMNQPGGEANKTALKPSTARQSKRLLIYN 111
Query: 471 LPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCC 530
+P SA+E +M+F N L +G+ G+ PCI + ++K A +EF T EDA+ A+
Sbjct: 112 IPASATEDTIMDFFN--LQLNGLNVTRGADPCISAQLSQDKAYALLEFKTPEDATNAMAF 169
Query: 531 DGCSFSGSI---------------LKIKRPKEFV----------------------EVAI 553
DG + L IKRPK+++ + I
Sbjct: 170 DGINMEPEAMVTSGNEDENGGARGLDIKRPKDYIVPVVTDGTENDAGVLSNVVPDTQNKI 229
Query: 554 FIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLN 611
I I + + ME++ +FG LK + + EE AF EY D T A+ L+
Sbjct: 230 SITNIPAYVDEEQTMELLNSFGELKNFVLVKDASTEESRGIAFCEYKDPNSTKVAVESLH 289
Query: 612 GLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVF 664
G+ +G + A + I SG + +A+ L+ VL L N+
Sbjct: 290 GMTLGDAAMKVRLASI--GIQQVSGE-----MSVNAMSLMAGTARADGEGGRVLSLMNMI 342
Query: 665 NPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PE + E +E+LEDV+ ECA++G + V +
Sbjct: 343 TPEEL--MDPDEADEILEDVKEECAKYGPLLDVKM 375
>gi|168028774|ref|XP_001766902.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681881|gb|EDQ68304.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 486
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/352 (24%), Positives = 146/352 (41%), Gaps = 59/352 (16%)
Query: 389 KSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLN 448
K++ +D+AP P + Q M + P P G L
Sbjct: 96 KTSGFDMAP------PGATVVAGQIPG-----MPPAMPGVFPAMFPFGGTQQFGGLP--- 141
Query: 449 VSMDSVQLTQ-SNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
M + +TQ + R RR+ V LP A+E+ + + + + + G + I
Sbjct: 142 -GMPAQAMTQQATRHARRVYVGGLPPMANEQTIATYFSQVMAAVGGNTAGPGDAVVNVYI 200
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEV 551
+EK AFVE T E+AS A+ DG F G ++++RP ++ + +
Sbjct: 201 NQEKKFAFVEMRTVEEASNAMALDGIIFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLNL 260
Query: 552 A-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED--HEEPC 592
A IF+GG+ L+ + E++ +FGPL+ + + D + +
Sbjct: 261 AAVGLTPGAAGGADGPDRIFVGGLPYYLTEVQIKELLESFGPLRGFDLVKDRDTGNSKGY 320
Query: 593 AFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLL- 651
F Y D V A A LNG+K+ + L +A G + N H + A+ L
Sbjct: 321 GFCVYQDPSVVDIACATLNGMKMDDKTLNVRRATASGQPKPDQANVLAHAQQQIAIQKLA 380
Query: 652 ---KKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV 700
K PT+V+ L V P + E +E++ED+ EC ++G++ VN V
Sbjct: 381 LQAKTPTKVVALTEVVTPNQLE--DDEEYQEIMEDMGTECGKYGTL--VNCV 428
>gi|303274813|ref|XP_003056721.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461073|gb|EEH58366.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 709
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 71/249 (28%), Positives = 112/249 (44%), Gaps = 47/249 (18%)
Query: 492 GVQHVHGSLPCIGCVIQR----EKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKE 547
G + +H L V+++ +KG +FVE EDA A L DG G++LKI+RPK+
Sbjct: 329 GTEKLHPKLDAPTGVVEKVFISDKGYSFVEGTAVEDACALLALDGIVVRGNLLKIRRPKD 388
Query: 548 FVE-------------------------------VAIFIGGIS-RTLSSKMVMEIVCAFG 575
+V IF+G + R L + EI+ +FG
Sbjct: 389 YVASENPLVVDGTMKDVMKRTFEKIIRPSVPDTNTKIFLGNLGDRELDVLELKEIIASFG 448
Query: 576 PLKAYHFEVNEDH--EEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLD----G 629
++A E + + A+ EY+D V P A++GL G+ V G+ L A A D G
Sbjct: 449 RVRAMRAETDAHGRIKRGRAWFEYLDPTVAPHAVSGLTGVMVNGRRLVAAFATPDAPDVG 508
Query: 630 SIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECA 689
+ + G + +P A PLL+ VL +NV + + ++ +EDV+LEC
Sbjct: 509 DVREEDGACAYE-VPAAAEPLLRPEQRVLAFENVLA----AGMDAAAMKAAIEDVKLECE 563
Query: 690 RFGSVKSVN 698
G+V S +
Sbjct: 564 SLGNVLSTH 572
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 31/46 (67%)
Query: 898 GCVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
G VFVE+ R E + +AAH HRRLFD R V +EY P+ Y+ F+K
Sbjct: 635 GEVFVEFARVETATVAAHRFHRRLFDGRSVKIEYFPIAEYQRAFAK 680
>gi|453082700|gb|EMF10747.1| hypothetical protein SEPMUDRAFT_48483 [Mycosphaerella populorum
SO2202]
Length = 432
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 162/358 (45%), Gaps = 66/358 (18%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P +V T+ QA S + + Q+P+ + A + +
Sbjct: 38 KRRLTQWDIKPPGYDNV-----TAEQAKLSGMFPLPGA-----PRQQPMDPAKLQAFMNQ 87
Query: 447 L-NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
N + S S R +R+ V NLP SA+++++++F N L +G+ G PCI
Sbjct: 88 PGNQASSSALKPSSARQSKRVMVHNLPPSATDESMVDFFN--LQLNGLNITRGVDPCISA 145
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSF-------SGSI------LKIKRPKEFV--- 549
++K A VEF T EDA+ A+ DG + SG+ L+IKRP++++
Sbjct: 146 QCSKDKTYALVEFKTPEDATNAMALDGITMDHDAMDTSGASNGAPKGLQIKRPRDYIVPN 205
Query: 550 -------------------EVAIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEE 590
+ I I + L+ + + E++ +FG LK++ N+ E
Sbjct: 206 VIDETENESGLLSNTVPDTQNKISITNLPSFLAEEQIQELLMSFGELKSFVLVRNQSSGE 265
Query: 591 P--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHAL 648
AF EY D VT A+ LNG+++ + A + I S + +A+
Sbjct: 266 SRGIAFCEYKDPSVTKVAVDSLNGMELADTAMRVKLASI--GIQQVSSE-----MSVNAM 318
Query: 649 PLL--KKPTE-----VLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L+ K T+ VL L N+ PE + + +E+LEDV+ ECA++G + V +
Sbjct: 319 SLMAGAKSTDADNGRVLALMNMITPEEL--MDPDDADEILEDVKEECAKYGPLLEVKM 374
>gi|255089803|ref|XP_002506823.1| predicted protein [Micromonas sp. RCC299]
gi|226522096|gb|ACO68081.1| predicted protein [Micromonas sp. RCC299]
Length = 554
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/300 (28%), Positives = 141/300 (47%), Gaps = 49/300 (16%)
Query: 442 ASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLS----SGVQHVH 497
A LA L + +VQ T R RR+ V P + +E L F+ N L++ SG
Sbjct: 199 ARLAALQAAGINVQAT---RHARRVYVGGFPDNTNEPELASFIANALVAIGGASGAYDPD 255
Query: 498 GSLPCIGCV-IQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF-------- 548
+ C+ V I R+K AFVEF T E+AS A+ DG +GS L+++RP ++
Sbjct: 256 NGMTCVLSVYINRDKLFAFVEFRTVEEASNAMALDGVVMAGSQLRVRRPNDYQPQQAALI 315
Query: 549 --------VEVA----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEV 584
+ +A +++G + L+ V+E++ +FG ++A++ V
Sbjct: 316 GPTTPADSLNLAAVGLIPGVNGQSSGRKLYVGNLPPYLTELQVLELLQSFGAVQAFNLVV 375
Query: 585 NEDHE--EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDN----SGNP 638
++D + F EY D A+ GL G+++G +VL +A DG + SG+
Sbjct: 376 DKDTGTLKGYGFFEYADAAADEAAMEGLTGMRLGDKVLNVKRAAYDGGVGQGVGQASGSA 435
Query: 639 PFHGIPKHALPL-LKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSV 697
G +LP + +E ++L N+ E + + E E+LED + ECA FG + V
Sbjct: 436 QAPGFAPGSLPANGESASECVRLTNMVTREELTDPT--EAREILEDTQEECAGFGELTRV 493
>gi|414591745|tpg|DAA42316.1| TPA: hypothetical protein ZEAMMB73_924732 [Zea mays]
Length = 538
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 70/259 (27%), Positives = 113/259 (43%), Gaps = 42/259 (16%)
Query: 407 VHTSNQAASSNAHEMVSSDPVTST---TQKPLAGISVSASLAKLNVSMDSVQLTQSNRPM 463
V +QA + A +V++ + P+ G+ V +L L Q+ R
Sbjct: 189 VSGFDQAPTQQALPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQPQAMTQQATRHA 248
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RR+ V LP +A+E+ + F N + + G + I +K AFVE + E+
Sbjct: 249 RRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEE 308
Query: 524 ASAALCCDGCSFSGSILKIKRPKEF---VEVA---------------------------- 552
AS A+ DG F G+ +KI+RP ++ + A
Sbjct: 309 ASNAMVLDGIMFEGAPVKIRRPTDYNPSLAAALGPSQPNPNLNLSAVGLTPGSAGGLEGP 368
Query: 553 --IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQLVTPKA 606
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D VT A
Sbjct: 369 DRIFVGGLQYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQDLNVTDIA 426
Query: 607 IAGLNGLKVGGQVLTAVQA 625
A LNG+K+G + LT +A
Sbjct: 427 CAALNGIKMGDKTLTVRRA 445
>gi|156070797|gb|ABU45209.1| unknown [Solanum bulbocastanum]
Length = 558
Score = 90.9 bits (224), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 154/362 (42%), Gaps = 79/362 (21%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
+D+AP + + + Q + S P + PLA + L +
Sbjct: 161 FDMAPPTSALLSGATDVAGQVPGTTN----PSIPGMFSNMFPLA----AGQFGALPIMPV 212
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
Q+ R RR+ V LP +A+E+++ F ++ + + G + I EK
Sbjct: 213 QAMTQQATRHARRVYVGGLPPTANEQSVATFFSHVMYAIGGNTAGPGDAVVNVYINHEKK 272
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA---- 552
AFVE + E+AS A+ DG F G +K++RP ++ + +A
Sbjct: 273 FAFVEMRSVEEASNAMALDGVVFEGGPVKVRRPSDYNPSLAATLGPSQPSPNLNLAAVGL 332
Query: 553 -------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFI 595
IF+GG+ + + E++ +FG L+ F++ +D E + AF
Sbjct: 333 TPGSSGGLEGPDRIFVGGLPYYFTESQIRELLESFGQLRG--FDLVKDRETGNSKGYAFC 390
Query: 596 EYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSG----NPPFHGIPKHALP-- 649
Y D VT A A LNG+K+G + LT +A N G NP + HA
Sbjct: 391 VYQDVSVTDIACAALNGIKMGDKTLTVRRA--------NQGTTQPNPEQESVLLHAQQQI 442
Query: 650 ----LLKKP----TEVLKLKNVFNPEGFSSLSEL----EVEEVLEDVRLECARFGSVKSV 697
+ +P T+VL L V S+ EL + +++LED+R+EC +FG++ +V
Sbjct: 443 ALQRFMLQPGALATKVLCLTEVV------SVDELKDDDDYQDILEDMRIECGKFGALLNV 496
Query: 698 NV 699
+
Sbjct: 497 VI 498
>gi|407929464|gb|EKG22293.1| hypothetical protein MPH_00360 [Macrophomina phaseolina MS6]
Length = 824
Score = 90.9 bits (224), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 79/283 (27%), Positives = 129/283 (45%), Gaps = 53/283 (18%)
Query: 460 NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFL 519
+R +RL V N P ++++ ++ +F N L + + + S PCI I +++ A EF
Sbjct: 18 SRQAKRLFVYNFPAASTDDSIQDFFN--LQLNHLNVISSSDPCISVQISKDRTFALCEFK 75
Query: 520 TAEDASAALCCDGCSFSG------------SILKIKRPKEFVEVA--------------- 552
T ED + AL DG S S +KI RPK+++ A
Sbjct: 76 TPEDTTMALALDGQSMEAEDASNGASNGGHSGIKISRPKDYIVPAQSDDADYQEGVVSNK 135
Query: 553 -------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVT 603
I + I L+ + VM+++ AFG LKA+ + D + AF EYVD T
Sbjct: 136 VKDGPHKICVAQIPVYLTEEQVMDLLSAFGGLKAFTLVKDTGTDQSKGIAFCEYVDPDTT 195
Query: 604 PKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTE------- 656
A+ GL+G+++ L +A + I SG G+ +A+ +L +
Sbjct: 196 DPAVEGLDGMEIAQDHLKVKKACV--GIQQASGLE--MGV--NAMSMLAGTSSGDVEQGR 249
Query: 657 VLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
VL L N+ PE + E EE+ EDV EC+++G V+ + +
Sbjct: 250 VLMLLNMVTPEEL--MDPQEYEEIQEDVHEECSKYGKVEELKI 290
>gi|224134362|ref|XP_002327819.1| predicted protein [Populus trichocarpa]
gi|222836904|gb|EEE75297.1| predicted protein [Populus trichocarpa]
Length = 541
Score = 90.9 bits (224), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 159/370 (42%), Gaps = 70/370 (18%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAH---EMVSSDPVTSTTQKPLAGISVSASL 444
K+ + +D+AP + +P+ + AA+++A ++ + P + + S
Sbjct: 124 KRMSGFDMAPPSSAILPNAAAAAAAAAAASAAASGQIAGTTPPIPGMFPNMFPLGTSQQF 183
Query: 445 AKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIG 504
L V Q+ R RR+ V LP +A+E+++ F + + + G +
Sbjct: 184 GALPVMPVQAMTQQATRHARRVYVGGLPPTANEQSVATFFSQVMAAIGGNTAGPGDAVVN 243
Query: 505 CVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF---------------- 548
I EK AFVE + E+AS A+ DG F G+ +K++RP ++
Sbjct: 244 VYINHEKKFAFVEMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNPN 303
Query: 549 -----VEVA------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE-- 589
V +A IF+GG+ + + E++ +FGPL+ F++ +D E
Sbjct: 304 LNLSAVGLAPGSAGGLEGPDRIFVGGLPYYFTESQIRELLESFGPLRG--FDLVKDRETG 361
Query: 590 --EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP-------- 639
+ AF Y D VT A A LNG+K+G + LT +A + N P
Sbjct: 362 NSKGYAFCVYQDLSVTDIACAALNGIKMGDKTLTVRRA-------NQGTNQPKPEQENVL 414
Query: 640 FHGIPKHALPLL---------KKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECAR 690
H + AL L PT+V+ L V G + E E++LED+R E +
Sbjct: 415 LHAQQQIALQRLMLQPQPQQQPVPTKVVCLTQVVT--GDELKDDDEYEDILEDMRTEAGK 472
Query: 691 FGSVKSVNVV 700
FG + VNVV
Sbjct: 473 FGLL--VNVV 480
>gi|388580158|gb|EIM20475.1| hypothetical protein WALSEDRAFT_55089 [Wallemia sebi CBS 633.66]
Length = 391
Score = 90.5 bits (223), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 75/264 (28%), Positives = 132/264 (50%), Gaps = 31/264 (11%)
Query: 461 RPMRRLCVENLPLSASEKALMEFLNNFLLS-SGVQHVHGSLPCIGCVIQREKGQAFVEFL 519
R RR+ + +L + +E+ L + N+ + + V + G P + ++ ++G A++EF
Sbjct: 30 RASRRVFIGDLKPNHNEENLTKLFNDKMSTIDQVAKIPGE-PVVNVTVKHDRGYAYIEFR 88
Query: 520 TAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------IFIGGISRT 561
++A+ AL DG F G ++IKRP+E ++ IF+G +
Sbjct: 89 NTDEAAYALQFDGTIFQGEGIQIKRPQEVLDELQRKQGHTVSGTVPDSDQKIFVGSLPTF 148
Query: 562 LSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQV 619
L+ + VME++ +FG L++++ E D + AF EY+D +T AI GLNG++VG +
Sbjct: 149 LNDEQVMELLGSFGELRSFNLVKEGTSDVSKGFAFCEYMDPALTDIAIQGLNGMEVGDRK 208
Query: 620 LTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKP----TEVLKLKNVFNPEGFSSLSEL 675
L VQ G M G I LPL + T VL L N+ E +L
Sbjct: 209 LV-VQRSSTGP-MGKIGVGGTSSI-AQILPLASETQAYRTNVLLLLNMVTAEELK--DDL 263
Query: 676 EVEEVLEDVRLECARFGSVKSVNV 699
+ +E+ ED++ EC+++G + + +
Sbjct: 264 DYQEICEDIQEECSQYGEIIKIKI 287
>gi|307108143|gb|EFN56384.1| expressed protein [Chlorella variabilis]
Length = 404
Score = 90.1 bits (222), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 55/94 (58%)
Query: 455 QLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQA 514
QL Q RPMRRL V LP + L FLN L++ G+ V G P I C + E+ A
Sbjct: 243 QLNQMTRPMRRLYVGGLPQPCYDFMLTTFLNQALMALGICQVAGKAPIIACQVTPERNFA 302
Query: 515 FVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF 548
F+EF DA+AAL DG F G+ LKIKRPK++
Sbjct: 303 FIEFGDTSDATAALQLDGIPFRGNTLKIKRPKDY 336
>gi|255568277|ref|XP_002525113.1| splicing factor u2af large subunit, putative [Ricinus communis]
gi|223535572|gb|EEF37240.1| splicing factor u2af large subunit, putative [Ricinus communis]
Length = 549
Score = 90.1 bits (222), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 149/354 (42%), Gaps = 58/354 (16%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
+D+AP PS + T A ++ A ++ + P + + L V
Sbjct: 190 FDMAPP-----PSAMLTGAAAVAAAAGQIPGTAPAIPGMFPNMFPLGTGQQFGTLPVMPV 244
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
Q+ R RR+ V LP +A+E+++ F ++ + + G + I EK
Sbjct: 245 QAMTQQATRHARRVYVGGLPPTANEQSVATFFSHVMAAIGGNTAGPGDAVVNVYINHEKK 304
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF---------------------VEV 551
AFVE + E+AS A+ DG F G+ +K++RP ++ V +
Sbjct: 305 FAFVEMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPNPNLNLGAVGL 364
Query: 552 A------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFI 595
IF+GG+ + + E++ +FGPL+ F++ +D E + AF
Sbjct: 365 TPGSAGGLEGPDRIFVGGLPYYFTEAQIRELLESFGPLRG--FDLVKDRETGNSKGYAFC 422
Query: 596 EYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHA-------- 647
Y D VT A A LNG+K+G + LT V+ G+ N P + HA
Sbjct: 423 VYQDLSVTDIACAALNGIKMGDKTLT-VRRANQGA---NQPKPEQETVLLHAQQQIALQR 478
Query: 648 LPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVK 701
L L PT+V+ L V + + E E++LED+R E +F + K
Sbjct: 479 LMLQPVPTKVVCLTQVVTADELK--DDDEYEDILEDMRTEGGKFAFCSATFCYK 530
>gi|358369529|dbj|GAA86143.1| splicing factor u2af large subunit [Aspergillus kawachii IFO 4308]
Length = 571
Score = 89.7 bits (221), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 168/382 (43%), Gaps = 74/382 (19%)
Query: 367 PRKRR-TEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSD 425
PRKR T S ++R +++ +WD+ P +V T+ QA S + +
Sbjct: 157 PRKREATPDLTDIVSVLDR--KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA- 208
Query: 426 PVTSTTQKPLAGISVSASLAK-LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEF 483
Q+P+ + A + + + D+ L SN R +RL V N+P + + + L+ F
Sbjct: 209 ----PRQQPMDPSRLQAFMNQPAGGNADTSTLKPSNSRQAKRLFVYNIPQTVTGETLLAF 264
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF-------- 535
N L +G+ + PCI + ++ A +EF + DA+ AL DG +
Sbjct: 265 FNVQL--NGLNVIESVDPCISAQVAQDHSFALLEFKSPNDATVALAFDGIAMEEHEAAGN 322
Query: 536 -----SGSILKIKRPKEFV----------------EVA-----IFIGGISRTLSSKMVME 569
+ L+++RPK+++ EV I + I + + V
Sbjct: 323 GAANGAAQGLEVRRPKDYIVPGGAEQEYQEGVLLNEVPDSPNKICVSNIPHYIPEEPVTM 382
Query: 570 IVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL 627
++ +FG LK++ + EE AF EY D T A+ GLNG+++G + L V+A +
Sbjct: 383 LLKSFGELKSFVLVKDSSTEESRGIAFCEYADPSATTIAVEGLNGMELGDRHLKVVRASI 442
Query: 628 ---DGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEV 677
+ +D N A+ + K T VL+L N+ PE + +
Sbjct: 443 GMTQAAGLDMGVN---------AMSMFAKTTSQDLETSRVLQLLNMVTPEEL--MDPEDY 491
Query: 678 EEVLEDVRLECARFGSVKSVNV 699
EE+ +DVR EC+++G+V + V
Sbjct: 492 EEICDDVRDECSKYGTVVELKV 513
>gi|198456623|ref|XP_001360392.2| GA16338 [Drosophila pseudoobscura pseudoobscura]
gi|198135682|gb|EAL24967.2| GA16338 [Drosophila pseudoobscura pseudoobscura]
Length = 491
Score = 89.4 bits (220), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 92/369 (24%), Positives = 149/369 (40%), Gaps = 86/369 (23%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P E Y+ + + AS + SD +P A D
Sbjct: 77 WDVPP-EGYAHVTPMQYKAMQASGQITARIQSD------TQPTA---------------D 114
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNN-FLL---SSGVQHVHGSLPCIGCVIQ 508
+ + R RRL V N+P +E +M F N FLL + G Q + C
Sbjct: 115 TAAIAMVTRQARRLYVGNIPFGVTEDDIMAFFNQQFLLLGDNCGGQLCLDGKAVLSCQAN 174
Query: 509 REKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF------------VEVAIFIG 556
+K AF+EF + ++A+ A DG SF G +LKI+RP ++ + +G
Sbjct: 175 LDKNFAFIEFRSMQEATQATTFDGISFRGQVLKIRRPHDYHPVGSVGAAAGAGSIPDAVG 234
Query: 557 GI--------------SRTLSSKMVMEIV-------------------------CAFGPL 577
G + +L S+ + +V +FG L
Sbjct: 235 GCASSAAAKSRSSSAETGSLGSQAISNLVPDSPHKIYIGGLPTCLNETQIKELLLSFGQL 294
Query: 578 KAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNS 635
+ ++ + + + AF EYVD L+T + IA LNG+++G + L +++ G N
Sbjct: 295 RGFNLVKDPSTTLSKGYAFFEYVDPLLTEQVIANLNGMQLGDRRLIVQRSIPSGRYAGNQ 354
Query: 636 GNP-PFHGIPKHAL----PLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECAR 690
P G+ +L L T+VL L N+ PE L E E++ D+ EC++
Sbjct: 355 QIPIQVPGLVATSLTGSTAGLNNATQVLCLLNMVLPEEL--LDNEEYEDIRADIEQECSK 412
Query: 691 FGSVKSVNV 699
+G V S+ +
Sbjct: 413 YGEVLSLKI 421
>gi|338762830|gb|AEI98617.1| hypothetical protein 111O18.4 [Coffea canephora]
Length = 570
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 137/311 (44%), Gaps = 69/311 (22%)
Query: 441 SASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSL 500
+ L L V Q+ R RR+ V LP +A+E+++ F ++ + + G
Sbjct: 214 TGQLGALPVMPVQAMTQQATRHARRVYVGGLPPTANEQSVATFFSHVMSAIGGNTAGPGD 273
Query: 501 PCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF------------ 548
+ I EK AFVE + E+AS A+ DG F G+ +K++RP ++
Sbjct: 274 AVVNVYINHEKKFAFVEMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLAATLGPSQ 333
Query: 549 ----VEVA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED 587
+ +A IF+GG+ + + E++ +FGPL+ F++ +D
Sbjct: 334 PNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEGQIRELLESFGPLRG--FDLVKD 391
Query: 588 HE----EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLT---AVQAVLDG-----SIMDNS 635
E + AF Y D VT A A LNG+K+G + LT A Q V S++
Sbjct: 392 RETGNSKGYAFCVYQDLSVTDIACAALNGIKMGDKTLTVRRANQGVTQPKPEQESVL--- 448
Query: 636 GNPPFHGIPKHALP-LLKKP----TEVLKLKNVFNPEGFSSLSELEVEE----VLEDVRL 686
H + AL L+ +P T+VL L V S EL +E +LED+RL
Sbjct: 449 ----LHAQQQIALQKLMLQPGTLATKVLCLTQVV------SADELRDDEDYADILEDMRL 498
Query: 687 ECARFGSVKSV 697
EC +F V V
Sbjct: 499 ECGKFTLVNLV 509
>gi|317029342|ref|XP_001391373.2| splicing factor u2af large subunit [Aspergillus niger CBS 513.88]
Length = 561
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 168/382 (43%), Gaps = 74/382 (19%)
Query: 367 PRKRR-TEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSD 425
PRKR T S ++R +++ +WD+ P +V T+ QA S + +
Sbjct: 147 PRKREATPDLTDIVSVLDR--KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA- 198
Query: 426 PVTSTTQKPLAGISVSASLAK-LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEF 483
Q+P+ + A + + + D+ L SN R +RL V N+P S + + L+ F
Sbjct: 199 ----PRQQPMDPSRLQAFMNQPAGGNADTSTLKPSNSRQAKRLFVYNIPESVTGETLLAF 254
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF-------- 535
N L +G+ + PCI + ++ A +EF + DA+ AL DG +
Sbjct: 255 FNVQL--NGLNVIQSVDPCISAQVAQDHTFALLEFKSPNDATVALAFDGIAMEEHEAAGN 312
Query: 536 -----SGSILKIKRPKEFV----------------EVA-----IFIGGISRTLSSKMVME 569
+ L+++RPK+++ EV I + I + + V
Sbjct: 313 GAANGAAQGLEVRRPKDYIVPGGAEQEYQEGVLLNEVPDSPNKICVSNIPHYIPEEPVTM 372
Query: 570 IVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL 627
++ +FG LK++ + EE AF EY D T A+ GLNG+++G + L V+A +
Sbjct: 373 LLKSFGELKSFVLVKDSSTEESRGIAFCEYADPSATTIAVEGLNGMELGDRHLKVVRASI 432
Query: 628 ---DGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEV 677
+ +D N A+ + K T VL+L N+ PE + +
Sbjct: 433 GMTQAAGLDMGVN---------AMSMFAKTTSQDLETSRVLQLLNMVTPEEL--MDPEDY 481
Query: 678 EEVLEDVRLECARFGSVKSVNV 699
+E+ +DVR EC+++G+V + V
Sbjct: 482 DEICDDVRDECSKYGTVVELKV 503
>gi|66808005|ref|XP_637725.1| RNA-binding region RNP-1 domain-containing protein [Dictyostelium
discoideum AX4]
gi|60466159|gb|EAL64222.1| RNA-binding region RNP-1 domain-containing protein [Dictyostelium
discoideum AX4]
Length = 671
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 59/188 (31%), Positives = 91/188 (48%), Gaps = 27/188 (14%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RR+ V N+P S+ LMEF N +L++ + G P + C I K AF+EF + E+
Sbjct: 268 RRIYVGNIPPGISDSELMEFFNAAVLAANLNTKPGP-PVVFCQINAPKCFAFIEFRSPEE 326
Query: 524 ASAALCCDGCSFSGSILKIKRPKEF------------------------VEVAIFIGGIS 559
A+ A+ DG S LKI+RPK++ E I++GG+
Sbjct: 327 ATNAMRFDGISLKNFTLKIRRPKDYQSTSDNTGGNASLLPSIVPTNVPDSENKIYVGGLP 386
Query: 560 RTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGG 617
LS + V ++ A+G LKA++ + N + AF EY D VT A + LNG+ +
Sbjct: 387 SNLSEEQVKSLLSAYGKLKAFNLVKDTNTGVSKGFAFCEYQDSEVTDVACSKLNGIPLAD 446
Query: 618 QVLTAVQA 625
+ L +A
Sbjct: 447 KTLVVQRA 454
>gi|353240191|emb|CCA72072.1| related to pre-mRNA splicing factor U2AF large chain
[Piriformospora indica DSM 11827]
Length = 403
Score = 88.6 bits (218), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 128/286 (44%), Gaps = 61/286 (21%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL + ++ A E+++ F N+ + + + P + + REK AFVEF +AED
Sbjct: 47 RRLYLGSITPLADEESIALFFNSQMRERKLT-TSSAPPVLAVQVNREKNYAFVEFRSAED 105
Query: 524 ASAALCCDGCSFSGSILKIKRPKEF------VEVA----------------IFIGGISRT 561
A+A + DG F L+++RP ++ VA IFIG +
Sbjct: 106 ATAGMALDGTVFLDGPLRVRRPHDYAGPEAMTGVAGFATLLPATMPDSVNKIFIGNLPTH 165
Query: 562 LSSKMVMEIVCAFGPLKAYH------------------FEVNEDHEEPCAFIEYVDQLVT 603
L+ ++E++ +FG LKA++ +N + AF+EY D VT
Sbjct: 166 LTEDQIVELLKSFGELKAFNLVREHGTNVSKVFTVRITLSMNLTGSQGFAFVEYADPAVT 225
Query: 604 PKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHAL----PLLKKPT---- 655
A LNG+ +G + L +A S+ G P IP A+ P++
Sbjct: 226 DIATESLNGMDLGDKKLVVQRA----SVGAKGGVP----IPPEAMDIPAPIVAVDLNKEA 277
Query: 656 --EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
V+ + N+ E + ++E EE+L+D+R EC+ FG V V V
Sbjct: 278 NGRVVLMLNMVVAEDL--MDDVEYEEILDDIRSECSGFGQVLGVYV 321
>gi|384246661|gb|EIE20150.1| hypothetical protein COCSUDRAFT_30785 [Coccomyxa subellipsoidea
C-169]
Length = 212
Score = 88.6 bits (218), Expect = 2e-14, Method: Composition-based stats.
Identities = 60/204 (29%), Positives = 96/204 (47%), Gaps = 34/204 (16%)
Query: 461 RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLT 520
RP RR+ V LP ++ L +++N ++S+G G P C I EK AF+EF +
Sbjct: 8 RPARRIYVGGLPPETTDADLRQYINELMVSTGGCAATG-YPIASCKIYTEKSYAFLEFRS 66
Query: 521 AEDASAALCCDGCSFSGSILKIKRPKEF-VEVAI-------------------------- 553
E+AS + DG +F S L+++RP + + VA+
Sbjct: 67 VEEASNCMAFDGVAFKDSYLRVRRPNNYDINVAVMLGPTDPDPTMDVSNLDIVKTVVQDS 126
Query: 554 ----FIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAI 607
FIGG+ + V E++ FG LKA++ + N + + AF EY D +T I
Sbjct: 127 PHKLFIGGLPCDWTEDQVKELLLPFGSLKAFNLVMDKNTGNSKGYAFCEYQDIGLTDYVI 186
Query: 608 AGLNGLKVGGQVLTAVQAVLDGSI 631
LNG ++G + LT +A+ G +
Sbjct: 187 QNLNGKQIGNKFLTVKRALQPGPV 210
>gi|395529346|ref|XP_003766777.1| PREDICTED: splicing factor U2AF 65 kDa subunit [Sarcophilus
harrisii]
Length = 462
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/255 (27%), Positives = 111/255 (43%), Gaps = 50/255 (19%)
Query: 382 INRSPEKKSAK-----WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLA 436
+ RSP + K WDV P + + + QAA Q P
Sbjct: 124 VIRSPRHEKKKKIRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPAT 167
Query: 437 GISVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQ 494
+ + + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 168 ALLPTMTPDGLAVTPAPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLT 227
Query: 495 HVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA-- 552
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 228 QAPGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGM 286
Query: 553 ----------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDH 588
+FIGG+ L+ V E++ +FGPLKA++ +
Sbjct: 287 SENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGL 346
Query: 589 EEPCAFIEYVDQLVT 603
+ AF EYVD VT
Sbjct: 347 SKGYAFCEYVDINVT 361
>gi|134075845|emb|CAL00224.1| unnamed protein product [Aspergillus niger]
Length = 598
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 168/382 (43%), Gaps = 74/382 (19%)
Query: 367 PRKRR-TEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSD 425
PRKR T S ++R +++ +WD+ P +V T+ QA S + +
Sbjct: 164 PRKREATPDLTDIVSVLDR--KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA- 215
Query: 426 PVTSTTQKPLAGISVSASLAK-LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEF 483
Q+P+ + A + + + D+ L SN R +RL V N+P S + + L+ F
Sbjct: 216 ----PRQQPMDPSRLQAFMNQPAGGNADTSTLKPSNSRQAKRLFVYNIPESVTGETLLAF 271
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF-------- 535
N L +G+ + PCI + ++ A +EF + DA+ AL DG +
Sbjct: 272 FNVQL--NGLNVIQSVDPCISAQVAQDHTFALLEFKSPNDATVALAFDGIAMEEHEAAGN 329
Query: 536 -----SGSILKIKRPKEFV----------------EVA-----IFIGGISRTLSSKMVME 569
+ L+++RPK+++ EV I + I + + V
Sbjct: 330 GAANGAAQGLEVRRPKDYIVPGGAEQEYQEGVLLNEVPDSPNKICVSNIPHYIPEEPVTM 389
Query: 570 IVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL 627
++ +FG LK++ + EE AF EY D T A+ GLNG+++G + L V+A +
Sbjct: 390 LLKSFGELKSFVLVKDSSTEESRGIAFCEYADPSATTIAVEGLNGMELGDRHLKVVRASI 449
Query: 628 ---DGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEV 677
+ +D N A+ + K T VL+L N+ PE + +
Sbjct: 450 GMTQAAGLDMGVN---------AMSMFAKTTSQDLETSRVLQLLNMVTPEEL--MDPEDY 498
Query: 678 EEVLEDVRLECARFGSVKSVNV 699
+E+ +DVR EC+++G+V + V
Sbjct: 499 DEICDDVRDECSKYGTVVELKV 520
>gi|429854658|gb|ELA29655.1| splicing factor u2af large subunit [Colletotrichum gloeosporioides
Nara gc5]
Length = 559
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 168/394 (42%), Gaps = 92/394 (23%)
Query: 378 TPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS 424
TP P R P +++ +WD+ P +V T+ QA S + +
Sbjct: 145 TPPPKKREPTPDLTNVTSVLERKRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA 199
Query: 425 DPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEF 483
Q+P+ + A + + ++S L SN R +RL + NLP SA+E +++ F
Sbjct: 200 -----PRQQPMDPSKLQAIMNQPGGQVNSAALKPSNSRQAKRLLINNLPPSATEDSIVGF 254
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF-------- 535
N L +G+ + + PC C + ++ A VEF A +A+ AL DG +
Sbjct: 255 FN--LQLNGLNVIESTDPCTSCQLSKDHSFAVVEFRNASEATVALALDGITMEADDATNG 312
Query: 536 -SGS-ILKIKRPKEFVEVAIF----------------------IGGISRTLSSKMVMEIV 571
+GS L I+RPK+++ A+ I + LS + V E++
Sbjct: 313 AAGSNGLVIRRPKDYIVPAVVDDVPYEPGVVSNIVIDTPNKISIANMPPYLSDEQVTELL 372
Query: 572 CAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG 629
+FG LKA+ ++ EE AF EYV+ T AI GLNG+ +G + L +A +
Sbjct: 373 VSFGELKAFVLVRDKSTEESRGIAFCEYVEPSATDVAIQGLNGMDLGDKKLRVQKASV-- 430
Query: 630 SIMDNSGNPPFHGIPK--HALPLLK-------KPTEVLKLKNVFNPEGFSSLSELEVEEV 680
G G+ A+ +L + T VL+L N+ PE + + EE+
Sbjct: 431 ------GVTQVAGVEMGVAAMSMLAGTTSTDSEETRVLQLLNMVTPEEL--MDNDDYEEI 482
Query: 681 LEDVRLECARFGSV-------------KSVNVVK 701
EDV EC +FG V +S V K
Sbjct: 483 KEDVEEECTKFGKVLDVKIPRPVGGSRQSAGVGK 516
>gi|297840477|ref|XP_002888120.1| hypothetical protein ARALYDRAFT_475241 [Arabidopsis lyrata subsp.
lyrata]
gi|297333961|gb|EFH64379.1| hypothetical protein ARALYDRAFT_475241 [Arabidopsis lyrata subsp.
lyrata]
Length = 589
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 78/299 (26%), Positives = 131/299 (43%), Gaps = 47/299 (15%)
Query: 444 LAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCI 503
L L V Q+ R RR+ V LP +A+E+++ F + + + G +
Sbjct: 235 LGALPVLPVQAMTQQATRHARRVYVGGLPPTANEQSVATFFSQVMSAIGGNTAGPGDAVV 294
Query: 504 GCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF--------------- 548
I EK AFVE + E+AS A+ DG G +K++RP ++
Sbjct: 295 NVYINHEKKFAFVEMRSVEEASNAMALDGIILEGVPVKVRRPTDYNPSLAATLGPSQPNP 354
Query: 549 -VEVA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED--H 588
+ +A IF+GG+ + + E++ +FGPL+ ++ + + +
Sbjct: 355 NLNLAAVGLSSGSTGGLEGPDRIFVGGLPYYFTEVQIRELLESFGPLRGFNLVKDRETGN 414
Query: 589 EEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPP--FHGIPKH 646
+ AF Y D VT A A LNG+K+G + LT V+ + G I H +
Sbjct: 415 SKGYAFCVYQDPSVTDIACAALNGIKMGDKTLT-VRRAIQGVIQPKPEQEEVLLHAQQQI 473
Query: 647 ALPLLK-----KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV 700
AL L PT+++ L V + + E +++ED+R E +FG++ VNVV
Sbjct: 474 ALQRLMLQPGGTPTKIVCLTQVVTADDLR--DDEEYADIMEDMRQEGGKFGNL--VNVV 528
>gi|122245120|sp|Q2QKB4.1|U2A2B_WHEAT RecName: Full=Splicing factor U2af large subunit B; AltName:
Full=U2 auxiliary factor 65 kDa subunit B; AltName:
Full=U2 small nuclear ribonucleoprotein auxiliary factor
large subunit B; Short=U2 snRNP auxiliary factor large
subunit B
gi|68036764|gb|AAY84880.1| U2AF large subunit [Triticum aestivum]
Length = 543
Score = 87.4 bits (215), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 78/293 (26%), Positives = 130/293 (44%), Gaps = 63/293 (21%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V LP SA+E+++ + N + + G + I +K AFVE
Sbjct: 201 QATRHARRVYVGGLPPSANEQSVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVE 260
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA--------- 552
+ E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 261 MRSVEEASNAMALDGILFEGAPVKVRRPTDYNPSLAAALGPSQPSSNLNLAAVGLTPGSA 320
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D
Sbjct: 321 GGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQDL 378
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGN----PPFHGI----------PKH 646
VT A A LNG+K+G + LT +A N G+ P I K
Sbjct: 379 NVTDIACAALNGIKMGDKTLTVRRA--------NQGSAQPRPEQENILLQAQQQVQLQKL 430
Query: 647 ALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ PT+V+ L V + + E E+++ED+RLE ++G++ V +
Sbjct: 431 VYQVGALPTKVVCLTQVVTADELK--DDEEYEDIMEDMRLEAGKYGNLVKVVI 481
>gi|350635494|gb|EHA23855.1| hypothetical protein ASPNIDRAFT_209800 [Aspergillus niger ATCC
1015]
Length = 566
Score = 87.4 bits (215), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 168/382 (43%), Gaps = 74/382 (19%)
Query: 367 PRKRR-TEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSD 425
PRKR T S ++R +++ +WD+ P +V T+ QA S + +
Sbjct: 132 PRKREATPDLTDIVSVLDR--KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA- 183
Query: 426 PVTSTTQKPLAGISVSASLAK-LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEF 483
Q+P+ + A + + + D+ L SN R +RL V N+P S + + L+ F
Sbjct: 184 ----PRQQPMDPSRLQAFMNQPAGGNADTSTLKPSNSRQAKRLFVYNIPESVTGETLLAF 239
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF-------- 535
N L +G+ + PCI + ++ A +EF + DA+ AL DG +
Sbjct: 240 FNVQL--NGLNVIQSVDPCISAQVAQDHTFALLEFKSPNDATVALAFDGIAMEEHEAAGN 297
Query: 536 -----SGSILKIKRPKEFV----------------EVA-----IFIGGISRTLSSKMVME 569
+ L+++RPK+++ EV I + I + + V
Sbjct: 298 GAANGAAQGLEVRRPKDYIVPGGAEQEYQEGVLLNEVPDSPNKICVSNIPHYIPEEPVTM 357
Query: 570 IVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL 627
++ +FG LK++ + EE AF EY D T A+ GLNG+++G + L V+A +
Sbjct: 358 LLKSFGELKSFVLVKDSSTEESRGIAFCEYADPSATTIAVEGLNGMELGDRHLKVVRASI 417
Query: 628 ---DGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEV 677
+ +D N A+ + K T VL+L N+ PE + +
Sbjct: 418 GMTQAAGLDMGVN---------AMSMFAKTTSQDLETSRVLQLLNMVTPEEL--MDPEDY 466
Query: 678 EEVLEDVRLECARFGSVKSVNV 699
+E+ +DVR EC+++G+V + V
Sbjct: 467 DEICDDVRDECSKYGTVVELKV 488
>gi|452979953|gb|EME79715.1| hypothetical protein MYCFIDRAFT_81194 [Pseudocercospora fijiensis
CIRAD86]
Length = 552
Score = 87.0 bits (214), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 89/358 (24%), Positives = 155/358 (43%), Gaps = 66/358 (18%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P +V T+ QA S + + Q+P+ + A + +
Sbjct: 158 KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA-----PRQQPMDPQKLQAFMNQ 207
Query: 447 LNVSMDSVQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGC 505
S L S+ R +RL + N+P +A++ +++F N L +G+ G PC+
Sbjct: 208 PGNQASSTALKPSSARQSKRLLIHNIPAAATDDNIVDFFN--LQLNGLNVTRGQDPCVSA 265
Query: 506 VIQREKGQAFVEFLTAEDASAALCCDGCSF-------------SGSILKIKRPKEFV--- 549
I +E G A VEF T EDA+ A+ DG + + L+IKRPK+++
Sbjct: 266 QISKENGYALVEFKTPEDATNAMAFDGINMMPDAMDTNGDSNGTAKGLQIKRPKDYIVPN 325
Query: 550 -------------------EVAIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEE 590
+ I I + L + E++ +FG L+ + + EE
Sbjct: 326 VTDETENPSGILSGVVPDTQNKISITNLPTFLGEDQIQELLNSFGELRNFVLVKDTSTEE 385
Query: 591 P--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHAL 648
AF EY D VT A+ LNG+++G + A + I G + +A+
Sbjct: 386 SRGIAFCEYKDPSVTKTAVESLNGMELGDAAMKVKLASI--GIQQVPGE-----MSVNAM 438
Query: 649 PLL----KKPTE---VLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L+ + TE VL L N+ PE + E +E+L DV+ E +++G + V +
Sbjct: 439 SLMAGTQAEGTEKGRVLCLMNMITPEEL--MDADEADEILVDVKEEVSKYGPLLDVKM 494
>gi|30696485|ref|NP_176287.3| Splicing factor U2af large subunit B [Arabidopsis thaliana]
gi|209572798|sp|Q8L716.2|U2A2B_ARATH RecName: Full=Splicing factor U2af large subunit B; AltName:
Full=U2 auxiliary factor 65 kDa subunit B; AltName:
Full=U2 small nuclear ribonucleoprotein auxiliary factor
large subunit B; Short=U2 snRNP auxiliary factor large
subunit B
gi|332195625|gb|AEE33746.1| Splicing factor U2af large subunit B [Arabidopsis thaliana]
Length = 589
Score = 87.0 bits (214), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 81/308 (26%), Positives = 134/308 (43%), Gaps = 65/308 (21%)
Query: 444 LAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCI 503
L L V Q+ R RR+ V LP +A+E+++ F + + + G +
Sbjct: 235 LGALPVLPVQAMTQQATRHARRVYVGGLPPTANEQSVSTFFSQVMSAIGGNTAGPGDAVV 294
Query: 504 GCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF--------------- 548
I EK AFVE + E+AS A+ DG G +K++RP ++
Sbjct: 295 NVYINHEKKFAFVEMRSVEEASNAMALDGIILEGVPVKVRRPTDYNPSLAATLGPSQPNP 354
Query: 549 ------VEVA------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE- 589
V ++ IF+GG+ + + E++ +FGPL+ F + +D E
Sbjct: 355 NLNLGAVGLSSGSTGGLEGPDRIFVGGLPYYFTEVQIRELLESFGPLRG--FNLVKDRET 412
Query: 590 ---EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKH 646
+ AF Y D VT A A LNG+K+G + LT V+ + G+I P+
Sbjct: 413 GNSKGYAFCVYQDPSVTDIACAALNGIKMGDKTLT-VRRAIQGAIQPK---------PEQ 462
Query: 647 ALPLLKKPTEVLKLKNVFNPEGFSS----LSEL----------EVEEVLEDVRLECARFG 692
LL ++ + +F P G + L+++ E E++ED+R E +FG
Sbjct: 463 EEVLLYAQQQIALQRLMFQPGGTPTKIVCLTQVVTADDLRDDEEYAEIMEDMRQEGGKFG 522
Query: 693 SVKSVNVV 700
++ VNVV
Sbjct: 523 NL--VNVV 528
>gi|12323333|gb|AAG51641.1|AC018908_7 putative U2 snRNP auxiliary factor; 19096-22891 [Arabidopsis
thaliana]
Length = 568
Score = 86.7 bits (213), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 78/306 (25%), Positives = 134/306 (43%), Gaps = 61/306 (19%)
Query: 444 LAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCI 503
L L V Q+ R RR+ V LP +A+E+++ F + + + G +
Sbjct: 214 LGALPVLPVQAMTQQATRHARRVYVGGLPPTANEQSVSTFFSQVMSAIGGNTAGPGDAVV 273
Query: 504 GCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF--------------- 548
I EK AFVE + E+AS A+ DG G +K++RP ++
Sbjct: 274 NVYINHEKKFAFVEMRSVEEASNAMALDGIILEGVPVKVRRPTDYNPSLAATLGPSQPNP 333
Query: 549 ------VEVA------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED--H 588
V ++ IF+GG+ + + E++ +FGPL+ ++ + + +
Sbjct: 334 NLNLGAVGLSSGSTGGLEGPDRIFVGGLPYYFTEVQIRELLESFGPLRGFNLVKDRETGN 393
Query: 589 EEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHAL 648
+ AF Y D VT A A LNG+K+G + LT V+ + G+I P+
Sbjct: 394 SKGYAFCVYQDPSVTDIACAALNGIKMGDKTLT-VRRAIQGAIQPK---------PEQEE 443
Query: 649 PLLKKPTEVLKLKNVFNPEGFSS----LSEL----------EVEEVLEDVRLECARFGSV 694
LL ++ + +F P G + L+++ E E++ED+R E +FG++
Sbjct: 444 VLLYAQQQIALQRLMFQPGGTPTKIVCLTQVVTADDLRDDEEYAEIMEDMRQEGGKFGNL 503
Query: 695 KSVNVV 700
VNVV
Sbjct: 504 --VNVV 507
>gi|281207514|gb|EFA81697.1| RNA-binding region RNP-1 domain-containing protein [Polysphondylium
pallidum PN500]
Length = 682
Score = 86.7 bits (213), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 60/187 (32%), Positives = 95/187 (50%), Gaps = 24/187 (12%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL V N+P + +E ++EF N ++++ + G P + C I K AF+EF ++E+
Sbjct: 299 RRLYVGNIPPNVTEAQIVEFFNAAIIAAALTTKPGQ-PVLLCQITTGKSFAFIEFRSSEE 357
Query: 524 ASAALCCDGCSFSGSILKIKRPKEF---------------------VEVAIFIGGISRTL 562
A+ + DG S SG LKI+RPK++ E IF+GG+ TL
Sbjct: 358 ATLGMGLDGISLSGYSLKIRRPKDYQSGSNEPMPTGLSIVSTNVPDSENKIFLGGLPPTL 417
Query: 563 SSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVL 620
+ + + ++ A G LKA++ + + AF E++D T KA A LNG K G + L
Sbjct: 418 NEEQIKSMLSAIGRLKAFNLVKDTKTGISKGFAFCEFLDPENTDKACAELNGTKFGDKSL 477
Query: 621 TAVQAVL 627
+A L
Sbjct: 478 LVQKASL 484
>gi|359476715|ref|XP_002271463.2| PREDICTED: splicing factor U2af large subunit B-like [Vitis
vinifera]
Length = 568
Score = 86.7 bits (213), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 85/313 (27%), Positives = 140/313 (44%), Gaps = 59/313 (18%)
Query: 434 PLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGV 493
PLA S L V Q+ R RR+ V L +A+E+++ F + + + G
Sbjct: 208 PLA----SGQFGALPVMPVQAMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMSAIGG 263
Query: 494 QHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----- 548
+ I EK AFVE + E+AS A+ DG F G+ +K++RP ++
Sbjct: 264 NTAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLA 323
Query: 549 -----------VEVA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAY 580
+ +A IF+GG+ + + E++ +FGPL+
Sbjct: 324 ATLGPSQPNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQIRELLESFGPLRG- 382
Query: 581 HFEVNEDHE----EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGS-IMDNS 635
F++ +D E + AF Y D VT A A LNG+K+G + LT +A S
Sbjct: 383 -FDLVKDRETGNSKGYAFCVYQDLSVTDIACAALNGIKMGDKTLTVRRANQGASQPKPEQ 441
Query: 636 GNPPFHGIPKHALP-LLKKP----TEVLKLKNVFNPEGFSSLSELE----VEEVLEDVRL 686
N H + AL L+ +P T+V+ L V N + EL+ E+++ED+R+
Sbjct: 442 ENVLLHAQQQIALQRLMFQPGALATKVVCLTQVVNAD------ELQDDEAYEDIVEDMRI 495
Query: 687 ECARFGSVKSVNV 699
E +FG++ +V +
Sbjct: 496 EGGKFGNLVNVAI 508
>gi|297798226|ref|XP_002866997.1| hypothetical protein ARALYDRAFT_490965 [Arabidopsis lyrata subsp.
lyrata]
gi|297312833|gb|EFH43256.1| hypothetical protein ARALYDRAFT_490965 [Arabidopsis lyrata subsp.
lyrata]
Length = 567
Score = 86.7 bits (213), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 76/313 (24%), Positives = 134/313 (42%), Gaps = 63/313 (20%)
Query: 438 ISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ S L++ Q+ R RR+ V L +A+E+++ F + + + G
Sbjct: 207 LPTGQSFGGLSMMPIQAMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAG 266
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF--------- 548
+ I EK AFVE + E+AS A+ DG F G+ +K++RP ++
Sbjct: 267 PGDAVVNVYINHEKKFAFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLG 326
Query: 549 -------VEVA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEV 584
+ +A IF+GG+ + V E++ +FG LK F++
Sbjct: 327 PSQPSPHLNLAAVGLTPGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGALKG--FDL 384
Query: 585 NEDHE----EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPF 640
+D E + AF Y D VT A A LNG+K+G + LT +A N G
Sbjct: 385 VKDRETGNSKGYAFCVYQDLSVTDIACAALNGIKMGDKTLTVRRA--------NQGT--M 434
Query: 641 HGIPKHALPLLKKPTEVLKLKNVFNPEGFSS----LSEL----------EVEEVLEDVRL 686
P+ LL ++ + + P ++ L+++ E E+++ED+R
Sbjct: 435 QPKPEQESVLLHAQQQIAFQRIMLQPGVMATTVVCLTQVVTEDELRDDEEYEDIMEDMRQ 494
Query: 687 ECARFGSVKSVNV 699
E +FG++ +V +
Sbjct: 495 EGGKFGALTNVVI 507
>gi|406699650|gb|EKD02849.1| splicing factor (U2 snRNP auxiliary factor large subunit)
[Trichosporon asahii var. asahii CBS 8904]
Length = 487
Score = 86.7 bits (213), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 104/217 (47%), Gaps = 22/217 (10%)
Query: 504 GCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEV------------ 551
G I +EK +VEF++++ A F G+ ++ KRPK+FV +
Sbjct: 173 GVEINKEKDYVWVEFVSSDLAQVVFNKKDLDFDGAPIEPKRPKDFVGIDPALGFMGVSGD 232
Query: 552 ---AIFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKA 606
+F+GG+ TL S V E++ FG L+ ++ E N + AF+E++D VT A
Sbjct: 233 PNNKLFVGGLPTTLGSDEVKELLTPFGELRTFNLVKEGNGSVSKGFAFVEFLDPAVTDIA 292
Query: 607 IAGLNGLKVGGQVLTAVQAVLDG---SIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNV 663
I GLNG ++G + L +A G S SG F L +P ++ +
Sbjct: 293 IQGLNGFQLGDRALVVQRAATTGRSASSTGVSGTAQFLA-QSSILEKADEPAPATRVILM 351
Query: 664 FNPEGFSSL-SELEVEEVLEDVRLECARFGSVKSVNV 699
N G L + + ++LED+R EC++FG V+ V +
Sbjct: 352 LNMVGADELYDDQDYADILEDIRDECSKFGEVEGVRI 388
>gi|147840634|emb|CAN68321.1| hypothetical protein VITISV_032193 [Vitis vinifera]
Length = 565
Score = 86.7 bits (213), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 85/313 (27%), Positives = 140/313 (44%), Gaps = 59/313 (18%)
Query: 434 PLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGV 493
PLA S L V Q+ R RR+ V L +A+E+++ F + + + G
Sbjct: 205 PLA----SGQFGALPVMPVQAMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMSAIGG 260
Query: 494 QHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----- 548
+ I EK AFVE + E+AS A+ DG F G+ +K++RP ++
Sbjct: 261 NTAGPGDAVVNVYINHEKKFAFVEMRSVEEASNAMALDGIIFEGAPVKVRRPSDYNPSLA 320
Query: 549 -----------VEVA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAY 580
+ +A IF+GG+ + + E++ +FGPL+
Sbjct: 321 ATLGPSQPNPNLNLAAVGLTPGSAGGLEGPDRIFVGGLPYYFTEAQIRELLESFGPLRG- 379
Query: 581 HFEVNEDHE----EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGS-IMDNS 635
F++ +D E + AF Y D VT A A LNG+K+G + LT +A S
Sbjct: 380 -FDLVKDRETGNSKGYAFCVYQDLSVTDIACAALNGIKMGDKTLTVRRANQGASQPKPEQ 438
Query: 636 GNPPFHGIPKHALP-LLKKP----TEVLKLKNVFNPEGFSSLSELE----VEEVLEDVRL 686
N H + AL L+ +P T+V+ L V N + EL+ E+++ED+R+
Sbjct: 439 ENVLLHAQQQIALQRLMFQPGALATKVVCLTQVVNAD------ELQDDEAYEDIVEDMRI 492
Query: 687 ECARFGSVKSVNV 699
E +FG++ +V +
Sbjct: 493 EGGKFGNLVNVAI 505
>gi|302854386|ref|XP_002958701.1| hypothetical protein VOLCADRAFT_121741 [Volvox carteri f.
nagariensis]
gi|300255941|gb|EFJ40221.1| hypothetical protein VOLCADRAFT_121741 [Volvox carteri f.
nagariensis]
Length = 294
Score = 86.3 bits (212), Expect = 8e-14, Method: Composition-based stats.
Identities = 57/183 (31%), Positives = 87/183 (47%), Gaps = 34/183 (18%)
Query: 476 SEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF 535
SE +L + NN ++++G G P I C + EK AF+EF + E+ S A+ DG
Sbjct: 2 SEVSLTQLFNNVMMAAGATTQPGG-PVISCYMNNEKRFAFLEFRSVEETSNAMAFDGLQC 60
Query: 536 SGSILKIKRPKEFVEVA-------------------------------IFIGGISRTLSS 564
G LK++RP ++ A +F+GG+ LS
Sbjct: 61 QGETLKVRRPHDYNPAAAKLLGPTEPSAKINLALLGVVNTLVEDGPNKVFVGGLPGYLSE 120
Query: 565 KMVMEIVCAFGPLKAYHFEVNED--HEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTA 622
+ V +I+ AFGPL+A++ + D + F EY D +T AI GL+ L VGG+ LT
Sbjct: 121 EQVRQILQAFGPLRAFNLVTDRDTGASKGYGFCEYADPNITDVAIQGLSALIVGGKPLTV 180
Query: 623 VQA 625
+A
Sbjct: 181 RRA 183
>gi|402073699|gb|EJT69251.1| splicing factor U2AF 50 kDa subunit [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 623
Score = 86.3 bits (212), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 75/287 (26%), Positives = 132/287 (45%), Gaps = 47/287 (16%)
Query: 454 VQLTQSN-RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKG 512
V L +N R +RL V N +E+AL+ F+N L +G+ + + PC+ C + +
Sbjct: 285 VTLKPTNSRQSKRLIVTNFAPGTTEEALVAFMN--LQLNGLNVIESTDPCLLCQMAPDSS 342
Query: 513 QAFVEFLTAEDASAALCCDGCSFSG-------------SILKIKRPKEFVEVAIF----- 554
A +EF + + + AL DG + L+++RPK+++ A+
Sbjct: 343 FAILEFRSPAETTVALALDGITMEAEDTPMEGAANGTPQGLELRRPKDYIVPAVVEDTGY 402
Query: 555 -----------------IGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFI 595
I ++ L+ + V E++ +FG LKA + EE AF
Sbjct: 403 ERGVVSSRVVDTPHKIGITNLAPYLTEEQVTELLVSFGELKALVLVKDSGTEESRGIAFC 462
Query: 596 EYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSG-NPPFHGIPKHALPLLKKP 654
EYVD + T AI GLN +++G + L +A + I SG + + A + + P
Sbjct: 463 EYVDPVATDVAIHGLNNMELGEKRLRVKKASI--GITQVSGIEMGINAMSMLAGTVAQDP 520
Query: 655 --TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ VL+L N+ + L + EE+ +DVR EC++FG++ + +
Sbjct: 521 DLSRVLQLLNMVTADEL--LDNDDYEEICDDVREECSKFGTILELKI 565
>gi|224030681|gb|ACN34416.1| unknown [Zea mays]
Length = 425
Score = 85.9 bits (211), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 70/259 (27%), Positives = 113/259 (43%), Gaps = 42/259 (16%)
Query: 407 VHTSNQAASSNAHEMVSSDPVTST---TQKPLAGISVSASLAKLNVSMDSVQLTQSNRPM 463
V +QA + A +V++ + P+ G+ V +L L Q+ R
Sbjct: 76 VSGFDQAPTQQALPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQPQAMTQQATRHA 135
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RR+ V LP +A+E+ + F N + + G + I +K AFVE + E+
Sbjct: 136 RRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFVEMRSVEE 195
Query: 524 ASAALCCDGCSFSGSILKIKRPKEF---VEVA---------------------------- 552
AS A+ DG F G+ +KI+RP ++ + A
Sbjct: 196 ASNAMVLDGIMFEGAPVKIRRPTDYNPSLAAALGPSQPNPNLNLSAVGLTPGSAGGLEGP 255
Query: 553 --IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQLVTPKA 606
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D VT A
Sbjct: 256 DRIFVGGLQYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQDLNVTDIA 313
Query: 607 IAGLNGLKVGGQVLTAVQA 625
A LNG+K+G + LT +A
Sbjct: 314 CAALNGIKMGDKTLTVRRA 332
>gi|115400045|ref|XP_001215611.1| hypothetical protein ATEG_06433 [Aspergillus terreus NIH2624]
gi|114191277|gb|EAU32977.1| hypothetical protein ATEG_06433 [Aspergillus terreus NIH2624]
Length = 413
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 98/382 (25%), Positives = 169/382 (44%), Gaps = 74/382 (19%)
Query: 367 PRKRR-TEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSD 425
PRKR T S ++R +++ +WD+ P +V T+ QA S + +
Sbjct: 4 PRKREATPDLTDVVSVLHR--KRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA- 55
Query: 426 PVTSTTQKPLAGISVSASLAKLNV-SMDSVQLTQS-NRPMRRLCVENLPLSASEKALMEF 483
Q+P+ + A + + S D+ L S +R +RL V N+P + + +AL+ F
Sbjct: 56 ----PRQQPMDPSRLQAFMNQSTTGSADAASLKPSHSRQAKRLFVYNIPPNVTGEALLSF 111
Query: 484 LNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFS------- 536
N L +G+ V PCI + + A +EF + +A+ AL DG +
Sbjct: 112 FN--LQLNGLNVVQSVDPCISAQVSDDHSFALLEFKSPNEATVALAFDGITMDEHASMDG 169
Query: 537 ---GSI--LKIKRPKEFV-----------------EVA-----IFIGGISRTLSSKMVME 569
G + L+++RPK+++ EV I + I + + + V+
Sbjct: 170 AGKGEVKGLEVRRPKDYIVPNGSADQEYQEGVLLNEVPDSPNKICVSNIPQYIQEEAVIM 229
Query: 570 IVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL 627
++ +FG LK++ + EE AF EY D T A+ GLNG+++G + L V+A +
Sbjct: 230 LLKSFGELKSFVLVKDASTEESRGIAFCEYADPTATSIAVEGLNGMEIGDRPLKVVRASI 289
Query: 628 ---DGSIMDNSGNPPFHGIPKHALPLLKKPTE-------VLKLKNVFNPEGFSSLSELEV 677
+ +D N A+ + K T VL+L N+ E + +
Sbjct: 290 GMTQAAGLDMGVN---------AMSMFAKTTSQDLETSRVLQLLNMVTAEEL--IDNDDY 338
Query: 678 EEVLEDVRLECARFGSVKSVNV 699
EE+ EDVR EC+++G V + V
Sbjct: 339 EEICEDVREECSKYGQVLELKV 360
>gi|414591746|tpg|DAA42317.1| TPA: hypothetical protein ZEAMMB73_924732 [Zea mays]
Length = 538
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/266 (26%), Positives = 118/266 (44%), Gaps = 49/266 (18%)
Query: 407 VHTSNQAASSNAHEMVSSDPVTST---TQKPLAGISVSASLAKLN------VSMDSVQLT 457
V +QA + A +V++ + P+ G+ V +L L +++ +T
Sbjct: 189 VSGFDQAPTQQALPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQFNPLAIQPQAMT 248
Query: 458 Q-SNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
Q + R RR+ V LP +A+E+ + F N + + G + I +K AFV
Sbjct: 249 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 308
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEF---VEVA--------------------- 552
E + E+AS A+ DG F G+ +KI+RP ++ + A
Sbjct: 309 EMRSVEEASNAMVLDGIMFEGAPVKIRRPTDYNPSLAAALGPSQPNPNLNLSAVGLTPGS 368
Query: 553 ---------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVD 599
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D
Sbjct: 369 AGGLEGPDRIFVGGLQYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQD 426
Query: 600 QLVTPKAIAGLNGLKVGGQVLTAVQA 625
VT A A LNG+K+G + LT +A
Sbjct: 427 LNVTDIACAALNGIKMGDKTLTVRRA 452
>gi|414591747|tpg|DAA42318.1| TPA: hypothetical protein ZEAMMB73_924732 [Zea mays]
Length = 590
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/266 (26%), Positives = 118/266 (44%), Gaps = 49/266 (18%)
Query: 407 VHTSNQAASSNAHEMVSSDPVTST---TQKPLAGISVSASLAKLN------VSMDSVQLT 457
V +QA + A +V++ + P+ G+ V +L L +++ +T
Sbjct: 189 VSGFDQAPTQQALPIVAAGVIPGQLPGVTAPIPGVGVLPNLYNLAAGQFNPLAIQPQAMT 248
Query: 458 Q-SNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
Q + R RR+ V LP +A+E+ + F N + + G + I +K AFV
Sbjct: 249 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 308
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEF---VEVA--------------------- 552
E + E+AS A+ DG F G+ +KI+RP ++ + A
Sbjct: 309 EMRSVEEASNAMVLDGIMFEGAPVKIRRPTDYNPSLAAALGPSQPNPNLNLSAVGLTPGS 368
Query: 553 ---------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVD 599
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D
Sbjct: 369 AGGLEGPDRIFVGGLQYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQD 426
Query: 600 QLVTPKAIAGLNGLKVGGQVLTAVQA 625
VT A A LNG+K+G + LT +A
Sbjct: 427 LNVTDIACAALNGIKMGDKTLTVRRA 452
>gi|384500209|gb|EIE90700.1| hypothetical protein RO3G_15411 [Rhizopus delemar RA 99-880]
Length = 490
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 89/356 (25%), Positives = 137/356 (38%), Gaps = 101/356 (28%)
Query: 381 PINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISV 440
P+++ P +K WD+AP E +++ V T PL G V
Sbjct: 146 PLSKRP-RKLHNWDMAPA-------------------GMEGMTAQQVKMTGLFPLPGQVV 185
Query: 441 ------SASLAKLNVSMDSVQLTQSN----RPMRRLCVENLPLSASEKALMEFLNNFLLS 490
S +N D ++ +N + RRL V +P EK L +F N +
Sbjct: 186 GTRTPQSFQPPPVNNKFDDSRILAANTTATKQARRLYVGQIPPGLEEKPLADFFNATMHQ 245
Query: 491 SGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVE 550
+Q P I EK AFVEF TAE A+A + DG F G LKI+RPK++
Sbjct: 246 LQMQ---DRTPVAAVQINHEKSYAFVEFQTAEQATACMAFDGIMFQGQQLKIRRPKDYQP 302
Query: 551 VA----------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDH 588
A IFIGG+ L+ V+E++ +FG K
Sbjct: 303 PAEGDVSMQLPGLVSTNVPDTPNKIFIGGLPVYLNDDQVIELLKSFGDRK---------- 352
Query: 589 EEPCAFIEYVDQLVTPKAIAGLNGLK---VGGQVLTAVQAVLDGSIMDNSGNPPFHGIPK 645
L+ +A G + + G +L A + + D++
Sbjct: 353 ------------LIVQRASVGAKHIPPDYMSGPMLPANYVPVTSAKEDDA---------- 390
Query: 646 HALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVK 701
T VL+L N+ PE + E +++ ED+ ECA+FG+V + + K
Sbjct: 391 ---------TRVLQLMNMVTPEELE--DDEEYQDIWEDIAEECAKFGNVLDMKIPK 435
>gi|328865493|gb|EGG13879.1| RapGAP/RanGAP domain-containing protein [Dictyostelium fasciculatum]
Length = 3032
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 59/194 (30%), Positives = 92/194 (47%), Gaps = 28/194 (14%)
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCI-GCVIQREKGQAF 515
TQ N+ RRL + N+P + ++ L++F N + ++ + + P + C I K AF
Sbjct: 2569 TQQNKQSRRLYIGNIPPNITDNTLIDFFNTAITAANLHLSSKTGPVVLSCQINSAKNFAF 2628
Query: 516 VEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA----------------------- 552
+EF +AE+A+ A+ DG S LKI+RP ++ A
Sbjct: 2629 LEFRSAEEATNAMGLDGISLFTFSLKIRRPTDYQPPANESSMPSAPVSMSIVSTNVPDSE 2688
Query: 553 --IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIA 608
IFIGGI TL+ + + ++ AFG LKA++ + + AF EY + T I
Sbjct: 2689 NKIFIGGIPTTLNEEQIKSMLLAFGRLKAFNLVKDPKTGSSKGYAFCEYYETEETNDCIN 2748
Query: 609 GLNGLKVGGQVLTA 622
GLNG K G + L
Sbjct: 2749 GLNGTKFGEKSLVV 2762
>gi|226497766|ref|NP_001152419.1| LOC100286059 [Zea mays]
gi|195656099|gb|ACG47517.1| splicing factor U2AF 65 kDa subunit [Zea mays]
Length = 596
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 70/266 (26%), Positives = 115/266 (43%), Gaps = 49/266 (18%)
Query: 407 VHTSNQAASSNAHEMVSSDPVTST---TQKPLAGISVSASLAKLN-------VSMDSVQL 456
V +QA +A +V++ + P+ G+ V +L L V
Sbjct: 195 VSGFDQAPPQHALPIVAAGAIPGQLPGITAPIPGVGVLPNLYNLAAGQFNPLVIQPQAMT 254
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
Q+ R RR+ V LP +A+E+ + F N + + G + I +K AFV
Sbjct: 255 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 314
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA-------- 552
E + E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 315 EMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGS 374
Query: 553 ---------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVD 599
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D
Sbjct: 375 AGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQD 432
Query: 600 QLVTPKAIAGLNGLKVGGQVLTAVQA 625
VT A A LNG+K+G + LT +A
Sbjct: 433 LNVTDIACAALNGIKMGDKTLTVSRA 458
>gi|357160098|ref|XP_003578657.1| PREDICTED: splicing factor U2af large subunit B-like [Brachypodium
distachyon]
Length = 534
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 128/291 (43%), Gaps = 68/291 (23%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V LP SA+E+ + + N+ + + G + I +K AFVE
Sbjct: 193 QATRHARRVYVGGLPPSANEQTVAIYFNHVMAAIGGNAAGLGDAVVNVYINHDKKFAFVE 252
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA--------- 552
+ E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 253 MRSVEEASNAMALDGILFEGAPVKVRRPTDYNPSQAAVLGPSQPNPNLNLAAVGLTPGSA 312
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ + V E++ FGPL+ F++ +D E + AF Y D
Sbjct: 313 GGLEGPDRIFVGGLPYYFTEAQVQELLETFGPLRG--FDIVKDRETGNSKGYAFCVYQDL 370
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKK------- 653
VT A A LNG+++G + LT +A N G P+H LL+
Sbjct: 371 AVTDIACAALNGIQLGDRTLTVRRA--------NQGAA--EPRPEHENILLQAQHQAQMK 420
Query: 654 ----------PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSV 694
PT+V+ L V + + ++ E +++LED+ E ++G++
Sbjct: 421 KLVYEVGGAIPTKVVCLTQVVSEDDLR--NDEEYKDILEDMTFEGRKYGNL 469
>gi|164657478|ref|XP_001729865.1| hypothetical protein MGL_2851 [Malassezia globosa CBS 7966]
gi|159103759|gb|EDP42651.1| hypothetical protein MGL_2851 [Malassezia globosa CBS 7966]
Length = 473
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/293 (29%), Positives = 133/293 (45%), Gaps = 56/293 (19%)
Query: 435 LAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLN-----NFLL 489
+A + + S A N SM RRL V + + + L F+N L
Sbjct: 134 MAELEATTSAAAFNASM--------YLETRRLHVSPVSSVKTSQQLRIFINAKMNERLLC 185
Query: 490 SSGVQHVHGSL-PCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF 548
SSG SL PC + ++G A++EF ++AS AL DG +F G L I+RPK +
Sbjct: 186 SSG------SLEPCYAVDMHLDEGYAYLEFRNPDEASNALLLDGVAFLGHRLHIERPKGY 239
Query: 549 V-----------EVAI-------FIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEE 590
V E ++ +IG + L+ + VME++ AFG ++ HF++ D E
Sbjct: 240 VGQDAVPAPGAIETSVPDGPNKLYIGNVPVFLNEQQVMELLKAFGDVR--HFDLIRDPET 297
Query: 591 P----CAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKH 646
AF E+ + VT A GL+GL+VG Q L + ++ S N H +
Sbjct: 298 QRSRGMAFCEFHEDAVTDLACEGLDGLEVGEQRLMVRR-------VNASTNTHTH---ED 347
Query: 647 ALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PT + + N+ + L + E +++ EDV EC+R G+V SV +
Sbjct: 348 TQETSDTPTRAMLMLNMVTTDEL--LDDTEYQDIKEDVHSECSRHGTVTSVYI 398
>gi|223950169|gb|ACN29168.1| unknown [Zea mays]
gi|413920349|gb|AFW60281.1| splicing factor U2AF subunit [Zea mays]
Length = 594
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 70/266 (26%), Positives = 115/266 (43%), Gaps = 49/266 (18%)
Query: 407 VHTSNQAASSNAHEMVSSDPVTST---TQKPLAGISVSASLAKLN-------VSMDSVQL 456
V +QA +A +V++ + P+ G+ V +L L V
Sbjct: 193 VSGFDQAPPQHALPIVAAGAIPGQLPGITAPIPGVGVLPNLYNLAAGQFNPLVIQPQAMT 252
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
Q+ R RR+ V LP +A+E+ + F N + + G + I +K AFV
Sbjct: 253 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 312
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA-------- 552
E + E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 313 EMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGS 372
Query: 553 ---------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVD 599
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D
Sbjct: 373 AGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQD 430
Query: 600 QLVTPKAIAGLNGLKVGGQVLTAVQA 625
VT A A LNG+K+G + LT +A
Sbjct: 431 LNVTDIACAALNGIKMGDKTLTVRRA 456
>gi|195149862|ref|XP_002015874.1| GL11290 [Drosophila persimilis]
gi|194109721|gb|EDW31764.1| GL11290 [Drosophila persimilis]
Length = 487
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/369 (24%), Positives = 147/369 (39%), Gaps = 86/369 (23%)
Query: 393 WDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMD 452
WDV P E Y+ + + AS + SD +P A D
Sbjct: 73 WDVPP-EGYAHVTPMQYKAMQASGQITARIQSD------TQPTA---------------D 110
Query: 453 SVQLTQSNRPMRRLCVENLPLSASEKALMEFLNN-FLL---SSGVQHVHGSLPCIGCVIQ 508
+ + R RRL V N+P +E +M F N FLL G Q + C
Sbjct: 111 TAAIAMVTRQARRLYVGNIPFGVTEDDIMAFFNQQFLLLGDDCGGQLCLDGKAVLSCQAN 170
Query: 509 REKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF------------VEVAIFIG 556
+K AF+EF + ++A+ A DG SF G +LKI+RP ++ + +G
Sbjct: 171 LDKNFAFIEFRSMQEATQATTFDGISFRGQVLKIRRPHDYHPVGSVGAAAGAGSIPDAVG 230
Query: 557 GI--------------SRTLSSKMVMEIV-------------------------CAFGPL 577
G + +L S+ + +V +FG L
Sbjct: 231 GCASSAAAKSRSSSADTGSLGSQAISNLVPDSPHKIYIGGLPTCLNETQIKELLLSFGQL 290
Query: 578 KAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNS 635
+ ++ + + + AF EYVD L+T + IA LNG+++G + L +++ G
Sbjct: 291 RGFNLVKDPSTTLSKGYAFFEYVDPLLTEQVIANLNGMQLGDRRLIVQRSIPSGRYAGIQ 350
Query: 636 GNP-PFHGIPKHAL----PLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECAR 690
P G+ +L L T+VL L N+ PE L E E++ D+ EC++
Sbjct: 351 QIPIQVPGLVATSLTGSTAGLNNATQVLCLLNMVLPEEL--LDNEEYEDIRADIEQECSK 408
Query: 691 FGSVKSVNV 699
+G V S+ +
Sbjct: 409 YGEVLSLKI 417
>gi|334187224|ref|NP_001190937.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
gi|332661290|gb|AEE86690.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
Length = 551
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 74/313 (23%), Positives = 135/313 (43%), Gaps = 63/313 (20%)
Query: 438 ISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ S L++ Q+ R RR+ V L +A+E+++ F + + + G
Sbjct: 191 LPTGQSFGGLSMMPIQAMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAG 250
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF--------- 548
+ I EK AFVE + E+AS A+ DG F G+ +K++RP ++
Sbjct: 251 PGDAVVNVYINHEKKFAFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLG 310
Query: 549 -------VEVA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEV 584
+ +A IF+GG+ + V E++ +FG LK F++
Sbjct: 311 PSQPSPHLNLAAVGLTPGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGGLKG--FDL 368
Query: 585 NEDHE----EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPF 640
+D E + AF Y D VT A A LNG+K+G + LT V+ G+++
Sbjct: 369 VKDRETGNSKGYAFCVYQDLSVTDIACAALNGIKMGDKTLT-VRRANQGTMLQK------ 421
Query: 641 HGIPKHALPLLKKPTEVLKLKNVFNPEGFSS----LSEL----------EVEEVLEDVRL 686
P+ LL ++ + + P ++ L+++ E +++ED+R
Sbjct: 422 ---PEQENVLLHAQQQIAFQRVMLQPGAVATTVVCLTQVVTEDELRDDEEYGDIMEDMRQ 478
Query: 687 ECARFGSVKSVNV 699
E +FG++ +V +
Sbjct: 479 EGGKFGALTNVVI 491
>gi|242803779|ref|XP_002484243.1| splicing factor u2af large subunit [Talaromyces stipitatus ATCC
10500]
gi|218717588|gb|EED17009.1| splicing factor u2af large subunit [Talaromyces stipitatus ATCC
10500]
Length = 543
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 85/367 (23%), Positives = 154/367 (41%), Gaps = 83/367 (22%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS------DP--VTSTTQKPLAGI 438
+++ +WD+ P +V T+ QA S + + DP + + +P AG
Sbjct: 148 KRRLTQWDIKPPGYDNV-----TAEQAKLSGMFPLPGAPRQQAVDPSRLQALVNQPAAGT 202
Query: 439 SVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHG 498
+ +++L N +R +RL NLP + +E AL+ F N L +G+ + G
Sbjct: 203 TENSALRPAN-----------SRQAKRLFAHNLPPNVTEAALVSFFN--LQLNGLNVIEG 249
Query: 499 SLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF-----------SGSILKIKRPKE 547
PC+ I ++ A +EF A + + AL DG + L+++RPK+
Sbjct: 250 IDPCVSAQISKDHSFALLEFKGANETTVALALDGITMEEHESAATANGGARGLELRRPKD 309
Query: 548 FVEVAI-----------------------FIGGISRTLSSKMVMEIVCAFGPLKAYHF-- 582
++ ++ I I + + V ++ + G LKA+
Sbjct: 310 YIVPSVPEDQQPHQESVISNHVPDSPNKLCITNIPLYIPEEPVTMLLKSIGELKAFVLVK 369
Query: 583 EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSI---MDNSGNPP 639
+ D AF EYVD T A+ LNG+++G + L A + + +D N
Sbjct: 370 DSGTDESRGIAFCEYVDAASTAIAVESLNGMELGDKHLKITHASIGATQAAGLDMGVN-- 427
Query: 640 FHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFG 692
A+ + K T +L+L N+ + ++ + EE+LEDV+ EC+++G
Sbjct: 428 -------AMSMFAKTTSADLETSRILQLLNMVTADEL--INNDDYEEILEDVQDECSKYG 478
Query: 693 SVKSVNV 699
V V +
Sbjct: 479 QVLDVKI 485
>gi|15234495|ref|NP_195387.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
gi|75318082|sp|O23212.2|U2A2A_ARATH RecName: Full=Splicing factor U2af large subunit A; AltName:
Full=U2 auxiliary factor 65 kDa subunit A; AltName:
Full=U2 small nuclear ribonucleoprotein auxiliary factor
large subunit A; Short=U2 snRNP auxiliary factor large
subunit A
gi|18087531|gb|AAL58899.1|AF462805_1 At4g35590/C7A10_670 [Arabidopsis thaliana]
gi|4006898|emb|CAB16828.1| splicing factor-like protein [Arabidopsis thaliana]
gi|7270617|emb|CAB80335.1| splicing factor-like protein [Arabidopsis thaliana]
gi|23506119|gb|AAN28919.1| At4g35590/C7A10_670 [Arabidopsis thaliana]
gi|24030414|gb|AAN41365.1| putative splicing factor [Arabidopsis thaliana]
gi|332661287|gb|AEE86687.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
Length = 573
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 74/313 (23%), Positives = 135/313 (43%), Gaps = 63/313 (20%)
Query: 438 ISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ S L++ Q+ R RR+ V L +A+E+++ F + + + G
Sbjct: 213 LPTGQSFGGLSMMPIQAMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAG 272
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF--------- 548
+ I EK AFVE + E+AS A+ DG F G+ +K++RP ++
Sbjct: 273 PGDAVVNVYINHEKKFAFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLG 332
Query: 549 -------VEVA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEV 584
+ +A IF+GG+ + V E++ +FG LK F++
Sbjct: 333 PSQPSPHLNLAAVGLTPGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGGLKG--FDL 390
Query: 585 NEDHE----EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPF 640
+D E + AF Y D VT A A LNG+K+G + LT V+ G+++
Sbjct: 391 VKDRETGNSKGYAFCVYQDLSVTDIACAALNGIKMGDKTLT-VRRANQGTMLQK------ 443
Query: 641 HGIPKHALPLLKKPTEVLKLKNVFNPEGFSS----LSEL----------EVEEVLEDVRL 686
P+ LL ++ + + P ++ L+++ E +++ED+R
Sbjct: 444 ---PEQENVLLHAQQQIAFQRVMLQPGAVATTVVCLTQVVTEDELRDDEEYGDIMEDMRQ 500
Query: 687 ECARFGSVKSVNV 699
E +FG++ +V +
Sbjct: 501 EGGKFGALTNVVI 513
>gi|326428095|gb|EGD73665.1| U2 small nuclear ribonucleoprotein auxiliary factor 2 isoform 2
[Salpingoeca sp. ATCC 50818]
Length = 415
Score = 84.0 bits (206), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 97/209 (46%), Gaps = 28/209 (13%)
Query: 462 PMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTA 521
P R+L VENLP + LM F+N+ + S+ + G P I C + + A +EF +
Sbjct: 34 PTRQLVVENLPEGTRDHDLMSFMNDCIASNKLITQPGQ-PVIKCTLSEDGKSAVLEFRSV 92
Query: 522 EDASAALCCDGCSFSGSILKIKRPKEF------------------------VEVAIFIGG 557
++A+ L D F G+ L+++RP + I+IGG
Sbjct: 93 DEATNGLVFDRERFKGAQLRVRRPDNYEAPKGHITRIPMQSGANVSAVVQDSPYKIYIGG 152
Query: 558 ISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKV 615
I LS + V E++ FG LK++H + + + AF EY+D +T I LN L++
Sbjct: 153 IPPYLSEEQVKELLEPFGQLKSFHLVMDTSTGQSKGFAFCEYMDPAITDTMIGALNDLRI 212
Query: 616 GGQVLTAVQAVLDGSIMDNSGNPPFHGIP 644
G + L +A + G+ N P H IP
Sbjct: 213 GEKRLLVQRASI-GARGGAQQNNPMHSIP 240
>gi|22655131|gb|AAM98156.1| putative U2 snRNP auxiliary factor [Arabidopsis thaliana]
Length = 589
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 80/308 (25%), Positives = 133/308 (43%), Gaps = 65/308 (21%)
Query: 444 LAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCI 503
L L V Q+ R R+ V LP +A+E+++ F + + + G +
Sbjct: 235 LGALPVLPVQAMTQQATRHAPRVYVGGLPPTANEQSVSTFFSQVMSAIGGNTAGPGDAVV 294
Query: 504 GCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF--------------- 548
I EK AFVE + E+AS A+ DG G +K++RP ++
Sbjct: 295 NVYINHEKKFAFVEMRSVEEASNAMALDGIILEGVPVKVRRPTDYNPSLAATLGPSQPNP 354
Query: 549 ------VEVA------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE- 589
V ++ IF+GG+ + + E++ +FGPL+ F + +D E
Sbjct: 355 NLNLGAVGLSSGSTGGLEGPDRIFVGGLPYYFTEVQIRELLESFGPLRG--FNLVKDRET 412
Query: 590 ---EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKH 646
+ AF Y D VT A A LNG+K+G + LT V+ + G+I P+
Sbjct: 413 GNSKGYAFCVYQDPSVTDIACAALNGIKMGDKTLT-VRRAIQGAIQPK---------PEQ 462
Query: 647 ALPLLKKPTEVLKLKNVFNPEGFSS----LSEL----------EVEEVLEDVRLECARFG 692
LL ++ + +F P G + L+++ E E++ED+R E +FG
Sbjct: 463 EEVLLYAQQQIALQRLMFQPGGTPTKIVCLTQVVTADDLRDDEEYAEIMEDMRQEGGKFG 522
Query: 693 SVKSVNVV 700
++ VNVV
Sbjct: 523 NL--VNVV 528
>gi|413920348|gb|AFW60280.1| hypothetical protein ZEAMMB73_339264 [Zea mays]
Length = 549
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 70/266 (26%), Positives = 115/266 (43%), Gaps = 49/266 (18%)
Query: 407 VHTSNQAASSNAHEMVSSDPVTST---TQKPLAGISVSASLAKLN-------VSMDSVQL 456
V +QA +A +V++ + P+ G+ V +L L V
Sbjct: 193 VSGFDQAPPQHALPIVAAGAIPGQLPGITAPIPGVGVLPNLYNLAAGQFNPLVIQPQAMT 252
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFV 516
Q+ R RR+ V LP +A+E+ + F N + + G + I +K AFV
Sbjct: 253 QQATRHARRVYVGGLPPTANEQTVAIFFNGVMAAIGGNTAGPGDAVLNVYINHDKKFAFV 312
Query: 517 EFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA-------- 552
E + E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 313 EMRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGS 372
Query: 553 ---------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVD 599
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D
Sbjct: 373 AGGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQD 430
Query: 600 QLVTPKAIAGLNGLKVGGQVLTAVQA 625
VT A A LNG+K+G + LT +A
Sbjct: 431 LNVTDIACAALNGIKMGDKTLTVRRA 456
>gi|355727237|gb|AES09128.1| U2 small nuclear RNA auxiliary factor 2 [Mustela putorius furo]
Length = 301
Score = 83.6 bits (205), Expect = 5e-13, Method: Composition-based stats.
Identities = 70/264 (26%), Positives = 113/264 (42%), Gaps = 45/264 (17%)
Query: 368 RKRRTEAAAKTPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPV 427
R + E SP + +K WDV P + + + QAA
Sbjct: 54 RGAKEEHGGLIRSPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG------------ 101
Query: 428 TSTTQKPLAGISVSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLN 485
Q P + + + L V+ V + +Q R RRL V N+P +E+A+M+F N
Sbjct: 102 ----QIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFN 157
Query: 486 NFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRP 545
+ G+ G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP
Sbjct: 158 AQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRP 216
Query: 546 KEFVEV------------------------AIFIGGISRTLSSKMVMEIVCAFGPLKAYH 581
++ + +FIGG+ L+ V E++ +FGPLKA++
Sbjct: 217 HDYQPLPGMSENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVKELLTSFGPLKAFN 276
Query: 582 F--EVNEDHEEPCAFIEYVDQLVT 603
+ + AF EYVD VT
Sbjct: 277 LVKDSATGLSKGYAFCEYVDINVT 300
>gi|296811258|ref|XP_002845967.1| splicing factor U2AF subunit [Arthroderma otae CBS 113480]
gi|238843355|gb|EEQ33017.1| splicing factor U2AF subunit [Arthroderma otae CBS 113480]
Length = 557
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 94/389 (24%), Positives = 162/389 (41%), Gaps = 92/389 (23%)
Query: 378 TPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS 424
TP P R P +++ +WD+ P +V T+ QA S + +
Sbjct: 136 TPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYENV-----TAEQAKLSGMFPLPGA 190
Query: 425 ------DP--VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSAS 476
DP + + P A S + +L K ++R +RL N+P S +
Sbjct: 191 PRQQAVDPSRLQAFINPPTASGSSNNTLLK----------PSNSRQSKRLFAHNIPPSVT 240
Query: 477 EKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF- 535
E L +F N L +G+ + G PC I ++ A +EF TA DA+ AL DG +
Sbjct: 241 EDTLQQFFN--LQLNGLNVISGVDPCQSVQISKDGKFALLEFNTAADATVALAFDGITME 298
Query: 536 ------------SGSI--LKIKRPKEFVEVA----------------------IFIGGIS 559
+G + L I RPK+++ I + I
Sbjct: 299 EHEANQESNGESNGQVKGLSIVRPKDYIVPLPTEEEPRQEGVLSSNVPDSPNKICVSNIP 358
Query: 560 RTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGG 617
+ V ++ +FG LK++ +V D AF EY+D T A+ GLNG+++G
Sbjct: 359 PFIQEDQVTMLLISFGELKSFVLVKDVGTDESRGIAFCEYLDPASTGIAVEGLNGMELGD 418
Query: 618 QVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFS 670
+ L +A + G++ + G+ +A+ + K T VL+L N+ +
Sbjct: 419 RRLKVNRASI-GTVQAAGLD---MGV--NAMSMFAKTTSQDLETGRVLQLLNMVTADEL- 471
Query: 671 SLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + EE+ EDV+ EC+++G V+ + +
Sbjct: 472 -IDNDDYEEICEDVQDECSKYGVVEELKI 499
>gi|212723502|ref|NP_001131562.1| uncharacterized protein LOC100192903 [Zea mays]
gi|194691860|gb|ACF80014.1| unknown [Zea mays]
gi|195646366|gb|ACG42651.1| splicing factor U2AF 65 kDa subunit [Zea mays]
gi|413920213|gb|AFW60145.1| Splicing factor U2AF subunit [Zea mays]
Length = 539
Score = 83.2 bits (204), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 129/287 (44%), Gaps = 51/287 (17%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V LP SA+E+ + + N + + G + I +K AFVE
Sbjct: 199 QATRHARRVYVGGLPPSANEQTVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVE 258
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA--------- 552
+ E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 259 MRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTAGSN 318
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D
Sbjct: 319 GGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQDL 376
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDG--------SIMDNSGNPPFHGIPKHALPLLK 652
VT A A LNG+K+G + LT +A SI+ + + + L
Sbjct: 377 TVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQLQKLVYQVGAL- 435
Query: 653 KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PT+V+ L V + + E E+++ED+RLE ++G++ V +
Sbjct: 436 -PTKVVCLTQVVTADELK--DDEEYEDIMEDMRLEAGKYGNLVKVII 479
>gi|122245119|sp|Q2QKB3.1|U2A2A_WHEAT RecName: Full=Splicing factor U2af large subunit A; AltName:
Full=U2 auxiliary factor 65 kDa subunit A; AltName:
Full=U2 small nuclear ribonucleoprotein auxiliary factor
large subunit A; Short=U2 snRNP auxiliary factor large
subunit A
gi|68036924|gb|AAY84881.1| U2AF large subunit [Triticum aestivum]
Length = 591
Score = 83.2 bits (204), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 93/205 (45%), Gaps = 39/205 (19%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V LP A+E+ + F N + + G + I +K AFVE
Sbjct: 266 QATRHARRVYVGGLPPIANEQTVAVFFNQVMAAIGGNTFALGHAVVNVYINHDKKFAFVE 325
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA--------- 552
+ E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 326 MRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSQAAALGPSQPNPNLNLAAVGLTPGAG 385
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ + V E++ FGPL+ F++ +D E + AF Y D
Sbjct: 386 GGLEGPDRIFVGGLPYYFTEAQVRELLETFGPLRG--FDIVKDKETGNSKGYAFCLYKDG 443
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQA 625
VT A A LNG+++G + LT +A
Sbjct: 444 TVTDIACAALNGIQLGDRTLTVRRA 468
>gi|315044445|ref|XP_003171598.1| splicing factor U2AF subunit [Arthroderma gypseum CBS 118893]
gi|311343941|gb|EFR03144.1| splicing factor U2AF subunit [Arthroderma gypseum CBS 118893]
Length = 565
Score = 83.2 bits (204), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 94/389 (24%), Positives = 162/389 (41%), Gaps = 92/389 (23%)
Query: 378 TPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS 424
TP P R P +++ +WD+ P +V T+ QA S + +
Sbjct: 144 TPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYENV-----TAEQAKVSGMFPLPGA 198
Query: 425 ------DP--VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSAS 476
DP + + P A S + +L K ++R +RL N+P S +
Sbjct: 199 PRQQAVDPSRLQAFMNPPAASGSNTNTLLK----------PSNSRQSKRLFTHNIPPSVT 248
Query: 477 EKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF- 535
E L +F N L +G+ + G PC I ++ A +EF TA DA+ AL DG +
Sbjct: 249 EDTLQQFFN--LQLNGLNVISGVDPCQSVQISKDGKFALLEFNTAADATVALAFDGITME 306
Query: 536 ------------SGSI--LKIKRPKEFVEVA----------------------IFIGGIS 559
+G + L I RPK+++ I + I
Sbjct: 307 EHEANRENNGESNGEVKGLSIIRPKDYIVPLPTDEEPHQEGVVSSNVPDSPNKICVSNIP 366
Query: 560 RTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGG 617
+ V ++ +FG LK++ +V D AF EY+D T A+ GLNG+++G
Sbjct: 367 PFIQEDQVTMLLVSFGELKSFVLVKDVGTDESRGIAFCEYLDPASTGIAVEGLNGMELGD 426
Query: 618 QVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFS 670
+ L +A + G++ + G+ +A+ + K T VL+L N+ +
Sbjct: 427 RRLKVNRASI-GTVQAAGLD---MGV--NAMSMFAKTTSQDLETGRVLQLLNMVTADEL- 479
Query: 671 SLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + EE+ EDV+ EC+++G V+ + +
Sbjct: 480 -IDNEDYEEICEDVQEECSKYGVVEELKI 507
>gi|9858779|gb|AAG01126.1|AF273333_11 BAC19.11 [Solanum lycopersicum]
Length = 532
Score = 83.2 bits (204), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 85/316 (26%), Positives = 140/316 (44%), Gaps = 82/316 (25%)
Query: 439 SVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHG 498
+ + S+ + +M + Q+ R RR+ V LP +A+E+ L L V +V+
Sbjct: 184 TTNPSIPGMFSNMFPLAAGQATRHARRVYVGGLPPTANEQVLKILLK--FSGDAVVNVY- 240
Query: 499 SLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF---------- 548
I EK AFVE + E+AS A+ DG F G +K++RP ++
Sbjct: 241 --------INHEKKFAFVEMRSVEEASNAMALDGVIFEGGPVKVRRPSDYNPSLAATLGP 292
Query: 549 ------VEVA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVN 585
+ +A IF+GG+ + + E++ +FG L+ F++
Sbjct: 293 SQPSPNLNLAAVGLTPGSSGGLEGPDRIFVGGLPYYFTESQIRELLESFGQLRG--FDLV 350
Query: 586 EDHE----EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSG----N 637
+D E + AF Y D VT A A LNG+K+G + LT +A N G N
Sbjct: 351 KDRETGNSKGYAFCVYQDVSVTDIACAALNGIKMGDKTLTVRRA--------NQGTTQPN 402
Query: 638 PPFHGIPKHALP------LLKKP----TEVLKLKNVFNPEGFSSLSEL----EVEEVLED 683
P + HA + +P T+VL L V S+ EL + +++LED
Sbjct: 403 PEQESVLLHAQQQIALQRFMLQPGALATKVLCLTEVV------SVDELKDDDDYQDILED 456
Query: 684 VRLECARFGSVKSVNV 699
+R+EC +FG++ +V +
Sbjct: 457 MRIECGKFGALLNVVI 472
>gi|413920212|gb|AFW60144.1| hypothetical protein ZEAMMB73_955987 [Zea mays]
Length = 502
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 129/287 (44%), Gaps = 51/287 (17%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V LP SA+E+ + + N + + G + I +K AFVE
Sbjct: 199 QATRHARRVYVGGLPPSANEQTVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVE 258
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA--------- 552
+ E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 259 MRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTAGSN 318
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D
Sbjct: 319 GGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQDL 376
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDG--------SIMDNSGNPPFHGIPKHALPLLK 652
VT A A LNG+K+G + LT +A SI+ + + + L
Sbjct: 377 TVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQLQKLVYQVGAL- 435
Query: 653 KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PT+V+ L V + + E E+++ED+RLE ++G++ V +
Sbjct: 436 -PTKVVCLTQVVTADELK--DDEEYEDIMEDMRLEAGKYGNLVKVII 479
>gi|326475623|gb|EGD99632.1| splicing factor u2af large subunit [Trichophyton tonsurans CBS
112818]
gi|326483752|gb|EGE07762.1| splicing factor U2AF subunit [Trichophyton equinum CBS 127.97]
Length = 565
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 94/389 (24%), Positives = 162/389 (41%), Gaps = 92/389 (23%)
Query: 378 TPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS 424
TP P R P +++ +WD+ P +V T+ QA S + +
Sbjct: 135 TPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYENV-----TAEQAKVSGMFPLPGA 189
Query: 425 ------DP--VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSAS 476
DP + + P A S + +L K ++R +RL N+P S +
Sbjct: 190 PRQQAVDPSRLQAFMNPPAASGSSNNTLLK----------PSNSRQSKRLFAHNIPPSVT 239
Query: 477 EKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF- 535
E L +F N L +G+ + G PC I ++ A +EF TA DA+ AL DG +
Sbjct: 240 EDTLQQFFN--LQLNGLNVISGVDPCQSVQISKDGKFALLEFNTAADATVALAFDGITME 297
Query: 536 ------------SGSI--LKIKRPKEFVEVA----------------------IFIGGIS 559
+G + L I RPK+++ I + I
Sbjct: 298 EHEANRESNGESNGEVKGLTIVRPKDYIVPLPTDEEPRQEGVVSSNVPDSPNKICVSNIP 357
Query: 560 RTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGG 617
+ V ++ +FG LK++ +V D AF EY+D T A+ GLNG+++G
Sbjct: 358 PFIQEDQVTMLLVSFGELKSFVLVKDVGTDESRGIAFCEYLDPASTGIAVEGLNGMELGD 417
Query: 618 QVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFS 670
+ L +A + G++ + G+ +A+ + K T VL+L N+ +
Sbjct: 418 RRLKVNRASI-GTVQAAGLD---MGV--NAMSMFAKTTSQDLETGRVLQLLNMVTADEL- 470
Query: 671 SLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + EE+ EDV+ EC+++G V+ + +
Sbjct: 471 -IDNEDYEEICEDVQEECSKYGVVEELKI 498
>gi|242069419|ref|XP_002449986.1| hypothetical protein SORBIDRAFT_05g026740 [Sorghum bicolor]
gi|241935829|gb|EES08974.1| hypothetical protein SORBIDRAFT_05g026740 [Sorghum bicolor]
Length = 545
Score = 82.8 bits (203), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 129/287 (44%), Gaps = 51/287 (17%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V LP SA+E+ + + N + + G + I +K AFVE
Sbjct: 205 QATRHARRVYVGGLPPSANEQTVAVYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVE 264
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA--------- 552
+ E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 265 MRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTAGST 324
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D
Sbjct: 325 GGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQDL 382
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDG--------SIMDNSGNPPFHGIPKHALPLLK 652
VT A A LNG+K+G + LT +A SI+ + + + L
Sbjct: 383 TVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESILLQAQQQVQLQKLVYQVGAL- 441
Query: 653 KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PT+V+ L V + + E E+++ED+RLE ++G++ V +
Sbjct: 442 -PTKVVCLTQVVTADELK--DDEEYEDIMEDMRLEAGKYGTLVKVII 485
>gi|357438349|ref|XP_003589450.1| Splicing factor U2af large subunit B [Medicago truncatula]
gi|355478498|gb|AES59701.1| Splicing factor U2af large subunit B [Medicago truncatula]
Length = 611
Score = 82.8 bits (203), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 78/290 (26%), Positives = 132/290 (45%), Gaps = 70/290 (24%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V L +A+E+++ F + + + G G+ G + R
Sbjct: 284 QATRHARRVYVGGLSPTANEQSVATFFSQVMATIG-----GNTAGPGDAVMR-------- 330
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF---VEVA---------------------- 552
+ E+AS A+ DG F G+ +K++RP ++ + A
Sbjct: 331 --SVEEASNAMALDGIIFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLGLVGLSPGSA 388
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ + + E++ FGPL+ F++ +D E + AF Y D
Sbjct: 389 GGLDGPDRIFVGGVPYYFTETQIRELLETFGPLRG--FDLVKDRETGNSKGYAFCVYQDL 446
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALP------LLKKP 654
VT A A LNG+K+G + LT +A + + M P I HA L+ +P
Sbjct: 447 AVTDIACAALNGIKMGDKTLTVRRANQNTNPMQP--KPEQESILMHAQQQIALQKLMLQP 504
Query: 655 ----TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV 700
T+VL L + +P+ + + EE+L+D+R EC++FG++ VNVV
Sbjct: 505 ALVATKVLCLTHAVSPDELK--DDEDYEEILDDMRQECSKFGNL--VNVV 550
>gi|168030966|ref|XP_001767993.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680835|gb|EDQ67268.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 499
Score = 82.8 bits (203), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 72/280 (25%), Positives = 128/280 (45%), Gaps = 42/280 (15%)
Query: 460 NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFL 519
R RR+ V LP A+E+++ F + + + G + I +EK AFVE
Sbjct: 162 TRHARRVYVGGLPPMANEQSVATFFSQVMAAVGGNTAGPGDAVVNVYINQEKRFAFVEMR 221
Query: 520 TAEDASAALCCDGCSFSGSILKIKRPKEF---------------------VEVA------ 552
T E+AS A+ DG + G ++++RP ++ V +
Sbjct: 222 TVEEASNAMALDGIVYEGVSVRVRRPSDYNPSMAATLGPSQPSSHLNLTAVGLTPGALGG 281
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED--HEEPCAFIEYVDQLVTP 604
IF+GG+ LS + +M+++ +FG L+A+ + D + + F Y D V
Sbjct: 282 ADGPDRIFVGGLPYYLSEEQIMDLLSSFGHLRAFDLVKDRDTGNSKGYGFCVYQDPSVMD 341
Query: 605 KAIAGLNGLKVGGQVLTAVQA---VLDGSIMDNSGNPPFHGIPKHALPLL--KKPTEVLK 659
A A LNGLK+G + LT +A + G + N + AL + + T+V+
Sbjct: 342 IACAALNGLKMGDRTLTVRRASARLRFGQPKPDQSNIIVQAQQQIALQVAAPETATKVIC 401
Query: 660 LKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L V + ++E +E++ED++ EC ++GS+ +V +
Sbjct: 402 LSQVVSIVDLK--DDVEFDEIVEDMKEECGKYGSLLNVVI 439
>gi|344251408|gb|EGW07512.1| Splicing factor U2AF 65 kDa subunit [Cricetulus griseus]
Length = 422
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 67/214 (31%), Positives = 107/214 (50%), Gaps = 34/214 (15%)
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------I 553
F + ++ + A+ DG F G LKI+RP ++ + +
Sbjct: 25 FRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSAHKL 84
Query: 554 FIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLN 611
FIGG+ L+ V E++ +FGPLKA++ + + AF EYVD VT +AIAGLN
Sbjct: 85 FIGGLPNYLNDDQVKELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLN 144
Query: 612 GLKVGGQVLTAVQAVLDG--SIMDNSGNPPFH----GIPKHALPLLKKPTEVLKLKNVFN 665
G+++G + L +A + + + P G+ + + PTEVL L N+
Sbjct: 145 GMQLGDKKLLVQRASVGAKNATLSTINQTPVTLQVPGLMSSQVQMGGHPTEVLCLMNMVL 204
Query: 666 PEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PE L + E EE++EDVR EC+++G VKS+ +
Sbjct: 205 PE--ELLDDEEYEEIVEDVRDECSKYGLVKSIEI 236
>gi|312372039|gb|EFR20089.1| hypothetical protein AND_20681 [Anopheles darlingi]
Length = 384
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 132/329 (40%), Gaps = 95/329 (28%)
Query: 383 NRSPEKK-SAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVS 441
NRS +K S WDV P + + + QAA +V+ P + P+ G +++
Sbjct: 82 NRSRRRKPSLYWDVPPPGFEHITPLQYKAMQAAGQIPANIVADTPQAAV---PVVGSTIT 138
Query: 442 ASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLP 501
R RRL V N+P +E+ +MEF N + SG+ G+ P
Sbjct: 139 -------------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLSGLAQAAGN-P 178
Query: 502 CIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF-----------VE 550
+ C I +K AF+EF + ++ + A+ D +F G LKI+RP ++ V
Sbjct: 179 VLACQINLDKNFAFLEFRSIDETTQAMAFDSINFKGQSLKIRRPHDYQPMPGMTDSATVN 238
Query: 551 VAIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCAFIEYVDQLVTPKAIAGL 610
V G+ T+ +I G L Y +NED + GL
Sbjct: 239 VPEKFSGVISTVVPDSAHKIFI--GGLPNY---LNEDQ------------------VPGL 275
Query: 611 NGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFS 670
S++ +SG P TEVL L N+ P+
Sbjct: 276 -------------------SLVGSSGPP----------------TEVLCLLNMVTPDELK 300
Query: 671 SLSELEVEEVLEDVRLECARFGSVKSVNV 699
E E E++LED+R EC ++G V+SV +
Sbjct: 301 --DEEEYEDILEDIREECNKYGVVRSVEI 327
>gi|145344032|ref|XP_001416543.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576769|gb|ABO94836.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 291
Score = 82.4 bits (202), Expect = 1e-12, Method: Composition-based stats.
Identities = 79/288 (27%), Positives = 121/288 (42%), Gaps = 52/288 (18%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V +PL+ +E + F NN LL+ G + P + I EK AFVE
Sbjct: 3 QATRHARRIYVGGIPLTTNEADVNAFFNNALLAVGGTNGAEGQPVVNVYINVEKKFAFVE 62
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEFV---------------------------- 549
F + E+AS AL DG G ++I+RP ++
Sbjct: 63 FRSVEEASNALALDGIVLDGVPVRIRRPNDYNPSLAHDLGPSMPNPALNLAAIGLDPSAL 122
Query: 550 -------------EVAIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPC 592
E IFIGG+ L + E++ AFGP++ F++ D E +
Sbjct: 123 QRAGVGGNLLHEHEDRIFIGGLPYFLDEAQIRELLEAFGPIR--QFDLVRDKETGNSKGY 180
Query: 593 AFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLK 652
F+ Y D VT A GLNG+ +G + LT +A + S P +P +
Sbjct: 181 GFVVYEDVSVTDIACQGLNGMTMGDKTLTVRRA--EQSNAPGGVQPGMMNVPPPPPAIAA 238
Query: 653 KPTEVLKLKNVFNPEGFSS---LSELEVEEVLEDVRLECARFGSVKSV 697
PT F+ G + + E E ++ED++ EC + G + SV
Sbjct: 239 PPTNPPSTVVSFDNMGLTEEELADDEEYENIMEDMQEECGKHGEIVSV 286
>gi|327297188|ref|XP_003233288.1| splicing factor u2af large subunit [Trichophyton rubrum CBS 118892]
gi|326464594|gb|EGD90047.1| splicing factor u2af large subunit [Trichophyton rubrum CBS 118892]
Length = 563
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 93/389 (23%), Positives = 163/389 (41%), Gaps = 92/389 (23%)
Query: 378 TPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS 424
TP P R P +++ +WD+ P +V T+ QA S + +
Sbjct: 135 TPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYENV-----TAEQAKVSGMFPLPGA 189
Query: 425 ------DP--VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSAS 476
DP + + P A S + +L K ++R +RL N+P + +
Sbjct: 190 PRQQAVDPSRLQAFMNPPAASGSGNNTLLK----------PSNSRQSKRLFAHNIPPNVT 239
Query: 477 EKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF- 535
E L +F N L +G+ + G PC I ++ A +EF TA DA+ AL DG +
Sbjct: 240 EDTLQQFFN--LQLNGLNVISGVDPCQSVQISKDGKFALLEFNTAADATVALAFDGITME 297
Query: 536 ------------SGSI--LKIKRPKEFVEVA----------------------IFIGGIS 559
+G++ L I RPK+++ I + I
Sbjct: 298 EHEANRESNGESNGNVKGLTIVRPKDYIVPLPTDEEPRQEGVVSSNVPDSPNKICVSNIP 357
Query: 560 RTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGG 617
+ V ++ +FG LK++ +V D AF EY+D T A+ GLNG+++G
Sbjct: 358 PFIQEDQVTMLLVSFGELKSFVLVKDVGTDESRGIAFCEYLDSASTGIAVEGLNGMELGD 417
Query: 618 QVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPT-------EVLKLKNVFNPEGFS 670
+ L +A + G++ + G+ +A+ + K T VL+L N+ +
Sbjct: 418 RRLKVNRASI-GTVQAAGLD---MGV--NAMSMFAKTTSQDLETGRVLQLLNMVTADEL- 470
Query: 671 SLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + EE+ EDV+ EC+++G V+ + +
Sbjct: 471 -IDNEDYEEICEDVQEECSKYGVVEELKI 498
>gi|218186084|gb|EEC68511.1| hypothetical protein OsI_36782 [Oryza sativa Indica Group]
Length = 574
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 75/289 (25%), Positives = 129/289 (44%), Gaps = 53/289 (18%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V LP +A+E + + N + + G + I +K AFVE
Sbjct: 232 QATRHARRVYVGGLPPTANEHTVAVYFNQVMAAVGGNTAGPGDAVLNVYINHDKKFAFVE 291
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA--------- 552
+ E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 292 MRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGSA 351
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D
Sbjct: 352 GGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQDL 409
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKK------- 653
VT A A LNG+K+G + LT +A S + + A ++K
Sbjct: 410 NVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESMLLHVQQQA--QMQKLMFQVGG 467
Query: 654 ---PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PT+V+ L V +P+ + E E++++D+R E R+G++ V +
Sbjct: 468 GALPTKVVCLTQVISPDELR--DDEEYEDIVQDMREEGCRYGNLVKVVI 514
>gi|302813497|ref|XP_002988434.1| hypothetical protein SELMODRAFT_24180 [Selaginella moellendorffii]
gi|300143836|gb|EFJ10524.1| hypothetical protein SELMODRAFT_24180 [Selaginella moellendorffii]
Length = 339
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/298 (25%), Positives = 129/298 (43%), Gaps = 50/298 (16%)
Query: 438 ISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFL-LSSGVQHV 496
+ + A +A+L V + Q+ Q +P RR+ V LP E + F N+ + + G +
Sbjct: 2 VVLPAGIAQLPVVLRMPQMPQITKPARRVYVGGLPAVVDEARIATFFNHAMAVIEGNTYG 61
Query: 497 HGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF-------- 548
G + I K AFVE + E+AS A+ DG F GS ++I+RP +
Sbjct: 62 QGGDAVVSVFIDHAKNYAFVEMRSVEEASNAMALDGIIFEGSQVRIRRPSNYNPEHAMLF 121
Query: 549 -------------VEVA----------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVN 585
V + IFIGG+ V +++ FG L+A ++
Sbjct: 122 GSSQPSPSLRLDKVGLVYRAHADGPDRIFIGGLPYEWGDAEVRQLLEPFGALRA--LDIV 179
Query: 586 ED----HEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFH 641
+D + F Y + T A A LN + G++L +A ++SGNP
Sbjct: 180 KDSYTRKSKGYGFAVYENPASTDAACAALNQKPLEGKILRVHRAT------NSSGNPALV 233
Query: 642 GIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+P+ + + T V+ L N + E E E E++ED++ EC ++G + SV +
Sbjct: 234 LLPQSS----ELGTRVVCLCNAVSEEMLR--DEKEYAEIIEDMKEECGKYGPLVSVEI 285
>gi|212539736|ref|XP_002150023.1| splicing factor u2af large subunit [Talaromyces marneffei ATCC
18224]
gi|210067322|gb|EEA21414.1| splicing factor u2af large subunit [Talaromyces marneffei ATCC
18224]
Length = 551
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 84/367 (22%), Positives = 153/367 (41%), Gaps = 83/367 (22%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS------DP--VTSTTQKPLAGI 438
+++ +WD+ P +V T+ QA S + + DP + + +P A
Sbjct: 156 KRRMTQWDIKPPGYDNV-----TAEQAKLSGMFPLPGAPRQQAVDPSRLQALVNQPSATT 210
Query: 439 SVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHG 498
+ S++L N +R +RL NLP + ++ AL+ F N L +G+ + G
Sbjct: 211 TESSTLRPAN-----------SRQAKRLFAYNLPPNVTDAALISFFN--LQLNGLNVIEG 257
Query: 499 SLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF-----------SGSILKIKRPKE 547
PC+ I ++ A +EF +A+ AL DG S L+++RPK+
Sbjct: 258 IDPCVSSQISKDHAFALLEFKGPNEATVALALDGISMEEHEAAATTNGGARGLELRRPKD 317
Query: 548 FVEVA-----------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF-- 582
++ + + + I + + V ++ + G L+A+
Sbjct: 318 YIVPSSPEDQQPYQEGVISNQVPDSPNKLCVTNIPLYIPEEPVTMLLKSIGELRAFVLVK 377
Query: 583 EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA---VLDGSIMDNSGNPP 639
+ D AF EYVD T A+ LNG+++G + L A V + +D N
Sbjct: 378 DSGTDESRGIAFCEYVDATATAIAVESLNGMELGDKHLKITHASIGVTQAAGLDMGVN-- 435
Query: 640 FHGIPKHALPLLKK-------PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFG 692
A+ + K T VL+L N+ + ++ + EE+LEDV+ EC+++G
Sbjct: 436 -------AMSMFAKTTSADLETTRVLQLLNMVTADEL--INNEDYEEILEDVQDECSKYG 486
Query: 693 SVKSVNV 699
V + +
Sbjct: 487 QVLDLKI 493
>gi|212539738|ref|XP_002150024.1| splicing factor u2af large subunit [Talaromyces marneffei ATCC
18224]
gi|210067323|gb|EEA21415.1| splicing factor u2af large subunit [Talaromyces marneffei ATCC
18224]
Length = 556
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 84/367 (22%), Positives = 153/367 (41%), Gaps = 83/367 (22%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS------DP--VTSTTQKPLAGI 438
+++ +WD+ P +V T+ QA S + + DP + + +P A
Sbjct: 156 KRRMTQWDIKPPGYDNV-----TAEQAKLSGMFPLPGAPRQQAVDPSRLQALVNQPSATT 210
Query: 439 SVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHG 498
+ S++L N +R +RL NLP + ++ AL+ F N L +G+ + G
Sbjct: 211 TESSTLRPAN-----------SRQAKRLFAYNLPPNVTDAALISFFN--LQLNGLNVIEG 257
Query: 499 SLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF-----------SGSILKIKRPKE 547
PC+ I ++ A +EF +A+ AL DG S L+++RPK+
Sbjct: 258 IDPCVSSQISKDHAFALLEFKGPNEATVALALDGISMEEHEAAATTNGGARGLELRRPKD 317
Query: 548 FVEVA-----------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF-- 582
++ + + + I + + V ++ + G L+A+
Sbjct: 318 YIVPSSPEDQQPYQEGVISNQVPDSPNKLCVTNIPLYIPEEPVTMLLKSIGELRAFVLVK 377
Query: 583 EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA---VLDGSIMDNSGNPP 639
+ D AF EYVD T A+ LNG+++G + L A V + +D N
Sbjct: 378 DSGTDESRGIAFCEYVDATATAIAVESLNGMELGDKHLKITHASIGVTQAAGLDMGVN-- 435
Query: 640 FHGIPKHALPLLKK-------PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFG 692
A+ + K T VL+L N+ + ++ + EE+LEDV+ EC+++G
Sbjct: 436 -------AMSMFAKTTSADLETTRVLQLLNMVTADEL--INNEDYEEILEDVQDECSKYG 486
Query: 693 SVKSVNV 699
V + +
Sbjct: 487 QVLDLKI 493
>gi|115486373|ref|NP_001068330.1| Os11g0636900 [Oryza sativa Japonica Group]
gi|122248736|sp|Q2R0Q1.2|U2A2A_ORYSJ RecName: Full=Splicing factor U2af large subunit A; AltName:
Full=U2 auxiliary factor 65 kDa subunit A; AltName:
Full=U2 small nuclear ribonucleoprotein auxiliary factor
large subunit A; Short=U2 snRNP auxiliary factor large
subunit A
gi|108864607|gb|ABA94914.2| U2 snRNP auxilliary factor, large subunit, splicing factor family
protein, expressed [Oryza sativa Japonica Group]
gi|113645552|dbj|BAF28693.1| Os11g0636900 [Oryza sativa Japonica Group]
gi|222616290|gb|EEE52422.1| hypothetical protein OsJ_34542 [Oryza sativa Japonica Group]
Length = 574
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 75/289 (25%), Positives = 129/289 (44%), Gaps = 53/289 (18%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V LP +A+E + + N + + G + I +K AFVE
Sbjct: 232 QATRHARRVYVGGLPPTANEHTVAVYFNQVMAAVGGNTAGPGDAVLNVYINHDKKFAFVE 291
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA--------- 552
+ E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 292 MRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGSA 351
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D
Sbjct: 352 GGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQDL 409
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKK------- 653
VT A A LNG+K+G + LT +A S + + A ++K
Sbjct: 410 NVTDIACAALNGIKMGDKTLTVRRANQGASQPRPEQESMLLHVQQQA--QMQKLMFQVGG 467
Query: 654 ---PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PT+V+ L V +P+ + E E++++D+R E R+G++ V +
Sbjct: 468 GALPTKVVCLTQVVSPDELR--DDEEYEDIVQDMREEGCRYGNLVKVVI 514
>gi|302656965|ref|XP_003020217.1| hypothetical protein TRV_05722 [Trichophyton verrucosum HKI 0517]
gi|291184026|gb|EFE39599.1| hypothetical protein TRV_05722 [Trichophyton verrucosum HKI 0517]
Length = 486
Score = 80.1 bits (196), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 93/389 (23%), Positives = 162/389 (41%), Gaps = 92/389 (23%)
Query: 378 TPSPINRSP-------------EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSS 424
TP P R P +++ +WD+ P +V T+ QA S + +
Sbjct: 73 TPPPKKREPTPDLTDVVPILERKRRLTQWDIKPPGYENV-----TAEQAKVSGMFPLPGA 127
Query: 425 ------DP--VTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSAS 476
DP + + P A S + +L K ++R +RL N+P + +
Sbjct: 128 PRQQAVDPSRLQAFMNPPAASGSSNNTLLK----------PSNSRQSKRLFAHNIPPNVT 177
Query: 477 EKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSF- 535
E L +F N L +G+ + G PC I ++ A +EF TA DA+ AL DG +
Sbjct: 178 EDTLQQFFN--LQLNGLNVISGVDPCQSVQISKDGKFALLEFNTAADATVALAFDGITME 235
Query: 536 ------------SGSI--LKIKRPKEFVEVA----------------------IFIGGIS 559
+G + L I RPK+++ I + I
Sbjct: 236 EHEANRESNGESNGDVKGLTIVRPKDYIVPIPTDEEPRQEGVVSSNVPDSPNKICVSNIP 295
Query: 560 RTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGG 617
+ V ++ +FG LK++ +V D AF EY+D T A+ GLNG+++G
Sbjct: 296 PFIQEDQVTMLLVSFGELKSFVLVKDVGTDESRGIAFCEYLDPASTGIAVEGLNGMELGD 355
Query: 618 QVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTE-------VLKLKNVFNPEGFS 670
+ L +A + G++ + G+ +A+ + K T VL+L N+ +
Sbjct: 356 RRLKVNRASI-GTVQAAGLD---MGV--NAMSMFAKTTSQDLETGRVLQLLNMVTADEL- 408
Query: 671 SLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + EE+ EDV+ EC+++G V+ + +
Sbjct: 409 -IDNEDYEEICEDVQEECSKYGVVEELKI 436
>gi|396472864|ref|XP_003839217.1| hypothetical protein LEMA_P028900.1 [Leptosphaeria maculans JN3]
gi|312215786|emb|CBX95738.1| hypothetical protein LEMA_P028900.1 [Leptosphaeria maculans JN3]
Length = 587
Score = 80.1 bits (196), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 114/270 (42%), Gaps = 41/270 (15%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
++ +WD+ P ++ T+ QA S + + LA ++ +
Sbjct: 200 RRMTQWDIKPAGYENI-----TAEQAKLSGMFPLPGAPRAAPMDPSKLAAFMTPSAGSAS 254
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
+ ++ QS +RL V NLP S + LMEF N L +G+ V G PC+ I
Sbjct: 255 SAALAPSAAKQS----KRLYVHNLPSGVSSEELMEFFN--LQLNGLNVVSGQDPCLSAQI 308
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSF---SG----SILKIKRPKEFV----------- 549
K A +EF T EDA+ AL +G S SG S L I+RPK+++
Sbjct: 309 ATSKEYAALEFKTPEDATVALAMNGISMREESGGPDRSGLSIRRPKDYITPTADDNAYTG 368
Query: 550 -EVA---------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNE--DHEEPCAFIEY 597
EV+ + I I + + V E+V G LKA+ +E D AF EY
Sbjct: 369 DEVSSVVKDSPNKLSIVNIPTYIEEEQVRELVGTMGKLKAFVLVKDESTDQHRGIAFCEY 428
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVL 627
D + I GLN + +G L +A +
Sbjct: 429 ADNEIVDAVIEGLNDIPLGDGNLKVTRATV 458
>gi|413920214|gb|AFW60146.1| hypothetical protein ZEAMMB73_955987 [Zea mays]
Length = 536
Score = 80.1 bits (196), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 95/205 (46%), Gaps = 39/205 (19%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V LP SA+E+ + + N + + G + I +K AFVE
Sbjct: 199 QATRHARRVYVGGLPPSANEQTVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVE 258
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA--------- 552
+ E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 259 MRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPSPNLNLAAVGLTAGSN 318
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D
Sbjct: 319 GGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQDL 376
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQA 625
VT A A LNG+K+G + LT +A
Sbjct: 377 TVTDIACAALNGIKMGDKTLTVRRA 401
>gi|308801273|ref|XP_003077950.1| U2 snRNP auxiliary factor, large subunit (ISS) [Ostreococcus tauri]
gi|116056401|emb|CAL52690.1| U2 snRNP auxiliary factor, large subunit (ISS) [Ostreococcus tauri]
Length = 388
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 93/376 (24%), Positives = 146/376 (38%), Gaps = 101/376 (26%)
Query: 378 TPSPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAG 437
+P P R P ++ +DV P V N ++ + P G
Sbjct: 2 SPGPYERRPRMRATLFDV-------YPEGVDPKNLPSAGAGY-----------GPPPGMG 43
Query: 438 ISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
S +A A Q+ R RR+ + P A+E+ L F N+ L++ G
Sbjct: 44 FSSAAITA------------QTTRHARRIYLGGCPTMANEQELSSFFNDALVAVG-GTTS 90
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF-VEVA---- 552
P + I EK AFVEF + E+ S AL DG G ++I+RP ++ ++A
Sbjct: 91 EEAPVVNVYINLEKKFAFVEFRSVEECSNALALDGVMIQGEPVRIRRPNDYNPQIAQGLG 150
Query: 553 -----------------------------------IFIGGISRTLSSKMVMEIVCAFGPL 577
IFIGG+ L V E++ AFGP+
Sbjct: 151 PSTPNPKLNLQAIGLDPSALARSATTNILQEDPNRIFIGGLPYYLEEPQVRELLEAFGPI 210
Query: 578 KAYHFEVNEDHE----EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA------VL 627
F++ D E + F+ Y D VT A GLNG+++G + LT +A ++
Sbjct: 211 A--RFDLVRDKENGNSKGYGFVVYQDAAVTDIACQGLNGMQMGEKTLTVRRAEQGRTDLI 268
Query: 628 DGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSEL----EVEEVLED 683
G + A+ P+EV+ N+ E EL E E ++ED
Sbjct: 269 GGQVSVPP---------PPAIAPANPPSEVVSFTNMGITE-----EELADDEEFENIMED 314
Query: 684 VRLECARFGSVKSVNV 699
+ EC ++G + SV +
Sbjct: 315 MNEECGKYGKIISVVI 330
>gi|168021052|ref|XP_001763056.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685868|gb|EDQ72261.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 503
Score = 79.7 bits (195), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 81/353 (22%), Positives = 145/353 (41%), Gaps = 58/353 (16%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
+K + +D+AP P S A + + P P G +
Sbjct: 99 RKQSGFDMAP------PGAAVVSGAALAGQIPGIAQPMPGVYPGMFPFGGTQQFGGIP-- 150
Query: 448 NVSMDSVQLTQ-SNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCV 506
M + +TQ + R RR+ V LP A+E+ + + + + + G +
Sbjct: 151 --GMPAQAMTQQATRHARRVYVGGLPPLANEQTIATYFSQVMAAVGGNTAGPGDAVVNVY 208
Query: 507 IQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VE 550
I +EK AFVE T E+AS A+ DG F G ++++RP ++ +
Sbjct: 209 INQEKKFAFVEMRTVEEASNAMALDGIMFEGVSVRVRRPSDYNPSMAATLGPSQPSPHLN 268
Query: 551 VA------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED--HEE 590
+A IF+GG+ L+ + E++ +FGPL+ + + D + +
Sbjct: 269 LAAVGLTPGNAAGGADGPDRIFVGGLPYYLTEIQIKELLESFGPLRGFDLVKDRDTGNSK 328
Query: 591 PCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLD-GSIMDNSGNPPFHGIPKHALP 649
F Y D V A A LNG+K+ + L +A + N H + A+
Sbjct: 329 GYGFCVYQDPSVVDIACATLNGMKMDDKTLNVRRATARLARPKPDQANVLAHAQQQIAIQ 388
Query: 650 LL--------KKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSV 694
+L + PT V+ L V +P+ + E +++LED++ EC ++G++
Sbjct: 389 VLVYSWMSPVETPTNVVALTQVVSPDELK--DDEEYQDILEDMKEECGKYGNL 439
>gi|30690730|ref|NP_849509.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
gi|19310597|gb|AAL85029.1| putative splicing factor [Arabidopsis thaliana]
gi|332661289|gb|AEE86689.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
Length = 542
Score = 79.3 bits (194), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 82/315 (26%), Positives = 140/315 (44%), Gaps = 58/315 (18%)
Query: 449 VSMDSVQLT--QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCV 506
+SM +Q Q+ R RR+ V L +A+E+++ F + + + G +
Sbjct: 222 LSMMPIQAMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVY 281
Query: 507 IQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VE 550
I EK AFVE + E+AS A+ DG F G+ +K++RP ++ +
Sbjct: 282 INHEKKFAFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLN 341
Query: 551 VA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE---- 589
+A IF+GG+ + V E++ +FG LK F++ +D E
Sbjct: 342 LAAVGLTPGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGGLKG--FDLVKDRETGNS 399
Query: 590 EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDN--SGNPPFHGIPKHA 647
+ AF Y D VT A A LNG+K+G + LT V+ G+++ N H + A
Sbjct: 400 KGYAFCVYQDLSVTDIACAALNGIKMGDKTLT-VRRANQGTMLQKPEQENVLLHAQQQIA 458
Query: 648 LP-LLKKP----TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFG----SVKSVN 698
++ +P T V+ L V + + E +++ED+R E +FG + +
Sbjct: 459 FQRVMLQPGAVATTVVCLTQVVTEDELR--DDEEYGDIMEDMRQEGGKFGKRPLNCAIWS 516
Query: 699 VVKYGDSNISTIQAC 713
++KY I +I C
Sbjct: 517 ILKY---KIKSILIC 528
>gi|302511201|ref|XP_003017552.1| hypothetical protein ARB_04434 [Arthroderma benhamiae CBS 112371]
gi|291181123|gb|EFE36907.1| hypothetical protein ARB_04434 [Arthroderma benhamiae CBS 112371]
Length = 501
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/286 (25%), Positives = 128/286 (44%), Gaps = 56/286 (19%)
Query: 460 NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFL 519
+R +RL N+P + +E L +F N L +G+ + G PC I ++ A +EF
Sbjct: 161 SRQSKRLFAHNIPPNVTEDTLQQFFN--LQLNGLNVISGVDPCQSVQISKDGKFALLEFN 218
Query: 520 TAEDASAALCCDGCSF-------------SGSI--LKIKRPKEFVEVA------------ 552
TA DA+ AL DG + +G + L I RPK+++
Sbjct: 219 TAADATVALAFDGITMEEHEANRESNGESNGEVKGLTIVRPKDYIVPIPTDEEPRQEGVV 278
Query: 553 ----------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQ 600
I + I + V ++ +FG LK++ +V D AF EY+D
Sbjct: 279 SSNVPDSPNKICVSNIPPFIQEDQVTMLLVSFGELKSFVLVKDVGTDESRGIAFCEYLDP 338
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTE---- 656
T A+ GLNG+++G + L +A + G++ + G+ +A+ + K T
Sbjct: 339 ASTGIAVEGLNGMELGDRRLKVNRASI-GTVQAAGLD---MGV--NAMSMFAKTTSQDLE 392
Query: 657 ---VLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
VL+L N+ + + + EE+ EDV+ EC+++G V+ + +
Sbjct: 393 TGRVLQLLNMVTADEL--IDNEDYEEICEDVQEECSKYGVVEELKI 436
>gi|42573197|ref|NP_974695.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
gi|332661288|gb|AEE86688.1| Splicing factor U2af large subunit A [Arabidopsis thaliana]
Length = 565
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 78/305 (25%), Positives = 134/305 (43%), Gaps = 51/305 (16%)
Query: 449 VSMDSVQLT--QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCV 506
+SM +Q Q+ R RR+ V L +A+E+++ F + + + G +
Sbjct: 222 LSMMPIQAMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVY 281
Query: 507 IQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VE 550
I EK AFVE + E+AS A+ DG F G+ +K++RP ++ +
Sbjct: 282 INHEKKFAFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLN 341
Query: 551 VA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE---- 589
+A IF+GG+ + V E++ +FG LK F++ +D E
Sbjct: 342 LAAVGLTPGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGGLKG--FDLVKDRETGNS 399
Query: 590 EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDN--SGNPPFHGIPKHA 647
+ AF Y D VT A A LNG+K+G + LT V+ G+++ N H + A
Sbjct: 400 KGYAFCVYQDLSVTDIACAALNGIKMGDKTLT-VRRANQGTMLQKPEQENVLLHAQQQIA 458
Query: 648 LP-LLKKP----TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKY 702
++ +P T V+ L V + + E +++ED+R E +F + + Y
Sbjct: 459 FQRVMLQPGAVATTVVCLTQVVTEDELR--DDEEYGDIMEDMRQEGGKFAFCYKESALTY 516
Query: 703 GDSNI 707
D +
Sbjct: 517 TDRRL 521
>gi|303273844|ref|XP_003056274.1| RNA binding protein [Micromonas pusilla CCMP1545]
gi|226462358|gb|EEH59650.1| RNA binding protein [Micromonas pusilla CCMP1545]
Length = 564
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/224 (29%), Positives = 95/224 (42%), Gaps = 61/224 (27%)
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSG--VQ--HVHGSLPCIGCVIQREKG 512
TQ+ R RR+ V LP +A+E + F +N L + G VQ G P + + EK
Sbjct: 157 TQATRHARRIYVGGLPATANEASTATFFSNALAAIGGVVQTAAAAGVEPVLNVYMNHEKK 216
Query: 513 QAFVEFLTAEDASAALCCDGCSFSGSILKIKR----------------PKEFVEVA---- 552
AFVEF T E+ S A+ DG F G L+++R P +++A
Sbjct: 217 FAFVEFRTVEETSNAIALDGVVFDGVSLRVRRPNDYNAAIAATLGPSTPSTDLDLAAIGL 276
Query: 553 -------------------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYH 581
+F+GG+ L+ MV E+V AFGP K H
Sbjct: 277 VPGAGGAAGGAGAGGAAGGQNNLSPEDTANRLFVGGLPYFLTEPMVKELVEAFGPTK--H 334
Query: 582 FEVNEDHE----EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLT 621
F + D E + F Y D VT A GL+G+K+G + LT
Sbjct: 335 FMLVMDRETGNSKGYGFFVYQDHAVTDVACQGLHGMKMGEKTLT 378
>gi|108864608|gb|ABG22562.1| U2 snRNP auxilliary factor, large subunit, splicing factor family
protein, expressed [Oryza sativa Japonica Group]
Length = 550
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 94/205 (45%), Gaps = 39/205 (19%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V LP +A+E + + N + + G + I +K AFVE
Sbjct: 232 QATRHARRVYVGGLPPTANEHTVAVYFNQVMAAVGGNTAGPGDAVLNVYINHDKKFAFVE 291
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA--------- 552
+ E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 292 MRSVEEASNAMALDGIMFEGAPVKVRRPTDYNPSLAAALGPSQPNPNLNLAAVGLTPGSA 351
Query: 553 --------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQ 600
IF+GG+ + V E++ +FGPL+ F++ +D E + AF Y D
Sbjct: 352 GGLEGPDRIFVGGLPYYFTEAQVRELLESFGPLRG--FDLVKDRETGNSKGYAFCVYQDL 409
Query: 601 LVTPKAIAGLNGLKVGGQVLTAVQA 625
VT A A LNG+K+G + LT +A
Sbjct: 410 NVTDIACAALNGIKMGDKTLTVRRA 434
>gi|222423510|dbj|BAH19725.1| AT4G36690 [Arabidopsis thaliana]
Length = 565
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 79/305 (25%), Positives = 134/305 (43%), Gaps = 51/305 (16%)
Query: 449 VSMDSVQLT--QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCV 506
+SM +Q Q+ R RR+ V L +A+E+++ F + + + G +
Sbjct: 222 LSMMPIQAMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAGPGDAVVNVY 281
Query: 507 IQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VE 550
I EK AFVE + E+AS A+ DG F G+ +K++RP ++ +
Sbjct: 282 INHEKKFAFVEMRSVEEASNAMSLDGIIFEGAPVKVRRPSDYNPSLAATLGPSQPSPHLN 341
Query: 551 VA-----------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE---- 589
+A IF+GG+ + V E++ +FG LK F++ +D E
Sbjct: 342 LAAVGLTPGASGGLEGPDRIFVGGLPYYFTESQVRELLESFGGLKG--FDLVKDRETGNS 399
Query: 590 EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDN--SGNPPFHGIPKHA 647
+ AF Y D VT A A LNG+K+G + LT V+ G+++ N H + A
Sbjct: 400 KGYAFCVYQDLSVTDIACAALNGIKMGDKTLT-VRRANQGTMLQKPEQENVLLHAQQQIA 458
Query: 648 LP-LLKKP----TEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKY 702
++ +P T V+ L V + + E +++ED+R E RF + + Y
Sbjct: 459 FQRVMLQPGAVATTVVCLTQVVTEDELR--DDEEYGDIMEDMRQEGGRFAFCYKESALTY 516
Query: 703 GDSNI 707
D +
Sbjct: 517 TDRRL 521
>gi|449299113|gb|EMC95127.1| hypothetical protein BAUCODRAFT_526859 [Baudoinia compniacensis
UAMH 10762]
Length = 432
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 128/286 (44%), Gaps = 59/286 (20%)
Query: 461 RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLT 520
R +RL N+P + +E + +F N L +G+ G PCI + ++ A ++F T
Sbjct: 101 RQSKRLFAYNIPPNVNESMISDFFN--LQLNGLNVTRGVDPCISAQLSQDLTYALLDFKT 158
Query: 521 AEDASAALCCDGCS-------FSGSI------LKIKRPKEFVEVAIF------------- 554
+EDA+ A+ DG + +GS L I+RPK+++ A+
Sbjct: 159 SEDATNAMALDGITMPEHMEVMNGSANGNSQGLIIQRPKDYIVPAVVDDTEHEAGVLSST 218
Query: 555 ---------IGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVD-QLV 602
I I L+ + V E++ +FG LK + + D AF EY D +
Sbjct: 219 VPDTQFKISITHIPSYLTEEQVQELLVSFGELKNFVLVKDAGTDQSRGIAFCEYKDAKNT 278
Query: 603 TPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHG-IPKHALPLL--------KK 653
T A+ LNG+++G L +A + G G + +A+ ++ +
Sbjct: 279 TDIAVESLNGMELGDSHLKVQRASI--------GTQQVGGEMTVNAMSMMASAAGGADRD 330
Query: 654 PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ VL L N+ PE + E +E+LEDV+ ECA++G++ V +
Sbjct: 331 ASRVLCLMNMITPEEL--MDADEADEILEDVKEECAKYGAIIDVKM 374
>gi|193848546|gb|ACF22733.1| U2AF large subunit [Brachypodium distachyon]
Length = 569
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 76/295 (25%), Positives = 129/295 (43%), Gaps = 63/295 (21%)
Query: 456 LTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAF 515
Q+ R RR+ V LP +A+E+++ + N + + G + I +K AF
Sbjct: 227 FPQATRHARRVYVGGLPPTANEQSVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAF 286
Query: 516 VEFLTAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA------- 552
VE + E+AS A+ DG F G+ +K++RP ++ + +A
Sbjct: 287 VEMRSVEEASNAMALDGILFEGAPVKVRRPTDYNPSLASALGPSQPSSNLNLAAVGLTPG 346
Query: 553 ----------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYV 598
IF+GG+ + V E++ +FG L+ F++ +D E + AF Y
Sbjct: 347 SAGGLEGPDRIFVGGLPYYFTEAQVRELLESFGSLRG--FDLVKDRETGNSKGYAFCVYQ 404
Query: 599 DQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGN----PPFHGI----------P 644
D VT A A LNG+K+G + LT +A N G+ P I
Sbjct: 405 DLNVTDIACAALNGIKMGDKTLTVRRA--------NQGSAQPRPEQENILLQAQQQVQLQ 456
Query: 645 KHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
K + PT+V+ L V + + E E+++ED+RLE ++G++ V +
Sbjct: 457 KLVYQVGALPTKVICLTQVVTADELK--DDEEYEDIMEDMRLEAGKYGTLVKVVI 509
>gi|403331270|gb|EJY64574.1| RNA-binding proteins (RRM domain) [Oxytricha trifallax]
Length = 565
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 83/184 (45%), Gaps = 32/184 (17%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL V +P S S+ +++FL L +G G P I EK F+E + E+
Sbjct: 190 RRLYVGGVPTSQSDVQVVQFLTQTLRKAGGILEEGD-PVIKSQNNPEKRYTFLELRSVEE 248
Query: 524 ASAALCCDGCSFSGSILKIKRPKEF--------------------------VE---VAIF 554
AS + DG F S L+I+RP+++ VE + IF
Sbjct: 249 ASTMIQLDGIKFMDSTLRIRRPEDYDKYPQIPPRRPIPQIDTAALGIISTKVEETPLKIF 308
Query: 555 IGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNG 612
+GG+ + + + + ++ +G LK++H N D AF EY D+ AI LNG
Sbjct: 309 VGGLPKEFNEEQIKNLLLRYGQLKSFHLVKHTNIDQSRGFAFCEYTDEKGVQNAIQFLNG 368
Query: 613 LKVG 616
LK+G
Sbjct: 369 LKIG 372
>gi|449680331|ref|XP_002158219.2| PREDICTED: splicing factor U2AF 50 kDa subunit-like, partial [Hydra
magnipapillata]
Length = 259
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 53/208 (25%), Positives = 98/208 (47%), Gaps = 33/208 (15%)
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------------------I 553
F + E+ + A+ DG G LKI+RPK++ + +
Sbjct: 1 FRSVEETTLAMAFDGIMLQGQALKIRRPKDYQPIPGISEMQATHIPGVVSTVVSDTINKV 60
Query: 554 FIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLN 611
F+GG+ L+ V E++ FG L++++ + + AF EYVD +T AIAG+N
Sbjct: 61 FVGGLPNYLNEDQVKELLSTFGDLRSFNLVKDSATGLSKGYAFCEYVDIGITDVAIAGMN 120
Query: 612 GLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSS 671
G+++G + L +A + M N P + K T +L L N+ +
Sbjct: 121 GMQLGDKKLVVQRASVGSKTMTAQLNIPGFDLSKEI-----TATNILCLMNMVVADEL-- 173
Query: 672 LSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + + +E+ ED+R EC+++G ++S+ +
Sbjct: 174 IDDEDYDEIFEDIREECSKYGRIRSMQI 201
>gi|224003073|ref|XP_002291208.1| U2 snRNP auxillary splicing factor, U2AF subunit [Thalassiosira
pseudonana CCMP1335]
gi|220972984|gb|EED91315.1| U2 snRNP auxillary splicing factor, U2AF subunit, partial
[Thalassiosira pseudonana CCMP1335]
Length = 352
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 124/296 (41%), Gaps = 66/296 (22%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGV---------QHVHGSL---PCIGCVIQREK 511
RRL V N+P S++ L F + + SS + H H + P I I RE+
Sbjct: 5 RRLYVGNIP-DVSDEQLHHFFRDAIRSSIILDNNSEAHSSHKHQYVDNDPIISVYINRER 63
Query: 512 GQAFVEFLTAEDASAALCCDGCSFSG-SILKIKRPKEFVEVA------------------ 552
AF+EF T E +A + DG G +KIKRP ++
Sbjct: 64 RFAFLEFKTMEITTACMALDGLDVMGRGKVKIKRPNDYNPAVAPMLNASTMPVLDTGKLG 123
Query: 553 ------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNE---DHEEPCAFIEY 597
IF+GG+ L V+E++ AFG +KA++ N+ D + F+EY
Sbjct: 124 IISMTVHDGPNKIFVGGLPYHLVDSQVLELLSAFGAVKAFNLVKNDPMSDTSKGYCFVEY 183
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMD--------------NSGNPPFHGI 643
D VT A GLNG+ +GG Q ++ M + G PP
Sbjct: 184 CDPNVTQIAAMGLNGMDMGGGKQPYQQPIVVKDPMAVANAAASALDQAFGSGGVPPLA-- 241
Query: 644 PKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
P A+P T +L L N+ E ++ + + E E+VR E ++G++ S+ +
Sbjct: 242 PPTAMP-GSTVTRILVLLNMVMDEDLATAEDRKFLE--EEVREEVGKYGTLLSMKI 294
>gi|255088499|ref|XP_002506172.1| predicted protein [Micromonas sp. RCC299]
gi|226521443|gb|ACO67430.1| predicted protein [Micromonas sp. RCC299]
Length = 628
Score = 76.3 bits (186), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 66/228 (28%), Positives = 99/228 (43%), Gaps = 43/228 (18%)
Query: 510 EKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF-------------------VE 550
+KG F+E ED A L +G +G + + +RPK++ V
Sbjct: 310 DKGFGFLETTALEDVPAILALNGVRVNGGVTRFRRPKDYDPDNNPLVRDGSYRDVFQRVF 369
Query: 551 VAI------------FIGGI-SRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPC---AF 594
A+ F+GG+ R L+ ++EIV +FG L A+ E D C A+
Sbjct: 370 TAVLSDEVVDSPTKVFVGGVEPRALTKLDLLEIVSSFGALTAFRCET--DGAGLCRGFAW 427
Query: 595 IEYVD-QLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPF-HGIPKHALPLLK 652
+EY + + V KA+AGL+G ++ G+ + A A G + + +P PLLK
Sbjct: 428 MEYAEGESVAAKAVAGLSGYQLRGKPIAAALATPRAEAARTRGERRYTYEVPPEVEPLLK 487
Query: 653 KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV 700
P VL NV G S + ED EC FG+V S +VV
Sbjct: 488 PPQRVLAFSNVLVRGG----SREDRAAATEDTLAECDGFGNVLSTHVV 531
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 29/45 (64%)
Query: 899 CVFVEYMRAEASCMAAHSLHRRLFDDRIVAVEYIPLNLYRARFSK 943
C+FVE+ R E + +AAH+ HRR +D R+V + PL Y+ F K
Sbjct: 550 CLFVEFARVETATIAAHAFHRREYDGRVVECRFFPLTEYQRMFGK 594
>gi|241166827|ref|XP_002409934.1| splicing factor u2af large subunit, putative [Ixodes scapularis]
gi|215494685|gb|EEC04326.1| splicing factor u2af large subunit, putative [Ixodes scapularis]
Length = 444
Score = 75.9 bits (185), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 85/280 (30%), Positives = 128/280 (45%), Gaps = 44/280 (15%)
Query: 460 NRPMRRLCVENLPLSASEKA--------LMEFLNNFLLSSGVQHVHGSLPCIGCVIQREK 511
R RRL V N+P SE + +M++ N + + G G+ P + C I +K
Sbjct: 112 TRQARRLYVGNIPFGCSEASRPLLLREEMMDYFNAQMHACGFSQAPGN-PVLACQINLDK 170
Query: 512 GQAFVEFLTAEDASAALCC------DGCS--FSGSILKIKR--PKE----FVEVA----- 552
AF+E + CC D S FSG+ L P + F+
Sbjct: 171 NFAFLEVSALDTDLGTPCCPTFVLYDYLSPPFSGNCLGAGNGTPGDTWLGFLSGVISTVV 230
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTP 604
IFIGG+ L+ V E++ +FG L+A++ + + AF EYV+ T
Sbjct: 231 QDSPHKIFIGGLPNYLNEDQVRELLMSFGQLRAFNLVKDSATGLSKGYAFCEYVEVTTTD 290
Query: 605 KAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFHGIPKHALPL---LKKPTEVLK 659
+AI GLNG+++G + L +A + S M+ S + P I L L PTEVL
Sbjct: 291 QAIMGLNGMQLGDKKLIVQRASVGAKNSQMNVSRDAPVQ-IQVPGLQLQGGAGPPTEVLC 349
Query: 660 LKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
L N+ PE E E E++LED+ EC ++G VKS+ +
Sbjct: 350 LMNLVCPE--ELKDEEEYEDILEDIHEECNKYGVVKSIEI 387
>gi|330931856|ref|XP_003303563.1| hypothetical protein PTT_15819 [Pyrenophora teres f. teres 0-1]
gi|311320368|gb|EFQ88342.1| hypothetical protein PTT_15819 [Pyrenophora teres f. teres 0-1]
Length = 578
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 69/271 (25%), Positives = 111/271 (40%), Gaps = 42/271 (15%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
++ +WD+ P ++ T+ QA S + + LA ++
Sbjct: 190 RRMTQWDIKPAGYENI-----TAEQAKLSGMFPLPGAPRAAPMDPSKLAAFISPSAGTAT 244
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
++ + QS +RL V NLP + + +MEF NN L +G+ V G+ PC+ I
Sbjct: 245 AAALATSAAKQS----KRLYVHNLPSGCTSQEIMEFFNNQL--NGLNVVSGNDPCLSAHI 298
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSG-------SILKIKRPKEFV----------- 549
K A +EF EDA+ AL +G S S L I+RPK+++
Sbjct: 299 ATSKEYAALEFKAPEDATLALAMNGISMRDEGGAPDRSGLSIRRPKDYITPTADENAYPP 358
Query: 550 --EVA---------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIE 596
EV+ + I I + + + E+V G LKA+ + D AF E
Sbjct: 359 GDEVSSVVKDSPNKLSIVNIPTYIEEEQIRELVETMGKLKAFILVKDTGTDQHRGIAFCE 418
Query: 597 YVDQLVTPKAIAGLNGLKVGGQVLTAVQAVL 627
Y D + I GLN + +G L +A +
Sbjct: 419 YADNEIIDAVIEGLNDIPLGDGNLKVSRATV 449
>gi|387193280|gb|AFJ68695.1| splicing factor U2AF 65 kDa subunit [Nannochloropsis gaditana
CCMP526]
Length = 424
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/293 (25%), Positives = 125/293 (42%), Gaps = 63/293 (21%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
QS R RR+ V AS+ ++ F N +++ ++ + P + + R+K AF+E
Sbjct: 84 QSTRHARRVYVGGNFGDASDFEVLAFFNQ-IINESLERPSPAGPVVAIQVNRQKHFAFLE 142
Query: 518 FLTAE-DASAALCCDGCSFSGSILKIKRPKEF--------------VEVA---------- 552
+ S + DG F G+ LK+KRP ++ ++VA
Sbjct: 143 LNSVPLTTSVIMQLDGVPFRGNPLKVKRPTDYHPELLPLDTPPPPTLKVANFRALQASGA 202
Query: 553 ----------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED--- 587
IF+GG+ ++ V E++ AFGPL+ F++ +D
Sbjct: 203 LPMASTGLTAPGANSVPDSPYKIFVGGLPYHVTDDQVRELLSAFGPLRG--FDLKKDPAT 260
Query: 588 -HEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKH 646
+ F EY+D V AI GL+G+ +GG+ LT A+ +
Sbjct: 261 GMSKGYGFCEYIDHAVGDVAIQGLHGMDLGGKTLTVKYALASQQLQQQQSMQQMLLSTTP 320
Query: 647 ALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
A T+VL L N+ P+ + E +E++EDVR E A+FG V S+ +
Sbjct: 321 A-------TKVLVLANMVTPDELK--DDQEYQEIVEDVREEVAKFGEVLSLVI 364
>gi|159487587|ref|XP_001701804.1| U2 snRNP auxiliary factor, large subunit [Chlamydomonas
reinhardtii]
gi|158281023|gb|EDP06779.1| U2 snRNP auxiliary factor, large subunit [Chlamydomonas
reinhardtii]
Length = 309
Score = 74.3 bits (181), Expect = 3e-10, Method: Composition-based stats.
Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 34/176 (19%)
Query: 483 FLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKI 542
F N +++SG G P + C + +K AF+E E+ S A+ DG G +LK+
Sbjct: 1 FFNQIMMASGATTQPGP-PVMSCFMNNDKRFAFLEMRCVEETSNAMAFDGIQCQGEVLKV 59
Query: 543 KRPKEFVEVA-------------------------------IFIGGISRTLSSKMVMEIV 571
+RP ++ A ++IGG+ LS + V +I+
Sbjct: 60 RRPHDYNPAAAKLLGPTDPSPKVNLALLGVINTLVEDGPNKVYIGGLPACLSEEQVRQIL 119
Query: 572 CAFGPLKAYHFEVNED--HEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQA 625
AFG LKA++ ++ + + + F EY D VT AI GL+ L + G+ LTA +A
Sbjct: 120 QAFGTLKAFNLVLDRETGNSKGYGFCEYADPSVTDSAIQGLSALIIQGKPLTARRA 175
>gi|169602913|ref|XP_001794878.1| hypothetical protein SNOG_04461 [Phaeosphaeria nodorum SN15]
gi|160706286|gb|EAT88221.2| hypothetical protein SNOG_04461 [Phaeosphaeria nodorum SN15]
Length = 594
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 67/260 (25%), Positives = 106/260 (40%), Gaps = 41/260 (15%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
E++ +WD+ P ++ T+ QA S + + LA ++
Sbjct: 206 ERRMTQWDIKPAGYENI-----TAEQAKLSGMFPLPGAPRAAPMDPSKLAAFMSPSAGTA 260
Query: 447 LNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCV 506
++ QS +RL V NLP + + L+EF N L +G+ V G PC+
Sbjct: 261 SAAALAPGASKQS----KRLYVHNLPSGTTSEELLEFFN--LQLNGLNVVSGQDPCLSAQ 314
Query: 507 IQREKGQAFVEFLTAEDASAALCCDGCSFSG-------SILKIKRPKEFV---------- 549
I K A +EF T EDA+ AL G S S L I+RPK+++
Sbjct: 315 IASSKTYAALEFKTPEDATVALAMSGISMRDDGGGPDRSGLSIRRPKDYITPSADENAYP 374
Query: 550 --EVA---------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIE 596
EV+ + I I + + + E+V G L A+ +++ + AF E
Sbjct: 375 GDEVSSVVKDSPNKLSIVNIPTFIEEEQIRELVETMGKLNAFVLVKDISSEQHRGIAFCE 434
Query: 597 YVDQLVTPKAIAGLNGLKVG 616
Y D V I GLN + +G
Sbjct: 435 YADNEVVNAVIEGLNDITLG 454
>gi|307106441|gb|EFN54687.1| hypothetical protein CHLNCDRAFT_53018 [Chlorella variabilis]
Length = 247
Score = 73.6 bits (179), Expect = 5e-10, Method: Composition-based stats.
Identities = 59/179 (32%), Positives = 90/179 (50%), Gaps = 25/179 (13%)
Query: 461 RPMRRLCVENLPLSASEKALMEFLNNF-----LLSSGVQHVHGSLPCIGCVIQREKGQAF 515
RP +R+ V NLP + SE L + +N LL +G+ V ++KG AF
Sbjct: 76 RPAKRVYVGNLPAAVSEAELRQAVNELMGNGDLLFNGMHQV------------QDKGYAF 123
Query: 516 VEFLTAEDASAALCCDGCSFSGSILK---IKRPKEFVEVA---IFIGGISRTLSSKMVME 569
VEF + E+AS A+ DG F S LK ++ K V+ + +FIGG+ S V E
Sbjct: 124 VEFRSVEEASNAMALDGVKFHDSYLKLVGLEVVKTVVQDSPHKLFIGGLPCDWSEDQVKE 183
Query: 570 IVCAFGPLKAYHFEVNE--DHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAV 626
++ FG LKA++ ++ + + AF E++D VT I LNG + LT +A+
Sbjct: 184 MLMPFGQLKAFNLVMDRGTGNSKGYAFAEFMDVHVTDIVIQNLNGKPCNTKFLTVKRAL 242
>gi|358417046|ref|XP_001256277.3| PREDICTED: splicing factor U2AF 65 kDa subunit-like [Bos taurus]
Length = 330
Score = 72.8 bits (177), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 68/258 (26%), Positives = 111/258 (43%), Gaps = 37/258 (14%)
Query: 380 SPINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGIS 439
SP + +K WDV P + + + QAA Q P +
Sbjct: 79 SPRHEKKKKVRKYWDVPPPGFEHITPMQYKAMQAAG----------------QIPATALL 122
Query: 440 VSASLAKLNVSMDSVQL--TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ + L V+ V + +Q R RRL V N+P +E+A+M+F N + G+
Sbjct: 123 PTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAP 182
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEV------ 551
G+ P + I ++K AF+EF + ++ + A+ DG F G LKI+RP ++ +
Sbjct: 183 GN-PVLAVQINQDKNFAFLEFRSVDETTQAMAFDGIIFQGQSLKIRRPHDYQPLPGMSEN 241
Query: 552 -AIFIGGISR---TLSSKMVMEIVCAFGPLKAYHFEVNEDHEEPCA----FIEYVDQLVT 603
++++ G+ R L SK C G A E +E E A F+E+
Sbjct: 242 PSVYVPGLCRPGQCLGSKG----SCRGGGDAAGTGEASEPMCEARAGASIFVEFTSVFDC 297
Query: 604 PKAIAGLNGLKVGGQVLT 621
KA+ GL G K +V+
Sbjct: 298 QKAMQGLTGRKFANRVVV 315
>gi|189204129|ref|XP_001938400.1| splicing factor U2AF 65 kDa subunit [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187985499|gb|EDU50987.1| splicing factor U2AF 65 kDa subunit [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 572
Score = 72.8 bits (177), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 81/184 (44%), Gaps = 33/184 (17%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
+RL V NLP + + +MEF NN L +G+ V G+ PC+ I K A +EF ED
Sbjct: 251 KRLYVHNLPSGCTSQEIMEFFNNQL--NGLNVVSGNDPCLSAHIATSKEYAALEFKAPED 308
Query: 524 ASAALCCDGCSFSG-------SILKIKRPKEFV-------------EVA---------IF 554
A+ AL G S S L I+RPK+++ EV+ +
Sbjct: 309 ATLALAMTGISMRDEGGAPDRSGLSIRRPKDYITPTADENAYPPGDEVSSVVKDSPNKLS 368
Query: 555 IGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNG 612
I I + + + E+V G LKA+ + D AF EY D + I GLN
Sbjct: 369 IVNIPTYIEEEQIRELVETMGKLKAFILVKDTGTDQHRGIAFCEYADNEIIDAVIEGLND 428
Query: 613 LKVG 616
+ +G
Sbjct: 429 IPLG 432
>gi|167515386|ref|XP_001742034.1| RNA-binding region RNP-1 containing protein [Monosiga brevicollis
MX1]
gi|163778658|gb|EDQ92272.1| RNA-binding region RNP-1 containing protein [Monosiga brevicollis
MX1]
Length = 431
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 147/368 (39%), Gaps = 83/368 (22%)
Query: 385 SPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASL 444
+PE A WDV P ++P V+ D V + AG+ +SA L
Sbjct: 36 APEPSEA-WDVPPPGYENMPPKVY---------------KDYVCTY----FAGLPISAEL 75
Query: 445 AKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIG 504
L SM + LT R RRL V +P A++ L EF N L G+ G+ P +
Sbjct: 76 PGLRSSMPN-PLT---RGARRLYVGGIPNGANDMELAEFFNMQLTQQGLTIGPGA-PVVS 130
Query: 505 CVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVA------------ 552
I EK AF+E + E+A++ + D F G+ L+I+RPK++ A
Sbjct: 131 AQINEEKSFAFLELRSPEEATSCIAFDNIMFMGNQLRIRRPKDYQAPAGGTSEVPKVDMP 190
Query: 553 -------------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVN 585
+ + I + + + E+ FG + + E
Sbjct: 191 MPRPMPMPTPMPMPTPMPMLVPSGRLNVTNIPLAMDEEQLRELFSVFGTIASLELRKEPE 250
Query: 586 EDHEEPCAFIEY----VDQLVTPKAIAGLNGLKVGGQVLTAVQAV-------LDGSIMDN 634
D A +E+ D L K AGL + GQ L Q V L S +
Sbjct: 251 TDKFAGDAIVEFDTRAPDFLNQVK--AGLEDIDFEGQKLKVEQVVRWWSYCGLRASYIAP 308
Query: 635 S---GNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARF 691
S +P G A+P ++ TEVL L N+ E + E ++++ED+R EC +F
Sbjct: 309 SLVNASPFVGGAAAPAVPDVEA-TEVLVLMNMVTKEELQ--DDEEYKDIMEDIREECGKF 365
Query: 692 GSVKSVNV 699
G++ + +
Sbjct: 366 GNITDLKI 373
>gi|451849636|gb|EMD62939.1| hypothetical protein COCSADRAFT_200575 [Cochliobolus sativus
ND90Pr]
Length = 576
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 65/261 (24%), Positives = 107/261 (40%), Gaps = 42/261 (16%)
Query: 387 EKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAK 446
+++ +WD+ P ++ T+ QA S + + LA ++
Sbjct: 187 QRRMTQWDIKPAGYENI-----TAEQAKLSGMFPLPGAPRAAPMDPSKLAAFISPSTGTA 241
Query: 447 LNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCV 506
++ + QS +RL V NLP + + +MEF N L +G+ V G PC+
Sbjct: 242 TAAALATSAAKQS----KRLYVHNLPSGCTSQEIMEFFNTQL--NGLNVVSGPDPCVSAH 295
Query: 507 IQREKGQAFVEFLTAEDASAALCCDGCSFSG-------SILKIKRPKEFV---------- 549
I K A +EF EDA+ AL +G S + L I+RPK+++
Sbjct: 296 IATSKEYAALEFKAPEDATLALAMNGISMRDDGGAPDRAGLSIRRPKDYITPTADENAYP 355
Query: 550 ---EVA---------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFI 595
EV+ + I I + + + E+V G LKA+ + + D AF
Sbjct: 356 PGDEVSSVVKDSPNKLSIVNIPTYIEEEQIRELVETMGKLKAFILVKDTSTDQHRGIAFC 415
Query: 596 EYVDQLVTPKAIAGLNGLKVG 616
EY D + I GLN + +G
Sbjct: 416 EYADNEIIDAVIEGLNDIPLG 436
>gi|255082091|ref|XP_002508264.1| RNA binding protein [Micromonas sp. RCC299]
gi|226523540|gb|ACO69522.1| RNA binding protein [Micromonas sp. RCC299]
Length = 493
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 40/93 (43%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
Query: 457 TQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQ-HVHGSLPCIGCVIQREKGQAF 515
Q R RRL V +LP +++AL F NN +++SG G + I EKG AF
Sbjct: 111 AQHTRQSRRLYVGSLPKPVNDEALHAFFNNAMVNSGAAIDPSGGPSVVNTTITHEKGFAF 170
Query: 516 VEFLTAEDASAALCCDGCSFSGSILKIKRPKEF 548
+EF EDA +AL DG F+GS L IKRPK++
Sbjct: 171 IEFRRLEDAESALMFDGIVFNGSKLIIKRPKDY 203
>gi|195393580|ref|XP_002055432.1| GJ19364 [Drosophila virilis]
gi|194149942|gb|EDW65633.1| GJ19364 [Drosophila virilis]
Length = 476
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 73/161 (45%), Gaps = 23/161 (14%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K S WDV P + + + QA+ +V P T+ P+ G +++
Sbjct: 39 KPSLYWDVPPPGFEHITPMQYKAMQASGQIPASVVPDTPQTAV---PVVGSTIT------ 89
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
R RRL V N+P +E+ +MEF N + G+ GS P + C I
Sbjct: 90 -------------RQARRLYVGNIPFGVTEEEMMEFFNQQMHLVGLAQAAGS-PVLACQI 135
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEF 548
+K AF+EF + ++ + A+ DG + G LKI+RP ++
Sbjct: 136 NLDKNFAFLEFRSIDETTQAMAFDGINLKGQDLKIRRPHDY 176
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 79/143 (55%), Gaps = 8/143 (5%)
Query: 563 SSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVL 620
+S+ V E++ +FG L+A++ + + AF EYVD +T ++IAGLNG+++G + L
Sbjct: 279 TSRFVKELLLSFGKLRAFNLVKDAATGLSKGYAFCEYVDLSITDQSIAGLNGMQLGDKKL 338
Query: 621 TAVQAVLDGSIMDNSGNPP---FHGIPKHALPLLK-KPTEVLKLKNVFNPEGFSSLSELE 676
+A + N+ N +P + + PTEVL L N+ P+ E E
Sbjct: 339 IVQRASVGAKNAQNAANTSQSVMLQVPGLSTVVTSGPPTEVLCLLNMVTPDELR--DEEE 396
Query: 677 VEEVLEDVRLECARFGSVKSVNV 699
E++LED++ EC ++G V+SV +
Sbjct: 397 YEDILEDIKEECTKYGVVRSVEI 419
>gi|156085070|ref|XP_001610018.1| RNA recognition motif (RRM)-containing protein [Babesia bovis]
gi|154797270|gb|EDO06450.1| RNA recognition motif (RRM)-containing protein [Babesia bovis]
Length = 383
Score = 69.7 bits (169), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 118/268 (44%), Gaps = 37/268 (13%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLL----SSG--VQHVHGSLPCIGCVIQREKGQAFVE 517
RRL + NLP + KAL+EFL+ L SG VQ H S I ++G F+E
Sbjct: 59 RRLYIGNLPSGTTYKALVEFLSAALRLPNDDSGQTVQVPHISKTEI---FNEDQGYCFLE 115
Query: 518 FLTAEDASAALCCDGCSFSGSILKIKRPKEF------VEVAIFIGGISRTLSSKMVMEIV 571
F T E A A DG +F G +LKI+RP ++ + +F+ I T+S V E++
Sbjct: 116 FSTPELADACFKLDGINFKGKLLKIRRPIDYGTTSSSEDTKVFVQNIPPTMSEAEVKELL 175
Query: 572 CAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLD- 628
G +K+ + ++ + F E+ D A+ LNG +G VL+ A
Sbjct: 176 EKHGKIKSSNLVKDLKTGQNKGYGFFEFDDSRAAKMAVCHLNGHIIGKNVLSVKHAAFSY 235
Query: 629 ----GSIMDNSGNPPFHGIPKHAL--PLL-----------KKPTEVLKLKNVFNPEGFSS 671
G + D + + + L PLL KP+ +++L N+ E
Sbjct: 236 FAAGGKLTDCKATNLPNSVTQSILSNPLLGLQMQSGRRIGSKPSRIVQLINIVFHEDL-- 293
Query: 672 LSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + EV + + E ++G ++ + +
Sbjct: 294 IQDKRYHEVKDAIMEEAKKYGHLEDIVI 321
>gi|145523992|ref|XP_001447829.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415351|emb|CAK80432.1| unnamed protein product [Paramecium tetraurelia]
Length = 419
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/292 (23%), Positives = 121/292 (41%), Gaps = 68/292 (23%)
Query: 411 NQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPM---RRLC 467
+Q + ++ ++P+ Q PLA + ++ L + M +Q ++ + R+L
Sbjct: 99 DQVSLGEFETILPANPL----QNPLASLEALQAMTPL-IQMQRLQQLRAQADVKADRKLY 153
Query: 468 VENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAA 527
V NLP ++ K L +FLN LL GV H C C I F+EF + E+A+
Sbjct: 154 VGNLPPNSQPKELQDFLNQTLLKMGVSSEHAGSIC-NCWIDSNGHFGFIEFRSPEEATQG 212
Query: 528 LCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED 587
F G LKI RPK F+ + +++ + S+ AF PL +
Sbjct: 213 FILKDVIFKGHQLKIGRPKSFLTS---LAAVNQQMMSEQ------AFNPLSS-------- 255
Query: 588 HEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHA 647
+ G+ D+S F + K
Sbjct: 256 ---------------------------------------MKGNDKDSSSLISFRPL-KLM 275
Query: 648 LPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
++ PT +L +KNV E ++ E E ++++D++ EC++FG+VK++ +
Sbjct: 276 ARYVQIPTNILVIKNVLTLEDV-TIDE-EFNDIMDDIKEECSKFGTVKNIII 325
>gi|124511860|ref|XP_001349063.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
gi|23498831|emb|CAD50908.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
Length = 1125
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/299 (22%), Positives = 125/299 (41%), Gaps = 73/299 (24%)
Query: 461 RPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLT 520
+ R L V N+P + ++++LN+ LL + C+ I+ + AFVEF
Sbjct: 648 KTARELYVGNIPQHIDIQEIVKYLNSCLLILYNKENENENICLKACIRGDTHYAFVEFRN 707
Query: 521 AEDASAALCCDGCSFSGSILKIKRPKEF-VE---------------------------VA 552
+D S + +G +F G+ L+I RPK F +E
Sbjct: 708 IQDTSNCMLLNGINFYGNNLRIGRPKTFPIEYHSLIPQATIPAIDNYYLSQGLIGLRSFI 767
Query: 553 IF-----------------------IGGISRTLSSKMVMEIVCAFGPLKAYHFEV-NEDH 588
IF + IS+ + + E++ AFG +K + F +E
Sbjct: 768 IFCKNEEKMKNDGLPVNMIKLQKLCVSNISKNNDTSKIKELLEAFGEIKNFEFFYGDETS 827
Query: 589 EEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPK--- 645
+ + +EYV+ T AI L T+ + + I++ +P + I K
Sbjct: 828 DTYISLVEYVN---TENAIQAHKILNQN----TSYKIQFEHEIIN---DPHINNIIKNKY 877
Query: 646 ----HALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV 700
+++ L+ PT+V+ L + E S SE ++++ED+++EC ++G K++ VV
Sbjct: 878 MKTENSILSLQVPTKVIVLNKIATFEELSDSSEY--KDIVEDIKIECDKYG--KTLEVV 932
>gi|47217926|emb|CAG02209.1| unnamed protein product [Tetraodon nigroviridis]
Length = 600
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 79/143 (55%), Gaps = 16/143 (11%)
Query: 567 VMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQ 624
V E++ +FGPLKA++ + + AF EYVD T +A+AGLNG+++G + L +
Sbjct: 407 VKELLTSFGPLKAFNLVKDGATSLSKGYAFCEYVDVGATDQAVAGLNGMQLGDKKLIVQR 466
Query: 625 AVLDGSIMDNSGNPPFHGIPKHAL--PLLKK------PTEVLKLKNVFNPEGFSSLSELE 676
A S+ + NP L P L++ PTEVL L N+ PE + + +
Sbjct: 467 A----SVGAKNANPSAAAEAPVTLQVPGLQRLQTSGVPTEVLCLLNMVVPEEL--VDDED 520
Query: 677 VEEVLEDVRLECARFGSVKSVNV 699
EE+LEDVR EC ++G V+S+ +
Sbjct: 521 YEEILEDVREECCKYGGVRSIEI 543
>gi|449707077|gb|EMD46798.1| RNA recognition motif domain containing protein [Entamoeba
histolytica KU27]
Length = 712
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/318 (22%), Positives = 134/318 (42%), Gaps = 66/318 (20%)
Query: 450 SMDSVQLTQS-----NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIG 504
++DSVQ++Q R +R+ V N+ S SEK +++ N + + + I
Sbjct: 123 AIDSVQISQQLNVHQERAAKRIYVGNINSSTSEKDIVDAFNEAMRRGDYVDKNDTRDIIT 182
Query: 505 CV-IQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRT-- 561
+ + E+ AF+EF T E+A AL DG + G+ +K++RPK++ V FI G+S+
Sbjct: 183 HIEVNYERSYAFLEFRTLEEAVKALSLDGLTIKGASVKVRRPKDYNPVLPFISGLSQLME 242
Query: 562 -------------------LSSKMVMEIVCAFGPLKAYHFEVNEDHEEP-----CAFIEY 597
++ + + + + PLK + + D P C F EY
Sbjct: 243 PGTTNPRESILYMGNIPLQMTDEQIRKKLENLNPLKNFFVIRDPDLGAPQGKCYCLF-EY 301
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFHGIPKHALPL----- 650
+ K I +G+ +GG + V DG + S N F + H L
Sbjct: 302 QNPEYKEK-ILTFDGINLGGNKIEVCSGV-DGFKHLPKASLNELFSKMFPHTTDLVIGTL 359
Query: 651 -------------LKKPTE---------VLKLKNVFNPEGFSSLSELEVEEVLEDVRLEC 688
+ KP+E ++ + N+ PE + + E+++D+R C
Sbjct: 360 LNSSVGYSTVFEKILKPSEKIEDQHVSRIIIIFNMVYPEDLT--DQQRYIELIDDIRFVC 417
Query: 689 ARFGSVKSVNVVKYGDSN 706
+G V+S+++ + + N
Sbjct: 418 QEYGEVESISIPRPTEEN 435
>gi|407043289|gb|EKE41863.1| U2 snRNP auxiliary factor large subunit, putative [Entamoeba
nuttalli P19]
Length = 628
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/318 (22%), Positives = 134/318 (42%), Gaps = 66/318 (20%)
Query: 450 SMDSVQLTQS-----NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIG 504
++DSVQ++Q R +R+ V N+ S SEK +++ N + + + I
Sbjct: 123 AIDSVQISQQLNVHQERAAKRIYVGNINSSTSEKDIVDAFNEAMRRGDYVDKNDTRDIIT 182
Query: 505 CV-IQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRT-- 561
+ + E+ AF+EF T E+A AL DG + G+ +K++RPK++ V FI G+S+
Sbjct: 183 HIEVNYERSYAFLEFRTLEEAVKALSLDGLTIKGASVKVRRPKDYNPVLPFISGLSQLME 242
Query: 562 -------------------LSSKMVMEIVCAFGPLKAYHFEVNEDHEEP-----CAFIEY 597
++ + + + + PLK + + D P C F EY
Sbjct: 243 PGTTNPRESILYMGNIPLQMTDEQIRKKLENLNPLKNFFVIRDPDLGAPQGKCYCLF-EY 301
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFHGIPKHALPL----- 650
+ K I +G+ +GG + V DG + S N F + H L
Sbjct: 302 QNPEYKEK-ILTFDGINLGGNKIEVCSGV-DGFKHLPKASLNELFSKMFPHTTDLVIGTL 359
Query: 651 -------------LKKPTE---------VLKLKNVFNPEGFSSLSELEVEEVLEDVRLEC 688
+ KP+E ++ + N+ PE + + E+++D+R C
Sbjct: 360 LNSSVGYSTVFEKILKPSEKIEDQHVSRIIIIFNMVYPEDLT--DQQRYIELIDDIRFVC 417
Query: 689 ARFGSVKSVNVVKYGDSN 706
+G V+S+++ + + N
Sbjct: 418 QEYGEVESISIPRPTEEN 435
>gi|67475980|ref|XP_653619.1| U2 snRNP auxiliary factor large subunit [Entamoeba histolytica
HM-1:IMSS]
gi|56470591|gb|EAL48233.1| U2 snRNP auxiliary factor large subunit, putative [Entamoeba
histolytica HM-1:IMSS]
Length = 712
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/318 (22%), Positives = 134/318 (42%), Gaps = 66/318 (20%)
Query: 450 SMDSVQLTQS-----NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIG 504
++DSVQ++Q R +R+ V N+ S SEK +++ N + + + I
Sbjct: 123 AIDSVQISQQLNVHQERAAKRIYVGNINSSTSEKDIVDAFNEAMRRGDYVDKNDTRDIIT 182
Query: 505 CV-IQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRT-- 561
+ + E+ AF+EF T E+A AL DG + G+ +K++RPK++ V FI G+S+
Sbjct: 183 HIEVNYERSYAFLEFRTLEEAVKALSLDGLTIKGASVKVRRPKDYNPVLPFISGLSQLME 242
Query: 562 -------------------LSSKMVMEIVCAFGPLKAYHFEVNEDHEEP-----CAFIEY 597
++ + + + + PLK + + D P C F EY
Sbjct: 243 PGTTNPRESILYMGNIPLQMTDEQIRKKLENLNPLKNFFVIRDPDLGAPQGKCYCLF-EY 301
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDG--SIMDNSGNPPFHGIPKHALPL----- 650
+ K I +G+ +GG + V DG + S N F + H L
Sbjct: 302 QNPEYKEK-ILTFDGINLGGNKIEVCSGV-DGFKHLPKASLNELFSKMFPHTTDLVIGTL 359
Query: 651 -------------LKKPTE---------VLKLKNVFNPEGFSSLSELEVEEVLEDVRLEC 688
+ KP+E ++ + N+ PE + + E+++D+R C
Sbjct: 360 LNSSVGYSTVFEKILKPSEKIEDQHVSRIIIIFNMVYPEDLT--DQQRYIELIDDIRFVC 417
Query: 689 ARFGSVKSVNVVKYGDSN 706
+G V+S+++ + + N
Sbjct: 418 QEYGEVESISIPRPTEEN 435
>gi|68068227|ref|XP_676023.1| U2 snRNP auxiliary factor [Plasmodium berghei strain ANKA]
gi|56495523|emb|CAI00540.1| U2 snRNP auxiliary factor, putative [Plasmodium berghei]
Length = 630
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 67/328 (20%), Positives = 138/328 (42%), Gaps = 78/328 (23%)
Query: 449 VSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLS----SGVQHVHGS---LP 501
+S +S +L + ++ R+L + NLP ++ ++ ++EF NN + S S ++ G LP
Sbjct: 247 LSRNSYEL-EGDKKQRKLYIGNLPPNSKQEEIVEFFNNTISSIIKGSSLEVKIGDVQLLP 305
Query: 502 CIGC-VIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFV----------- 549
+ C + + F+EF T + L D S++ L+I RP +++
Sbjct: 306 VVKCEIFNADSRFCFLEFRTMDITWLCLKLDSMSYNNYCLRINRPHDYMPPPEGDPALTV 365
Query: 550 ----------------EVA-----------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF 582
++A ++I + L +M+++ FG LK ++
Sbjct: 366 VFPDIDMGLLESFKPPKIAPVRSTGDDDNKLYIQNLPHDLKDDQIMDLLGQFGKLKGFNI 425
Query: 583 --EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGS---------- 630
++N + F EY D T AI LNG G +L +A + +
Sbjct: 426 IKDLNTGLNKGYGFFEYEDSSCTQVAIHALNGFVCGKNILNVKKATFNKNSNNAPNSNNI 485
Query: 631 IMDNSGNPPFHGIP-----------------KHALPLLKKPTEVLKLKNVFNPEGFSSLS 673
++ N+ + P +P + + + +K + V++L N E +
Sbjct: 486 VLANNVDVPVSLLPNSISQKILSNSIIGLQIQASRKIGEKSSRVIQLTNAVFQEDL--II 543
Query: 674 ELEVEEVLEDVRLECARFGSVKSVNVVK 701
+ + +E+L+DV+ E ++G ++S+ + K
Sbjct: 544 DSQYDEILKDVKEEAEKYGPLQSIVIPK 571
>gi|224125466|ref|XP_002329812.1| predicted protein [Populus trichocarpa]
gi|222870874|gb|EEF08005.1| predicted protein [Populus trichocarpa]
Length = 321
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 101/237 (42%), Gaps = 60/237 (25%)
Query: 520 TAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA----------- 552
T E+AS A+ DG F G ++++RP ++ + +A
Sbjct: 3 TVEEASNAMTLDGIIFEGVAVRVRRPTDYNPSLAATLGPSQPSPLLNLAAVGLVPGTISG 62
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED--HEEPCAFIEYVDQLVTP 604
+F+GG+ + + E++ +FGPL+ + + D + + F Y D VT
Sbjct: 63 AEGPDRVFVGGLPYYFTETQIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAVTD 122
Query: 605 KAIAGLNGLKVGGQVLTA------------------VQAVLDGSIMDNSGNPPFHGIPKH 646
A A LNGLK+G + LT QA +I + +P
Sbjct: 123 IACAALNGLKMGDKTLTVRRGTESGGQSRSEQENILAQAQQHIAIQKMALQAGVMNLPGV 182
Query: 647 ALPLLKK---PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV 700
+PL + P++VL L E + + E EE+LED+R EC +FG++ +NVV
Sbjct: 183 GIPLAESSHSPSKVLCLTEAIAMEVLAD--DEEYEEILEDMREECCKFGTL--INVV 235
>gi|452001453|gb|EMD93912.1| hypothetical protein COCHEDRAFT_1020092 [Cochliobolus
heterostrophus C5]
Length = 352
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 81/184 (44%), Gaps = 33/184 (17%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
+RL V NLP + + +MEF N L +G+ V G PC+ I K A +EF ED
Sbjct: 31 KRLYVHNLPSGCTSQEIMEFFNTQL--NGLNVVSGPDPCVSAHIATSKEYAALEFKAPED 88
Query: 524 ASAALCCDGCSFSG-------SILKIKRPKEFV-------------EVA---------IF 554
A+ AL +G S + L I+RPK+++ EV+ +
Sbjct: 89 ATLALAMNGISMRDDGGAPDRAGLSIRRPKDYITPTADENAYPPGDEVSSVVKDSPNKLS 148
Query: 555 IGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNG 612
I I + + + E+V G LKA+ + + D AF EY D + I GLN
Sbjct: 149 IVNIPTYIEEEQIRELVETMGKLKAFILVKDTSTDQHRGIAFCEYADNEIIDAVIEGLND 208
Query: 613 LKVG 616
+ +G
Sbjct: 209 IPLG 212
>gi|224077136|ref|XP_002305148.1| predicted protein [Populus trichocarpa]
gi|222848112|gb|EEE85659.1| predicted protein [Populus trichocarpa]
Length = 296
Score = 67.0 bits (162), Expect = 5e-08, Method: Composition-based stats.
Identities = 61/237 (25%), Positives = 102/237 (43%), Gaps = 60/237 (25%)
Query: 520 TAEDASAALCCDGCSFSGSILKIKRPKEF----------------VEVA----------- 552
T E+AS A+ DG F G ++++RP ++ + +A
Sbjct: 3 TVEEASNAMALDGIIFEGVAVRVRRPTDYNPSLAATLGPSQPSPLLNLAAVGLVPGTISG 62
Query: 553 ------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED--HEEPCAFIEYVDQLVTP 604
+F+GG+ + + E++ +FGPL+ + + D + + F Y D VT
Sbjct: 63 AEGPDRVFVGGLPYYFTEIQIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAVTD 122
Query: 605 KAIAGLNGLKVGGQVLTAVQAVLDG------------------SIMDNSGNPPFHGIPKH 646
A A LNGLK+G + LT +A G +I + +P
Sbjct: 123 IACAALNGLKMGDKTLTVRRATESGGQSKSEQENILAQAQQHIAIQKMALQAGVMNLPGV 182
Query: 647 ALPLLKK---PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV 700
+PL + P++VL L E + + E EE+LED+R EC +FG++ +NVV
Sbjct: 183 GIPLAESAYTPSKVLCLTEAITMEVLA--DDEEYEEILEDMREECCKFGTL--INVV 235
>gi|347968829|ref|XP_003436304.1| AGAP002908-PC [Anopheles gambiae str. PEST]
gi|333467822|gb|EGK96709.1| AGAP002908-PC [Anopheles gambiae str. PEST]
Length = 250
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 80/145 (55%), Gaps = 8/145 (5%)
Query: 560 RTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGG 617
R+ + + V E++ +FG LKA++ + + AF EYV+ VT +AIAGLNG+++G
Sbjct: 52 RSRNCEQVKELLLSFGQLKAFNLVKDAATGLGKGYAFAEYVEYTVTDQAIAGLNGMQLGD 111
Query: 618 QVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKK---PTEVLKLKNVFNPEGFSSLSE 674
+ L +A + + + P I L L+ PTEVL L N+ P+ E
Sbjct: 112 KKLIVQRASVGAKNSNAAVVAPVQ-IQVPGLSLVGSSGPPTEVLCLLNMVTPDELK--DE 168
Query: 675 LEVEEVLEDVRLECARFGSVKSVNV 699
E E++LED+R EC ++G V+SV +
Sbjct: 169 EEYEDILEDIREECNKYGVVRSVEI 193
>gi|167395950|ref|XP_001741817.1| hexokinase [Entamoeba dispar SAW760]
gi|165893477|gb|EDR21726.1| hexokinase, putative [Entamoeba dispar SAW760]
Length = 974
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 72/311 (23%), Positives = 130/311 (41%), Gaps = 66/311 (21%)
Query: 450 SMDSVQLTQS-----NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIG 504
++DSVQ++Q R +R+ V N+ S SEK +++ N + + I
Sbjct: 341 AIDSVQISQQLNVHQERAAKRIYVGNINSSTSEKDIVDAFNEAMRRGDYVDKNDPRDIIT 400
Query: 505 CV-IQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRT-- 561
+ + E+ AF+EF T E+A AL DG + G+ +K++RPK++ V FI G+S+
Sbjct: 401 HIEVNYERSYAFLEFRTLEEAVKALSLDGLTIKGASVKVRRPKDYNPVLPFISGLSQLME 460
Query: 562 -------------------LSSKMVMEIVCAFGPLKAYHFEVNEDHEEP-----CAFIEY 597
++ + + + + PLK + + D P C F EY
Sbjct: 461 PGTTNPRESILYMGNIPLQMTDEQIRKKLENLNPLKKFFVIRDPDLGAPQGKCYCLF-EY 519
Query: 598 VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGS--IMDNSGNPPFHGIPKHALPL----- 650
+ K I +G+ +GG + V DG + S N F + H L
Sbjct: 520 QNPEYKEK-ILTFDGINLGGNKIEVCSGV-DGFKHLPKASLNELFSKMFPHTTDLVIGTL 577
Query: 651 -------------LKKPTE---------VLKLKNVFNPEGFSSLSELEVEEVLEDVRLEC 688
+ KP+E ++ + N+ PE + + E+++D+R C
Sbjct: 578 LNSSVGYSTVFEKILKPSEKIEDQHVSRIIVIFNMVYPEDL--IDQQRYIELIDDIRFVC 635
Query: 689 ARFGSVKSVNV 699
+G V+S+++
Sbjct: 636 QEYGEVESISI 646
>gi|82540696|ref|XP_724646.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23479360|gb|EAA16211.1| splicing factor-like protein, putative [Plasmodium yoelii yoelii]
Length = 714
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/319 (20%), Positives = 130/319 (40%), Gaps = 77/319 (24%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLS----SGVQHVHGS---LPCIGC-VIQR 509
+ ++ R+L + NLP ++ ++ ++EF NN + S S ++ G LP + C +
Sbjct: 339 EGDKKQRKLYIGNLPPNSKQEEIVEFFNNTISSIIKGSSLEVKIGDVQLLPVVKCEIFNA 398
Query: 510 EKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFV-------------------- 549
+ F+EF T + L D S++ L+I RP +++
Sbjct: 399 DSRFCFLEFRTMDITWLCLKLDSMSYNNYCLRINRPHDYMPPPEGDPALTVVFPDIDMGL 458
Query: 550 -------EVA-----------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHE 589
++A ++I + L +M+++ FG LK ++ ++N
Sbjct: 459 LESFKPPKIAPVRSTGDDDNKLYIQNLPHDLKDDQIMDLLGQFGKLKGFNIIKDLNTGLN 518
Query: 590 EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGS----------IMDNSGNPP 639
+ F EY D T AI LNG G +L +A + + + N+ + P
Sbjct: 519 KGYGFFEYEDSSCTQVAIHALNGFVCGKNILNVKKATFNKNSNNAPNSNNIALANNVDVP 578
Query: 640 FHGIP-----------------KHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLE 682
+P + + + +K + V++L N E + + +E+L+
Sbjct: 579 VSLLPNSISQKILSNSIIGLQIQASRKIGEKSSRVIQLTNAVFQEDL--IINSQYDEILK 636
Query: 683 DVRLECARFGSVKSVNVVK 701
DV+ E ++G ++S+ + K
Sbjct: 637 DVKEEAEKYGPLQSIVIPK 655
>gi|449689952|ref|XP_004212193.1| PREDICTED: splicing factor U2AF 65 kDa subunit-like, partial [Hydra
magnipapillata]
Length = 210
Score = 64.3 bits (155), Expect = 3e-07, Method: Composition-based stats.
Identities = 38/108 (35%), Positives = 57/108 (52%), Gaps = 5/108 (4%)
Query: 455 QLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQA 514
Q Q+ RR+ + N+P SE +++F N + S + G+ P + C I EK A
Sbjct: 100 QGAQTTWQARRIYLGNIPFGISEDLMVDFFNAKMRESDIARQPGN-PVLACQINLEKNFA 158
Query: 515 FVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRTL 562
F+EF + E+ + A+ DG G LKI+RPK++ I GIS TL
Sbjct: 159 FLEFRSVEETTLAMAFDGIMLQGQALKIRRPKDYQP----IPGISGTL 202
>gi|357446501|ref|XP_003593528.1| Splicing factor U2af large subunit B [Medicago truncatula]
gi|355482576|gb|AES63779.1| Splicing factor U2af large subunit B [Medicago truncatula]
Length = 593
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 55/169 (32%), Positives = 81/169 (47%), Gaps = 28/169 (16%)
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHE----EPCAFIEYVDQLVTPKAIA 608
IF+GG+ + + E++ AFGPL++ F++ D E + F Y D VT A A
Sbjct: 371 IFVGGLPYYFTEVQMRELLQAFGPLRS--FDIVRDKETGNSKGYGFCIYQDPAVTDIACA 428
Query: 609 GLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKH------ALPLL----------- 651
LNGLK+G + LT +A + N F +H AL ++
Sbjct: 429 ALNGLKMGDKTLTVRRATVSAHSKPEEDN-IFARAQQHIAMQKIALEVVGLNIPGVPTND 487
Query: 652 KKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV 700
+ PT+VL L E + E EE+LED+R EC +FG++ VNVV
Sbjct: 488 ESPTKVLCLTEAVTTEQLTDNGEY--EEILEDMRDECRKFGTL--VNVV 532
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 67/150 (44%), Gaps = 12/150 (8%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K+++ +D+AP T P T + AH M+ +T GIS +L+ +
Sbjct: 114 KRTSGFDMAPPATGVTP----TVSGQMPGIAH-MIQG----ATQNFSPYGISQIGALSLM 164
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
V Q+ R RR+ V LP A+E+++ F + +++ G + I
Sbjct: 165 QVQ---PMTQQATRHARRVYVGGLPPFANEQSIASFFSQVMIAIGGNSAGSGDSVVNVYI 221
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSG 537
EK AFVE T E+AS A+ DG F G
Sbjct: 222 NHEKKFAFVEMRTVEEASNAMALDGIVFEG 251
>gi|159163083|pdb|1U2F|A Chain A, Solution Structure Of The First Rna-Binding Domain Of
Hu2af65
Length = 90
Score = 62.8 bits (151), Expect = 8e-07, Method: Composition-based stats.
Identities = 30/85 (35%), Positives = 50/85 (58%), Gaps = 1/85 (1%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL V N+P +E+A+M+F N + G+ G+ P + I ++K AF+EF + ++
Sbjct: 2 RRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSVDE 60
Query: 524 ASAALCCDGCSFSGSILKIKRPKEF 548
+ A+ DG F G LKI+RP ++
Sbjct: 61 TTQAMAFDGIIFQGQSLKIRRPHDY 85
>gi|429328959|gb|AFZ80718.1| hypothetical protein BEWA_001250 [Babesia equi]
Length = 711
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 60/256 (23%), Positives = 112/256 (43%), Gaps = 30/256 (11%)
Query: 472 PLSASEKALMEFLNNFLLS-SGVQHVHGSLPCIGC-VIQREKGQAFVEFLTAEDASAALC 529
PL + +++F N L++ S ++G +P + + +E+G F+EF TAE A
Sbjct: 357 PLGIRLQDVVDFFNGALMTMSTSIDINGPMPVMKTEIFNQEQGYCFLEFTTAEYADLCYK 416
Query: 530 CDGCSFSGSILKIKRPKEFV------EVAIFIGGISRTLSSKMVMEIVCAFGPLKAYHFE 583
DG +G LK++RP +F + IF+ I + S + + +++ A G LK +
Sbjct: 417 LDGIQCNGYSLKLRRPIDFSSSMSSEDTKIFVQNIPESFSEEDIRKLLEAHGKLKTCNLV 476
Query: 584 VNEDHE--EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLD-----GSIMDNSG 636
++ + F EY +A+ LNG + VL+ A G D
Sbjct: 477 IDPFTRLNKGYGFFEYESSSSAKEAVIHLNGHVIQNNVLSVKHAAFSSFAAGGKPADCRA 536
Query: 637 NPPFHGIPKHAL--PLL-----------KKPTEVLKLKNVFNPEGFSSLSELEVEEVLED 683
+ + PLL +P+ V++L NV PE L + E+L++
Sbjct: 537 SSIITSVSHCVFSNPLLGLQMQNGRKKGSEPSRVVQLLNVVYPEDI--LDDKNYREMLKE 594
Query: 684 VRLECARFGSVKSVNV 699
++ E ++G ++ + +
Sbjct: 595 IKEEAQKYGPLEEIYI 610
>gi|403224363|dbj|BAM42493.1| splicing factor [Theileria orientalis strain Shintoku]
Length = 377
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 121/292 (41%), Gaps = 39/292 (13%)
Query: 445 AKLNVSMDSVQLTQ---------SNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQH 495
K + + D+V LT+ + + +RL + NLP ++EF N LL+ +
Sbjct: 26 GKWDTAEDAVSLTELKAKTSEEENKKRQKRLYIGNLPAGMKLGDVVEFFNGALLAMVPSN 85
Query: 496 VHGSLPCIGC--VIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFVE--- 550
P + + E+G F+EF T E A DG + +G LKI+RP +F +
Sbjct: 86 QTTKDPLVSKTEIYNPEQGYCFLEFKTPELTDLAFKLDGITCNGYSLKIRRPIDFTQGNQ 145
Query: 551 ---VAIFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPK 605
IFI ++ ++ + E++ G LK ++ + + F EY D
Sbjct: 146 LEDTKIFIQNVATDVTEAELRELLEKHGKLKLFNLIKDPITGASKGYGFFEYEDSRSAKM 205
Query: 606 AIAGLNGLKVGGQVLTAVQAV-----------------LDGSIMDNSGNPPFHGIPKHAL 648
A+ LNG + VL+ A L SI + N P G+
Sbjct: 206 AVLHLNGQALKQNVLSVKHAAFGYFASGGKPIDCKASNLPNSITQSILNNPLLGLQLQNS 265
Query: 649 PLL-KKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
++ KPT V++L N+ E LS+ E++ + E ++G++ + V
Sbjct: 266 KIVGAKPTRVVQLLNMVFSEDL--LSDYNYNEIVRLTKEEAGKYGALDEIVV 315
>gi|114794658|pdb|2HZC|A Chain A, Crystal Structure Of The N-terminal Rrm Of The U2af Large
Subunit
Length = 87
Score = 61.2 bits (147), Expect = 2e-06, Method: Composition-based stats.
Identities = 30/82 (36%), Positives = 48/82 (58%), Gaps = 1/82 (1%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
RRL V N+P +E+A+M+F N + G+ G+ P + I ++K AF+EF + ++
Sbjct: 7 RRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGN-PVLAVQINQDKNFAFLEFRSVDE 65
Query: 524 ASAALCCDGCSFSGSILKIKRP 545
+ A+ DG F G LKI+RP
Sbjct: 66 TTQAMAFDGIIFQGQSLKIRRP 87
>gi|145536694|ref|XP_001454069.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421813|emb|CAK86672.1| unnamed protein product [Paramecium tetraurelia]
Length = 426
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 64/275 (23%), Positives = 118/275 (42%), Gaps = 50/275 (18%)
Query: 466 LCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDAS 525
+ V N+PL+ K L E+ N + S + + I K +E + E
Sbjct: 98 MIVSNVPLNVQLKELEEYFNTLITSLDPKITERPIKAIE--YGATKSWVVLECSSKEAKR 155
Query: 526 AALCCDGCSFSGSI-LKIKRPKEFVE---------------------VAIFIGGISRTLS 563
A + D F + +K+++P++F+E +++GG+ L
Sbjct: 156 ALVTQDQVQFVNNCKIKVEKPRKFLERILNPQAKEAELNADQKQEDNTRLYLGGLPTYLR 215
Query: 564 SKMVMEIVCAFGPLKAYHF----EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVGGQV 619
+ VM+++ +FG K ++ N + + F EY T KA+ LN L++G +
Sbjct: 216 DEDVMKLIQSFGTTKYFNLVKDTTSNTEISKGYCFFEYEKTASTAKALKALNNLQIGDKK 275
Query: 620 LTAVQAV--------LDGSIM---DNSGNPPFH---GIPKHALPLLKKPTEVLKLKNVFN 665
L + + GS + D P IP+ AL P++V++ N+ +
Sbjct: 276 LKICKKINGRDQPSNYAGSFLASCDLLRIPQVQQMLTIPQSALI----PSKVVQFLNMCS 331
Query: 666 PEGFSSLSELEV-EEVLEDVRLECARFGSVKSVNV 699
E L E ++ EE++ED+R EC RFG ++ + +
Sbjct: 332 IE---DLYEDDIYEELMEDIRSECIRFGQIEKIEI 363
>gi|298713809|emb|CBJ27181.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1141
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 69/270 (25%), Positives = 112/270 (41%), Gaps = 51/270 (18%)
Query: 460 NRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQ-----A 514
N+ MR L V LP S L +FLN + +Q++ + VI+ G A
Sbjct: 745 NKVMRELHVGGLPHGVSGVQLQDFLN-----AAMQYLKIATSAGNPVIRIAMGPDGTNFA 799
Query: 515 FVEFLTAEDASAALC-CDGCSFSGSILKIKRPKEFVEV------------AIFIGGISRT 561
F+E T E+ +A L G LK RPK + + + +
Sbjct: 800 FIELRTEEETNATLGRMSGIQCGTGHLKFGRPKAHAAGATAVAPKKEESSVLMVMNLPDS 859
Query: 562 LSSKMVMEIVCAFGPLKAYHF-EVNEDHEEPCAFIEYVDQLVTPKAIAGLNGLKVG-GQV 619
L+ V E++ FG LK ++ + + + A EY D A++GL+GL VG G++
Sbjct: 860 LTDDHVRELLSPFGELKKFNLLKDSSGKSKGTAVFEYTDMENGQLALSGLSGLPVGKGKL 919
Query: 620 LTA-VQAVLDGSIM---------DNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGF 669
+ V A++ +++ D N +PT V++L N+ E
Sbjct: 920 MVQRVPAMMAATLLKPVKVKEVEDEQDN--------------VEPTCVVRLSNMVEVEEL 965
Query: 670 SSLSELEVEEVLEDVRLECARFGSVKSVNV 699
+ + E E+ DV EC ++G VKS V
Sbjct: 966 AD--DTEYAEIKGDVVEECEQYGKVKSAEV 993
>gi|124810295|ref|XP_001348830.1| U2 snRNP auxiliary factor, putative [Plasmodium falciparum 3D7]
gi|23497731|gb|AAN37269.1|AE014827_12 U2 snRNP auxiliary factor, putative [Plasmodium falciparum 3D7]
Length = 833
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 98/507 (19%), Positives = 194/507 (38%), Gaps = 132/507 (26%)
Query: 284 LKRRRSRSREREDRNRRSLSLSPRAQKRTSYYDREHEGLPSHSTKISSGRQHSDIDSSRV 343
L RRRSR ++ DRN+ S HS ISS +H + R
Sbjct: 311 LDRRRSRYKDSRDRNKFSY----------------------HSRSISSDEEHYSKNKKRT 348
Query: 344 TGNGLSGHYRRHDGSTSGLGGYSPRKRRTEAAAKTPSPINRSPEK--KSAKWDVAPVETY 401
+ + S+S + R R++ ++ T N S +K K +KWD
Sbjct: 349 KRRKSRHNIDKDSYSSSSNSTHRRRNRKSSSSNNTRKDDNNSIQKKRKKSKWD------- 401
Query: 402 SVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLT---- 457
T ++ +N S V+ QK + ++V+ +L N + +
Sbjct: 402 -------TVDECLLNNNLLNNS---VSGIFQK--SSLTVTGNLIAQNNKITDLSRNPYEQ 449
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLS----SGVQHVHGS---LPCIGC-VIQR 509
+++ R+L + N+P ++ ++ +++F NN +L+ S + G +P I C +
Sbjct: 450 DTDKKQRKLYIGNIPPNSKQEDVVDFFNNSILAVIKDSSLDVKIGDVQLMPVIKCEIFNS 509
Query: 510 EKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFV-------------------- 549
+ F+EF T + L D ++ L+I RP +++
Sbjct: 510 DSRFCFLEFRTVQITWLCLKLDSIPYNNYCLRIGRPHDYIPPPEGDPAFTTVFTDINMDV 569
Query: 550 ------------------EVAIFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHE 589
E ++I + L + + +++ FG LKA++ ++N
Sbjct: 570 FEKLRPSKPVNVKTSSDEENRLYIQNLPHDLKDEQIKDLLEQFGDLKAFNIIKDLNTGLN 629
Query: 590 EPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGS------------------I 631
+ F EY D T AI LNG G +L +A +
Sbjct: 630 KGYGFFEYEDSSCTQLAIHALNGFVCGQNILNVKKATFNKQPTTITTNNNMNNQNPNFIA 689
Query: 632 MDNSGNPPFHGIP-----------------KHALPLLKKPTEVLKLKNVFNPEGFSSLSE 674
+ N+ + P +P + + + +K ++V++L N E + +
Sbjct: 690 LPNNSDVPVTLLPSSISQKILSNSIIGLQVQASRKIGEKSSKVVQLTNAVFQEDL--IVD 747
Query: 675 LEVEEVLEDVRLECARFGSVKSVNVVK 701
+ EE+L++V+ E ++G+++++ + K
Sbjct: 748 SQYEEILKEVKEEAEKYGTLQNIVIPK 774
>gi|440290938|gb|ELP84237.1| splicing factor u2af large subunit, putative [Entamoeba invadens
IP1]
Length = 623
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 68/298 (22%), Positives = 129/298 (43%), Gaps = 59/298 (19%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLL-SSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAE 522
RR+ V N+ + +E+ ++E N+ + V S + + EK AF+EF T +
Sbjct: 124 RRVYVGNINTTTTEQDIVEAFNDAMRRGDYVDKNDKSDIIVSTEVNYEKSYAFIEFRTFD 183
Query: 523 DASAALCCDGCSFSGSILKIKRPKEFVEVAIFI---------------------GGISRT 561
A AL DG + G+ +K++RPK+F V FI G I
Sbjct: 184 QAVKALSLDGLTIKGASVKVRRPKDFNPVLPFISSLSQLMEVGTTKPRDGVMYMGNIPLQ 243
Query: 562 LSSKMVMEIVCAFGPLKAYHFEVNEDHEEP-----CAFIEYVDQLVTPKAIAGLNGLKVG 616
+S + + + + PLK Y + P C F EY + K +A NG+ +G
Sbjct: 244 MSDEQIQKKLENLNPLKKYVVVRDPSLGAPQGKCYCLF-EYQNPEYKDKVLA-FNGIILG 301
Query: 617 G---QVLTAVQA----------------------VLDGSIMDNS--GNPPFHGIPKHALP 649
G +V + ++ ++ +++++S + F + ++
Sbjct: 302 GDKIEVCSGLEGFKHFPTAALNELCMKMFPQRTDIITATLLNSSVGYSDVFERVLHNSED 361
Query: 650 LLK-KPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGDSN 706
L + + T V+ L N+F PE + +E E+++D+R C +G V S+++ + ++N
Sbjct: 362 LSQYECTRVIVLFNMFFPEDLN--NEQRYIELVDDIREACIAYGEVISISIPRPTETN 417
>gi|297736736|emb|CBI25913.3| unnamed protein product [Vitis vinifera]
Length = 6467
Score = 58.5 bits (140), Expect = 2e-05, Method: Composition-based stats.
Identities = 50/169 (29%), Positives = 77/169 (45%), Gaps = 27/169 (15%)
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED--HEEPCAFIEYVDQLVTPKAIAGL 610
IF+GG+ + + + E++ +FGPL+ + + D + + F Y D VT A A L
Sbjct: 6240 IFVGGLPYYFTEEQIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 6299
Query: 611 NGLKVGGQVLTAVQAVLDG----SIMDN-----------------SGNPPFHGIPKHALP 649
NGLK+G + LT +A + S DN +G G
Sbjct: 6300 NGLKMGDKTLTVRRATVGSGQAKSEQDNILAQAQQHIAIQKIALQAGGLNLPGAGMAFTA 6359
Query: 650 LLKKPTEVLKLKNVFNPEGFSSLSELEV-EEVLEDVRLECARFGSVKSV 697
+ + PT+VL L V N L + E EE+LED+R E +FG++ V
Sbjct: 6360 IAETPTKVLCLTEVIN---IDELRDDEAYEEILEDMRDEGGKFGALVHV 6405
>gi|70946422|ref|XP_742927.1| U2 snRNP auxiliary factor [Plasmodium chabaudi chabaudi]
gi|56522174|emb|CAH84932.1| U2 snRNP auxiliary factor, putative [Plasmodium chabaudi chabaudi]
Length = 561
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 85/421 (20%), Positives = 166/421 (39%), Gaps = 101/421 (23%)
Query: 366 SPRKRRTEAAAKTPSPINR----------SPEKKSAKWDVAPVETYSVPSNVHTSNQAAS 415
S R+R+ A K ++ P++K +KWD +V ++ +N
Sbjct: 98 STRERKKNARDKNDISMSEEDSKKENKEIKPKRKKSKWD-------TVDESLLANNMLID 150
Query: 416 SNAHEMVSSDPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSA 475
SN V S L G + + + +L + ++ R+L + NLP ++
Sbjct: 151 SNNLSGVLQYQRLSLNGNLLPGNKMP------QLGRNPYEL-EGDKKQRKLYIGNLPPNS 203
Query: 476 SEKALMEFLNNFLLS----SGVQHVHGS---LPCIGC-VIQREKGQAFVEFLTAEDASAA 527
++ ++EF NN L S S ++ G LP + C + + F+EF T + +
Sbjct: 204 KQEEIVEFFNNTLSSIIKGSSLEVKIGDVQLLPVVKCEIFNPDSRFCFLEFRTMDITWLS 263
Query: 528 LCCDGCSFSGSILKIKRPKEFV---------------------------EVA-------- 552
L D S++ L+I RP +++ ++A
Sbjct: 264 LKLDSMSYNNYCLRINRPHDYMPPPEGDPALTVVFPDIDMGLLESFKPPKIAPVRSTGDD 323
Query: 553 ---IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAI 607
++I + L +M+++ FG LK ++ ++N + F EY D T AI
Sbjct: 324 DNKLYIQNLPHDLKDDQIMDLLGQFGKLKGFNIIKDLNTGLNKGYGFFEYEDSSCTQVAI 383
Query: 608 AGLNGLKVGGQVLTAVQAVLDGSI----------MDNSGNPPFHGIP------------- 644
LNG G +L +A + + + N+ + P +P
Sbjct: 384 HALNGFVCGKNILNVKKATFNKNPNNIPNPNNIALANNVDVPVSLLPNSISQKILSNSII 443
Query: 645 ----KHALPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVV 700
+ + + +K + V++L N E + + EE+L+DV+ E ++G+++S+ +
Sbjct: 444 GLQIQASRKIGEKSSRVIQLTNAVFQEDL--IINSQYEEILKDVKEEAEKYGTLQSIVIP 501
Query: 701 K 701
K
Sbjct: 502 K 502
>gi|320590609|gb|EFX03052.1| splicing factor u2af large subunit [Grosmannia clavigera kw1407]
Length = 420
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 95/221 (42%), Gaps = 40/221 (18%)
Query: 515 FVEFLTAEDASAALCCDGCSFSG--------SILKIKRPKEFVEVAIF------------ 554
VEF DA+ AL +G S S L I+RPK+++ A+
Sbjct: 1 MVEFKEPIDATVALALNGISMEAEDASGSGQSGLSIQRPKDYIVPAVVDYSVYHPGVVSN 60
Query: 555 ----------IGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP--CAFIEYVDQLV 602
I I LS + V E++ +FG L+A+ + EE AF EY +
Sbjct: 61 VVIDTPFKIAITNIPSYLSDEQVTELLVSFGELRAFVLLKDRSTEESRGVAFCEYTEPQS 120
Query: 603 TPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKN 662
T AI GLNG+ +G + L VQ G S + + A + ++ ++V ++
Sbjct: 121 TDVAIQGLNGMDLGDRKLR-VQKASIGITQVTSVEMGVNAMSLLAGTISQEASDVSRVVQ 179
Query: 663 VFNPEGFSSLSEL----EVEEVLEDVRLECARFGSVKSVNV 699
+ N + EL + E++ EDV ECA+FG V + V
Sbjct: 180 LLN---MVTAEELVNNDDYEDICEDVTEECAKFGPVMGLKV 217
>gi|359497129|ref|XP_003635431.1| PREDICTED: splicing factor U2af large subunit B-like, partial
[Vitis vinifera]
Length = 238
Score = 56.6 bits (135), Expect = 6e-05, Method: Composition-based stats.
Identities = 50/169 (29%), Positives = 77/169 (45%), Gaps = 27/169 (15%)
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED--HEEPCAFIEYVDQLVTPKAIAGL 610
IF+GG+ + + + E++ +FGPL+ + + D + + F Y D VT A A L
Sbjct: 11 IFVGGLPYYFTEEQIRELLESFGPLRGFDLVKDRDTGNSKGYGFCVYQDPAVTDIACAAL 70
Query: 611 NGLKVGGQVLTAVQAVLDG----SIMDN-----------------SGNPPFHGIPKHALP 649
NGLK+G + LT +A + S DN +G G
Sbjct: 71 NGLKMGDKTLTVRRATVGSGQAKSEQDNILAQAQQHIAIQKIALQAGGLNLPGAGMAFTA 130
Query: 650 LLKKPTEVLKLKNVFNPEGFSSLSELEV-EEVLEDVRLECARFGSVKSV 697
+ + PT+VL L V N L + E EE+LED+R E +FG++ V
Sbjct: 131 IAETPTKVLCLTEVIN---IDELRDDEAYEEILEDMRDEGGKFGALVHV 176
>gi|325179530|emb|CCA13927.1| splicing factor U2af large subunit putative [Albugo laibachii Nc14]
Length = 833
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 79/160 (49%), Gaps = 24/160 (15%)
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED----HEEPCAFIEYVDQLVTPKAIA 608
+ + I L+ + V E+V FG L+ F++ +D AF EY + VT +A+
Sbjct: 637 LLMSNIPGVLAEEQVKELVQPFGELR--FFKLIKDPITGQSTGTAFFEYQENQVTTEALN 694
Query: 609 GLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLL----KKPTEVLKLKNVF 664
GL+GL +GG L+ +A P P+ A+ + ++P VL++ N+
Sbjct: 695 GLDGLDIGGVKLSVRRA------------PDATKYPQIAVLMPGAAGEEPGPVLRMANMV 742
Query: 665 NPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNVVKYGD 704
+ + ++ E ++ EDV EC RFG++ ++++ + D
Sbjct: 743 SEDELK--NDEEFADLKEDVEEECKRFGTIIALDIPRSQD 780
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/187 (24%), Positives = 79/187 (42%), Gaps = 4/187 (2%)
Query: 410 SNQAASSNAHEMVSSDPVTSTTQ--KPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLC 467
+N + + H VS D S TQ + +S+ +AK+ S ++ + +P R L
Sbjct: 416 ANSSPEPHEHPPVSGDGTPSITQLMQQYPTLSLQDIIAKMQASNVTMAAAVAMKPARELY 475
Query: 468 VENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAA 527
V NLP + + L EFL + G+ G+ P + I + AF E + E+ + A
Sbjct: 476 VGNLPATITGPQLQEFLGTIIQQVGLSTQPGN-PILSVWISTDGHFAFCEMRSVEECNLA 534
Query: 528 LCCDGCSFSGSILKIKRPKEFVEVAIFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNED 587
L + G LK RP+ F+ + +S + +V + C P+ +V
Sbjct: 535 LLLNQLPLLGQPLKFGRPRSFMGPPQPMPIVSARTQTALV-NLGCTPNPVWFASPDVTSF 593
Query: 588 HEEPCAF 594
+P F
Sbjct: 594 GSDPMGF 600
>gi|348681357|gb|EGZ21173.1| hypothetical protein PHYSODRAFT_488481 [Phytophthora sojae]
Length = 640
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 71/161 (44%), Gaps = 26/161 (16%)
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGL 610
+ + I L+ V E+V FG LK++ + A EY D VT +A+ GL
Sbjct: 444 LIMSNIPVVLTEDQVKELVEPFGALKSFTLVKDTATGASMGSALFEYEDDSVTAQAVEGL 503
Query: 611 NGLKVGGQVLTAVQ---AVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPE 667
NGL +GG +L +VQ A P F +P+ VLK+ N+
Sbjct: 504 NGLSIGG-ILLSVQCQPASGAALPAAPGATPNFE----------DQPSAVLKMANMV--- 549
Query: 668 GFSSLSEL----EVEEVLEDVRLECARFGSVKSVNVVKYGD 704
S+ EL E ++ EDV EC RFG+V + + + D
Sbjct: 550 ---SIDELRDDDEYADLAEDVEEECKRFGNVTGLEIPRPKD 587
>gi|124360614|gb|ABN08613.1| RNA-binding region RNP-1 (RNA recognition motif) [Medicago
truncatula]
Length = 257
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 69/155 (44%), Gaps = 12/155 (7%)
Query: 388 KKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISVSASLAKL 447
K+++ +D+AP T P T + AH M+ +T GIS +L+ +
Sbjct: 114 KRTSGFDMAPPATGVTP----TVSGQMPGIAH-MIQG----ATQNFSPYGISQIGALSLM 164
Query: 448 NVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVI 507
V + Q T R RR+ V LP A+E+++ F + +++ G + I
Sbjct: 165 QVQPMTQQAT---RHARRVYVGGLPPFANEQSIASFFSQVMIAIGGNSAGSGDSVVNVYI 221
Query: 508 QREKGQAFVEFLTAEDASAALCCDGCSFSGSILKI 542
EK AFVE T E+AS A+ DG F S +
Sbjct: 222 NHEKKFAFVEMRTVEEASNAMALDGIVFEASTFIL 256
>gi|76154831|gb|AAX26240.2| SJCHGC03157 protein [Schistosoma japonicum]
Length = 258
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/172 (30%), Positives = 85/172 (49%), Gaps = 36/172 (20%)
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGL 610
IF+GG+ L+ V E++ +FGPLK ++ + + + AF EYVD VT A AGL
Sbjct: 41 IFVGGLPYYLNEDQVKELLLSFGPLKGFNLVKDGSTGLSKGYAFCEYVDSNVTDHACAGL 100
Query: 611 NGLKVG--------------------GQVLTAVQAVLDGSIMDNSGNPPF---HGIPKHA 647
NG+++G Q L + + +G++ + +G+ G P
Sbjct: 101 NGMQLGDKKLIVQRASVGAKHTTGVLPQCLLQMSGLEEGAVQNTTGSGNLTVRSGGP--- 157
Query: 648 LPLLKKPTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
PTEVL L N+ E + E E+++EDVR EC+++G V+S+ +
Sbjct: 158 ------PTEVLCLMNMI--ETSELEDDEEYEDIVEDVRAECSKYGVVRSLEI 201
>gi|50552688|ref|XP_503754.1| YALI0E09889p [Yarrowia lipolytica]
gi|49649623|emb|CAG79345.1| YALI0E09889p [Yarrowia lipolytica CLIB122]
Length = 601
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 65/284 (22%), Positives = 114/284 (40%), Gaps = 58/284 (20%)
Query: 460 NRPMRRLCVENLPLSASEK-ALMEFLNNFL-----------LSSGVQHVHGSLPCIGCVI 507
+R RRL + +P + A+ F +F+ + GV + H LP
Sbjct: 262 SRVARRLILSGIPADQIDTVAIKSFFTDFIEGLELQGSKERIVDGV-YKHPRLP------ 314
Query: 508 QREKGQAFVEFLTAEDASAALCCDG--CSFSGSILKIKRPKEFV---------------- 549
+ VEF +AE A+ AL G ++SG + I+RP ++
Sbjct: 315 -----EVLVEFFSAEMATLALALSGLGINYSGPPISIRRPSNYICPTPERSEVSRRSLDE 369
Query: 550 --EVA---------IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIE 596
EVA I + I + V ++ +FG L A+ ++ A ++
Sbjct: 370 EKEVASVVEDSNTKIIVWDIPFNVEEDQVRQLTASFGELSAFQLIRQLPSRESAGIALVD 429
Query: 597 YVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKP-T 655
Y D V A++GL+G +GG+ L + A + + S N GI + +P +
Sbjct: 430 YKDPEVVKDAVSGLSGQVIGGKNLKVMLACEGPTQLSCSSNNGLKGIVTVMNDVKSRPES 489
Query: 656 EVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSVKSVNV 699
V+ L N+ + L ++ E+ E V EC ++G + V +
Sbjct: 490 SVIVLFNLVTLDEL--LDDVAYREITEQVESECLKYGGGEEVQI 531
>gi|70954273|ref|XP_746191.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56526725|emb|CAH88205.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 686
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 65/289 (22%), Positives = 114/289 (39%), Gaps = 63/289 (21%)
Query: 464 RRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAED 523
R L V N+P + + +++FLN LL + C+ I+ + AFVEF + +D
Sbjct: 254 RELYVGNIPQNIDIQEIVKFLNTCLLILYNKENENESICLKACIRGDTRYAFVEFRSLQD 313
Query: 524 ASAALCCDGCSFSGSILKIKRPKEFV----------------------------EVAIF- 554
S + +G F + L+I RPK F IF
Sbjct: 314 TSNCMLLNGIYFYSNNLRIGRPKTFPAEYTKLIPPATIPPIDTYYLSQGLIGIKAFVIFH 373
Query: 555 ----------------------IGGISRTLSSKMVMEIVCAFGPLKAY-HFEVNEDHEEP 591
+ IS+ + + E++ AFG ++++ FE E+ +
Sbjct: 374 QNRDETKNEYLPVDMIKLQKLCVSNISKNNETNKIKELLEAFGEIQSFEFFEGEENSDTY 433
Query: 592 CAFIEY--VDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALP 649
+EY V+ + I N ++ + V D +I N + K+A+
Sbjct: 434 ICLVEYNNVENAIQAHKILNQN---TSYRIQFEYEIVNDPTI--NQLVKKKYMQTKNAIL 488
Query: 650 LLKKPTEVLKLKNVFNPEGFSSLSELE-VEEVLEDVRLECARFGSVKSV 697
+ PT+V+ L + F LS E +E+ ED+++EC ++G V V
Sbjct: 489 SQQIPTKVVVLSKI---ATFDELSNPEDYKEISEDIKIECEKYGPVLEV 534
>gi|110736516|dbj|BAF00225.1| RRM-containing protein [Arabidopsis thaliana]
Length = 574
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 43/69 (62%), Gaps = 1/69 (1%)
Query: 8 KEKHGKRGEPSLADSNEGSAARTRPFSFDEIMIRRKTKNLSENIKEDAMDVSNIPGDCTI 67
K+K + G +L + EGSAARTRP S +EI +RRK K EN+KE+ + + + G +
Sbjct: 6 KQKENRDGHETLVN-EEGSAARTRPTSVEEIRLRRKRKESLENVKEETVGAARLLGYDLV 64
Query: 68 ENVSDDHKS 76
E SD H+S
Sbjct: 65 EKASDYHES 73
>gi|221058957|ref|XP_002260124.1| splicing factor [Plasmodium knowlesi strain H]
gi|193810197|emb|CAQ41391.1| splicing factor, putative [Plasmodium knowlesi strain H]
Length = 970
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/183 (25%), Positives = 74/183 (40%), Gaps = 39/183 (21%)
Query: 466 LCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQ----REKGQAFVEFLTA 521
+ V NL L A E+ + EF + V G + I C+ + KG A+VEF T
Sbjct: 595 VLVLNLDLKADERDIYEFFS---------EVAGKVRDIQCIKDQRSGKSKGVAYVEFYTQ 645
Query: 522 EDASAALCCDGCSFSGSILKIKRPK-----------------EFVEVAIFIGGISRTLSS 564
E AL +G +KI+ + + + ++IGG+ L +
Sbjct: 646 EAVVKALSANGYMLKNRPIKIQSSQAEKNRAAKAAKHQPIDPNDIPIKLYIGGLVGPLGN 705
Query: 565 KMVMEIVCAFGPLKAYHFEVNEDHEEP----C---AFIEYVDQLVTPKAIAGLNGLKVGG 617
E+ F P EV E H +P C FI++ +A+ LNG+++ G
Sbjct: 706 ISEQELKQLFNPFGEI-LEV-EIHRDPYTGKCKGFGFIQFFRASEAIEAMGVLNGMEIAG 763
Query: 618 QVL 620
+ L
Sbjct: 764 REL 766
>gi|110741990|dbj|BAE98934.1| splicing factor like protein [Arabidopsis thaliana]
Length = 341
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 48/105 (45%)
Query: 438 ISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVH 497
+ S L++ Q+ R RR+ V L +A+E+++ F + + + G
Sbjct: 213 LPTGQSFGGLSMMPIQAMTQQATRHARRVYVGGLSPTANEQSVATFFSQVMAAVGGNTAG 272
Query: 498 GSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKI 542
+ I EK AFVE + E+AS A+ DG F G+ +K+
Sbjct: 273 PGDAVVNVYINHEKKFAFVEMRSVEEASNAMSLDGIIFEGAPVKV 317
>gi|124513200|ref|XP_001349956.1| splicing factor, putative [Plasmodium falciparum 3D7]
gi|23615373|emb|CAD52364.1| splicing factor, putative [Plasmodium falciparum 3D7]
Length = 864
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 46/193 (23%), Positives = 78/193 (40%), Gaps = 41/193 (21%)
Query: 455 QLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQ----RE 510
++ ++ R + V NL L A E+ + EF V G + I C+ +
Sbjct: 478 EMEEAKRDDLTVLVLNLDLKADERDIYEFF---------SEVAGKVRDIQCIKDQRSGKS 528
Query: 511 KGQAFVEFLTAEDASAALCCDGCSFSGSILKIKR------------------PKEFVEVA 552
KG A+VEF T E AL +G +KI+ P + + +
Sbjct: 529 KGVAYVEFYTQEAVIKALAANGMMLKNRPIKIQSSQAEKNRAAKAAKHQPIDPND-IPLK 587
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVNEDHEEP-------CAFIEYVDQLVTPK 605
++IGG+ LS+ E+ F P +V E H +P FI++ +
Sbjct: 588 LYIGGLLGPLSNITEQELKQLFNPFGDI-LDV-EIHRDPYTGKSKGFGFIQFHKASEAIE 645
Query: 606 AIAGLNGLKVGGQ 618
A+ +NG++V G+
Sbjct: 646 ALTVMNGMEVAGR 658
>gi|213408691|ref|XP_002175116.1| splicing factor U2AF subunit [Schizosaccharomyces japonicus yFS275]
gi|212003163|gb|EEB08823.1| splicing factor U2AF subunit [Schizosaccharomyces japonicus yFS275]
Length = 511
Score = 49.3 bits (116), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 81/352 (23%), Positives = 138/352 (39%), Gaps = 58/352 (16%)
Query: 381 PINRSPEKKSAKWDVAPVETYSVPSNVHTSNQAASSNAHEMVSSDPVTSTTQKPLAGISV 440
P+N+ K+S WD+ P +V T++QA S + + S T P +
Sbjct: 114 PLNQWKRKRSM-WDMKPPGYENV-----TADQAKMSGLFPLPGAP--RSATADPEKLAAF 165
Query: 441 SASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGV------Q 494
+ S A ++ ++R RRL V+ LP + L + +S +
Sbjct: 166 ARSTAGSIIAPPPPIQPGASRQARRLKVKELPAEFEVEDLKNVFEESISTSSFHKDRDTK 225
Query: 495 HVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKRPKEFV----- 549
HV PC + + A +E T EDA+ F + I R + ++
Sbjct: 226 HVTAIYPC------KTERYAIIELATPEDATFIWGARKLKFKNETVLIDRLEGYIVPQIS 279
Query: 550 -EVA------------------IFIGGISRTLSSKMVMEIVCAFGPLKAYHFEVN--EDH 588
EVA ++IG + L+ + E++ FG L++ N +
Sbjct: 280 SEVAQKRPKNDLNQKVLDSADKVYIGSLPLYLNEDQISELLKPFGELQSLFLAKNSADMT 339
Query: 589 EEPCAFIEYVDQLVTPKAIAGLNGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHAL 648
AF EY+ A+ GLN ++ G L VQ G I +P G+ AL
Sbjct: 340 SRGYAFCEYISSESATAAVQGLNNMEFGDTRLM-VQFACVG-IQQPVPSPRSVGM--AAL 395
Query: 649 PLLKK------PTEVLKLKNVFNPEGFSSLSELEVEEVLEDVRLECARFGSV 694
L K PT VL++ N+ + + +L + E++ + V+ +C +G V
Sbjct: 396 IELSKSSTEAAPTRVLQIHNLLDAD--ETLDTEDYEDIRKSVQNKCNEYGQV 445
>gi|301121478|ref|XP_002908466.1| splicing factor U2af large subunit, putative [Phytophthora
infestans T30-4]
gi|262103497|gb|EEY61549.1| splicing factor U2af large subunit, putative [Phytophthora
infestans T30-4]
Length = 597
Score = 48.1 bits (113), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 68/158 (43%), Gaps = 21/158 (13%)
Query: 553 IFIGGISRTLSSKMVMEIVCAFGPLKAYHF--EVNEDHEEPCAFIEYVDQLVTPKAIAGL 610
+ + I L+ + V E+V FG LK++ + A EY D V +A+ GL
Sbjct: 402 LIMSNIPVVLAEEQVKELVEPFGKLKSFTLVKDSATGASLGSALFEYEDSDVAAQAVEGL 461
Query: 611 NGLKVGGQVLTAVQAVLDGSIMDNSGNPPFHGIPKHALPLLKKPTEVLKLKNVFNPEGFS 670
NGL +GG +L S+ + A +P+ VLK+ N+
Sbjct: 462 NGLSIGGILL---------SVQRQPASSAAALPSAAAANPEDQPSAVLKMANMV------ 506
Query: 671 SLSEL----EVEEVLEDVRLECARFGSVKSVNVVKYGD 704
S+ EL E ++ EDV EC RFG V + + + D
Sbjct: 507 SIDELRDDEEYADLAEDVEEECKRFGGVTGMEIPRPKD 544
Score = 42.7 bits (99), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 56/125 (44%), Gaps = 1/125 (0%)
Query: 425 DPVTSTTQKPLAGISVSASLAKLNVSMDSVQLTQSNRPMRRLCVENLPLSASEKALMEFL 484
+P + +S+ +AK+ S ++ + +P R L V NLP + + L EFL
Sbjct: 208 NPTITQLMAQYPTMSLQDIIAKMQASNVTMAAAVAQKPARELYVGNLPPNVTGPQLQEFL 267
Query: 485 NNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDASAALCCDGCSFSGSILKIKR 544
+ + G+ G+ P I + AF E + E+ + AL + S G LK R
Sbjct: 268 STIIQQVGLTTQPGN-PIINTWTSTDGHFAFCEMRSVEECNLALLLNQLSLLGQPLKFGR 326
Query: 545 PKEFV 549
P+ F+
Sbjct: 327 PRSFM 331
>gi|357155772|ref|XP_003577233.1| PREDICTED: splicing factor U2af large subunit B-like [Brachypodium
distachyon]
Length = 446
Score = 48.1 bits (113), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 43/84 (51%)
Query: 458 QSNRPMRRLCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVE 517
Q+ R RR+ V LP +A+E+++ + N + + G + I +K AFVE
Sbjct: 198 QATRHARRVYVGGLPPTANEQSVAIYFNQVMAAIGGNTAGPGDAVLNVYINHDKKFAFVE 257
Query: 518 FLTAEDASAALCCDGCSFSGSILK 541
+ E+AS A+ DG F G+ +K
Sbjct: 258 MRSVEEASNAMALDGILFEGAPVK 281
>gi|82595543|ref|XP_725893.1| splicing factor [Plasmodium yoelii yoelii 17XNL]
gi|23481068|gb|EAA17458.1| putative splicing factor [Plasmodium yoelii yoelii]
Length = 698
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 76/189 (40%), Gaps = 41/189 (21%)
Query: 466 LCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQ----REKGQAFVEFLTA 521
+ V NL L A E+ + EF + V G + I C+ + KG A+VEF T
Sbjct: 320 VLVLNLDLKADERDIYEFFS---------EVAGKVRDIQCIKDQRSGKSKGVAYVEFYTQ 370
Query: 522 EDASAALCCDGCSFSGSILKIKR------------------PKEFVEVAIFIGGISRTLS 563
+ AL +G +K++ P + + + ++IGG+ LS
Sbjct: 371 DSVIKALSVNGYMLKNRPIKVQSSQAEKNRAAKATKHHPIDPND-IPLKLYIGGLLGPLS 429
Query: 564 SKMVMEIVCAFGPLKAYHFEVNEDHEEP-------CAFIEYVDQLVTPKAIAGLNGLKVG 616
+ E+ F P +V E H +P FI++ +A+ +NG++V
Sbjct: 430 NITEQELKQLFNPFGDI-LDV-EIHRDPYTGKSKGFGFIQFHKASEAIEAMTVMNGMEVA 487
Query: 617 GQVLTAVQA 625
G+ + A
Sbjct: 488 GREIKVSYA 496
>gi|219521919|ref|NP_001083001.2| U2-associated SR140 protein-like [Danio rerio]
Length = 874
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 47/79 (59%), Gaps = 3/79 (3%)
Query: 236 RKHRNRDDEKNRDKSAAKKVDLGKGHDL--KVSERKEKKESPKSRHGNSRLKRRRSRSRE 293
+K ++DD++ RDK + K+ D K + K ER + ++ KSR R ++ R RSR+
Sbjct: 768 QKEFDKDDQEKRDKHSQKQKDRSKKDERRDKGEERSKTRDKEKSRKSEDR-EKSRGRSRD 826
Query: 294 REDRNRRSLSLSPRAQKRT 312
+EDR R+ S SPR KR+
Sbjct: 827 KEDRRERTKSRSPRKSKRS 845
>gi|325183203|emb|CCA17661.1| nucleolin putative [Albugo laibachii Nc14]
Length = 701
Score = 39.7 bits (91), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 62/162 (38%), Gaps = 15/162 (9%)
Query: 466 LCVENLPLSASEKALMEFLNNFLLSSGVQHVHGSLPCIGCVIQREKGQAFVEFLTAEDAS 525
+ + LP A+E + E F V+H+ G R G+A + F A +A
Sbjct: 438 IYILGLPWEATEAQVRE---RFESCGTVEHIDMQTQGDG----RPSGKARIRFSCASEAE 490
Query: 526 AALCCDGCSFSGSILKIKRPKEFVE-----VAIFIGGISRTLSSKMVMEIVCAFGPLKAY 580
AAL DG F G LKI+ E +E F+G + V E G +
Sbjct: 491 AALELDGSDFGGRWLKIQLANEILEKPENCTTAFVGNLPWDADDNSVCEFFSQCGEIGEC 550
Query: 581 HFEVNEDHEE--PCAFIEYVDQLVTPKAIAGLNGLKVGGQVL 620
+ + E ++E+ +AI LNG G+ L
Sbjct: 551 RLLSDRETGEFRGIGYVEFTSTEAADEAIK-LNGADFNGRSL 591
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.309 0.127 0.353
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,475,973,677
Number of Sequences: 23463169
Number of extensions: 626864437
Number of successful extensions: 2543874
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3839
Number of HSP's successfully gapped in prelim test: 28243
Number of HSP's that attempted gapping in prelim test: 2125222
Number of HSP's gapped (non-prelim): 187881
length of query: 943
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 791
effective length of database: 8,792,793,679
effective search space: 6955099800089
effective search space used: 6955099800089
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 82 (36.2 bits)