BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 004093
(774 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q12996|CSTF3_HUMAN Cleavage stimulation factor subunit 3 OS=Homo sapiens GN=CSTF3 PE=1
SV=1
Length = 717
Score = 396 bits (1018), Expect = e-109, Method: Compositional matrix adjust.
Identities = 252/752 (33%), Positives = 377/752 (50%), Gaps = 87/752 (11%)
Query: 21 YNVETAEILANSALHLPVAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATK 80
Y+++ IL A + P+ +A YE+L++ FP++ +FWK Y+EA + N D +
Sbjct: 30 YDLDAWSILIREAQNQPIDKARKTYERLVAQFPSS----GRFWKLYIEAEIKAKNYDKVE 85
Query: 81 QLFSRCLLICLQVPLWRCYIRFIRKVYEKKGT--EGQEETRKAFDFMLSHVGSDISSGPI 138
+LF RCL+ L + LW+CY+ ++R E KG +E+ +A+DF L +G +I S I
Sbjct: 86 KLFQRCLMKVLHIDLWKCYLSYVR---ETKGKLPSYKEKMAQAYDFALDKIGMEIMSYQI 142
Query: 139 WLEYITFLKSLPALNAQEESQRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLA 198
W++YI FLK + A+ + E+QR+ A+R+ YQR V P ++EQLW+DY +E ++ LA
Sbjct: 143 WVDYINFLKGVEAVGSYAENQRITAVRRVYQRGCVNPMINIEQLWRDYNKYEEGINIHLA 202
Query: 199 KGLLSEYQSKYTSARAVYRERKKYCEEIDWNMLAVPPTGSYKEEQQWIAWKRLLTFEKGN 258
K ++ + Y +AR V +E + + +D N +VPP + +E QQ WK+ + +EK N
Sbjct: 203 KKMIEDRSRDYMNARRVAKEYETVMKGLDRNAPSVPPQNTPQEAQQVDMWKKYIQWEKSN 262
Query: 259 PQRI-DTASSNKRIIFTYEQCLMYLYHYPDIWYDYATWNAKSGSI--------------D 303
P R D KR++F YEQCL+ L H+PDIWY+ A + +S + D
Sbjct: 263 PLRTEDQTLITKRVMFAYEQCLLVLGHHPDIWYEAAQYLEQSSKLLAEKGDMNNAKLFSD 322
Query: 304 AAIKVFQRALKALPDSEMLRY-AFAELEESRGAIAAAKKLYESLLTDSVNTTALAHIQFI 362
A +++RA+ L ML Y A+A+ EESR +Y LL L +IQ++
Sbjct: 323 EAANIYERAISTLLKKNMLLYFAYADYEESRMKYEKVHSIYNRLLAIEDIDPTLVYIQYM 382
Query: 363 RFLRRTEGVEAARKYFLDARKSPNFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMH 422
+F RR EG+++ R F AR+ +HVYV ALM + KD +A +FE GLK++
Sbjct: 383 KFARRAEGIKSGRMIFKKAREDTRTRHHVYVTAALMEYYCSKDKSVAFKIFELGLKKYGD 442
Query: 423 EPAYILEYADFLSRLNDDRNIRALFERALS--SLPPEESIEVWKRFTQFEQMYGDLDSTL 480
P Y+L Y D+LS LN+D N R LFER L+ SLPPE+S E+W RF FE GDL S L
Sbjct: 443 IPEYVLAYIDYLSHLNEDNNTRVLFERVLTSGSLPPEKSGEIWARFLAFESNIGDLASIL 502
Query: 481 KVEQRRKEALSRTGEEGASALEDSLQDVVSRYSFMDLWPCSSKDLDHLVRQEWLVKNINK 540
KVE+RR A E +AL +V RY FMDL+PCS+ +L L K+++
Sbjct: 503 KVEKRRFTAFKEEYEGKETAL------LVDRYKFMDLYPCSASELKALG-----YKDVS- 550
Query: 541 KVDKSALSNGPGIVDKGPSGLTSNSTTSATVIYPDTSQMVIYDPRQKPGIGISPSTTATG 600
+ +A+ P + L PDT QM+ + PR G+ P
Sbjct: 551 RAKLAAIIPDPVVAPSIVPVLKDEVDRKPEYPKPDTQQMIPFQPRHLAPPGLHPVP---- 606
Query: 601 ASSALNALSNPMVATGGGGIMNPFDEMLKAASPAIFAFLANLP---AVEGPTPNVDIVLS 657
GG+ PA + LP +GP VD ++
Sbjct: 607 -----------------GGVF--------PVPPAAVVLMKLLPPPICFQGPFVQVDELME 641
Query: 658 ICLQSDIPTGQMGKSPTTYPTPIPTGAARSASGISGSNKSHPTPSGSSLKQSKDKQSLKR 717
I + IP I TG A + + G+ P S + L +++KR
Sbjct: 642 IFRRCKIPNT------VEEAVRIITGGAPELA-VEGNG---PVESNAVL-----TKAVKR 686
Query: 718 KDIGQDDDETTTVQSQPQPRDFFRIRQMKKAR 749
+ D+DE P D +R RQ K+ R
Sbjct: 687 PNEDSDEDEEKGAVVPPV-HDIYRARQQKRIR 717
>sp|Q99LI7|CSTF3_MOUSE Cleavage stimulation factor subunit 3 OS=Mus musculus GN=Cstf3 PE=1
SV=1
Length = 717
Score = 396 bits (1018), Expect = e-109, Method: Compositional matrix adjust.
Identities = 252/752 (33%), Positives = 377/752 (50%), Gaps = 87/752 (11%)
Query: 21 YNVETAEILANSALHLPVAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATK 80
Y+++ IL A + P+ +A YE+L++ FP++ +FWK Y+EA + N D +
Sbjct: 30 YDLDAWSILIREAQNQPIDKARKTYERLVAQFPSS----GRFWKLYIEAEIKAKNYDKVE 85
Query: 81 QLFSRCLLICLQVPLWRCYIRFIRKVYEKKGT--EGQEETRKAFDFMLSHVGSDISSGPI 138
+LF RCL+ L + LW+CY+ ++R E KG +E+ +A+DF L +G +I S I
Sbjct: 86 KLFQRCLMKVLHIDLWKCYLSYVR---ETKGKLPSYKEKMAQAYDFALDKIGMEIMSYQI 142
Query: 139 WLEYITFLKSLPALNAQEESQRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLA 198
W++YI FLK + A+ + E+QR+ A+R+ YQR V P ++EQLW+DY +E ++ LA
Sbjct: 143 WVDYINFLKGVEAVGSYAENQRITAVRRVYQRGCVNPMINIEQLWRDYNKYEEGINIHLA 202
Query: 199 KGLLSEYQSKYTSARAVYRERKKYCEEIDWNMLAVPPTGSYKEEQQWIAWKRLLTFEKGN 258
K ++ + Y +AR V +E + + +D N +VPP + +E QQ WK+ + +EK N
Sbjct: 203 KKMIEDRSRDYMNARRVAKEYETVMKGLDRNAPSVPPQNTPQEAQQVDMWKKYIQWEKSN 262
Query: 259 PQRI-DTASSNKRIIFTYEQCLMYLYHYPDIWYDYATWNAKSGSI--------------D 303
P R D KR++F YEQCL+ L H+PDIWY+ A + +S + D
Sbjct: 263 PLRTEDQTLITKRVMFAYEQCLLVLGHHPDIWYEAAQYLEQSSKLLAEKGDMNNAKLFSD 322
Query: 304 AAIKVFQRALKALPDSEMLRY-AFAELEESRGAIAAAKKLYESLLTDSVNTTALAHIQFI 362
A +++RA+ L ML Y A+A+ EESR +Y LL L +IQ++
Sbjct: 323 EAANIYERAISTLLKKNMLLYFAYADYEESRMKYEKVHSIYNRLLAIEDIDPTLVYIQYM 382
Query: 363 RFLRRTEGVEAARKYFLDARKSPNFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMH 422
+F RR EG+++ R F AR+ +HVYV ALM + KD +A +FE GLK++
Sbjct: 383 KFARRAEGIKSGRMIFKKAREDARTRHHVYVTAALMEYYCSKDKSVAFKIFELGLKKYGD 442
Query: 423 EPAYILEYADFLSRLNDDRNIRALFERALS--SLPPEESIEVWKRFTQFEQMYGDLDSTL 480
P Y+L Y D+LS LN+D N R LFER L+ SLPPE+S E+W RF FE GDL S L
Sbjct: 443 IPEYVLAYIDYLSHLNEDNNTRVLFERVLTSGSLPPEKSGEIWARFLAFESNIGDLASIL 502
Query: 481 KVEQRRKEALSRTGEEGASALEDSLQDVVSRYSFMDLWPCSSKDLDHLVRQEWLVKNINK 540
KVE+RR A E +AL +V RY FMDL+PCS+ +L L K+++
Sbjct: 503 KVEKRRFTAFREEYEGKETAL------LVDRYKFMDLYPCSASELKALG-----YKDVS- 550
Query: 541 KVDKSALSNGPGIVDKGPSGLTSNSTTSATVIYPDTSQMVIYDPRQKPGIGISPSTTATG 600
+ +A+ P + L PDT QM+ + PR G+ P
Sbjct: 551 RAKLAAIIPDPVVAPSIVPVLKDEVDRKPEYPKPDTQQMIPFQPRHLAPPGLHPVP---- 606
Query: 601 ASSALNALSNPMVATGGGGIMNPFDEMLKAASPAIFAFLANLP---AVEGPTPNVDIVLS 657
GG+ PA + LP +GP VD ++
Sbjct: 607 -----------------GGVF--------PVPPAAVVLMKLLPPPICFQGPFVQVDELME 641
Query: 658 ICLQSDIPTGQMGKSPTTYPTPIPTGAARSASGISGSNKSHPTPSGSSLKQSKDKQSLKR 717
I + IP I TG A + + G+ P S + L +++KR
Sbjct: 642 IFRRCKIPNT------VEEAVRIITGGAPELA-VEGNG---PVESSAVL-----TKAVKR 686
Query: 718 KDIGQDDDETTTVQSQPQPRDFFRIRQMKKAR 749
+ D+DE P D +R RQ K+ R
Sbjct: 687 PNEDSDEDEEKGAVVPPV-HDIYRARQQKRIR 717
>sp|Q5RDW9|CSTF3_PONAB Cleavage stimulation factor subunit 3 OS=Pongo abelii GN=CSTF3 PE=2
SV=1
Length = 717
Score = 395 bits (1014), Expect = e-109, Method: Compositional matrix adjust.
Identities = 251/752 (33%), Positives = 376/752 (50%), Gaps = 87/752 (11%)
Query: 21 YNVETAEILANSALHLPVAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATK 80
Y+++ L A + P+ +A YE+L++ FP++ +FWK Y+EA + N D +
Sbjct: 30 YDLDAWSTLIREAQNQPIDKARKTYERLVAQFPSS----GRFWKLYIEAEIKAKNYDKVE 85
Query: 81 QLFSRCLLICLQVPLWRCYIRFIRKVYEKKGT--EGQEETRKAFDFMLSHVGSDISSGPI 138
+LF RCL+ L + LW+CY+ ++R E KG +E+ +A+DF L +G +I S I
Sbjct: 86 KLFQRCLMKVLHIDLWKCYLSYVR---ETKGKLPSYKEKMAQAYDFALDKIGMEIMSYQI 142
Query: 139 WLEYITFLKSLPALNAQEESQRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLA 198
W++YI FLK + A+ + E+QR+ A+R+ YQR V P ++EQLW+DY +E ++ LA
Sbjct: 143 WVDYINFLKGVEAVGSYAENQRITAVRRVYQRGCVNPMINIEQLWRDYNKYEEGINIHLA 202
Query: 199 KGLLSEYQSKYTSARAVYRERKKYCEEIDWNMLAVPPTGSYKEEQQWIAWKRLLTFEKGN 258
K ++ + Y +AR V +E + + +D N +VPP + +E QQ WK+ + +EK N
Sbjct: 203 KKMIEDRSRDYMNARRVAKEYETVMKGLDRNAPSVPPQNTPQEAQQVDMWKKYIQWEKSN 262
Query: 259 PQRI-DTASSNKRIIFTYEQCLMYLYHYPDIWYDYATWNAKSGSI--------------D 303
P R D KR++F YEQCL+ L H+PDIWY+ A + +S + D
Sbjct: 263 PLRTEDQTLITKRVMFAYEQCLLVLGHHPDIWYEAAQYLEQSSKLLAEKGDMNNAKLFSD 322
Query: 304 AAIKVFQRALKALPDSEMLRY-AFAELEESRGAIAAAKKLYESLLTDSVNTTALAHIQFI 362
A +++RA+ L ML Y A+A+ EESR +Y LL L +IQ++
Sbjct: 323 EAANIYERAISTLLKKNMLLYFAYADYEESRMKYEKVHSIYNRLLAIEDIDPTLVYIQYM 382
Query: 363 RFLRRTEGVEAARKYFLDARKSPNFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMH 422
+F RR EG+++ R F AR+ +HVYV ALM + KD +A +FE GLK++
Sbjct: 383 KFARRAEGIKSGRMIFKKAREDTRTRHHVYVTAALMEYYCSKDKSVAFKIFELGLKKYGD 442
Query: 423 EPAYILEYADFLSRLNDDRNIRALFERALS--SLPPEESIEVWKRFTQFEQMYGDLDSTL 480
P Y+L Y D+LS LN+D N R LFER L+ SLPPE+S E+W RF FE GDL S L
Sbjct: 443 IPEYVLAYIDYLSHLNEDNNTRVLFERVLTSGSLPPEKSGEIWARFLAFESNIGDLASIL 502
Query: 481 KVEQRRKEALSRTGEEGASALEDSLQDVVSRYSFMDLWPCSSKDLDHLVRQEWLVKNINK 540
KVE+RR A E +AL +V RY FMDL+PCS+ +L L K+++
Sbjct: 503 KVEKRRFTAFKEEYEGKETAL------LVDRYKFMDLYPCSASELKALG-----YKDVS- 550
Query: 541 KVDKSALSNGPGIVDKGPSGLTSNSTTSATVIYPDTSQMVIYDPRQKPGIGISPSTTATG 600
+ +A+ P + L PDT QM+ + PR G+ P
Sbjct: 551 RAKLAAIIPDPVVAPSIVPVLKDEVDRKPEYPKPDTQQMIPFQPRHLAPPGLHPVP---- 606
Query: 601 ASSALNALSNPMVATGGGGIMNPFDEMLKAASPAIFAFLANLP---AVEGPTPNVDIVLS 657
GG+ PA + LP +GP VD ++
Sbjct: 607 -----------------GGVF--------PVPPAAVVLMKLLPPPICFQGPFVQVDELME 641
Query: 658 ICLQSDIPTGQMGKSPTTYPTPIPTGAARSASGISGSNKSHPTPSGSSLKQSKDKQSLKR 717
I + IP I TG A + + G+ P S + L +++KR
Sbjct: 642 IFRRCKIPNT------VEEAVRIITGGAPELA-VEGNG---PVESNAVL-----TKAVKR 686
Query: 718 KDIGQDDDETTTVQSQPQPRDFFRIRQMKKAR 749
+ D+DE P D +R RQ K+ R
Sbjct: 687 PNEDSDEDEEKGAVVPPV-HDIYRARQQKRIR 717
>sp|P25991|SUF_DROME Protein suppressor of forked OS=Drosophila melanogaster GN=su(f)
PE=1 SV=2
Length = 765
Score = 371 bits (952), Expect = e-101, Method: Compositional matrix adjust.
Identities = 242/783 (30%), Positives = 385/783 (49%), Gaps = 109/783 (13%)
Query: 21 YNVETAEILANSALHLPVAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATK 80
Y++E+ ++ A P+ + +YE L++VFPT A++WK Y+E M + +
Sbjct: 31 YDIESWSVMIREAQTRPIHEVRSLYESLVNVFPTT----ARYWKLYIEMEMRSRYYERVE 86
Query: 81 QLFSRCLLICLQVPLWRCYIRFIRKVYEKKGTEGQEETRKAFDFMLSHVGSDISSGPIWL 140
+LF RCL+ L + LW+ Y+ ++++ T +E+ +A+DF L +G D+ S IW
Sbjct: 87 KLFQRCLVKILNIDLWKLYLTYVKETKSGLSTH-KEKMAQAYDFALEKIGMDLHSFSIWQ 145
Query: 141 EYITFLKSLPALNAQEESQRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKG 200
+YI FL+ + A+ E+Q++ A+R+ YQ+AVVTP +EQLWKDY FE +++ +++
Sbjct: 146 DYIYFLRGVEAVGNYAENQKITAVRRVYQKAVVTPIVGIEQLWKDYIAFEQNINPIISEK 205
Query: 201 LLSEYQSKYTSARAVYRERKKYCEEIDWNMLAVPPTGSYKEEQQWIAWKRLLTFEKGNPQ 260
+ E Y +AR V +E + + + ++ N+ AVPPT + +E +Q WKR +T+EK NP
Sbjct: 206 MSLERSKDYMNARRVAKELEYHTKGLNRNLPAVPPTLTKEEVKQVELWKRFITYEKSNPL 265
Query: 261 RI-DTASSNKRIIFTYEQCLMYLYHYPDIWYD---------------------------- 291
R DTA +R++F EQCL+ L H+P +W+
Sbjct: 266 RTEDTALVTRRVMFATEQCLLVLTHHPAVWHQASQFLDTSARVLTEKGVRTSVENISPIL 325
Query: 292 -------------YATWNAKSGS-----IDAAIKVFQRALKA-LPDSEMLRYAFAELEES 332
+A W AK D + +R++ L + +L +A+A+ EE
Sbjct: 326 CVPVVNQIEWVMAFAWWWAKDVQAAKIFADECANILERSINGVLNRNALLYFAYADFEEG 385
Query: 333 RGAIAAAKKLYESLLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYHVY 392
R +Y LL L ++Q+++F RR EG+++AR F AR+ YH++
Sbjct: 386 RLKYEKVHTMYNKLLQLPDIDPTLVYVQYMKFARRAEGIKSARSIFKKAREDVRSRYHIF 445
Query: 393 VAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALS 452
VA ALM + KD ++A +FE GLKRF P Y++ Y D+LS LN+D N R LFER LS
Sbjct: 446 VAAALMEYYCSKDKEIAFRIFELGLKRFGGSPEYVMCYIDYLSHLNEDNNTRVLFERVLS 505
Query: 453 S--LPPEESIEVWKRFTQFEQMYGDLDSTLKVEQRRKEALSRTGE-EGASALEDSLQDVV 509
S L P +S+EVW RF +FE GDL S +KVE+RR E EG + +V
Sbjct: 506 SGGLSPHKSVEVWNRFLEFESNIGDLSSIVKVERRRSAVFENLKEYEGKETAQ-----LV 560
Query: 510 SRYSFMDLWPCSSKDLDHLVRQEWLVKNINKKVDKSALSNGPGIVDKGPSGLTSNSTTSA 569
RY F+DL+PC+S +L + E V I KV A S G V+ ++S +
Sbjct: 561 DRYKFLDLYPCTSTELKSIGYAE-NVGIILNKVGGGAQSQNTGEVE-------TDSEATP 612
Query: 570 TVIYPDTSQMVIYDPRQKPGIGISPSTTATGASSALNALSNPMVATGGGGIMNPFDEMLK 629
+ PD SQM+ + PR G P GG P
Sbjct: 613 PLPRPDFSQMIPFKPRPCAHPGAHP--------------------LAGGVFPQP------ 646
Query: 630 AASPAIFAFLANLP---AVEGPTPNVDIVLSICLQSDIPTGQMGKSPTTYPTPIPTGAAR 686
PA+ A A LP + GP +V+++ I ++ ++P + +P A+
Sbjct: 647 ---PALAALCATLPPPNSFRGPFVSVELLFDIFMRLNLPDSAPQPNGDNELSPKIFDLAK 703
Query: 687 SASGISGSNKSHPTPSGSSLKQSKDKQSLKRKDI--GQDDDETTTVQSQPQPRDFFRIRQ 744
S I T + + ++ S +R+ + G DD + + P D +R+RQ
Sbjct: 704 SVHWIVD------TSTYTGVQHSVTAVPPRRRRLLPGGDDSDDELQTAVPPSHDIYRLRQ 757
Query: 745 MKK 747
+K+
Sbjct: 758 LKR 760
>sp|O14233|RNA14_SCHPO mRNA 3'-end-processing protein rna14 OS=Schizosaccharomyces pombe
(strain 972 / ATCC 24843) GN=rna14 PE=3 SV=1
Length = 733
Score = 271 bits (694), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 148/481 (30%), Positives = 253/481 (52%), Gaps = 44/481 (9%)
Query: 45 YEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWRCYIRFIR 104
YEQ+L FP ++ + W Y+ + +A N+ A + LFSRCL+ L V LW Y+ +IR
Sbjct: 94 YEQMLRPFP----YVPRVWVDYISSELAFNDFHAVELLFSRCLVKVLSVDLWTLYLSYIR 149
Query: 105 KVYEKKGTEGQEETRKAFDFMLSHVGSDISSGPIWLEYITFLKSLPALNAQEESQRMIAI 164
++ + + +A++F+++ +G DI SGPIW E++ FL+S PA + E+ Q++ +
Sbjct: 150 RINPDGEGQSRSTITQAYEFVINTIGVDILSGPIWSEFVDFLRSGPANSTWEQQQKLDHV 209
Query: 165 RKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAVYRERKKYCE 224
R+ YQRA+ TP H++E+LW+DY+ FENSV+R A+ ++E Y +ARA RE E
Sbjct: 210 RRIYQRAITTPIHNIEKLWRDYDAFENSVNRATARKFVAEKSPVYMAARAAMRELSNLTE 269
Query: 225 EIDWNMLAVPPTGSYKEEQQWIAWKRLLTFEKGNPQRIDTASS-NKRIIFTYEQCLMYLY 283
+ + E + W + +E+ +P + + RI + +EQ ++Y+
Sbjct: 270 GLRVYDFTFERKYTKVERIAYSRWMNWIKWEQSDPLDLQHGTMLQNRIAYAFEQAMLYVP 329
Query: 284 HYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLY 343
P IW D ++ A++ +R ++ P + +L +AE EE+ + + Y
Sbjct: 330 LCPQIWLDGFSYFLSISDEQRALQTIRRGMRYCPSNFVLHVRYAEHEEANNRTSEIRSTY 389
Query: 344 ESLL---------------------TDS------------------VNTTALAHIQFIRF 364
ESL+ TD V +LA I
Sbjct: 390 ESLIAALAREISQLDSKASSSSESSTDGNPQEKKLPEHLVKRKSRLVRQYSLAWCCLINA 449
Query: 365 LRRTEGVEAARKYFLDARKSPNFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEP 424
+RRTEGV+AAR F ARK+P ++ +Y+A A+M +DP +A +FE G++ F P
Sbjct: 450 IRRTEGVKAARAIFTKARKAPYQSHEIYIASAMMEHHCSRDPVIASRIFELGMRHFGDVP 509
Query: 425 AYILEYADFLSRLNDDRNIRALFERALSSLPPEESIEVWKRFTQFEQMYGDLDSTLKVEQ 484
AY+ +Y +L +ND+ N RALFE+A+ + +E+ +++++ +E YGDL++ + + Q
Sbjct: 510 AYVYKYLSYLIAINDETNARALFEKAIPRIAADEAKPIYQKWLDYESNYGDLNAAIALSQ 569
Query: 485 R 485
R
Sbjct: 570 R 570
>sp|Q6C8L8|RNA14_YARLI mRNA 3'-end-processing protein RNA14 OS=Yarrowia lipolytica (strain
CLIB 122 / E 150) GN=RNA14 PE=3 SV=1
Length = 806
Score = 241 bits (616), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 153/483 (31%), Positives = 247/483 (51%), Gaps = 51/483 (10%)
Query: 44 IYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWRCYIRFI 103
IYE+ L+++P + A+ W +Y+ M +QLF RCL + LW Y+ ++
Sbjct: 229 IYERFLALYPLS----AEIWIEYITLEMDNGEFKRLEQLFGRCLTRLPNLKLWNIYLTYV 284
Query: 104 RKVY-----EKKGTEGQEETRKAFDFMLSHVGSDISSGPIWLEYITFLKSLPALNAQEES 158
R+V K TE + KAF+F L HVG D SG +W EY+ F+KS PA EE
Sbjct: 285 RRVNVLSSESDKITEARTNIIKAFEFYLDHVGIDRESGNVWFEYLDFIKSKPATTTWEEQ 344
Query: 159 QRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAVYRE 218
Q+ RK Y++A+ P +++ LW Y NFE S+++ A+ ++E +AR
Sbjct: 345 QKNDLTRKIYRKAIGIPLNNLSILWTAYTNFEYSLNKATARKFINEKSGSCQNARQ---- 400
Query: 219 RKKYCEEIDWNML------AVPPTGSYKEEQQWIAWKRLLTFEKGNPQRIDT-ASSNKRI 271
C+ + N++ +VP +G ++E Q AWK+ + +EK NP D A +NKR+
Sbjct: 401 ----CQTVLENLMRGLDRSSVPKSGP-RDEFQVRAWKKWIDWEKSNPLGTDNKAETNKRL 455
Query: 272 IFTYEQCLMYLYHYPDIWYDYATWNAKSGSIDA-AIKVFQRALKALPDSEMLRYAFAELE 330
++ +Q +M L P+IW+ A + + A++ + L P+S +L + AE
Sbjct: 456 LYCLKQAVMSLQFVPEIWFLAAEYCFDDPLLKTEALQFLKDGLSLNPNSSLLAFRLAEYY 515
Query: 331 ESRGAIAAAKKLY----ESLLTD--------------------SVNT-TALAHIQFIRFL 365
E + +Y ESL + +NT ++A+ ++ +
Sbjct: 516 EREADAEKMRTIYDEHIESLGKERQALIEAQGDPEAEPTAEIIKLNTQISIAYSVCMKAV 575
Query: 366 RRTEGVEAARKYFLDARKSPNFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPA 425
+R EG++ R F AR + TYH+YVA ALM F +K+P +A NVFE GLK A
Sbjct: 576 KRFEGIKPGRMVFKKARNTGFATYHIYVASALMEFHHNKNPTVATNVFELGLKYCGSNAA 635
Query: 426 YILEYADFLSRLNDDRNIRALFERALSSLPPEESIEVWKRFTQFEQMYGDLDSTLKVEQR 485
Y+ Y DFL L+DD N RALFE+ + L P ++ + K +FE +G++ S +K++ R
Sbjct: 636 YVQHYLDFLISLHDDTNARALFEKTIPLLGPSDAASLIKSMIKFESDFGEITSVVKLQDR 695
Query: 486 RKE 488
++
Sbjct: 696 LRQ 698
>sp|P0CO12|RNA14_CRYNJ mRNA 3'-end-processing protein RNA14 OS=Cryptococcus neoformans
var. neoformans serotype D (strain JEC21 / ATCC MYA-565)
GN=RNA14 PE=3 SV=1
Length = 1064
Score = 194 bits (493), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 141/508 (27%), Positives = 229/508 (45%), Gaps = 90/508 (17%)
Query: 71 MAVNNDDATKQLFSRCL------LICLQVPLWRCYIRFIRKVYEKKGTEG-------QEE 117
+A++N + +F+ L V +W Y+ +IR+ + TEG +
Sbjct: 324 LALSNFAEVEAIFASTLKGSAGITTAADVSIWAAYLHYIRR--QNPLTEGSANAADVRST 381
Query: 118 TRKAFDFMLSHVGSDISSGPIWLEYITFLKSLPALNAQEESQRMIAIRKAYQRAVVTPTH 177
+A++F L G D SG IW EYI F+ S PA N + + +RK YQRAV P +
Sbjct: 382 ITEAYEFALRECGFDRESGDIWDEYIKFVASGPATNQWDTQAKNDNLRKIYQRAVCIPLN 441
Query: 178 HVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAVYRERKKYCEEIDWNMLAVPPTG 237
++E LWK Y+NFE+S+++ AK L+E Y +AR RE + + I +L PT
Sbjct: 442 NIEALWKSYDNFESSLNKLTAKKYLAEKSPAYMTARTALRELRALSDPIPKPILPPYPTF 501
Query: 238 SYKEEQQWIAWKRLLTFEKGNPQRIDTAS-SNKRIIFTYEQCLMYLYHYPDIWYDYATWN 296
+ ++ Q AWK L +E+GNP I+ RI + +CL + H+P++W+ A++
Sbjct: 502 TEQDRQVVGAWKACLRWEEGNPLVIENHELLQSRIGYALRKCLGEMRHFPELWHYAASYY 561
Query: 297 AKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESLLT-------- 348
+K G D A ++ + + A P S +L +A+AEL+E R A LY +L++
Sbjct: 562 SKLGKQDEAAEILEAGVNACPKSFLLTFAYAELQEERKAFPTCHSLYTTLISKLNPEVDE 621
Query: 349 ------------------------------DSVNTTA--LAHIQFIRFLRRTEGVEAARK 376
DS++ ++ IQ + R G A++
Sbjct: 622 LRQNVAREIDIARGPPIPGSEKAAVAAAVGDSIDADGNDISDIQRLVEEREQRGALVAQR 681
Query: 377 YFLDARKSPNFTYHVYVAYALMAFCQD-------------KDPKLAHNVFEA-------- 415
D + V++ Y A + K P L VFEA
Sbjct: 682 RGKDIEELMVGISVVWIMYMRFARRAEGIKAARGVFGKARKSPHLTWQVFEASALMEYHT 741
Query: 416 -------------GLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPEESIEV 462
GLK+F + Y+++Y FL +NDD N RALFER++ + +++ +
Sbjct: 742 NKDAAVAIRIFELGLKQFSEDVDYVIKYLQFLLSINDDNNARALFERSVVRIMGDKARPL 801
Query: 463 WKRFTQFEQMYGDLDSTLKVEQRRKEAL 490
W + ++E YGDL + K+E R E
Sbjct: 802 WDAWARYEYTYGDLSAVHKLEARMSEVF 829
>sp|P0CO13|RNA14_CRYNB mRNA 3'-end-processing protein RNA14 OS=Cryptococcus neoformans
var. neoformans serotype D (strain B-3501A) GN=RNA14
PE=3 SV=1
Length = 1064
Score = 194 bits (493), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 141/508 (27%), Positives = 229/508 (45%), Gaps = 90/508 (17%)
Query: 71 MAVNNDDATKQLFSRCL------LICLQVPLWRCYIRFIRKVYEKKGTEG-------QEE 117
+A++N + +F+ L V +W Y+ +IR+ + TEG +
Sbjct: 324 LALSNFAEVEAIFASTLKGSAGITTAADVSIWAAYLHYIRR--QNPLTEGSANAADVRST 381
Query: 118 TRKAFDFMLSHVGSDISSGPIWLEYITFLKSLPALNAQEESQRMIAIRKAYQRAVVTPTH 177
+A++F L G D SG IW EYI F+ S PA N + + +RK YQRAV P +
Sbjct: 382 ITEAYEFALRECGFDRESGDIWDEYIKFVASGPATNQWDTQAKNDNLRKIYQRAVCIPLN 441
Query: 178 HVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAVYRERKKYCEEIDWNMLAVPPTG 237
++E LWK Y+NFE+S+++ AK L+E Y +AR RE + + I +L PT
Sbjct: 442 NIEALWKSYDNFESSLNKLTAKKYLAEKSPAYMTARTALRELRALSDPIPKPILPPYPTF 501
Query: 238 SYKEEQQWIAWKRLLTFEKGNPQRIDTAS-SNKRIIFTYEQCLMYLYHYPDIWYDYATWN 296
+ ++ Q AWK L +E+GNP I+ RI + +CL + H+P++W+ A++
Sbjct: 502 TEQDRQVVGAWKACLRWEEGNPLVIENHELLQSRIGYALRKCLGEMRHFPELWHYAASYY 561
Query: 297 AKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESLLT-------- 348
+K G D A ++ + + A P S +L +A+AEL+E R A LY +L++
Sbjct: 562 SKLGKQDEAAEILEAGVNACPKSFLLTFAYAELQEERKAFPTCHSLYTTLISKLNPEVDE 621
Query: 349 ------------------------------DSVNTTA--LAHIQFIRFLRRTEGVEAARK 376
DS++ ++ IQ + R G A++
Sbjct: 622 LRQNVAREIDIARGPPIPGSEKAAVAAAVGDSIDADGNDISDIQRLVEEREQRGALVAQR 681
Query: 377 YFLDARKSPNFTYHVYVAYALMAFCQD-------------KDPKLAHNVFEA-------- 415
D + V++ Y A + K P L VFEA
Sbjct: 682 RGKDIEELMVGISVVWIMYMRFARRAEGIKAARGVFGKARKSPHLTWQVFEASALMEYHT 741
Query: 416 -------------GLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPEESIEV 462
GLK+F + Y+++Y FL +NDD N RALFER++ + +++ +
Sbjct: 742 NKDAAVAIRIFELGLKQFSEDVDYVIKYLQFLLSINDDNNARALFERSVVRIMGDKARPL 801
Query: 463 WKRFTQFEQMYGDLDSTLKVEQRRKEAL 490
W + ++E YGDL + K+E R E
Sbjct: 802 WDAWARYEYTYGDLSAVHKLEARMSEVF 829
>sp|Q4PCV8|RNA14_USTMA mRNA 3'-end-processing protein RNA14 OS=Ustilago maydis (strain 521
/ FGSC 9021) GN=RNA14 PE=3 SV=1
Length = 945
Score = 166 bits (420), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 101/330 (30%), Positives = 166/330 (50%), Gaps = 30/330 (9%)
Query: 44 IYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWRCYIRFI 103
+Y++ VFP A+ W Y++ +A +N + +F++CL V LW+ Y+ +
Sbjct: 214 LYDRFFKVFPNQ----ARQWLAYLDLELAHSNFAQVEAIFNQCLRTTPSVDLWKFYLSYT 269
Query: 104 RKVYEKKGTEGQEE------TRK----AFDFMLSHVGSDISSGPIWLEYITFLKSLPALN 153
R+V + G EE TR+ A++F L +G+D SG IW +YI +K A
Sbjct: 270 RRVNPLAPSTGAEEDPAREQTRRVLEGAYEFALRFIGNDKDSGSIWTDYILLIKEREARG 329
Query: 154 AQEESQRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSAR 213
+E Q+M +R+ YQRAV P ++E +WKDY+ +EN +++ AK L+E Y +AR
Sbjct: 330 GWKEGQKMDDLRRVYQRAVSVPLTNIETIWKDYDAYENGLNKLTAKKFLAERSPAYMTAR 389
Query: 214 AVYRERKKYCEEIDWNML---------------AVPPTGSYKEEQQWIAWKRLLTFEKGN 258
V R+ K Y + + +L A P +E QQ AW L +E+ N
Sbjct: 390 RVLRDLKAYSDPLVKPLLPRVPVWTTSALAGDAAQDPAQWQRERQQADAWIEYLKWEESN 449
Query: 259 PQRI-DTASSNKRIIFTYEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALP 317
P + D A R+ Y + MYL YP++WY + + +D A + ++A P
Sbjct: 450 PLLLEDVAILQARVTSAYRRATMYLRFYPEVWYLASRYLVSILRVDEAATWLKNGMEACP 509
Query: 318 DSEMLRYAFAELEESRGAIAAAKKLYESLL 347
S +L +A+AEL E+R + + +++ LL
Sbjct: 510 GSFLLHFAYAELGEARKSTSDCAAVFDGLL 539
Score = 126 bits (317), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 121/230 (52%), Gaps = 22/230 (9%)
Query: 296 NAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESLLTDSVNTTA 355
N G +D + +R KA E+L+ EL+ AAAK E L +A
Sbjct: 577 NEGDGDVDVDGEERERERKA---GELLQIRKQELQ------AAAKPEIEGL----KEASA 623
Query: 356 LAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYHVYVAYALMAFCQDKDPKLAHNVFEA 415
L I+++ FLRRTEG+ AR F ARKSP+ T+ V+ A ALM + KD +A VFE
Sbjct: 624 LVWIKYMHFLRRTEGIRPARSVFSRARKSPHCTWQVFEASALMEYHCSKDAVVATKVFEL 683
Query: 416 GLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPEESIEVWKRFTQFEQMYGD 475
LK F + A+++ Y DFL +NDD N RALFER + + PE + +W+R+ ++E +GD
Sbjct: 684 ALKTFGSDEAFVVRYLDFLISMNDDSNARALFERVIGTFAPERARPIWERWAKYEYNFGD 743
Query: 476 LDSTLKVEQRRKEALSRTGEEGASALEDSLQDVVSRYSFMDLWPCSSKDL 525
+ K+E R E E + + ++R S+MDL +DL
Sbjct: 744 SVAIQKLESRLAETYPD---------EPATKRFIARSSYMDLDLVGPRDL 784
>sp|P25298|RNA14_YEAST mRNA 3'-end-processing protein RNA14 OS=Saccharomyces cerevisiae
(strain ATCC 204508 / S288c) GN=RNA14 PE=1 SV=2
Length = 677
Score = 155 bits (391), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 123/499 (24%), Positives = 225/499 (45%), Gaps = 64/499 (12%)
Query: 39 AQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQ---VPL 95
A+ +YEQ + FP F + W ++ +A + + +++ ++CL L+ + L
Sbjct: 59 AKVREVYEQFHNTFP----FYSPAWTLQLKGELARDEFETVEKILAQCLSGKLENNDLSL 114
Query: 96 WRCYIRFIRKVYEKKGTEGQEETR----KAFDFMLSHVG-SDISSGPIWLEYITFLKSLP 150
W Y+ +IR+ + G +E R KAF ++ + S W EY+ FL+
Sbjct: 115 WSTYLDYIRR--KNNLITGGQEARAVIVKAFQLVMQKCAIFEPKSSSFWNEYLNFLEQWK 172
Query: 151 ALNAQEESQRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYT 210
N EE QR+ +R+ Y++ + P ++E++W Y +E ++ A+ + E ++Y
Sbjct: 173 PFNKWEEQQRIDMLREFYKKMLCVPFDNLEKMWNRYTQWEQEINSLTARKFIGELSAEYM 232
Query: 211 SARAVYRERKKYCEEIDW---------NMLAVPPTGS----YKEEQQWIAWKRLLTFEKG 257
AR++Y+E + N +P G+ ++ Q W+ W + +E+
Sbjct: 233 KARSLYQEWLNVTNGLKRASPINLRTANKKNIPQPGTSDSNIQQLQIWLNW---IKWERE 289
Query: 258 NPQRIDTASSNKRIIFTYEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALP 317
N + ++RI + Y+Q + Y+ ++WYDY+ + +++ + AL A P
Sbjct: 290 NKLMLSEDMLSQRISYVYKQGIQYMIFSAEMWYDYSMYISENSDRQ---NILYTALLANP 346
Query: 318 DSEMLRYAFAELEE---------------SRGAIAAAKKL-------------YES-LLT 348
DS L + +E E ++ ++ KK+ YE LL
Sbjct: 347 DSPSLTFKLSECYELDNDSESVSNCFDKCTQTLLSQYKKIASDVNSGEDNNTEYEQELLY 406
Query: 349 DSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPN-FTYHVYVAYALMAFCQDKDPK 407
++ ++R G+ AAR F RK T+ VYV A + F D K
Sbjct: 407 KQREKLTFVFCVYMNTMKRISGLSAARTVFGKCRKLKRILTHDVYVENAYLEFQNQNDYK 466
Query: 408 LAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPEESI-EVWKRF 466
A V E GLK F ++ YI +Y DFL LN D I+ LFE ++ + + E++K+
Sbjct: 467 TAFKVLELGLKYFQNDGVYINKYLDFLIFLNKDSQIKTLFETSVEKVQDLTQLKEIYKKM 526
Query: 467 TQFEQMYGDLDSTLKVEQR 485
+E +G+L++ +E+R
Sbjct: 527 ISYESKFGNLNNVYSLEKR 545
>sp|Q7S1Y0|RNA14_NEUCR mRNA 3'-end-processing protein rna-14 OS=Neurospora crassa (strain
ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC
987) GN=rna-14 PE=3 SV=1
Length = 1167
Score = 154 bits (390), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 91/316 (28%), Positives = 165/316 (52%), Gaps = 32/316 (10%)
Query: 44 IYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWRCYIRFI 103
+YE+ L++FP A A W +Y++ +++NN + +F++CL+ V LW Y+ +I
Sbjct: 285 VYERFLAIFPQA----ADIWVEYLDLELSLNNFPQAEGIFAKCLMTTPNVNLWTRYLDYI 340
Query: 104 RKVYEKKGTEGQ--EETRKAFDFMLSHVGSDISSGPIWLEYITFLKSLPAL---NAQEES 158
R+ + + GQ + +A++F++ ++G D SG IW EYI F+K P + ++
Sbjct: 341 RRRNDLNDSTGQARQTVSQAYEFVIDNIGLDKDSGKIWAEYIQFIKFGPGTVGGSQWQDQ 400
Query: 159 QRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAVYRE 218
Q+M +RKAYQRA+ P +V LWK+Y+ FE +++ + LSE Y SA++
Sbjct: 401 QKMDQLRKAYQRAICVPISNVNTLWKEYDQFEMGLNKLTGRKYLSEKSPSYMSAKSANTA 460
Query: 219 RKKYCEEID-WNMLAVPPTGSYKEEQQWIA----WKRLLTFEKGNPQRIDTASS-----N 268
+ ++ N+ +PP + +Q+++ WK+ + +EK +P +
Sbjct: 461 LEHITRGLNRTNLPRLPPAPGFDGDQEFMEQVEIWKKWIAWEKSDPLDLKDDKDQPGLYQ 520
Query: 269 KRIIFTYEQCLMYLYHYPDIWYDYATWN------------AKSGSIDAAIKVFQRALKAL 316
KRI++ Y Q LM L +P++W D A W K G+ + ++ R ++A
Sbjct: 521 KRILYVYNQALMALRFWPEMWVDAAQWCFDNNITTVENKVTKDGNAN-GVEFLIRGIEAN 579
Query: 317 PDSEMLRYAFAELEES 332
P+S +L + A+ ES
Sbjct: 580 PESVLLAFKHADHIES 595
Score = 63.2 bits (152), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 76/168 (45%), Gaps = 13/168 (7%)
Query: 333 RGAIAAAKKLYESLLTDSVNTTALAHIQFIRFLRRTEG-------VEAARKYFLDARKSP 385
+ I A +K Y + + I IR +RR +G + RK F DAR
Sbjct: 669 KAPIEAIQKGYAAQTQLLSRMISFVWIALIRAMRRVQGKGGLNVPLGGMRKAFHDARARG 728
Query: 386 NFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRA 445
T VY A A + + KDP +F+ G K F + + LE +L +D N R
Sbjct: 729 RLTSDVYAAVAQLEWTIYKDPA-GGKIFDRGAKLFPEDENFTLENIKYLHSRDDHTNARV 787
Query: 446 LFERALSSL--PPE---ESIEVWKRFTQFEQMYGDLDSTLKVEQRRKE 488
LFE ++ L PE ++ +++ F ++E +G+L K+E+R E
Sbjct: 788 LFETVVNRLTQKPELVHKAKPLYQYFHKYESQFGELAQVTKLEKRMAE 835
>sp|Q6FU45|RNA14_CANGA mRNA 3'-end-processing protein RNA14 OS=Candida glabrata (strain
ATCC 2001 / CBS 138 / JCM 3761 / NBRC 0622 / NRRL Y-65)
GN=RNA14 PE=3 SV=1
Length = 646
Score = 154 bits (389), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 129/517 (24%), Positives = 225/517 (43%), Gaps = 63/517 (12%)
Query: 44 IYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQ---VPLWRCYI 100
+Y +L FP + W ++++ + N D ++L ++CL L+ + LW Y+
Sbjct: 54 LYAELHERFP----LYSPLWTMHLQSELQRNEFDTVEKLLAQCLAGDLENNDLSLWSTYL 109
Query: 101 RFIRKVYEKKGTEGQEETR----KAFDFMLSHVGS-DISSGPIWLEYITFLKSLPALNAQ 155
++R+ + G +E R KAF ++ + + + W +Y+ FL +N
Sbjct: 110 DYVRR--KNNLITGGQEARAVVIKAFKLVMDKCATFEPKASSFWNDYLGFLHQWKPMNKW 167
Query: 156 EESQRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAV 215
EE QR+ IR+ Y++ + P +E++W Y +E + A+ + E + Y AR++
Sbjct: 168 EEQQRLDMIREVYKKMLCVPFDKLEKMWNQYTLWEQETNTLTARKFIGELSADYMKARSI 227
Query: 216 YRERKKYCEEI---------DWNMLAVP----PTGSYKEEQQWIAWKRLLTFEKGNPQRI 262
Y+E I N +P P + Q AW + + +EK N +
Sbjct: 228 YQELLNVTANIRRTSPLNLRTANKNNIPQYVLPCKK-NDHTQLEAWLKWIAWEKENKLEL 286
Query: 263 DTASSNKRIIFTYEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIK-VFQRALKALPDSEM 321
+ R+ + Y+Q + L P+IWYDY + + DAA K + + AL+A P S
Sbjct: 287 TEDALKDRVTYVYKQAIQQLLFEPEIWYDYVMYEFDN---DAARKNILKVALQANPTSPT 343
Query: 322 LRYAFAELEESRGAIAAAKKLYESLL--------TDSVN------------TTALAHIQF 361
L + AE E + +E + D+ N T + +
Sbjct: 344 LTFKLAECYEVENKSEEVQNCFEKTIDELLRQYKNDNGNDELSSDIIWERKTLTYIYCIY 403
Query: 362 IRFLRRTEGVEAARKYFLDARK-SPNFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRF 420
+ ++R G+ AAR F RK T+ +YV A + F D K A V E GLK F
Sbjct: 404 MNTMKRLSGLSAARAVFGKCRKLKKAMTHDIYVENAYLEFQNQNDHKTASKVLELGLKYF 463
Query: 421 MHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPEESI-EVWKRFTQFEQMYGDLDST 479
+ YI +Y DFLS LN ++ LFE ++ + + +++ + +E YG+L++
Sbjct: 464 GDDGEYINKYMDFLSLLNRGSQMKTLFETSIEKVEDLRQLKKIYVKMIGYESKYGNLNNV 523
Query: 480 LKVEQRRKEALSRTGEEGASALEDSLQDVVSRYSFMD 516
++E+R E T D +Q +RY D
Sbjct: 524 YQLEKRFFEKFPDT---------DLIQLFSTRYKIQD 551
>sp|Q5B3I8|RNA14_EMENI mRNA 3'-end-processing protein rna14 OS=Emericella nidulans (strain
FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139)
GN=rna14 PE=3 SV=2
Length = 1075
Score = 154 bits (388), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 91/306 (29%), Positives = 155/306 (50%), Gaps = 20/306 (6%)
Query: 38 VAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWR 97
+ A +YE+ L VFP + A+ W Y +N +Q+F+R LL V LW
Sbjct: 272 IDSARDVYERFLKVFPLS----AEMWVAYATMESELNELFRLEQIFNRTLLTIPAVQLWT 327
Query: 98 CYIRFIRKVYEKKGTEGQEETRK----AFDFMLSHVGSDISSGPIWLEYITFLKSLPAL- 152
Y+ ++R+ T+ + RK A++ L H+G D SG IW +YI F++S P
Sbjct: 328 VYLDYVRR-RNPLSTDTTGQARKVISSAYELALQHIGMDKESGSIWADYIQFIRSGPGNV 386
Query: 153 --NAQEESQRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYT 210
+ ++ Q+M +RKAYQRA+ P V LWK+Y+ FE +++ + L E Y
Sbjct: 387 GGSGWQDQQKMDLLRKAYQRAICVPMQAVNTLWKEYDQFEMGLNKLTGRKFLQEQSPSYM 446
Query: 211 SARAVYRERKKYCEEIDWNMLA-VPPT----GSYKEEQQWIAWKRLLTFEKGNP---QRI 262
+AR+ Y E + + +++ L +PP G ++ QQ WKR + +EKG+P +
Sbjct: 447 TARSSYTELQNFTRDLNRTTLPRLPPVPGSEGDFEYLQQIEIWKRWINWEKGDPLVLKED 506
Query: 263 DTASSNKRIIFTYEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEML 322
D + R+++ Y+Q LM L P+IW++ A + + + + + A P+S +L
Sbjct: 507 DLTAYKGRVVYVYKQALMALRFLPEIWFEAADFCFLNDMETEGNEFLKNGIDANPESCLL 566
Query: 323 RYAFAE 328
+ A+
Sbjct: 567 AFKRAD 572
Score = 75.5 bits (184), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 75/147 (51%), Gaps = 12/147 (8%)
Query: 353 TTALAHIQFIRFLRRTEG------VEAARKYFLDARKSPNFTYHVYVAYALMAFCQDKDP 406
T + A I +R +RR +G V +R+ F DARK T VY+A AL+ + KDP
Sbjct: 673 TVSFAWIALMRAMRRIQGKGKPGEVPGSRQVFADARKRGRITSDVYIASALIEYHCYKDP 732
Query: 407 KLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALSSLP--PE---ESIE 461
A +FE G K F + + LEY L +ND N RA+FE + L PE ++
Sbjct: 733 A-ATKIFERGAKLFPEDENFALEYLKHLIDINDIINARAVFEMTVRKLAANPENVHKTKP 791
Query: 462 VWKRFTQFEQMYGDLDSTLKVEQRRKE 488
++ ++E YGDL + +E R +E
Sbjct: 792 IFAFLHEYESRYGDLVQVINLETRMRE 818
>sp|Q2UKV8|RNA14_ASPOR mRNA 3'-end-processing protein rna14 OS=Aspergillus oryzae (strain
ATCC 42149 / RIB 40) GN=rna14 PE=3 SV=1
Length = 1078
Score = 152 bits (384), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 93/305 (30%), Positives = 155/305 (50%), Gaps = 18/305 (5%)
Query: 38 VAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWR 97
+ A +YE+ L+ FP F A+ W Y +N +Q+F+R LL V LW
Sbjct: 282 IDSAREVYERFLTAFP----FSAEQWVAYATMESELNELYRLEQIFNRTLLTIPDVQLWT 337
Query: 98 CYIRFIRKVYE-KKGTEGQEE--TRKAFDFMLSHVGSDISSGPIWLEYITFLKSLPAL-- 152
Y+ ++R+ T GQ A+D L +VG D SG IW +Y+ F++S P
Sbjct: 338 VYLDYVRRRNPLTTDTTGQSRRIISSAYDLALQYVGVDKDSGSIWTDYVQFIRSGPGNVG 397
Query: 153 -NAQEESQRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTS 211
+ ++ Q+M +RKAYQ+A+ PT V LWK+Y+ FE +++ + L E Y +
Sbjct: 398 GSGWQDQQKMDLLRKAYQKAICVPTQAVNNLWKEYDQFEMGLNKLTGRKFLQEQSPAYMT 457
Query: 212 ARAVYRERKKYCEEIDWNMLA-VPPT----GSYKEEQQWIAWKRLLTFEKGNP---QRID 263
AR+ Y E + +++ L +PP G + QQ WKR + +EKG+P + D
Sbjct: 458 ARSSYTELQNITRDLNRTTLPRLPPVLGSDGDIEFGQQVDIWKRWIKWEKGDPLVLKEED 517
Query: 264 TASSNKRIIFTYEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLR 323
A+ R+I+ Y+Q LM L P+IW++ A + + + + + ++A P+S +L
Sbjct: 518 QAAFKARVIYVYKQALMALRFLPEIWFEAAEFCFLNDMENEGNEFLKNGIEANPESCLLA 577
Query: 324 YAFAE 328
+ A+
Sbjct: 578 FKRAD 582
Score = 75.9 bits (185), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 76/147 (51%), Gaps = 12/147 (8%)
Query: 353 TTALAHIQFIRFLRRTEG------VEAARKYFLDARKSPNFTYHVYVAYALMAFCQDKDP 406
T + A I +R +RR +G + +R+ F DARK T VY+A AL+ + KDP
Sbjct: 684 TVSFAWIALMRAMRRIQGKGKPGEMPGSRQVFADARKRGRITSDVYIASALIEYHCYKDP 743
Query: 407 KLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFE---RALSSLPP--EESIE 461
A +FE G K F + + LEY L +ND N RA+FE R L+S P ++
Sbjct: 744 A-ATKIFERGAKLFPEDENFALEYLKHLIDINDVINARAVFEMTVRKLASNPENVHKTKP 802
Query: 462 VWKRFTQFEQMYGDLDSTLKVEQRRKE 488
++ ++E YGDL + +E R +E
Sbjct: 803 IFAFLHEYESRYGDLVQVINLENRMRE 829
>sp|Q759Y6|RNA14_ASHGO mRNA 3'-end-processing protein RNA14 OS=Ashbya gossypii (strain
ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056)
GN=RNA14 PE=3 SV=1
Length = 661
Score = 151 bits (381), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 119/487 (24%), Positives = 226/487 (46%), Gaps = 50/487 (10%)
Query: 38 VAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCL---LICLQVP 94
VA+ ++ QL +FP SF+ W ++ + + L ++CL L+ +
Sbjct: 69 VAEIREVFGQLHELFPLE-SFL---WTIHLNWELEQEESGQVETLLAKCLSGELMNNDIY 124
Query: 95 LWRCYIRFIRKVYEKKGTEGQEETR----KAFDFMLSHVGS-DISSGPIWLEYITFLKSL 149
LW Y+ ++R+ + G EE R KA++ ++ + S W +Y+ FL+
Sbjct: 125 LWSTYLGYVRR--KNNTVTGGEEARGTVLKAYELVMEKCAVFEPRSMQFWQDYLQFLEQW 182
Query: 150 PALNAQEESQRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKY 209
++ EE R+ +RK Y+R + P +E+ W+ Y +E V++ A+ + E + Y
Sbjct: 183 KPVSKWEEQSRVEILRKLYKRLLCLPVESLERYWEKYTQWEQEVNQLTARKFIGELSASY 242
Query: 210 TSARAVYRERKKYCEEIDWNM---------LAVPPTGSYKEEQQWIAWKRLLTFEKGNPQ 260
+AR++Y+E + + ++ +P G Y E Q I W + + +E N
Sbjct: 243 MNARSLYQEWSNLTKGLRRSLPTKLNQATQQNLPAPGQYDEYQLQI-WTKWIQWELDNKL 301
Query: 261 RIDTASSNKRIIFTYEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSE 320
+ +R+ + + Q + ++ P+IWY+YA + + + KV + A++ P S
Sbjct: 302 DLPEVVLRQRVEYVHRQAVQHMCFAPEIWYNYAMFVDE----NEHEKVLEIAVRCNPGSL 357
Query: 321 MLRYAFAELEESRGAIAAAKKLYE------SLLTDSVNTTAL--------------AHIQ 360
L + AE E I A ++ ++ S+ +N T + A+
Sbjct: 358 SLTFKLAEYLELNNKIEALEERFQHCIARISMELQVMNDTTMDPDKILRQTRKLTFAYCV 417
Query: 361 FIRFLRRTEGVEAARKYFLDARK-SPNFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKR 419
++ ++R G+ AARK F RK + +Y +YV A M + + D V E GLK
Sbjct: 418 YMTTMKRVTGLSAARKVFSKCRKLKKDISYEIYVENAYMEYYNNSDVTTPCRVLEFGLKY 477
Query: 420 FMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPEESI-EVWKRFTQFEQMYGDLDS 478
F YI +Y DFL + D I++LFE + + + + E++K+ +E +G+L++
Sbjct: 478 FQDNGNYINKYLDFLILVKQDAQIKSLFESCIDKIYNLDQLKEIYKKVINYESKFGNLNN 537
Query: 479 TLKVEQR 485
++E+R
Sbjct: 538 VYELERR 544
>sp|Q4WXX4|RNA14_ASPFU mRNA 3'-end-processing protein rna14 OS=Neosartorya fumigata
(strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100)
GN=rna14 PE=3 SV=1
Length = 1029
Score = 150 bits (380), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 153/299 (51%), Gaps = 18/299 (6%)
Query: 44 IYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWRCYIRFI 103
++E+ L VFP F A+ W Y + +N+ +Q+F+R LL V LW Y+ ++
Sbjct: 289 VFERFLKVFP----FAAEQWVAYAKMESELNDLYRLEQIFNRTLLTIPDVQLWSVYLDYV 344
Query: 104 RKVYE-KKGTEGQEE--TRKAFDFMLSHVGSDISSGPIWLEYITFLKSLPAL---NAQEE 157
R+ T GQ A++ H+G D SG IW +Y+ F+KS P + ++
Sbjct: 345 RRRNPLTTDTTGQARRIISSAYELAFQHIGVDKDSGSIWSDYVQFIKSGPGNVGGSGWQD 404
Query: 158 SQRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAVYR 217
Q+M +RKAYQ+A+ PT V LWK+Y+ FE +++ + L E Y +AR+ Y
Sbjct: 405 QQKMDLLRKAYQKAICVPTQAVNTLWKEYDQFEMGLNKLTGRKFLQEQSPAYMTARSSYT 464
Query: 218 ERKKYCEEIDWNMLA-VPPT----GSYKEEQQWIAWKRLLTFEKGNP---QRIDTASSNK 269
E + ++ L +PP G + QQ WKR + +EKG+P + D A+
Sbjct: 465 ELQNITRDLIRTTLPRLPPVPGSDGDIEFTQQVDIWKRWIKWEKGDPLVLKEEDPAAFKG 524
Query: 270 RIIFTYEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAE 328
R+++ Y+Q LM L P++W+D A + + + ++ ++A P+S +L + A+
Sbjct: 525 RVVYVYKQALMALRFLPEMWFDAAEFCFLNDLESEGNEFLKQGMEANPESCLLAFKRAD 583
Score = 74.7 bits (182), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 81/164 (49%), Gaps = 12/164 (7%)
Query: 336 IAAAKKLYESLLTDSVNTTALAHIQFIRFLRRTEG------VEAARKYFLDARKSPNFTY 389
I A +K + + T + A I +R +RR +G +R+ F DARK T
Sbjct: 668 IDAVRKAHAIQIGILSKTISFAWIALMRAMRRIQGKGKPGETPGSRQVFADARKRGRITS 727
Query: 390 HVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFE- 448
VY+A AL+ + KDP A +FE G K F + + LEY L +ND N RA+FE
Sbjct: 728 DVYIASALIEYHCYKDPA-ATKIFERGAKLFPDDENFALEYLKHLIDINDIINARAVFEM 786
Query: 449 --RALSSLPP--EESIEVWKRFTQFEQMYGDLDSTLKVEQRRKE 488
R L+S P ++ ++ ++E YGDL + +E R +E
Sbjct: 787 TVRKLASNPDNVHKTKPIFAFLHEYESRYGDLVQVINLENRMRE 830
>sp|Q4IR09|RNA14_GIBZE mRNA 3'-end-processing protein RNA14 OS=Gibberella zeae (strain
PH-1 / ATCC MYA-4620 / FGSC 9075 / NRRL 31084) GN=RNA14
PE=3 SV=1
Length = 997
Score = 142 bits (359), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 78/283 (27%), Positives = 144/283 (50%), Gaps = 18/283 (6%)
Query: 27 EILANSALHLPVAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRC 86
E++A+ P+ +A Y + + +FP A A W +++E + NN +QLF RC
Sbjct: 185 ELIASHRDFSPLEKARSTYNRFVEIFPQA----ADKWVEWIELELKYNNFVEVEQLFGRC 240
Query: 87 LLICLQVPLWRCYIRFIRK---VYEKKGTEGQEETRKAFDFMLSHVGSDISSGPIWLEYI 143
L+ V LW Y+ +IR+ + + + ++++F++ ++G D SG IW +Y+
Sbjct: 241 LMQVPNVKLWTVYLDYIRRRNDLNNDPSGQARRTVTQSYEFVIDNIGVDRDSGNIWQQYV 300
Query: 144 TFLKSLPAL---NAQEESQRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKG 200
F+K+ P ++ Q+M +R Y+RAV P V LWK+Y+ FE +++ +
Sbjct: 301 QFVKNGPGQIDGTDWQDRQKMDQLRGIYRRAVAVPMSTVNNLWKEYDQFEMGLNKMTGRK 360
Query: 201 LLSEYQSKYTSARAVYRERKKYCEEID-WNMLAVPPTGSYKEEQQWI----AWKRLLTFE 255
+ E Y SA++ +D N+ +PP + +Q++ WK+ + +E
Sbjct: 361 FIQERSPVYMSAKSANIALDNITRHLDRTNLPRLPPAPGFNGDQEFRDQVEMWKKWIAWE 420
Query: 256 KGNP---QRIDTASSNKRIIFTYEQCLMYLYHYPDIWYDYATW 295
K +P + + + N+R++ Y+Q LM L +P+IW D A W
Sbjct: 421 KEDPLVLKSDEPKAYNQRVLHVYKQALMALRFWPEIWVDAAEW 463
Score = 66.2 bits (160), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 79/163 (48%), Gaps = 11/163 (6%)
Query: 336 IAAAKKLYESLLTDSVNTTALAHIQFIRFLRRTEGVEAA-----RKYFLDARKSPNFTYH 390
I A +K Y + T + I R +RR +G + RK F DAR+ T
Sbjct: 582 ILAIQKGYAAETQLLSRTISYVWIALARAMRRIQGKGSQAEGGLRKVFTDARQKGRLTSD 641
Query: 391 VYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFERA 450
VYVA AL+ KDP + +FE G + F ++ +++EY +L +D N R +FE
Sbjct: 642 VYVAVALLESVVYKDP-VGAKIFERGARLFPNDEMFMIEYLKYLHSKDDTTNARVVFETC 700
Query: 451 LSSL--PPE---ESIEVWKRFTQFEQMYGDLDSTLKVEQRRKE 488
++ L P+ ++ ++ F ++E YG+L K+E R E
Sbjct: 701 INRLVSNPDTLAKAKLLYAYFHKYESQYGELSQISKLEDRMAE 743
>sp|Q6CII8|RNA14_KLULA mRNA 3'-end-processing protein RNA14 OS=Kluyveromyces lactis
(strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 /
NRRL Y-1140 / WM37) GN=RNA14 PE=1 SV=1
Length = 661
Score = 137 bits (346), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 115/490 (23%), Positives = 221/490 (45%), Gaps = 53/490 (10%)
Query: 82 LFSRCLLICL---QVPLWRCYIRFIRKVYEKKGTEGQEETR----KAFDFMLSHVGS-DI 133
+ +RCL L + LW YI ++RK + G EE R +AF ++ +
Sbjct: 107 VLARCLSKELGNNDLSLWLSYITYVRK--KNDIITGGEEARNIVIQAFQVVVDKCAIFEP 164
Query: 134 SSGPIWLEYITFLKSLPALNAQEESQRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSV 193
S W EY+ FL+ +N EE QR+ IRK Y+ + P +E +W+ Y +E V
Sbjct: 165 KSIQFWNEYLHFLEHWKPVNKFEEQQRVQYIRKLYKTLLCQPMDCLESMWQRYTQWEQDV 224
Query: 194 SRQLAKGLLSEYQSKYTSARAVYRERKKYCEEIDWNM---------LAVPPTGSYKEEQQ 244
++ A+ + E ++Y +AR++Y++ + + N+ +P Y + QQ
Sbjct: 225 NQLTARRHIGELSAQYMNARSLYQDWLNITKGLKRNLPITLNQATESNLPKPNEY-DVQQ 283
Query: 245 WIAWKRLLTFEKGNPQRIDTASSNKRIIFTYEQCLMYLYHYPDIWYDYATWNAKSGSIDA 304
+ W + +E N + R+ + Y Q ++ P+IW++ A + + +
Sbjct: 284 LLIWLEWIRWESDNKLELSDDLHKARMTYVYMQAAQHVCFAPEIWFNMANYQGEKNTDST 343
Query: 305 AI-KVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESLL------------TDSV 351
I K + + +P+S +L ++ +E E I + S + D
Sbjct: 344 VITKYLKLGQQCIPNSAVLAFSLSEQYELNTKIPEIETTILSCIDRIHLDLAALMEDDPT 403
Query: 352 NTTALAHIQ---------FIRFLRRTEGVEAARKYFLDARKSPNF-TYHVYVAYALMAFC 401
N +A+ ++ ++ ++R +G+ A+RK F R+ T +Y+ A + +
Sbjct: 404 NESAINQLKSKLTYVYCVYMNTMKRIQGLAASRKIFGKCRRLKKLVTPDIYLENAYIEYH 463
Query: 402 QDKDPKLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPEESIE 461
KD K A V E GLK F + YI +Y DFL +N++ +++LFE ++ + ++
Sbjct: 464 ISKDTKTACKVLELGLKYFATDGEYINKYLDFLIYVNEESQVKSLFESSIDKISDSHLLK 523
Query: 462 -VWKRFTQFEQMYGDLDSTLKVEQRRKEALSRTGEEGASALEDSLQDVVSRYSFMDLWPC 520
++++ FE G L+S +E+R E + L++ ++Y +D+
Sbjct: 524 MIFQKVIFFESKVGSLNSVRTLEKRFFEKFPEVNK---------LEEFTNKYKVLDVNYL 574
Query: 521 SSKDLDHLVR 530
+LD++VR
Sbjct: 575 QRLELDYMVR 584
>sp|Q6BJD8|RNA14_DEBHA mRNA 3'-end-processing protein RNA14 OS=Debaryomyces hansenii
(strain ATCC 36239 / CBS 767 / JCM 1990 / NBRC 0083 /
IGC 2968) GN=RNA14 PE=3 SV=2
Length = 740
Score = 133 bits (335), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 91/317 (28%), Positives = 156/317 (49%), Gaps = 18/317 (5%)
Query: 46 EQLLSVFPTAVS-FIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWRCYIRFIR 104
EQ+ SVF ++ F W Y+ + +QLFS+CL I V L R Y+ ++R
Sbjct: 51 EQVKSVFNKYLNIFNFDQWCNYINYQLNRGEFQEVEQLFSKCLPITDHVELCRLYVSYVR 110
Query: 105 KVYEKKGTEGQEETR----KAFDFMLSHVGSDISSGPIWLEYITFLKSLPALNAQEESQR 160
+ + G E+ R +AF+F ++ VG DISSG +W +Y+ FLK+ E+ Q+
Sbjct: 111 RTND--VITGGEKARGIVVQAFEFAVTKVGIDISSGDLWNDYLDFLKAWTPAATWEQQQK 168
Query: 161 MIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAVYRERK 220
IR+ Y+R +V PT +EQ+W Y +EN V+ A ++E S++ AR+ E
Sbjct: 169 TDLIRRVYKRFLVIPTEKIEQVWSTYTKWENEVNASSANKFIAEKSSEFMDARSWNTEWH 228
Query: 221 KYCEEIDWNMLAVPPTGSYKEEQQWI-----AWKRLLTFEKGNPQRI-DTASSNKRIIFT 274
E V P G + + + W + + E+ N + D +S +RI +
Sbjct: 229 NATERSL--RREVIPIGIHNDNNNLVHTQLQLWYKWIALERENKLNLKDDSSVQQRIEYV 286
Query: 275 YEQCLMYLYHYPDIWYDYATWNAKSG---SIDAAIKVFQRALKALPDSEMLRYAFAELEE 331
Y+Q +M L P++W+ + + +S + + I++ AL P S +L + +E+ E
Sbjct: 287 YKQAIMALPFVPELWFKFNKFWLRSNEEANSNKCIELLNEALVLNPRSYLLTFQLSEMYE 346
Query: 332 SRGAIAAAKKLYESLLT 348
I A + Y++L+T
Sbjct: 347 KDNTINKATETYDNLIT 363
Score = 64.3 bits (155), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 60/110 (54%), Gaps = 6/110 (5%)
Query: 355 ALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFT---YHVYVAYALMAFCQDKDPKLAHN 411
L + + + +R+ G++ AR F AR NF Y YV ALM + D + K A
Sbjct: 445 TLVYTKLMMACKRSRGIKEARGVFKQARN--NFEAIGYEFYVENALMEYHSD-NLKTASK 501
Query: 412 VFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPEESIE 461
+FE G+K F + ++L Y DFL +N +I+ LFE+ L++L + +IE
Sbjct: 502 IFELGMKHFKKQGEFLLAYLDFLIMINKGESIKVLFEQGLTALLQDVNIE 551
>sp|Q5AM44|RNA14_CANAL mRNA 3'-end-processing protein RNA14 OS=Candida albicans (strain
SC5314 / ATCC MYA-2876) GN=RNA14 PE=3 SV=1
Length = 791
Score = 124 bits (312), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 87/318 (27%), Positives = 156/318 (49%), Gaps = 17/318 (5%)
Query: 40 QAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWRCY 99
Q +++ L +F F W +Y++ + + + + LF +CL I V L R Y
Sbjct: 48 QVRNTFDKYLKIF----KFDGASWCKYIKYELNRDEKEKVENLFQQCLGITDNVELCRLY 103
Query: 100 IRFIRKV--YEKKGTEGQEETRKAFDFMLSHVGSDISSGPIWLEYITFLKSLPALNAQEE 157
+ ++R V + G + + +AF+F ++ VG DI+S +W +YI FL+S E+
Sbjct: 104 VDYVRGVTDFVTGGEKARGVVVQAFEFAINKVGIDITSESLWQDYIQFLQSWNPNANWEQ 163
Query: 158 SQRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAVYR 217
Q++ IRK Y++ + PT ++E W Y +EN ++ A +SE ++ AR+
Sbjct: 164 QQKIDLIRKVYKKFLTIPTENIEVSWSQYTKWENELNPATASKFISEKSGEFMLARSWNT 223
Query: 218 ERKKYCEEIDWNMLAVPPTGSYKEE---QQWIAWKRLLTFEKGNPQRI-DTASSNKRIIF 273
E + D ++ G + +E +Q W R L EK N + D ++KRI +
Sbjct: 224 EFNRIT---DKSLKRNLNPGDHNDEDVVKQLKYWLRWLELEKENKLELKDETVNDKRIQY 280
Query: 274 TYEQCLMYLYHYPDIWYDYATW---NAKSGSIDAAIKVFQRALKAL-PDSEMLRYAFAEL 329
Y+Q L P+IW+ Y + + G++ +I++ + AL P S +L + AEL
Sbjct: 281 VYKQATYALPFVPEIWFQYVKYLLVQNEEGNLQESIRLLKEGGLALNPKSMLLTFQLAEL 340
Query: 330 EESRGAIAAAKKLYESLL 347
E + AK ++++LL
Sbjct: 341 YERDNSFNNAKIVFKNLL 358
Score = 54.7 bits (130), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 67/130 (51%), Gaps = 9/130 (6%)
Query: 333 RGAIAAAKKLY-----ESLLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNF 387
R ++A +K+L + L+D++ L +++ + +R+EG++ AR F ARK +
Sbjct: 458 RISLADSKQLLSFENEQKRLSDAI---TLTYVKSMIASKRSEGIKEARNVFKQARKFTDI 514
Query: 388 TYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALF 447
Y +++ AL+ DK A +F+ G K F ++L Y D+L +ND +R +
Sbjct: 515 GYQIFIESALLEHYSDK-KSTALKIFDLGKKNFATNGKFLLNYLDYLIMINDVDTMRTVI 573
Query: 448 ERALSSLPPE 457
+ + ++ E
Sbjct: 574 QSSDANFTKE 583
>sp|Q86UA1|PRP39_HUMAN Pre-mRNA-processing factor 39 OS=Homo sapiens GN=PRPF39 PE=1 SV=3
Length = 669
Score = 80.9 bits (198), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 120/541 (22%), Positives = 213/541 (39%), Gaps = 105/541 (19%)
Query: 35 HLPVAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCL-LICLQV 93
HL A+ A +++ +P + +WK+Y + +N + +++ R L I L V
Sbjct: 110 HLMAARKA--FDRFFIHYP----YCYGYWKKYADLEKRHDNIKPSDEVYRRGLQAIPLSV 163
Query: 94 PLWRCYIRFIRKVYEKKGTEGQEETRKAFDFMLSHVGSDISSGPIWLEYITFLKSLPALN 153
LW YI F+++ + E R F+ + G+D S +W YI N
Sbjct: 164 DLWIHYINFLKETLDPGDPETNNTIRGTFEHAVLAAGTDFRSDRLWEMYI---------N 214
Query: 154 AQEESQRMIAIRKAYQRAVVTPT----HHVEQLWKDYENFENSVSRQLAKG--------- 200
+ E + + Y R + PT HH ++ E+ +N++ R L G
Sbjct: 215 WENEQGNLREVTAIYDRILGIPTQLYSHHFQRF---KEHVQNNLPRDLLTGEQFIQLRRE 271
Query: 201 ------------------------------LLSEYQSKYTSARAVYRERKKYCEE----- 225
L++E ++ +++E Y E
Sbjct: 272 LASVNGHSGDDGPPGDDLPSGIEDITDPAKLITEIENMRHRIIEIHQEMFNYNEHEVSKR 331
Query: 226 ------IDWNMLAVPPTGSYKEEQQWIAWKRLLTFEKGNPQRIDTASSNKRIIFTYEQCL 279
I V P E+ Q WK L FE N +++R++ +E+C+
Sbjct: 332 WTFEEGIKRPYFHVKPL----EKAQLKNWKEYLEFEIEN-------GTHERVVVLFERCV 380
Query: 280 MYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALK-ALPDSEMLRYAFAELEESRGAIAA 338
+ Y + W YA + ++ SI+ VF RA LP M+ +A EE +G I
Sbjct: 381 ISCALYEEFWIKYAKY-MENHSIEGVRHVFSRACTIHLPKKPMVHMLWAAFEEQQGNINE 439
Query: 339 AKKLYESLLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDA---RKSPNFTYHVYVAY 395
A+ + ++ + V A+ ++ + RR +E A DA KS N + V
Sbjct: 440 ARNILKT-FEECVLGLAMVRLRRVSLERRHGNLEEAEHLLQDAIKNAKSNNESSFYAVKL 498
Query: 396 ALMAF-CQDKDPKLAHNVFEAGLKRFMHEPAYI----LEYADFLSRLNDDRNIRALFERA 450
A F Q PK + EA + + Y+ +EY+ L + ++ NI F++A
Sbjct: 499 ARHLFKIQKNLPKSRKVLLEAIERDKENTKLYLNLLEMEYSGDLKQ--NEENILNCFDKA 556
Query: 451 L-SSLPPEESIEVWKRFTQFEQMYG-DLDSTLKVEQ------RRKEALSRTGEEGASALE 502
+ SLP + I +R +F + +G D++ L + +++L R E G+ E
Sbjct: 557 VHGSLPIKMRITFSQRKVEFLEDFGSDVNKLLNAYDEHQTLLKEQDSLKRKAENGSEEPE 616
Query: 503 D 503
+
Sbjct: 617 E 617
>sp|Q8K2Z2|PRP39_MOUSE Pre-mRNA-processing factor 39 OS=Mus musculus GN=Prpf39 PE=2 SV=3
Length = 665
Score = 79.0 bits (193), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 119/540 (22%), Positives = 210/540 (38%), Gaps = 103/540 (19%)
Query: 35 HLPVAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCL-LICLQV 93
HL A+ A +++ +P + +WK+Y + +N + +++ R L I L V
Sbjct: 108 HLMAARKA--FDKFFVHYP----YCYGYWKKYADLEKRHDNIKQSDEVYRRGLQAIPLSV 161
Query: 94 PLWRCYIRFIRKVYEKKGTEGQEETRKAFDFMLSHVGSDISSGPIWLEYITFLKSLPALN 153
LW YI F+++ E E R F+ + G+D S +W YI N
Sbjct: 162 DLWIHYINFLKETLEPGDQETNTTIRGTFEHAVLAAGTDFRSDKLWEMYI---------N 212
Query: 154 AQEESQRMIAIRKAYQRAVVTPT----HHVEQLWKDYENFENSVSRQLAKG--------- 200
+ E + + Y R + PT HH ++ E+ +N++ R L G
Sbjct: 213 WENEQGNLREVTAVYDRILGIPTQLYSHHFQRF---KEHVQNNLPRDLLTGEQFIQLRRE 269
Query: 201 -----------------------------LLSEYQSKYTSARAVYRERKKYCEE------ 225
L++E ++ +++E Y E
Sbjct: 270 LASVNGHSGDDGPPGDDLPSGIEDISPAKLITEIENMRHRIIEIHQEMFNYNEHEVSKRW 329
Query: 226 -----IDWNMLAVPPTGSYKEEQQWIAWKRLLTFEKGNPQRIDTASSNKRIIFTYEQCLM 280
I V P + ++ W K L FE N +++R++ +E+C++
Sbjct: 330 TFEEGIKRPYFHVKPLEKAQPKKNW---KEYLEFEIEN-------GTHERVVVLFERCVI 379
Query: 281 YLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKA-LPDSEMLRYAFAELEESRGAIAAA 339
Y + W YA + ++ SI+ VF RA LP M +A EE +G I A
Sbjct: 380 SCALYEEFWIKYAKY-MENHSIEGVRHVFSRACTVHLPKKPMAHMLWAAFEEQQGNINEA 438
Query: 340 KKLYESLLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDA---RKSPNFTYHVYVAYA 396
+ + + + V A+ ++ + RR +E A DA KS N + + A
Sbjct: 439 RIILRT-FEECVLGLAMVRLRRVSLERRHGNMEEAEHLLQDAIKNAKSNNESSFYAIKLA 497
Query: 397 LMAF-CQDKDPKLAHNVFEAGLKRFMHEPAYI----LEYADFLSRLNDDRNIRALFERAL 451
F Q PK + EA K + Y+ +EY+ L + ++ NI F++A+
Sbjct: 498 RHLFKIQKNLPKSRKVLLEAIEKDKENTKLYLNLLEMEYSCDLKQ--NEENILNCFDKAI 555
Query: 452 -SSLPPEESIEVWKRFTQFEQMYG-DLDSTLKVEQ------RRKEALSRTGEEGASALED 503
SLP + I +R +F + +G D++ L + ++ L R E G+ E+
Sbjct: 556 HGSLPIKMRITFSQRKVEFLEDFGSDVNKLLNAYDEHQTLLKEQDTLKRKAENGSEEPEE 615
>sp|O74970|PRP39_SCHPO Pre-mRNA-processing factor 39 OS=Schizosaccharomyces pombe (strain
972 / ATCC 24843) GN=prp39 PE=3 SV=1
Length = 612
Score = 72.0 bits (175), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/349 (23%), Positives = 143/349 (40%), Gaps = 71/349 (20%)
Query: 44 IYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLL-ICLQVPLWRCYIRF 102
+Y++ L +P + +WK+Y + V +A++ ++ R + I V LW Y F
Sbjct: 60 VYDRFLGKYP----LLFGYWKKYADFEFFVAGAEASEHIYERGIAGIPHSVDLWTNYCAF 115
Query: 103 IRKVYEKKGTEGQ-EETRKAFDFMLSHVGSDISSGPIWLEYITFLKSLPALNAQEESQRM 161
K T G E R+ F + VG D S P W +Y+ F +E +R
Sbjct: 116 ------KMETNGDANEVRELFMQGANMVGLDFLSHPFWDKYLEF---------EERQERP 160
Query: 162 IAIRKAYQRAVVTPTHHVEQLWKDYENFENS------------------VSRQLAKGLLS 203
+ + +R + P H + ++ + S V+R+ AK ++S
Sbjct: 161 DNVFQLLERLIHIPLHQYARYFERFVQVSQSQPIQQLLPPDVLASIRADVTREPAK-VVS 219
Query: 204 EYQSKYTSARA---VYRERKKYCEEIDWNMLAVPPTGSYK------------------EE 242
+ T R + RE + I + + K +E
Sbjct: 220 AGSKQITVERGELEIEREMRARIYNIHLQIFQKVQLETAKRWTFESEIKRPYFHVKELDE 279
Query: 243 QQWIAWKRLLTFEKGNPQRIDTASSNKRIIFTYEQCLMYLYHYPDIWYDYATW-NAKSGS 301
Q + W++ L FE + +RI YE+CL+ Y + W+ YA W +A+
Sbjct: 280 AQLVNWRKYLDFE-------EVEGDFQRICHLYERCLITCALYDEFWFRYARWMSAQPDH 332
Query: 302 IDAAIKVFQRA--LKALPDSEMLRYAFAELEESRGAIAAAKKLYESLLT 348
++ +++RA + A +R +A EES+G IA+AK +Y+S+LT
Sbjct: 333 LNDVSIIYERASCIFASISRPGIRVQYALFEESQGNIASAKAIYQSILT 381
>sp|Q4KLU2|PRP39_XENLA Pre-mRNA-processing factor 39 OS=Xenopus laevis GN=prpf39 PE=2 SV=1
Length = 641
Score = 67.0 bits (162), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 82/396 (20%), Positives = 154/396 (38%), Gaps = 77/396 (19%)
Query: 35 HLPVAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCL-LICLQV 93
HL A+ A ++ L+ +P + +WK+Y + NN +++ R + I L V
Sbjct: 85 HLFAARKA--FDAFLAHYP----YCYGYWKKYADLEKKNNNILEADEVYRRGIQAITLSV 138
Query: 94 PLWRCYIRFIRKVYEKKGTEGQEETRKAFDFMLSHVGSDISSGPIWLEYITFLKSLPALN 153
LW Y+ F+++ + E R F+ + G D S +W YI N
Sbjct: 139 DLWMHYLNFLKETLDPADPETSLTLRGTFEHAVVSAGLDFRSDKLWEMYI---------N 189
Query: 154 AQEESQRMIAIRKAYQRAVVTPTHHVEQLWK-DYENFENSVSRQLAKGLLSEYQ------ 206
+ E + + Y R + PT Q + ++ F+ + L + L+ +
Sbjct: 190 WETEQGNLSGVTSIYSRLLGIPT----QFYSLHFQRFKEHIQGHLPREFLTSEKFIELRK 245
Query: 207 -------------------------SKYTSARAVYRER--KKYCEEIDWNMLAVPPTGSY 239
+K T+ R R + + E + N V ++
Sbjct: 246 ELASMTLHGGTNDDIPSGLEEIKDPAKRTTEVENMRHRIIEVHQEIFNLNEHEVSKIWNF 305
Query: 240 KEE-------------QQWIAWKRLLTFEKGNPQRIDTASSNKRIIFTYEQCLMYLYHYP 286
+EE Q WK L FE N SN+RI+ +E+C++ Y
Sbjct: 306 EEEIKRPYFHVKPLEKAQLNNWKEYLEFELEN-------GSNERIVILFERCVIACACYE 358
Query: 287 DIWYDYATWNAKSGSIDAAIKVFQRALKA-LPDSEMLRYAFAELEESRGAIAAAKKLYES 345
+ W YA + ++ S++ V+ RA L M+ +A EE +G + A+++ ++
Sbjct: 359 EFWIKYAKY-MENHSVEGVRHVYNRACHVHLAKKPMVHLLWAAFEEQQGNLEEARRILKN 417
Query: 346 LLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDA 381
+ T ++ A+ ++ + RR V+ A +A
Sbjct: 418 IET-AIEGLAMVRLRRVNLERRHGNVKEAEHLLEEA 452
>sp|P63155|CRNL1_RAT Crooked neck-like protein 1 OS=Rattus norvegicus GN=Crnkl1 PE=2
SV=1
Length = 690
Score = 60.1 bits (144), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 96/211 (45%), Gaps = 21/211 (9%)
Query: 275 YEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRG 334
YE+ L Y +W YA K+ ++ A ++ RA+ LP Y + +EE G
Sbjct: 104 YERALDVDYRNITLWLKYAEMEMKNRQVNHARNIWDRAITTLPRVNQFWYKYTYMEEMLG 163
Query: 335 AIAAAKKLYESLLTDSVNTTALAHIQFIRFLRRTEGVEAAR----KYFLDARKSPNFTYH 390
+A A++++E + A +I F R + VE AR ++ L N
Sbjct: 164 NVAGARQVFERWMEWQPEEQAWH--SYINFELRYKEVERARTIYERFVLVHPAVKN---- 217
Query: 391 VYVAYALMAFCQDKDPKLAH--NVFEAGLKRF----MHEPAYILEYADFLSRLNDDRNIR 444
++ YA ++K AH V+E ++ F M E Y+ +A F + +R
Sbjct: 218 -WIKYARF---EEKHAYFAHARKVYERAVEFFGDEHMDEHLYVA-FAKFEENQKEFERVR 272
Query: 445 ALFERALSSLPPEESIEVWKRFTQFEQMYGD 475
+++ AL + +E+ E++K +T FE+ +GD
Sbjct: 273 VIYKYALDRISKQEAQELFKNYTIFEKKFGD 303
Score = 40.4 bits (93), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 105/455 (23%), Positives = 178/455 (39%), Gaps = 82/455 (18%)
Query: 38 VAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWR 97
V A I+++ ++ P + +FW +Y + N +Q+F R + + W
Sbjct: 131 VNHARNIWDRAITTLPR----VNQFWYKYTYMEEMLGNVAGARQVFERWMEWQPEEQAWH 186
Query: 98 CYIRFIRKVYEKKGTEGQEETRKAFD-FMLSHVGSDISSGPIWLEYITFLKSLPALNAQE 156
YI F + E E R ++ F+L H + W++Y F +E
Sbjct: 187 SYINFELRYKE------VERARTIYERFVLVH-----PAVKNWIKYARF---------EE 226
Query: 157 ESQRMIAIRKAYQRAV--VTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARA 214
+ RK Y+RAV H E L+ + FE E Q ++ R
Sbjct: 227 KHAYFAHARKVYERAVEFFGDEHMDEHLYVAFAKFE-------------ENQKEFERVRV 273
Query: 215 VYRERKKYCEEIDWNMLAVPPTGSYKEEQQWIAWKRLLTFEK--GNPQRIDTASSNKRII 272
+Y KY +D S +E Q+ +K FEK G+ + I+ +KR
Sbjct: 274 IY----KYA--LD--------RISKQEAQE--LFKNYTIFEKKFGDRRGIEDIIVSKRR- 316
Query: 273 FTYEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLR--------- 323
F YE+ + H D W+DY D +V++RA+ +P + R
Sbjct: 317 FQYEEEVKANPHNYDAWFDYLRLVESDAEADTVREVYERAIANVPPIQEKRHWKRYIYLW 376
Query: 324 --YAFAELEESRGAIAAAKKLYES---LLTDSVNTTALAHIQFIRFLRRTEGVEAARKYF 378
YA E E++ +++Y++ L+ T A + + +F R + + AR+
Sbjct: 377 VNYALYEELEAKDP-ERTRQVYQASLELIPHKKFTFAKMWLYYAQFEIRQKNLPFARRAL 435
Query: 379 -LDARKSP-NFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYILEYADFLSR 436
K P N + Y+ L D+ KL E G + +++A+ +
Sbjct: 436 GTSIGKCPKNKLFKGYIELELQLREFDRCRKLYEKFLEFGPENCTS----WIKFAELETI 491
Query: 437 LNDDRNIRALFERALSSLPPEESIEV-WKRFTQFE 470
L D RA++E A+S P + EV WK + FE
Sbjct: 492 LGDIERARAIYELAISQ-PRLDMPEVLWKSYIDFE 525
>sp|P63154|CRNL1_MOUSE Crooked neck-like protein 1 OS=Mus musculus GN=Crnkl1 PE=2 SV=1
Length = 690
Score = 60.1 bits (144), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 96/211 (45%), Gaps = 21/211 (9%)
Query: 275 YEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRG 334
YE+ L Y +W YA K+ ++ A ++ RA+ LP Y + +EE G
Sbjct: 104 YERALDVDYRNITLWLKYAEMEMKNRQVNHARNIWDRAITTLPRVNQFWYKYTYMEEMLG 163
Query: 335 AIAAAKKLYESLLTDSVNTTALAHIQFIRFLRRTEGVEAAR----KYFLDARKSPNFTYH 390
+A A++++E + A +I F R + VE AR ++ L N
Sbjct: 164 NVAGARQVFERWMEWQPEEQAWH--SYINFELRYKEVERARTIYERFVLVHPAVKN---- 217
Query: 391 VYVAYALMAFCQDKDPKLAH--NVFEAGLKRF----MHEPAYILEYADFLSRLNDDRNIR 444
++ YA ++K AH V+E ++ F M E Y+ +A F + +R
Sbjct: 218 -WIKYARF---EEKHAYFAHARKVYERAVEFFGDEHMDEHLYVA-FAKFEENQKEFERVR 272
Query: 445 ALFERALSSLPPEESIEVWKRFTQFEQMYGD 475
+++ AL + +E+ E++K +T FE+ +GD
Sbjct: 273 VIYKYALDRISKQEAQELFKNYTIFEKKFGD 303
Score = 40.4 bits (93), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 105/455 (23%), Positives = 178/455 (39%), Gaps = 82/455 (18%)
Query: 38 VAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWR 97
V A I+++ ++ P + +FW +Y + N +Q+F R + + W
Sbjct: 131 VNHARNIWDRAITTLPR----VNQFWYKYTYMEEMLGNVAGARQVFERWMEWQPEEQAWH 186
Query: 98 CYIRFIRKVYEKKGTEGQEETRKAFD-FMLSHVGSDISSGPIWLEYITFLKSLPALNAQE 156
YI F + E E R ++ F+L H + W++Y F +E
Sbjct: 187 SYINFELRYKE------VERARTIYERFVLVH-----PAVKNWIKYARF---------EE 226
Query: 157 ESQRMIAIRKAYQRAV--VTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARA 214
+ RK Y+RAV H E L+ + FE E Q ++ R
Sbjct: 227 KHAYFAHARKVYERAVEFFGDEHMDEHLYVAFAKFE-------------ENQKEFERVRV 273
Query: 215 VYRERKKYCEEIDWNMLAVPPTGSYKEEQQWIAWKRLLTFEK--GNPQRIDTASSNKRII 272
+Y KY +D S +E Q+ +K FEK G+ + I+ +KR
Sbjct: 274 IY----KYA--LD--------RISKQEAQE--LFKNYTIFEKKFGDRRGIEDIIVSKRR- 316
Query: 273 FTYEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLR--------- 323
F YE+ + H D W+DY D +V++RA+ +P + R
Sbjct: 317 FQYEEEVKANPHNYDAWFDYLRLVESDAEADTVREVYERAIANVPPIQEKRHWKRYIYLW 376
Query: 324 --YAFAELEESRGAIAAAKKLYES---LLTDSVNTTALAHIQFIRFLRRTEGVEAARKYF 378
YA E E++ +++Y++ L+ T A + + +F R + + AR+
Sbjct: 377 VNYALYEELEAKDP-ERTRQVYQASLELIPHKKFTFAKMWLYYAQFEIRQKNLPFARRAL 435
Query: 379 -LDARKSP-NFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYILEYADFLSR 436
K P N + Y+ L D+ KL E G + +++A+ +
Sbjct: 436 GTSIGKCPKNKLFKGYIELELQLREFDRCRKLYEKFLEFGPENCTS----WIKFAELETI 491
Query: 437 LNDDRNIRALFERALSSLPPEESIEV-WKRFTQFE 470
L D RA++E A+S P + EV WK + FE
Sbjct: 492 LGDIERARAIYELAISQ-PRLDMPEVLWKSYIDFE 525
>sp|Q9BZJ0|CRNL1_HUMAN Crooked neck-like protein 1 OS=Homo sapiens GN=CRNKL1 PE=1 SV=4
Length = 848
Score = 58.9 bits (141), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/211 (24%), Positives = 96/211 (45%), Gaps = 21/211 (9%)
Query: 275 YEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRG 334
YE+ L Y +W YA K+ ++ A ++ RA+ LP Y + +EE G
Sbjct: 265 YERALDVDYRNITLWLKYAEMEMKNRQVNHARNIWDRAITTLPRVNQFWYKYTYMEEMLG 324
Query: 335 AIAAAKKLYESLLTDSVNTTALAHIQFIRFLRRTEGVEAAR----KYFLDARKSPNFTYH 390
+A A++++E + A +I F R + V+ AR ++ L N
Sbjct: 325 NVAGARQVFERWMEWQPEEQAWH--SYINFELRYKEVDRARTIYERFVLVHPDVKN---- 378
Query: 391 VYVAYALMAFCQDKDPKLAH--NVFEAGLKRF----MHEPAYILEYADFLSRLNDDRNIR 444
++ YA ++K AH V+E ++ F M E Y+ +A F + +R
Sbjct: 379 -WIKYARF---EEKHAYFAHARKVYERAVEFFGDEHMDEHLYVA-FAKFEENQKEFERVR 433
Query: 445 ALFERALSSLPPEESIEVWKRFTQFEQMYGD 475
+++ AL + +++ E++K +T FE+ +GD
Sbjct: 434 VIYKYALDRISKQDAQELFKNYTIFEKKFGD 464
Score = 48.1 bits (113), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 104/462 (22%), Positives = 179/462 (38%), Gaps = 96/462 (20%)
Query: 38 VAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWR 97
V A I+++ ++ P + +FW +Y + N +Q+F R + + W
Sbjct: 292 VNHARNIWDRAITTLPR----VNQFWYKYTYMEEMLGNVAGARQVFERWMEWQPEEQAWH 347
Query: 98 CYIRF---------IRKVYEKKGTEGQEETRKAFDFMLSHVGSDISSGPIWLEYITFLKS 148
YI F R +YE+ F+L H D+ + W++Y F
Sbjct: 348 SYINFELRYKEVDRARTIYER--------------FVLVH--PDVKN---WIKYARF--- 385
Query: 149 LPALNAQEESQRMIAIRKAYQRAV--VTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQ 206
+E+ RK Y+RAV H E L+ + FE E Q
Sbjct: 386 ------EEKHAYFAHARKVYERAVEFFGDEHMDEHLYVAFAKFE-------------ENQ 426
Query: 207 SKYTSARAVYRERKKYCEEIDWNMLAVPPTGSYKEEQQWIAWKRLLTFEK--GNPQRIDT 264
++ R +Y KY +D S ++ Q+ +K FEK G+ + I+
Sbjct: 427 KEFERVRVIY----KYA--LD--------RISKQDAQE--LFKNYTIFEKKFGDRRGIED 470
Query: 265 ASSNKRIIFTYEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLR- 323
+KR F YE+ + H D W+DY +A +V++RA+ +P + R
Sbjct: 471 IIVSKRR-FQYEEEVKANPHNYDAWFDYLRLVESDAEAEAVREVYERAIANVPPIQEKRH 529
Query: 324 ----------YAFAELEESRGAIAAAKKLYES---LLTDSVNTTALAHIQFIRFLRRTEG 370
YA E E++ +++Y++ L+ T A I + +F R +
Sbjct: 530 WKRYIYLWINYALYEELEAKDP-ERTRQVYQASLELIPHKKFTFAKMWILYAQFEIRQKN 588
Query: 371 VEAARKYF-LDARKSP-NFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYIL 428
+ AR+ K P N + VY+ L D+ KL E G + +
Sbjct: 589 LSLARRALGTSIGKCPKNKLFKVYIELELQLREFDRCRKLYEKFLEFGPENCTS----WI 644
Query: 429 EYADFLSRLNDDRNIRALFERALSSLPPEESIEVWKRFTQFE 470
++A+ + L D RA++E A+S + +WK + FE
Sbjct: 645 KFAELETILGDIDRARAIYELAISQPRLDMPEVLWKSYIDFE 686
>sp|P87312|CLF1_SCHPO Pre-mRNA-splicing factor cwf4 OS=Schizosaccharomyces pombe (strain
972 / ATCC 24843) GN=cwf4 PE=1 SV=1
Length = 674
Score = 58.5 bits (140), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 99/212 (46%), Gaps = 19/212 (8%)
Query: 285 YPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYE 344
Y +W Y K+ +I+ A +F RA+ LP + L Y + +EE G I ++++E
Sbjct: 103 YIPLWLKYIECEMKNRNINHARNLFDRAVTQLPRVDKLWYKYVYMEEMLGNITGCRQVFE 162
Query: 345 SLLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYHVYVAYALMAFCQDK 404
L + + +IR RR E AR + H V L ++
Sbjct: 163 RWLKWEPDENCW--MSYIRMERRYHENERARGIY-----ERFVVVHPEVTNWLRWARFEE 215
Query: 405 DPKLAHNVFEAGL-------KRFMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPE 457
+ A NV + L + F++E +I +A F R + R +F+ A+ +P
Sbjct: 216 ECGNAANVRQVYLAAIDALGQEFLNERFFIA-FAKFEIRQKEYERARTIFKYAIDFMPRS 274
Query: 458 ESIEVWKRFTQFEQMYGD---LDSTLKVEQRR 486
+S+E++K +T FE+ +GD ++ST+ +++RR
Sbjct: 275 KSMELYKEYTHFEKQFGDHLGVESTV-LDKRR 305
Score = 39.7 bits (91), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 102/507 (20%), Positives = 190/507 (37%), Gaps = 101/507 (19%)
Query: 54 TAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWRCYIRFIRKVYEKKGTE 113
T + + K W +YV + N +Q+F R L W YIR R+ +E +
Sbjct: 132 TQLPRVDKLWYKYVYMEEMLGNITGCRQVFERWLKWEPDENCWMSYIRMERRYHENERAR 191
Query: 114 GQEETRKAFDFMLSHVGSDISSGPIWLEY-----------ITFLKSLPALNAQEESQRMI 162
G E F++ H ++++ W + +L ++ AL + ++R
Sbjct: 192 GIYER-----FVVVH--PEVTNWLRWARFEEECGNAANVRQVYLAAIDALGQEFLNERFF 244
Query: 163 ------AIR-KAYQRAVVTPTHHVE--------QLWKDYENFENSVSRQLA--------- 198
IR K Y+RA + ++ +L+K+Y +FE L
Sbjct: 245 IAFAKFEIRQKEYERARTIFKYAIDFMPRSKSMELYKEYTHFEKQFGDHLGVESTVLDKR 304
Query: 199 ----KGLLSEYQSKY----------TSARAVYRERKKYCEEIDWNMLAVPPTGSYKEEQQ 244
+ LL + Y SA + R+ Y + I VP ++
Sbjct: 305 RLQYEKLLKDSPYDYDTWLDLLKLEESAGDINTIRETYEKAI----AKVPEVVEKNAWRR 360
Query: 245 WI-AWKRLLTFEKGNPQRIDTASSNKRIIFTYEQCLMYLYH----YPDIWYDYATWNAKS 299
++ W FE+ + + +D A Y++ L + H + +W YA + +
Sbjct: 361 YVYIWLNYCLFEEIDVKDVDRARK------VYQEALKLIPHKKFTFAKLWLMYAMFELRQ 414
Query: 300 GSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESLLTDSVNTTA--LA 357
ID A K RAL P ++ R + E E++ + LYE + A L
Sbjct: 415 RKIDVARKTLGRALGMCPKPKLFR-GYIEFEDAIKQFDRCRILYEKWILYDPEACAPWLG 473
Query: 358 HIQFIRFLRRTEGVEAARKYFLDA--RKSPNFTYHVYVAYALMAFCQDKDPKLAHNVFEA 415
+ L ++ A ++ ++P + Y+ + ++ + A ++++
Sbjct: 474 YAALETKLGDSDRARALYNLAVNQPILETPELVWKAYIDFEF----EEMEYGKARSIYQQ 529
Query: 416 GLKRFMHEPAYILEYADF-LSRLNDDRN---------------IRALFERALSSLP---- 455
L+ H +I +A+F ++ L DD R +FE AL+ L
Sbjct: 530 LLRTAPHVKVWI-SFANFEIAHLEDDDEEPPNEEVASPTAVVRARNVFENALAHLRQQGL 588
Query: 456 PEESIEVWKRFTQFEQMYGDLDSTLKV 482
EE + + + + QFE M+G D+ V
Sbjct: 589 KEERVVLLEAWKQFEAMHGTEDTRKHV 615
Score = 33.5 bits (75), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 18/65 (27%), Positives = 29/65 (44%)
Query: 38 VAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWR 97
V +A +Y++ L + P AK W Y + D ++ R L +C + L+R
Sbjct: 379 VDRARKVYQEALKLIPHKKFTFAKLWLMYAMFELRQRKIDVARKTLGRALGMCPKPKLFR 438
Query: 98 CYIRF 102
YI F
Sbjct: 439 GYIEF 443
>sp|Q9HF03|CLF1_CRYNH Pre-mRNA-splicing factor CLF1 OS=Cryptococcus neoformans var.
grubii serotype A (strain H99 / ATCC 208821 / CBS 10515
/ FGSC 9487) GN=CLF1 PE=3 SV=1
Length = 724
Score = 58.2 bits (139), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 95/198 (47%), Gaps = 15/198 (7%)
Query: 287 DIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESL 346
D+W Y K+ +I+ A +F RA+ LP + L Y + LEE ++ A++++E
Sbjct: 110 DLWIKYTDMELKARNINHARNLFDRAITLLPRVDALWYKYVYLEELLLNVSGARQIFERW 169
Query: 347 LTDSVNTTAL-AHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYHVYVAYALMAFCQDK- 404
+ N A ++I+ + A + ++ R P +VA+A F +D+
Sbjct: 170 MQWEPNDKAWQSYIKLEERYNELDRASAIYERWIACRPIPK----NWVAWA--KFEEDRG 223
Query: 405 DPKLAHNVFEAGLKRFMHEPAYILE-------YADFLSRLNDDRNIRALFERALSSLPPE 457
P A VF+ L+ F E + + +A +RL + R +++ AL+ LP
Sbjct: 224 QPDKAREVFQTALEFFGDEEEQVEKAQSVFAAFARMETRLKEFERARVIYKFALARLPRS 283
Query: 458 ESIEVWKRFTQFEQMYGD 475
+S ++ ++T+FE+ +GD
Sbjct: 284 KSASLYAQYTKFEKQHGD 301
>sp|P0CO10|CLF1_CRYNJ Pre-mRNA-splicing factor CLF1 OS=Cryptococcus neoformans var.
neoformans serotype D (strain JEC21 / ATCC MYA-565)
GN=CLF1 PE=3 SV=1
Length = 726
Score = 56.6 bits (135), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 49/198 (24%), Positives = 94/198 (47%), Gaps = 15/198 (7%)
Query: 287 DIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESL 346
D+W Y K+ +I+ A +F RA+ LP + L Y + LEE ++ A++++E
Sbjct: 110 DLWIKYTDMELKARNINHARNLFDRAITLLPRVDALWYKYVYLEELLLNVSGARQIFERW 169
Query: 347 LTDSVNTTAL-AHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYHVYVAYALMAFCQDK- 404
+ N A ++I+ + A + ++ R P +V +A F +D+
Sbjct: 170 MQWEPNDKAWQSYIKLEERYNELDRASAIYERWIACRPIPK----NWVTWA--KFEEDRG 223
Query: 405 DPKLAHNVFEAGLKRFMHEPAYILE-------YADFLSRLNDDRNIRALFERALSSLPPE 457
P A VF+ L+ F E + + +A +RL + R +++ AL+ LP
Sbjct: 224 QPDKAREVFQTALEFFGDEEEQVEKAQSVFAAFARMETRLKEFERARVIYKFALARLPRS 283
Query: 458 ESIEVWKRFTQFEQMYGD 475
+S ++ ++T+FE+ +GD
Sbjct: 284 KSASLYAQYTKFEKQHGD 301
Score = 39.3 bits (90), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 71/307 (23%), Positives = 123/307 (40%), Gaps = 54/307 (17%)
Query: 233 VPPTGSYKEEQQWI-AWKRLLTFEKGNPQRIDTASSNKRIIFTYEQCLMYLYH----YPD 287
VPP + +++I W + FE+ + + D A Y+ + + H +
Sbjct: 368 VPPALEKRYWRRYIYLWLQYAAFEEIDTKDYDRARD------VYKAAVKLVPHKTFTFAK 421
Query: 288 IWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESLL 347
+W YA + + + AA KV + P ++ + ELE + LYE L
Sbjct: 422 LWLAYAYFEIRRLDVSAARKVLGAGIGMCPKPKLFT-GYIELEMRLREFDRVRTLYEKFL 480
Query: 348 TDSVNTTALAHIQFIRFLRRTEGVEAARKYF-LDARKS---PNFTYHVYVAYALMAFCQD 403
T + ++ A IQ+ + E E R F L ++S P + Y+ + +
Sbjct: 481 TYDPSLSS-AWIQWTQVESAVEDFERVRAIFELAVQQSLDMPEIVWKAYIDFE----AGE 535
Query: 404 KDPKLAHNVFEAGLKRFMHEPAYI----LEYADFLSRLNDDRN-----------IRALFE 448
+ + A N++E L+R H +I +E A ++D N R +FE
Sbjct: 536 GERERARNLYERLLERTSHVKVWISYALMEIATLGGGEDEDGNEIEGEAGDADLARQVFE 595
Query: 449 RALSSLPPEES-------IEVWKRFTQFEQMYGDLDSTLKVEQ-----RRKEALSRTGEE 496
R L + +E WK FEQ +GD ++ KVE R++ R E+
Sbjct: 596 RGYKDLRAKGEKEDRAVLLESWK---SFEQEHGDEETLAKVEDMLPTTRKR---WRKAED 649
Query: 497 GASALED 503
G+ LE+
Sbjct: 650 GSGELEE 656
>sp|P0CO11|CLF1_CRYNB Pre-mRNA-splicing factor CLF1 OS=Cryptococcus neoformans var.
neoformans serotype D (strain B-3501A) GN=CLF1 PE=3 SV=1
Length = 726
Score = 56.6 bits (135), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 49/198 (24%), Positives = 94/198 (47%), Gaps = 15/198 (7%)
Query: 287 DIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESL 346
D+W Y K+ +I+ A +F RA+ LP + L Y + LEE ++ A++++E
Sbjct: 110 DLWIKYTDMELKARNINHARNLFDRAITLLPRVDALWYKYVYLEELLLNVSGARQIFERW 169
Query: 347 LTDSVNTTAL-AHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYHVYVAYALMAFCQDK- 404
+ N A ++I+ + A + ++ R P +V +A F +D+
Sbjct: 170 MQWEPNDKAWQSYIKLEERYNELDRASAIYERWIACRPIPK----NWVTWA--KFEEDRG 223
Query: 405 DPKLAHNVFEAGLKRFMHEPAYILE-------YADFLSRLNDDRNIRALFERALSSLPPE 457
P A VF+ L+ F E + + +A +RL + R +++ AL+ LP
Sbjct: 224 QPDKAREVFQTALEFFGDEEEQVEKAQSVFAAFARMETRLKEFERARVIYKFALARLPRS 283
Query: 458 ESIEVWKRFTQFEQMYGD 475
+S ++ ++T+FE+ +GD
Sbjct: 284 KSASLYAQYTKFEKQHGD 301
Score = 39.3 bits (90), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 71/307 (23%), Positives = 123/307 (40%), Gaps = 54/307 (17%)
Query: 233 VPPTGSYKEEQQWI-AWKRLLTFEKGNPQRIDTASSNKRIIFTYEQCLMYLYH----YPD 287
VPP + +++I W + FE+ + + D A Y+ + + H +
Sbjct: 368 VPPALEKRYWRRYIYLWLQYAAFEEIDTKDYDRARD------VYKAAVKLVPHKTFTFAK 421
Query: 288 IWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESLL 347
+W YA + + + AA KV + P ++ + ELE + LYE L
Sbjct: 422 LWLAYAYFEIRRLDVSAARKVLGAGIGMCPKPKLFT-GYIELEMRLREFDRVRTLYEKFL 480
Query: 348 TDSVNTTALAHIQFIRFLRRTEGVEAARKYF-LDARKS---PNFTYHVYVAYALMAFCQD 403
T + ++ A IQ+ + E E R F L ++S P + Y+ + +
Sbjct: 481 TYDPSLSS-AWIQWTQVESAVEDFERVRAIFELAVQQSLDMPEIVWKAYIDFE----AGE 535
Query: 404 KDPKLAHNVFEAGLKRFMHEPAYI----LEYADFLSRLNDDRN-----------IRALFE 448
+ + A N++E L+R H +I +E A ++D N R +FE
Sbjct: 536 GERERARNLYERLLERTSHVKVWISYALMEIATLGGGEDEDGNEIEGEAGDADLARQVFE 595
Query: 449 RALSSLPPEES-------IEVWKRFTQFEQMYGDLDSTLKVEQ-----RRKEALSRTGEE 496
R L + +E WK FEQ +GD ++ KVE R++ R E+
Sbjct: 596 RGYKDLRAKGEKEDRAVLLESWK---SFEQEHGDEETLAKVEDMLPTTRKR---WRKAED 649
Query: 497 GASALED 503
G+ LE+
Sbjct: 650 GSGELEE 656
>sp|Q5BDX1|CLF1_EMENI Pre-mRNA-splicing factor clf1 OS=Emericella nidulans (strain FGSC
A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) GN=clf1
PE=3 SV=2
Length = 673
Score = 52.0 bits (123), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 92/204 (45%), Gaps = 19/204 (9%)
Query: 288 IWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESLL 347
+W Y ++ +I+ A + RA+ LP + L Y + +EE+ G I ++++E +
Sbjct: 108 LWIRYIESEMRNRNINHARNLLDRAVTILPRVDKLWYKYVYMEETLGNIPGTRQVFERWM 167
Query: 348 TDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTY-----HVYVAYALMAFCQ 402
+ + A + +I+ +R E AR F FT ++ +A +
Sbjct: 168 SWEPDEGAWS--AYIKLEKRYNEFERARAIF------QRFTIVHPEPRNWIKWARFE-EE 218
Query: 403 DKDPKLAHNVF----EAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPEE 458
L V+ E + FM E +I YA F ++L + RA+++ AL LP +
Sbjct: 219 YGTSDLVREVYGLAVETLGEDFMDEKLFIA-YARFETKLKEYERARAIYKYALDRLPRSK 277
Query: 459 SIEVWKRFTQFEQMYGDLDSTLKV 482
SI + K +T FE+ +GD + V
Sbjct: 278 SITLHKAYTTFEKQFGDREGVENV 301
Score = 48.1 bits (113), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 106/461 (22%), Positives = 178/461 (38%), Gaps = 80/461 (17%)
Query: 48 LLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWRCYIRFIRKVY 107
LL T + + K W +YV + N T+Q+F R + W YI+
Sbjct: 128 LLDRAVTILPRVDKLWYKYVYMEETLGNIPGTRQVFERWMSWEPDEGAWSAYIKL----- 182
Query: 108 EKKGTEGQEETRKAFD-FMLSHVGSDISSGPIWLEYITFLKSLPALNAQEESQRMIAIRK 166
EK+ E E R F F + H W+++ F +EE +R+
Sbjct: 183 EKRYNEF-ERARAIFQRFTIVH-----PEPRNWIKWARF---------EEEYGTSDLVRE 227
Query: 167 AYQRAVVTPTHHV--EQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAVYRERKKYCE 224
Y AV T E+L+ Y FE + +Y ARA+Y KY
Sbjct: 228 VYGLAVETLGEDFMDEKLFIAYARFETKLK-------------EYERARAIY----KYA- 269
Query: 225 EIDWNMLAVPPTGSYKEEQQWIAWKRLLTFEK--GNPQRIDTASSNKRIIFTYEQCLMYL 282
+ +P + S K TFEK G+ + ++ KR + EQ L
Sbjct: 270 -----LDRLPRSKSI------TLHKAYTTFEKQFGDREGVENVILAKRRVQYEEQLKENL 318
Query: 283 YHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLR-----------YAFAELEE 331
+Y D+W+D+A +SG + V++RA+ +P S+ R YA E E
Sbjct: 319 RNY-DVWFDFARLEEQSGDPERVRDVYERAIAQIPPSQEKRHWRRYIYLWIFYALWEEME 377
Query: 332 SRGAIAAAKKLYES---LLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDA----RKS 384
++ I A+++Y L+ T A + +F R ++AARK A K
Sbjct: 378 AKD-IDRARQVYTECLKLIPHKKFTFAKVWLMKAQFEVRQLNLQAARKTLGQAIGMCPKD 436
Query: 385 PNFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIR 444
F ++ + L F + ++E ++ ++YA+ L+D R
Sbjct: 437 KLFRGYIDLERQLFEFVR------CRTLYEKQIEWNPSNSQSWIQYAELERGLDDTERAR 490
Query: 445 ALFERALSSLPPEESIEVWKRFTQFEQMYGDLDSTLKVEQR 485
A++E + + VWK + FE G+ + ++ +R
Sbjct: 491 AIYELGIDQPTLDMPELVWKAYIDFEDDEGEYERERQLYER 531
Score = 38.5 bits (88), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 59/278 (21%), Positives = 109/278 (39%), Gaps = 35/278 (12%)
Query: 233 VPPTGSYKEEQQWI-AWKRLLTFEKGNPQRIDTASSNKRIIFTYEQCLMYLYH----YPD 287
+PP+ + +++I W +E+ + ID A Y +CL + H +
Sbjct: 351 IPPSQEKRHWRRYIYLWIFYALWEEMEAKDIDRARQ------VYTECLKLIPHKKFTFAK 404
Query: 288 IWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESLL 347
+W A + + ++ AA K +A+ P ++ R + +LE + LYE +
Sbjct: 405 VWLMKAQFEVRQLNLQAARKTLGQAIGMCPKDKLFR-GYIDLERQLFEFVRCRTLYEKQI 463
Query: 348 TDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYHVYVAYALMAFCQDK-DP 406
+ + + + IQ+ R + E AR + P V A + F D+ +
Sbjct: 464 EWNPSNSQ-SWIQYAELERGLDDTERARAIYELGIDQPTLDMPELVWKAYIDFEDDEGEY 522
Query: 407 KLAHNVFEAGLKRFMHEPAYILEYADFLSRLND----------------DRNIRALFERA 450
+ ++E L++ H +I YA F + D R RA+FERA
Sbjct: 523 ERERQLYERLLQKTDHVKVWI-NYARFEINVPDEEEEEEEEERPISDEAKRRARAVFERA 581
Query: 451 LSSLP----PEESIEVWKRFTQFEQMYGDLDSTLKVEQ 484
EE +E+ + FE +G + K+E+
Sbjct: 582 HRVFKEKELKEERVELLNAWRAFEHTHGSPEDIDKIEK 619
>sp|Q4WT84|CLF1_ASPFU Pre-mRNA-splicing factor clf1 OS=Neosartorya fumigata (strain ATCC
MYA-4609 / Af293 / CBS 101355 / FGSC A1100) GN=clf1 PE=3
SV=1
Length = 676
Score = 50.8 bits (120), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 105/460 (22%), Positives = 175/460 (38%), Gaps = 78/460 (16%)
Query: 48 LLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWRCYIRFIRKVY 107
LL T + + KFW +YV + N T+Q+F R + W YI+
Sbjct: 128 LLDRAVTILPRVDKFWYKYVYMEETLGNIQGTRQVFERWMSWEPDEGAWSAYIKL----- 182
Query: 108 EKKGTEGQEETRKAFD-FMLSHVGSDISSGPIWLEYITFLKSLPALNAQEESQRMIAIRK 166
EK+ E E R F F + H W+++ F +EE +R+
Sbjct: 183 EKRYNEF-ERARAIFQRFTIVH-----PEPRNWIKWARF---------EEEYGTSDLVRE 227
Query: 167 AYQRAVVTPTHHV--EQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAVYRERKKYCE 224
Y A+ T E+L+ Y FE + +Y ARA+Y KY
Sbjct: 228 VYGMAIETLGEDFMDEKLFIAYAKFEAKLK-------------EYERARAIY----KYA- 269
Query: 225 EIDWNMLAVPPTGSYKEEQQWIAWKRLLTFEK--GNPQRIDTASSNKRIIFTYEQCLMYL 282
+ +P + + K TFEK G+ + ++ +KR + YE+ L
Sbjct: 270 -----LDRLPRSKAMALH------KAYTTFEKQFGDREGVEDVILSKRRV-QYEEQLKEN 317
Query: 283 YHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLR-----------YAFAELEE 331
D+W+D+A SG D +++RA+ +P S+ R YA E E
Sbjct: 318 PRNYDVWFDFARLEETSGDPDRVRDIYERAIAQIPPSQEKRHWRRYIYLWIFYAIWEEME 377
Query: 332 SRGAIAAAKKLYESLLTDSVNTTALAHIQFIR--FLRRTEGVEAARKYFLDA----RKSP 385
++ A + E L A I ++ F R ++AARK A K
Sbjct: 378 AKDVDRARQIYTECLKLIPHKKFTFAKIWLLKAQFDIRQMDLQAARKTLGQAIGMCPKDK 437
Query: 386 NFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRA 445
F ++ + L F + ++E ++ ++YA+ L+D RA
Sbjct: 438 LFRGYIDLERQLFEFVR------CRTLYEKQIEWNPANSQSWIKYAELERGLDDSERARA 491
Query: 446 LFERALSSLPPEESIEVWKRFTQFEQMYGDLDSTLKVEQR 485
+FE + + VWK + FE+ G+ D ++ +R
Sbjct: 492 IFELGIDQPMLDMPELVWKAYIDFEEYEGEYDRVRQLYER 531
Score = 48.9 bits (115), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/204 (23%), Positives = 91/204 (44%), Gaps = 19/204 (9%)
Query: 288 IWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESLL 347
+W Y ++ +I+ A + RA+ LP + Y + +EE+ G I ++++E +
Sbjct: 108 LWIRYIESEMRNRNINHARNLLDRAVTILPRVDKFWYKYVYMEETLGNIQGTRQVFERWM 167
Query: 348 TDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTY-----HVYVAYALMAFCQ 402
+ + A + +I+ +R E AR F FT ++ +A +
Sbjct: 168 SWEPDEGAWS--AYIKLEKRYNEFERARAIF------QRFTIVHPEPRNWIKWARFE-EE 218
Query: 403 DKDPKLAHNVFEAGLKR----FMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPEE 458
L V+ ++ FM E +I YA F ++L + RA+++ AL LP +
Sbjct: 219 YGTSDLVREVYGMAIETLGEDFMDEKLFIA-YAKFEAKLKEYERARAIYKYALDRLPRSK 277
Query: 459 SIEVWKRFTQFEQMYGDLDSTLKV 482
++ + K +T FE+ +GD + V
Sbjct: 278 AMALHKAYTTFEKQFGDREGVEDV 301
>sp|Q7KRW8|PRP39_DROME Pre-mRNA-processing factor 39 OS=Drosophila melanogaster GN=CG1646
PE=1 SV=1
Length = 1066
Score = 50.8 bits (120), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 61/270 (22%), Positives = 116/270 (42%), Gaps = 25/270 (9%)
Query: 241 EEQQWIAWKRLLTFEKGNPQRIDTASSNKRIIFTYEQCLMYLYHYPDIWYDYATW----N 296
E Q WK L FE R +R++ +E+CL+ Y + W +
Sbjct: 701 ERAQLKNWKDYLDFEIEKGDR-------ERVLVLFERCLIACALYDEFWLKMLRYLESLE 753
Query: 297 AKSGSIDAAIKVFQRALK-ALPDSEMLRYAFAELEESRGAIAAAKKLYESLLTDSVNTTA 355
+SG +D V++RA + PD L +A EE + A ++ + + N
Sbjct: 754 DQSGVVDLVRDVYRRACRIHHPDKPSLHLMWAAFEECQMNFDDAAEILQRIDQRCPNLLQ 813
Query: 356 LAHIQFIRFLRRTEGVEAAR---KYFLDARKSPNFTYHVYVAYA--LMAFCQDKDPKLAH 410
L++ + I RR ++ R K+++++ K+ + + YA L C D D LA
Sbjct: 814 LSY-RRINVERRRGALDKCRELYKHYIESTKNKGIAGSLAIKYARFLNKICHDLDAGLA- 871
Query: 411 NVFEAGLKRFMHEPAYILEYADF-LSRLN-DDRNIRALFER--ALSSLPPEESIEVWKRF 466
+ L+R L+ D L R D++ + + ++ A + + P++ + +R
Sbjct: 872 -ALQQALERDPANTRVALQMIDLCLQRPKVDEQEVVEIMDKFMARADIEPDQKVLFAQRK 930
Query: 467 TQFEQMYGDLDSTLKVEQRR-KEALSRTGE 495
+F + +G L+ QR ++AL++ E
Sbjct: 931 VEFLEDFGSTARGLQDAQRALQQALTKAKE 960
>sp|Q527H0|CLF1_MAGO7 Pre-mRNA-splicing factor CLF1 OS=Magnaporthe oryzae (strain 70-15 /
ATCC MYA-4617 / FGSC 8958) GN=CLF1 PE=3 SV=1
Length = 691
Score = 50.4 bits (119), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 96/219 (43%), Gaps = 22/219 (10%)
Query: 288 IWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESLL 347
+W Y KS +I+ A + RA+ LP + L Y + +EE G I ++++E +
Sbjct: 108 LWIRYVEAELKSRNINFARNLLDRAVTHLPRVDKLWYKYVWVEEMLGNIPGVRQVFERWM 167
Query: 348 TDSVNTTALAHIQFIRFLRRTEGVEAARKYFLD---ARKSPNFTYHVYVAYALMAFCQDK 404
+ A + FI+ +R + AR+ F P + + + D+
Sbjct: 168 EWQPDEAAWS--AFIKLEQRYGEYDRAREIFTRFTMVHPEPR-NWIKWSKFEEEYGTSDR 224
Query: 405 DPKLAHNVFEAGLKR-------FMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPE 457
VFE ++ F+ E +I YA + ++L+D RA+++ L +LP
Sbjct: 225 ----VREVFERAIEELSKYGDEFVEERLFIA-YARYEAKLHDLDRARAIYKFGLENLPRS 279
Query: 458 ESIEVWKRFTQFEQMYGDL----DSTLKVEQRRKEALSR 492
+++ + K +T FE+ YGD D L +R E L R
Sbjct: 280 KAMLLHKEYTTFEKQYGDREGVEDVVLSKRRRHYEDLVR 318
Score = 38.5 bits (88), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 52/226 (23%), Positives = 95/226 (42%), Gaps = 43/226 (19%)
Query: 287 DIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYA---------FAELEESRGAI- 336
D+W+DYA SG ID +V+++A+ +P ++ R+ FA EE+
Sbjct: 325 DVWFDYARLEEASGDIDRTREVYEKAIAQVPPTQAKRHWRRYIYLWIFFALWEETEAKNP 384
Query: 337 AAAKKLYES---LLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYHVYV 393
A+++Y++ L+ T A + F R + AARK A
Sbjct: 385 ERARQVYDTCLKLIPHRTFTFAKVWMHKAHFEIRQGDLAAARKTLGRA------------ 432
Query: 394 AYALMAFC-QDKDPK----LAHNVFEAGLKRFMHE------PAYI---LEYADFLSRLND 439
+ C +D+ K + ++E G R ++E PA +++A+ L+D
Sbjct: 433 ----IGMCPKDRLFKGYIEMEQKLYEFGRCRILYEKHIAYNPANCSTWVKWAELERGLDD 488
Query: 440 DRNIRALFERALSSLPPEESIEVWKRFTQFEQMYGDLDSTLKVEQR 485
RA+ + ++ + VWK + FE+ G+ D T + +R
Sbjct: 489 LDRARAILDMGIAQPVLDMPEVVWKSYIDFEEEEGEYDKTRSLYER 534
>sp|Q1JPZ7|PRP39_DANRE Pre-mRNA-processing factor 39 OS=Danio rerio GN=prpf39 PE=2 SV=2
Length = 752
Score = 49.3 bits (116), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 68/275 (24%), Positives = 118/275 (42%), Gaps = 28/275 (10%)
Query: 241 EEQQWIAWKRLLTFEKGNPQRIDTASSNKRIIFTYEQCLMYLYHYPDIWYDYATWNAKSG 300
E+ Q W+ L FE N + +R++ +E+CL+ Y + W YA + +S
Sbjct: 424 EKTQLNNWREYLDFELEN-------GTPERVVVLFERCLIACALYEEFWIKYAKY-LESY 475
Query: 301 SIDAAIKVFQRALKA-LPDSEMLRYAFAELEESRGAIAAAKKLYESLLTDSVNTTALAHI 359
S +A ++++A LP + +A EE +G+I A+ + +++ SV A+ +
Sbjct: 476 STEAVRHIYKKACTVHLPKKPNVHLLWAAFEEQQGSIDEARSILKAVEV-SVPGLAMVRL 534
Query: 360 QFIRFLRRTEGVEAARKYFLDA-----RKSPNFTYHVYVAYALMAFCQDKDPKLAHNVFE 414
+ + RR +E A DA S + Y V +A L+ K A V
Sbjct: 535 RRVSLERRHGNMEEAEALLQDAITNGRNSSESSFYSVKLARQLVKV--QKSIGRAKKVLL 592
Query: 415 AGLKRFMHEPAYILEYADFLSRLNDDRN---IRALFERAL-SSLPPEESIEVWKRFTQFE 470
+++ P L + + +N I A F+RAL SS+ E I +R F
Sbjct: 593 EAVEKDETNPKLYLNLLELEYSGDVQQNEAEIIACFDRALSSSMALESRITFSQRKVDFL 652
Query: 471 QMYGDLDSTLKV--EQRRK-----EALSRTGEEGA 498
+ +G +TL EQ ++ E+ R E G+
Sbjct: 653 EDFGSDINTLMAAYEQHQRLLAEQESFKRKAENGS 687
Score = 47.8 bits (112), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 69/335 (20%), Positives = 134/335 (40%), Gaps = 41/335 (12%)
Query: 62 FWKQYVEAYMAVNNDDATKQLFSRCL-LICLQVPLWRCYIRFIRKVYEKKGTEGQEETRK 120
+WK+Y + +++ R L I L V LW YI F+R+ + E + R
Sbjct: 202 YWKKYADIERKHGYIQMADEVYRRGLQAIPLSVDLWLHYITFLRENQDTSDGEAESRIRA 261
Query: 121 AFDFMLSHVGSDISSGPIWLEYITFLKSLPALNAQEESQRMIAIRKAYQRAVVTPTHHVE 180
+++ + G+D S +W YI + + E ++ + Y R + PT
Sbjct: 262 SYEHAVLACGTDFRSDRLWEAYIAW---------ETEQGKLANVTAIYDRLLCIPT---- 308
Query: 181 QLW-KDYENFENSVSRQLAKGLLSEYQSKYTSARAVYRERKKYCEEIDWNMLA----VPP 235
QL+ + ++ F++ V K LSE ++ S R K + D A +PP
Sbjct: 309 QLYSQHFQKFKDHVQSNNPKHFLSE--EEFVSLRVELANANKPSGDEDAETEAPGEELPP 366
Query: 236 TGSYKEEQQWIAWKRLLTFEKGNPQRIDTA----SSNKRII---FTYEQCLMYLYHYP-- 286
E KR+ E + I+T + N+ + + +E+ + Y +
Sbjct: 367 GT----EDLPDPAKRVTEIENMRHKVIETRQEMFNHNEHEVSKRWAFEEGIKRPYFHVKA 422
Query: 287 ------DIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAK 340
+ W +Y + ++G+ + + +F+R L A E +A+ ES + A +
Sbjct: 423 LEKTQLNNWREYLDFELENGTPERVVVLFERCLIACALYEEFWIKYAKYLESY-STEAVR 481
Query: 341 KLYESLLTDSVNTTALAHIQFIRFLRRTEGVEAAR 375
+Y+ T + H+ + F + ++ AR
Sbjct: 482 HIYKKACTVHLPKKPNVHLLWAAFEEQQGSIDEAR 516
>sp|Q5K654|CLF1_PARBR Pre-mRNA-splicing factor CLF1 OS=Paracoccidioides brasiliensis
GN=CLF1 PE=3 SV=1
Length = 677
Score = 49.3 bits (116), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 110/491 (22%), Positives = 190/491 (38%), Gaps = 94/491 (19%)
Query: 40 QAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQV-PLWRC 98
+A ++E+ L V PTAV W +Y+EA M N + + L R + I +V LW
Sbjct: 90 RARSVFERALDVDPTAVVL----WIRYIEAEMKTRNINHARNLLDRAVTIYSRVDKLWYK 145
Query: 99 YIRFIRKVYEKKGTEGQEETRKAFDFMLSHVGSDISSGPIWLEYITFLKSLPALNAQEES 158
Y VY ++ TR+ F+ +S + + G YI K +
Sbjct: 146 Y------VYMEEMLGNIPGTRQVFERWMSWEPDEGAWGA----YIKLEKRYNEFDR---- 191
Query: 159 QRMIAIRKAYQRAVVTPTHHVEQLWKDYENFENSVS-------------RQLAKGLLSE- 204
+R ++R T H + W + FE L + + E
Sbjct: 192 -----VRAIFER--FTVVHPEPKNWIKWARFEEEYGTSDMVREVYGLAIETLGEDFMDEK 244
Query: 205 -------YQSK---YTSARAVYRERKKYCEEIDWNMLAVPPTGSYKEEQQWIAWKRLLTF 254
Y++K + ARA+Y KY + +P S K TF
Sbjct: 245 LFIAYARYEAKLKEFERARAIY----KYA------LDRLPRAKSVALH------KAYTTF 288
Query: 255 EK--GNPQRIDTASSNKRIIFTYEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRA 312
EK G+ + ++ +KR + EQ +Y DIW+D+ SG ++ V++RA
Sbjct: 289 EKQFGDREGVEDVILSKRRVQYEEQIKENPKNY-DIWFDFVRLEESSGDVERVRDVYERA 347
Query: 313 LKALPDSEMLR-----------YAFAELEESRGAIAAAKKLYES---LLTDSVNTTALAH 358
+ +P S+ R YA E E++ + A ++Y+ L+ T A
Sbjct: 348 IAQMPPSQEKRHWRRYIYLWIFYALWEELEAKD-MERAHQIYQECIRLIPHKKFTFAKIW 406
Query: 359 IQFIRFLRRTEGVEAARKYFLDA----RKSPNFTYHVYVAYALMAFCQDKDPKLAHNVFE 414
+ +F R ++AARK A K F ++ + L F + +FE
Sbjct: 407 LMKAQFEIRQMDLQAARKTLGHAIGACPKDKLFKGYIDLERQLFEFVR------CRKLFE 460
Query: 415 AGLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPEESIEVWKRFTQFEQMYG 474
++ +++A+ L+D RA++E +S + +WK + FE+ G
Sbjct: 461 KQIEWSPSNCQAWIKFAELERGLDDIDRARAIYELGISQPVLDMPELLWKSYIDFEEYEG 520
Query: 475 DLDSTLKVEQR 485
+ D T + +R
Sbjct: 521 EYDRTRALYER 531
>sp|Q6C186|CLF1_YARLI Pre-mRNA-splicing factor CLF1 OS=Yarrowia lipolytica (strain CLIB
122 / E 150) GN=CLF1 PE=3 SV=1
Length = 676
Score = 46.6 bits (109), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 89/199 (44%), Gaps = 18/199 (9%)
Query: 287 DIWYDYATWNAKSG-SIDAAIKVFQRALKALPDSE----------MLRYAFAELEESRGA 335
D W+ Y T +SG D ++F+RA+ +P ++YA E E++
Sbjct: 322 DTWFSYITLGQESGLEADQIREIFERAVSNVPPHSKRLWRRYIFLWIKYAIWEELENK-E 380
Query: 336 IAAAKKLYE---SLLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYHVY 392
+ A+++Y+ S++ T A + + +F R + ARK +Y
Sbjct: 381 VEKAREIYKTCISIIPHKKFTFAKVWLLWAKFEIRHGNLPEARKILGRGLGMSGGKPALY 440
Query: 393 VAY-ALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFERAL 451
Y AL A ++ D +++ +++F A +EYA+ L D+ RA+FE A+
Sbjct: 441 KGYIALEAKLREFDR--CRKLYDKYVEKFAEFAAPWMEYAELEQMLGDEERARAIFELAV 498
Query: 452 SSLPPEESIEVWKRFTQFE 470
S E VWKRF +FE
Sbjct: 499 SQPEMEMPELVWKRFIEFE 517
Score = 42.4 bits (98), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 46/204 (22%), Positives = 84/204 (41%), Gaps = 22/204 (10%)
Query: 284 HYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLY 343
H P W Y K +I+ A + RA+ LP + L + + EE+ G IA + ++
Sbjct: 99 HVP-TWIRYIQCELKEKNINHARNLLDRAVTLLPRVDKLWFTYVATEETLGNIAGCRAVF 157
Query: 344 ESLLTDSVNTTALAHI----QFIRFLRRTEGVEAARKYFLDARKSPNFTYHVYVAYALMA 399
E + TA A + R R G+ R+Y +P ++ + + + A
Sbjct: 158 ERWMHWRPPVTAWAAYVNMEKRYREFDRARGI--LRRYVTVHPGAP--AWNKWAKFEMEA 213
Query: 400 FCQDKDPKLAHNVFEAGLKRFMH---------EPAYILEYADFLSRLNDDRNIRALFERA 450
+D V+ G+ + + + + +A F +R + RAL+
Sbjct: 214 GNRD----TVREVYALGIDTLVEMAHGGVDFLDESLLAGWASFETRHREYERARALYTYG 269
Query: 451 LSSLPPEESIEVWKRFTQFEQMYG 474
L LP +S +++ +T FE+ YG
Sbjct: 270 LEKLPKSKSAKLYADYTAFEKQYG 293
>sp|Q54XP4|CRNL1_DICDI Crooked neck-like protein 1 OS=Dictyostelium discoideum GN=crnkl1
PE=3 SV=1
Length = 705
Score = 45.8 bits (107), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/210 (20%), Positives = 92/210 (43%), Gaps = 17/210 (8%)
Query: 274 TYEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESR 333
+E+ L + P +W YA K+ +I+ A ++ RA+ LP L + + +E+
Sbjct: 97 VFERFLDIDHRIPTVWIKYAEMEMKNKNINLARNIWDRAVCLLPRVSQLWFKYTFMEDML 156
Query: 334 GAIAAAKKLYESLLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYHVYV 393
G AA+ ++E + A +++F +R + E R F H Y+
Sbjct: 157 GNYPAARAIFERWMQWKPEPQAWN--SYLKFEQRLKLFENTRLIF-----EKYILVHPYI 209
Query: 394 AYALMAFCQDKDP----KLAHNVFEAGLKRFMHEPA----YILEYADFLSRLNDDRNIRA 445
+ + + ++ + A +F+ ++ F+ E + +A F + + R
Sbjct: 210 K-TWIKYTKFEERLGNIENARTIFQRAIE-FLGEDGNDEQLFIAFAKFEEKYKEIERARV 267
Query: 446 LFERALSSLPPEESIEVWKRFTQFEQMYGD 475
+++ A+ +P + +++ FT FE+ +GD
Sbjct: 268 IYKYAIDHVPKSRAKDLFDTFTNFEKQHGD 297
Score = 39.7 bits (91), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 75/172 (43%), Gaps = 9/172 (5%)
Query: 326 FAELEESRGAIAAAKKLYESLLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSP 385
+A EES+ + A+ ++E L D + I++ + + + AR + A
Sbjct: 81 YAAWEESQKDLTRARSVFERFL-DIDHRIPTVWIKYAEMEMKNKNINLARNIWDRAVCLL 139
Query: 386 NFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRA 445
++ Y M P A +FE ++ + EP Y F RL N R
Sbjct: 140 PRVSQLWFKYTFMEDMLGNYPA-ARAIFERWMQ-WKPEPQAWNSYLKFEQRLKLFENTRL 197
Query: 446 LFERALSSLPPEESIEVWKRFTQFEQMYGDLDSTLKVEQRRKEALSRTGEEG 497
+FE+ + P I+ W ++T+FE+ G++++ + QR A+ GE+G
Sbjct: 198 IFEKYILVHP---YIKTWIKYTKFEERLGNIENARTIFQR---AIEFLGEDG 243
Score = 36.2 bits (82), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 64/283 (22%), Positives = 115/283 (40%), Gaps = 61/283 (21%)
Query: 41 AAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWRCYI 100
A I+++ + + P +++ W +Y + N A + +F R + + W Y+
Sbjct: 128 ARNIWDRAVCLLPR----VSQLWFKYTFMEDMLGNYPAARAIFERWMQWKPEPQAWNSYL 183
Query: 101 RFIRKVYEKKGTEGQEETRKAFD-FMLSHVGSDISSGPIWLEYITFLKSLPALNAQEESQ 159
+F +++ + E TR F+ ++L H W++Y F + L + E+
Sbjct: 184 KFEQRL------KLFENTRLIFEKYILVH-----PYIKTWIKYTKFEERLGNI----ENA 228
Query: 160 RMIAIRKAYQRAV--VTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAVYR 217
R I +QRA+ + + EQL+ + FE E + AR +Y
Sbjct: 229 RTI-----FQRAIEFLGEDGNDEQLFIAFAKFE-------------EKYKEIERARVIY- 269
Query: 218 ERKKYCEEIDWNMLAVPPTGSYKEEQQWIAWKRLLTFEKGNPQRI---DTASSNKRIIFT 274
KY ID VP + + + FEK + RI D KR F
Sbjct: 270 ---KYA--ID----HVPKSRAKD------LFDTFTNFEKQHGDRIGIEDVVLGKKR--FQ 312
Query: 275 YEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALP 317
YE+ + DIW+DY +G I+ ++++R++ LP
Sbjct: 313 YEEEIKKNSKNYDIWFDYLKMEEINGEIEKTREIYERSIGNLP 355
>sp|Q14690|RRP5_HUMAN Protein RRP5 homolog OS=Homo sapiens GN=PDCD11 PE=1 SV=3
Length = 1871
Score = 45.1 bits (105), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 78/193 (40%), Gaps = 42/193 (21%)
Query: 288 IWYDYATWNAKSGSIDAAIKVFQRALKALP---DSEMLRY--AFAELEESRGAIAAAKKL 342
+W Y ++ ++ I+ A V +RALK + + E L A LE G+ + K+
Sbjct: 1621 LWLQYMAFHLQATEIEKARAVAERALKTISFREEQEKLNVWVALLNLENMYGSQESLTKV 1680
Query: 343 YESLLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYHVYVAYALMAFCQ 402
+E R ++ E + K FL H+ YA Q
Sbjct: 1681 FE------------------RAVQYNEPL----KVFL----------HLADIYAKSEKFQ 1708
Query: 403 DKDPKLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPEESIEV 462
+ A ++ LKRF E A ++Y FL R + + +RAL LP +E ++V
Sbjct: 1709 E-----AGELYNRMLKRFRQEKAVWIKYGAFLLRRSQAAASHRVLQRALECLPSKEHVDV 1763
Query: 463 WKRFTQFEQMYGD 475
+F Q E GD
Sbjct: 1764 IAKFAQLEFQLGD 1776
Score = 39.3 bits (90), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 51/220 (23%), Positives = 92/220 (41%), Gaps = 10/220 (4%)
Query: 287 DIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESL 346
++W GS ++ KVF+RA++ ++ + A++ A +LY +
Sbjct: 1659 NVWVALLNLENMYGSQESLTKVFERAVQYNEPLKVFLH-LADIYAKSEKFQEAGELYNRM 1717
Query: 347 LTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYHVYV--AYALMAFCQDK 404
L A+ I++ FL R A+ + A + HV V +A + F Q
Sbjct: 1718 LKRFRQEKAVW-IKYGAFLLRRSQAAASHRVLQRALECLPSKEHVDVIAKFAQLEF-QLG 1775
Query: 405 DPKLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALS-SLPPEESIEVW 463
D + A +FE L + Y D + +++R +FER + SL P+ +
Sbjct: 1776 DAERAKAIFENTLSTYPKRTDVWSVYIDMTIKHGSQKDVRDIFERVIHLSLAPKRMKFFF 1835
Query: 464 KRFTQFEQMYGDLDSTLKVEQRRKEALSRTGEEGASALED 503
KR+ +E+ +G V+ + E + E +S LED
Sbjct: 1836 KRYLDYEKQHGTEKDVQAVKAKALEYV----EAKSSVLED 1871
Score = 37.0 bits (84), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 22/76 (28%), Positives = 35/76 (46%), Gaps = 2/76 (2%)
Query: 275 YEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRY--AFAELEES 332
Y + L +W Y + + A+ +V QRAL+ LP E + FA+LE
Sbjct: 1714 YNRMLKRFRQEKAVWIKYGAFLLRRSQAAASHRVLQRALECLPSKEHVDVIAKFAQLEFQ 1773
Query: 333 RGAIAAAKKLYESLLT 348
G AK ++E+ L+
Sbjct: 1774 LGDAERAKAIFENTLS 1789
Score = 33.9 bits (76), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 20/61 (32%), Positives = 30/61 (49%), Gaps = 3/61 (4%)
Query: 428 LEYADFLSRLNDDRNIRALFERALSSLP---PEESIEVWKRFTQFEQMYGDLDSTLKVEQ 484
L+Y F + + RA+ ERAL ++ +E + VW E MYG +S KV +
Sbjct: 1623 LQYMAFHLQATEIEKARAVAERALKTISFREEQEKLNVWVALLNLENMYGSQESLTKVFE 1682
Query: 485 R 485
R
Sbjct: 1683 R 1683
>sp|A7MB10|RRP5_BOVIN Protein RRP5 homolog OS=Bos taurus GN=PDCD11 PE=2 SV=1
Length = 1874
Score = 43.9 bits (102), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%), Gaps = 7/90 (7%)
Query: 409 AHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPEESIEVWKRFTQ 468
A ++ LKRF E A ++Y FL R + +RAL LP +E ++V +F Q
Sbjct: 1713 AGELYNRMLKRFRQEKAVWVKYGAFLLRRGKAEASHRVMQRALECLPKKEHVDVIAKFAQ 1772
Query: 469 FEQMYGD-------LDSTLKVEQRRKEALS 491
E GD +STL + +R + S
Sbjct: 1773 LEFQLGDAERARAIFESTLSIYPKRTDVWS 1802
Score = 42.0 bits (97), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 45/191 (23%), Positives = 83/191 (43%), Gaps = 6/191 (3%)
Query: 287 DIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESL 346
++W GS ++ KVF+RA++ ++ + A++ A +LY +
Sbjct: 1662 NVWVALLNLENMYGSQESLTKVFERAVQYNEPLKVFLH-LADIYTKSEKFQEAGELYNRM 1720
Query: 347 LTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYHVYV--AYALMAFCQDK 404
L A+ +++ FL R EA+ + A + HV V +A + F Q
Sbjct: 1721 LKRFRQEKAVW-VKYGAFLLRRGKAEASHRVMQRALECLPKKEHVDVIAKFAQLEF-QLG 1778
Query: 405 DPKLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALS-SLPPEESIEVW 463
D + A +FE+ L + Y D + + + RA+FER + SL P+ +
Sbjct: 1779 DAERARAIFESTLSIYPKRTDVWSVYIDMIIKHGSQKEARAIFERVIHLSLAPKRMKFFF 1838
Query: 464 KRFTQFEQMYG 474
KR+ +E+ +G
Sbjct: 1839 KRYLDYEKQHG 1849
Score = 41.2 bits (95), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 2/76 (2%)
Query: 275 YEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRY--AFAELEES 332
Y + L +W Y + + G +A+ +V QRAL+ LP E + FA+LE
Sbjct: 1717 YNRMLKRFRQEKAVWVKYGAFLLRRGKAEASHRVMQRALECLPKKEHVDVIAKFAQLEFQ 1776
Query: 333 RGAIAAAKKLYESLLT 348
G A+ ++ES L+
Sbjct: 1777 LGDAERARAIFESTLS 1792
>sp|Q7SGD2|CLF1_NEUCR Pre-mRNA-splicing factor clf-1 OS=Neurospora crassa (strain ATCC
24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
GN=clf-1 PE=3 SV=1
Length = 695
Score = 43.5 bits (101), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 99/446 (22%), Positives = 174/446 (39%), Gaps = 78/446 (17%)
Query: 48 LLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVPLWRCYIRFIRKVY 107
LL T + + W QY+ + + T+Q+F R + W YIR ++
Sbjct: 128 LLDRAVTRLPRVTSLWYQYLYVMEMLGDIPGTRQVFDRWMKWQPDEQAWSAYIRLEKRYG 187
Query: 108 EKKGTEGQEETRKAFDFMLSHVGSDISSGP-IWLEYITFLKSLPALNAQEESQRMIAIRK 166
E + E +AF + + P WL++ F +EE +R+
Sbjct: 188 E---FDRAREIFRAF--------TAVHPEPRTWLKWAKF---------EEEYGTSDTVRE 227
Query: 167 AYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAVYRERKKYCEEI 226
+Q A+ T E L D + ++ + L EY+ ARA+Y+
Sbjct: 228 VFQTAIQTI---AETLGDDAVDERIFIAFARYEARLREYER----ARAIYK--------- 271
Query: 227 DWNMLAVPPTGSYKEEQQWIAWKRLLTFEK--GNPQRIDTASSNKRIIFTYEQCLMYLYH 284
+ + +P + S + TFEK G+ + ++ KR EQ +
Sbjct: 272 -FGLDNLPRSKSMTLHAHYT------TFEKQFGDKEGVEDVILTKRRRLYEEQVKENAKN 324
Query: 285 YPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSE-----------MLRYAFAELEESR 333
Y D+W+D+A G +D +V++RA+ +P ++ L YA E E++
Sbjct: 325 Y-DVWFDFARLEESGGDVDRTREVYERAIAQVPPTQEKRHWRRYIFLFLFYAIWEERETK 383
Query: 334 GAIAAAKKLYES---LLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDAR----KSPN 386
I A+++Y++ L+ T A + F R + ARK A K
Sbjct: 384 D-IGRARQIYDTCLNLIPHKKFTFAKVWVAKAHFEIRQGQLTTARKTLGRAIGMCPKDKI 442
Query: 387 FTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYI---LEYADFLSRLNDDRNI 443
F ++ + L F + ++E K M+ PA +++A+ L+D
Sbjct: 443 FKEYILLEQKLYEF------ERCRTLYE---KHVMYNPANCQTWIKWAELERGLDDLERT 493
Query: 444 RALFERALSSLPPEESIEVWKRFTQF 469
RA+FE A+S + VWK + F
Sbjct: 494 RAIFELAVSQPILDMPEVVWKAYIDF 519
Score = 37.4 bits (85), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 40/197 (20%), Positives = 82/197 (41%), Gaps = 15/197 (7%)
Query: 288 IWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESLL 347
+W Y K+ +I+ A + RA+ LP L Y + + E G I +++++ +
Sbjct: 108 LWIRYVQAEIKNRNINHARNLLDRAVTRLPRVTSLWYQYLYVMEMLGDIPGTRQVFDRWM 167
Query: 348 TDSVNTTALAHIQFIRFLRRTEGVEAARKYF--LDARKSPNFTYHVYVAYALMAFCQDKD 405
+ A + +IR +R + AR+ F A T+ + + D
Sbjct: 168 KWQPDEQAWS--AYIRLEKRYGEFDRAREIFRAFTAVHPEPRTWLKWAKFEEEYGTSD-- 223
Query: 406 PKLAHNVFEAGLKRFMH-------EPAYILEYADFLSRLNDDRNIRALFERALSSLPPEE 458
VF+ ++ + + +A + +RL + RA+++ L +LP +
Sbjct: 224 --TVREVFQTAIQTIAETLGDDAVDERIFIAFARYEARLREYERARAIYKFGLDNLPRSK 281
Query: 459 SIEVWKRFTQFEQMYGD 475
S+ + +T FE+ +GD
Sbjct: 282 SMTLHAHYTTFEKQFGD 298
Score = 33.9 bits (76), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 44/215 (20%), Positives = 89/215 (41%), Gaps = 23/215 (10%)
Query: 288 IWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLYESLL 347
+WY Y G I +VF R +K PD + A+ LE+ G A++++ +
Sbjct: 142 LWYQYLYVMEMLGDIPGTRQVFDRWMKWQPDEQAWS-AYIRLEKRYGEFDRAREIFRAFT 200
Query: 348 TDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTY-------HVYVAYALMAF 400
+V+ +++ +F + R+ F A ++ T +++A+A
Sbjct: 201 --AVHPEPRTWLKWAKFEEEYGTSDTVREVFQTAIQTIAETLGDDAVDERIFIAFARYE- 257
Query: 401 CQDKDPKLAHNVFEAGLKRFMHEPAYILE--YADFLSRLNDDRNI--------RALFERA 450
+ ++ + A +++ GL + L Y F + D + R L+E
Sbjct: 258 ARLREYERARAIYKFGLDNLPRSKSMTLHAHYTTFEKQFGDKEGVEDVILTKRRRLYEEQ 317
Query: 451 LSSLPPEESIEVWKRFTQFEQMYGDLDSTLKVEQR 485
+ ++ +VW F + E+ GD+D T +V +R
Sbjct: 318 VKE--NAKNYDVWFDFARLEESGGDVDRTREVYER 350
Score = 33.1 bits (74), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 36/161 (22%), Positives = 73/161 (45%), Gaps = 13/161 (8%)
Query: 326 FAELEESRGAIAAAKKLYESLLTDSVNTTALAH-IQFIRFLRRTEGVEAARKYF---LDA 381
FA+LEE + +K +E + N L++ +Q+ ++ + AR F LD
Sbjct: 44 FADLEELKEYQGRKRKEFEDYV--RRNRVRLSNWLQYAQWELEQKEFARARSVFERALDV 101
Query: 382 RKSPNFTYHVYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDR 441
PN T +++ Y + A ++++ A N+ + + R + +Y + L D
Sbjct: 102 --HPNNT-QLWIRY-VQAEIKNRNINHARNLLDRAVTRLPRVTSLWYQYLYVMEMLGDIP 157
Query: 442 NIRALFERALSSLPPEESIEVWKRFTQFEQMYGDLDSTLKV 482
R +F+R + P E++ W + + E+ YG+ D ++
Sbjct: 158 GTRQVFDRWMKWQPDEQA---WSAYIRLEKRYGEFDRAREI 195
>sp|Q54Z08|SYF1_DICDI Pre-mRNA-splicing factor SYF1 OS=Dictyostelium discoideum GN=xab2
PE=3 SV=1
Length = 850
Score = 43.5 bits (101), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 55/223 (24%), Positives = 89/223 (39%), Gaps = 50/223 (22%)
Query: 288 IWYDYATWNAKSGSIDAAIKVFQRAL-----------------KALPDSEMLRYAFAELE 330
++ DYA K + + AI++ +R K L S + + +LE
Sbjct: 448 LYCDYAEMELKHRNYEKAIEILKRGTVSPKKQNTIIEENEPVQKRLFKSIKIWTFYVDLE 507
Query: 331 ESRGAIAAAKKLYESLLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYH 390
ES G K +YE ++ V T + + F ++L KYF D K+ Y
Sbjct: 508 ESFGTFHNTKSIYEKMIQLKVVTPQII-LNFAKYLEEN-------KYFEDMFKA----YE 555
Query: 391 VYVAYALMAFCQDKDPKLAHNVFEAGLKRFMHEPAYILEYADF-LSRLNDDRNIRALFER 449
V L QD ++ L +F+ YA L R D LFE+
Sbjct: 556 HGVQLFLFPHVQD--------IWITYLTKFIQR------YAGMKLERTRD------LFEQ 595
Query: 450 ALSSLPPEESIEVWKRFTQFEQMYGDLDSTLKVEQRRKEALSR 492
LS +PP+ESI + + FE+ YG ++ V R +++ +
Sbjct: 596 VLSKVPPKESIIFYLMYANFEEQYGLARHSMAVYDRAAKSVDK 638
Score = 33.9 bits (76), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 28/61 (45%)
Query: 409 AHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRALFERALSSLPPEESIEVWKRFTQ 468
+ +FE L P +EY +FL R F+RAL +LP + +W +T+
Sbjct: 105 VNTLFERSLVFLDKMPRIWIEYCEFLMIQEKITLTRKTFDRALIALPVTQHYRIWNEYTK 164
Query: 469 F 469
F
Sbjct: 165 F 165
>sp|Q05022|RRP5_YEAST rRNA biogenesis protein RRP5 OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=RRP5 PE=1 SV=1
Length = 1729
Score = 43.1 bits (100), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 1/68 (1%)
Query: 409 AHNVFEAGLKRFMHEPAYI-LEYADFLSRLNDDRNIRALFERALSSLPPEESIEVWKRFT 467
A +F+A K+F E I + + DFL N+++ R + AL +LP IEV ++F
Sbjct: 1566 AAELFKATAKKFGGEKVSIWVSWGDFLISHNEEQEARTILGNALKALPKRNHIEVVRKFA 1625
Query: 468 QFEQMYGD 475
Q E GD
Sbjct: 1626 QLEFAKGD 1633
>sp|O74835|RRP5_SCHPO rRNA biogenesis protein rrp5 OS=Schizosaccharomyces pombe (strain 972
/ ATCC 24843) GN=rrp5 PE=1 SV=1
Length = 1690
Score = 42.4 bits (98), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 38/70 (54%), Gaps = 4/70 (5%)
Query: 408 LAHNVFEAGLKRFMHEPAYILEYADFLSRLNDDRNIRA--LFERALSSLPPEESIEVWKR 465
LA + LK F P+ ++YA FL LN+D+ +A L ER+L SLP E + + ++
Sbjct: 1530 LADEYMQLMLKNFKQVPSVWIQYATFL--LNNDKAEKAHGLLERSLQSLPKSEHVGIIEK 1587
Query: 466 FTQFEQMYGD 475
F E GD
Sbjct: 1588 FAILEFKNGD 1597
Score = 36.6 bits (83), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 39/77 (50%), Gaps = 3/77 (3%)
Query: 275 YEQCLMYLY-HYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRY--AFAELEE 331
Y Q ++ + P +W YAT+ + + A + +R+L++LP SE + FA LE
Sbjct: 1534 YMQLMLKNFKQVPSVWIQYATFLLNNDKAEKAHGLLERSLQSLPKSEHVGIIEKFAILEF 1593
Query: 332 SRGAIAAAKKLYESLLT 348
G + ++E LL+
Sbjct: 1594 KNGDPERGRTIFEGLLS 1610
>sp|Q4PB37|CLF1_USTMA Pre-mRNA-splicing factor CLF1 OS=Ustilago maydis (strain 521 / FGSC
9021) GN=CLF1 PE=3 SV=1
Length = 781
Score = 42.4 bits (98), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 49/218 (22%), Positives = 95/218 (43%), Gaps = 23/218 (10%)
Query: 275 YEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRG 334
YE+ L H+ +W Y K ++ A ++ RA+ LP + L Y + LEE G
Sbjct: 93 YERALDVEPHHLPLWLRYTEQELKMRNVQHARNLYDRAVSILPRIDQLWYKYVHLEELLG 152
Query: 335 AIAAAKKLYESLLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYH---- 390
I ++++E + A +I R + ++ A + T H
Sbjct: 153 NIPGTRQVFERWMKWEPEEKAWH--AYINLEVRYDELDRASAIW-----ERCVTCHPVPK 205
Query: 391 VYVAYALMAFCQDK-DPKLAHNVFEAGLKRFMHEPAYILE--------YADFLSRLNDDR 441
++ +A F +D+ + + A VF+ L ++ E +E +A +RL +
Sbjct: 206 QWIRWA--KFEEDRGNLEKARIVFQMALD-YIGEDEDAMEKAQSVFTAFAKMETRLKEYE 262
Query: 442 NIRALFERALSSLPPEESIEVWKRFTQFEQMYGDLDST 479
R +++ AL LP +S ++ +T+FE+ +G ++S
Sbjct: 263 RARVIYKYALERLPRSKSEGIYSSYTRFEKQFGTMNSV 300
Score = 37.7 bits (86), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 92/407 (22%), Positives = 151/407 (37%), Gaps = 108/407 (26%)
Query: 38 VAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCL---------- 87
V A +Y++ +S+ P I + W +YV + N T+Q+F R +
Sbjct: 120 VQHARNLYDRAVSILPR----IDQLWYKYVHLEELLGNIPGTRQVFERWMKWEPEEKAWH 175
Query: 88 -LICLQV---------PLW-RC---------YIRFIRKVYEKKGTEGQEETRKAFDFMLS 127
I L+V +W RC +IR+ K E +G E+ R F L
Sbjct: 176 AYINLEVRYDELDRASAIWERCVTCHPVPKQWIRWA-KFEEDRGN--LEKARIVFQMALD 232
Query: 128 HVGSD----------------------------------------ISSGPIWLEYITFLK 147
++G D S I+ Y F K
Sbjct: 233 YIGEDEDAMEKAQSVFTAFAKMETRLKEYERARVIYKYALERLPRSKSEGIYSSYTRFEK 292
Query: 148 SLPALNAQEES---QRMIAIRK--AYQRAVVTPTHHVEQLWKDYENFENSVSRQLAKGLL 202
+N+ E++ +R I + A Q A P + W DY E R LL
Sbjct: 293 QFGTMNSVEDTVIGKRRIQYEEELAAQEAGGAPADY--DTWFDYSRLEEDAYR----ALL 346
Query: 203 SEYQSKYTSARAVYRERKKYCEEIDWNMLAVPPTGSYKEEQQWI-AWKRLLTFEKGNPQR 261
+ S+ +AV R R+ Y I VP + ++ +++I W R FE+
Sbjct: 347 ATGGSQDQLQQAVKRVREVYERAI----AQVPSSQEKRDWRRYIFLWLRYALFEE----- 397
Query: 262 IDTASSNK-RIIFTYEQCLM--YLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPD 318
IDT ++ R I+ L+ + + +W YA + + + AA K+ A+ P
Sbjct: 398 IDTRDYDRTREIYKAAIALVPHRRFTFAKLWVQYARFEVRRLELTAARKILGAAIGMAPK 457
Query: 319 SEMLRYAFAELEESRGAIAAAKKLYESLLT-DSVNTTALAHIQFIRF 364
++ ++ ELE S A+K+YE L D N+ ++RF
Sbjct: 458 LKLFS-SYIELEVSLKEFDRARKIYEKALEWDPTNSQT-----WVRF 498
>sp|Q6NS46|RRP5_MOUSE Protein RRP5 homolog OS=Mus musculus GN=Pdcd11 PE=2 SV=2
Length = 1862
Score = 40.4 bits (93), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 45/196 (22%), Positives = 76/196 (38%), Gaps = 44/196 (22%)
Query: 288 IWYDYATWNAKSGSIDAAIKVFQRALKALP---DSEMLRY--AFAELEESRGAIAAAKKL 342
+W Y ++ ++ I+ A V +RALK + + E L A LE G+ + K+
Sbjct: 1612 LWLQYMAFHLQATEIEKARAVAERALKTISFREEQEKLNVWVALLNLENMYGSQESLTKV 1671
Query: 343 YESLLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARKSPNFTYHVYVAYALMAFCQ 402
+E + + H+ I + +
Sbjct: 1672 FERAVQYNEPLKVFLHLADI-------------------------------------YTK 1694
Query: 403 DKDPKLAHNVFEAGLKRFMHEPAYILEYADF-LSRLNDDRNIRALFERALSSLPPEESIE 461
+ K A ++ LKRF E A ++Y F L R + R L +RAL LP +E ++
Sbjct: 1695 SEKYKEAGELYNRMLKRFRQEKAVWIKYGAFVLGRSQAGASHRVL-QRALECLPAKEHVD 1753
Query: 462 VWKRFTQFEQMYGDLD 477
V +F Q E GD++
Sbjct: 1754 VIVKFAQLEFQLGDVE 1769
Score = 37.4 bits (85), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 22/76 (28%), Positives = 35/76 (46%), Gaps = 2/76 (2%)
Query: 275 YEQCLMYLYHYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLR--YAFAELEES 332
Y + L +W Y + A+ +V QRAL+ LP E + FA+LE
Sbjct: 1705 YNRMLKRFRQEKAVWIKYGAFVLGRSQAGASHRVLQRALECLPAKEHVDVIVKFAQLEFQ 1764
Query: 333 RGAIAAAKKLYESLLT 348
G + AK ++E+ L+
Sbjct: 1765 LGDVERAKAIFENTLS 1780
Score = 34.3 bits (77), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 20/61 (32%), Positives = 30/61 (49%), Gaps = 3/61 (4%)
Query: 428 LEYADFLSRLNDDRNIRALFERALSSLP---PEESIEVWKRFTQFEQMYGDLDSTLKVEQ 484
L+Y F + + RA+ ERAL ++ +E + VW E MYG +S KV +
Sbjct: 1614 LQYMAFHLQATEIEKARAVAERALKTISFREEQEKLNVWVALLNLENMYGSQESLTKVFE 1673
Query: 485 R 485
R
Sbjct: 1674 R 1674
>sp|P39682|PRP39_YEAST Pre-mRNA-processing factor 39 OS=Saccharomyces cerevisiae (strain
ATCC 204508 / S288c) GN=PRP39 PE=1 SV=1
Length = 629
Score = 39.3 bits (90), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 42/235 (17%), Positives = 97/235 (41%), Gaps = 25/235 (10%)
Query: 119 RKAFDFMLSHVGSDISSGPIWLEYITFLKSLPALNAQEESQRMIAIRKAYQRAVVTPTHH 178
R F+ +G S P W ++I F + + +++ Y+ + P H
Sbjct: 144 RNNFEIAKDLIGKQFLSHPFWDKFIEF---------EVGQKNWHNVQRIYEYIIEVPLHQ 194
Query: 179 VEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAVYRERKKYCEEIDWNMLAVPPTGS 238
+ + Y+ F N + + + + + T+ +++ K + +N+ V
Sbjct: 195 YARFFTSYKKFLNEKNLKTTRNIDIVLRKTQTTVNEIWQFESKIKQPF-FNLGQV----- 248
Query: 239 YKEEQQWIAWKRLLTFEKGNPQRIDTASSNKRIIFTYEQCLMYLYHYPDIWYDYATW-NA 297
++ + W R L F + +D + ++ +++CL+ ++ + W Y W
Sbjct: 249 LNDDLE--NWSRYLKFVTDPSKSLD----KEFVMSVFDRCLIPCLYHENTWMMYIKWLTK 302
Query: 298 KSGSIDAAIKVFQRALKALP-DSEMLRYAFAELEESRGAIAAAKKLYESLLTDSV 351
K+ S + + ++Q+A LP D + LRY F L + + L+ ++ ++V
Sbjct: 303 KNISDEVVVDIYQKANTFLPLDFKTLRYDF--LRFLKRKYRSNNTLFNNIFNETV 355
>sp|Q6BSP7|CLF1_DEBHA Pre-mRNA-splicing factor CLF1 OS=Debaryomyces hansenii (strain ATCC
36239 / CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968)
GN=CLF1 PE=3 SV=2
Length = 714
Score = 39.3 bits (90), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 44/213 (20%), Positives = 93/213 (43%), Gaps = 21/213 (9%)
Query: 284 HYPDIWYDYATWNAKSGSIDAAIKVFQRALKALPDSEMLRYAFAELEESRGAIAAAKKLY 343
H P W Y + +I A + RA+ LP + L + + + EE+ + ++
Sbjct: 102 HIP-FWTHYIQFELSHKNITHARNLLDRAVTTLPRVDKLWFLYVQTEETLKNYQMVRIIF 160
Query: 344 ESLLTDSVNTTALAHIQFIRFLRRTEGVEAARKYFLDARK--SPNFTYHVYVAYALMAFC 401
E L+ + N +A +I + +R + + AR+ ++ + S + ++ + +
Sbjct: 161 ERWLSWNPNPSAWD--AYINYEKRYDEYDNAREIYIRYVQIHSSGEIWLKWIDFEMNDVP 218
Query: 402 QD-KDPKLAHNVFEAGLKRFMHEPAY---------ILEYADFLSRLNDDRNIRALFERAL 451
D + K NVFE + + A I +++ + + + RA+F+ L
Sbjct: 219 IDPEQVKRIRNVFELSVDSMLASEALRGDISLAEIINKWSLWEISVKEYERARAIFQLML 278
Query: 452 SS------LPPEESIEVWKRFTQFEQMYGDLDS 478
S + PE+ +++ +T+FE+ YGD D+
Sbjct: 279 KSDTIQEIITPEQRNQIYSSYTEFEKSYGDKDT 311
Score = 38.9 bits (89), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 77/346 (22%), Positives = 127/346 (36%), Gaps = 92/346 (26%)
Query: 35 HLPVAQAAPIYEQLLSVFPTAVSFIAKFWKQYVEAYMAVNNDDATKQLFSRCLLICLQVP 94
H + A + ++ ++ P + K W YV+ + N + +F R L
Sbjct: 116 HKNITHARNLLDRAVTTLPR----VDKLWFLYVQTEETLKNYQMVRIIFERWLSWNPNPS 171
Query: 95 LWRCYIRFIRKVYEKKGTEGQEETRKAFDFMLSHVGSDISSGPIWLEYITF-LKSLPALN 153
W YI YEK+ E ++ H SSG IWL++I F + +P
Sbjct: 172 AWDAYIN-----YEKRYDEYDNAREIYIRYVQIH-----SSGEIWLKWIDFEMNDVPIDP 221
Query: 154 AQEESQRMI------------AIR-------------------KAYQRA----------- 171
Q + R + A+R K Y+RA
Sbjct: 222 EQVKRIRNVFELSVDSMLASEALRGDISLAEIINKWSLWEISVKEYERARAIFQLMLKSD 281
Query: 172 ----VVTPTHHVEQLWKDYENFENSVSRQLAKGLLSEYQSKYTSARAVYRERK-KYCEEI 226
++TP Q++ Y FE S Y K T ++ +RK KY EE+
Sbjct: 282 TIQEIITPEQR-NQIYSSYTEFEKS------------YGDKDTIESSIMIKRKLKYEEEV 328
Query: 227 DWNMLAVPPTGSYKEEQQWIAWKRLLTFEKGNPQRIDTASSNKRIIFT--YEQCLMYLYH 284
+ S + W ++ +L E N +T ++I T ++ + Y
Sbjct: 329 N---------KSPSDYDSWWSYISILQQEDNNEVTRETFERAIKVIPTDAFKSTVWRRYI 379
Query: 285 YPDIWYDYATWNAKS-GSIDAAIKVFQRALKALPDSEMLRYAFAEL 329
Y IW YA W + GSI+ ++ +ALK +P R+ FA++
Sbjct: 380 Y--IWVKYAFWEEFTMGSIENGRNIWNKALKVIPHK---RFTFAKI 420
>sp|Q6K8J4|ISPG_ORYSJ 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase, chloroplastic
OS=Oryza sativa subsp. japonica GN=ISPG PE=2 SV=1
Length = 744
Score = 39.3 bits (90), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 35/62 (56%)
Query: 313 LKALPDSEMLRYAFAELEESRGAIAAAKKLYESLLTDSVNTTALAHIQFIRFLRRTEGVE 372
LK + D ML ++ EE G + AA++L+E L T+ +N + HI+F + + R + V
Sbjct: 536 LKGVDDITMLLHSVPYGEEKTGRVHAARRLFEYLETNGLNFPVIHHIEFPKSVNRDDLVI 595
Query: 373 AA 374
A
Sbjct: 596 GA 597
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.316 0.130 0.379
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 281,310,836
Number of Sequences: 539616
Number of extensions: 11874433
Number of successful extensions: 38165
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 70
Number of HSP's successfully gapped in prelim test: 101
Number of HSP's that attempted gapping in prelim test: 37245
Number of HSP's gapped (non-prelim): 692
length of query: 774
length of database: 191,569,459
effective HSP length: 125
effective length of query: 649
effective length of database: 124,117,459
effective search space: 80552230891
effective search space used: 80552230891
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 65 (29.6 bits)