BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 001853
(1004 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255539681|ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
[Ricinus communis]
gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative
[Ricinus communis]
Length = 1461
Score = 1628 bits (4216), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 792/1028 (77%), Positives = 886/1028 (86%), Gaps = 30/1028 (2%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELP-SKRGIGPVPNLV 59
MS+AAYKM+HWPTGI +C SG+ITHSRAD+VPQIP IQT+ LDSE P SKRGIGP+PNL+
Sbjct: 1 MSYAAYKMLHWPTGIESCASGYITHSRADFVPQIPPIQTDNLDSEWPPSKRGIGPMPNLI 60
Query: 60 VTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL 119
VTA +V+E+YVVRVQE+GS+ES++S ETKR LMDG+S ASLELVCHYRLHGNVES+ +L
Sbjct: 61 VTAGSVLEVYVVRVQEDGSRESRSSRETKRGGLMDGVSGASLELVCHYRLHGNVESMVVL 120
Query: 120 SQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR 179
G D+SRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEWLHLKRGRESFAR
Sbjct: 121 PTEGGDSSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLKRGRESFAR 180
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
GPL+KVDPQGRCGG+LVY +QMIIL+A+Q SGLVGD+D SGG SAR++SS+VINLR
Sbjct: 181 GPLLKVDPQGRCGGILVYDMQMIILRAAQASSGLVGDDDALSSGGSISARVQSSYVINLR 240
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
D+DMKHVKDFIF+H YIEPV+VILHERELTWAGRVSWKHHTCMISALSISTTLKQ LIW
Sbjct: 241 DMDMKHVKDFIFLHDYIEPVVVILHERELTWAGRVSWKHHTCMISALSISTTLKQPTLIW 300
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
S +NLPHDAYKLLAVP PIGGVLV+ ANTIHYHS+SA+ ALALNNYAVS+DSSQELPR+S
Sbjct: 301 SVVNLPHDAYKLLAVPPPIGGVLVICANTIHYHSESATYALALNNYAVSIDSSQELPRAS 360
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIG 419
FSVELDA A WL NDVALLS K G+L+LL++VYDGRVVQRLDLSK+ SVLTSDITTIG
Sbjct: 361 FSVELDAVKAAWLLNDVALLSAKNGELLLLSLVYDGRVVQRLDLSKSKASVLTSDITTIG 420
Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
NSLFFLGSRLGDSLLVQFT G G S++SSGLKEE G+IE D PS KRL+RS+SD LQDMV
Sbjct: 421 NSLFFLGSRLGDSLLVQFTNGLGPSVVSSGLKEEVGEIEGDVPSAKRLKRSASDGLQDMV 480
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
+GEELSLYGS +NNTESAQK+FSFAVRDSL+N+GPLKDFSYGLR N DASATGI+KQSNY
Sbjct: 481 SGEELSLYGSTANNTESAQKSFSFAVRDSLINVGPLKDFSYGLRSNYDASATGIAKQSNY 540
Query: 540 EL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDD 573
+L V+LPGC+GIWTVYHK++RGHN D S+MAA D
Sbjct: 541 DLVCCSGHGKNGTLCILRQSIRPEMITEVDLPGCRGIWTVYHKNARGHNVDLSKMAAAAD 600
Query: 574 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
EYHAYLIIS+EARTMVLETADLL+EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI
Sbjct: 601 EYHAYLIISMEARTMVLETADLLSEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 660
Query: 634 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
LDGS+MTQDLS G SNSES GSE++TV SVSIADPYVL+ M+DGSIRLL+GD STC VS
Sbjct: 661 LDGSFMTQDLSIGSSNSESSPGSESATVSSVSIADPYVLIKMTDGSIRLLIGDSSTCMVS 720
Query: 694 VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG---ADGGPLDQG 750
+ TP+A E+S++ VS+CTLYHDKGPEPWLRK STDAWLSTGV EAIDG ADGGP DQG
Sbjct: 721 INTPSAFENSERSVSACTLYHDKGPEPWLRKASTDAWLSTGVSEAIDGAESADGGPHDQG 780
Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
DIY +VCYESGALEIFDVPNFN VF+VDKFVSG+TH+ D Y+RE KDS+ + N SEE
Sbjct: 781 DIYCIVCYESGALEIFDVPNFNRVFSVDKFVSGKTHLADAYVREPPKDSQEKTNRISEEV 840
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
G GRKEN H+MK VELAMQRWS HHSRPFLF +LTDGTILCY AYLFE P+ TSK++D
Sbjct: 841 AGLGRKENAHNMKAVELAMQRWSGHHSRPFLFGVLTDGTILCYHAYLFEAPDATSKTEDS 900
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
VS + + ++SASRLRNLRF R PLD+Y +EET CQRITIF NISGHQGFFL G
Sbjct: 901 VSAQNPVGLGSISASRLRNLRFVRVPLDSYIKEETSTENSCQRITIFNNISGHQGFFLLG 960
Query: 931 SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
SRP W MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG LKICQLPS S YD
Sbjct: 961 SRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGNLKICQLPSFSNYD 1020
Query: 991 NYWPVQKV 998
NYWPVQK+
Sbjct: 1021 NYWPVQKI 1028
>gi|225455571|ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Vitis vinifera]
Length = 1442
Score = 1627 bits (4214), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 778/1024 (75%), Positives = 871/1024 (85%), Gaps = 41/1024 (4%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS+AAYKMMHWPTGI NC SGF+THSRAD+ PQI IQT++L+SE P+KR IGP+PNL+V
Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
TAAN++E+Y+VRVQE+ S+ES+ S ETKR +M GIS A+LELVC YRLHGNVE++ +L
Sbjct: 61 TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
GG DNSRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEW HLKRG ESFARG
Sbjct: 121 SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
PLVKVDPQGRC GVLVYGLQMIILKASQ G GLVGDE+ SG SAR+ESS+VI+LRD
Sbjct: 181 PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
LDMKHVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
Sbjct: 241 LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
A+NLPHDAYKLL VPSPIGGV+V+ AN+IHYHSQSASCALALNNYAVS D+SQE+PRSSF
Sbjct: 301 AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
SVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGRVV RLDLSK+ SVLTS I IGN
Sbjct: 361 SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
SLFFLGSRLGDSLLVQF TS+LSS +KEE GDIE D PS KRLR+SSSDALQDMVN
Sbjct: 421 SLFFLGSRLGDSLLVQF-----TSILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVN 475
Query: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
GEELSLYGSA N+TE++QKTFSF+VRDS +N+GPLKDF+YGLRINAD ATGI+KQSNYE
Sbjct: 476 GEELSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQSNYE 535
Query: 541 L--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
L VELPGCKGIWTVYHK++RGHNADS++MA DDE
Sbjct: 536 LVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATKDDE 595
Query: 575 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
YHAYLIISLE+RTMVLETADLL EVTESVDY+VQG TI+AGNLFGRRRV+QV+ RGARIL
Sbjct: 596 YHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGARIL 655
Query: 635 DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 694
DG++MTQDL SE+STVLSVSIADPYVLL MSDG+I+LLVGDPSTCTVS+
Sbjct: 656 DGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPSTCTVSI 705
Query: 695 QTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS 754
PA ESSKK +S+CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG DQGDIY
Sbjct: 706 NIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGDIYC 765
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
VV YESG LEIFDVPNFNCVF+VDKF+SG H+VDT + E +D++ ++ +SEE QG
Sbjct: 766 VVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQG 825
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
RKEN H++KVVELAMQRWS HSRPFLF ILTDGTILCY AYL+EGPE+T K+++ VS
Sbjct: 826 RKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQ 885
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
SLS+SNVSASRLRNLRF R PLD YTREE G R+T+FKNI G QG FLSGSRP
Sbjct: 886 NSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPL 945
Query: 935 WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
W MVFRER+RVHPQLCDGSIVAFTVLHN+NCNHG IYVTSQG LKICQLP+ S+YDNYWP
Sbjct: 946 WFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWP 1005
Query: 995 VQKV 998
VQK+
Sbjct: 1006 VQKI 1009
>gi|296084122|emb|CBI24510.3| unnamed protein product [Vitis vinifera]
Length = 1448
Score = 1621 bits (4198), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 778/1030 (75%), Positives = 871/1030 (84%), Gaps = 47/1030 (4%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS+AAYKMMHWPTGI NC SGF+THSRAD+ PQI IQT++L+SE P+KR IGP+PNL+V
Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
TAAN++E+Y+VRVQE+ S+ES+ S ETKR +M GIS A+LELVC YRLHGNVE++ +L
Sbjct: 61 TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
GG DNSRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEW HLKRG ESFARG
Sbjct: 121 SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
PLVKVDPQGRC GVLVYGLQMIILKASQ G GLVGDE+ SG SAR+ESS+VI+LRD
Sbjct: 181 PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
LDMKHVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
Sbjct: 241 LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
A+NLPHDAYKLL VPSPIGGV+V+ AN+IHYHSQSASCALALNNYAVS D+SQE+PRSSF
Sbjct: 301 AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
SVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGRVV RLDLSK+ SVLTS I IGN
Sbjct: 361 SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
SLFFLGSRLGDSLLVQF TS+LSS +KEE GDIE D PS KRLR+SSSDALQDMVN
Sbjct: 421 SLFFLGSRLGDSLLVQF-----TSILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVN 475
Query: 481 GEELSLYGSASNNTESAQ------KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
GEELSLYGSA N+TE++Q KTFSF+VRDS +N+GPLKDF+YGLRINAD ATGI+
Sbjct: 476 GEELSLYGSAPNSTETSQVEAQVGKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIA 535
Query: 535 KQSNYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRM 568
KQSNYEL VELPGCKGIWTVYHK++RGHNADS++M
Sbjct: 536 KQSNYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKM 595
Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
A DDEYHAYLIISLE+RTMVLETADLL EVTESVDY+VQG TI+AGNLFGRRRV+QV+
Sbjct: 596 ATKDDEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYA 655
Query: 629 RGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
RGARILDG++MTQDL SE+STVLSVSIADPYVLL MSDG+I+LLVGDPS
Sbjct: 656 RGARILDGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPS 705
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
TCTVS+ PA ESSKK +S+CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG D
Sbjct: 706 TCTVSINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQD 765
Query: 749 QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSE 808
QGDIY VV YESG LEIFDVPNFNCVF+VDKF+SG H+VDT + E +D++ ++ +SE
Sbjct: 766 QGDIYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSE 825
Query: 809 EGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSD 868
E QGRKEN H++KVVELAMQRWS HSRPFLF ILTDGTILCY AYL+EGPE+T K++
Sbjct: 826 EEADQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTE 885
Query: 869 DPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFL 928
+ VS SLS+SNVSASRLRNLRF R PLD YTREE G R+T+FKNI G QG FL
Sbjct: 886 EAVSAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFL 945
Query: 929 SGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
SGSRP W MVFRER+RVHPQLCDGSIVAFTVLHN+NCNHG IYVTSQG LKICQLP+ S+
Sbjct: 946 SGSRPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSS 1005
Query: 989 YDNYWPVQKV 998
YDNYWPVQK+
Sbjct: 1006 YDNYWPVQKI 1015
>gi|356559917|ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Glycine max]
Length = 1447
Score = 1609 bits (4167), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 775/1026 (75%), Positives = 880/1026 (85%), Gaps = 38/1026 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSK--RGIGPVPNL 58
MSFAAYKMM PTGI NC +GF+THSR+D+VP +Q ++LD+E PS+ +G +PNL
Sbjct: 1 MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAEWPSRPRHHVGSLPNL 56
Query: 59 VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAI 118
VVTAANV+E+Y VR+QE+ + K + +++R L+DGI+ ASLELVCHYRLHGNVE++A+
Sbjct: 57 VVTAANVLEVYAVRLQED--QPPKAAADSRRGALLDGIAGASLELVCHYRLHGNVETMAV 114
Query: 119 LSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA 178
LS GG D SRRRDSI+L F DAKISVLE+DDSIHGLR +S+HCFE PEWLHLKRGRE FA
Sbjct: 115 LSIGGGDVSRRRDSIMLTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQFA 174
Query: 179 RGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
RGP+VKVDPQGRCGGVL+Y LQMIILKA+Q GSGLVG++D GS G +ARIESS++INL
Sbjct: 175 RGPVVKVDPQGRCGGVLIYDLQMIILKATQAGSGLVGEDDALGSSGAVAARIESSYMINL 234
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
RDLDM+HVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI
Sbjct: 235 RDLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 294
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
WSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALN+YAV+LDSSQE+PRS
Sbjct: 295 WSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNSYAVTLDSSQEIPRS 354
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
SF+VELDAA+ATWL +DVALLSTKTG+L+LLT+VYDGRVVQRLDLSK+ SVL+S ITTI
Sbjct: 355 SFNVELDAANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITTI 414
Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
GNSLFFL SRLGDS+LVQF+CGSG SMLSS LKEE GDIEADAPS KRLRRS SDALQDM
Sbjct: 415 GNSLFFLASRLGDSMLVQFSCGSGVSMLSSNLKEEVGDIEADAPS-KRLRRSPSDALQDM 473
Query: 479 VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSN 538
V+GEELSLYGSA N TESAQK+FSFAVRDSL+N+GPLKDFSYGLRINADA+ATGI+KQSN
Sbjct: 474 VSGEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 533
Query: 539 YEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
YEL VELPGCKGIWTVYHKS+R HNADSS+MA D
Sbjct: 534 YELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADDD 593
Query: 573 DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
DEYHAYLIISLEARTMVLETADLL+EVTESVDY+VQG+T+AAGNLFGR RVIQV+ERGAR
Sbjct: 594 DEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRCRVIQVYERGAR 653
Query: 633 ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
ILDGS+MTQD+SFG SN ESGS S+++ LSVSIADP+VLL MSDGSIRLL+GDPSTCT+
Sbjct: 654 ILDGSFMTQDVSFGASNLESGSASDSAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCTI 713
Query: 693 SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
SV +PA+ ESSK VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE IDG DG D GDI
Sbjct: 714 SVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGETIDGTDGAAQDHGDI 773
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
Y VVC+++G LEIFDVPNFNCVF+V+ F+SG++H+VD M+E LKDS+ +
Sbjct: 774 YCVVCFDNGNLEIFDVPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVIN 830
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QGRKENI MKVVELAMQRWS HSRPFLF IL+DGTILCY AYL+E P++TSK +D S
Sbjct: 831 QGRKENIPDMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDSTSKVEDSAS 890
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
S+ +S+ + SRLRNLRF R PLDAY RE+T +G PCQ+ITIFKNI ++GFFLSGSR
Sbjct: 891 AGGSIGLSSTNVSRLRNLRFVRVPLDAYAREDTSNGPPCQQITIFKNIGSYEGFFLSGSR 950
Query: 933 PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
P W MV RERLRVHPQLCDGSIVAFTVLHNVNCN G IYVTSQG+LKICQLPSGS YD+Y
Sbjct: 951 PAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNQGLIYVTSQGVLKICQLPSGSNYDSY 1010
Query: 993 WPVQKV 998
WPVQK+
Sbjct: 1011 WPVQKI 1016
>gi|356530945|ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Glycine max]
Length = 1449
Score = 1593 bits (4125), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 777/1027 (75%), Positives = 884/1027 (86%), Gaps = 38/1027 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDS-ELPSK--RGIGPVPN 57
MSFAAYKMM PTGI NC +GF+THSR+D+VP +Q ++LD+ E PS+ +GP+PN
Sbjct: 1 MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAAEWPSRPRHHVGPLPN 56
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVVTAANV+E+Y VR+QE+ + S +++R L+DGI+ ASLEL CHYRLHGNVE++A
Sbjct: 57 LVVTAANVLEVYAVRLQED-QQPKDASDDSRRGTLLDGIAGASLELECHYRLHGNVETMA 115
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+LS GG D SR+RDSIIL F DAKISVLE+DDSIHGLR +S+HCFE PEWLHLKRGRE F
Sbjct: 116 VLSIGGGDVSRKRDSIILTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQF 175
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
ARGP+VK+DPQGRCGGVL+Y LQMIILKA+Q GSGLVGD+D FGS G +ARIESS++IN
Sbjct: 176 ARGPVVKIDPQGRCGGVLIYDLQMIILKATQVGSGLVGDDDAFGSSGAVAARIESSYMIN 235
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDLDM+HVKDF FV+GYIEPVMVILHERELTWAGRVSW HHTCMISALSISTTLKQHPL
Sbjct: 236 LRDLDMRHVKDFTFVYGYIEPVMVILHERELTWAGRVSWTHHTCMISALSISTTLKQHPL 295
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+GANTIHYHSQSASCALALNNYAV+LDSSQE+PR
Sbjct: 296 IWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNNYAVTLDSSQEIPR 355
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
SSF+VELDAA+ATWL +DVALLSTKTG+L+LL +VYDGRVVQRLDLSK+ SVL+S ITT
Sbjct: 356 SSFNVELDAANATWLLSDVALLSTKTGELLLLMLVYDGRVVQRLDLSKSKASVLSSGITT 415
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
IGNSLFFL SRLGDS+LVQF+CGSG SM+SS LKEE GDIE DAPS KRLRRS SDALQD
Sbjct: 416 IGNSLFFLASRLGDSMLVQFSCGSGVSMMSSNLKEEVGDIEVDAPS-KRLRRSPSDALQD 474
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
MV+GEELSLYGSA+N TESAQK+FSFAVRDSL+N+GPLKDFSYGLRINADA+ATGI+KQS
Sbjct: 475 MVSGEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQS 534
Query: 538 NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
NYEL VELPGCKGIWTVYHKS+R HNADSS+MA
Sbjct: 535 NYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADD 594
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
DDEYHAYLIISLEARTMVLETADLL+EVTESVDY+VQG+T+AAGNLFGRRRVIQV+ERGA
Sbjct: 595 DDEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERGA 654
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
RILDGS+MTQD+SFG SNSESGS SE++ LSVSIADP+VLL MSDGSIRLL+GDPSTCT
Sbjct: 655 RILDGSFMTQDVSFGASNSESGSASESAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCT 714
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
+SV +PA+ ESSK VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG DG D GD
Sbjct: 715 ISVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHGD 774
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
IY VVC+++G LEIFD+PNFNCVF+V+ F+SG++H+VD M+E LKDS+ +
Sbjct: 775 IYCVVCFDNGNLEIFDIPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVV 831
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
QGRK+NI +MKVVELAMQRWS HSRPFLF IL+DGTILCY AYL+E P+ TSK +D
Sbjct: 832 NQGRKDNIPNMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDSA 891
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
S S+ +S+ + SRLRNLRF R PLDAY RE+T +G+PCQ+ITIFKNI +QGFFLSGS
Sbjct: 892 SAGGSIGLSSTNVSRLRNLRFVRVPLDAYPREDTSNGSPCQQITIFKNIGSYQGFFLSGS 951
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
RP W MV RERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG+LKICQLPSGS YD+
Sbjct: 952 RPAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDS 1011
Query: 992 YWPVQKV 998
YWPVQK+
Sbjct: 1012 YWPVQKI 1018
>gi|224120960|ref|XP_002318462.1| predicted protein [Populus trichocarpa]
gi|222859135|gb|EEE96682.1| predicted protein [Populus trichocarpa]
Length = 1455
Score = 1581 bits (4093), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 777/1033 (75%), Positives = 862/1033 (83%), Gaps = 44/1033 (4%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKR----GIGPVP 56
MS+AAYKMMHWPT I C SGF+THSR++ +P + T++LDS+ PS+R GIGP P
Sbjct: 1 MSYAAYKMMHWPTTIDTCVSGFVTHSRSESA-HLPQLHTDDLDSDWPSRRRHGGGIGPTP 59
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V + NV+E+YVVRVQEEG++ +SGE KR +MDG++ ASLELVCHYRLHGNVES+
Sbjct: 60 NLIVASGNVLELYVVRVQEEGAR---SSGELKRGGVMDGVAGASLELVCHYRLHGNVESM 116
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+LS G D+SRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE P+W HLKRGRES
Sbjct: 117 GVLSVEGGDDSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPDWRHLKRGRES 176
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
FARGPLVKVDPQGRCGGVLVY LQMIILKA+Q GS LV DED FGSG SA I SS++I
Sbjct: 177 FARGPLVKVDPQGRCGGVLVYDLQMIILKAAQAGSALVQDEDAFGSGAAISAHIASSYII 236
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
NLRDLDMKHVKDFIFVH YIEPV+V+LHERELTWAGRV WKHHTCMISALSISTTLKQ
Sbjct: 237 NLRDLDMKHVKDFIFVHDYIEPVVVVLHERELTWAGRVVWKHHTCMISALSISTTLKQPT 296
Query: 297 LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
LIWS NLPHDAYKLLAVPSPIGGVLV+G NTIHYHS+SASCALALN+YA S+DSSQELP
Sbjct: 297 LIWSIGNLPHDAYKLLAVPSPIGGVLVIGVNTIHYHSESASCALALNSYAASVDSSQELP 356
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT 416
R++FSVELDAA+ATWL DVALLSTKTG+L+LLT+VYDGRVVQRLDLSK+ SVLTSDIT
Sbjct: 357 RATFSVELDAANATWLLKDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDIT 416
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ 476
T+GNS FFLGSRLGDSLLVQFT G G+SMLS GLKEE GDIE D PS KRL+ SSSDALQ
Sbjct: 417 TLGNSFFFLGSRLGDSLLVQFTSGLGSSMLSPGLKEEVGDIEGDLPSAKRLKVSSSDALQ 476
Query: 477 DMVNGEELSLYGSASNNTESAQ-----KTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
DMV+GEELSLY SA NN ES+Q KTFSF VRDSL+N+GPLKDF+YGLRINADA+AT
Sbjct: 477 DMVSGEELSLYSSAPNNAESSQVVSVIKTFSFTVRDSLINVGPLKDFAYGLRINADANAT 536
Query: 532 GISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGHNADS 565
GISKQSNYEL VELPGCKGIWTVYHK++R H+ DS
Sbjct: 537 GISKQSNYELVCCSGHGKNGALCVLQQSIRPEMITEVELPGCKGIWTVYHKNARIHSVDS 596
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
+MA+ DDEYHAYLIIS+EARTMVLETAD LTEVTESVDYFVQGRTIAAGNLFGRRRV+Q
Sbjct: 597 LKMAS-DDEYHAYLIISMEARTMVLETADHLTEVTESVDYFVQGRTIAAGNLFGRRRVVQ 655
Query: 626 VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
VFERGARILDGS+MTQDLSFG SNSE+G SE+STV+ VSI DPYVL+ M+DGSI++LVG
Sbjct: 656 VFERGARILDGSFMTQDLSFGGSNSETGR-SESSTVMHVSIVDPYVLVRMADGSIQILVG 714
Query: 686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGG 745
DPS CTVSV TP+A +SS K VS+CTLYHDKGPEPWLRKTSTDAWLSTG+ EAIDGAD G
Sbjct: 715 DPSACTVSVNTPSAFQSSTKSVSACTLYHDKGPEPWLRKTSTDAWLSTGISEAIDGADSG 774
Query: 746 PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
+QGDIY VVCYE+GALEIFDVPNFN VF VDKFVSG+TH++DT E KD +
Sbjct: 775 AHEQGDIYCVVCYETGALEIFDVPNFNSVFFVDKFVSGKTHLLDTCTGEPAKDM---MKG 831
Query: 806 SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
EE G GRKE+ +MKVVEL M RWS HSRPFLF ILTDGTILCY AYLFEGP+ TS
Sbjct: 832 VKEEVAGAGRKESTQNMKVVELTMLRWSGRHSRPFLFGILTDGTILCYHAYLFEGPDGTS 891
Query: 866 KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQG 925
K +D VS S+ S +SASRLRNLRF R PLD YTREET CQRIT FKNISG+QG
Sbjct: 892 KLEDSVSAQNSVGASTISASRLRNLRFVRVPLDTYTREETSSETSCQRITTFKNISGYQG 951
Query: 926 FFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
FFLSGSRP W MVFRERLRVHPQLCDGSIVAFTVLH VNCNHG IYVTSQG LKIC L S
Sbjct: 952 FFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHTVNCNHGLIYVTSQGNLKICHLSS 1011
Query: 986 GSTYDNYWPVQKV 998
S+YDNYWPVQK+
Sbjct: 1012 VSSYDNYWPVQKI 1024
>gi|30696088|ref|NP_199979.2| cleavage and polyadenylation specificity factor subunit 1
[Arabidopsis thaliana]
gi|290457637|sp|Q9FGR0.2|CPSF1_ARATH RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=AtCPSF160; Short=CPSF 160
kDa subunit
gi|332008729|gb|AED96112.1| cleavage and polyadenylation specificity factor subunit 1
[Arabidopsis thaliana]
Length = 1442
Score = 1514 bits (3920), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 742/1027 (72%), Positives = 861/1027 (83%), Gaps = 38/1027 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
MSFAAYKMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TAAN++E+Y+VR QEEG ++E +N KR +MDG+ SLELVCHYRLHGNVES+A
Sbjct: 61 VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRLR +SD QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRM-TSDTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ EELSL+GS NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 538 NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
NYEL VELPGCKGIWTVYHKSSRGHNADSS+MAA
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600 EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
RILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660 RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
VS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D DGGP DQGD
Sbjct: 720 VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
IY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D + E E E+N +SE+ T
Sbjct: 780 IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSEDNT 835
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
+ I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T K+++ +
Sbjct: 836 S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
S+ ++++ +S+LRNL+F R PLD TRE T G QRIT+FKNISGHQGFFLSGS
Sbjct: 892 SSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS S YDN
Sbjct: 952 RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011
Query: 992 YWPVQKV 998
YWPVQK+
Sbjct: 1012 YWPVQKI 1018
>gi|24415580|gb|AAN41460.1| putative cleavage and polyadenylation specificity factor 160 kDa
subunit [Arabidopsis thaliana]
Length = 1442
Score = 1512 bits (3914), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 741/1027 (72%), Positives = 860/1027 (83%), Gaps = 38/1027 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
MSFAAYKMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TAAN++E+Y+VR QEEG ++E +N KR +MDG+ SLELVCHYRLHGNVES+A
Sbjct: 61 VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRLR +SD QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRM-TSDTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ EELSL+GS +N++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480 TIGNEELSLFGSTPDNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 538 NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
NYEL VELPGCKGIWTVYHKSSRGHNADSS+MAA
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600 EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
RILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660 RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
VS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D DGGP DQGD
Sbjct: 720 VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
IY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D + E E E+N +SE+ T
Sbjct: 780 IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSEDNT 835
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
+ I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T K+++ +
Sbjct: 836 S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
S ++++ +S+LRNL+F R PLD TRE T G QRIT+FKNISGHQGFFLSGS
Sbjct: 892 SPENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS S YDN
Sbjct: 952 RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011
Query: 992 YWPVQKV 998
YWPVQK+
Sbjct: 1012 YWPVQKI 1018
>gi|10257491|dbj|BAB11613.1| cleavage and polyadenylation specificity factor subunit [Arabidopsis
thaliana]
Length = 1448
Score = 1507 bits (3902), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 742/1033 (71%), Positives = 861/1033 (83%), Gaps = 44/1033 (4%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
MSFAAYKMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TAAN++E+Y+VR QEEG ++E +N KR +MDG+ SLELVCHYRLHGNVES+A
Sbjct: 61 VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRLR +SD QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRM-TSDTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQ------KTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
+ EELSL+GS NN++SAQ K+FSFAVRDSLVN+GP+KDF+YGLRINADA+AT
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQVTSSVLKSFSFAVRDSLVNVGPVKDFAYGLRINADANAT 539
Query: 532 GISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGHNADS 565
G+SKQSNYEL VELPGCKGIWTVYHKSSRGHNADS
Sbjct: 540 GVSKQSNYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADS 599
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
S+MAA +DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQ
Sbjct: 600 SKMAADEDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQ 659
Query: 626 VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
VFE GARILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVG
Sbjct: 660 VFEHGARILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVG 719
Query: 686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGG 745
DPSTCTVS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D DGG
Sbjct: 720 DPSTCTVSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGG 779
Query: 746 PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
P DQGDIY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D + E E E+N
Sbjct: 780 PQDQGDIYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNK 835
Query: 806 SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
+SE+ T + I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T
Sbjct: 836 NSEDNTS---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST- 891
Query: 866 KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQG 925
K+++ +S+ ++++ +S+LRNL+F R PLD TRE T G QRIT+FKNISGHQG
Sbjct: 892 KAENSLSSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQG 951
Query: 926 FFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
FFLSGSRP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS
Sbjct: 952 FFLSGSRPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPS 1011
Query: 986 GSTYDNYWPVQKV 998
S YDNYWPVQK+
Sbjct: 1012 ASIYDNYWPVQKI 1024
>gi|297792471|ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
lyrata]
gi|297309955|gb|EFH40379.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
lyrata]
Length = 1444
Score = 1507 bits (3902), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 741/1027 (72%), Positives = 857/1027 (83%), Gaps = 36/1027 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQ-TEELDSELPS-KRGIGPVPNL 58
MSFAA+KMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAFKMMHWPTGVENCASGYITHSLSDSTLQIPIVSGDDDMEAEWPNHKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TA N++E+Y+VR QEEG ++E + KR +MDG+S SLELVCHYRLHGNVES+A
Sbjct: 61 VITAGNILEVYIVRAQEEGNTQELRIPKLVKRGGVMDGVSGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILKASQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKASQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI+TTLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINTTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ +DVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISSDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRL R SSD QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRL-RISSDTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ EELSL+GS NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 538 NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
NYEL VELPGCKGIWTVYHKSSRGHNADSS+MAA
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSVRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+DEYHAYLIIS+EARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600 EDEYHAYLIISVEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
RILDGS+M Q+LSFG NSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660 RILDGSFMNQELSFGAPNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
VS+ +P+ +E SKK +S+CTL+HDKGPEPWLRK STDAWLS+GVGEA+D ADGGP DQGD
Sbjct: 720 VSISSPSVLEGSKKKISACTLFHDKGPEPWLRKASTDAWLSSGVGEAVDSADGGPQDQGD 779
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
IY V+CYESGALEIFDVP FNCVF+VDKF SGR H+ D + E E E+N +SE+
Sbjct: 780 IYCVLCYESGALEIFDVPGFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSED-N 834
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
R E I + KVVEL+MQRWS H+RPFLFA+L DGTILCY AYLFEG ++T K+++ V
Sbjct: 835 ASSRNEEIKNTKVVELSMQRWSGPHTRPFLFAVLADGTILCYHAYLFEGVDST-KAENSV 893
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
S+ ++++ +S+LRNL+F R P D TRE T G QRIT+FKNISGHQGFFLSGS
Sbjct: 894 SSENPAALNSSGSSKLRNLKFLRIPFDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 953
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVTSQ +LKICQLPS S YDN
Sbjct: 954 RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTSQVVLKICQLPSASIYDN 1013
Query: 992 YWPVQKV 998
YWPVQK+
Sbjct: 1014 YWPVQKI 1020
>gi|449470342|ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Cucumis sativus]
Length = 1504
Score = 1477 bits (3825), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 747/1076 (69%), Positives = 844/1076 (78%), Gaps = 81/1076 (7%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MSFAAY+MMHWPTGI NC S +ITHSRAD+VP + +++LDS+ +R IGPVPNLVV
Sbjct: 1 MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVT-SHSDDLDSDWHPRRDIGPVPNLVV 59
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
TA NV+E+YVVRV EEG +ESK+SGE +R +MDG+S ASLELVCHYRLHGNVES+AILS
Sbjct: 60 TAGNVLEVYVVRVLEEGGRESKSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILS 119
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
G D S++RDSIIL F++AKISVLEFDDS H LR +SMHCF+ P+WLHLKRGRESFARG
Sbjct: 120 SRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARG 179
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
P+VKVDPQGRCGGVLVYGLQMIILKASQ GSGLV D++ FG+ G SAR+ESS++INLRD
Sbjct: 180 PVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRD 239
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
LD+KHVKDF+FVHGYIEPVMVILHE+ELTWAGRVSWKHHTCM+SALSISTTLKQHPLIWS
Sbjct: 240 LDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMVSALSISTTLKQHPLIWS 299
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
A NLPHDAYKLLAVPSPIGGVLV+ AN+IHY+SQSASC LALNNYAVS DSSQ++PRS+F
Sbjct: 300 ASNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNF 359
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
+VELDAA+ATWL NDVALLSTKTG+L+LL +VYDGRVVQRLDLSK+ SVLTS I +IGN
Sbjct: 360 NVELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGN 419
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF-------------------------- 454
SLFFLGSRLGDSLLVQF+CG G+S L+S LK+E
Sbjct: 420 SLFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEITYYTQNLQKEMVPPTLPSALVHESKP 479
Query: 455 ----GDIEADAPS----------------------TKRLRRSSSDALQDMVNGEELSLYG 488
G IE + + R+ R V G+ELSLYG
Sbjct: 480 TQAKGTIELNNNNLCVENDIVDVVEVDITNMTILGENRIARRDETLTDTQVGGDELSLYG 539
Query: 489 SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------ 542
SA+NNTESAQK FSFAVRDSL+NIGPLKDFSYGLRINAD +ATGI+KQSNYELV
Sbjct: 540 SAANNTESAQKIFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGHG 599
Query: 543 --------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
ELPGCKGIWTVYHK++RG ADSSRM DDEYHAYLIIS
Sbjct: 600 KNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSIADSSRMVPDDDEYHAYLIIS 659
Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
LEARTMVL T +LLTEVTESVDYFV GRTIAAGNLFGRRRVIQV+E GARILDGS+MTQD
Sbjct: 660 LEARTMVLVTGELLTEVTESVDYFVHGRTIAAGNLFGRRRVIQVYESGARILDGSFMTQD 719
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
L+ + +ESG+ SE TVLS SI+DPYVLL M+DGSIRLLVGD S+C+VSV PAA S
Sbjct: 720 LNLVVNGNESGNASEGCTVLSASISDPYVLLTMTDGSIRLLVGDSSSCSVSVSAPAAFGS 779
Query: 703 SKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGA 762
SKK VSSCTLY DKG EPWLR TSTDAWLSTGVGE IDG DG DQGDIY V CY++G
Sbjct: 780 SKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVACYDNGD 839
Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
LEIFDVPNF VF VDKFVSG++H+VD + + K SE + N S+E GR E+ +M
Sbjct: 840 LEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSEVDQN--SQELISHGRNESSQNM 897
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
KV+E+AMQRWS HSRPFLF ILTDGTILCY AYLFE ++ SK DD VS S+S SN+
Sbjct: 898 KVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNSVSSSNM 957
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRER 942
S+SRLRNLRF R PLD RE+ P+G R++IFKNISG+QG FL GSRP W MVFRER
Sbjct: 958 SSSRLRNLRFLRVPLDIQGREDMPNGTLSCRLSIFKNISGYQGLFLCGSRPAWFMVFRER 1017
Query: 943 LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
LRVHPQLCDG IVAF VLHNVNCNHG IYVTSQG+LKICQLPS S YDNYWPVQKV
Sbjct: 1018 LRVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQKV 1073
>gi|218194461|gb|EEC76888.1| hypothetical protein OsI_15095 [Oryza sativa Indica Group]
Length = 1503
Score = 1199 bits (3101), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 612/1039 (58%), Positives = 746/1039 (71%), Gaps = 64/1039 (6%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
MS+AAYKMMHWPTG+ +C +GF+THS +D ++DS + R +G
Sbjct: 1 MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60
Query: 54 PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
P PNLVV AANV+E+Y VR + G++ S +SG ++DGIS A LELVC+Y
Sbjct: 61 PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RLHGN+ES+ +LS G A+N RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116 RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
HLKRGRESFA GP++K DP GRCG L YGLQMIILKA+Q G LVG+++ + +
Sbjct: 173 QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
RIESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+ KHHTCMISA S
Sbjct: 233 VRIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++
Sbjct: 293 ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
D S E+ +S+F VELDAA ATW ND+ + S+K G+++LLTVVYDGRVVQRLDL K+
Sbjct: 353 HPDGSPEISKSNFQVELDAAKATWFSNDIVMFSSKAGEMLLLTVVYDGRVVQRLDLMKSK 412
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVL+S +T+IGNS FFLGSRLGDSLLVQF+ G+ S+L E DIE D P +KRL
Sbjct: 413 ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYGASKSVLQDLTNERSADIEGDLPFSKRL 472
Query: 468 RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
+R SD LQD+ + EELS A N+ ESAQK S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473 KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531
Query: 527 DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
D +A G +KQSNYEL VELP C+GIWTVY+KS RG
Sbjct: 532 DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
A+ D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ TIAAGNLFGR
Sbjct: 592 QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
RRVIQV+ +GAR+LDGS+MTQ+L+F +++ S SE V SIADPYVLL M DGS+
Sbjct: 645 RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
+LL+GD TCT+SV P+ SS + +++CTLY D+GPEPWLRKT +DAWLSTG+ EAID
Sbjct: 704 QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLRKTRSDAWLSTGIAEAID 763
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
G DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG +VD + + +DS
Sbjct: 764 GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
E ++ +KE S+++VELAM RWS SRPFLF +L DGT+LCY A+ +E
Sbjct: 824 KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
E+ K P+S S N S SRLRNLRF R +D +RE+ P G P RIT F N
Sbjct: 881 SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+ G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938 VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997
Query: 980 ICQLPSGSTYDNYWPVQKV 998
ICQLPS YDNYWPVQKV
Sbjct: 998 ICQLPSAYNYDNYWPVQKV 1016
>gi|75145059|sp|Q7XWP1.2|CPSF1_ORYSJ RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
gi|38345987|emb|CAD39979.2| OSJNBa0032B23.5 [Oryza sativa Japonica Group]
Length = 1441
Score = 1191 bits (3081), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 610/1039 (58%), Positives = 744/1039 (71%), Gaps = 64/1039 (6%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
MS+AAYKMMHWPTG+ +C +GF+THS +D ++DS + R +G
Sbjct: 1 MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60
Query: 54 PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
P PNLVV AANV+E+Y VR + G++ S +SG ++DGIS A LELVC+Y
Sbjct: 61 PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RLHGN+ES+ +LS G A+N RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116 RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
HLKRGRESFA GP++K DP GRCG L YGLQMIILKA+Q G LVG+++ + +
Sbjct: 173 QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
IESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+ KHHTCMISA S
Sbjct: 233 VCIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++
Sbjct: 293 ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
D S E+ +S+F VELDAA ATWL ND+ + STK G+++LLTVVYDGRVVQRLDL K+
Sbjct: 353 HPDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSK 412
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVL+S +T+IGNS FFLGSRLGDSLLVQF+ + S+L E DIE D P +KRL
Sbjct: 413 ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRL 472
Query: 468 RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
+R SD LQD+ + EELS A N+ ESAQK S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473 KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531
Query: 527 DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
D +A G +KQSNYEL VELP C+GIWTVY+KS RG
Sbjct: 532 DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
A+ D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ TIAAGNLFGR
Sbjct: 592 QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
RRVIQV+ +GAR+LDGS+MTQ+L+F +++ S SE V SIADPYVLL M DGS+
Sbjct: 645 RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
+LL+GD TCT+SV P+ SS + +++CTLY D+GPEPWL KT +DAWLSTG+ EAID
Sbjct: 704 QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAID 763
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
G DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG +VD + + +DS
Sbjct: 764 GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
E ++ +KE S+++VELAM RWS SRPFLF +L DGT+LCY A+ +E
Sbjct: 824 KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
E+ K P+S S N S SRLRNLRF R +D +RE+ P G P RIT F N
Sbjct: 881 SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+ G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938 VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997
Query: 980 ICQLPSGSTYDNYWPVQKV 998
ICQLPS YD+YWPVQKV
Sbjct: 998 ICQLPSAYNYDSYWPVQKV 1016
>gi|222628488|gb|EEE60620.1| hypothetical protein OsJ_14038 [Oryza sativa Japonica Group]
Length = 1441
Score = 1191 bits (3080), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 610/1039 (58%), Positives = 744/1039 (71%), Gaps = 64/1039 (6%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
MS+AAYKMMHWPTG+ +C +GF+THS +D ++DS + R +G
Sbjct: 1 MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60
Query: 54 PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
P PNLVV AANV+E+Y VR + G++ S +SG ++DGIS A LELVC+Y
Sbjct: 61 PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RLHGN+ES+ +LS G A+N RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116 RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
HLKRGRESFA GP++K DP GRCG L YGLQMIILKA+Q G LVG+++ + +
Sbjct: 173 QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
IESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+ KHHTCMISA S
Sbjct: 233 VCIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++
Sbjct: 293 ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
D S E+ +S+F VELDAA ATWL ND+ + STK G+++LLTVVYDGRVVQRLDL K+
Sbjct: 353 HPDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSK 412
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVL+S +T+IGNS FFLGSRLGDSLLVQF+ + S+L E DIE D P +KRL
Sbjct: 413 ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRL 472
Query: 468 RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
+R SD LQD+ + EELS A N+ ESAQK S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473 KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531
Query: 527 DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
D +A G +KQSNYEL VELP C+GIWTVY+KS RG
Sbjct: 532 DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
A+ D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ TIAAGNLFGR
Sbjct: 592 QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
RRVIQV+ +GAR+LDGS+MTQ+L+F +++ S SE V SIADPYVLL M DGS+
Sbjct: 645 RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
+LL+GD TCT+SV P+ SS + +++CTLY D+GPEPWL KT +DAWLSTG+ EAID
Sbjct: 704 QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAID 763
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
G DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG +VD + + +DS
Sbjct: 764 GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
E ++ +KE S+++VELAM RWS SRPFLF +L DGT+LCY A+ +E
Sbjct: 824 KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
E+ K P+S S N S SRLRNLRF R +D +RE+ P G P RIT F N
Sbjct: 881 SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+ G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938 VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997
Query: 980 ICQLPSGSTYDNYWPVQKV 998
ICQLPS YD+YWPVQKV
Sbjct: 998 ICQLPSAYNYDSYWPVQKV 1016
>gi|357162146|ref|XP_003579318.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 1-like [Brachypodium distachyon]
Length = 1442
Score = 1181 bits (3055), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 611/1038 (58%), Positives = 744/1038 (71%), Gaps = 61/1038 (5%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE--ELDSELPSK----RGIGP 54
MS+AAYKMMHWPTGI +C +GFITH +D E D L + + +GP
Sbjct: 1 MSYAAYKMMHWPTGIDHCAAGFITHCPSDAAAFCSAAAASGPEGDVGLVAAARHPKRLGP 60
Query: 55 VPNLVVTAANVIEIYVVRVQEEGS------KESKNSGETKRRVLMDGISAASLELVCHYR 108
PNLVV AANV+E+Y VR + + S +SG + DGIS A LELVCHYR
Sbjct: 61 TPNLVVAAANVLEVYAVRADAAAADGAGGAQPSSSSG-----AVFDGISGARLELVCHYR 115
Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
LHGN+ES+AILS G A+N RRDSI LAF DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116 LHGNIESMAILSDG-AEN--RRDSIALAFRDAKITCLEFDDAIHGLRTSSMHCFEGPEWQ 172
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
HLKRGRESFA GP++K DP GRCG LVYGLQMIILK++Q G LVG+++ + +
Sbjct: 173 HLKRGRESFAWGPVIKSDPLGRCGAALVYGLQMIILKSAQVGQSLVGEDEPTRALSSAAV 232
Query: 229 RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
RIESS++I+LR LD HVKDF FVHGYIEPV+VILHERE TWAGR+S KHHTCMISA SI
Sbjct: 233 RIESSYLIDLRALDTNHVKDFTFVHGYIEPVLVILHEREPTWAGRISSKHHTCMISAFSI 292
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
S TLKQHP+IWSA N+PHDAY++L+VP PI GVLV+ AN+IHYHSQS SC+LALNN+A
Sbjct: 293 SMTLKQHPMIWSAANIPHDAYQILSVPPPISGVLVICANSIHYHSQSTSCSLALNNFASQ 352
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
D S E+ + +F VELDAA ATWL ND+ + S KTG+++LLTVVYDGR VQ+LDL K+
Sbjct: 353 PDGSPEIHKVNFHVELDAAKATWLSNDIVMFSAKTGEMLLLTVVYDGRTVQKLDLMKSKA 412
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
SV++S +TTIG+S FFLGSR+GDSLLVQF+CG TS++ E DIE D P +KRL+
Sbjct: 413 SVISSGVTTIGSSFFFLGSRVGDSLLVQFSCGVPTSVIPDIADERSADIEGDLPFSKRLK 472
Query: 469 RSSSDALQDMVNGEELSLYGSA-SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
R SD LQD+ + EELS + N+ ESAQK S+ VRD+LVN+GPLKDFSYGLR+NAD
Sbjct: 473 RVPSDILQDVTSVEELSFQNNMLPNSLESAQK-ISYVVRDALVNVGPLKDFSYGLRVNAD 531
Query: 528 ASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGH 561
+ATG +KQSNYEL VELP C+GIWTVY+KSSRGH
Sbjct: 532 PNATGNAKQSNYELVCCSGHGKNGALSVLQQSIRPDLITEVELPSCRGIWTVYYKSSRGH 591
Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
+ D+EYHAYLIISLE+RTMVLET D L EVTE+VDY+VQG TI AGNLFGRR
Sbjct: 592 TTE-------DNEYHAYLIISLESRTMVLETGDDLGEVTETVDYYVQGATITAGNLFGRR 644
Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST-VLSVSIADPYVLLGMSDGSI 680
RVIQV+ GAR+LDGS+MTQ+L+F +SES S V S SIADPYVLL M DG+I
Sbjct: 645 RVIQVYATGARVLDGSFMTQELNFTALSSESSSSGSEPLGVASASIADPYVLLKMVDGTI 704
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
+LLVGD STC +S+ P+ + S + +S+CTLYHD+GPEPWLRKT DAWLS+GV A+D
Sbjct: 705 QLLVGDHSTCALSINAPSTLTSRGERISACTLYHDRGPEPWLRKTRGDAWLSSGVTVAVD 764
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
+ DQ DIY ++CYESG LEIF+VP+F VF+V F SG + +VD + + +DS
Sbjct: 765 VSGSSSQDQSDIYCIICYESGKLEIFEVPSFRQVFSVGSFFSGESLLVDAFAQGFTEDSA 824
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
+E +KE +++++VELAM RWS SRPFLF +L DGT+LCYQAY +EG
Sbjct: 825 ---EGRQDETKVSLKKEVANNIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYQAYCYEG 881
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
E+ K +S S+ + N S SRL+NLRF R +D +RE+ A RITIF N+
Sbjct: 882 LESNIKGTS-LSPDGSVDLGNASDSRLKNLRFHRVSVDITSREDISSLAR-PRITIFNNV 939
Query: 921 SGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKI 980
G++G FLSG+RP W MV R+R RVHPQLCDG I AFTVLHNVNC+HG IYVTSQG LKI
Sbjct: 940 GGYEGLFLSGTRPVWVMVCRQRFRVHPQLCDGPIEAFTVLHNVNCSHGLIYVTSQGFLKI 999
Query: 981 CQLPSGSTYDNYWPVQKV 998
CQLPS YDNYWPVQK+
Sbjct: 1000 CQLPSAYNYDNYWPVQKI 1017
>gi|168021793|ref|XP_001763425.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685218|gb|EDQ71614.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1452
Score = 981 bits (2536), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 528/1061 (49%), Positives = 685/1061 (64%), Gaps = 93/1061 (8%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADY-VPQIPLIQTEELDSELPSKRGIGPVPNLV 59
MS+AA+KM+H PTG+ NC + ++THS + IPL ++L + G G PNLV
Sbjct: 1 MSYAAFKMVHCPTGVDNCVAAYVTHSAGETDSDSIPLP-----GADLIASGGSGFPPNLV 55
Query: 60 VTAANVIEIYVVRVQE------EGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV 113
+T ANV+E++ VR+ E GS N T R LM G+S LEL CHYRLHGNV
Sbjct: 56 ITKANVLEVFHVRLLEGDDSAANGSNGVGNPETTPRGGLMAGLSYVKLELACHYRLHGNV 115
Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
ESL +LS A+ + RD+IIL F DAKISVLEFDDS HGLRI S+H FE PEW +LKRG
Sbjct: 116 ESLGVLSYRHAEGRKGRDAIILTFRDAKISVLEFDDSTHGLRIGSLHYFEGPEWQYLKRG 175
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
RE FA GP V+ DP GRC GVL+Y Q+++LKA+Q G GL ++++ GG A + +S
Sbjct: 176 REQFASGPSVRADPVGRCAGVLIYNSQLVLLKAAQVGYGLGDEDESLIMGGKLCAHVATS 235
Query: 234 HVINLRDLDMKHVKDFIFVHG--------------YIEPVMVILHERELTWAGRVSWKHH 279
++++LRDLDMKH+KDF+F+HG YIEPV+V+LHE++ TWAGRV+ + H
Sbjct: 236 YIVSLRDLDMKHIKDFVFLHGKLLFLIQYIFAFSSYIEPVLVVLHEKDPTWAGRVAVRRH 295
Query: 280 TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCA 339
TC I+ALSI+TTLKQHP IWSA NLP+DAYKLLAVP+PIGGVLV AN++HYHSQS SCA
Sbjct: 296 TCAITALSINTTLKQHPHIWSATNLPYDAYKLLAVPAPIGGVLVFCANSLHYHSQSGSCA 355
Query: 340 LALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
L LN +AV+ + S E PRS SVELD AHATW+ N+VAL+STK G L+ L +VY+GR VQ
Sbjct: 356 LGLNEFAVAPEGSAEYPRSKMSVELDCAHATWVANEVALISTKNGMLLFLNLVYEGRSVQ 415
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
RL+L+K+ SVLTS + TIG + FFLGSRL DSLLVQ T GS + SS + GDIEA
Sbjct: 416 RLELTKSKASVLTSCMCTIGENFFFLGSRLADSLLVQHTLGSASGRTSSLM----GDIEA 471
Query: 460 D--APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE-SAQKTFSFAVRDSLVNIGPLK 516
D AP+ KRL+R S+ + + EE+SLY S ++ S +KTF+F VRDSLVNI PL+
Sbjct: 472 DLSAPAAKRLKREPSEEEEGVSA-EEMSLYYSTPTASDISQKKTFTFTVRDSLVNICPLR 530
Query: 517 DFSYGLRINADASATGISKQSNYEL--------------------------VELPGCKGI 550
DF+YGLR NAD SATG+ KQSNYEL V LPGC GI
Sbjct: 531 DFAYGLRSNADQSATGLGKQSNYELVACSGHGKNGSLSVLHQSIRPDLINKVALPGCSGI 590
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTVYHK+ R + + + DDE+HAYLIISLE+RTMVLET D L EVTE+V+Y+ +G
Sbjct: 591 WTVYHKTDRDDSNEFDFGTSEDDEFHAYLIISLESRTMVLETGDTLGEVTENVEYYTEGN 650
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS-ENSTVLSVSIADP 669
TIAAGNLFGRR V+QV++ G R+LDG+ M Q+L S E+ S N+ V+ IADP
Sbjct: 651 TIAAGNLFGRRFVVQVYQNGLRLLDGAKMLQELLITNSELENNSSEVANNLVIEAVIADP 710
Query: 670 YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD- 728
Y+LL M+DGS++L+VGD +S+ P + +++ TLY DKGP WLR+T ++
Sbjct: 711 YMLLKMTDGSLQLVVGDVENTKLSIPQPQGFGITTDAITAFTLYQDKGPHQWLRRTCSEM 770
Query: 729 -----AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSG 783
W ST DQG +Y +VC SG EI+++P CV+ VD F G
Sbjct: 771 NSDRSQWSSTS-------------DQGYVYCIVCRISGRFEIYELPRMVCVYAVDNFNHG 817
Query: 784 RTHIVDTYMREALKDSETEINSSSEEGTGQGR---KENIHSMKVVELAMQRWSAHHSRPF 840
+ + D + E +S + +EE G ++ S+ V ++ + W RPF
Sbjct: 818 MSVLWDQKVLERRANSNAALKEGAEEDKAPGDALLRDAGLSLHVSQICFESWGEKFGRPF 877
Query: 841 LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
L A L+DGT+LCY A+ ++ E++ + R + S SRL +LRF+R P+D
Sbjct: 878 LLATLSDGTMLCYHAFSYDANESSDALE-----FRETATSLKDLSRLTHLRFARIPIDWV 932
Query: 901 TREETPHGAPC---QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
+ +E GA + FKN+ G F++G RP W MV R RLR HPQ CDG+I+ F
Sbjct: 933 SGQED--GAKVLYETKFCSFKNVGSFPGVFVTGLRPTWLMVCRGRLRPHPQFCDGAILGF 990
Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
T LHNVNC HGFIY+T+QG LKICQLPS YDN WPVQK+
Sbjct: 991 TPLHNVNCAHGFIYITAQGQLKICQLPSLLFYDNDWPVQKI 1031
>gi|302814354|ref|XP_002988861.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
gi|300143432|gb|EFJ10123.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
Length = 1413
Score = 902 bits (2330), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 496/1051 (47%), Positives = 656/1051 (62%), Gaps = 116/1051 (11%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS+AA K++H PTG++ C S FITHS + S S +PNLV+
Sbjct: 1 MSYAAIKLVHGPTGVSACASAFITHSPVNPASS----------SGWKSGNAKDSLPNLVL 50
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGE-------------TKRRVLMDGISAASLELVCHY 107
ANV+EIY VR QE G ++S GE KR M GI+AA LELVC Y
Sbjct: 51 VKANVLEIYNVRFQE-GDEKSARGGEQLVGSACVAFPASAKRGGFMSGITAAWLELVCQY 109
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RL G V+S+AIL +G D R RD+IILAF AK SVL FDD+ L+ +SMH FE PEW
Sbjct: 110 RLFGIVDSMAILHRG-RDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEW 168
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
+HLKRGRE F GPLV+ D QGRC GVL+Y Q++++KA+Q GLV ++D SG S
Sbjct: 169 IHLKRGREKFPGGPLVRADSQGRCAGVLIYKSQLVMMKAAQEAYGLVEEDDP--SGNIVS 226
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
ARIESS+V+NL++L M HVKDF+F++GYIEPV+ ILHERELTWAGRV+++ TC ++ALS
Sbjct: 227 ARIESSYVVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALS 286
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
I+T K+HP +W LP+DAY LLAVPSPIGGVLV+ AN+I Y+SQ ++C +A+N A
Sbjct: 287 INTNTKKHPRLWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELAT 346
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
S E+PRS FS+ELDAAHATWL D ALLSTKTG LV L +++DGR VQRL+LSK+
Sbjct: 347 PPAGSLEMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSK 406
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVL+S + TIG+ FF+GSRLGDSLLVQF S ++ LS G+ + +KR+
Sbjct: 407 GSVLSSSLCTIGDMFFFVGSRLGDSLLVQFGSASTSNSLSQSYD---GEDDIMVRPSKRM 463
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFS-------- 519
R L D N + L Y SA ++++ F F+VRDSL NIGP++D +
Sbjct: 464 R------LDDDANEQSLYQYKSAVSDSQK-NMNFLFSVRDSLCNIGPIRDITGRSQNPSE 516
Query: 520 -------------YG----LRINADASATGISKQSNYEL------------VELPGCKGI 550
+G L I + + Q+N L V+LPGC G+
Sbjct: 517 QPGSAQDLIACCGHGKNGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCVGV 576
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTVYH+ S ++ A DEYHAYLIISLE+RTMVLET + L EVT+SV+Y+ +G
Sbjct: 577 WTVYHR--------SGQIPAEKDEYHAYLIISLESRTMVLETGETLGEVTDSVEYYTEGP 628
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
+I+AGNLFGRRR+ QV+++G RILDG+ TQDL G E G+ E S S ADPY
Sbjct: 629 SISAGNLFGRRRIAQVYQKGVRILDGARQTQDLQVG----EPGNAIE-----SASFADPY 679
Query: 671 VLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
VLL M DGS +L+VGD T TVSV TP + S P+S+CTLY+D+GP PWLR+ + D W
Sbjct: 680 VLLRMQDGSCQLVVGDSETLTVSVSTPPELGLSPDPISACTLYNDRGPSPWLRRATGDVW 739
Query: 731 LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDT 790
+ GV +A DQGD+Y +VC SG +E ++P+ C++ V++ G + D
Sbjct: 740 QTLGVPDA-----NFAFDQGDMYCIVCRNSGTMEFLELPSMACLYRVERLPYGVQVLADN 794
Query: 791 YMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTI 850
R A K ++ EEG + R E + +KVV++ + W + RPF+F +L+DGT+
Sbjct: 795 --RTASKVPVDTSSNKDEEGAEEIR-ERMSKIKVVDICVDTWGEKYGRPFVFVLLSDGTL 851
Query: 851 LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG-- 908
L Y+A+++EG ++ + + D S RNLRF R LD EE +
Sbjct: 852 LSYRAFIYEGQDSGAHASDGTS--------------FRNLRFLRLQLDLELGEEDSNADE 897
Query: 909 -APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNH 967
Q+I FK++ G QG FL+G +P W M+FRE++R+HPQ DG IVAFT LHNVNC H
Sbjct: 898 VRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIFREQIRLHPQASDGPIVAFTSLHNVNCQH 957
Query: 968 GFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
G IYVT++ LKIC+L + YDN WPVQK+
Sbjct: 958 GLIYVTNEASLKICRLSNILNYDNDWPVQKI 988
>gi|302761560|ref|XP_002964202.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
gi|300167931|gb|EFJ34535.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
Length = 1413
Score = 899 bits (2324), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 498/1053 (47%), Positives = 659/1053 (62%), Gaps = 120/1053 (11%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS+AA K++H PTG++ C S FITHS P P S S +PNLV+
Sbjct: 1 MSYAAIKLVHGPTGVSACASAFITHS-----PVNP-----ASSSGWKSGNAKDSLPNLVL 50
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGE-------------TKRRVLMDGISAASLELVCHY 107
ANV+EIY VR QE G ++S GE KR M GI+AA LELVC Y
Sbjct: 51 VKANVLEIYNVRFQE-GDEKSARGGEQLVGSACVAFPASAKRGGFMSGITAAWLELVCQY 109
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RL G V+S+AIL +G D R RD+IILAF AK SVL FDD+ L+ +SMH FE PEW
Sbjct: 110 RLFGIVDSMAILHRG-RDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEW 168
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
+HLKRGRE F GPLV+ D QGRC GVL+Y Q++++KA+Q GLV ++D SG S
Sbjct: 169 IHLKRGREKFPGGPLVRADSQGRCAGVLIYKCQLVMMKAAQEAYGLVEEDDP--SGNIVS 226
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
ARIESS+V+NL++L M HVKDF+F++GYIEPV+ ILHERELTWAGRV+++ TC ++ALS
Sbjct: 227 ARIESSYVVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALS 286
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
I+T K+HP +W LP+DAY LLAVPSPIGGVLV+ AN+I Y+SQ ++C +A+N A
Sbjct: 287 INTNTKKHPRLWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELAT 346
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
S E+PRS FS+ELDAAHATWL D ALLSTKTG LV L +++DGR VQRL+LSK+
Sbjct: 347 PPAGSLEMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSK 406
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF-GDIEADAPSTKR 466
SVL+S + TIG+ FF+GSRLGDSLLVQF G++ S+ L+ + G+ + +KR
Sbjct: 407 GSVLSSSLCTIGDKFFFVGSRLGDSLLVQF----GSASTSNSLEHSYDGEDDIMVRPSKR 462
Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSY----- 520
+R L D + E SLY S ++S + F F+VRDSL NIGP++D +
Sbjct: 463 MR------LDD--DASEQSLYQYKSGVSDSQKNMNFLFSVRDSLCNIGPIRDITCRSQNP 514
Query: 521 --------------------GLRINADASATGISKQSNYEL------------VELPGCK 548
L I + + Q+N L V+LPGC
Sbjct: 515 SEQPGSAQDLIACCGHGKNGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCV 574
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
G+WTVYH+ S ++ A DEYHAYLIISLE+RTMVLET + L EVT+SV+Y+ +
Sbjct: 575 GVWTVYHR--------SGQIPAEKDEYHAYLIISLESRTMVLETGETLGEVTDSVEYYTE 626
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
G +I+AGNLFGRRR+ QV+++G RILDG+ TQDL G E G+ E S S AD
Sbjct: 627 GPSISAGNLFGRRRIAQVYQKGVRILDGARQTQDLQVG----EPGNAIE-----SASFAD 677
Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
PYVLL M DGS +L+VGD T TVSV TP + S P+S+CTLY+D+GP PWLR+ + D
Sbjct: 678 PYVLLRMQDGSCQLVVGDSETLTVSVSTPPELGLSPDPISACTLYNDRGPSPWLRRATGD 737
Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
W + GV +A DQGD+Y +VC SG +E ++P+ C++ V++ G +
Sbjct: 738 VWQTLGVPDA-----NFAFDQGDMYCIVCRNSGTMEFLELPSMACLYRVERLPYGVQVLA 792
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D+ R A K ++ EEG + R E + +KVV++ + W + RPF+F +L+DG
Sbjct: 793 DS--RTASKVPVDTSSNKDEEGAEEIR-ERMSKIKVVDICVDTWGEKYGRPFVFVLLSDG 849
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
T+L Y+A+++EG ++ + + D S RNLRF R LD EE +
Sbjct: 850 TLLSYRAFIYEGQDSGAHASDGTS--------------FRNLRFLRLQLDLELGEEDSNA 895
Query: 909 ---APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
Q+I FK++ G QG FL+G +P W M+FRE++R+HPQ DG IVAFT LHNVNC
Sbjct: 896 DEVRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIFREQIRLHPQASDGPIVAFTSLHNVNC 955
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
HG IYVT++ LKIC+L + YDN WPVQK+
Sbjct: 956 QHGLIYVTNEASLKICRLSNILNYDNDWPVQKI 988
>gi|414587801|tpg|DAA38372.1| TPA: hypothetical protein ZEAMMB73_993613 [Zea mays]
Length = 573
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/569 (58%), Positives = 411/569 (72%), Gaps = 34/569 (5%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE--------ELDSELP-SKRG 51
MS+AAYKMMH PTGI +C +GFITHS AD ++DS + R
Sbjct: 1 MSYAAYKMMHLPTGIDHCAAGFITHSPADAAAFSTPAPAPTAAAGPDGDIDSTAARAPRR 60
Query: 52 IGPVPNLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYR-- 108
+GP PNLVV+AANV+E+Y VR + G++++ NS T ++DGIS A LELVCHYR
Sbjct: 61 VGPTPNLVVSAANVLEVYAVRAEVATGAEDAGNSSSTG--TILDGISGARLELVCHYRCK 118
Query: 109 ----------------LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
LHGN+ES+A+LS G RRDSI + F DAKI+ LEFDDSI+
Sbjct: 119 QMALASLHSLLAVNFRLHGNIESMAVLSDG---TENRRDSIAVTFNDAKITCLEFDDSIN 175
Query: 153 GLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
GLR +SMHCFE PEW HLKRGRESFA GP++K DPQGRCG VLVYGLQ+IILKA+Q G
Sbjct: 176 GLRTSSMHCFEGPEWFHLKRGRESFAWGPIIKGDPQGRCGAVLVYGLQIIILKAAQVGQS 235
Query: 213 LVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
LVG+++ + RIESS+VI+LRDL+M H+KDF FVHGYIEPV+VILHERE TWAG
Sbjct: 236 LVGEDEPTRVLSSTAVRIESSYVIDLRDLEMNHIKDFTFVHGYIEPVLVILHEREPTWAG 295
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
R+S K TCM+SA SIS LKQHP+IWSA LPHDAY+LLAVP PI G+LV+ AN+IHYH
Sbjct: 296 RISSKSQTCMLSAFSISMGLKQHPMIWSAAKLPHDAYQLLAVPPPISGILVICANSIHYH 355
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
SQS SC+LALN+++ D S E+ ++SF VELD A ATWL +D+ + S+K G+++LLTVV
Sbjct: 356 SQSTSCSLALNSFSSQPDGSPEILKTSFHVELDVAKATWLSHDIVMFSSKNGEILLLTVV 415
Query: 393 YDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
YDGR VQRLDL K+ SVL+S TT+G+S FLGSRL DSLLVQF+CG TS+L L +
Sbjct: 416 YDGRAVQRLDLMKSKASVLSSGATTLGSSFIFLGSRLADSLLVQFSCGMPTSVLPD-LTD 474
Query: 453 EFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
E DIE+D P +KRL+R SD LQD+ + EELS + A N + + SF VRD+L+N+
Sbjct: 475 EPADIESDLPFSKRLKRIPSDVLQDVTSVEELSFHNKAVPNIVDSAEKISFVVRDALINV 534
Query: 513 GPLKDFSYGLRINADASATGISKQSNYEL 541
GPLKDF+YGLR N+D +A GI+KQSNYEL
Sbjct: 535 GPLKDFAYGLRTNSDPNAAGIAKQSNYEL 563
>gi|242075248|ref|XP_002447560.1| hypothetical protein SORBIDRAFT_06g003580 [Sorghum bicolor]
gi|241938743|gb|EES11888.1| hypothetical protein SORBIDRAFT_06g003580 [Sorghum bicolor]
Length = 374
Score = 444 bits (1141), Expect = e-121, Method: Compositional matrix adjust.
Identities = 220/359 (61%), Positives = 267/359 (74%), Gaps = 14/359 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-------ELDSELPSK-RGI 52
MS+AAYKMMHWPT I +C +GFITHS AD ++DS S R +
Sbjct: 1 MSYAAYKMMHWPTSIDHCAAGFITHSPADAAAFSSAAPAAAASGPDGDIDSAAASAPRRV 60
Query: 53 GPVPNLVVTAANVIEIYVVRVQEE-GSKESKNSGETKRRVLMDGISAASLELVCHYRLHG 111
GP PNLVV+AANV+E+Y VR G+++ NS T ++DGIS A LELVCHYRLHG
Sbjct: 61 GPTPNLVVSAANVLEVYAVRADSATGAEDVGNSSSTG--AILDGISGARLELVCHYRLHG 118
Query: 112 NVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
N+ES+A+LS G RRDSI + F+DAKI+ +EFDDS +GLR +SMHCFE PEW HLK
Sbjct: 119 NIESMAVLSDG---TENRRDSIAVTFKDAKIACMEFDDSTNGLRTSSMHCFEGPEWFHLK 175
Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE 231
RGRESFA GP++K DPQGRCG VLVYGLQMIILKA++ G LVG+++ + RIE
Sbjct: 176 RGRESFAWGPIIKADPQGRCGAVLVYGLQMIILKAAEVGQSLVGEDEPTRMLSSTAVRIE 235
Query: 232 SSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
SS+VI+LRDL+M H+KDF FVHGYIEPV+VILHERE TWAGR+S K TCM+SA SIS
Sbjct: 236 SSYVIDLRDLEMNHIKDFTFVHGYIEPVLVILHEREPTWAGRISSKSQTCMLSAFSISMG 295
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
LKQHP+IWSA LPHDAY+LLAVP PI G+LV+ AN+IHYHSQS SC+LALN+++ D
Sbjct: 296 LKQHPMIWSAAKLPHDAYQLLAVPPPISGILVICANSIHYHSQSTSCSLALNSFSSQPD 354
>gi|449524573|ref|XP_004169296.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like, partial [Cucumis sativus]
Length = 741
Score = 434 bits (1116), Expect = e-118, Method: Compositional matrix adjust.
Identities = 212/302 (70%), Positives = 237/302 (78%), Gaps = 2/302 (0%)
Query: 697 PAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVV 756
PAA SSKK VSSCTLY DKG EPWLR TSTDAWLSTGVGE IDG DG DQGDIY V
Sbjct: 11 PAAFGSSKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVA 70
Query: 757 CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK 816
CY++G LEIFDVPNF VF VDKFVSG++H+VD + + K SE + NS +E GR
Sbjct: 71 CYDNGDLEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSEVDQNS--QELISHGRN 128
Query: 817 ENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRS 876
E+ +MKV+E+AMQRWS HSRPFLF ILTDGTILCY AYLFE ++ SK DD VS S
Sbjct: 129 ESSQNMKVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNS 188
Query: 877 LSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC 936
+S SN+S+SRLRNLRF R PLD RE+ P+G +R++IFKNISG+QG FL GSRP W
Sbjct: 189 VSSSNMSSSRLRNLRFLRVPLDIQGREDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 248
Query: 937 MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
MVFRERLRVHPQLCDG IVAF VLHNVNCNHG IYVTSQG+LKICQLPS S YDNYWPVQ
Sbjct: 249 MVFRERLRVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQ 308
Query: 997 KV 998
KV
Sbjct: 309 KV 310
>gi|255075065|ref|XP_002501207.1| predicted protein [Micromonas sp. RCC299]
gi|226516471|gb|ACO62465.1| predicted protein [Micromonas sp. RCC299]
Length = 1423
Score = 418 bits (1074), Expect = e-113, Method: Compositional matrix adjust.
Identities = 331/1073 (30%), Positives = 514/1073 (47%), Gaps = 156/1073 (14%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MSFA +K +H PTG+ + + + TH D P PNLVV
Sbjct: 1 MSFAIHKQVHPPTGVDHAVAAYFTHPIGDGGP-----------------------PNLVV 37
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
AN + ++ +R + SG+ G A SLE+V + L+G V S+A++
Sbjct: 38 MQANHLTVFAIRRD----PSADASGDAAL-----GAKAMSLEVVAEFDLNGTVGSIAVMR 88
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR- 179
+ +RD++++A ++K+SV+E+D S + +S+H +E+P G S R
Sbjct: 89 RRSGAPRNQRDALLIAVRESKLSVIEWDPSEMTVVPSSLHSWETPVG---TGGVPSALRV 145
Query: 180 ---GPLVKVDPQGRCGGVLVY--GLQMIIL----KASQGGSGLVGDEDTFGSGGGFSARI 230
PL DP+GRC VL+ G + L A G G +D G G +A +
Sbjct: 146 APLPPLAIADPEGRCAAVLLRAEGRSRLALCPAVDADADADGDGGGDDGDRRGQGPAASV 205
Query: 231 ESSHVINLR-DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S V++L DL + V+D F+HGY EPV++ILHERE TWA R+ + TC+++A+SI+
Sbjct: 206 RKSFVVDLTADLALSGVRDAAFLHGYGEPVVLILHEREPTWAARMPLVNDTCVLTAVSIN 265
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
K+ +IW LP Y+L A+P P+GG +V+ N + + SQ +S ALALN A
Sbjct: 266 LDTKRCTVIWQREKLPCTCYRLCAMPDPLGGAIVLSNNFLLHESQESSKALALNPLAGGG 325
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ---RLDLSKT 406
S SVELD+AHA L L++TK G L+LL++ +GR + + L +
Sbjct: 326 TESA----LGVSVELDSAHAAVLSERQVLVTTKQGALMLLSLRVEGRRLAAHGAMHLRRA 381
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGL--KEEFGDIEADA 461
+VL+S + I L FLGSR+GDSLLV ML + K + G+ E
Sbjct: 382 GGAVLSSGMCLITKRLLFLGSRVGDSLLVSLKKKEAAGAAQMLPAAAPKKRKAGEAEPPK 441
Query: 462 PSTKRLRRSSS----DALQDMVNGE-ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLK 516
P + +S D L+ M+ GE E + + + E ++F VRDS++ I P+
Sbjct: 442 PPPPPQKVGTSQDDEDELEAMLYGEGEAAAKAANAGRKE--DPGYTFTVRDSVLGISPII 499
Query: 517 DFSYGLRINADA---------SATGISKQSNYELVE---------------LPGCKGIWT 552
D + G + +A G K +++ LPG G WT
Sbjct: 500 DLTAGASASVQGDTEERAELVAACGHGKNGALAILQRGIQPELVTEVEAGTLPGLMGTWT 559
Query: 553 VYHKS---SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
VYH+S R + ++ AA D +H+YL+ISLE+ TMVLET + L EV+E+V+
Sbjct: 560 VYHESRDNERLRESGAAAAAANVDPFHSYLVISLESTTMVLETGEELREVSEAVELVTDA 619
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
T+AAGN+ GR+R+ QV + G RI +G QDLS +G S + +++ + DP
Sbjct: 620 ATLAAGNMHGRKRIAQVHKGGVRICEGPVKIQDLSAA-EMPAAGDVSPDLEIIAAQVLDP 678
Query: 670 YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES--SKKPVSSCTLYHDKGP--------- 718
YVL MSDGS+R+L GD +V +P++ + + + ++S L D P
Sbjct: 679 YVLCRMSDGSLRVLKGDEEKGSVEAMSPSSYANLPTGESIASAALVDDSVPAAERPGLTT 738
Query: 719 -EP-WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFT 776
EP +LR+T+T STGV P D+ V G LE++ +P+ +++
Sbjct: 739 REPGFLRRTAT----STGV---------LPEDEEGTVLAVTRVGGTLELYALPSCERIWS 785
Query: 777 VDKFVSGRTHIVDTYMREALK-DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAH 835
D G + + + D + E+ + ++ + ++VE + +
Sbjct: 786 ADGLSEGLNVLAPGGAGDDVNVDGDGEVEPT----------DDYPAPEIVEFRLDAFPRA 835
Query: 836 HSRPFLFAILTDGTILCYQAYLF-EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR 894
H RP L A+ DG++L Y+A+L G N P LRF R
Sbjct: 836 HERPMLTALRGDGSVLVYRAFLCPPGAGNVGHEAKP------------------QLRFCR 877
Query: 895 TPLD------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ 948
P++ + G+ R + G +G F+SG RP W +V R R+ P
Sbjct: 878 VPIELEGGGGGMVDTKALSGSRLTRFERVGDRGGIRGVFVSGPRPLWLLVRRSRVLALPI 937
Query: 949 LCDGS-IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVVF 1000
+ V+FT HNVNC +GF+ T+ G ++ICQ+P Y+ WPV+K+
Sbjct: 938 RGEAQRTVSFTPFHNVNCLNGFMLGTAAGGVRICQIPGRMHYEAAWPVRKLAL 990
>gi|449477808|ref|XP_004155129.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Cucumis sativus]
Length = 643
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 188/255 (73%), Positives = 220/255 (86%), Gaps = 1/255 (0%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MSFAAY+MMHWPTGI NC S +ITHSRAD+VP + +++LDS+ +R IGPVPNLVV
Sbjct: 1 MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVTS-HSDDLDSDWHPRRDIGPVPNLVV 59
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
TA NV+E+YVVRV EEG +ESK+SGE KR +MDG+S ASLELVCHYRLHGNVES+AILS
Sbjct: 60 TAGNVLEVYVVRVLEEGGRESKSSGEVKRGGIMDGVSWASLELVCHYRLHGNVESMAILS 119
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
G D S++RDSIIL F++AKISVLEFDDS H LR +SMHCF+ P+WLHLKRGRESFARG
Sbjct: 120 SRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARG 179
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
P+VKVDPQGRCGGVLVYGLQMIILKASQ GSGLV D++ FG+ G SAR+ESS++INLRD
Sbjct: 180 PVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRD 239
Query: 241 LDMKHVKDFIFVHGY 255
LD+KHVKDF+FVH Y
Sbjct: 240 LDVKHVKDFVFVHVY 254
Score = 367 bits (943), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 189/258 (73%), Positives = 208/258 (80%), Gaps = 26/258 (10%)
Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
GDIE DA + KR+RRSSSDALQDMV G+ELSLYGSA+NNTESAQK FSFAVRDSL+NIGP
Sbjct: 342 GDIEVDAHTAKRMRRSSSDALQDMVGGDELSLYGSAANNTESAQKIFSFAVRDSLINIGP 401
Query: 515 LKDFSYGLRINADASATGISKQSNYELV--------------------------ELPGCK 548
LKDFSYGLRINAD +ATGI+KQSNYELV ELPGCK
Sbjct: 402 LKDFSYGLRINADPNATGIAKQSNYELVCCSGHGKNGALCILRQSIRPEMITEVELPGCK 461
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
GIWTVYHK++RG ADSSRM DDEYHAYLIISLEARTMVL T +LLTEVTESVDYFV
Sbjct: 462 GIWTVYHKNTRGSIADSSRMVPDDDEYHAYLIISLEARTMVLVTGELLTEVTESVDYFVH 521
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
GRTIAAGNLFGRRRVIQV+E GARILDGS+MTQDL+ + +ESG+ SE TVLS SI+D
Sbjct: 522 GRTIAAGNLFGRRRVIQVYESGARILDGSFMTQDLNLVVNGNESGNASEGCTVLSASISD 581
Query: 669 PYVLLGMSDGSIRLLVGD 686
PYVLL M+DGSIRLLVG+
Sbjct: 582 PYVLLTMTDGSIRLLVGE 599
>gi|145348791|ref|XP_001418827.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579057|gb|ABO97120.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1386
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 304/1076 (28%), Positives = 496/1076 (46%), Gaps = 196/1076 (18%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS A ++ +H PTG+ + + + T D G PNL+V
Sbjct: 1 MSHAVHREVHPPTGVDHAVTAYFTRPVGD-----------------------GGDPNLIV 37
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
+AN I +Y V G +ES L++ + G + S+++L
Sbjct: 38 ASANRITVYAV--NRRGDEES-------------------LDVCAEFDAQGAIGSMSVLR 76
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES-----PEWLHLKRGRE 175
+ +RD++++A + K+SV+E+D + + +SMH FES P L+ RE
Sbjct: 77 RRFGAPRNQRDALLIAIRERKLSVVEYDAATGDVCCSSMHSFESALGCNPLGTTLRMSRE 136
Query: 176 SFARGPLVKVDPQGRCGGVLV----YGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE 231
+ PLV DP+GRC V++ ++ +L + GG GLV ++D G G +A +
Sbjct: 137 A----PLVVSDPEGRCAAVVLREDGVAGKVRVLPSVDGGLGLVANDDE-GRVRGPAASVR 191
Query: 232 SSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
S ++L + + ++D F+HGY EP + +L+E+ TWAGR + TC I ALS+
Sbjct: 192 ESFPLHLPGVRL--IRDACFLHGYGEPALAVLYEKTPTWAGRYNLSKDTCEIVALSVDVD 249
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
++ +IW NLP +YKL A+ P+GG LV + + + SQ +S L LN +
Sbjct: 250 KQKGTVIWRRQNLPSSSYKLTALLPPLGGALVFSQDFLLHESQESSSVLGLNTFGHG--G 307
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
QE + + LD A A+ + D L++TKTG L+LL + DGR ++R+ L + +VL
Sbjct: 308 PQE--GNDAEITLDGAQASVVSEDRVLVTTKTGALLLLALHTDGRSLRRMMLQRAGGAVL 365
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCG---SGTSML-----------SSGLKEEFGDI 457
+S + + L FLGSR+GDSLLV+FT + ML + K++ ++
Sbjct: 366 SSGMCLLSRDLLFLGSRIGDSLLVKFTPKEEPTAPLMLPDAEDESEDEATEKSKDDDDEL 425
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
EA T + +DA+Q E G A + V+DSL+ + P+ D
Sbjct: 426 EALLYGTTKTETVQTDAVQ-----TEKKREGLAGIIPGLKVAGYDLKVKDSLLGVAPVVD 480
Query: 518 FSYGLRINADASATGISKQSNYELV-----------------------------ELPGCK 548
+ G ++ G +K EL+ LP +
Sbjct: 481 IAVGA-----SAPMGSNKNERTELITACGQGKNGALAILTRGVQPELVTEVESGTLPNLQ 535
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
G+WT+++ R + R + +H +L++S+++ TM++ET + L EV+ S+++
Sbjct: 536 GLWTLHY---RKEGSKEER-----EPFHHHLLLSMKSSTMIMETGEELQEVSASLEFITN 587
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
T+AA N+FG +QV G R+L G QD+ ++ G + + S I D
Sbjct: 588 QATLAASNIFGHYCSVQVTGTGIRVLKGGVKVQDVGLQDMDAPKG-----AAIASAQILD 642
Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK--------GPEP 720
PY+++ +SDGSIRLL GD +VS+ AI +S V++ L D G E
Sbjct: 643 PYIIVRLSDGSIRLLSGDEKQMSVSLMETGAIPTSS--VTAFALVDDSVEAADAAGGGER 700
Query: 721 ---WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTV 777
W+ + +T+ ++ G GA + + + E G+LE+F +P+ ++
Sbjct: 701 KSGWIHRAATNGTITGLEGNKKSGA----CNNSEAIVALTREGGSLELFSLPSCTRIWCA 756
Query: 778 DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
D G MR + +T +N+ S ++V++ + + H
Sbjct: 757 DGLSEG--------MR--VLSPQTPVNAESS------------VPEIVDIRIDSFQDAHE 794
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
RP L A+ DGT+L Y+ ++ D+P+ + LRFSR +
Sbjct: 795 RPLLTAVRGDGTLLLYKGFIVPAGTTYEGQDEPLEKN--------------ELRFSRVNV 840
Query: 898 D-------------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLR 944
D A ++ GA RI G QG F++G P W +V R R+
Sbjct: 841 DVEGSGLNVAGIGAAGQLRDSLAGARLTRIGNVGEGQGVQGIFVAGPNPLWLIVRRSRVL 900
Query: 945 VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVVF 1000
P +G +VAFTV HNVNC HGFI T+ G ++ICQ+PS Y+ WPV+KV
Sbjct: 901 ALPTRGEGEVVAFTVFHNVNCPHGFILGTALGGVRICQMPSKMHYEAAWPVRKVAL 956
>gi|410911304|ref|XP_003969130.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Takifugu rubripes]
Length = 1444
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 310/1052 (29%), Positives = 501/1052 (47%), Gaps = 173/1052 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + ++ S ++K R LE V + L GNV S+
Sbjct: 29 NLVVAGTSQLFVYRIIHDVESTSKTDKSSDSKTR-------KEKLEQVAAFSLFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ GA+ RD+++L+F+DAK+SV+E+D H L+ S+H FE L L+ G
Sbjct: 82 ESVQLVGAN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEE---LELRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+ RC +L+YG ++++L + + DE G G G + +++I
Sbjct: 135 NVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKDT---LTDEQEVGVGEGPKSSFLPTYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ K
Sbjct: 192 DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD + A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRLQDEVKITLDCSQADFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA-----PSTKRL 467
+ + T+ FLGSRLG+SLL+++T L G ++ + E D P +K+
Sbjct: 372 TCMVTMEPGYLFLGSRLGNSLLLKYTEKLQDMPLEEGKDQQDKEKEKDMDKQEEPPSKKK 431
Query: 468 RRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
R SS D V+ E+ +YGS A + T+ A T+SF V DS++NIGP + S G
Sbjct: 432 RVESSSNWTDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANASMGEPAFL 487
Query: 522 ---LRINADAS-----ATGISKQSNYELV------------ELPGCKGIWTVY------- 554
+ N + +G K ++ ELPGC +WTV
Sbjct: 488 SEEFQSNPEPDLEVVVCSGHGKNGALSVLQRSIRPQVVTTFELPGCHDMWTVISNEPVQK 547
Query: 555 --HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
++ R + A D + H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 548 EQEETEREGKEKTEPPAEEDTKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTV 606
Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
AGN+ + +IQV G R+L+G +TQ L F P + S ++ S+ADPYV+
Sbjct: 607 FAGNIGDNKYIIQVSPMGIRLLEG--VTQ-LHFIPVDL-------GSPIVHCSLADPYVV 656
Query: 673 LGMSDGSIRLLVGD-----PSTCTVSVQTPAAIESSK-------KPVS---------SCT 711
+ ++G + + V T +++Q P S+ + VS SC+
Sbjct: 657 IMTAEGVVTMFVLKIDSYMGKTHRLALQKPQISTQSRVIALCAYRDVSGMFTTENKVSCS 716
Query: 712 LYHDKGPEPWLRKTSTDAWLSTGV---------GEAIDGADGGPLDQGDI---------- 752
+ D + LST + G++ G +++
Sbjct: 717 ITEDISIRSQSEAETIIQDLSTNIVDDEEEMLYGDSNTGPSKEEMNRSSFAGPSEGSYSK 776
Query: 753 -----YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
+ ++ +SG +EI+ +P++ VF V F G+ +VD+ ++ E E
Sbjct: 777 AEPSHWCLITRDSGVMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQSATQGEKE--GKK 834
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF--EGPENTS 865
EE T QG + + +V L +HSRP+L + D +L Y+A+ + + P+N
Sbjct: 835 EEVTRQGEIPLVKEVTLVSLGY-----NHSRPYLL-VHVDQELLIYEAFPYDQQQPQNNL 888
Query: 866 KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--------ETPHGAPCQ----- 912
K +RF + P + RE + G +
Sbjct: 889 K-----------------------VRFKKVPHNINFREKKSKLRKDKKAEGTAAEDSVAA 925
Query: 913 -----RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCN 966
R F++ISG+ G F+ G P W +V R LR+HP DG I +F+ HN+NC
Sbjct: 926 RGRISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALRLHPMSIDGPIESFSPFHNINCP 985
Query: 967 HGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
GF+Y QG L+I LP+ +YD WPV+K+
Sbjct: 986 KGFLYFNKQGELRISVLPTYLSYDAPWPVRKI 1017
>gi|303285993|ref|XP_003062286.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455803|gb|EEH53105.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1469
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 318/1100 (28%), Positives = 521/1100 (47%), Gaps = 165/1100 (15%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MSFA +K +H PTG+ + + + TH P+ G G PNLVV
Sbjct: 1 MSFAIHKQVHPPTGVDHACAAYFTH---------PI--------------GSGAPPNLVV 37
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRR---------VLMDGISAA------------ 99
AN + IY +R +G SG + ++ D IS A
Sbjct: 38 LQANRLTIYAIR--RDGDARDNPSGNATKEADDAAIAASLVADAISGAGATASATIDADD 95
Query: 100 ---SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
SLE+V + L+G V S+A L + +RD+++LA ++K+SV+EFD S L
Sbjct: 96 AEVSLEVVAEFDLNGTVGSIATLRRRFGAPREQRDALLLAVRESKLSVVEFDPSTLSLVC 155
Query: 157 TSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVY---GLQMIILKASQG 209
+S+H +E+P G S R P+V DP+GRC VL+ G ++ +L
Sbjct: 156 SSLHSWETPPG---AGGVPSALRLAPTPPVVVADPEGRCAAVLLRAEGGTRLALLPTDND 212
Query: 210 GSGLVGDEDTFGSGG----GFSARIESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILH 264
+ G + + G G G +A ++ S+V++L R++ +++V+D F+HGY EPV+++LH
Sbjct: 213 AMDVDGGDGSEGKGRRTLRGTAAAVKKSYVVDLVREMGVRYVRDVCFLHGYGEPVLLVLH 272
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
E LTWA R + T +SA+S++ ++H +IW LPH Y+L A+P+P+GG +V+
Sbjct: 273 EERLTWAARATLVKDTMRLSAISLNVDARKHTVIWRRSALPHSCYRLTAMPAPLGGAIVL 332
Query: 325 GANTIHYHSQSASCALALNNYA---VSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
N + + SQ +S ALALN A D + + ++ + LD A+A + AL++T
Sbjct: 333 SQNFLLHESQESSAALALNPLAGGGRGDDPAAKAAAAASAAALDGAYAAVISEKQALVTT 392
Query: 382 KTGDLVLLTVVYDGRVVQR---LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
K G L LL++ +GR + + L + +VL+S + + L FLGSR+GDSLLV
Sbjct: 393 KAGALYLLSLRIEGRRLATRGGMHLKRAGGAVLSSGMCLVTRRLLFLGSRVGDSLLVS-R 451
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA- 497
C + + ++ + A + +R D V G + +A+ +
Sbjct: 452 CSTARASTAAPGRRPRAAAAAATTAAAEVRLLPIRPQIDGVGGVSAASLRAAAAAHRAPD 511
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATG----------------------ISK 535
++F VRDS++ I P+ D + G A AS +G + +
Sbjct: 512 HPGYTFTVRDSVLGISPVIDLTVG----ASASVSGDTIERTELIAACGHGKNGALAVLQR 567
Query: 536 QSNYELVE------LPGCKGIWTVYHKSS---RGHNADSSRMAAYDDEYHAYLIISLEAR 586
ELV LPG KG WTV+H S+ R + ++ A D YHAYL+ISL +
Sbjct: 568 GIQPELVTEVESGTLPGLKGTWTVHHDSADNERLRGSAAAAAAQAVDPYHAYLVISLASS 627
Query: 587 TMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
TM+LET + L EV+E V+ T+ AGN FGR R++QV+++G R+ G QD++
Sbjct: 628 TMILETGEELKEVSEHVELVTDAATLCAGNAFGRERIVQVYDKGVRVAAGPVKVQDIAST 687
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-------VQTP-- 697
+++G G E +++ I+ PYVL +SDGS+ +L GD + T+ + P
Sbjct: 688 ELVADAGDG-EGIEIVAAEISFPYVLCRLSDGSLAVLKGDEESKTLVKLDVDALARLPPG 746
Query: 698 -----AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI--DGADGGPLDQG 750
A + P ++ HD+ P +L++ +T +T + + D +
Sbjct: 747 GGIACATLVDDSTPAAAHGGLHDRSPG-FLKRATTATATTTTTTASASREDGDDDDDSRR 805
Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
++ V GALE++ +P+ + +T + G + +
Sbjct: 806 PMFLAVTRTGGALELYSLPSCDKAWTANGLSEGVAVLSPA--------GSASAALVDRDA 857
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
+ ++VEL + ++ H RP L A+ DG +L Y+A+
Sbjct: 858 AAAADAGADRAPEIVELRVDAFARAHERPLLTALRADGAVLVYRAF-------------- 903
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA------PCQRITIFKNI---S 921
+ +V+ L LRF+R P++ E GA P R+T F+ +
Sbjct: 904 -----TCAVAGPGGRALTQLRFARVPVEL---EGGGGGAVDLSALPGSRLTRFERVGDRG 955
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS-IVAFTVLHNVNCNHGFIYVTSQGILKI 980
G +G F+SG +P W + R R+ P + +V+FT HNVNC+ GFI T+ G ++I
Sbjct: 956 GIRGVFVSGPQPLWLLARRSRVLALPVRGEAQRVVSFTAFHNVNCHAGFILGTAAGGVRI 1015
Query: 981 CQLPSGSTYDNYWPVQKVVF 1000
CQ+P Y+ WPV+K+
Sbjct: 1016 CQIPGRMHYEAAWPVRKLAL 1035
>gi|348512553|ref|XP_003443807.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Oreochromis niloticus]
Length = 1456
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 315/1080 (29%), Positives = 504/1080 (46%), Gaps = 217/1080 (20%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + ++ S ++K R LE V + L GN+ S+
Sbjct: 29 NLVVAGTSQLFVYRIIHDVESTSKADKSSDSKSR-------KEKLEQVASFSLFGNIMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLVGAS----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+ RC +LVYG ++++L + + DE G G G + S++I
Sbjct: 135 NVHIPVVRVDPENRCAVMLVYGTKLVVLPFRKDT---LTDEQESGVGEGPKSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ K
Sbjct: 192 DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS Y VSL+S
Sbjct: 252 HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVSLNSQTN 305
Query: 352 -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ P + + LD + ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 GTTAFPLRVQDEVKLTLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKA 365
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA---PS 463
SVLT+ + T+ FLGSRLG+SLL+++T + G + + + + D PS
Sbjct: 366 AASVLTTCMVTMEPGYLFLGSRLGNSLLLKYTEKLQETPAEEGKERQDKEKDKDKQEPPS 425
Query: 464 TKRLRRSSSD----------ALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
K+ SS++ L D V+ E+ +YGS A + T+ A T+SF V DS++NI
Sbjct: 426 KKKRVESSTNWTVCVILDFFVLSDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 481
Query: 513 GPLKDFSYG--------LRINADAS-----ATGISKQSNYELV------------ELPGC 547
GP + S G + N + +G K ++ ELPGC
Sbjct: 482 GPCANASMGEPAFLSEEFQSNPEPDLEVVVCSGYGKNGALSVLQRSIRPQVVTTFELPGC 541
Query: 548 KGIWTVYHKSSRGHNADSSRMAAY------------DDEYHAYLIISLEARTMVLETADL 595
+WTV + D + D + H +LI+S E TM+L+T
Sbjct: 542 HDMWTVISSDVKEDKTDKEEVEKEEEEKKTEPPLEDDAKKHGFLILSREDSTMILQTGQE 601
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ S + QG T+ AGN+ + +IQV G R+L+G + L F P +
Sbjct: 602 IMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGLRLLEG---VRQLHFIPVDL----- 652
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDP-----STCTVSVQTPAAIESSKKPVSSC 710
S ++ S+ADPYV++ ++G + + V T +++Q P I S + ++ C
Sbjct: 653 --GSPIVHCSVADPYVVIMTAEGVVTMFVLKSDSYMGKTHRLALQKP-QIPSQSRVITLC 709
Query: 711 -------------------------------TLYHDKG-----PEPWLRKTSTDAWLSTG 734
T+ HD E L S + +T
Sbjct: 710 AYRDVSGMFTTENKVSCSIKEDTIRSQSEAETIIHDMSNTVDDEEEMLYGDSNAS--ATP 767
Query: 735 VGEAIDGADGGPLDQGDI----------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGR 784
E I+ + P G + ++ E+G +EI+ +P++ VF V F G+
Sbjct: 768 AKEDINRSFVAPTTSGSEATSSKAEPTHWCMIIRENGVMEIYQLPDWRLVFLVKNFPVGQ 827
Query: 785 THIVDTYMREALKDSETEINSSSEEGT-GQGRKENIHSMK----VVELAMQRWSAHHSRP 839
+VD+ SS + T G+G+KE + V E+A+ +HS+P
Sbjct: 828 RVLVDS--------------SSGQSATQGEGKKEEVTRQGEIPLVKEVALVSLGNNHSKP 873
Query: 840 FLFAILTDGTILCYQAYLF--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
+L + + +L Y+A+ + + P+N K +RF + P
Sbjct: 874 YLL-VHVEQELLIYEAFQYDQQQPQNNLK-----------------------VRFKKVPH 909
Query: 898 DAYTRE----------------ETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCMVF 939
+ RE E G + R F++ISG+ G F+ G P W +V
Sbjct: 910 NINFREKKSKLKKDKKAESSATEESSGVKGRIARFRFFEDISGYSGVFICGPSPHWMLVT 969
Query: 940 -RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
R LR+HP DGSI +F+ HN+NC GF+Y QG L+I LP+ +YD WPV+K+
Sbjct: 970 SRGALRLHPMTIDGSIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKI 1029
>gi|432883539|ref|XP_004074300.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Oryzias latipes]
Length = 1456
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 320/1074 (29%), Positives = 502/1074 (46%), Gaps = 205/1074 (19%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + S S + K R LE V + L GNV S+
Sbjct: 29 NLVVAGTSQLFVYRIIHDVESTSSSDKSSDAKTR-------KEKLEQVASFSLFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +D+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLTGAS----KDALLLSFKDAKLSVIEYDPGTHDLKTLSLHYFEEPE---LRDGFFQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+ RC +L+YG ++++L + + DE G G G + S++I
Sbjct: 135 NVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKDT---LSDEQEGGVGEGPKSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ K
Sbjct: 192 DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS Y VSL+S
Sbjct: 252 HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVSLNSQTN 305
Query: 352 -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ P + + LD + ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 GTTSFPLRVQEEVKITLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKA 365
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
SVLT+ + T+ FLGSRLG+SLL+++T + G ++ + E P K+
Sbjct: 366 AASVLTTCMVTMEPGYLFLGSRLGNSLLLKYTEKLQEAPAEDGNDKQ--EKEKQEPPNKK 423
Query: 467 LRRSSSD-----------ALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGP 514
R SS L D V+ E+ +YGS A + T+ A TFSF V DS++NIGP
Sbjct: 424 KRVESSSNWTGCSASYFFVLSDEVD--EIEVYGSEAQSGTQLA--TFSFEVCDSILNIGP 479
Query: 515 LKDFSYG--------LRINADAS-----ATGISKQSNYELV------------ELPGCKG 549
+ S G + N + +G K ++ ELPGC
Sbjct: 480 CANASMGEPAFLSEEFQSNPEPDLEIVVCSGYGKNGALSVLQRSIRPQVVTTFELPGCHD 539
Query: 550 IWTVY----HKSSRG--HNADSSRMAAYDD---------EYHAYLIISLEARTMVLETAD 594
+WTV K S G AD+ + D + H +LI+S E TM+L+T
Sbjct: 540 MWTVISGEDKKESEGGEKEADAEKKEEQDKTEPPLEDDAKKHGFLILSREDSTMILQTGQ 599
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
+ E+ S + QG T+ AGN+ + +IQV G R+L+G + L F P +
Sbjct: 600 EIMELDTS-GFATQGPTVFAGNIGDNQYIIQVSPMGLRLLEG---VKQLHFIPVDL---- 651
Query: 655 GSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT-----VSVQTPAAIESSK----- 704
S ++ S+ADPYV++ ++G + + V T +++Q P S+
Sbjct: 652 ---GSPIVHCSVADPYVVIMTAEGVVTMFVLKSDTYMGKTHRLALQKPQISTLSRVIALC 708
Query: 705 -----------KPVSSC---------------TLYHDKG----PEPWLRKTSTDAWLSTG 734
+ SSC T+Y D E + + A ++ G
Sbjct: 709 AYRDVSGMFTTENKSSCSSKEDLILRSNSETETVYQDLSNTVDDEEEMLYGESGASMAAG 768
Query: 735 V-----GEAIDGADGGPLDQGDI----YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
G A GG G + V+ E+G +EI+ +P++ VF V F G+
Sbjct: 769 KEEMSRGSAATAPPGGEGSAGKAEPSHWCVLIRENGVMEIYQLPDWRLVFLVKNFPVGQR 828
Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
+VD+ + S T+ + EE T QG + + +V L R SRP+L +
Sbjct: 829 VLVDS----SSGQSATQGDGKKEEVTRQGEIPLVKEVALVALGNNR-----SRPYLL-VH 878
Query: 846 TDGTILCYQAYLF--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
+ +L Y+A+ + + P+N K +RF + P RE
Sbjct: 879 VENELLVYEAFPYDQQQPQNNLK-----------------------VRFKKVPHSINFRE 915
Query: 904 ETPH---------GAPCQRITI---------FKNISGHQGFFLSGSRPCWCMVF-RERLR 944
+ P G P + + + F++ISG+ G F+ G P W ++ R LR
Sbjct: 916 KKPKLKKDKKAEGGGPEENVAVKSRISRFRYFEDISGYSGVFICGPSPHWMLITSRGGLR 975
Query: 945 VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+HP DG I +F+ HN+NC GF+Y QG L+I LP+ +YD WPV+K+
Sbjct: 976 LHPMTIDGPIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKI 1029
>gi|444523674|gb|ELV13604.1| Cleavage and polyadenylation specificity factor subunit 1 [Tupaia
chinensis]
Length = 1469
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 319/1086 (29%), Positives = 498/1086 (45%), Gaps = 216/1086 (19%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T + HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQRVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTAG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD AHA ++ D ++S K G++ +LT+V DG R V+ K
Sbjct: 307 TTAFPLRTQDGVRLTLDCAHAAFISYDKMVISLKGGEIYVLTLVTDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASAVREAADKEEPPSKKKRV 424
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFS-- 519
+ SS QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + +
Sbjct: 425 DPTGGWSGSSTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 480
Query: 520 ---------------YGLRINAD--ASATGI---------SKQSNYELV----------- 542
YGL A+ TG+ S + + E+V
Sbjct: 481 EPAFLSEEVGTGVAEYGLIGQAEGWGRRTGLTPAPVQFQNSPEPDLEIVVCSGYGKNGAL 540
Query: 543 ---------------ELPGCKGIWTVY-------HKSSRGHNADSSRMAAYDD--EYHAY 578
ELPGC +WTV ++ + + R A +D H +
Sbjct: 541 SVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKDEEETPKAEGTEQPRAAEAEDGVRRHGF 600
Query: 579 LIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY 638
LI+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 601 LILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG-- 657
Query: 639 MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC-----TVS 693
L F P + + ++ ++ADPYV++ ++G + + + T ++
Sbjct: 658 -VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFLLKSDTYGGRHHRLA 709
Query: 694 VQTPAAIESSKKPVSSCTLYHD-------------KGPEPWLRKTSTDAWLSTGVGEAID 740
+ P SK V + LY D EP R + L +D
Sbjct: 710 LHKPPLHHQSK--VITLCLYRDVSGMFTTESRLGGARDEPGARGSCEVEGLGAETSPTVD 767
Query: 741 G------ADGG----------------PLDQGDI--------YSVVCYESGALEIFDVPN 770
D G P D+ + ++ E+G +E++ +P+
Sbjct: 768 DEEEMLYGDSGSLFSPSKEETRRSSQPPADRDPAPFRAEPTHWCLLVRENGTMEMYQLPD 827
Query: 771 FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQ 830
+ VF V F G+ +VD+ + T+ + EE T QG + + +V L
Sbjct: 828 WRLVFLVKNFPVGQRVLVDS----SFGQPATQAEARKEEATRQGELPLVKEVLLVALG-- 881
Query: 831 RWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNL 890
+ SRP+L + D +L Y+A+ P ++ L N+ +
Sbjct: 882 ---SRQSRPYLL-VHVDQELLLYEAF----PHDS-----------QLGQGNL------KV 916
Query: 891 RFSRTPLDAYTRE-------------ETPHGAPCQ----RITIFKNISGHQGFFLSGSRP 933
RF + P + RE T GA + R F++I G+ G F+ G P
Sbjct: 917 RFKKVPHNINFREKKLKPSKKKAEGGSTEEGAGARGRVARFRYFEDIYGYSGVFICGPSP 976
Query: 934 CWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 977 HWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAP 1036
Query: 993 WPVQKV 998
WPV+K+
Sbjct: 1037 WPVRKI 1042
>gi|405977622|gb|EKC42064.1| Cleavage and polyadenylation specificity factor subunit 1
[Crassostrea gigas]
Length = 1369
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 303/993 (30%), Positives = 468/993 (47%), Gaps = 151/993 (15%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
+E + + L GN+ S+ + GA RDS++L+F +AK+SV+E+D H L+ TS+H
Sbjct: 5 MECLATFTLFGNIMSMKYVKLPGA----LRDSLLLSFSEAKLSVVEYDPGTHDLQTTSLH 60
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE P +K G + P V+VDP GRC +LVYG M+IL + GD
Sbjct: 61 FFEEPS---MKGGFFTNYCIPEVRVDPDGRCAAMLVYGTHMVILPFRRDVMVEEGD---- 113
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G + I SS++I+LR+ D K +VKDF F+HGY EP + IL E TWAGR + +
Sbjct: 114 NLAGTSKSPILSSYIIDLRNFDEKIINVKDFQFLHGYYEPTVFILFEPLQTWAGRTAVRA 173
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
TC I A+S++ K HP+IWS +LP D ++LAVP PIGGV+++ N++ Y +QS
Sbjct: 174 DTCSIVAISLNLQEKVHPVIWSLGSLPFDCCQVLAVPRPIGGVIIIAVNSLLYLNQSVP- 232
Query: 339 ALALNNYAVSLDS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
Y VSL+S S P + + LD A ++ D +LS K G+L +LT+
Sbjct: 233 -----PYGVSLNSISAQSTLFPLRVQEGVRIALDCCQAAFMSYDKIVLSLKGGELYVLTL 287
Query: 392 VYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
V DG R V+ + K+ SVLTS + + FLGSRLG+SLL+++T + + + L
Sbjct: 288 VVDGMRSVRSFNFDKSAASVLTSCMCICEDGFLFLGSRLGNSLLLKYTEKASECLENGDL 347
Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMV----NGEELSLYGSASNNTESAQKTFSFAVR 506
++ + D P+ K+ + S + V N +L +YGSA N T + +++F V
Sbjct: 348 DKK----KEDEPAAKKKKVEGSTEIASDVSQIENLYDLEVYGSAENPTSTTITSYTFEVC 403
Query: 507 DSLVNIGPL------------KDFSYGLRINADASAT-GISKQSNYELV----------- 542
D++ NIGP ++FS + + T G K ++
Sbjct: 404 DNIWNIGPCGNIVMGEPAFLSEEFSSCEDPDIEMVMTSGYGKNGALSVLQRSIRPQVVTT 463
Query: 543 -ELPGCKGIWTVYH--KSSRGHNADSSRMAAYDDEY---HAYLIISLEARTMVLETADLL 596
ELPGC +WTV + + ++S DD H++LI+S +M+LET +
Sbjct: 464 FELPGCLDMWTVKSLVPKEKSEDKENSMEDDSDDNIEGGHSFLILSRSDSSMILETGQEM 523
Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS 656
E+ S + Q TI AGN+ G R ++QV + R+L+G Q + ++G
Sbjct: 524 NELDHS-GFSTQTTTIFAGNIGGDRYIVQVSDTSLRLLEGVRQIQHIPL-----DTG--- 574
Query: 657 ENSTVLSVSIADPYVLLGMSDGSI--------------RLLVGDPSTCTVSVQTPAAIES 702
S V+ S+ADPY++L +G I RL+VG PS +S + + S
Sbjct: 575 --SPVVQCSLADPYIVLLTQEGQILMFTLRTESVGLGVRLVVGKPS---ISQHSKVEVIS 629
Query: 703 SKKPVSSCTLYHDK------GPEPWLRKTSTDAWLSTGV-----------GEAIDGADGG 745
+ K VS ++ P+ KT T+ S GE
Sbjct: 630 AYKDVSGLFTCMNQMEDVQVTPDTKATKTVTERSFSIDAKTADEEDELLYGETESNVFNS 689
Query: 746 PLDQGDI-------------------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTH 786
+ G + ++C E+G LEI+ +P++ V+ V F G+
Sbjct: 690 SFNMGQTAEMESPTKEKKQTEAKPTYWLLLCRENGVLEIYSIPDYKKVYYVKNFPMGQKL 749
Query: 787 IVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILT 846
+VD+ ++ G Q K N + EL M SRP L A +
Sbjct: 750 LVDS----------VQVTDKLSSGERQ-EKVNAECPALKELLMVGLGYKDSRPHLLARVE 798
Query: 847 DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP 906
D Y++E S D R + + R + + + + + +EE
Sbjct: 799 D------DLYIYEAFSYPQSSIDNHLKLRFKKIQHDLILREKRSKSKKKDPEEFQKEEKK 852
Query: 907 HGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNC 965
G ++ FK+++G+ G F+ G+ P W V R LR+HP DG + F+ HN+NC
Sbjct: 853 VG----KMRYFKDVAGYSGVFVCGAYPHWIFVTSRGSLRIHPMGIDGPVWCFSEFHNINC 908
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
HGF+Y G L+I LP+ TYD WPV+KV
Sbjct: 909 PHGFLYFNKMGELRISVLPTHLTYDAPWPVRKV 941
>gi|229335612|ref|NP_001108153.2| cleavage and polyadenylation specificity factor subunit 1 [Danio
rerio]
Length = 1449
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 299/1024 (29%), Positives = 488/1024 (47%), Gaps = 178/1024 (17%)
Query: 94 DGIS-AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
DG S LE V + L GNV S+A + G + RD+++L+F+DAK+SV+E+D H
Sbjct: 58 DGKSRKEKLEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTH 113
Query: 153 GLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
L+ S+H FE PE L+ G P+V+VDP+ RC +LVYG +++L +
Sbjct: 114 DLKTLSLHYFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT-- 168
Query: 213 LVGDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTW 270
+ DE G G + S++I++R+LD K ++ D F+HGY EP ++IL E TW
Sbjct: 169 -LADEQEGIVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTW 227
Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
GRV+ + TC I A+S++ K HP+IWS NLP D +++AVP PIGGV+V N++
Sbjct: 228 PGRVAVRQDTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLL 287
Query: 331 YHSQSAS-CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
Y +QS ++LN+ + P+ + LD + A+++ +D ++S K G++ +L
Sbjct: 288 YLNQSVPPFGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVL 347
Query: 390 TVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
T++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T + +
Sbjct: 348 TLITDGMRSVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEE 407
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDA-------LQDMVNGEELSLYGS-ASNNTESAQKT 500
G + E + + P K+ R S+ A L D ++ E+ +YGS A + T+ A T
Sbjct: 408 GKENEEKEKQ---PPNKKKRVDSNWAGCPGKGNLPDELD--EIEVYGSEAQSGTQLA--T 460
Query: 501 FSFAVRDSLVNIGPLKDFSYG--------LRINADAS-----ATGISKQSNYELV----- 542
+SF V DS++NIGP S G + N + +G K ++
Sbjct: 461 YSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVVVCSGYGKNGALSVLQKSIR 520
Query: 543 -------ELPGCKGIWTVYHKSSR---------GHNADSSRMAAY---DDEYHAYLIISL 583
ELPGC +WTV + + G + + + D + H +LI+S
Sbjct: 521 PQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILSR 580
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
E TM+L+T + E+ S + QG T+ AGN+ + +IQV G R+L+G L
Sbjct: 581 EDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---VNQL 636
Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI---------------RLLVGDPS 688
F P + S ++ S+ADPYV++ ++G + RL + P
Sbjct: 637 HFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFVLKNDSYMGKSHRLALQKPQ 689
Query: 689 TCT------------------------------VSVQTPAAIESSKKPVSSCT------L 712
T ++++T + E+ + +S+ L
Sbjct: 690 IHTQSRVITLCAYRDVSGMFTTENKVSFLAKEEIAIRTNSETETIIQDISNTVDDEEEML 749
Query: 713 YHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN 772
Y + P K + + G + + ++ E+G +EI+ +P++
Sbjct: 750 YGESNPLTSPNKEESSRGSAAASSAHTGKESGSGRQEPSHWCLLVRENGVMEIYQLPDWR 809
Query: 773 CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
VF V F G+ +VD+ + S T+ EE T QG +I +K E+A+
Sbjct: 810 LVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTRQG---DIPLVK--EVALVSL 860
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
+HSRP+L A + + +L Y+A+ ++ + + S L+ +RF
Sbjct: 861 GYNHSRPYLLAHV-EQELLIYEAFPYDQQQ--------------------AQSNLK-VRF 898
Query: 893 SRTPLDAYTREET--------PHG---------APCQRITIFKNISGHQGFFLSGSRPCW 935
+ P + RE+ P G R F++ISG+ G F+ G P W
Sbjct: 899 KKMPHNINYREKKVKVRKDKKPEGQGEDTLGVKGRVARFRYFQDISGYSGVFICGPSPHW 958
Query: 936 CMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
+V R +R+HP DG+I +F+ HN+NC GF+Y QG L+I LP+ +YD WP
Sbjct: 959 MLVTSRGAMRLHPMTIDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWP 1018
Query: 995 VQKV 998
V+K+
Sbjct: 1019 VRKI 1022
>gi|156364999|ref|XP_001626630.1| predicted protein [Nematostella vectensis]
gi|156213514|gb|EDO34530.1| predicted protein [Nematostella vectensis]
Length = 1420
Score = 374 bits (961), Expect = e-100, Method: Compositional matrix adjust.
Identities = 310/1100 (28%), Positives = 498/1100 (45%), Gaps = 210/1100 (19%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A YK H PTG+ C + +R NLVV
Sbjct: 2 YAIYKETHPPTGVEFCVNCHFYSARES---------------------------NLVVAG 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
+ ++ + Q+EGS +++ G + +R LELV + L GN+ESL +
Sbjct: 35 TTEVRVFRLCYQQEGSSSAESGGSSLKR---------KLELVGQHSLFGNIESLHAIRLA 85
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
G RDS++++F+DAK+S++++D H ++ S+H FE + +K + R P+
Sbjct: 86 G----NTRDSLLMSFKDAKLSIVDYDPGKHDIKTRSLHFFEDEK---IKSHCLAQDRAPV 138
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
V++DP+ RC +L YG +++L Q G +D+ S + S++I+++++D
Sbjct: 139 VRIDPERRCAVMLAYGTHLVVLPFRQEGGIDDTAQDSIISSSD-RPPVLPSYIIDVKEID 197
Query: 243 MK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
K ++ D F+HGY EP ++IL+E TWAGR++ ++ TC + A+S++ + K HP++W
Sbjct: 198 EKTCNILDIQFLHGYYEPTLLILYEPLKTWAGRLAMRNDTCALVAVSLNMSQKAHPVVWQ 257
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE------ 354
LP D ++ VP PIGGVLV N + Y +QS Y VS++S E
Sbjct: 258 LSCLPFDCIYVMPVPKPIGGVLVCCMNALLYLNQSVP------PYGVSVNSIGENSTVFP 311
Query: 355 -LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
P+ ++ L+ ++A ++ ND + S K G++ ++T++ DG R V+ KT SVLT
Sbjct: 312 LKPQKGVTITLEGSNAIFIANDKLVFSLKGGEIYVVTLIADGVRSVRNFVFDKTAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
S + G+ FLGSRLG+SLLV++T + G + ++ D + +R + +
Sbjct: 372 SCVCECGDGYLFLGSRLGNSLLVKYT--EKPQDIVYGTENNAQSMQCD--NIERWQILNG 427
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----LRIN--- 525
L + + +EL +YG A +++F V DSL+NIGP G L ++
Sbjct: 428 SLLLIVDDLDELEVYG-AQQEAGVELTSYTFEVCDSLLNIGPCSCMDIGEPAFLSVSSYF 486
Query: 526 ADA--------SATGISKQSNYELV------------ELPGCKGIWTVYHKSSRG----- 560
ADA S +G K ++ ELPGC +WTV+ K +
Sbjct: 487 ADAQELDLEVVSCSGYGKNGALTVLQRSIRPQVVTTFELPGCTDMWTVFSKDQKKGAQTN 546
Query: 561 --HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
H S +++YH++LI+S E +M+L+T + EV +S + Q TI AGN
Sbjct: 547 AIHRYPSQPCTQGNEKYHSFLILSREDSSMILKTEQEIMEVDQS-GFSTQCATIYAGNFG 605
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
++QV G R+L+G Q + +SG S ++ S+ DPY +L M+DG
Sbjct: 606 NGSYILQVTPLGVRLLEGVNQLQHIPM-----DSGL----SNIVWCSVCDPYAVLLMADG 656
Query: 679 SIRLL--VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD----------------KGPEP 720
S+ L+ + S ++V P+ +SSK V +C Y D K P P
Sbjct: 657 SVILIEFIKSASGPKLTVSRPSLSQSSK--VCACCTYKDMSGLFTTENSNLEEVSKVPSP 714
Query: 721 WLRKTS----------------------TDAWLSTGVGEAIDGADGGPLD------QGDI 752
T+ T L+ E P++ Q
Sbjct: 715 KPEMTAPPRQEKESLTIDEEDELLYGGDTSLTLTFEPPEPSKAESAAPVEVFEEPLQPSY 774
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ +VC E+G +EI+ +P F VF V F IVD+ DS SS E
Sbjct: 775 WCLVCRENGVMEIYSLPGFTRVFFVKNFSKAPRVIVDS------GDSGASTQSSVSEE-- 826
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
S+ V E+ + + R L A++ D +L Y+A+ + E
Sbjct: 827 -------ESLNVREVLLTGLGYKNRRATLVAVM-DQDLLIYEAFSYPTVEGH-------- 870
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-------------CQRITIFKN 919
NLRF + + RE+ P P + +F +
Sbjct: 871 ---------------LNLRFKKLQHNIQIREKKPKQEPKNDSETKSGLDPKVAMLRVFND 915
Query: 920 ISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
IS + G F+ GS P W V R HP DG + F HNVNC GF+Y ++G L
Sbjct: 916 ISSYSGIFVCGSYPFWIFVTNRGAFHWHPMSIDGPVTCFAAFHNVNCPKGFLYFNTRGEL 975
Query: 979 KICQLPSGSTYDNYWPVQKV 998
+I LP+ +YD+ WPV+KV
Sbjct: 976 RISVLPTHLSYDSPWPVRKV 995
>gi|414587798|tpg|DAA38369.1| TPA: hypothetical protein ZEAMMB73_163106, partial [Zea mays]
Length = 483
Score = 363 bits (933), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 178/317 (56%), Positives = 212/317 (66%), Gaps = 10/317 (3%)
Query: 685 GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
DPSTCT+S+ PA SS + +S+CTLY D+GPEPWLRKT TDAWLST VGEAID D
Sbjct: 33 ADPSTCTISINAPAIFASSSERISACTLYCDRGPEPWLRKTHTDAWLSTDVGEAIDDNDN 92
Query: 745 GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
D DIY ++CYESG LEIF+VP+F VF+VD FVSG + D + R + KDS
Sbjct: 93 SSHDLSDIYCIICYESGKLEIFEVPSFKRVFSVDNFVSGPAILFDVFSRNSTKDSGIGDR 152
Query: 805 SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
+S+ +KE ++K+VELAM RWS SRPFLF +L DGT+LCY AY FEG E+
Sbjct: 153 DASKVSV---KKEEAANIKIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAYYFEGSESN 209
Query: 865 SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPC---QRITIFKNIS 921
+ S + N + SRLRNLRF R +D +R++ C RITIF N+
Sbjct: 210 VQCAPFSPHGGSPDIGNATDSRLRNLRFCRVSIDISSRDDI----SCLVRPRITIFNNVG 265
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
G++G FL G RP W V R+R RVHPQLCDG IVAFTVLHNVNC G IYVTSQG LKIC
Sbjct: 266 GYEGLFLGGPRPTWVFVCRQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIYVTSQGFLKIC 325
Query: 982 QLPSGSTYDNYWPVQKV 998
QLPS YDNYWPVQKV
Sbjct: 326 QLPSAYNYDNYWPVQKV 342
>gi|414587799|tpg|DAA38370.1| TPA: hypothetical protein ZEAMMB73_163106 [Zea mays]
Length = 461
Score = 363 bits (931), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 178/317 (56%), Positives = 212/317 (66%), Gaps = 10/317 (3%)
Query: 685 GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
DPSTCT+S+ PA SS + +S+CTLY D+GPEPWLRKT TDAWLST VGEAID D
Sbjct: 33 ADPSTCTISINAPAIFASSSERISACTLYCDRGPEPWLRKTHTDAWLSTDVGEAIDDNDN 92
Query: 745 GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
D DIY ++CYESG LEIF+VP+F VF+VD FVSG + D + R + KDS
Sbjct: 93 SSHDLSDIYCIICYESGKLEIFEVPSFKRVFSVDNFVSGPAILFDVFSRNSTKDSGIGDR 152
Query: 805 SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
+S+ +KE ++K+VELAM RWS SRPFLF +L DGT+LCY AY FEG E+
Sbjct: 153 DASKVSV---KKEEAANIKIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAYYFEGSESN 209
Query: 865 SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPC---QRITIFKNIS 921
+ S + N + SRLRNLRF R +D +R++ C RITIF N+
Sbjct: 210 VQCAPFSPHGGSPDIGNATDSRLRNLRFCRVSIDISSRDDI----SCLVRPRITIFNNVG 265
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
G++G FL G RP W V R+R RVHPQLCDG IVAFTVLHNVNC G IYVTSQG LKIC
Sbjct: 266 GYEGLFLGGPRPTWVFVCRQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIYVTSQGFLKIC 325
Query: 982 QLPSGSTYDNYWPVQKV 998
QLPS YDNYWPVQKV
Sbjct: 326 QLPSAYNYDNYWPVQKV 342
>gi|340710064|ref|XP_003393618.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Bombus terrestris]
Length = 1417
Score = 362 bits (929), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 292/1030 (28%), Positives = 472/1030 (45%), Gaps = 159/1030 (15%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
L V AN+I I+ + + +K+ K + ++ LE + Y LHGNV S+
Sbjct: 30 LAVAGANIIRIFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D H LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR--IESSHV 235
P+V+VDP+GRC +L+YG ++++L + S + D D + S + I SS++
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPS--LDDGDLLDNSKALSNKTPILSSYM 192
Query: 236 INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
I L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ +
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
HP+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLA 306
Query: 354 EL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
E P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 307 ETSTNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDK 366
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPST 464
SVLTS + ++ FLGSRLG+SLL++FT ++ ++ + + E +
Sbjct: 367 AAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPA 426
Query: 465 KRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPLKD 517
K++++ +SD L D+ + EEL +YGS S Q T + F V DSL+NIGP +
Sbjct: 427 KKIKQDFIGDWMASDVL-DIKDPEELEVYGSERETHTSIQITSYIFEVCDSLLNIGPCGN 485
Query: 518 FSYGLRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
S G + S+ + ELV ELPGC+ +W
Sbjct: 486 ISMGEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMW 544
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV G + ++ + HA+LI+S E TM+L+T + EV +S + QG T
Sbjct: 545 TVI-----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGST 598
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
I AGNL R ++QV + G R+L G Q + ++ S ADPYV
Sbjct: 599 IFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYV 648
Query: 672 LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDK 716
L DG + LL T + AA + + + Y D+
Sbjct: 649 TLLSEDGQVMLLTLREGRGTAKLHAQAANLLFRPQIEALCAYRDVSGIFTTQLPENVEDE 708
Query: 717 GPE--------------------------PWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
PE + T + S G+ + +
Sbjct: 709 APEEEHNIEEPPIVGNIDNEDDLLYGDAPAFQMPTPSHTKTSEGISKRTPWWQKHLQEIK 768
Query: 751 DIYSVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
Y ++ Y +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 769 PTYWLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE- 827
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 828 ------------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLK--- 869
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
+ L + LR P+ TR C + F NI+G+ G F+
Sbjct: 870 --LRFKKLDHGIIPGQLKPKLRDEDIPMMNETRH-------CM-MRYFSNIAGYNGVFIC 919
Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +
Sbjct: 920 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 979
Query: 989 YDNYWPVQKV 998
YD WPV+KV
Sbjct: 980 YDAPWPVRKV 989
>gi|307190910|gb|EFN74734.1| Cleavage and polyadenylation specificity factor subunit 1
[Camponotus floridanus]
Length = 1418
Score = 361 bits (927), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 299/1029 (29%), Positives = 478/1029 (46%), Gaps = 157/1029 (15%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV ANVI ++ + + ++ K + ++ LE + Y LHGN+ S+
Sbjct: 30 LVVAGANVIRVFRLIPDIDMTRREKYTENRPPKM--------KLECLAQYTLHGNIMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+ G+ +RDS++L+F DAK+SV+E+D IH LR S+H FE E +K G +
Sbjct: 82 AVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IKDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR---IESSH 234
P+V+VDP+GRC +L+YG ++++L + S + D D S S+ I SS+
Sbjct: 135 HHIPIVRVDPEGRCAIMLIYGRKLVVLPFRKDPS--LDDGDLLDSAKLTSSNKTPILSSY 192
Query: 235 VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+I L+ L+ M +V D F++GY EP ++IL+E T++GR++ + TC + A+S++
Sbjct: 193 MIVLKTLEEKMDNVIDLQFLYGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDS 351
+ HP+IWS NLP D Y+++ V P+GG L++ N++ Y +QS ++LN+ A + +
Sbjct: 253 RVHPIIWSVSNLPFDCYQVVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTN 312
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
P+ + L+ + ++ D ++S K+G+L +L++ D R V+ K SV
Sbjct: 313 FPLKPQEGVKMSLEGSQVAFISGDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASV 372
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKRLRR 469
LTS + ++ FLGSRLG+SLL++FT ++ + E + E++ K+ ++
Sbjct: 373 LTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNDNEITIEENESEETPAKKAKQ 432
Query: 470 S------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-- 521
+SD L D+ + EEL +YGS + +T ++ F V DSL+NIGP + S G
Sbjct: 433 DFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEP 490
Query: 522 ------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSS 558
N D + +G K ++ ELPGC+ +WTV
Sbjct: 491 AFLSEEFLHNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI---- 546
Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
G + ++ A + HA+LI+S E TM+L+T + EV +S + QG T+ AGNL
Sbjct: 547 -GTLNNDEQVKAEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTVFAGNLG 604
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
R ++QV + G R+L G Q + ++ S ADPYV L DG
Sbjct: 605 ANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLSEDG 654
Query: 679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGP--EPWLRKTSTDAWLST--G 734
+ LL T + A + + + Y D L +T+ D +
Sbjct: 655 QVVLLTLREVRGTARLHAQPANLLFRPQIEALCTYRDVSGIFTTQLSETTDDEQVEEEHN 714
Query: 735 VGEA-----IDGADG-----------------GPLD----------------QGDIYSVV 756
V E ID D PLD + + +V
Sbjct: 715 VEEPSLLSNIDNEDDLLYGDAPAFQMPAPSYQKPLDGVSKKAPWWQRHLQEIKPTYWLLV 774
Query: 757 CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK 816
+SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 775 YRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQSAPVNEIPNPE-------- 826
Query: 817 ENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRS 876
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 827 -----MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYKY--PKGYLK---------- 868
Query: 877 LSVSNVSASRLRNLRFSRTP--LDAYTREE-TPHGAPCQRITI---FKNISGHQGFFLSG 930
R + L P L +EE P A RI + F NI+G+ G F+
Sbjct: 869 --------LRFKKLEHGIIPGRLSPKPKEEDMPMNASETRICMMRYFSNIAGYNGVFICC 920
Query: 931 SRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
P W + R LR HP DG I +F +NVNC GF+Y + L+IC LP+ +Y
Sbjct: 921 DYPHWIFLTGRGELRTHPMGIDGPITSFAAFNNVNCPQGFLYFNRKEELRICVLPTHLSY 980
Query: 990 DNYWPVQKV 998
D WPV+KV
Sbjct: 981 DAPWPVRKV 989
>gi|242021233|ref|XP_002431050.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
putative [Pediculus humanus corporis]
gi|212516279|gb|EEB18312.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
putative [Pediculus humanus corporis]
Length = 1409
Score = 360 bits (924), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 301/1025 (29%), Positives = 471/1025 (45%), Gaps = 155/1025 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LVV N++ ++ + + +K T+RR LE + + L NV S+
Sbjct: 29 SLVVAGKNILRVFQLIPDID---PTKRDAYTERRP-----PKMKLECLSSFSLFANVMSM 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S G+ RD+++L+F +AK+ V+E+D H LR S+H FE + +K G +
Sbjct: 81 QAVSLAGSS----RDALLLSFREAKLCVVEYDPDSHDLRTLSLHYFEEED---MKGGWTN 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSG-GGFSARIES 232
P V+VDP+GRC +LVYG +++IL + S+ + D S A + S
Sbjct: 134 HYDIPYVRVDPEGRCAAMLVYGRKLVILPFRRESKLDDPDIALLDPHSSSVATAKAPVLS 193
Query: 233 SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
S+ I LR++D +++V D F++GY EP ++IL+E T+AGR++ + TC + A+S++
Sbjct: 194 SYTITLREIDEKLENVIDIQFLYGYYEPTLLILYEPLKTFAGRIAVRSDTCAMIAVSLNI 253
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSL 349
+ HP IWS NLP + + + VP P+GG L+ N + Y +QS +++N+ A +
Sbjct: 254 QQRVHPAIWSVGNLPFNCTQAIPVPKPLGGTLIFSVNALIYLNQSIPPFGVSVNSIAENS 313
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNP 408
+ Q + + L+ + AT++ +D +LS KTG+L +L+++ D R V+ K
Sbjct: 314 TNFQLKIQEGVKITLEGSQATFISHDRLVLSLKTGELYVLSLLADNIRSVRGFHFDKAAA 373
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
SVLT+ + + FLGSRLG+SLL++FT + L E E PS +R +
Sbjct: 374 SVLTTCLCVCEDKYLFLGSRLGNSLLLRFTEKESSEAPIITLDESIR--EVPVPSKRRRQ 431
Query: 469 RSSSDAL----QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI 524
+ D + D+ + +EL +YG+ ++ +F F V DSL+NIGP + S G
Sbjct: 432 DALGDWMASDVADIRDLDELEVYGTQEASSSVQITSFMFEVCDSLLNIGPCGNVSMGEPA 491
Query: 525 NADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSS 558
+ ++ + ELV ELPGC +WTV
Sbjct: 492 FLSEEFSN-NRDPDLELVTTSGHGKNGAICVLQRTIRPQVVTTFELPGCLDMWTVI---- 546
Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
G +DS A DD HA+LI+S + TM+L+T + EV S + QG TI AGNL
Sbjct: 547 -GPQSDSGPTQAEDDISHAFLILSQKDSTMILQTGQEINEVDHS-GFNTQGPTIFAGNLA 604
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
+ ++QV + G R+L G Q + S+V+ S ADPYV L DG
Sbjct: 605 SNKYIVQVSKAGVRLLRGLEQIQHIPL----------DLGSSVVHASTADPYVALLTEDG 654
Query: 679 SIRLLVGDPS--TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL----- 731
+ LL S +SV P I ++ + CT G L +T+ L
Sbjct: 655 QVVLLTLRESRGQGRLSVFKP-TIPTNPRVSKICTYRDVSG----LFTLTTEEELQNATF 709
Query: 732 ---STGVGEAIDGAD----GG--------------------PLDQGDIYS---------V 755
S + + D D GG P + YS
Sbjct: 710 KSDSKNMKKEADDEDEMLYGGSEVKFQLLPITNTNEPSPPRPFVRWKKYSQEIKPNYWMF 769
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
V E+G L+I+ +P+F F + + G + D ++ + G
Sbjct: 770 VLRETGTLDIYSLPDFRPSFQIRRIGQGHRVLYDV------------LDMAQTSGMDGSD 817
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF-EGPENTSKSDDPVSTS 874
IH + VV L H R + + T+ ++ YQA+ F +GP +
Sbjct: 818 DPEIHELLVVSL------GHLGRRPILLLRTENDLMIYQAFKFAKGPNLKIR-------F 864
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
R L + + R + Y E A R+ F NISG+ G F+ G P
Sbjct: 865 RRLPQTLILKERKAKFKVK------YENEVESERA--TRLRYFSNISGYNGVFVCGPNPH 916
Query: 935 WC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
W + R LR HP L DG + +F HNVNC GF+Y TS+ L+IC LP+ +YD W
Sbjct: 917 WLFLTARGELRSHPMLIDGRVTSFASFHNVNCPLGFLYFTSKCELRICILPTHLSYDAPW 976
Query: 994 PVQKV 998
PV+KV
Sbjct: 977 PVRKV 981
>gi|350413821|ref|XP_003490124.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Bombus impatiens]
Length = 1417
Score = 360 bits (924), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 292/1030 (28%), Positives = 471/1030 (45%), Gaps = 159/1030 (15%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
L V AN+I I+ + + +K+ K + ++ LE + Y LHGNV S+
Sbjct: 30 LAVAGANIIRIFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D H LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR--IESSHV 235
P+V+VDP+GRC +L+YG ++++L + S + D D + S + I SS++
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPS--LDDGDLLDNSKALSNKTPILSSYM 192
Query: 236 INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
I L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ +
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
HP+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLA 306
Query: 354 EL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
E P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 307 ETSTNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDK 366
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPST 464
SVLTS + ++ FLGSRLG+SLL++FT ++ ++ + + E +
Sbjct: 367 AAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPA 426
Query: 465 KRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPLKD 517
K++++ +SD L D+ + EEL +YGS S Q T + F V DSL+NIGP +
Sbjct: 427 KKIKQDFIGDWMASDVL-DIKDPEELEVYGSERETHTSIQITSYIFEVCDSLLNIGPCGN 485
Query: 518 FSYGLRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
S G + S+ + ELV ELPGC+ +W
Sbjct: 486 ISMGEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMW 544
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV G + ++ + HA+LI+S E TM+L+T + EV +S + QG T
Sbjct: 545 TVI-----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGST 598
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
I AGNL R ++QV + G R+L G Q + ++ S ADPYV
Sbjct: 599 IFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYV 648
Query: 672 LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDK 716
L DG + LL T + AA + + + Y D+
Sbjct: 649 TLLSEDGQVMLLTLREGRGTAKLHVQAANLLFRPQIEALCAYRDVSGIFTTQLPENVEDE 708
Query: 717 GPE--------------------------PWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
PE + T + S GV + +
Sbjct: 709 APEEEHNIEEPPIVGNIDNEDDLLYGDAPAFQMPTPSHTKTSEGVSKRTPWWQKHLQEIK 768
Query: 751 DIYSVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
Y ++ Y +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 769 PTYWLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE- 827
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 828 ------------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLK--- 869
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
+ L + R P+ TR C + F NI+G+ G F+
Sbjct: 870 --LRFKKLDHGIIPGQLRPKPRDEDIPMMNETRH-------CM-MRYFSNIAGYNGVFIC 919
Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +
Sbjct: 920 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 979
Query: 989 YDNYWPVQKV 998
YD WPV+KV
Sbjct: 980 YDAPWPVRKV 989
>gi|91078626|ref|XP_968117.1| PREDICTED: similar to cleavage and polyadenylation specificity
factor cpsf [Tribolium castaneum]
Length = 1413
Score = 353 bits (907), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 292/1038 (28%), Positives = 470/1038 (45%), Gaps = 179/1038 (17%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LV + ANVI+++ + + ET + LE V Y L GN+ S+
Sbjct: 30 LVTSGANVIKVFRLIPDIDTKTRIDKFNETNP-------PKSKLECVAQYTLFGNIMSMQ 82
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ + RD+++LAF+DAK+SV+E+D H L+ S+H FE + +K G
Sbjct: 83 SVNLANSP----RDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDD---MKDGWTHH 135
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED---TFGSGGGFSARIESSH 234
P+V+ DP+ RC + V+G ++++L + + D D G G A I +S+
Sbjct: 136 YHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILASY 195
Query: 235 VINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+I L+D + ++ D F+HGY EP ++IL E T+AGRV+ + TC ++A+S++
Sbjct: 196 MIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNLQQ 255
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSS 352
K HP+IWS NLP D K + + P+GG L+ N + Y +QS + Y VSL+S
Sbjct: 256 KVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQS------IPPYGVSLNSI 309
Query: 353 QE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
E P+ + LD A AT+L++D +LS K G+L +LT++ D R V+
Sbjct: 310 AENSTNFPLKPQDDLCISLDCAQATFLEDDTIVLSLKGGELYVLTLLADNMRYVRSFHFE 369
Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--CGSGTSMLSSGLKEEFGDIEADAP 462
K SVLT+ I+ N+ FLGSRLG+SLL++FT C ++ E P
Sbjct: 370 KAAASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITL-----------DETIEP 418
Query: 463 STKRLRRSSS------DALQDMVNG------------EELSLYGSASNNTESAQKTFSFA 504
S KRL+ S+S D + D +N EEL +YG+ + ++ F
Sbjct: 419 SAKRLKASNSTSENEDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQIS-SYVFE 477
Query: 505 VRDSLVNIGPL------------KDFSYGLRINADASATG----------ISKQSNYELV 542
V DSL+NIGP ++FS L ++ + T + K ++V
Sbjct: 478 VCDSLLNIGPCGNISLGEPAFLSEEFSENLDLDLELVTTAGYGKNGALCVLQKSVRPQIV 537
Query: 543 ---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
LPGC +WTV+ + HA+LI+S E TM+L+T D + E+
Sbjct: 538 TTFTLPGCSNMWTVHAGEDK----------------HAFLILSQEDGTMILQTGDEINEI 581
Query: 600 TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENS 659
++ + T+ AGNL + ++QV R+L G Q + E G S
Sbjct: 582 -DNTGFATHIPTVYAGNLGNLKYIVQVTSSAVRLLQGINQLQHIPL-----ELG-----S 630
Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG-- 717
++ V+ DPY+ L +DG + L+ + + + S+ PV++ +Y D
Sbjct: 631 PIVHVTSVDPYISLLTTDGQVITLMLREARGVAKLVISKSTLSNSPPVTTICMYRDVSGL 690
Query: 718 ------------PEPWLRKTSTDAWLSTGVGEAIDGADGG--------PLDQGDIYS--- 754
PE ++ ++ T + + + G D P + +Y
Sbjct: 691 FTSKIPEDFTHIPEHFINESETKMEVENE-DDLLYGDDSDFKMPTLNPPQPKPKVYYNWW 749
Query: 755 -------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
V E+ LEI+ +P+F + + G +VD E++ S +
Sbjct: 750 KKYLLDVRPSYWLFVVRENSNLEIYSIPDFKLCYYITNLCFGHKVLVDNL--ESVTISAS 807
Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
S++ E Q R+ ++ + VV L H SRP L L + + Y+ + F P
Sbjct: 808 TPISAAHEANIQ-RQFDVKEILVVALG-----NHGSRPLLMVRL-ERDLYIYEVFRF--P 858
Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
K + NVS R D + +E ++ F NI+
Sbjct: 859 RGNLKMRFRKIKHSLIYSPNVSG------RIDTEDSDFFAIQER-----IIKMRYFTNIA 907
Query: 922 GHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKI 980
G+ G F+ G+ P W M R LR HP DG +++F +NVNC GF+Y + L+I
Sbjct: 908 GYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELRI 967
Query: 981 CQLPSGSTYDNYWPVQKV 998
LP+ +YD WPV+KV
Sbjct: 968 GVLPTHLSYDAAWPVRKV 985
>gi|383863556|ref|XP_003707246.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Megachile rotundata]
Length = 1415
Score = 353 bits (906), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 295/1028 (28%), Positives = 471/1028 (45%), Gaps = 157/1028 (15%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV N+I ++ + + +K K + ++ LE + Y LHGNV S+
Sbjct: 30 LVVAGGNIIRVFRLIPDVDITKREKYTESRPPKM--------KLECLAQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D IH LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
P+V+VDP+GRC +L+YG ++++L + S GD I SS++I
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ + H
Sbjct: 195 LKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
P+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S E
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308
Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKR 466
SVLTS + ++ FLGSRLG+SLL++F S S + + + E + K+
Sbjct: 369 ASVLTSCVCMCDDNYLFLGSRLGNSLLLRFIEKESENSQNMNENEITIEENETEETPAKK 428
Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+++ +SD L D+ + EEL +YGS + +T ++ F V DSL+NIGP + S
Sbjct: 429 VKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486
Query: 521 GLRINADASATGISKQSNYELV--------------------------ELPGCKGIWTVY 554
G + S+ + ELV ELPGC+ +WTV
Sbjct: 487 GEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI 545
Query: 555 HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAA 614
G + ++ + HA+LI+S E TM+L+T + EV +S + QG T+ A
Sbjct: 546 -----GALNNDEQVRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTVFA 599
Query: 615 GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
GNL R ++QV + G R+L G Q + ++ S ADPYV L
Sbjct: 600 GNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVSLL 649
Query: 675 MSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-------KGPE-------- 719
DG + LL T + A + + + Y D + PE
Sbjct: 650 SEDGQVMLLTLREGRGTAKLHAQTANLLFRPQIEALCAYRDVSGIFTTQLPENVEDEVPE 709
Query: 720 --------PWLRKTSTDAWLSTGVGEAID--------GADG----GPLDQGDI------Y 753
P + + L G G A ++G P Q + Y
Sbjct: 710 EEHNTEEPPIVGNIDNEDDLLYGDGPAFQMPAPSQTKSSEGTSKRAPWWQKHLQEIKPTY 769
Query: 754 SVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
++ Y +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 770 WLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTAPVNEIPNPE---- 825
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
M+V E+ M H +RP L L D + YQ Y + P+ K
Sbjct: 826 ---------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQTYRY--PKGHLK------ 867
Query: 873 TSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
L + + NLR D ET H C + F NI+G+ G F+
Sbjct: 868 ----LRFKKLDHGIIPGNLRPKPKEEDMSAMNETRH---CM-MRYFSNIAGYNGVFICSD 919
Query: 932 RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W + R LR HP DG I +F +N+NC GF+Y + L+IC LP+ +YD
Sbjct: 920 YPHWIFLTGRGELRTHPMGIDGPITSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYD 979
Query: 991 NYWPVQKV 998
WPV+KV
Sbjct: 980 APWPVRKV 987
>gi|443684051|gb|ELT88095.1| hypothetical protein CAPTEDRAFT_161045 [Capitella teleta]
Length = 1410
Score = 350 bits (899), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 307/1042 (29%), Positives = 465/1042 (44%), Gaps = 188/1042 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLV N I +Y + + + ++ ++ ETK + LE V Y L GNV S+
Sbjct: 29 NLVTAGVNQIRVYRLVAESKPVEKESHTTETKS-------AKQKLECVADYELCGNVSSI 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S GA RD+++L FE+AK+S+ ++D L+ S+H FE + L+ G
Sbjct: 82 ESISLVGA----ARDALLLCFEEAKLSLCDYDPDTDDLKTISLHYFEDAD---LENG--C 132
Query: 177 FARG---PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
RG V+VDP+GRC +L+YG +I+L + D + S + I S+
Sbjct: 133 CQRGLHHSEVRVDPEGRCAVMLIYGTHLIVLPFRKESPSDEIDATSCAS----KSPIMST 188
Query: 234 HVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
++I+LR LD + +V D F+HGY EP ++IL+E TW RV+ + TC I A+S++
Sbjct: 189 YIIDLRTLDERVTNVVDIQFLHGYYEPTVLILYEPLPTWTCRVAVRKDTCSIVAISLNLQ 248
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
K HP+IWS NLP+D + VP PIGGV+V N++ Y +QS Y VSL+S
Sbjct: 249 DKTHPIIWSHSNLPYDCLRTFPVPKPIGGVIVFAVNSLLYLNQS------FPPYGVSLNS 302
Query: 352 SQEL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDL 403
P+ + LD A A ++ ND ++S K G+L +LT+V D R V+ L
Sbjct: 303 LTSFNTEFLLKPQEGVRMSLDCAQAEFIDNDKLVISLKGGELYVLTLVIDSMRAVRSFHL 362
Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
K SVLT+ + G++ FLGSRLG+SLL+++ SS G+ +
Sbjct: 363 DKAAASVLTTCMCMCGDNYLFLGSRLGNSLLLRYQ--EKKPEASSSSDASPGEEQRKEKM 420
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPL------- 515
T + S + + + +EL +YG S ES T F F V DS++NIGP
Sbjct: 421 TLAIGLVGSSDVSKLDDLDELEVYGRDSQAVESEDITQFMFEVCDSIINIGPCGQVEMGE 480
Query: 516 -----KDFSYGLRINADASATG----------ISKQSNYELV---ELPGCKGIWTVYHKS 557
++FS+ + + T + +Q ++V ELPGC +WTV
Sbjct: 481 PAFLSEEFSHQEDPDLELVTTSGYGKNGAISILQRQIRPQVVTTFELPGCTDVWTVLGSP 540
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
+D + HA+L++S +MVLET + E+ S + T+ A N+
Sbjct: 541 DEQQGSDEKLAGS-----HAFLLLSRADSSMVLETGQEIMELDHS-GFCTDAPTVHAANI 594
Query: 618 FGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
R ++QV +L G Q L+ S S V+S S+ADP+VLL D
Sbjct: 595 GNGRYIVQVGPNAIWLLKGVERIQHLALDVS----------SPVVSCSLADPHVLLLCED 644
Query: 678 GSIRLLV-----GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG---------PEP--- 720
G + LV DP T+S+ T + SK V + LY D EP
Sbjct: 645 GQLLHLVLSVQGDDP---TLSLLTTKLHQKSK--VIAINLYRDTSGLFVVASSESEPSAT 699
Query: 721 --------------------------------------WLRKTSTDAWLSTGVGEAIDGA 742
W ++ S E +GA
Sbjct: 700 TTTEATETTTPQQQTEEGVDDEDDLLYGDSDISAITSTWQKQESEKEEKKEEEEEEAEGA 759
Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
D P ++V+ +G LE++ +P++ F V F +G ++D+ L S
Sbjct: 760 DIQP----TYWAVIIRATGNLELYSLPDWQLCFLVKNFATGNKLLIDSMQAADLSASFVA 815
Query: 803 INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
S++E V E+ + + + S+P L A + D + Y+ + G +
Sbjct: 816 PERSTQEVPF-----------VHEVMLHGFGVNGSQPLLMARVHD-ELYIYKVFSHVGSK 863
Query: 863 NTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTR-----EETPHGAPCQRITIF 917
+ RL+ +RF R R E+ P R F
Sbjct: 864 --------------------AKGRLQ-VRFKRRSHGLIIRPRDREEKIPENKKWLR--PF 900
Query: 918 KNISGHQGFFLSGSRPCW-CMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
+ISG+ G F+ GS P W M R LR HP DG+I FT HNVNC GF+Y +S
Sbjct: 901 TDISGYSGVFICGSYPHWLIMTQRGTLRGHPMAIDGTIPCFTAFHNVNCPKGFLYFSSNE 960
Query: 977 ILKICQLPSGSTYDNYWPVQKV 998
L+IC LP+ +YD WPV+KV
Sbjct: 961 ELRICVLPTHLSYDAPWPVRKV 982
>gi|440793679|gb|ELR14857.1| CPSF A subunit region protein [Acanthamoeba castellanii str. Neff]
Length = 1477
Score = 348 bits (893), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 256/898 (28%), Positives = 429/898 (47%), Gaps = 169/898 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGI----------------SAAS 100
NL+V NV+E+Y + E+ + T+ DG+ + S
Sbjct: 30 NLIVAKTNVLEVYALHRHEDSKARPIDRQSTRP---TDGVISLRGEEPKDAPPYAGTQHS 86
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
+ LV L GN+ES+A + G +D+++L+F DAKISVLEFD + + LR S+H
Sbjct: 87 MRLVLSSSLFGNIESMAAVRFPGTS----KDALLLSFRDAKISVLEFDIATNDLRTISLH 142
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE +K G + + P ++VDPQ RC +L + ++++L Q S +
Sbjct: 143 YFED---YKVKEGHDHYIHVPELRVDPQQRCAAMLAFDRKLVVLPFRQHASLM-----EI 194
Query: 221 GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
+GG ++ S +++LR + + +VKDF+F+ GY EP ++IL+E TW+GRV+ +T
Sbjct: 195 ENGGQEDQPVKPSFLLDLRAMGIINVKDFVFLQGYYEPTLLILYEPTQTWSGRVAVNRNT 254
Query: 281 CMISALSIST-----TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI------ 329
C+ +A+S++ HP++WSA LP+D +L+AVP PIGG L + N++
Sbjct: 255 CVAAAVSLNLWQHRGQTSAHPVVWSAEFLPYDTQRLIAVPGPIGGALALSTNSLLYLNQV 314
Query: 330 -------------------HYHSQSASCALALNNYA-VSLDSSQELP---RSSFSVELDA 366
H+ +Q+++ L LN +A + L P ++ + LDA
Sbjct: 315 SFPYRLILPAHGADVSITSHHDTQASASCLPLNVFADLYLSPQTPFPSAGKNRVGIALDA 374
Query: 367 AHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNS----- 421
A +L +D L+S K G+L + ++ DGR V + L+K SV+TS + T+
Sbjct: 375 ARDVFLADDQLLVSLKGGELYIFHLLSDGRTVNDIQLTKAGSSVITSCMATLSGEGADER 434
Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE--FGDIEADAPSTKRLRRSSSDA----- 474
FLGSR+GDSLL+Q+T ++ +G + F DI+ + + +A
Sbjct: 435 FLFLGSRVGDSLLLQYTTADASAPKQNGATKGSLFDDIKKEEDNDDDDEDEEEEASGEGE 494
Query: 475 LQDMVNGE-ELSLYGSASNNTESAQK-----TFSFAVRDSLVNIGPLKDFSYGLRIN-AD 527
+++ +GE E+ +G + +K T+ F V DSLVN+GP+ DF+ G + A
Sbjct: 495 VKEEPDGEGEVDEFGRRIREEDRRKKKGLLTTYKFKVCDSLVNVGPITDFAIGESFDPAS 554
Query: 528 ASATGISKQSNYELV---------------------------ELPGCKGIWTVYHKSSRG 560
S Q + E+V +L GCK WT+YH+S
Sbjct: 555 VSMAEQEGQRSVEIVTCSGQGKNGSLCVLQHGVRPELVHASADLAGCKAFWTLYHRSEER 614
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT-ESVDYFVQGRTIAAGNLFG 619
++ EYHAYL++S E +T V+ D L E++ E D+ V T+ AGNLF
Sbjct: 615 QGEEA--------EYHAYLLLSEEEQTRVI-AGDGLDELSNEETDFNVAAPTVDAGNLFE 665
Query: 620 RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
+ R++QV + G +LDG TQ + S + + SIADPYVL+ M+DG+
Sbjct: 666 QTRIVQVHQHGLILLDGVKATQRI------------STPGQIAAASIADPYVLVLMADGA 713
Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI 739
+RL DP++ + VQT + + + L++ G A+
Sbjct: 714 LRLYFADPTSSKL-VQTSLQNIHEVRDIMAMHLFY---------------------GGAM 751
Query: 740 DGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
G D+ I++ + ++G L+I+ VP F+ VF+ ++ +G I + MR + +
Sbjct: 752 RGKKARTNDE--IFAAIAKDNGRLDIYSVPEFDLVFSAERAANGPRLINNVLMRPPPQSA 809
Query: 800 ETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYL 857
+ ++ + S ++ E+A+ S P LF L +G +L Y+ +L
Sbjct: 810 AAQQSADTT------------SARIAEIALHSIGNIPSLPHLFLYLDNGELLLYRGFL 855
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 41/75 (54%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
+RI F + G F+SGS P W R R++P D + AF HN NC HGFIY
Sbjct: 967 RRIHYFGTVGKSNGVFISGSAPAWVFAQRGYARLYPMKLDTFVRAFAEFHNANCPHGFIY 1026
Query: 972 VTSQGILKICQLPSG 986
+G LKICQLP+
Sbjct: 1027 FNHEGTLKICQLPAA 1041
>gi|345482082|ref|XP_001607052.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Nasonia vitripennis]
Length = 1415
Score = 347 bits (891), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 293/1025 (28%), Positives = 461/1025 (44%), Gaps = 151/1025 (14%)
Query: 58 LVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
LVV AN+I ++ ++ + G KE + LE + Y LHGNV S+
Sbjct: 30 LVVAGANIIRVFRLIPDVDPGKKEKFTESRPPK---------MRLECLAQYTLHGNVMSM 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ G+ RDS++L+F +AK+SV+E+D IH LR S+H FE E +K G +
Sbjct: 81 QAVQLIGSP----RDSLLLSFREAKLSVVEYDPEIHSLRTVSLHYFEEEE---IKDGWTN 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+GRC +L+YG ++++L + GD I SS++I
Sbjct: 134 HHHVPIVRVDPEGRCAVMLIYGRKLVVLPFRKDPILDEGDLIENPKSSSHKTPILSSYMI 193
Query: 237 NLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
L+ L+ M ++ D F+HGY EP ++IL+E T+AGR++ + TC + A+S++ K
Sbjct: 194 VLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFAGRIAVRQDTCAMVAISLNIQQKV 253
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDSSQ 353
HP+IWS NLP D Y+ +AV P+GG L++ N++ Y +QS ++LN+ + +
Sbjct: 254 HPIIWSVSNLPFDCYQAVAVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLTDNCTNFP 313
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
P+ + L+++ ++ D ++S KTG+L +L++ D R V+ K SVLT
Sbjct: 314 LKPQEGVKISLESSQVAFISPDRLVISLKTGELYVLSLFADSMRSVRGFHFDKAAASVLT 373
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS-SGLKEEFGDIEADAPSTKRLRRS- 470
S + ++ FLGSRLG+SLL++FT + S L+ + TK+++
Sbjct: 374 SCVCLCDDNYLFLGSRLGNSLLLRFTEKESEKINDISMLEMSLNSSNSQEQPTKKIKLDY 433
Query: 471 -----SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN 525
+SD L D+ + EEL +YGS + T ++ F V DSL+NIGP + S G
Sbjct: 434 LEDWMASDVL-DIKDPEELEVYGSET-QTSIQITSYIFEVCDSLLNIGPCGNISMGEPAF 491
Query: 526 ADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSR 559
+ S + + ELV +LPG + IWTV +
Sbjct: 492 LSEEFSNNS-EPDVELVTTSGYGKNGALCVLQRSIRPQVITTFDLPGYENIWTVIDSTVS 550
Query: 560 GHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFG 619
+ A + H +LI++ + TMVL+T + EV + + QG TI AGNL
Sbjct: 551 DNRAKTETEGT-----HGFLILTQDDSTMVLQTGQEINEVVDQSGFSTQGTTIFAGNLGS 605
Query: 620 RRRVIQVFERGARILDG----SYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
R +IQV + G R+L G +M DL ++ S ADPYV L
Sbjct: 606 NRYIIQVTQMGVRLLQGLEQIQHMPMDLG--------------CPIVHASCADPYVSLLS 651
Query: 676 SDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGV 735
DG + LL T + A + + + Y D +T +
Sbjct: 652 EDGQVVLLTLREGRGTARLHAQAVNLMFRPQIEAVCAYRD-----------VSGLFTTIL 700
Query: 736 GEAID--GADGGPLDQGDIYSVVCYES----GALEIFDVPNFNCVFTVDK---------- 779
E +D D D+ I E G + F +P V +
Sbjct: 701 PEDVDEEAFDNDSSDEPQIIENPDNEDDLLYGDTQTFQMPAIPVVKPQETPTKKPPWWQQ 760
Query: 780 -----------FVSGRTHIVDTYMREALKDS---------ETEINSSSEEGTGQGRKENI 819
FV ++ Y L+ S + ++ S E T QG ++N
Sbjct: 761 YLQEIKPTYWLFVYRDNGTLEVYSLPELRLSYLIKNFGFGQNILHDSMEFTTIQGSQQNE 820
Query: 820 ---HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRS 876
++V E+A+ H +RP L L D + YQ Y + P+ K
Sbjct: 821 PVNPEVQVREIAVVALGHHGNRPMLLVRL-DSELQIYQVYRY--PKGHLK---------- 867
Query: 877 LSVSNVSASRLRNLRFSRTPLDAYTREETP--HGAPCQRITIFKNISGHQGFFLSGSRPC 934
L + + + + FSR E+ P + + F NI+G+ G F+ G P
Sbjct: 868 LRFKKIDHNFI--VGFSRI---GPKEEDMPSMNDTRLCMMRYFSNIAGYNGVFIGGDYPH 922
Query: 935 WCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
W + R LR HP DG + +F +NVNC GF+Y + L+IC LP+ +YD W
Sbjct: 923 WIFLTGRGELRAHPMNIDGPVKSFAPFNNVNCPQGFLYFNRKDELRICVLPTHLSYDAPW 982
Query: 994 PVQKV 998
PV+KV
Sbjct: 983 PVRKV 987
>gi|47217773|emb|CAG05995.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1446
Score = 347 bits (890), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 294/1060 (27%), Positives = 474/1060 (44%), Gaps = 219/1060 (20%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + ++ S ++K R LE V + L GNV S+
Sbjct: 29 NLVVAGTSQLFVYRIIHDVESTSKTDKSSDSKTR-------KEKLEQVAAFSLFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ GA+ RD+++L+F+DAK+SV+E+D H L+ S+H FE PE R++
Sbjct: 82 ESVQLVGAN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPEL------RDT 131
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
DE G G G + +++I
Sbjct: 132 LT-------------------------------------DEQELGVGEGPKSSFLPTYII 154
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + C I A+S++ K
Sbjct: 155 DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQAQCSIVAISLNIMQKV 214
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS +ALN+ +
Sbjct: 215 HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFP 274
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD + A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 275 LRLQDEVKITLDCSQADFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 334
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL----KEEFGDIEADAPSTKRLR 468
+ + T+ FLGSRLG+SLL+++T L G KE+ D++ L
Sbjct: 335 TCMVTMEPGYLFLGSRLGNSLLLKYTEKLQEMPLEEGKDKQEKEKDNDMDKQV-YVHTLN 393
Query: 469 RSSSDALQDMVNGE--ELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN 525
S+ + D E E+ +YGS A + T+ A T+SF V DS++NIGP + S G
Sbjct: 394 SFSAHSQHDFFVDEVDEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANASMGEPAF 451
Query: 526 ADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSR 559
G + + + E+V ELPGC +WTV +
Sbjct: 452 LSEEFQG-NPEPDLEVVVCSGHGKNGALSVLQRSIRPQVVTTFELPGCHDMWTVISNEVK 510
Query: 560 GHNADSSRMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
++ D + H +LI+S E TM+L+T + E+ S + QG
Sbjct: 511 EDKKVPQSPGSFTATHYSLEEDTKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGP 569
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
T+ AGN+ + +IQV G R+L+G + L F P + S ++ S+ADPY
Sbjct: 570 TVFAGNIGDNKYIIQVSPMGIRLLEG---VKQLHFIPVDL-------GSPIVHCSVADPY 619
Query: 671 VLLGMSDGSIRLLVGDP-----STCTVSVQTPAAIESSKKPVSSC--------------- 710
V++ ++G + + V T +++Q P I + + ++ C
Sbjct: 620 VVIMTAEGVVTMFVLKVDSYMGKTHRLALQKP-QISTQSRVIALCAYRDVSGMFTTENKV 678
Query: 711 -----------------TLYHD------KGPEPWLRKTST-------DAWLSTGVGEAID 740
T+ HD E L S+ + + + V
Sbjct: 679 SCAIAEDFNIRSQSETETVIHDLSSNIVDDEEEMLYGDSSSNAGPSKEEMIRSFVAPGPS 738
Query: 741 GADGGPLD-QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
++GGP + + +V ESG +EI+ +P++ VF V F G+ +VD+ ++
Sbjct: 739 VSEGGPSKAEPSHWCLVTRESGVMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQSATQG 798
Query: 800 ETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF- 858
+ E EE T QG + + +V L +HSRP+L + + +L Y+A+ +
Sbjct: 799 DKE--GKKEEMTRQGEIPLVKEVTLVSLGY-----NHSRPYLL-VHVEQELLVYEAFPYD 850
Query: 859 -EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--------ETPHGA 909
+ P+N K +RF + P + RE + GA
Sbjct: 851 QQQPQNNLK-----------------------VRFKKVPHNINFREKKSKLRKDKKAEGA 887
Query: 910 PCQ----------RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFT 958
+ R F++ISG+ G F+ G P W +V R LR+HP DG I +F+
Sbjct: 888 AAEDGVAARGRISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALRLHPMTIDGPIESFS 947
Query: 959 VLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
HN+NC GF+Y QG L+I LP+ +YD WPV+K+
Sbjct: 948 PFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKI 987
>gi|195583398|ref|XP_002081509.1| GD25678 [Drosophila simulans]
gi|194193518|gb|EDX07094.1| GD25678 [Drosophila simulans]
Length = 1450
Score = 344 bits (883), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 299/1053 (28%), Positives = 481/1053 (45%), Gaps = 163/1053 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E +E + +L + + A + EEL +YGS + + + F F V D
Sbjct: 430 RNLQDEDQSLE-EILDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 489 SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCLN 548
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 549 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 600 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 649 DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 709 LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768
Query: 741 GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + VV +SG LEI+ +P+ V+ V+ +G T +
Sbjct: 769 LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGATVLT 828
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D E + S T +S ++ +S +EL++ + RP L + T
Sbjct: 829 DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L YQ +F P+ K R L N+ + ++ D E+
Sbjct: 886 ELLIYQ--VFRYPKGHLK-----IRFRKLDXXNLLDQQPTHIELDEN--DEQEEIESYQM 936
Query: 909 AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 937 QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+GF+Y + LKI LPS +YD+ WPV+KV
Sbjct: 997 PNGFLYFDTTYELKISVLPSYLSYDSIWPVRKV 1029
>gi|198415711|ref|XP_002123169.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
1, partial [Ciona intestinalis]
Length = 1370
Score = 343 bits (880), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 300/1099 (27%), Positives = 480/1099 (43%), Gaps = 197/1099 (17%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A Y+ +H PTG+ C + NL+VTA
Sbjct: 2 YAWYRQIHAPTGVEQCVYCNFASEKEK---------------------------NLLVTA 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
A+ + +Y + E + +++N E + + L+ + ++L GNV + +
Sbjct: 35 ASQLTVYRLERNYEVTTKTENGEE-------NTVVKEKLQQIGSWQLFGNVVRMRSVRLA 87
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
GA + DS++L+F +AK+S++EFD + H ++ TS+H FE + K G P
Sbjct: 88 GA----KLDSVLLSFAEAKLSIIEFDQATHDIKTTSLHYFEDALY---KDGSYQRITLPK 140
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGF--SARIESSHVINLRD 240
+ VDP+ RC + + + ++ + L D+ + R +S+ I+L
Sbjct: 141 IAVDPESRCVALQLTTKSVAVVPLRANTAALATDDGAAPQDNVSLQNKRSTTSYTIDLHA 200
Query: 241 LD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
+D ++ + D F+HGY EP +++L E TWAGRV+ + TC I A+S++ + HP++
Sbjct: 201 VDARLQRIIDIQFLHGYNEPTLLVLFESLRTWAGRVAMRQDTCNIVAISLNMAEQLHPVV 260
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE---- 354
WS LP D VP PIGGVL+ N+I Y +QS Y SL+S+ E
Sbjct: 261 WSLNGLPFDCKYAYPVPKPIGGVLIFAVNSILYLNQSVP------PYGTSLNSTTENSTS 314
Query: 355 ---LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
P+ + LD +HA ++ + ++S K G+L +LT++ D R V+ K+ SV
Sbjct: 315 FPLKPQEDVCMTLDCSHAMFISPESLVISLKNGELYVLTLLVDSMRSVRNFHFDKSASSV 374
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
LTS +T + + FLGSRLG+SLL+++T + + E A KRL +
Sbjct: 375 LTSCLTVLDDGFLFLGSRLGNSLLLKYT-------EARPVFRNCYHTEEPAAKRKRLNTA 427
Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
+ A D N +L +YG + +E ++ F V DSLVNIGP G A S
Sbjct: 428 ADWAASD-TNDIDLQMYGKDTVTSEPL-SSYKFEVCDSLVNIGPCGAAELGEP--AFLSE 483
Query: 531 TGIS-KQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNA 563
+S ++S+ EL ELPGC +WTV +
Sbjct: 484 EFVSQRESDLELAILSGHGKNGAISVLQRSVKPQVVTTFELPGCIDMWTVKSVCEKTELP 543
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
++ + H+YLI+S E T++LET + EV E+ + + +++ GN+ G + +
Sbjct: 544 TKTQ-----QQQHSYLILSREESTLILETGKEIMEV-ENSGFNTREQSVFVGNIGGDKEL 597
Query: 624 I-QVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
I QV G +L G + Q + E G S + SI DPY LL SDG + +
Sbjct: 598 ILQVCASGVWLLAGVKLLQHIPL-----ELG-----SPITQCSICDPYALLLTSDGDLIM 647
Query: 683 LV----------GDPSTCTVSVQTPAAIE--------------------------SSKK- 705
L C S+ IE SS K
Sbjct: 648 LTLTNDLDSENGVKLECCNPSINQVPQIEHVCLYKDTSGLFKTASGPSDVFLPEDSSNKG 707
Query: 706 -------------PVSSCTLYHDKGPEPWLRKTSTDAWL----------STGVGEAIDGA 742
P+SS T D+ E ++ D S E +DG
Sbjct: 708 VSDSEIPSSLPRTPLSSKTFTVDEEDELLYGESDPDVIFAPQFAPNVPKSPTQNEPLDGD 767
Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
G ++ ++++ E+ LEI+ +P+ + V+T+ F G+ + ++ + S+ +
Sbjct: 768 KEGN-EEFTFWAIIARENRNLEIYSMPSLDLVYTIKNFSFGQKLLTNSGPVHSYSVSKDD 826
Query: 803 INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
++S T K I + +V L + +S P L A + + IL Y+ + F PE
Sbjct: 827 KSTS----TRYSDKPRIFEILLVGLGYK-----NSSPHLIARIEE-EILIYEVFKFSAPE 876
Query: 863 NTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG 922
K + S + V+ S + R P+ T+ + C R F NI G
Sbjct: 877 KFKKYN-----SLQIRFKKVNHS----MMIRRAPVTHETKTDQLEHRNCLR--TFSNIGG 925
Query: 923 HQGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G FL G P W V R L HP DGS+ F HNVNC +GF+Y SQG L+IC
Sbjct: 926 YSGVFLCGPYPYWIFVTIRGALCCHPMSVDGSVSCFVPFHNVNCPNGFLYFNSQGELRIC 985
Query: 982 QLPSGSTYDNYWPVQKVVF 1000
LP YD WP++K+
Sbjct: 986 MLPPHMKYDTAWPMRKITL 1004
>gi|195485994|ref|XP_002091320.1| GE12310 [Drosophila yakuba]
gi|194177421|gb|EDW91032.1| GE12310 [Drosophila yakuba]
Length = 1455
Score = 341 bits (875), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 296/1054 (28%), Positives = 484/1054 (45%), Gaps = 165/1054 (15%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E G ++ N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEAGQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E ++E + +L + + A + EEL +YG+ + + + F F V D
Sbjct: 430 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGTGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 489 SLMNVAPINYMCAGERVEFEEDGATLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 548
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 549 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 600 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 649 DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 709 LSGLFTVKGDDINLTGSSNSGFGHSFGGYMKAEPNMKVEDEEDLLYGDAGNAFKMNSMAD 768
Query: 741 GADGGPLDQGD------------IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + +V +SG LEI+ +P+ V+ V+ +G +
Sbjct: 769 LAKQSKQKNSDWWRRLLVQAKPSYWLIVARQSGTLEIYSMPDMKLVYLVNDVGNGAMVLT 828
Query: 789 DTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
D + + E +S+ G Q ++ +S +EL++ + RP L + T
Sbjct: 829 DAMEFVPISLTTQE---NSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTR 884
Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH 907
+L YQ +F P+ K R L N+ + ++ DA E+
Sbjct: 885 VELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DAQEEIESYQ 935
Query: 908 GAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVN 964
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 936 MQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVN 995
Query: 965 CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+GF+Y + LKI LPS +YD+ WPV+KV
Sbjct: 996 IPNGFLYFDTTYELKISVLPSYLSYDSTWPVRKV 1029
>gi|194883064|ref|XP_001975624.1| GG22421 [Drosophila erecta]
gi|190658811|gb|EDV56024.1| GG22421 [Drosophila erecta]
Length = 1455
Score = 341 bits (874), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 301/1053 (28%), Positives = 481/1053 (45%), Gaps = 163/1053 (15%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E G ++ N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEAGQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E I + +L + + A + EEL +YGS + + + F F V D
Sbjct: 430 RNLQDE-EQIMEEIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 489 SLMNVAPVNYMCAGERVEFEEDGATLRPHAESLQDVKIELVAATGHSKNGALSVFVNCIN 548
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 549 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++ E G
Sbjct: 600 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-----EVG-- 651
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 652 ---SPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 709 LSGLFTVKGDDINLTGSSNSGFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768
Query: 741 GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + VV +SG LEI+ +P+ V+ V+ G IV
Sbjct: 769 LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDV--GNGAIV 826
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
T E + S T +S ++ +S +EL++ + RP L + T
Sbjct: 827 LTDAMEFVPISLTTQENSKAGIVQACMPQHANSPLPLELSLTGLGLNGERPLLM-VRTRV 885
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L YQ +F P+ K R L N+ + ++ D E+
Sbjct: 886 ELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DEQEDIESYQM 936
Query: 909 AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 937 QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+GF+Y + LKI LPS +YD+ WPV+KV
Sbjct: 997 PNGFLYFDTTYELKISVLPSYLSYDSTWPVRKV 1029
>gi|45552619|ref|NP_995833.1| cleavage and polyadenylation specificity factor 160, isoform A
[Drosophila melanogaster]
gi|18203551|sp|Q9V726.1|CPSF1_DROME RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit;
Short=dCPSF 160
gi|7303176|gb|AAF58240.1| cleavage and polyadenylation specificity factor 160, isoform A
[Drosophila melanogaster]
Length = 1455
Score = 340 bits (872), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 297/1054 (28%), Positives = 483/1054 (45%), Gaps = 165/1054 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E ++E + +L + + A + EEL +YGS + + + F F V D
Sbjct: 430 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 489 SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 548
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 549 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 600 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 649 DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 709 LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768
Query: 741 GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + VV +SG LEI+ +P+ V+ V+ +G +
Sbjct: 769 LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLT 828
Query: 789 DTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
D + + E +S+ G Q ++ +S +EL++ + RP L + T
Sbjct: 829 DAMEFVPISLTTQE---NSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTR 884
Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH 907
+L YQ +F P+ K R + N+ + ++ D E+
Sbjct: 885 VELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHIDLDEN--DEQEEIESYQ 935
Query: 908 GAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVN 964
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 936 MQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVN 995
Query: 965 CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+GF+Y + LKI LPS +YD+ WPV+KV
Sbjct: 996 IPNGFLYFDTTYELKISVLPSYLSYDSVWPVRKV 1029
>gi|195334368|ref|XP_002033855.1| GM20208 [Drosophila sechellia]
gi|194125825|gb|EDW47868.1| GM20208 [Drosophila sechellia]
Length = 1455
Score = 340 bits (871), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 297/1059 (28%), Positives = 480/1059 (45%), Gaps = 175/1059 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
K SVLTS I + + FLGSRLG+SLL+ FT +++++ D+E
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVIT------LDDVEQ 423
Query: 460 DAPSTKRLRRSSSDALQDM-----------------VNGEELSLYGSASNNTESAQKTFS 502
+ +R + L+++ + EEL +YGS + + + F
Sbjct: 424 QSEQQQRNLQDEDQNLEEIFDVDQVEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFI 483
Query: 503 FAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS----- 537
F V DSL+N+ P+ G R+ + +ATG SK
Sbjct: 484 FEVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVF 543
Query: 538 ----NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
N +++ EL GC +WTV+ D+++ ++ +D+ H ++ +S T+VL
Sbjct: 544 VNCLNPQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMFLSQRNSTLVL 594
Query: 591 ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNS 650
+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 595 QTGQEINEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI----- 648
Query: 651 ESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSC 710
S V+ VSIADPYV L + +G + L + T + SS V +
Sbjct: 649 -----DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAI 703
Query: 711 TLYHD-------KG----------------------PEPWLRKTSTDAWLSTGVGEAI-- 739
+ Y D KG EP ++ + L G A
Sbjct: 704 SAYKDLSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKM 763
Query: 740 ----DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSG 783
D A D + VV +SG LEI+ +P+ V+ V+ +G
Sbjct: 764 NSMADLAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNG 823
Query: 784 RTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLF 842
+ D + + E +S+ G Q ++ +S +EL++ + RP L
Sbjct: 824 AMVLTDAMEFVPISLTTQE---NSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL 880
Query: 843 AILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTR 902
+ T +L YQ +F P+ K R L N+ + ++ D
Sbjct: 881 -VRTRVELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DEQEE 930
Query: 903 EETPHGAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTV 959
E+ P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F
Sbjct: 931 IESYQMQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAA 990
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+NVN +GF+Y + LKI LPS +YD+ WPV+KV
Sbjct: 991 FNNVNIPNGFLYFDTTYELKISVLPSYLSYDSIWPVRKV 1029
>gi|195455711|ref|XP_002074834.1| GK23274 [Drosophila willistoni]
gi|194170919|gb|EDW85820.1| GK23274 [Drosophila willistoni]
Length = 1463
Score = 332 bits (850), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 295/1065 (27%), Positives = 470/1065 (44%), Gaps = 179/1065 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSE------MRVAPKMRLECLATYSLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D + L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA--GAMRDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 137
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +L+YG ++++L + S L + + R
Sbjct: 138 GRYYVPEVRVDPDARCAVMLIYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVTRTP 197
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T AGR+ + TC++ A+S
Sbjct: 198 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCAGRIKVRSDTCVLVAIS 257
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ NLP D +LL + PIGG LV+ N + Y +QS Y V
Sbjct: 258 LNIQQRVHPIIWTVNNLPFDCLRLLPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 311
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 312 SLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVVSLRTGDLYVLTLCVDSMRTVR 371
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I FLGSRLG+SLL+ FT +++++
Sbjct: 372 NFHFHKAASSVLTSCICVCHMEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQQQQQA 431
Query: 448 ----SGLKEEFGDIEAD-------APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES 496
S E G ++ D APS + RR + EEL +YG+ + +
Sbjct: 432 AEEPSEEAEIEGILDMDQLEAATSAPSQAKSRR---------IEDEELEVYGTGAKASVL 482
Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQ 536
+ F F V DSL+N+ P+ G R+ + +ATG SK
Sbjct: 483 QLRKFVFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAESLQDVKIELVAATGHSKN 542
Query: 537 S---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
N +++ EL GC +WTV+ D+++ + D+ H ++++S +
Sbjct: 543 GALSVFVNCINPQIITSFELEGCLDVWTVFD--------DATKKTSRQDQ-HDFMLLSQK 593
Query: 585 ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
T+VL+T + E+ E+ + V TI GNL R ++QV R R+L G+ + Q++
Sbjct: 594 NSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQNRFIVQVTTRHVRLLQGTRLVQNVP 652
Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSK 704
S V+ V+IADPYV L + +G + L S T + SS
Sbjct: 653 I----------DVGSPVVQVAIADPYVCLRVFNGQVITLALRESRGTPRLAINKHTISSS 702
Query: 705 KPVSSCTLYHD-------------------------------KGPEPWLRKTSTDAWLST 733
V + Y D EP ++ + L
Sbjct: 703 PAVVAIAAYKDLSGLFTVKSDDILNLTGSGSNSAFGSTFGGYMKSEPHMKVEDEEDLLYG 762
Query: 734 GVGEAI------DGADGGPLDQGD-------------IYSVVCYESGALEIFDVPNFNCV 774
G A D A D + VV +SG LEI+ +P+ V
Sbjct: 763 DAGNAFKMNTMADLAKQSKQKNSDWWRRMLVQAAKPTYWLVVARQSGTLEIYSMPDMKLV 822
Query: 775 FTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSA 834
+ V+ +G + D E + S T +S ++ +S +EL++
Sbjct: 823 YLVNDVGNGAMVLTDAM--EFVPISLTSQENSKAGIVQSCMPQHANSPLPLELSLVGLGL 880
Query: 835 HHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR 894
+ RP L + T +L YQ +F P+ K R + N+ + ++
Sbjct: 881 NGERPLLL-VRTRLELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHVNLDD 932
Query: 895 TPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGS 953
+ Q++ F N+ G G + G PC+ + R LR+H L +G
Sbjct: 933 NEENEELESYNMQPKYVQKLRPFNNVGGMSGVMICGVNPCFLFLTSRGELRIHRLLGNGE 992
Query: 954 IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+ +F +N+N +GF++ + LKI LPS +YD+ WPV+KV
Sbjct: 993 VRSFAAFNNINIPNGFLFFDTTFELKISVLPSYLSYDSTWPVRKV 1037
>gi|194756960|ref|XP_001960738.1| GF11349 [Drosophila ananassae]
gi|190622036|gb|EDV37560.1| GF11349 [Drosophila ananassae]
Length = 1455
Score = 331 bits (849), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 293/1061 (27%), Positives = 473/1061 (44%), Gaps = 179/1061 (16%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E G ++ N E M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEAGQRQKLNPTE------MRVAPKMRLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D + L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P+V+VDP RC +LVYG ++++L + + L + + R
Sbjct: 136 GRYFVPVVRVDPDSRCAVMLVYGKRLVVLPFRKDNTLDEIELADVKPIKKAPTAMVTRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCQQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK-------- 451
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVDQQADQQL 429
Query: 452 ----------EEFGDIEA--DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+E D++ AP+ + RR + EEL +YGS + + +
Sbjct: 430 QRQQSEDQTLDEILDVDQLELAPTQAKSRR---------IEDEELEVYGSGAKASVLQLR 480
Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS-- 537
F F V DSL+N+ P+ G R+ + +ATG SK
Sbjct: 481 KFVFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLNDLKIELVAATGHSKNGAL 540
Query: 538 -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
N +++ EL GC +WTV+ D+++ + D+ H ++++S T
Sbjct: 541 SVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLLSQRNST 591
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 592 LVLQTGQEINEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 648
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
S V+ VSIADPYV L + +G + L + T + SS V
Sbjct: 649 --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAV 700
Query: 708 SSCTLYHD-----------------------------KGPEPWLRKTSTDAWLSTGVGEA 738
+ + Y D EP ++ + L G A
Sbjct: 701 VAISAYKDLSGLFTVKADDVNLTGSSSSAFGHSFGGYMKAEPHMKVEDEEDLLYGDAGNA 760
Query: 739 I------DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKF 780
D A D + VV +SG LEI+ +P+ V+ V+
Sbjct: 761 FKMNSMADLAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDV 820
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
+G + D E + S T +S ++ +S +EL + + RP
Sbjct: 821 GNGAMVLTDAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELTVLGLGLNGERPL 878
Query: 841 LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
L + T +L YQ +F P+ K R L N+ + ++ D
Sbjct: 879 LL-VRTRVELLIYQ--VFRYPKGHLK-----IRFRKLEQLNLMDHQPSHIELDEN--DER 928
Query: 901 TREETPHGAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAF 957
E+ P Q++ F N+ G G + G PC+ + R LR+H L +G + +F
Sbjct: 929 EEMESYQMQPKYVQKLRPFANVGGLSGIMVCGVNPCFVFLTSRGELRIHRLLGNGDVRSF 988
Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+NVN +GF+Y + LKI LPS +YD+ WP++KV
Sbjct: 989 AAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSTWPIRKV 1029
>gi|270003792|gb|EFA00240.1| hypothetical protein TcasGA2_TC003068 [Tribolium castaneum]
Length = 1392
Score = 329 bits (844), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 284/1038 (27%), Positives = 460/1038 (44%), Gaps = 200/1038 (19%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LV + ANVI+++ + + ET + LE V Y L GN+ S+
Sbjct: 30 LVTSGANVIKVFRLIPDIDTKTRIDKFNETNP-------PKSKLECVAQYTLFGNIMSMQ 82
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ + RD+++LAF+DAK+SV+E+D H L+ S+H FE + +K G
Sbjct: 83 SVNLANSP----RDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDD---MKDGWTHH 135
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED---TFGSGGGFSARIESSH 234
P+V+ DP+ RC + V+G ++++L + + D D G G A I +S+
Sbjct: 136 YHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILASY 195
Query: 235 VINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+I L+D + ++ D F+HGY EP ++IL E T+AGRV+ + TC ++A+S++
Sbjct: 196 MIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNLQQ 255
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSS 352
K HP+IWS NLP D K + + P+GG L+ N + Y +QS + Y VSL+S
Sbjct: 256 KVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQS------IPPYGVSLNSI 309
Query: 353 QE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
E P+ + LD A AT+L++D +LS K G+L +LT++ D R V+
Sbjct: 310 AENSTNFPLKPQDDLCISLDCAQATFLEDDTIVLSLKGGELYVLTLLADNMRYVRSFHFE 369
Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--CGSGTSMLSSGLKEEFGDIEADAP 462
K SVLT+ I+ N+ FLGSRLG+SLL++FT C ++ E P
Sbjct: 370 KAAASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITL-----------DETIEP 418
Query: 463 STKRLRRSSS------DALQDMVNG------------EELSLYGSASNNTESAQKTFSFA 504
S KRL+ S+S D + D +N EEL +YG+ + ++ F
Sbjct: 419 SAKRLKASNSTSENEDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQIS-SYVFE 477
Query: 505 VRDSLVNIGPL------------KDFSYGLRINADASATG----------ISKQSNYELV 542
V DSL+NIGP ++FS L ++ + T + K ++V
Sbjct: 478 VCDSLLNIGPCGNISLGEPAFLSEEFSENLDLDLELVTTAGYGKNGALCVLQKSVRPQIV 537
Query: 543 ---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
LPGC +WTV+ + HA+LI+S E TM+L+T D + E+
Sbjct: 538 TTFTLPGCSNMWTVHAGEDK----------------HAFLILSQEDGTMILQTGDEINEI 581
Query: 600 TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENS 659
++ + T+ AG I ++ +L S
Sbjct: 582 -DNTGFATHIPTVYAG-----------------INQLQHIPLELG--------------S 609
Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG-- 717
++ V+ DPY+ L +DG + L+ + + + S+ PV++ +Y D
Sbjct: 610 PIVHVTSVDPYISLLTTDGQVITLMLREARGVAKLVISKSTLSNSPPVTTICMYRDVSGL 669
Query: 718 ------------PEPWLRKTSTDAWLSTGVGEAIDGADGG--------PLDQGDIYS--- 754
PE ++ ++ T + + + G D P + +Y
Sbjct: 670 FTSKIPEDFTHIPEHFINESETKMEVENE-DDLLYGDDSDFKMPTLNPPQPKPKVYYNWW 728
Query: 755 -------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
V E+ LEI+ +P+F + + G +VD E++ S +
Sbjct: 729 KKYLLDVRPSYWLFVVRENSNLEIYSIPDFKLCYYITNLCFGHKVLVDNL--ESVTISAS 786
Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
S++ E Q R+ ++ + VV L H SRP L L + + Y+ + F P
Sbjct: 787 TPISAAHEANIQ-RQFDVKEILVVALG-----NHGSRPLLMVRL-ERDLYIYEVFRF--P 837
Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
K + NVS R D + +E ++ F NI+
Sbjct: 838 RGNLKMRFRKIKHSLIYSPNVSG------RIDTEDSDFFAIQER-----IIKMRYFTNIA 886
Query: 922 GHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKI 980
G+ G F+ G+ P W M R LR HP DG +++F +NVNC GF+Y + L+I
Sbjct: 887 GYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELRI 946
Query: 981 CQLPSGSTYDNYWPVQKV 998
LP+ +YD WPV+KV
Sbjct: 947 GVLPTHLSYDAAWPVRKV 964
>gi|215701517|dbj|BAG92941.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 265
Score = 323 bits (829), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 161/246 (65%), Positives = 191/246 (77%), Gaps = 1/246 (0%)
Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
GYIEPV+VILHE+E TWAGR+ KHHTCMISA SIS TLKQHP+IWSA NLPHDAY+LLA
Sbjct: 19 GYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSISMTLKQHPVIWSAANLPHDAYQLLA 78
Query: 314 VPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
VP PI GVLV+ AN+IHYHSQS SC+L LNN++ D S E+ +S+F VELDAA ATWL
Sbjct: 79 VPPPISGVLVICANSIHYHSQSTSCSLDLNNFSSHPDGSPEISKSNFQVELDAAKATWLS 138
Query: 374 NDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
ND+ + STK G+++LLTVVYDGRVVQRLDL K+ SVL+S +T+IGNS FFLGSRLGDSL
Sbjct: 139 NDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSKASVLSSAVTSIGNSFFFLGSRLGDSL 198
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASN 492
LVQF+ + S+L E DIE D P +KRL+R SD LQD+ + EELS A N
Sbjct: 199 LVQFSYCASKSVLQDLTNERSADIEGDLPFSKRLKRIPSDVLQDVTSVEELSFQNIIAPN 258
Query: 493 NTESAQ 498
+ ESAQ
Sbjct: 259 SLESAQ 264
>gi|195150431|ref|XP_002016158.1| GL10645 [Drosophila persimilis]
gi|194110005|gb|EDW32048.1| GL10645 [Drosophila persimilis]
Length = 1459
Score = 321 bits (823), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 291/1067 (27%), Positives = 475/1067 (44%), Gaps = 187/1067 (17%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV AN++++Y + E G ++ N E M LE + Y L+GNV S
Sbjct: 29 NLVVAGANMLKVYRISPNVEAGQRQKLNPNE------MRIAPKMRLECLATYFLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA +D+++++F+DAK+SVL+ D + L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR------ 229
P+V+VDP RC +LVYG ++++L + S DE F
Sbjct: 136 GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSL---DEIELTDVKPFKKAPTAMVS 192
Query: 230 ---IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
I +S++I L++LD K +V D F+HGY EP ++IL+E T +GR+ + TC++
Sbjct: 193 RTPIMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCSGRIKVRSDTCVLV 252
Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
A+S++ + HP+IW+ +LP D +++ + PIGG LV+ N + Y +QS
Sbjct: 253 AISLNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVP------P 306
Query: 345 YAVSLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
Y VSL+SS + P+ + LD A+ ++ D ++S +TG+L +LT+ D R
Sbjct: 307 YGVSLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGELYVLTLCVDSMR 366
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK----- 451
V+ K SVLTS I + FLGSRLG+SLL+ FT +++++ +
Sbjct: 367 TVRNFHFHKAAASVLTSCICVCHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDVDAEQQA 426
Query: 452 ----------------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
EE D++ AP + RR + EEL +YGS +
Sbjct: 427 EQQQQKQQRVQEDQDIEEVYDVDQIELAPPQAKSRR---------IEDEELEVYGSGAKA 477
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGI 533
+ + F F V DSL+N+ P+ G R+ + +ATG
Sbjct: 478 SVLQLRKFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGH 537
Query: 534 SKQS---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII 581
SK N +++ EL GC +WTV+ D+++ + D+ H ++++
Sbjct: 538 SKNGALSVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLL 588
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
S T+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q
Sbjct: 589 SQSNSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQ 647
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG-----SIRLLVGDPSTCT---VS 693
++ S V+ V+IADPYV L M +G ++R G P
Sbjct: 648 NVPI----------DVGSPVVQVAIADPYVCLRMLNGQVITLALRETRGSPRLAINKHTI 697
Query: 694 VQTPA--AIESSKKPVSSCTLYHDK--------------------GPEPWLRKTSTDAWL 731
+PA AI + K T+ D EP ++ + L
Sbjct: 698 TSSPAVVAIAAYKDLSGLFTVKSDDVLNLTGGTGSGFGHSFGGYMKAEPNMKVEDEEDLL 757
Query: 732 STGVGEAIDGADGGPLDQGD------------------IYSVVCYESGALEIFDVPNFNC 773
G A L Q + VV +SG LEI+ +P+
Sbjct: 758 YGDAGNAFKINSMAVLAQQSKQKNSDWWRRLLVQAKPSYWLVVSRKSGTLEIYSMPDMKL 817
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRW 832
V+ ++ +G + D +L S E +S+ G Q ++ +S +EL++
Sbjct: 818 VYHINDVGNGAMVLSDALEFVSLSSSTQE---NSKVGIVQSCMPQHANSPLPLELSLVGL 874
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
+ RP L + T +L YQ +F P+ K R L N+ + ++
Sbjct: 875 GLNGERPVLM-VRTRVELLIYQ--VFRYPKGNLKI-----RFRKLEQLNLLDQQPSHIEL 926
Query: 893 SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCD 951
+ Q++ F N+ G G + G PC+ + R LR+H +
Sbjct: 927 EENDEEEELESYNMQPKYVQKLRPFSNVGGLAGIMVCGVNPCFVFLTARGELRIHRLQGN 986
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
G + +F +NVN +GF+Y + LKI LPS +YD+ WPV+KV
Sbjct: 987 GDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSVWPVRKV 1033
>gi|198457226|ref|XP_001360595.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
gi|198135905|gb|EAL25170.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
Length = 1459
Score = 321 bits (822), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 291/1067 (27%), Positives = 474/1067 (44%), Gaps = 187/1067 (17%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV AN++++Y + E G ++ N E M LE + Y L+GNV S
Sbjct: 29 NLVVAGANMLKVYRISPNVEAGQRQKLNPNE------MRIAPKMRLECLATYFLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA +D+++++F+DAK+SVL+ D + L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR------ 229
P+V+VDP RC +LVYG ++++L + S DE F
Sbjct: 136 GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSL---DEIELTDVKPFKKAPTAMVS 192
Query: 230 ---IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
I +S++I L++LD K +V D F+HGY EP ++IL+E T GR+ + TC++
Sbjct: 193 RTPIMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLV 252
Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
A+S++ + HP+IW+ +LP D +++ + PIGG LV+ N + Y +QS
Sbjct: 253 AISLNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVP------P 306
Query: 345 YAVSLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
Y VSL+SS + P+ + LD A+ ++ D ++S +TG+L +LT+ D R
Sbjct: 307 YGVSLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGELYVLTLCVDSMR 366
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK----- 451
V+ K SVLTS I + FLGSRLG+SLL+ FT +++++ +
Sbjct: 367 TVRNFHFHKAAASVLTSCICVCHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDVDAEQQA 426
Query: 452 ----------------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
EE D++ AP + RR + EEL +YGS +
Sbjct: 427 EQQQQKQQRVQEDQDIEEVYDVDQIELAPPQAKSRR---------IEDEELEVYGSGAKA 477
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGI 533
+ + F F V DSL+N+ P+ G R+ + +ATG
Sbjct: 478 SVLQLRKFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGH 537
Query: 534 SKQS---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII 581
SK N +++ EL GC +WTV+ D+++ + D+ H ++++
Sbjct: 538 SKNGALSVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLL 588
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
S T+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q
Sbjct: 589 SQSNSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQ 647
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG-----SIRLLVGDPSTCT---VS 693
++ S V+ V+IADPYV L M +G ++R G P
Sbjct: 648 NVPI----------DVGSPVVQVAIADPYVCLRMLNGQVITLALRETRGSPRLAINKHTI 697
Query: 694 VQTPA--AIESSKKPVSSCTLYHDK--------------------GPEPWLRKTSTDAWL 731
+PA AI + K T+ D EP ++ + L
Sbjct: 698 TSSPAVVAIAAYKDLSGLFTVKSDDVLNLTGGSGSGFGHSFGGYMKAEPNMKVEDEEDLL 757
Query: 732 STGVGEAIDGADGGPLDQGD------------------IYSVVCYESGALEIFDVPNFNC 773
G A L Q + VV +SG LEI+ +P+
Sbjct: 758 YGDAGNAFKINSMAVLAQQSKQKNSDWWRRLLVQAKPSYWLVVSRKSGTLEIYSMPDMKL 817
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRW 832
V+ ++ +G + D +L S E +S+ G Q ++ +S +EL++
Sbjct: 818 VYHINDVGNGAMVLSDALEFVSLSSSTQE---NSKVGIVQSCMPQHANSPLPLELSLVGL 874
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
+ RP L + T +L YQ +F P+ K R L N+ + ++
Sbjct: 875 GLNGERPVLM-VRTRVELLIYQ--VFRYPKGNLKI-----RFRKLEQLNLLDQQPSHIEL 926
Query: 893 SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCD 951
+ Q++ F N+ G G + G PC+ + R LR+H +
Sbjct: 927 EENDEEEELESYNMQPKYVQKLRPFSNVGGLAGIMVCGVNPCFVFLTARGELRIHRLQGN 986
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
G + +F +NVN +GF+Y + LKI LPS +YD+ WPV+KV
Sbjct: 987 GDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSVWPVRKV 1033
>gi|290981010|ref|XP_002673224.1| CPSF A subunit [Naegleria gruberi]
gi|284086806|gb|EFC40480.1| CPSF A subunit [Naegleria gruberi]
Length = 1373
Score = 313 bits (803), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 268/1066 (25%), Positives = 456/1066 (42%), Gaps = 205/1066 (19%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
FA YK +H PT ++ C T + + NL++
Sbjct: 2 FACYKQLHPPTAVSFCLKARFTSANDE---------------------------NLIIVK 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL-AILSQ 121
N++E+Y+++ + +++ LV + L G ++S+ A+ Q
Sbjct: 35 NNIMEVYLIKP-----------------------NTSNIVLVKVFELFGVIDSIIAVCLQ 71
Query: 122 GGADNSRRRDSIILAFED-AKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
G +++ +++ FED AK+SV+EFD+ L+ S+H E L+ G+ F
Sbjct: 72 G-----MKKEMLLINFEDEAKVSVVEFDEKRSDLKTLSLHYLEDD---FLREGKARFFHN 123
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGS------------GLVGDEDTFGSGGGFSA 228
+ +DPQ R V++ +++IL Q G L GD++ G
Sbjct: 124 QPIILDPQNRFATVIICDSKLVILPFRQSGEDVSLSTEDNFLFALSGDQEEANENVGDQK 183
Query: 229 R-----IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
+ ++ +I+L DL +K+VKD+ F++GY EP ++ LHE E TW+GR++ K +T +
Sbjct: 184 KHHQPEVQRQVIIDLNDLGIKNVKDYCFLNGYNEPTILFLHENEQTWSGRLAAKSNTSTV 243
Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI-GGVLVVGANTIHYHSQSASCALAL 342
+A+S K +P IWS +LPHD KL+ + + GG LV+G N+I + +Q A+ L+
Sbjct: 244 TAVSFDLFRKYYPKIWSVGSLPHDCNKLIPLQEDVAGGALVIGMNSIIHINQCATYGLSF 303
Query: 343 NNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
N++AVS + + + ++F ++ D T++ D L+S K G+L + + G +
Sbjct: 304 NDFAVS-NPNLSINFNTFDGPALFFDTVAYTFIARDKLLVSLKDGELYTMYLESGGSRIN 362
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
+++ KT+ + S + T+ +L FLGS++GDS+L ++ S EE + A
Sbjct: 363 NINIKKTSNTTPASCMCTLKGNLIFLGSKIGDSVLYEYQEKVEVETSSLDTDEEMSSVFA 422
Query: 460 -----DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
+ KR D + EE ++ S S ++ ++ NIGP
Sbjct: 423 AGENFEPEKKKRKLADDDDFFAALEKDEEPTVIESFSKVSKKETTKVELKIKHVFTNIGP 482
Query: 515 LKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WTVYHKSSRGHNADSSRMA 569
+ + + + D S G ++N + C GI TV ++S + + +
Sbjct: 483 ISHLTAAVTSSFDMS--GFKSKTNDNQLSAIACSGIGRHGCLTVLNRSLQPDIQSEATLP 540
Query: 570 ---------AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
+ E+ YLI+SLE +T V E+ L EVT + T+ G + R
Sbjct: 541 FLVKQVWTISQKTEHDLYLILSLEDKTKVFESKATLAEVTSKSMFVTNETTLNIGKI--R 598
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
++QV R + +L GS P S++ I DPYVLL DGS+
Sbjct: 599 ESIVQV-TRKSVMLIGS--------EPKQVHHSKKEIRSSI----ILDPYVLLHFYDGSL 645
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
LL D T IES+ +++ LY PE + G+ E
Sbjct: 646 VLLTHDNGRVT---SKQLDIESNHGKITAVCLYK-TNPE----------FEFFGINEK-- 689
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
+G V + GA EI VP+ CVF+ +F T + D
Sbjct: 690 --------EGKYLCCVYWTDGAFEILSVPDMTCVFSFSQFYQFHTTLFD----------- 730
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
E + + + V E+A++ + P+L ++L+D T+ Y+++L
Sbjct: 731 -------EGQSSNTTQSEVKYPYVTEMALRGIGSDSEMPYLVSVLSDNTVHIYRSFL--- 780
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR------TPLDAYTREETPHGAPCQ-- 912
+ T+KS D +RL LRFS+ P+ ++ +
Sbjct: 781 -DRTTKSKD---------------NRLTRLRFSKFQHDDLLPISEIDKKSQTFTLNLKSK 824
Query: 913 -----------RITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 961
++ FKNI G+ G F +G +P W LRVHP + FT H
Sbjct: 825 YLFPKSDLGRSQLIPFKNIGGYGGLFKTGEKPFWLFTEHSNLRVHPTQSRDPVTTFTPYH 884
Query: 962 NVNCNHGFIYVT-------SQGILKICQLPSGSTYDNYWPVQKVVF 1000
+ NC HGFIY+T Q L I L + ++ YWP +K++
Sbjct: 885 HENCPHGFIYLTDKEQDNKKQSKLHISSLNANVKFNAYWPQRKILL 930
>gi|157110889|ref|XP_001651294.1| cleavage and polyadenylation specificity factor cpsf [Aedes
aegypti]
gi|108883895|gb|EAT48120.1| AAEL000832-PA [Aedes aegypti]
Length = 1417
Score = 311 bits (797), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 286/1021 (28%), Positives = 467/1021 (45%), Gaps = 137/1021 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LV ANV+++Y R+ + S++ T R M LE + Y L GN+ S+
Sbjct: 29 SLVTGGANVLKVY--RLIPDADATSRDKFTTTRPPNM------KLECMATYTLFGNIMSM 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S G+ +RD+++++F+DAK+SV++FD L+ S+H FE + +K G
Sbjct: 81 QSVSLAGS----QRDALLISFQDAKLSVVQFDPDNFELKTLSLHYFEEED---IKGGWTG 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSGGG---FSARI 230
P+V+VDP RC +LVYG ++++L K S V D I
Sbjct: 134 HYHTPIVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEVQDVKPMKKAPTQLIAKTPI 193
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+S+VI L++ + + +V D F+HGY EP ++IL+E T+ GR++ + TCM+ ALS+
Sbjct: 194 LASYVIELKESEERIDNVIDIQFLHGYYEPTLLILYEPVKTFPGRIAVRSDTCMMVALSL 253
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAV 347
+ + HP+IW+ LP D + +A+ PIGG L++ N + Y +QS ++LN+ A
Sbjct: 254 NIQQRVHPVIWTVNCLPFDCLQAIAISKPIGGCLILSVNALIYLNQSVPPYGVSLNSIAD 313
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ P+ + LDAA +++ + +LS K G+L +LT+ D R V+ SK
Sbjct: 314 HCTNFPLKPQDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRSVRSFHFSKA 373
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
SVLT I + FLGSRLG+SLL++F + +++ EE + E KR
Sbjct: 374 ASSVLTCCICVVEEEYLFLGSRLGNSLLLRFKEKDESMVITIDDTEEVVEKEP-----KR 428
Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI-- 524
LR + EEL +YGS T ++ F V DS++NIGP+ + G RI
Sbjct: 429 LR----------LEQEELEVYGSG-QKTSVQLTSYIFEVCDSILNIGPIGHMAVGERISE 477
Query: 525 ---NADASATGISKQSNYELVELP--GCKGIWTVYHKSSRGHNADSSRMAAY--DDEYHA 577
+ + + + + E+V G G V S + S ++ D+ H+
Sbjct: 478 EEQDENKDVQFVPNKLDLEIVTSSGHGKNGALCVLQNSIKPQVITSFGLSGCLDVDDMHS 537
Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
++I+S EA TMVL+T D + E+ E+ + TI GN+ G R ++QV + R+L G+
Sbjct: 538 FMILSQEAGTMVLQTGDEINEI-ENTGFATNVPTIHVGNIGGNRFIVQVTTKSIRLLQGT 596
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV-----GDPSTC-- 690
+ Q++ + +VSIADPYV + S+G + L G P
Sbjct: 597 RLLQNIPI----------DLGCPLAAVSIADPYVCVRSSEGRVITLALREGKGTPRLAVN 646
Query: 691 --TVSVQTPAAIE-SSKKPVSS--CTLYHD------------------KGPEPWLRKTST 727
T+S TPA + S K VS T Y D PEP ++
Sbjct: 647 KNTIST-TPAVVAISVYKDVSGMFTTKYEDFYDGSKAGSSAYSSGFGYMKPEPHMKIEDE 705
Query: 728 DAWLSTGVGEAI------DGADGGPLDQGD--------------IYSVVCYESGALEIFD 767
+ L G + D A D +Y+V ++G LEI+
Sbjct: 706 EDLLYGESGRSFKMTSMADMAIETKKKNTDFWRKFMQPVKPTFWLYAV--RDNGNLEIYS 763
Query: 768 VPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN---SSSEEGTGQGRKENIHSMKV 824
+P+ V+ + +G + D+ L+ +T + +S+ + G N+ ++
Sbjct: 764 MPDLKLVYLITNIGNGNKVLQDSMEFVPLQVGQTAADADVTSNAFTSPFGFNPNLLPKEI 823
Query: 825 VELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSA 884
+ +A+ H +RP LF L + +L Y+ Y + SK + R S V+
Sbjct: 824 LMVAL---GHHGTRPMLFVRL-ENDLLVYRVYRY------SKGHLKLRFRR--VPSGVTG 871
Query: 885 SRLRNLRFSRTPLDAYTREETPHGAPCQ-----RITIFKNISGHQGFFLSGSRPCWCMVF 939
+ P D + H I F N++G+ G + G +P + M+
Sbjct: 872 PIFKIAPRQSAPTDQEGEKPDEHSTKIMYENISMIRYFNNVNGYNGVAVCGEKP-YIMLL 930
Query: 940 RER--LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK 997
R LR H + F +NVNC +GF+Y Q LKI P +YD+ WPV+K
Sbjct: 931 TSRGELRAHRLYAKTIMKGFAPFNNVNCPNGFLYFDEQYELKIAVFPGYLSYDSIWPVRK 990
Query: 998 V 998
+
Sbjct: 991 I 991
>gi|194474008|ref|NP_001124043.1| cleavage and polyadenylation specificity factor subunit 1 [Rattus
norvegicus]
gi|149066087|gb|EDM15960.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 1386
Score = 309 bits (791), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 219/669 (32%), Positives = 345/669 (51%), Gaps = 75/669 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429
Query: 471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
+ QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSE 485
Query: 522 -------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH---------- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 ENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 545
Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T+ AG
Sbjct: 546 KAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAG 604
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
N+ R ++QV G R+L+G L F P + + ++ ++ADPYV++
Sbjct: 605 NIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMS 654
Query: 676 SDGSIRLLV 684
++G + + +
Sbjct: 655 AEGHVTMFL 663
Score = 109 bits (273), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 778 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 833
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 834 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 883
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 884 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 943
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 944 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1003
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1004 PWPVRKI 1010
>gi|74212803|dbj|BAE33365.1| unnamed protein product [Mus musculus]
Length = 741
Score = 308 bits (790), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
>gi|197245729|gb|AAI68713.1| Cpsf1 protein [Rattus norvegicus]
Length = 1439
Score = 308 bits (789), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 219/671 (32%), Positives = 345/671 (51%), Gaps = 77/671 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429
Query: 471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
+ QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSE 485
Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH-------- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 EFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545
Query: 556 --KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 546 TPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654
Query: 674 GMSDGSIRLLV 684
++G + + +
Sbjct: 655 MSAEGHVTMFL 665
Score = 109 bits (272), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 780 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 836 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 886 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 945
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 946 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1006 PWPVRKI 1012
>gi|148697644|gb|EDL29591.1| cleavage and polyadenylation specific factor 1, isoform CRA_c [Mus
musculus]
Length = 1388
Score = 308 bits (789), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 219/671 (32%), Positives = 344/671 (51%), Gaps = 77/671 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH-------- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545
Query: 556 --KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 546 TPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654
Query: 674 GMSDGSIRLLV 684
++G + + +
Sbjct: 655 MSAEGHVTMFL 665
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 780 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 836 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 886 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 945
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 946 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1006 PWPVRKI 1012
>gi|16751835|ref|NP_444423.1| cleavage and polyadenylation specificity factor subunit 1 isoform 2
[Mus musculus]
gi|17374611|sp|Q9EPU4.1|CPSF1_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor
subunit 1; AltName: Full=Cleavage and polyadenylation
specificity factor 160 kDa subunit; Short=CPSF 160 kDa
subunit
gi|11762096|gb|AAG40326.1|AF322193_1 cleavage and polyadenylation specificity factor 1 [Mus musculus]
gi|38614159|gb|AAH56388.1| Cleavage and polyadenylation specific factor 1 [Mus musculus]
Length = 1441
Score = 308 bits (788), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
Score = 109 bits (273), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1008 PWPVRKI 1014
>gi|255918233|ref|NP_001157645.1| cleavage and polyadenylation specificity factor subunit 1 isoform 1
[Mus musculus]
Length = 1450
Score = 307 bits (787), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
Score = 109 bits (273), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1008 PWPVRKI 1014
>gi|148697642|gb|EDL29589.1| cleavage and polyadenylation specific factor 1, isoform CRA_a [Mus
musculus]
Length = 1417
Score = 307 bits (787), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 56 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 108
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 109 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 161
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 162 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 218
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 219 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 278
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 279 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 338
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 339 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 398
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 399 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 456
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 457 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 512
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 513 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 572
Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 573 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 631
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 632 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 681
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 682 VIMSAEGHVTMFL 694
Score = 110 bits (274), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 809 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 864
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 865 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 914
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 915 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 974
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 975 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1034
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1035 PWPVRKI 1041
>gi|427780291|gb|JAA55597.1| Putative mrna cleavage and polyadenylation factor ii complex
subunit cft1 cpsf subunit [Rhipicephalus pulchellus]
Length = 1237
Score = 307 bits (786), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 257/862 (29%), Positives = 387/862 (44%), Gaps = 170/862 (19%)
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
F+HGY EP ++IL+E TW GR++ + TC I ALS++ + HP+IWS NLP D +
Sbjct: 3 FLHGYYEPTLLILYEPLRTWPGRIAIRQDTCCILALSLNLQQRVHPVIWSYTNLPFDCLR 62
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL-------PRSSFSVE 363
LLAVP P+GGVL++ +++ Y +QS Y VSL+S + P+ +
Sbjct: 63 LLAVPRPLGGVLIMAVDSLLYLNQSVP------PYGVSLNSFTDFSTSFPLKPQEGLKIS 116
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSL 422
LD A A +L D +LS K G+L +LT+ DG R V+ K SVLT+ +T +
Sbjct: 117 LDCAQACFLSYDRLVLSLKGGELYVLTLFNDGMRSVRNFYFDKAAASVLTTSMTLCEDGY 176
Query: 423 FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD----- 477
FLGSRLG+SLL+ +T + ++E + +A+ P +K+ R DA+ D
Sbjct: 177 LFLGSRLGNSLLLHYT-EKAAEVDDIAKRDEKTESDANDPPSKKKRM---DAIGDWMASD 232
Query: 478 --MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRINAD 527
+++ +EL +YGS + T+ +++F V DSL+NIGP G N D
Sbjct: 233 VALIDPDELEVYGSETMATKQL-TSYTFEVCDSLINIGPCGKICMGEPAFLSEEFVQNTD 291
Query: 528 -----ASATGISKQSNYELV------------ELPGCKGIWTVY---------HKSSRGH 561
+ G K ++ ELPGC +WTV K+
Sbjct: 292 PDLELVTTAGYGKNGALCVLQRSVRPQVVTTFELPGCVHMWTVMGPPAEKKPPEKTEESD 351
Query: 562 NADSSRMAAYDD--EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFG 619
+ S AA HA+LI+S +M+L+T + E+ S + Q T+ AGNL
Sbjct: 352 DPASEDKAAEQPLTNTHAFLILSRADSSMILQTDQEINELDHS-GFSTQNPTVFAGNLGD 410
Query: 620 RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
R V+QV G R+LDG+ Q + S++++ S+ADP+V++ ++G
Sbjct: 411 GRYVLQVCPMGVRLLDGTRQLQHIPL----------DVGSSIVAGSLADPHVIIRSAEGL 460
Query: 680 I--RLLVGDPST-CTVSVQTPAAIESSKKPVSSCT------LYHDKGPEPWLRKTSTDAW 730
+ L GDP+ C ++V P K +S C L+ + EP + +
Sbjct: 461 VIHLTLRGDPAAGCRLAVLRPQLTAVKAKILSICVYKDVSGLFTTQYREP--DEPAKPEK 518
Query: 731 LSTGVGEAIDGADGGPLDQGD--------------------------------------- 751
E+ID + G LD D
Sbjct: 519 PLPPPKESIDMSSNGLLDDEDELLYGESEENPIQKEPVRMTSEEAPSVAESMFEIKEVAP 578
Query: 752 -IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEE 809
+ V E+G LEI+ +P + F V F G+ +VD+ A +++E ++ S E
Sbjct: 579 TYWLFVARENGVLEIYSLPEYKLCFLVKNFPMGQKVLVDSVQMTAPSGTKSEKLSDMSHE 638
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
+H + VV L ++ HSRP L A + D +L Y+A+ F
Sbjct: 639 SM-----PVVHEILVVGLGIR-----HSRPLLLARV-DEDLLIYEAFPF----------- 676
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-----ETPHGAPCQR-------ITIF 917
T R + LRF + D + RE + P ++ + F
Sbjct: 677 -YETQREGHL---------KLRFKKMSHDIFLRERKYKTQKPENEEEEKAFQSRQWLHPF 726
Query: 918 KNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
+ISG+ G FL G RP W M R LR HP DG I F HNVNC GF++ QG
Sbjct: 727 SDISGYSGVFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQG 786
Query: 977 ILKICQLPSGSTYDNYWPVQKV 998
L+I LP+ TYD WPV+KV
Sbjct: 787 ELRISTLPTHLTYDAPWPVRKV 808
>gi|193702313|ref|XP_001945086.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Acyrthosiphon pisum]
Length = 1335
Score = 306 bits (783), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 279/1005 (27%), Positives = 449/1005 (44%), Gaps = 192/1005 (19%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV N++ +Y + + + K E + Y L GN+ L
Sbjct: 30 LVVAGVNILRVYRLVPTDTTCQPPK----------------TKFECLAQYTLFGNIMCL- 72
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
Q D+++L+F +AK S++E+D +H LR S+H FE ++ K G
Sbjct: 73 ---QSVTLCPSSPDALLLSFSEAKFSLVEYDRDMHSLRTLSLHYFEDDKF---KNGHTQH 126
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
PL++VDP GRC LVYG ++L G D++ SA++ S+ I
Sbjct: 127 WSPPLIRVDPDGRCVVGLVYGSYFVVLPF-----GRTIDDN------AKSAQVMPSYTIP 175
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
+ +D M ++ DF F+HGY EP ++IL+E T+AGR++ + TC + A+S++ H
Sbjct: 176 ISKIDPKMNNIMDFDFLHGYYEPTLLILYEPVKTFAGRIAVRKDTCAMVAISLNIQQHVH 235
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQE 354
P+IWS +LP+D K++AV PIGGVL++ N++ Y +QS +ALN+ A +L +
Sbjct: 236 PVIWSLDSLPYDCQKVIAVSRPIGGVLIMAVNSLIYLNQSVPPFGVALNSIAKTLTNFPL 295
Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTS 413
+ ++ LD A AT++ +D + S GDL ++T+ D R V+ K SVLT+
Sbjct: 296 GQQEDINLVLDRATATFISSDKLVTSLCNGDLYVITLYADSMRAVRSFHFEKCASSVLTT 355
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
IT +S FLGSRLG+SLL+++ S ++ D PS KR + +D
Sbjct: 356 CITVCLDSYLFLGSRLGNSLLLRYYARSQSN--------------DDEPSIKRKKTDETD 401
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+D+V EL +YGS T +++SF V DS++NIGP S G A S
Sbjct: 402 --EDLV---ELEVYGSEV-QTSICLESYSFEVCDSIINIGPCSQASIGE--PAYISDEFS 453
Query: 534 SKQSNYELVELP--GCKGIWTVYHKSSRGHNADSSRMAAYDD--------EYHAYLIISL 583
S + + EL+ G G +V H+S + + + Y D ++H ++I++
Sbjct: 454 SDEHDVELLCTSGHGKNGALSVLHRSIKPQLVTTFHLDGYKDMWTVHGENDFHTFMILTN 513
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
T++L+T + E+ +S Y + T+ N+ + VIQV R+L+GS Q +
Sbjct: 514 VDSTLILQTGQEINEL-DSSGYATREHTVFVCNM--NKFVIQVLRYSVRLLNGSEQLQSV 570
Query: 644 S--FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI---------RLLVGDPSTCTV 692
S FG S ++ S +PY +L DG + R+L+ P+
Sbjct: 571 SLDFG------------SPIIHGSSCNPYAVLLTEDGQVIVLTVKSTGRILLMRPTNFEQ 618
Query: 693 SVQTP--------AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
QT + + SS P + L GP K D ++S V + + G
Sbjct: 619 IPQTKTLAVYRDVSGLFSSTMPQAEIPLV---GP-----KLQHDHFVSDSVEDEEEMLYG 670
Query: 745 GPLDQGD--------------------------IYSVVCYESGALEIFDVPNFNCVFTVD 778
D + V+ ++G +EI+ +P+F
Sbjct: 671 DARDPSSRETPHNSVSNKNTMWWLKFLEVPTPTYWVVLTRDNGYMEIYTLPDFKI----- 725
Query: 779 KFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK-ENIHSMKVVELAMQRWSAHHS 837
Y + +S + S EEG +K E I + +V L Q
Sbjct: 726 -----------KYRAANIDESPMILKDSLEEGCYFPKKTEIIKEILIVPLGYQ-----DK 769
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
RP +F L D ++ Y + PE T K +RF + +
Sbjct: 770 RPIMFVRL-DNEVVIYGIH--RHPEGTLK-----------------------MRFHK--M 801
Query: 898 DAYTREETPHGAPCQRITI---FKNISGHQGFFLSGSRPCWCMV-FRERLRVHPQLCDGS 953
+ ++ G P + ++ F ++GH G F+ G P ++ R LR HP DG
Sbjct: 802 TSLLTFQSRSGNPLEGTSLLRYFSKVAGHNGVFICGQNPHLILLTVRGELRCHPLHIDGP 861
Query: 954 IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
I+ F HNVNC+ GF+Y S L+I LP+ +YD WP++KV
Sbjct: 862 IMCFAPFHNVNCSQGFLYFNSDHKLRISILPTHLSYDEPWPLRKV 906
>gi|334326317|ref|XP_001364707.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Monodelphis domestica]
Length = 1449
Score = 305 bits (782), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 224/687 (32%), Positives = 354/687 (51%), Gaps = 99/687 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E S +S + E K + LELV + GNV S+
Sbjct: 29 NLVVAGTSQLYVYRLNHDAETSTKSDRNAEGK----LHKEHKEKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 138 NVHTPRVRVDPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQKSSFL 189
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
K HP+IWS NLP D + LAVP PIGGV++ N++ Y +QS Y VSL
Sbjct: 250 ILQKVHPVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSL 303
Query: 350 DS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRL 401
+S + P + + LD A A ++ D ++S K G++ +LT++ DG R V+
Sbjct: 304 NSLTAGTTAFPLRMQDGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSF 363
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI-EAD 460
K SVLT+ + T+ FLGSRLG+SLL+++T S+ + ++ + D
Sbjct: 364 HFDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAAREAPSREVSDKD 423
Query: 461 APSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
P K+ R S+ A QD V+ E+ +YGS A + T+ A T+SF V DS++NI
Sbjct: 424 EPPVKKKRVESTLGWAGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 479
Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
GP + + G L I + + + + K ++V ELPGC
Sbjct: 480 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 539
Query: 548 KGIWTVY-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
+WTV ++++G A+ SS D + H +LI+S E TM+L+T +
Sbjct: 540 YDMWTVIAPLRKEEDETTKGEGAEQEPSSPETEDDGKRHGFLILSREDSTMILQTGQEIM 599
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 600 ELDTS-GFATQGPTVYAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL------- 648
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 649 GSPIVQCAVADPYVVIMSAEGHVTMFL 675
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 78/264 (29%), Positives = 124/264 (46%), Gaps = 49/264 (18%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V+ ESGA+EI+ +P++ VF V F G+ +VD+ + T+ ++ EE T
Sbjct: 790 WCVLVRESGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPATQGDTKKEEVTR 845
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L ++ +RP+L + D +L Y+A+ +
Sbjct: 846 QGELPLVKEVLLVALGNRQ-----TRPYLL-VHVDQELLIYEAFAHD------------- 886
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
S + S L+ +RF + P + RE+ P + R
Sbjct: 887 -------SQLGQSNLK-VRFKKVPHNINFREKKPKPSKKKPEGGGTEEGAGARGRVARFR 938
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 939 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 998
Query: 975 QGILKICQLPSGSTYDNYWPVQKV 998
QG L+I LP+ +YD WPV+K+
Sbjct: 999 QGELRISVLPAYLSYDAPWPVRKI 1022
>gi|397497327|ref|XP_003819464.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Pan paniscus]
gi|410336497|gb|JAA37195.1| cleavage and polyadenylation specific factor 1, 160kDa [Pan
troglodytes]
Length = 1442
Score = 305 bits (782), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 220/677 (32%), Positives = 348/677 (51%), Gaps = 86/677 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+ G + S A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651
Query: 668 DPYVLLGMSDGSIRLLV 684
DPYV++ ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/247 (27%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 783 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 839 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 889 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 949 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1009 PWPVRKI 1015
>gi|410042329|ref|XP_003954555.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Pan troglodytes]
Length = 1296
Score = 305 bits (781), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 220/677 (32%), Positives = 348/677 (51%), Gaps = 86/677 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNEALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+ G + S A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651
Query: 668 DPYVLLGMSDGSIRLLV 684
DPYV++ ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 66/141 (46%), Gaps = 5/141 (3%)
Query: 860 GPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA-PCQRITIFK 918
G E + DD S S S S+ R S+ P D R+ P A P + +
Sbjct: 732 GSETSPTVDDEEEMLYGDSGSLFSPSKEEARRSSQPPAD---RDPAPFRAEPTHWCLLVR 788
Query: 919 NISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
F+ G P W +V R LR+HP DG + +F HNVNC GF+Y QG
Sbjct: 789 ENGTMXXXFICGPSPPWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGE 848
Query: 978 LKICQLPSGSTYDNYWPVQKV 998
L+I LP+ +YD WPV+K+
Sbjct: 849 LRISVLPAYLSYDAPWPVRKI 869
>gi|56676371|ref|NP_037423.2| cleavage and polyadenylation specificity factor subunit 1 [Homo
sapiens]
gi|23503048|sp|Q10570.2|CPSF1_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor
subunit 1; AltName: Full=Cleavage and polyadenylation
specificity factor 160 kDa subunit; Short=CPSF 160 kDa
subunit
gi|16878041|gb|AAH17232.1| Cleavage and polyadenylation specific factor 1, 160kDa [Homo
sapiens]
gi|119602516|gb|EAW82110.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
CRA_c [Homo sapiens]
gi|123993607|gb|ABM84405.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
gi|123999626|gb|ABM87355.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
gi|307684758|dbj|BAJ20419.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
Length = 1443
Score = 305 bits (781), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 219/678 (32%), Positives = 348/678 (51%), Gaps = 87/678 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + S DD+ H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 601
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++
Sbjct: 602 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 651
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 652 ADPYVVIMSAEGHVTMFL 669
Score = 108 bits (270), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/247 (27%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 784 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 839
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 840 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 889
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 890 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 950 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1010 PWPVRKI 1016
>gi|338728511|ref|XP_001505047.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like isoform 1 [Equus caballus]
Length = 1444
Score = 305 bits (781), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 219/675 (32%), Positives = 350/675 (51%), Gaps = 80/675 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN + + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRNAEGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+ +
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+ + T+ FLGSRLG+SLL+++T +S ++E E + P +K+ R S+
Sbjct: 375 TSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---AEKEEPPSKKKRVDST 430
Query: 473 -------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 431 VGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGEPA 486
Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 487 FLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRK 546
Query: 555 --HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
++ +G + S+ A D H +LI+S E TM+L+T + E+ S + QG
Sbjct: 547 EQEETPKGEGTEQEPSAPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 605
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
T+ AGN+ R ++QV G R+L+G L F P + S ++ ++ADP
Sbjct: 606 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVADP 655
Query: 670 YVLLGMSDGSIRLLV 684
YV++ ++G + + +
Sbjct: 656 YVVIMSAEGHVTMFL 670
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 785 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 841 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 890
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 891 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGVGARGRVARFRYFEDIYGYSGVFICGPS 950
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 951 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1010
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1011 PWPVRKI 1017
>gi|1045574|gb|AAC50293.1| cleavage and polyadenylation specificity factor [Homo sapiens]
Length = 1442
Score = 305 bits (780), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 219/678 (32%), Positives = 348/678 (51%), Gaps = 87/678 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + S DD+ H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 601
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++
Sbjct: 602 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 651
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 652 ADPYVVIMSAEGHVTMFL 669
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/247 (27%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 784 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 839
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 840 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 889
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 890 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 950 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1010 PWPVRKI 1016
>gi|395512730|ref|XP_003760588.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Sarcophilus harrisii]
Length = 1449
Score = 305 bits (780), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 224/687 (32%), Positives = 353/687 (51%), Gaps = 99/687 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E S +S + E K + LELV + GNV S+
Sbjct: 29 NLVVAGTSQLYVYRLNHDAETSTKSDRNAEGK----LHKEHKEKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 138 NVHTPRVRVDPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQKSSFL 189
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
K HP+IWS NLP D + LAVP PIGGV++ N++ Y +QS Y VSL
Sbjct: 250 ILQKVHPVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSL 303
Query: 350 DS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRL 401
+S + P + + LD A A ++ D ++S K G++ +LT++ DG R V+
Sbjct: 304 NSLTAGTTAFPLRMQDGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSF 363
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI-EAD 460
K SVLT+ + T+ FLGSRLG+SLL+++T SS + ++ + D
Sbjct: 364 HFDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSAREAPSREVSDKD 423
Query: 461 APSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
P K+ R S+ A QD V+ E+ +YGS A + T+ A T+SF V DS++NI
Sbjct: 424 EPPVKKKRVESTLGWAGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 479
Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
GP + + G L I + + + + K ++V ELPGC
Sbjct: 480 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 539
Query: 548 KGIWTVY-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
+WTV ++++G + SS D + H +LI+S E TM+L+T +
Sbjct: 540 YDMWTVIAPLRKEEDETTKGEGPEQEPSSPETEDDGKRHGFLILSREDSTMILQTGQEIM 599
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 600 ELDTS-GFATQGPTVYAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL------- 648
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 649 GSPIVQCAVADPYVVIMSAEGHVTMFL 675
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 78/264 (29%), Positives = 124/264 (46%), Gaps = 49/264 (18%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V+ ESGA+EI+ +P++ VF V F G+ +VD+ + T+ ++ EE T
Sbjct: 790 WCVLVRESGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPATQGDTKKEEVTR 845
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L ++ +RP+L + D +L Y+A+ +
Sbjct: 846 QGELPLVKEVLLVALGNRQ-----TRPYLL-VHVDQELLIYEAFAHD------------- 886
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
S + S L+ +RF + P + RE+ P + R
Sbjct: 887 -------SQLGQSNLK-VRFKKVPHNINFREKKPKPSKKKPEGGGAEEGAGARGRVARFR 938
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 939 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 998
Query: 975 QGILKICQLPSGSTYDNYWPVQKV 998
QG L+I LP+ +YD WPV+K+
Sbjct: 999 QGELRISVLPAYLSYDAPWPVRKI 1022
>gi|384946686|gb|AFI36948.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
Length = 1428
Score = 305 bits (780), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 221/679 (32%), Positives = 347/679 (51%), Gaps = 90/679 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASAVREAADKEEPPSKKKRV 424
Query: 468 RRSSS------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
++S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + +
Sbjct: 425 DATASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAM 480
Query: 521 G----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY- 554
G L I + + + + K ++V ELPGC +WTV
Sbjct: 481 GEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIA 540
Query: 555 --------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDY 605
+ G ++ A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 541 PVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GF 599
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ +
Sbjct: 600 ATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCA 649
Query: 666 IADPYVLLGMSDGSIRLLV 684
+ADPYV++ ++G + + +
Sbjct: 650 VADPYVVIMSAEGHVTMFL 668
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 70/247 (28%), Positives = 115/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 783 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 839 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 889 VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 949 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1009 PWPVRKI 1015
>gi|27807297|ref|NP_777145.1| cleavage and polyadenylation specificity factor subunit 1 [Bos
taurus]
gi|1706101|sp|Q10569.1|CPSF1_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor
subunit 1; AltName: Full=Cleavage and polyadenylation
specificity factor 160 kDa subunit; Short=CPSF 160 kDa
subunit
gi|929007|emb|CAA58152.1| cleavage and polyadenylation specificity factor, 160 kDa subunit
[Bos taurus]
gi|296480730|tpg|DAA22845.1| TPA: cleavage and polyadenylation specificity factor subunit 1 [Bos
taurus]
Length = 1444
Score = 305 bits (780), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 221/678 (32%), Positives = 345/678 (50%), Gaps = 86/678 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 189
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 250 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 309
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 310 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 369
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T S+ E D E KR+
Sbjct: 370 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRV 427
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 428 DATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 483
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
L I + + + + K ++V ELPGC +WTV
Sbjct: 484 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 543
Query: 557 SSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 544 VRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFA 602
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + S ++ ++
Sbjct: 603 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAV 652
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 653 ADPYVVIMSAEGHVTMFL 670
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 116/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 785 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + RP+L + D +L Y+A+ P ++ +
Sbjct: 841 QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 890
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E T R F++I G+ G F+ G
Sbjct: 891 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 950
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 951 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1010
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1011 PWPVRKI 1017
>gi|392306997|ref|NP_001254722.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
gi|380812168|gb|AFE77959.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
gi|383417835|gb|AFH32131.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
Length = 1442
Score = 304 bits (778), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 221/679 (32%), Positives = 347/679 (51%), Gaps = 90/679 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASAVREAADKEEPPSKKKRV 424
Query: 468 RRSSS------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
++S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + +
Sbjct: 425 DATASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAM 480
Query: 521 G----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY- 554
G L I + + + + K ++V ELPGC +WTV
Sbjct: 481 GEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIA 540
Query: 555 --------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDY 605
+ G ++ A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 541 PVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GF 599
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ +
Sbjct: 600 ATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCA 649
Query: 666 IADPYVLLGMSDGSIRLLV 684
+ADPYV++ ++G + + +
Sbjct: 650 VADPYVVIMSAEGHVTMFL 668
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/247 (28%), Positives = 115/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 783 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 839 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 889 VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 949 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1009 PWPVRKI 1015
>gi|321475208|gb|EFX86171.1| hypothetical protein DAPPUDRAFT_313209 [Daphnia pulex]
Length = 1260
Score = 303 bits (777), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 281/1017 (27%), Positives = 447/1017 (43%), Gaps = 159/1017 (15%)
Query: 57 NLVVTAANVIEIY--VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
NLVV ANV+ ++ + E+ ++ G+ + LE + Y L G V
Sbjct: 29 NLVVAGANVLRVFRLIPNTDEKMLRKESADGQPPK---------MKLECLASYNLFGKVM 79
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
S+A +S G+ +D+I+++F AK+S++E+D L+ S+H FE L G
Sbjct: 80 SIAAVSLPGSS----QDTILMSFAHAKLSLIEYDPVSDNLKTLSLHNFEVVSIL--DEGI 133
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE-SS 233
S + P ++VDP+GRC +L++ + IL F + + SS
Sbjct: 134 GSNHKIPEIRVDPEGRCAALLIFRNTLAILP--------------FRKDSAHDSNVTLSS 179
Query: 234 HVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
++I L DL+ + +V D F+HGY EP ++IL+E T+ GR++ + TC + A+S++T
Sbjct: 180 YIIKLTDLEERVDNVIDVQFLHGYYEPTLIILYEPVGTFPGRIAVRQDTCNMVAVSLNTQ 239
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLD 350
+ HP+IWS +LP D +LL VP P+GG L++ N++ Y +QS +++N+ A
Sbjct: 240 QRVHPIIWSLNSLPFDCSQLLPVPKPLGGALIMAVNSVIYVNQSVPPYGVSVNSIADHCT 299
Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPS 409
S P + LD A A +LQ D +LS K G+L +LT+ D R V++ L K S
Sbjct: 300 SFPLKPYEGSRIGLDCARAAFLQYDRVVLSLKGGELYVLTLFADSMRSVRKFHLEKAAAS 359
Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
VLT+ + N LF LGSRLG+SLL+ F + + A P ++
Sbjct: 360 VLTTCLCICDNYLF-LGSRLGNSLLLAFQ--------TKDYNQYATPFAAKKPKMEQFSL 410
Query: 470 SSSDALQDMVNGEELS--LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
L D ++ EE+ LYG +T+S ++ F V DSL+NIGP + G
Sbjct: 411 LFDQEL-DHLDEEEIDNYLYGEDHESTDSKAISYQFEVCDSLLNIGPCGQMAVG---EPA 466
Query: 528 ASATGISKQSNYELVELP-----GCKGIWTVYHKSSRGHNADSSRMAAYDDEY------- 575
++ T K+S VE+ G G V ++ + + + D +
Sbjct: 467 STCTDFDKKSPDPDVEIVTTSGYGKNGAICVLQRTMKPQVVTTFELPEVSDMFTVFASRN 526
Query: 576 ------HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
H YL++S TMVL+T + E+ +S + V TI A NL R ++QV
Sbjct: 527 NEDAIMHTYLLLSRADSTMVLQTGQEINEMDQS-GFSVTSPTILAANLGNNRFIVQVCPT 585
Query: 630 GARILDGS-YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV----------LLGMSDG 678
R+LD + + Q+L + + S S +DPYV LL +G
Sbjct: 586 SVRLLDATATVIQELVM----------DSDFLITSASASDPYVAVLTENGRIGLLTFVEG 635
Query: 679 SIRLLV------GDPSTCTVSVQTPAAIESSKKPVS---SCTLYHDKGPEPWLRKTSTDA 729
S ++ P C + + + ++ P + T H +K D
Sbjct: 636 SQLEMIFPVLSKNSPVVCVCLYRDISGLFNTTIPETDSPETTKLHTANKSLNAKKEMDDE 695
Query: 730 ----WLSTGVGEAIDGADGGPLDQGDIYSVVCY--------------ESGALEIFDVPNF 771
+ T E+ D V+ Y ++G LEI+ +
Sbjct: 696 EDYLYGDTNTEESRPTEDKTHTKFTPQQKVIDYFREIKPTFWLSIIRQNGTLEIYSLAGQ 755
Query: 772 NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
+ V+ F + H+ + +K ET + SS+ +VE+ +
Sbjct: 756 S---VVETFQTVHVHLGHRLIFN-MKADETSLPSSTH-------------CNIVEMGIFG 798
Query: 832 WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRN-- 889
H RP L +D +L Y+A PV S+ + + +L +
Sbjct: 799 LGHLHRRPLLMIRTSDFGVLLYEAI----------PALPVYDSKQKNELKIRFRKLNHSL 848
Query: 890 -LRFSRTPLDAYTREE------TPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRE 941
LR ++T Y R+ P+ + F NI+G+ G F+ G P W M R
Sbjct: 849 LLRETKT----YVRKGGQSVVLEPYAWKTNQFKYFSNIAGYTGVFIGGPYPHWLFMTSRG 904
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
LR+HP DGSI F HNVNC GFIY+ + L+IC LP+ YD WPV+KV
Sbjct: 905 ELRLHPMSIDGSIKCFACFHNVNCAQGFIYLNRKDELRICLLPTLFNYDAPWPVRKV 961
>gi|354491124|ref|XP_003507706.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 isoform 2 [Cricetulus griseus]
Length = 1388
Score = 303 bits (776), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 217/671 (32%), Positives = 347/671 (51%), Gaps = 77/671 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE E L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ ++
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429
Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-------HK 556
L I + + + + K ++V ELPGC +WTV +
Sbjct: 486 SEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545
Query: 557 SSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
+ R + + ++ A D H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 546 APRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654
Query: 674 GMSDGSIRLLV 684
++G + + +
Sbjct: 655 MSAEGHVTMFL 665
Score = 109 bits (273), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 780 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 836 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 886 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 945
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 946 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1006 PWPVRKI 1012
>gi|351713968|gb|EHB16887.1| Cleavage and polyadenylation specificity factor subunit 1
[Heterocephalus glaber]
Length = 1440
Score = 303 bits (776), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 220/678 (32%), Positives = 347/678 (51%), Gaps = 89/678 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEGLTKNDKTTEGKSHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTVREAADKEEPPSKKKRV 424
Query: 468 RRSSSDA-----LQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
++ A QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 425 DSAAGWAGNKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 480
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 481 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 540
Query: 556 --------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + S A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 541 VRKEEEETPKAEGSEQEPSAPEAQDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 599
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++
Sbjct: 600 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 649
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 650 ADPYVVIMSAEGHVTMFL 667
Score = 110 bits (276), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 70/246 (28%), Positives = 115/246 (46%), Gaps = 14/246 (5%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SSGQPTTQGEARKEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
P W +V LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGLRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAP 1007
Query: 993 WPVQKV 998
WPV+K+
Sbjct: 1008 WPVRKI 1013
>gi|338728513|ref|XP_003365689.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Equus caballus]
Length = 1450
Score = 303 bits (775), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 219/681 (32%), Positives = 350/681 (51%), Gaps = 86/681 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN + + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRNAEGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+ +
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+ + T+ FLGSRLG+SLL+++T +S ++E E + P +K+ R S+
Sbjct: 375 TSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---AEKEEPPSKKKRVDST 430
Query: 473 -------------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP +
Sbjct: 431 VGWSGSPRAAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANA 486
Query: 519 SYG----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTV 553
+ G L I + + + + K ++V ELPGC +WTV
Sbjct: 487 AMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTV 546
Query: 554 Y-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
++ +G + S+ A D H +LI+S E TM+L+T + E+ S
Sbjct: 547 IAPVRKEQEETPKGEGTEQEPSAPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS- 605
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
+ QG T+ AGN+ R ++QV G R+L+G L F P + S ++
Sbjct: 606 GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQ 655
Query: 664 VSIADPYVLLGMSDGSIRLLV 684
++ADPYV++ ++G + + +
Sbjct: 656 CAVADPYVVIMSAEGHVTMFL 676
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 791 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 846
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 847 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 896
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 897 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGVGARGRVARFRYFEDIYGYSGVFICGPS 956
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 957 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1016
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1017 PWPVRKI 1023
>gi|354491122|ref|XP_003507705.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 isoform 1 [Cricetulus griseus]
Length = 1441
Score = 303 bits (775), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 217/673 (32%), Positives = 347/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE E L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ ++
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429
Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY------- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 555 HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
++ R + + ++ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
Score = 109 bits (272), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1008 PWPVRKI 1014
>gi|354491126|ref|XP_003507707.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 isoform 3 [Cricetulus griseus]
Length = 1449
Score = 302 bits (774), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 217/673 (32%), Positives = 347/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE E L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ ++
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429
Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY------- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 555 HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
++ R + + ++ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1008 PWPVRKI 1014
>gi|345779232|ref|XP_532356.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Canis lupus familiaris]
Length = 1460
Score = 301 bits (770), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 212/630 (33%), Positives = 331/630 (52%), Gaps = 72/630 (11%)
Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+
Sbjct: 84 KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 139
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
H FE PE L+ G P V+VDP GRC +L+YG ++++L + + +E
Sbjct: 140 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHE 193
Query: 220 FGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
G G + S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 194 GLMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVR 253
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS
Sbjct: 254 QDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP 313
Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG- 395
+ALN + + + LD A A ++ D ++S K G++ +LT++ DG
Sbjct: 314 PYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGM 373
Query: 396 RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
R V+ K SVLT+ + T+ FLGSRLG+SLL+++T S+ E
Sbjct: 374 RSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAA--REAA 431
Query: 456 DIEADAPSTKRLRRSS-----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSL 509
D E KR+ ++ QD V+ E+ +YGS A + T+ A T+SF V DS+
Sbjct: 432 DKEEPPSKKKRVDCAAGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSI 487
Query: 510 VNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV---EL 544
+NIGP + + G L I + + + + K ++V EL
Sbjct: 488 LNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFEL 547
Query: 545 PGCKGIWTVY-------HKSSRGHNA--DSSRMAAYDD-EYHAYLIISLEARTMVLETAD 594
PGC +WTV ++S+G A +SS + A DD H +LI+S E TM+L+T
Sbjct: 548 PGCYDMWTVIAPVRKEQEETSKGEVAEQESSALEAEDDGRRHGFLILSREDSTMILQTGQ 607
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 608 EIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL---- 659
Query: 655 GSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 660 ---GSPIVQCAVADPYVVIMSAEGHVTMFL 686
Score = 109 bits (273), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 70/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 801 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 856
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 857 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 906
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 907 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 966
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 967 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1026
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1027 PWPVRKI 1033
>gi|410987992|ref|XP_004000273.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Felis catus]
Length = 1432
Score = 300 bits (768), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 211/632 (33%), Positives = 334/632 (52%), Gaps = 76/632 (12%)
Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+
Sbjct: 56 KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 111
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
H FE PE L+ G P V+VDP GRC +L+YG ++++L + + +E
Sbjct: 112 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHE 165
Query: 220 FGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
G G + S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 166 GLMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVR 225
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS
Sbjct: 226 QDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP 285
Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG- 395
+ALN + + + LD A A ++ D ++S K G++ +LT++ DG
Sbjct: 286 PYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGM 345
Query: 396 RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
R V+ K SVLT+ + T+ FLGSRLG+SLL+++T +S ++E
Sbjct: 346 RSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA-- 402
Query: 456 DIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRD 507
+ + P +K+ R S+ QD V+ E+ +YGS A + T+ A T+SF V D
Sbjct: 403 -ADKEEPPSKKKRVDSTVGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCD 457
Query: 508 SLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV--- 542
S++NIGP + + G L I + + + + K ++V
Sbjct: 458 SILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTF 517
Query: 543 ELPGCKGIWTVY-------HKSSRGHNADS--SRMAAYDD-EYHAYLIISLEARTMVLET 592
ELPGC +WTV ++S+G A+ S + A DD H +LI+S E TM+L+T
Sbjct: 518 ELPGCYDMWTVIAPVRKEQEETSKGEGAEQEPSTLEAEDDGRRHGFLILSREDSTMILQT 577
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 578 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-- 631
Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 632 -----GSPIVQCAVADPYVVIMSAEGHVTMFL 658
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 79/264 (29%), Positives = 122/264 (46%), Gaps = 49/264 (18%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G LEI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 773 WCLLVRENGTLEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 828
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 829 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 871
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
L N+ +RF + P + RE E GA + R
Sbjct: 872 ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKVEGGSAEEGAGARGRVARFR 921
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 922 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 981
Query: 975 QGILKICQLPSGSTYDNYWPVQKV 998
QG L+I LP+ +YD WPV+K+
Sbjct: 982 QGELRISVLPAYLSYDAPWPVRKI 1005
>gi|158287218|ref|XP_309311.4| AGAP011340-PA [Anopheles gambiae str. PEST]
gi|157019545|gb|EAA05261.4| AGAP011340-PA [Anopheles gambiae str. PEST]
Length = 1434
Score = 299 bits (766), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 283/1050 (26%), Positives = 477/1050 (45%), Gaps = 179/1050 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LV ANV+++Y RV + +++ R M LE V YRL+GN++S+
Sbjct: 29 SLVTGGANVLKVY--RVIPDADPATRDKYTAARPPNM------KLECVASYRLNGNIKSM 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S G+ RD+++++F DAK+SV++FD L+ S+H FE + ++ G
Sbjct: 81 QSVSLAGS----LRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDED---IRGGWTG 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR--I 230
P+V+VDP RC +LVYG ++++L + S L + + A+ I
Sbjct: 134 HYHIPMVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIELQDVKPIKKAPMQLVAKTPI 193
Query: 231 ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+S++I L+DLD K +V D F+HGY EP ++IL+E T+ GR++ + TC + ALS+
Sbjct: 194 LASYIIELKDLDEKIDNVIDIQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCTMVALSL 253
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
+ + HP+IW+ +LP D + + + PIGG LV+ N++ Y +QS Y VS
Sbjct: 254 NIQQRVHPVIWTVNSLPFDCIQAIPINKPIGGCLVMCVNSLIYLNQSVP------PYGVS 307
Query: 349 LDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQR 400
L+SS + P+ + LDAA +++ + +LS K G+L +LT+ D R V+
Sbjct: 308 LNSSADHSTSFPLKPQDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRSVRN 367
Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
+K SVLTS I + FLGSRLG+SLL++F + +++ ++ G +E +
Sbjct: 368 FHFNKAAASVLTSCICVCEDEYLFLGSRLGNSLLLRFKEKDESLVITI---DDSGAVEKE 424
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
KR R + +YGS T ++ F V D+++NIGP+ +
Sbjct: 425 P---KRPRLEEEEL----------EVYGSGY-KTSVQLTSYIFEVCDNVLNIGPIAHMAV 470
Query: 521 GLRINADASATG-----ISKQSNYELVE--------------------------LPGCKG 549
G R+ + + + + + E+V L GC
Sbjct: 471 GERVAEEDAENQPDVQIVQNKLDIEVVTSSGHGKNGALCVLQSSIKPQVITSFGLSGCVD 530
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+WTV+ ++ +R A HA++I+S E TMVL+T + + E+ E+ +
Sbjct: 531 VWTVFDEA-------VARRAEDGPSTHAFMILSQEGGTMVLQTGEEINEI-ENTGFATTV 582
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
TI GN+ R ++QV + R+L G+ + Q++ + SV+I DP
Sbjct: 583 PTIHVGNIGTNRFIVQVTTKSIRLLQGTRLLQNIPI----------DLGCPLASVAIVDP 632
Query: 670 YVLLGMSDGSIRLLV-----GDPSTC----TVSVQTPAAIE-SSKKPVSSCTLYHDK--- 716
YV + S+G + L G P T+S TPA + S+ + VS L+ K
Sbjct: 633 YVCVRSSEGRVITLALREGKGTPRLAVNKNTIS-PTPAVVAISAYRDVSG--LFTKKIED 689
Query: 717 --------------------GPEPWLRKTSTDAWLSTGVGE----------AIDGADGGP 746
PEP ++ + L G AI G GG
Sbjct: 690 VYDLSRGGAASAYSSGFGSMKPEPHMKIEDEEDLLYGESGRSFKMTSMADMAIAGKSGGS 749
Query: 747 LD---------QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDT--YMREA 795
D + + ++G LEI+ +P+ V+ + +G + D+ ++
Sbjct: 750 ADFWMKYMQQVKPTYWLFAARDNGTLEIYSMPDLKLVYLITNVGNGNKVLSDSMEFVPLP 809
Query: 796 LKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQA 855
+ S ++ ++SS G G ++ +++ +A+ ++ SRP LF I + +L Y+
Sbjct: 810 MGKSASQEDASSAFGASFGVSASLLPKEILMVAL---GSYGSRPLLF-IRLEHDLLIYRV 865
Query: 856 YLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP----- 910
+ + SK + R LS S V+ R S E+ A
Sbjct: 866 FRY------SKGHLKLRFKR-LSTS-VTCPVFRTPEPSGAGATEAANEQQQARATKVLYE 917
Query: 911 -CQRITIFKNISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHG 968
I F N+SG+ G + G +P + + LR H + AF +NVNC +G
Sbjct: 918 NISMIRYFANVSGYAGVAVCGEKPYFLFLTAHGELRSHRLYARTVMKAFAPFNNVNCPNG 977
Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
F+Y Q LKI P+ +YD+ WPV+K+
Sbjct: 978 FLYFDEQYELKISIFPTYLSYDSVWPVRKI 1007
>gi|395860104|ref|XP_003802355.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Otolemur garnettii]
Length = 1441
Score = 299 bits (766), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 217/678 (32%), Positives = 344/678 (50%), Gaps = 89/678 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEVLTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASATRESADKEEPPSKKKRV 424
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 425 DPSVGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 480
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 481 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAS 540
Query: 555 -----HKSSRGHNADSSRMAAYDDE---YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
++ +G + +E H +LI+S E TM+L+T + E+ S +
Sbjct: 541 VRKEEEETPKGEGTEQESGVPEGEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 599
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++
Sbjct: 600 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 649
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 650 ADPYVVIMSAEGHVTMFL 667
Score = 110 bits (276), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 78/264 (29%), Positives = 122/264 (46%), Gaps = 49/264 (18%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 880
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-------------ETPHGAPCQ----RIT 915
L N+ +RF + P + RE T GA + R
Sbjct: 881 ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGVRGRVARFR 930
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 931 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 990
Query: 975 QGILKICQLPSGSTYDNYWPVQKV 998
QG L+I LP+ +YD WPV+K+
Sbjct: 991 QGELRISVLPAYLSYDAPWPVRKI 1014
>gi|417406474|gb|JAA49895.1| Putative mrna cleavage and polyadenylation factor ii complex
subunit cft1 cpsf subunit [Desmodus rotundus]
Length = 1444
Score = 299 bits (765), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 216/675 (32%), Positives = 347/675 (51%), Gaps = 80/675 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRSTEGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTSGTTAFP 314
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 315 LRTQEGVRTTLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKAAASVLT 374
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
S + T+ FLGSRLG+SLL+++T +S ++E + + PS+K+ R +
Sbjct: 375 SSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEAPASAVREA---ADKEEPSSKKKRVDPT 430
Query: 473 -------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 431 VGWSGGQSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGEPA 486
Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 487 FLSEEFQNSPEPDLEIVLCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVVAPVRK 546
Query: 555 --HKSSRGHNADSSRMAAY---DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
++ +G + + D H +LI+S E TM+L+T + E+ S + QG
Sbjct: 547 EQEETPKGEGTEQEPITPETEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 605
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
T+ AGN+ R ++QV G R+L+G L F P + S ++ ++ADP
Sbjct: 606 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVADP 655
Query: 670 YVLLGMSDGSIRLLV 684
V++ ++G + + +
Sbjct: 656 CVVIMSAEGHVAMFL 670
Score = 107 bits (266), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 75/264 (28%), Positives = 120/264 (45%), Gaps = 49/264 (18%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 785 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ +
Sbjct: 841 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAFAHD------------- 881
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
S + L+ +RF + P + RE+ P + R
Sbjct: 882 -------SQLGQGNLK-VRFKKVPHNINFREKKPKPSKKKADGGGAEEGAGARGRVARFR 933
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 934 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 993
Query: 975 QGILKICQLPSGSTYDNYWPVQKV 998
QG L+I LP+ +YD WPV+K+
Sbjct: 994 QGELRISVLPAYLSYDAPWPVRKI 1017
>gi|344236599|gb|EGV92702.1| Cleavage and polyadenylation specificity factor subunit 1
[Cricetulus griseus]
Length = 1419
Score = 298 bits (764), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 217/671 (32%), Positives = 347/671 (51%), Gaps = 82/671 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE E L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA--DAPSTKRLRRS 470
+ + T+ FLGSRLG+SLL+++T SS ++E A + P +K+ R
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS-VREAADKASAHNEEPPSKKKRVD 430
Query: 471 SSDAL-------QDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
+ QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 431 PTAGWTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGE 486
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 487 PAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 546
Query: 555 ----HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
++ R + + ++ A D H +LI+S E TM+L+T + E+ S +
Sbjct: 547 RKEEEEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 605
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++A
Sbjct: 606 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 655
Query: 668 DPYVLLGMSDG 678
DPYV++ ++G
Sbjct: 656 DPYVVIMSAEG 666
Score = 109 bits (272), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 760 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 815
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 816 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 865
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 866 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 925
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 926 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 985
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 986 PWPVRKI 992
>gi|301773406|ref|XP_002922132.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1-like [Ailuropoda
melanoleuca]
Length = 1469
Score = 298 bits (762), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 210/632 (33%), Positives = 333/632 (52%), Gaps = 76/632 (12%)
Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+
Sbjct: 103 KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 158
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
H FE PE L+ G P V+VDP GRC +L+YG ++++L + + +E
Sbjct: 159 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHE 212
Query: 220 FGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
G G + S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 213 GLMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVR 272
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS
Sbjct: 273 QDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP 332
Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG- 395
+ALN + + + LD A A ++ D ++S K G++ +LT++ DG
Sbjct: 333 PYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGM 392
Query: 396 RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
R V+ K SVLT+ + T+ FLGSRLG+SLL+++T +S ++E
Sbjct: 393 RSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA-- 449
Query: 456 DIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRD 507
+ + P +K+ R S+ QD V+ E+ +YGS A + T+ A T+SF V D
Sbjct: 450 -ADKEEPPSKKKRVDSTVGWSGGKSMPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCD 504
Query: 508 SLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV--- 542
S++NIGP + + G L I + + + + K ++V
Sbjct: 505 SILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTF 564
Query: 543 ELPGCKGIWTVY-------HKSSRGHNADS--SRMAAYDD-EYHAYLIISLEARTMVLET 592
ELPGC +WTV ++ +G A+ S + A DD H +LI+S E TM+L+T
Sbjct: 565 ELPGCYDMWTVIAPVRKEQEETPKGEGAEQEPSALEADDDGRRHGFLILSREDSTMILQT 624
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 625 GQEIMELDTS-GFATQGPTVFAGNIGDSRYIVQVSPLGIRLLEG---VNQLHFIPVDL-- 678
Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 679 -----GSPIVQCAVADPYVVIMSAEGHVTMFL 705
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 78/271 (28%), Positives = 122/271 (45%), Gaps = 56/271 (20%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 820 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 875
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 876 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 918
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
L N+ +RF + P + RE E GA + R
Sbjct: 919 ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKVEGGSAEEGAGARGRVARFR 968
Query: 916 IFKNISGHQG-------FFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNH 967
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC
Sbjct: 969 YFEDIYGYSGGGGACPQVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPR 1028
Query: 968 GFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
GF+Y QG L+I LP+ +YD WPV+K+
Sbjct: 1029 GFLYFNRQGELRISVLPAYLSYDAPWPVRKI 1059
>gi|380014171|ref|XP_003691113.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Apis florea]
Length = 1583
Score = 298 bits (762), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 210/668 (31%), Positives = 343/668 (51%), Gaps = 81/668 (12%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV AN+I ++ + + +K+ K + ++ LE + Y LHGNV S+
Sbjct: 30 LVVAGANIIRVFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D H LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
P+V+VDP+GRC +L+YG ++++L + S GD I SS++I
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ + H
Sbjct: 195 LKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
P+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S E
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308
Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKR 466
SVLTS + ++ FLGSRLG+SLL++FT ++ ++ E + E + K+
Sbjct: 369 ASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIVLEENETEETPAKK 428
Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+++ +SD L D+ + EEL +YGS + +T ++ F V DSL+NIGP + S
Sbjct: 429 IKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486
Query: 521 G--------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYH 555
G N D + +G K ++ ELPGC+ +WTV
Sbjct: 487 GEPAFLSEEFSHNQDPDVELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVI- 545
Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
G + ++ + HA+LI+S E TM+L+T + EV +S + QG TI AG
Sbjct: 546 ----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTIFAG 600
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
NL R ++QV + G R+L G Q + ++ S ADPYV L
Sbjct: 601 NLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLS 650
Query: 676 SDGSIRLL 683
DG + LL
Sbjct: 651 EDGQVMLL 658
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/250 (27%), Positives = 105/250 (42%), Gaps = 40/250 (16%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
+V +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 772 LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE------ 825
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 826 -------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL------- 868
Query: 875 RSLSVSNVSASRLRNLRFSRTP--LDAYTREETPHGAPCQR---ITIFKNISGHQGFFLS 929
R + L P L R+E R + F NI+G+ G F+
Sbjct: 869 -----------RFKKLDHGIIPGHLRPRPRDEDMPAMNDTRHCMMRYFSNIAGYNGVFIC 917
Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +
Sbjct: 918 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 977
Query: 989 YDNYWPVQKV 998
YD WPV+KV
Sbjct: 978 YDAPWPVRKV 987
>gi|355680843|gb|AER96659.1| cleavage and polyadenylation specific factor 1, 160kDa [Mustela
putorius furo]
Length = 1399
Score = 298 bits (762), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 210/636 (33%), Positives = 329/636 (51%), Gaps = 81/636 (12%)
Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+
Sbjct: 21 KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 76
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVL----VYGLQMIILKASQGGSGLVG 215
H FE PE L+ G P V+VDP GRC +L +YG ++++L + +
Sbjct: 77 HYFEEPE---LRDGFVQNVHAPRVRVDPDGRCAAMLTAMLIYGSRLVVLPFRRES---LA 130
Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
+E G G + S++I++R LD K ++ D F+HGY EP ++IL E TW GR
Sbjct: 131 EEHEGLMGEGQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGR 190
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
V+ + TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +
Sbjct: 191 VAVRQDTCCIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLN 250
Query: 334 QSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
QS +ALN + + + LD A A ++ D ++S K G++ +LT++
Sbjct: 251 QSVPPYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLI 310
Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++ T L
Sbjct: 311 TDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKY-----TEKLQEAPA 365
Query: 452 EEFGDIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSF 503
+ + D P +K+ R S+ A QD V+ E+ +YGS A + T+ A T+SF
Sbjct: 366 GAVRETDKDEPPSKKKRVESAVGWSGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSF 421
Query: 504 AVRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYEL 541
V DS++NIGP + + G L I + + + + K ++
Sbjct: 422 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 481
Query: 542 V---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTM 588
V ELPGC +WTV + + G + S + A DD H +LI+S E TM
Sbjct: 482 VTTFELPGCYDMWTVIAPARKEQEETPKGDGAEQEPSALEADDDGRRHGFLILSREDSTM 541
Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPS 648
+L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G L F P
Sbjct: 542 ILQTGQEIMELDTS-GFATQGPTVFAGNIGDGRYIVQVSPLGIRLLEG---VSQLHFIPV 597
Query: 649 NSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
+ S ++ ++ADPYV++ ++G + + +
Sbjct: 598 DL-------GSPIVQCAVADPYVVIMSAEGHVTMFL 626
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 741 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 796
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 797 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 846
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 847 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAAARGRVARFRYFEDIYGYSGVFICGPS 906
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 907 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 966
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 967 PWPVRKI 973
>gi|24653655|ref|NP_725397.1| cleavage and polyadenylation specificity factor 160, isoform B
[Drosophila melanogaster]
gi|15292103|gb|AAK93320.1| LD38533p [Drosophila melanogaster]
gi|21627189|gb|AAM68553.1| cleavage and polyadenylation specificity factor 160, isoform B
[Drosophila melanogaster]
Length = 1420
Score = 296 bits (759), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 287/1054 (27%), Positives = 462/1054 (43%), Gaps = 200/1054 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRI------------- 242
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
K+ + PIGG LV+ N + Y +QS Y V
Sbjct: 243 ----------------------KVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 274
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 275 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 335 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 394
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E ++E + +L + + A + EEL +YGS + + + F F V D
Sbjct: 395 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 453
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 454 SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 513
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 514 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 564
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 565 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 613
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 614 DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 673
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 674 LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 733
Query: 741 GADGGPLDQGD------------IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + VV +SG LEI+ +P+ V+ V+ +G +
Sbjct: 734 LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLT 793
Query: 789 DTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
D + + E +S+ G Q ++ +S +EL++ + RP L + T
Sbjct: 794 DAMEFVPISLTTQE---NSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTR 849
Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH 907
+L YQ +F P+ K R + N+ + ++ D E+
Sbjct: 850 VELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHIDLDEN--DEQEEIESYQ 900
Query: 908 GAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVN 964
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 901 MQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVN 960
Query: 965 CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+GF+Y + LKI LPS +YD+ WPV+KV
Sbjct: 961 IPNGFLYFDTTYELKISVLPSYLSYDSVWPVRKV 994
>gi|358415280|ref|XP_003583063.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Bos taurus]
Length = 1490
Score = 295 bits (756), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 209/635 (32%), Positives = 327/635 (51%), Gaps = 82/635 (12%)
Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+
Sbjct: 114 KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 169
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLV 214
H FE PE L+ G P V+VDP GRC +L+YG ++++L ++ GLV
Sbjct: 170 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLV 226
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAG 272
G+ G + S++I++R LD K ++ D F+HGY EP ++IL E TW G
Sbjct: 227 GE--------GQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPG 278
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
+V+ + TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV++ N++ Y
Sbjct: 279 KVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYL 338
Query: 333 SQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
+QS +ALN+ + + + LD A A ++ D ++S K G++ +LT+
Sbjct: 339 NQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTL 398
Query: 392 VYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
+ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T S+
Sbjct: 399 ITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA- 457
Query: 451 KEEFGDIEADAPSTKRLRRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFA 504
E D E KR+ + S QD V+ E+ +YGS A + T+ A T+SF
Sbjct: 458 -REAADKEEPPSKKKRVDATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFE 512
Query: 505 VRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV 542
V DS++NIGP + + G L I + + + + K ++V
Sbjct: 513 VCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVV 572
Query: 543 ---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMV 589
ELPGC +WTV + G + A DD H +LI+S E TM+
Sbjct: 573 TTFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMI 632
Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 633 LQTGQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVD 688
Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 689 L-------GSPIVQCAVADPYVVIMSAEGHVTMFL 716
Score = 112 bits (280), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 116/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 831 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 886
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + RP+L + D +L Y+A+ P ++ +
Sbjct: 887 QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 936
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E T R F++I G+ G F+ G
Sbjct: 937 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 996
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 997 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1056
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1057 PWPVRKI 1063
>gi|195056749|ref|XP_001995154.1| GH22991 [Drosophila grimshawi]
gi|193899360|gb|EDV98226.1| GH22991 [Drosophila grimshawi]
Length = 1426
Score = 295 bits (755), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 291/1065 (27%), Positives = 451/1065 (42%), Gaps = 216/1065 (20%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + + + K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVDAVQRQKLNPSEMRLAPKM------RLECLASYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQSVSLAGA----MRDALLISFKDAKLSVLQLDADTQTLKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P+V+VDP RC +LVYG ++++L + S L + + R
Sbjct: 136 GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I L DLD K +V D F+HGY EP ++IL+E T AGR+ + T
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRIKVRSDT------- 248
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
+ PIGG LV+ N + Y +QS Y V
Sbjct: 249 -----------------------FFPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 279
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 280 SLNSSADNSTAFPLKPQDNVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 339
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I FLGSRLG+SLL+ FT +++++
Sbjct: 340 NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEATVEQQT 399
Query: 448 -----SGLKEE--FGDIEA-DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
L EE D+E +AP + RR + EEL +YGS + + +
Sbjct: 400 IEQSPEELAEESPVYDVEQHEAPPQSKSRR---------IEDEELEVYGSGAKASVLQLR 450
Query: 500 TFSFAVRDSLVNIGPLKDFSYG-----------LRINAD---------ASATGISKQS-- 537
F F V DSL+N+ P+ G LR +AD +ATG SK
Sbjct: 451 KFIFEVCDSLINVAPINYMCAGERVEFEEDGATLRPHADNLNDLKIELVAATGHSKNGAL 510
Query: 538 -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
N +++ EL GC +WTV+ ++R A ++R + H ++++S + T
Sbjct: 511 SVFVNCINPQIITSFELEGCLDVWTVFDDATR--KATTAR-----QDQHDFMLLSQRSST 563
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 564 LVLQTGQEINEI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 620
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
S V+ VSIADPYV L + +G + L + T + SS V
Sbjct: 621 --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAV 672
Query: 708 SSCTLYHD------------------------------KGPEPWLRKTSTDAWLSTGVGE 737
+ Y D EP ++ + L G
Sbjct: 673 VAIAAYKDLSGLFTCKADDVLNLTGSSGAGFANSFGGYMKAEPHMKVEDEEDLLYGDAGS 732
Query: 738 AI------DGADGGPLDQGD------------IYSVVCYESGALEIFDVPNFNCVFTVDK 779
A D A D + VV +SG LEI+ +P+ V+ V+
Sbjct: 733 AFKLNSMADLAKQSKQKNSDWWRRQLIQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVND 792
Query: 780 FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSR 838
+G + D E + S T+ NS + G ++ +S +EL + H R
Sbjct: 793 IGNGALVLSDAM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELCLVGLGQHGER 848
Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
P L + T +L YQ + + + + L + + LD
Sbjct: 849 PLLL-VRTRLELLIYQVFRY------------AKGHLKIRFRKLEQLHLLEQQPTHIELD 895
Query: 899 AYTREETP----HGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGS 953
EE Q++ F N+ G G + G PC+ + R LR+H L +G
Sbjct: 896 GEDVEEAESYNMQAKYVQKLRYFANVGGLAGIMVCGVNPCFVFLTSRGELRIHRLLGNGD 955
Query: 954 IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+ +F +NVN HGF+Y + LKI LPS +YD WPV+KV
Sbjct: 956 VRSFAAFNNVNIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKV 1000
>gi|110750698|ref|XP_624382.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Apis mellifera]
Length = 1415
Score = 293 bits (750), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 209/668 (31%), Positives = 343/668 (51%), Gaps = 81/668 (12%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV AN+I ++ + + +K+ K + ++ LE + Y LHGNV S+
Sbjct: 30 LVVAGANIIRVFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D H LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
P+V+VDP+GRC +L+YG ++++L + S GD I SS++I
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ + H
Sbjct: 195 LKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
P+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S E
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308
Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPSTKR 466
SVLTS + ++ FLGSRLG+SLL++FT ++ ++ + + E + K+
Sbjct: 369 ASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPAKK 428
Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+++ +SD L D+ + EEL +YGS + +T ++ F V DSL+NIGP + S
Sbjct: 429 IKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486
Query: 521 G--------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYH 555
G N D + +G K ++ ELPGC+ +WTV
Sbjct: 487 GEPAFLSEEFSHNQDPDVELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVI- 545
Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
G + ++ + HA+LI+S E TM+L+T + EV +S + QG TI AG
Sbjct: 546 ----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTIFAG 600
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
NL R ++QV + G R+L G Q + ++ S ADPYV L
Sbjct: 601 NLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLS 650
Query: 676 SDGSIRLL 683
DG + LL
Sbjct: 651 EDGQVMLL 658
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/250 (27%), Positives = 105/250 (42%), Gaps = 40/250 (16%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
+V +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 772 LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE------ 825
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 826 -------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL------- 868
Query: 875 RSLSVSNVSASRLRNLRFSRTP--LDAYTREETPHGAPCQR---ITIFKNISGHQGFFLS 929
R + L P L R+E R + F NI+G+ G F+
Sbjct: 869 -----------RFKKLDHGIIPGHLRPRPRDEDMPAMNDTRHCMMRYFSNIAGYNGVFIC 917
Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +
Sbjct: 918 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 977
Query: 989 YDNYWPVQKV 998
YD WPV+KV
Sbjct: 978 YDAPWPVRKV 987
>gi|301628217|ref|XP_002943254.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Xenopus (Silurana) tropicalis]
Length = 628
Score = 293 bits (749), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 204/626 (32%), Positives = 318/626 (50%), Gaps = 74/626 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E S + + E K LEL+ + GNV S+
Sbjct: 29 NLVVAGTSQLYVYRLNPNCESSSKGEKGSEVKGH-------KEKLELMASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F++AK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKEAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG Q+++L ++ GLVG+ G +
Sbjct: 135 NVHNPKVRVDPSGRCAVMLIYGTQLVVLPFRRDTLAEEHDGLVGE--------GQKSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRELDEKLLNIIDMQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
K HP+IWS NLP+D + LAVP PIGGV++ N++ Y +QS ++LN+
Sbjct: 247 IMQKVHPVIWSLTNLPYDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVSLNSLTNG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
S P+ V LD + AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTSFPLKPQEGLRVTLDCSQATFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ +T + FLGSRLG+SLL+++T S + + D P K+
Sbjct: 367 ASVLTTSMTPMEPGYLFLGSRLGNSLLLRYTEKVQDSPAGPSKDPD----KQDEPPNKKK 422
Query: 468 RRSSSDALQ-----DMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
R SS A +MV+ +E+ +YGS + + T+SF V DS++NIGP S G
Sbjct: 423 RVDSSLARPGGSKGNMVDEIDEIEVYGSEM-QSGTQLSTYSFEVCDSILNIGPCATASMG 481
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
L I + + + + K ++V ELPGC +WTV
Sbjct: 482 EPAFLSEEFQESPEPDLEIVLCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVISN 541
Query: 557 SSRGHNADSSRM------AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
+ A D H +LI+S + TM+L+T + E+ S + Q
Sbjct: 542 HKKEEQEGEKEGETPPVEAEEDTNRHGFLILSRDDSTMILQTGQEIMELDTS-GFATQDP 600
Query: 611 TIAAGNLFGRRRVIQVFERGARILDG 636
T+ AGN+ + ++QV RG R+L+G
Sbjct: 601 TVYAGNIGDNKYIVQVSPRGIRLLEG 626
>gi|322792443|gb|EFZ16427.1| hypothetical protein SINV_15375 [Solenopsis invicta]
Length = 1532
Score = 293 bits (749), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 206/621 (33%), Positives = 328/621 (52%), Gaps = 63/621 (10%)
Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
LE + Y LHGN+ S+ + G+ +RDS++L+F DAK+SV+E+D IH LR S+
Sbjct: 32 KLECLAQYTLHGNIMSMQAVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSL 87
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-D 218
H FE E +K G + P+V+VDP+GRC +L++G ++++L + S GD D
Sbjct: 88 HYFEEEE---IKDGWTNHHHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPSLDDGDLLD 144
Query: 219 TFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
T A I SS++I L+ L+ M +V D F+HGY EP ++IL+E T+AGR++
Sbjct: 145 TAKLTSSNKAPILSSYMIVLKSLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFAGRIAV 204
Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS- 335
+ TC + A+S++ + HP+IWS NLP D Y+ + V P+GG L++ N++ Y +QS
Sbjct: 205 RQDTCAMVAISLNIQQRVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAFNSLIYLNQSI 264
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
++LN+ A + + P+ + L+ A ++ D ++S K+G+L +L++ D
Sbjct: 265 PPYGVSLNSLADTSTNFPLKPQEGVKMSLEGAQVAFISADRLVISLKSGELYVLSLFADS 324
Query: 396 -RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS-SGLKEE 453
R V+ K SVLTS + ++ FLGSRLG+SLL++FT ++ + +G +
Sbjct: 325 MRSVRGFHFDKAAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNGGEIT 384
Query: 454 FGDIEADAPSTKRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
+ E++ K+ ++ +SD L D+ + EEL +YGS + +T ++ F V D
Sbjct: 385 IEENESEETPAKKAKQDFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCD 442
Query: 508 SLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELV------------ 542
SL+NIGP + S G N D + +G K ++
Sbjct: 443 SLLNIGPCGNISMGEPAFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTF 502
Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
ELPGC+ +WTV N D + A + HA+LI+S E TM+L+T + EV +S
Sbjct: 503 ELPGCEDMWTVIGTL----NNDEIKTEA--EGSHAFLILSQEDSTMILQTGQEINEVDQS 556
Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVL 662
+ QG T+ AGNL R ++QV + G R+L G Q + ++
Sbjct: 557 -GFSTQGSTVFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIV 605
Query: 663 SVSIADPYVLLGMSDGSIRLL 683
S ADPYV L DG + LL
Sbjct: 606 HASCADPYVTLLSEDGQVMLL 626
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 107/248 (43%), Gaps = 35/248 (14%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
+V +SG LEI+ +P+ + + F G+ + D+ L+ T IN
Sbjct: 740 LVYRDSGTLEIYSLPDLRLSYLIRNFGFGQYVLHDSMESTTLQS--TPINEIPHP----- 792
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
M+V E+ M H +RP L L D + YQ Y + P+ K
Sbjct: 793 ------DMQVREILMVALGHHGNRPMLLVRL-DSELQIYQVYRY--PKGYLK-------- 835
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
L + + R S P E+ P RI + F NI+G+ G F+
Sbjct: 836 --LRFKKLDHGIIPG-RLSPRP----KEEDVPRNTSDTRICVMRYFSNIAGYNGVFICSD 888
Query: 932 RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W + R LR HP DGS+ +F +N+NC GF+Y + L+IC LP+ +YD
Sbjct: 889 YPHWIFLTGRGELRTHPMGIDGSVTSFAAFNNINCPQGFLYFNRKEELRICVLPTHLSYD 948
Query: 991 NYWPVQKV 998
WPV+KV
Sbjct: 949 APWPVRKV 956
>gi|195381337|ref|XP_002049409.1| GJ21566 [Drosophila virilis]
gi|194144206|gb|EDW60602.1| GJ21566 [Drosophila virilis]
Length = 1420
Score = 290 bits (741), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 284/1055 (26%), Positives = 447/1055 (42%), Gaps = 202/1055 (19%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + + ++ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVDAAQRQKLNPTEMRLAPKM------RLECLASYSLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S G RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQSVSLAGG----MRDALLISFKDAKLSVLQLDADTQALKTLSLHYFEEED---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P+V+VDP RC +LVYG ++++L + S L + + R
Sbjct: 136 GRYHVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I L DLD K +V D F+HGY EP ++IL+E T AGR+
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRI------------- 242
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
K+ + PIGG LV+ N I Y +QS Y V
Sbjct: 243 ----------------------KVFPIQKPIGGCLVMTVNAIIYLNQSVP------PYGV 274
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 275 SLNSSADNSTSFPLKPQDNVRLSLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
K SVLTS I FLGSRLG+SLL+ FT +++++ E + +A
Sbjct: 335 NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDMENAVEQQA 394
Query: 460 DAPSTKRL----------RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSL 509
+ +L + S A + EEL +YGS + + + F F V DSL
Sbjct: 395 VEQAPPQLDEEQVYDVDQHEAPSQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSL 454
Query: 510 VNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------NYE 540
+N+ P+ G R+ + +ATG SK N +
Sbjct: 455 INVAPINYMCAGERVEFEEDGSTLRPHAESLNEVKIELVAATGHSKNGALSVFVNCINPQ 514
Query: 541 LV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
++ EL GC +WTV+ ++R ++R E H ++++S + T+VL+T +
Sbjct: 515 IITSFELDGCLDVWTVFDDATR--KPTTAR-----QEQHDFMLLSQRSSTLVLQTGQEIN 567
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 568 EI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI----------DV 616
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-- 715
S V+ VSIADPYV L + +G + L + T + SS V + Y D
Sbjct: 617 GSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAIAAYKDLS 676
Query: 716 ----------------KGP------------EPWLRKTSTDAWLSTGVGEAI------DG 741
GP EP ++ + L G A D
Sbjct: 677 GLFTCKADDVLNLTGSSGPGFVNSFGGYMKAEPHMKVEDEEDLLYGDAGNAFKLNSMADL 736
Query: 742 ADGGPLDQGD------------IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD 789
A D + VV +SG LEI+ +P+ V+ V+ +G + D
Sbjct: 737 AKQSKQKNSDWWRRQLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGALVLND 796
Query: 790 TYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
E + S T+ NS + G ++ +S +EL + H RP L + T
Sbjct: 797 AM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELCLVGLGQHGERPLLL-VRTRL 851
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP-- 906
+L YQ + + + + L + + + LD EE
Sbjct: 852 ELLIYQVFRY------------AKGHLKIRFRKLEQLHLLDQQPTHIELDGDEAEEAESY 899
Query: 907 --HGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNV 963
Q++ F N+ G G + G P + + R LR+H L + + +F +NV
Sbjct: 900 NMQPKYVQKLRYFSNVGGLAGIMVCGMNPVFVFLTARGELRIHRLLGNADVRSFAAFNNV 959
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
N HGF+Y + LKI LPS +YD WPV+KV
Sbjct: 960 NIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKV 994
>gi|195122290|ref|XP_002005645.1| GI18959 [Drosophila mojavensis]
gi|193910713|gb|EDW09580.1| GI18959 [Drosophila mojavensis]
Length = 1431
Score = 289 bits (739), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 280/1068 (26%), Positives = 446/1068 (41%), Gaps = 217/1068 (20%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + + ++ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVDATQRQKLNPSEMRLAPKM------RLECLASYSLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S G RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQSVSLAGG----MRDALLVSFKDAKLSVLQLDADTQTLKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P+V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I L DLD K +V D F+HGY EP ++IL+E T AGR+
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRI------------- 242
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
K+ + PIGG LV+ N + Y +QS Y V
Sbjct: 243 ----------------------KVFPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 274
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 275 SLNSSADNSTSFPLKPQDNVRLSLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I FLGSRLG+SLL+ FT +++++
Sbjct: 335 NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVESAATAAA 394
Query: 448 --------SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+ + ++ D + S A + EEL +YGS + + +
Sbjct: 395 TGAGEQQQQAIDQSPPQMDEDQVYDVEQHEAPSQAKSRRIEDEELEVYGSGAKASVLQLR 454
Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS-- 537
F F V DSL+N+ P+ G R+ + +ATG SK
Sbjct: 455 KFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAESLTDLKIELVAATGHSKNGAL 514
Query: 538 -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
N +++ EL GC +WTV+ ++R + + E H ++++S + T
Sbjct: 515 SVFVNCINPQIITSFELDGCLDVWTVFDDATR-------KPSTARQEQHDFMLLSQRSST 567
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 568 LVLQTGQEINEI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 624
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
S V+ VSIADPYV L + +G + L + T + SS V
Sbjct: 625 --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSAPAV 676
Query: 708 SSCTLYHD------------------------------KGPEPWLRKTSTDAWLSTGVGE 737
+ Y D EP ++ + L G
Sbjct: 677 VAIAAYKDLSGLFTCKADDVLNLTGSTGAGFANSFGGYMKAEPHMKVEDEEDLLYGDAGN 736
Query: 738 AI------DGADGGPLDQGD------------IYSVVCYESGALEIFDVPNFNCVFTVDK 779
A D A D + VV +SG LEI+ +P+ V+ V+
Sbjct: 737 AFKLNSMADLAKQSKQKNTDWWRRQLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVND 796
Query: 780 FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSR 838
+G + D E + S T+ NS + G ++ +S +EL++ H R
Sbjct: 797 VGNGALVLTDAM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELSLVGLGQHGDR 852
Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
P L + T +L YQ + + L + +L L T ++
Sbjct: 853 PLLL-VRTRLELLIYQVFRY--------------AKGHLKIRFRKLEQLHLLDQQPTHIE 897
Query: 899 AYTREETPHGAP-------CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLC 950
EET Q++ F N+ G G + G PC+ + R LR+H L
Sbjct: 898 LINEEETDEAESYNMQPKYVQKLRYFNNVGGLAGIMVCGVNPCFIFLTARGELRIHRLLG 957
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+ + +F +NVN HGF+Y + LKI LP+ +YD WPV+KV
Sbjct: 958 NAEVRSFAAFNNVNIPHGFLYFDTTYELKISVLPTYLSYDAAWPVRKV 1005
>gi|332018184|gb|EGI58789.1| Cleavage and polyadenylation specificity factor subunit 1
[Acromyrmex echinatior]
Length = 1412
Score = 289 bits (739), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 214/665 (32%), Positives = 342/665 (51%), Gaps = 76/665 (11%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV ANVI ++ + + ++ K + ET+ LE + Y LHGN+ S+
Sbjct: 30 LVVAGANVIRVFRLIPDVDMTRREKYT-ETRP-------PKMKLECLTQYTLHGNIMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+ G+ +RDS++L+F DAK+SV+E+D IH LR S+H FE E +K G +
Sbjct: 82 AVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IKDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR---IESSH 234
P+V+VDP+GRC +L++G ++++L + S + D D S S I SS+
Sbjct: 135 HHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPS--LDDGDLLDSAKLTSTNKTPILSSY 192
Query: 235 VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+I L+ L+ M +V D F+HGY EP ++IL+E T++GR++ + TC + A+S++
Sbjct: 193 MIVLKTLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDS 351
+ HP+IWS NLP D Y+ + V P+GG L++ N++ Y +QS ++LN+ A S +
Sbjct: 253 RVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADSSTN 312
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
P+ + L+ + ++ D ++S K+G+L +L++ D R V+ K SV
Sbjct: 313 FPLKPQEGVKMSLEGSQVAFISADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASV 372
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM--LSSGLKEEFGDIEADAPSTKRLR 468
LTS + ++ FLGSRLG+SLL++FT ++ L+ + + P+ K +
Sbjct: 373 LTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNDNEITIEENENEETPAKKTKQ 432
Query: 469 R-----SSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-- 521
+SD L D+ + EEL +YGS + +T ++ F V DSL+NIGP + S G
Sbjct: 433 DFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEP 490
Query: 522 ------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSS 558
N D + +G K ++ +LPGC+ +WTV
Sbjct: 491 AFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFQLPGCEDMWTVIGIV- 549
Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
N D R ++ HA+LI+S E TMVL+T + EV +S + QG T+ AGNL
Sbjct: 550 ---NNDEIRT---EEGSHAFLILSQEDSTMVLQTGQEINEVDQS-GFSTQGSTVFAGNLG 602
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
R ++QV + G R+L G Q + ++ S ADPYV L DG
Sbjct: 603 ANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVALLSEDG 652
Query: 679 SIRLL 683
+ LL
Sbjct: 653 QVMLL 657
Score = 94.4 bits (233), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 107/248 (43%), Gaps = 35/248 (14%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
+V +SG LEI+ +P+ + + F G+ + D+ L+ T IN
Sbjct: 771 LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQ--STPINEIPHP----- 823
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 824 ------DMQVREILMVALGHHGNRPMLLVRL-DSDLQIYQAYRY--PKGYLK-------- 866
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
L + + R S P E+ P RI + F NI+G+ G F+
Sbjct: 867 --LRFKKLDHGIIPG-RLSPRP----KEEDVPRNRNITRICVMRYFSNIAGYNGVFICSD 919
Query: 932 RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +YD
Sbjct: 920 YPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYD 979
Query: 991 NYWPVQKV 998
WPV+KV
Sbjct: 980 APWPVRKV 987
>gi|426361048|ref|XP_004047737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Gorilla gorilla gorilla]
Length = 1440
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 210/675 (31%), Positives = 336/675 (49%), Gaps = 84/675 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP G C +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGTCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T +S ++E + + P +K+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---ADKEEPPSKKK 422
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR---DSLVNIGPLKDFSYG--- 521
R ++ G + A + AV DS++NIGP + + G
Sbjct: 423 RVDATAGWSGEGRSRAGQERGQVTQGWSGAGAPLTVAVPQVCDSILNIGPCANAAMGEPA 482
Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 FLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRK 542
Query: 555 ----HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+ G + S A DD H +LI+S E TM+L+T + E+ S + QG
Sbjct: 543 EEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 601
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
T+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADP
Sbjct: 602 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADP 651
Query: 670 YVLLGMSDGSIRLLV 684
YV++ ++G + + +
Sbjct: 652 YVVIMSAEGHVTMFL 666
Score = 108 bits (270), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/247 (27%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 781 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 836
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 837 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 886
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 887 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 946
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 947 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1006
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1007 PWPVRKI 1013
>gi|312380158|gb|EFR26239.1| hypothetical protein AND_07834 [Anopheles darlingi]
Length = 1503
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 277/1057 (26%), Positives = 457/1057 (43%), Gaps = 185/1057 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LV ANV+++Y R+ + ++ R M LE + YRL GN+ SL
Sbjct: 42 SLVTGGANVLKVY--RIIPDADPATREKYSATRPPNM------KLECMASYRLFGNIMSL 93
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S G+ +RD+++++F DAK+SV++FD L+ S+H FE + ++ G
Sbjct: 94 QSVSLAGS----QRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDED---IRGGWTG 146
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSGGGF---SARI 230
PLV+VDP RC +LVYG ++++L K S + D I
Sbjct: 147 HYHIPLVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEMQDVKPIKKTPTLLIAKTPI 206
Query: 231 ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+S++I L+DLD K +V D F+HGY EP ++IL+E T+ GR++ + TC + ALS+
Sbjct: 207 LASYIIELKDLDEKIDNVIDVQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCTMVALSL 266
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
+ + HP+IW+ +LP D + + + PIGG LV+ N++ Y +QS Y VS
Sbjct: 267 NIQQRVHPVIWTVNSLPFDCLQAVPISKPIGGCLVMCVNSLIYLNQSVP------PYGVS 320
Query: 349 LDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
L+SS + P+ + LDAA +++++ +LS K G+L +LT+ D
Sbjct: 321 LNSSADHSTNFPLKPQDGVRISLDAAQVCFIESEKLVLSLKGGELYVLTLCADS------ 374
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
I FLGSRLG+SLL++F + +++ ++ G +E +
Sbjct: 375 ----------MRSICVCETEYLFLGSRLGNSLLLRFREKDESLVITI---DDSGTVEKE- 420
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
KR R + +YGS T ++ F V DS++NIGP+ + G
Sbjct: 421 --QKRQRLEEEEL----------EVYGSGY-KTSVQLTSYIFEVCDSVLNIGPIAHMAVG 467
Query: 522 LRINAD-------------------ASATGISKQSNYELVE------------LPGCKGI 550
RI + +A+G K +++ L GC +
Sbjct: 468 ERICEEEMEEGAEVQFVPNKLDVEVVTASGHGKNGALCVLQSSIKPQVITSFGLSGCLDV 527
Query: 551 WTVYHKSSRGHNADSSRMAAYDD---EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
WTV+ +++ +R DD HA++I+S E TMVL+T + + E+ E+ +
Sbjct: 528 WTVFDEAAGPGGVTGTRKP--DDAPPPNHAFMILSQEGATMVLQTGEEINEI-ENTGFAT 584
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
TI GN+ R ++QV + R+L G+ + Q++ + SVSI
Sbjct: 585 DVPTIHVGNIGSNRFIVQVTTKSIRLLQGTRLLQNIPI----------DLGCPLASVSIV 634
Query: 668 DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD------------ 715
DPYV + S+G + L T + S+ PV + + Y D
Sbjct: 635 DPYVCVRSSEGRVITLALREGKGTPRLAVNKNTISASPPVIAISAYRDVSGMFTRKLEDS 694
Query: 716 ----KG---------------PEPWLRKTSTDAWLSTGVGEAID---GADGGPLDQG--- 750
KG PEP ++ + L G + AD D+G
Sbjct: 695 FDVSKGGGATSAYSSGFGSMKPEPNMKIEDEEDLLYGESGRSFKVTSMADMALADKGGGN 754
Query: 751 -------------DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREAL- 796
+ + ++G LEI+ +P+ + + +G + D+ L
Sbjct: 755 ADFWLKYMQQIKPTYWLLAARDNGNLEIYSMPDLKLAYLISNVGNGNKVLSDSMEFVPLP 814
Query: 797 --KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
K ++ ++S G G S+ E+ M ++ SRP LF I + +L Y+
Sbjct: 815 MAKPGTSQEEATSAFGASFGSGGVPVSLLPKEILMVALGSYGSRPILF-IRLEQDLLIYR 873
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR- 913
+ + + S+ + V A RL NL + A T P+G Q
Sbjct: 874 VFRYAKGHLKLRFKRLTSSVTCPAFRTVPA-RLANLP-DKPATGATTDATEPNGKDTQEH 931
Query: 914 -----------ITIFKNISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLH 961
I F N+SG+ G + G +P + + LR H + AF +
Sbjct: 932 ATKVQYENISMIRYFGNVSGYAGVAVCGEKPYFLFLTAHGELRSHRLYARTVMKAFAPFN 991
Query: 962 NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
NVNC +GF+Y Q LKI LP+ +YD+ WPV+K+
Sbjct: 992 NVNCPNGFLYFDEQYQLKISILPTYLSYDSVWPVRKI 1028
>gi|119602512|gb|EAW82106.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
CRA_a [Homo sapiens]
gi|119602513|gb|EAW82107.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
CRA_a [Homo sapiens]
gi|119602514|gb|EAW82108.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
CRA_a [Homo sapiens]
Length = 1365
Score = 286 bits (733), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 202/620 (32%), Positives = 322/620 (51%), Gaps = 80/620 (12%)
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 2 SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSAR 229
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 55 VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYA 346
++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLT 226
Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 227 TGTTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDK 286
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADA 461
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE +
Sbjct: 287 AAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRV 346
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + +
Sbjct: 347 DATAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAV 402
Query: 521 G----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY- 554
G L I + + + + K ++V ELPGC +WTV
Sbjct: 403 GEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIA 462
Query: 555 --------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVD 604
+ G + S DD+ H +LI+S E TM+L+T + E+ S
Sbjct: 463 PVRKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-G 521
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
+ QG T+ AGN+ R ++QV G R+L+G L F P + + ++
Sbjct: 522 FATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQC 571
Query: 665 SIADPYVLLGMSDGSIRLLV 684
++ADPYV++ ++G + + +
Sbjct: 572 AVADPYVVIMSAEGHVTMFL 591
Score = 108 bits (270), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/247 (27%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 706 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 761
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 762 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 811
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 812 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 871
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 872 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 931
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 932 PWPVRKI 938
>gi|390347522|ref|XP_003726804.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 1439
Score = 286 bits (731), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 219/726 (30%), Positives = 349/726 (48%), Gaps = 105/726 (14%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A Y+ +H PTG+ +C H + P ++ NLVV
Sbjct: 2 YAFYREIHPPTGVEHC---VYCHFFS------------------PDQQ------NLVVAK 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
+ + +Y + + + +K + + K + LE + + G V S+ Q
Sbjct: 35 GSELTVYSM-ITVDSNKPTDKESKPKNK----------LEEAATFHIFGKVMSM----QS 79
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
RD+++L+F +AK+S++E+D ++H L+ SMH FE E K G P+
Sbjct: 80 AQVTGSGRDALLLSFMNAKVSIVEYDPNMHDLKTLSMHYFEEDE---TKEGVYRNIFHPV 136
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
VKVDP RC +L YG ++++L + GLV D D S + S+VI L ++D
Sbjct: 137 VKVDPDHRCAIMLTYGSKLVVLPFRR--DGLVEDLDKSMSASTRRGALMPSYVIRLNEMD 194
Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
+ +V D F+HGY EP ++IL+E TWAGRV+ + TC I ALS++ K HP+IWS
Sbjct: 195 DPICNVLDIQFLHGYYEPTLLILYEPLRTWAGRVAVRQDTCSIVALSLNMAQKVHPIIWS 254
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS----SQELP 356
+LP+D ++ AVP PIGGVL++ N++ Y +QS + Y VSL+S S P
Sbjct: 255 QSSLPYDCMQVQAVPKPIGGVLILAVNSLLYLNQS------IPPYGVSLNSLTDWSTAFP 308
Query: 357 ---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + +D AT++ D LS K G++ +LT++ DG R V+ L K SVLT
Sbjct: 309 LKTQEGVKLSMDCTQATFISYDRLALSLKDGEIYVLTLLVDGMRSVRGFHLDKAAASVLT 368
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+ I +G+ FLGSRLG+SLL+++T + S K E + PS K +S
Sbjct: 369 TCICPMGDGFLFLGSRLGNSLLLKYTEKVSETSPSDASKTEEPKPGEEPPSKKMRSDDAS 428
Query: 473 DALQD----MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
D + + + +EL +YG T + ++SF + DSL+NIGP + G
Sbjct: 429 DWMASDTKFLDDPDELEVYGKQVQKTGTQLTSYSFEICDSLLNIGPCGNMIMGEPAFLSE 488
Query: 522 -LRINAD-----ASATGISKQSNYELVE------------LPGCKGIWTV--YHKSSRGH 561
+ N D + +G K +++ LPGC +WTV K+
Sbjct: 489 EFQGNVDPDLELVTTSGYGKNGALSVLQRTIRPQVVTTFNLPGCLDMWTVKSLKKAKADE 548
Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
++ S + D + HA+LI+S + +MVL+T +TEV + Q TI A N+ R
Sbjct: 549 KSEESETSPEDKDRHAFLILSKQDSSMVLQTGQEITEVAAG-GFSTQAPTIFASNMGDDR 607
Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIR 681
++QV + +++G Q + S + S+ADPY+LL +G
Sbjct: 608 YIVQVMNKSICLMEGVEQIQHMVL----------DVGSPIKQCSLADPYLLLLTENGDPI 657
Query: 682 LLVGDP 687
L+ P
Sbjct: 658 LMTLKP 663
Score = 100 bits (250), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 72/257 (28%), Positives = 112/257 (43%), Gaps = 32/257 (12%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V C E+G LE++ +P+ F V F G +VD S S TG
Sbjct: 776 WCVFCRENGQLEMYSLPDMVLAFLVKNFPMGSKVLVD---------------SGSAFMTG 820
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
+++ +V E+ + + ++ A++ D I+ Y+A+ P NT + +
Sbjct: 821 DQSQQHEMLQQVQEVLLVGLGHDRKKIYMLALVEDD-IMIYEAF----PYNTVTQEHHLR 875
Query: 873 TSRSLSVSNVSASRLRNLRFSRTP----------LDAYTREETPHGAPCQRITIFKNISG 922
R + + + + R S+ P + R+ F N+
Sbjct: 876 V-RFRKIPHKILMKPKKTRTSKKPTAEGGTKTETETEAESDTKTQTRRVNRLREFHNVQT 934
Query: 923 HQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G F+SGS P W V R LR HP DG+I F HNVNC +GF+Y + L+IC
Sbjct: 935 YSGVFISGSHPYWLFVTSRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRIC 994
Query: 982 QLPSGSTYDNYWPVQKV 998
LPS +YD WPV+KV
Sbjct: 995 VLPSHLSYDAPWPVRKV 1011
>gi|355698297|gb|EHH28845.1| Cleavage and polyadenylation specificity factor 160 kDa subunit
[Macaca mulatta]
Length = 1436
Score = 286 bits (731), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 216/704 (30%), Positives = 343/704 (48%), Gaps = 116/704 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASAVREAADKEEPPSKKKRV 424
Query: 468 RRSSS------DALQDMVNGEELSLYGSASNN--------------------TESAQKTF 501
++S QD V+ E+ +YGS + + ++ Q+
Sbjct: 425 DATASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLATYSFEVRLRQQGPHPSQCPQRPL 482
Query: 502 SFAVR---DSLVNIGPLKDFSYG--LRINADASATGISKQSNYELV-------------- 542
+FAV DS++NIGP + + G ++ + S + + E+V
Sbjct: 483 TFAVPQVCDSILNIGPCANAAMGEPAFLSEEVPRVVNSPEPDLEIVVCSGHGKNGALSVL 542
Query: 543 ------------ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYLI 580
ELPGC +WTV + G ++ A DD H +LI
Sbjct: 543 QKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLI 602
Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMT 640
+S E TM T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 603 LSREDSTM---TGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---V 655
Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 656 NQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 692
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 52/87 (59%), Gaps = 1/87 (1%)
Query: 913 RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
R F++I G+ G F+ G P W +V R LR+HP DG + +F HNVNC GF+Y
Sbjct: 923 RFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLY 982
Query: 972 VTSQGILKICQLPSGSTYDNYWPVQKV 998
QG L+I LP+ +YD WPV+K+
Sbjct: 983 FNRQGELRISVLPAYLSYDAPWPVRKI 1009
>gi|308805673|ref|XP_003080148.1| cleavage and polyadenylation specificity factor (ISS) [Ostreococcus
tauri]
gi|116058608|emb|CAL54315.1| cleavage and polyadenylation specificity factor (ISS), partial
[Ostreococcus tauri]
Length = 1473
Score = 285 bits (730), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 226/848 (26%), Positives = 364/848 (42%), Gaps = 176/848 (20%)
Query: 260 MVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
+ IL+E+ TWAGR + TC I ALS+ ++ +IW NLP +YKL A+ P+G
Sbjct: 6 LAILYEKTPTWAGRYNLAKDTCEIVALSVDVDKQKSTVIWRRQNLPSSSYKLTALLPPLG 65
Query: 320 GVLVVGANTIHYHSQSASCALALNNYA------------------VSLDSSQELPRS--- 358
GVLV + + + SQ +S AL LN + + D+ P +
Sbjct: 66 GVLVFSQDFLLHESQESSSALCLNTFGRGGPQEGNDAETVARLAGMGEDAVANPPPACAA 125
Query: 359 -----SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
+ LD A A + D L++TK G L LL + DGR ++R+ L + +VL+S
Sbjct: 126 RAVDCGLEITLDGAQAAVVSEDRVLVTTKMGALFLLALHTDGRSLRRMMLQRAGGAVLSS 185
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---MLSSGLKEEFGDIEADAPSTKRLRRS 470
+ + L FLGSR+GDSLLV+FT S + ML G +E E + S KR +
Sbjct: 186 GMCLLSRDLLFLGSRIGDSLLVKFTPKSEPAAPLMLPKGEDDEETVDEVEKGSGKRSKSG 245
Query: 471 SSDALQDMV-----------------NGEELS--LYGSAS-------NNTESAQKT---- 500
A++ + +EL LYG+ T++A+K
Sbjct: 246 DGAAIRKRAKSTEDPPPAPSTPSPEDDDDELEALLYGTTKAESVIGDETTQTAEKKREGL 305
Query: 501 -----------FSFAVRDSLVNIGPLKDFSYGLR--INAD-------ASATGISKQ---- 536
+ F V+DSL+ + P+ D + G + D +A G K
Sbjct: 306 AGVVPGLKVAGYDFKVKDSLLGVAPVVDITVGASAPVGTDTAERTELVTACGQGKNGALA 365
Query: 537 -----------SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
+ E LP +G+W ++ + + +R + +H +L++ L+
Sbjct: 366 ILTRGVQPELVTEVEAGTLPTLQGLWALHDRK------EGTR--EVREPFHNHLLLKLQ- 416
Query: 586 RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
EV+ S+++ T+AA N FG +Q+ E RIL QD++
Sbjct: 417 ------------EVSASLEFITDQATLAAANFFGHFCSLQITETSIRILKSGMKVQDVTL 464
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
+ GS + S I DPY+++ +SDG++RLL GD TVS+ A+ +S +
Sbjct: 465 ADIKAPKGS-----VIASAEILDPYIMIRLSDGTLRLLAGDEKKMTVSLMESGAMPTSSR 519
Query: 706 PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
G W+ +++T+ ++ G GA +Q + + E G+LE+
Sbjct: 520 RTRLVEALKKSG---WIHRSATNGTITGLEGSKKSGAS----NQKEAIVAIAREGGSLEL 572
Query: 766 FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
F +P+ ++ D G + T SE I ++V
Sbjct: 573 FSLPSCTRIWNADGLSEGSRVLSPTRPVH----SELRIP------------------EIV 610
Query: 826 ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS 885
++ + + H RP L A+ DGT+L Y+ ++ S++P++
Sbjct: 611 DIRIDSFEEAHERPLLTAVRGDGTLLLYRGFIVPAGTTCEGSEEPLARG----------- 659
Query: 886 RLRNLRFSRTPLD-------------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
LRFSR +D A ++ G RI+ G QG F++G
Sbjct: 660 ---ELRFSRVNIDVEGSGLNVAGVGVAGQVRDSLAGTRLTRISNVGEGQGLQGIFVAGPN 716
Query: 933 PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
P W +V R R+ P +G IVAFT HNVNC +GFI T+ G ++ICQ+PS Y+
Sbjct: 717 PLWLIVRRSRVLALPTRGEGEIVAFTDFHNVNCPYGFILGTAVGGVRICQMPSKMHYEAA 776
Query: 993 WPVQKVVF 1000
WPV+K+
Sbjct: 777 WPVRKIAL 784
>gi|431908147|gb|ELK11750.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
alecto]
Length = 671
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 212/661 (32%), Positives = 328/661 (49%), Gaps = 114/661 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN + + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRSAEGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 189
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 250 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 309
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT+V DG R V+ K
Sbjct: 310 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLVTDGLRSVRAFHFDKAA 369
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC----GSGTSMLSSGLKEEFGDIEADAPS 463
SVLTS + T+ FLGSRLG+SLL+++T +++ + KEE P
Sbjct: 370 ASVLTSSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEAPASTVREAADKEE--------PP 421
Query: 464 TKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPL 515
+K+ R S+ QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP
Sbjct: 422 SKKKRVDSTVGWSGGKSVAQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPC 477
Query: 516 K-------------------------DFSYGLRINADASATGISKQSNYELV-------- 542
+ + GL + + S + + E+V
Sbjct: 478 ANAAMGEPAFLSEEVPVWEVQGGGGVECTVGLWPHPSLAQFQNSPEPDLEIVMCSGYGKN 537
Query: 543 ------------------ELPGCKGIWTVY-------HKSSRGHNAD---SSRMAAYDDE 574
ELPGC +WTV ++ +G + S+ A D
Sbjct: 538 GALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEQEETPKGEAVEPEPSAPDADDDGR 597
Query: 575 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
H +LI+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L
Sbjct: 598 RHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLL 656
Query: 635 D 635
+
Sbjct: 657 E 657
>gi|148697643|gb|EDL29590.1| cleavage and polyadenylation specific factor 1, isoform CRA_b [Mus
musculus]
Length = 1311
Score = 284 bits (726), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 196/601 (32%), Positives = 310/601 (51%), Gaps = 68/601 (11%)
Query: 129 RRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQ 188
+RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G P V+VDP
Sbjct: 11 KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQNVHTPRVRVDPD 67
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK--HV 246
GRC +L+YG ++++L + + +E G G + S++I++R LD K ++
Sbjct: 68 GRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYIIDVRALDEKLLNI 124
Query: 247 KDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPH 306
D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K HP+IWS +LP
Sbjct: 125 IDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPF 184
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELD 365
D + LAVP PIGGV++ N++ Y +QS +ALN+ + + + LD
Sbjct: 185 DCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLD 244
Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFF 424
A A ++ D ++S K G++ +LT++ DG R V+ K SVLT+ + T+ F
Sbjct: 245 CAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLF 304
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-----SSDALQDMV 479
LGSRLG+SLL+++T SS E D E KR+ + QD V
Sbjct: 305 LGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVGWTGGKTVPQDEV 362
Query: 480 NGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----------------L 522
+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G L
Sbjct: 363 D--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPDL 418
Query: 523 RI------NADASATGISKQSNYELV---ELPGCKGIWTVYH----------KSSRGHNA 563
I + + + + K ++V ELPGC +WTV K+
Sbjct: 419 EIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKAESTEQE 478
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
S+ A D H +LI+S E TM+L+T + E+ S + QG T+ AGN+ R +
Sbjct: 479 PSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYI 537
Query: 624 IQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
+QV G R+L+G L F P + + ++ ++ADPYV++ ++G + +
Sbjct: 538 VQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMF 587
Query: 684 V 684
+
Sbjct: 588 L 588
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 703 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 758
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 759 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 808
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 809 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 868
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 869 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 928
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 929 PWPVRKI 935
>gi|327287424|ref|XP_003228429.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Anolis carolinensis]
Length = 1294
Score = 283 bits (725), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 213/680 (31%), Positives = 336/680 (49%), Gaps = 89/680 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + +S S E K LELV + GNV S+
Sbjct: 29 NLVVAGTSQLYVYRLNHDSESTTKSDRSSEGKSH-------KEKLELVAAFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + DE G G + S++I
Sbjct: 135 NVHIPKVRVDPDGRCAVMLIYGTRLVVLPFRRD---TLTDEHEGVVGEGQKSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F++GY EP ++IL E TW GRV+ + TC I A+S++ K
Sbjct: 192 DIRELDEKLLNIIDMQFLYGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
HP+IWS NLP D + LAVP PIGGV++ N++ Y +QS Y VSL+S
Sbjct: 252 HPVIWSLSNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSLNSLTN 305
Query: 352 -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ P + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 GTTVFPLRIQEGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKA 365
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
SVLT+ + T+ FLGSRLG+SLL+++T +++ K+ E KR
Sbjct: 366 AASVLTTCMITMDPGYLFLGSRLGNSLLLRYTEKLQEPPVNAA-KDATEKTEEPPVKKKR 424
Query: 467 LRRSSS-----DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ + ++ A QD V+ E+ +YGS + + + T+SF V DS++NIGP + + G
Sbjct: 425 VEQQANWAGGKSAPQDEVD--EIEVYGSEAQSG-TQLSTYSFEVCDSILNIGPCANAAMG 481
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
L I + + + + K ++V ELPGC +WTV
Sbjct: 482 EPAFLSEEFQNSLEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 541
Query: 557 SSRGHNADSSRMAAY----------DDEYHAYLIISLEARTMVLETADLLTEVTESVDY- 605
D+ +A D + H +LI+S E TMV + +D
Sbjct: 542 QKAEQEEDAQGESAEKEPSPPEPPDDGKRHGFLILSREDSTMVNPANGPTGQEIMELDTS 601
Query: 606 -FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
T AGN+ R ++QV G R+L+G L F P + S ++
Sbjct: 602 GLAPRSTQDAGNIGENRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQC 651
Query: 665 SIADPYVLLGMSDGSIRLLV 684
++ADPYV++ S+G + + V
Sbjct: 652 AVADPYVVIMSSEGQVTMFV 671
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 66/227 (29%), Positives = 104/227 (45%), Gaps = 19/227 (8%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V+ E+G +EI+ +P + VF V F G+ +VD+ + +E + EE
Sbjct: 786 WCVLVRENGTMEIYQLPEWRLVFLVKNFPMGQRVLVDSSFGQPASQAE----AKKEEVIR 841
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
Q + + +V L ++ SRP+L + D +L Y+A+ + S+
Sbjct: 842 QTEMPLVKEVLLVALGNRQ-----SRPYLL-VHVDQELLIYEAF-----NHDSQLGQTNL 890
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREE--TPHGAPCQRITIFKNISGHQGFFLSG 930
R V + R + R S+ ++ EE P G R F++I G+ G F+ G
Sbjct: 891 KVRFKKVPHNINFREKKPRPSKKKTESAGGEEASVPRGR-VARFRYFEDIYGYSGVFICG 949
Query: 931 SRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
P W +V R LR+HP DG I +F HNVNC GF+Y QG
Sbjct: 950 PSPHWLLVTSRGALRLHPMTIDGPIESFAPFHNVNCPKGFLYFNRQG 996
>gi|307191845|gb|EFN75271.1| Cleavage and polyadenylation specificity factor subunit 1
[Harpegnathos saltator]
Length = 1214
Score = 281 bits (720), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 232/835 (27%), Positives = 371/835 (44%), Gaps = 128/835 (15%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
M +V D F+HGY EP ++IL+E T++GR++ + TC + A+S++ + HP+IWS
Sbjct: 1 MDNVIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVHPIIWSVS 60
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDSSQELPRSSFS 361
NLP D Y+ + V P+GG L++ N++ Y +QS ++LN+ A + + P+
Sbjct: 61 NLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTNFPLRPQDGVK 120
Query: 362 VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGN 420
+ L+ A +L D ++S K+G+L +L++ D R V+ K SVLTS + +
Sbjct: 121 ISLEGAQVAFLSADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASVLTSCVCMCED 180
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKRLRRS------SSD 473
+ FLGSRLG+SLL++FT ++ S E D + + P K+ ++ +SD
Sbjct: 181 NYLFLGSRLGNSLLLRFTEKEPETIKSLDDGEINIEDNDNEEPPAKKAKQDFLGDWMASD 240
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----------R 523
L D+ + EEL +YGS + +T ++ F V DSL+NIGP + S G
Sbjct: 241 VL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEPAFLSEEFAHN 298
Query: 524 INAD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
N D + +G K ++ ELPGC+ +WTV G + ++
Sbjct: 299 QNPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI-----GSLNNDEQV 353
Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
+ + HA+LI+S E TMVL+T + EV +S + QG T+ AGNL R ++QV +
Sbjct: 354 KSETEGSHAFLILSQEDSTMVLQTGQEINEVDQS-GFSTQGSTVFAGNLGANRYIVQVTQ 412
Query: 629 RGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
G R+L G Q + ++ S ADPYV+L DG + LL
Sbjct: 413 MGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVILLSEDGQVMLLTLREV 462
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHD--------------------------------- 715
T + A + + + Y D
Sbjct: 463 RGTAKLHAQTANLLFRPQIEALCAYRDVSGIFTTQLPESAEDEQTEEEHNVEEPSIIGNI 522
Query: 716 -------KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCY-ESGALEIFD 767
G P + + + G + I + Y ++ Y +SG LEI+
Sbjct: 523 DNEDDLLYGDAPAFQMPTPSHPKTDGTTKKIPWWQKHLQEIKPTYWLLVYRDSGTLEIYS 582
Query: 768 VPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVEL 827
+P+ + + F G+ + D+ L+ + + E ++V E+
Sbjct: 583 LPDLRLSYLIRNFGYGQYMLHDSMESTTLQSAPINETLNPE-------------LQVREV 629
Query: 828 AMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRL 887
M H +RP L L D + YQAY + P+ K L + +
Sbjct: 630 LMVALGHHGNRPMLLVRL-DSELQIYQAYKY--PKGHLK----------LRFKKLDHGII 676
Query: 888 RNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGSRPCWCMVF-RERL 943
SR P E+ P A RI + F NI+G+ G F+ P W + R L
Sbjct: 677 PG-HLSRKP----KEEDVPVNANETRICMMRYFSNIAGYNGVFICSDYPHWIFLTGRGEL 731
Query: 944 RVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
R HP DGS+ +F +N+NC GF+Y + L+IC LP+ +YD WPV+KV
Sbjct: 732 RTHPMGIDGSVTSFAAFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKV 786
>gi|440904368|gb|ELR54893.1| Cleavage and polyadenylation specificity factor subunit 1, partial
[Bos grunniens mutus]
Length = 1417
Score = 280 bits (717), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 206/635 (32%), Positives = 320/635 (50%), Gaps = 89/635 (14%)
Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
LELV + GNV S+A + GA +RD+++L SV+E+D H L+ S+
Sbjct: 65 KLELVASFSFFGNVMSMASVQLAGA----KRDALLL-------SVVEYDPGTHDLKTLSL 113
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLV 214
H FE PE L+ G P V+VDP GRC +L+YG ++++L ++ GLV
Sbjct: 114 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLV 170
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAG 272
G+ G + S++I++R LD K ++ D F+HGY EP ++IL E TW G
Sbjct: 171 GE--------GQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPG 222
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
RV+ + TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV++ N++ Y
Sbjct: 223 RVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYL 282
Query: 333 SQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
+QS +ALN+ + + + LD A A ++ D ++S K G++ +LT+
Sbjct: 283 NQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTL 342
Query: 392 VYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
+ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T S+
Sbjct: 343 ITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA- 401
Query: 451 KEEFGDIEADAPSTKRLRRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFA 504
E D E KR+ + S QD V+ E+ +YGS A + T+ A T+SF
Sbjct: 402 -REAADKEEPPSKKKRVDATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFE 456
Query: 505 VRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV 542
V DS++NIGP + + G L I + + + + K ++V
Sbjct: 457 VCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVV 516
Query: 543 ---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMV 589
ELPGC +WTV + G + A DD H +LI+S E TM+
Sbjct: 517 TTFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMI 576
Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 577 LQTGQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVD 632
Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 633 L-------GSPIVQCAVADPYVVIMSAEGHVTMFL 660
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 116/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 775 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 830
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + RP+L + D +L Y+A+ P ++ +
Sbjct: 831 QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 880
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E T R F++I G+ G F+ G
Sbjct: 881 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 940
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 941 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1000
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1001 PWPVRKI 1007
>gi|441648592|ref|XP_004093268.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Nomascus leucogenys]
Length = 1177
Score = 280 bits (716), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 202/651 (31%), Positives = 321/651 (49%), Gaps = 115/651 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++ T++L
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKY--------------------------TEKL 400
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
+ + A+++ + EE S +K R++A
Sbjct: 401 QEPPASAVREAADKEE----------PPSKKK-----------------------RVDAT 427
Query: 528 ASATGISKQSNYELV----ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD- 573
+G ++S V ELPGC +WTV + G + S A DD
Sbjct: 428 VGWSGEGQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTPEADDDC 487
Query: 574 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
H +LI+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+
Sbjct: 488 RRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRL 546
Query: 634 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L+G L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 547 LEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 587
Score = 107 bits (268), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 70/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 702 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 757
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 758 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 807
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 808 VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 867
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 868 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 927
Query: 992 YWPVQKV 998
WPV K+
Sbjct: 928 PWPVXKI 934
>gi|403302917|ref|XP_003942095.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Saimiri boliviensis boliviensis]
Length = 1390
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 203/645 (31%), Positives = 322/645 (49%), Gaps = 105/645 (16%)
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 2 SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSAR 229
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 55 VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--------------------------GGV 321
++ T K HP+IWS +LP D + LAVP PI GGV
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGENPGGAEGSAGRGAVSLPTSLCPPPGGV 226
Query: 322 LVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
++ N++ Y +QS +ALN+ + + + LD A AT++ D ++S
Sbjct: 227 VIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQATFISYDKMVIS 286
Query: 381 TKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
K G++ +LT++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T
Sbjct: 287 LKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTE 346
Query: 440 G----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNT 494
+++ +G KEE + +T QD V+ E+ +YGS A + T
Sbjct: 347 KLQEPPASAVREAGDKEEPPSKKKRVDATAGWSAGGKSVPQDEVD--EIEVYGSEAQSGT 404
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASATG 532
+ A T+SF V DS++NIGP + + G L I + + +
Sbjct: 405 QLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSV 462
Query: 533 ISKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYL 579
+ K ++V ELPGC +WTV + G + S A DD H +L
Sbjct: 463 LQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEENPKGEGTEQEPSTPEADDDSRRHGFL 522
Query: 580 IISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYM 639
I+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 523 ILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNVGDDRYIVQVSPLGIRLLEG--- 578
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 579 VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 616
Score = 109 bits (273), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 123/264 (46%), Gaps = 49/264 (18%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 731 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 786
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 787 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 829
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
LS N+ +RF + P + RE E GA + R
Sbjct: 830 ----LSQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSAEEGAGARGRVARFR 879
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG + +F HN+NC GF+Y
Sbjct: 880 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPVDSFAPFHNINCPRGFLYFNR 939
Query: 975 QGILKICQLPSGSTYDNYWPVQKV 998
QG L+I LP+ +YD WPV+K+
Sbjct: 940 QGELRISVLPAYLSYDAPWPVRKI 963
>gi|348555856|ref|XP_003463739.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 isoform 2 [Cavia porcellus]
Length = 1387
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 210/676 (31%), Positives = 333/676 (49%), Gaps = 88/676 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + + A + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRPTEGKSHREKLGAGGPPSLSF----GNVMSM 82
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + I ++SV+E+D H L+ S+H FE PE L+ G
Sbjct: 83 ASVQLXXXXXX------IALISFPQLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 134 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 185
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 186 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 245
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 246 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLG 305
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 365
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T S+ E D E KR+
Sbjct: 366 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPAST--VREAADKEEPPSKKKRV 423
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 424 DSTAGWAGSKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 479
Query: 522 --------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH--- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 480 EPAFLSEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVR 539
Query: 556 ------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+ G + S A DD H +LI+S E TM+L+T + E+ S + Q
Sbjct: 540 KEEEETPKAEGSEQEPSAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQ 598
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
G T+ AGN+ R ++QV G R+L+G L F P + + ++ ++AD
Sbjct: 599 GPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVAD 648
Query: 669 PYVLLGMSDGSIRLLV 684
PYV++ ++G + + +
Sbjct: 649 PYVVIMSAEGHVTMFL 664
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 779 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQPTTQGEVR----KEEATR 834
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 835 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 884
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 885 VRFKKVPHNINFREKKPKPSKKKAEGGSTDEGSGVRGRVARFRYFEDIYGYSGVFICGPS 944
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 945 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1004
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1005 PWPVRKI 1011
>gi|402879380|ref|XP_003903320.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Papio anubis]
Length = 1389
Score = 271 bits (693), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 204/646 (31%), Positives = 321/646 (49%), Gaps = 108/646 (16%)
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 2 SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSAR 229
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 55 VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI-------------------------GGVL 322
++ T K HP+IWS +LP D + LAVP PI GGV+
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGEYPGSGWGCVEGALSLPTSLCPPPGGVV 226
Query: 323 VVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
V N++ Y +QS +ALN+ + + + LD A AT++ D ++S
Sbjct: 227 VFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRIQEGVRITLDCAQATFISYDKMVISL 286
Query: 382 KTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
K G++ +LT++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T
Sbjct: 287 KGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-- 344
Query: 441 SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS------DALQDMVNGEELSLYGS-ASNN 493
+ E D E KR+ ++S QD V+ E+ +YGS A +
Sbjct: 345 EKLQEPPASAVREAADKEEPPSKKKRVDATASWSAGGKSVPQDEVD--EIEVYGSEAQSG 402
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASAT 531
T+ A T+SF V DS++NIGP + + G L I + + +
Sbjct: 403 TQLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALS 460
Query: 532 GISKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAY 578
+ K ++V ELPGC +WTV + G ++ A DD H +
Sbjct: 461 VLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEARSPEADDDGRRHGF 520
Query: 579 LIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY 638
LI+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 521 LILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG-- 577
Query: 639 MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 578 -VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 615
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/247 (28%), Positives = 115/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 730 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 785
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 786 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 835
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 836 VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGXRGRVARFRYFEDIYGYSGVFICGPS 895
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 896 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 955
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 956 PWPVRKI 962
>gi|307107849|gb|EFN56091.1| hypothetical protein CHLNCDRAFT_145620 [Chlorella variabilis]
Length = 1626
Score = 271 bits (692), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 313/1247 (25%), Positives = 478/1247 (38%), Gaps = 314/1247 (25%)
Query: 4 AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
A +H PT + +C + ++TH++ Q P+P+LVV +
Sbjct: 7 AVCTQVHPPTAVTHCTAAWLTHAQRQ--------QGSGSADGDDGGGSGDPLPDLVVVRS 58
Query: 64 NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
+E+Y VR E G + ++ A SL+ + RL G ES+A+L +G
Sbjct: 59 TQLELYSVRGSEAGGPATTHT-------------AQSLDQLASCRLFGVAESVAVL-RGR 104
Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
A +RD ++L F DAK+SVL +D H L +S+H FE LK GR F PL
Sbjct: 105 APG--QRDVLLLTFRDAKLSVLHWDAGRHELAPSSLHYFEGDA--SLKLGRTVFPYPPLA 160
Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG-------------------- 223
DP GRCG V+++ Q+ +L A D + FG G
Sbjct: 161 VTDPLGRCGAVIIFRHQLAVLPAV--------DSELFGLGLSAAEEDEEEAAATAALGLA 212
Query: 224 --------------------------GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIE 257
+A + +S+V NL +K V+D F+HGY E
Sbjct: 213 PPDGGGAADGEAGAPRGGAAAAAAGLPAAAAAVGNSYVDNLGKAGIKEVRDACFLHGYSE 272
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
PV+++LHE E TWAG + K TC+++ALS++ T K HP IW A LP DAY+L A +P
Sbjct: 273 PVLMVLHEAEPTWAGNLRQKKDTCVLTALSLNLTRKHHPKIWGAQELPSDAYRLSA--AP 330
Query: 318 IGGVLVVGANTIHYHSQSASCALALN---------------------------------- 343
GGVLV+ + + ++ Q + L+
Sbjct: 331 CGGVLVLCQHLVLHYRQGQQSGVVLHPSALPPAAAPPPLLFDPQAMAEAGGPGPASAAYA 390
Query: 344 -NYAVSL------------DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT 390
+AV + D+SQ ++ V D A WL + ALL ++G L+ L
Sbjct: 391 RQHAVDVHPETVPAAVRFCDASQA---AALKVTADGASVCWLSPESALLCLRSGQLLQLA 447
Query: 391 VVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS----------------------LFFLG 426
++ G + L +++ + S ++ + L FLG
Sbjct: 448 LLPQQAGGSARHLAVARAGAAPHPSCCCSLSGAHRAPHMPGSAAAAAAGQAPQPALVFLG 507
Query: 427 SRLGDSLLVQFT----CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--- 479
S GDSLLV+ T G+ ++ D AD P++KRLR +
Sbjct: 508 SAAGDSLLVRATPAAAAGTKRPAEAATGAAGEEDGTADEPASKRLRLEGIEVGSAAAAVE 567
Query: 480 --------------------------------NGEELSLYGSA--------------SNN 493
EE +YG+A +
Sbjct: 568 ATAAAAAAAQGAAAAAAEARAAAGGGPAGSDSEDEEALIYGTALYSSAAGVAPAAAAAVP 627
Query: 494 TESAQ-KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWT 552
T S Q + + V DSL NIGPL+DF+ A A G + G G T
Sbjct: 628 TPSWQLQRYQLKVLDSLANIGPLRDFAVA---EPAAGAGGEAVPPALVGCSGEGKGGTLT 684
Query: 553 VYHKS----------------SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
V +S G + A + +HAYL++S + T VL T + L
Sbjct: 685 VLRRSVVPDVITEHRGAASASGGGSGQAAGEAAGQEGGHHAYLLLSFQGATKVLATGEEL 744
Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD-----LSFGPSNSE 651
EVTESV++ V T+AAG++ RR+ Q F +G R+LDG QD L+ + +
Sbjct: 745 REVTESVEFAVDTPTLAAGSVCCGRRIAQAFPQGLRLLDGEESVQDVWASELAAPAAAAA 804
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQT--------------- 696
+G ++S + DPYVLL ++DG+ R L DP C +S +
Sbjct: 805 AGGAPGGGAIVSADMCDPYVLLYLADGTARFLTADPVACRLSAASAAGAGPEAAAAAEAA 864
Query: 697 -----PAAIESSKKPVSSCTLYHD-------KGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
P A E +++C+L+ D + P+ + G A
Sbjct: 865 EAALRPVAAEER---ITACSLFADSCGWLAARLPQTQQQTQQQQQQQGQQDGGTTAQAAA 921
Query: 745 GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
G +Y+VVC SGA +++ +P + VF+ ++G ++ T A +
Sbjct: 922 SGGGCGAVYAVVCRASGACQLYALPAWQPVFSSSTSLAGGPALL-TGSGGAGGVAAAAAA 980
Query: 805 SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR-----------------PFLFAILTD 847
+++ E +VVE+ + + + P L A+ D
Sbjct: 981 AAAAAAAAGVEDEMDGPGEVVEVRLVSFGPAAAGRRDAAAARASPAPACEPPLLLALTAD 1040
Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-----NLRFSRTPLDAYTR 902
+L YQA+ ++ T R + L LR R
Sbjct: 1041 HQLLAYQAFSASPGSGGTRGSSGSGTPRFRRLRLDLPPLLPPAGGPQLRLRRLHCFEGLG 1100
Query: 903 EETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD---------GS 953
EE P + G F++G P W + R L HP
Sbjct: 1101 EEAP----------------YSGVFVAGQHPHWLVASRGGLLPHPHFLPQPAGPGAAAVG 1144
Query: 954 IVAFTVLHNVNCNHGFIYVTS--QGILKICQLPSGSTYDNYWPVQKV 998
FT HNVNC HGFI TS + ++I QLP + D WP Q+V
Sbjct: 1145 AAGFTPFHNVNCPHGFIVATSGARSGIQISQLPPRTRLDAPWPRQRV 1191
>gi|348555854|ref|XP_003463738.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 isoform 1 [Cavia porcellus]
Length = 1440
Score = 270 bits (691), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 209/678 (30%), Positives = 332/678 (48%), Gaps = 90/678 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + + A + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRPTEGKSHREKLGAGGPPSLSF----GNVMSM 82
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + I ++SV+E+D H L+ S+H FE PE L+ G
Sbjct: 83 ASVQLXXXXXX------IALISFPQLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 134 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 185
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 186 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 245
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 246 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLG 305
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 365
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 366 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTVREAADKEEPPSKKKRV 423
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 424 DSTAGWAGSKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 479
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 480 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 539
Query: 556 --------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + S A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 540 VRKEEEETPKAEGSEQEPSAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 598
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++
Sbjct: 599 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 648
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 649 ADPYVVIMSAEGHVTMFL 666
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 781 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQPTTQGEVR----KEEATR 836
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 837 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 886
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 887 VRFKKVPHNINFREKKPKPSKKKAEGGSTDEGSGVRGRVARFRYFEDIYGYSGVFICGPS 946
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 947 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1006
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 1007 PWPVRKI 1013
>gi|296227035|ref|XP_002807684.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Callithrix jacchus]
Length = 1394
Score = 266 bits (681), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 201/672 (29%), Positives = 322/672 (47%), Gaps = 124/672 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN + + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSAEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ ++ F C +G +
Sbjct: 367 ASVLTTSVSGTEG----------------FLCAAGGKSVP-------------------- 390
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------ 521
QD +E+ +YGS + + + T+SF V DS++NIGP + + G
Sbjct: 391 --------QD--EXDEIEVYGSETQSG-TQLATYSFEVCDSILNIGPCANAAMGEPAFLS 439
Query: 522 ----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-------- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 440 EEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEE 499
Query: 555 -HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
+ G + S A DD H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 500 ENPKGEGTEQEPSTPEADDDSRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTV 558
Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV+
Sbjct: 559 FAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFVPVDL-------GAPIVQCAVADPYVV 608
Query: 673 LGMSDGSIRLLV 684
+ ++G + + +
Sbjct: 609 IMSAEGHVTMFL 620
Score = 110 bits (276), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 70/247 (28%), Positives = 116/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 735 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 790
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ S +
Sbjct: 791 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLSQGNLK 840
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 841 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 900
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 901 PHWLLVTGRGALRLHPMGIDGPVDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 960
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 961 PWPVRKI 967
>gi|55725165|emb|CAH89449.1| hypothetical protein [Pongo abelii]
Length = 565
Score = 263 bits (672), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 182/536 (33%), Positives = 282/536 (52%), Gaps = 65/536 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTV 553
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTV 538
>gi|281205270|gb|EFA79463.1| CPSF domain-containing protein [Polysphondylium pallidum PN500]
Length = 1395
Score = 256 bits (654), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 209/734 (28%), Positives = 340/734 (46%), Gaps = 109/734 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLV+ +++++Y +R ++ + +++ D + LEL +L +ESL
Sbjct: 31 NLVIAKTSLLQVYTIRYDRIEQQQQQQQQTNEQQSQQDTLKPW-LELNLELQLFSIIESL 89
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ G D DS+IL+F DAK+S+++++ + L I S+H FE LK GR++
Sbjct: 90 NCVRLPGDD----IDSLILSFRDAKVSIVKYNKATEKLDIRSLHYFEGNS--ELKGGRKT 143
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG-------------------DE 217
F PL++VD Q RC +L+Y + +L + S L DE
Sbjct: 144 FRTPPLIRVDYQQRCAVMLLYDRHLAVLPFPRSFSILDDEEEEEEEEAAVVADQQQQHDE 203
Query: 218 D-----------TFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHER 266
+ S + S+VI+L L +++VKDF F+H Y EP ++ LHE
Sbjct: 204 NEQQQPQDDQQQQQTSEKNKKKKQSESYVISLNSLGIENVKDFCFLHTYYEPTLLFLHEP 263
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
TW R+S K T +++A+S++ +Q P+IWS +LP++ +L+ VP P+GG +V+
Sbjct: 264 SQTWTSRISSKKFTNVLTAVSLNIAQRQQPVIWSIEHLPYNCERLVPVPDPLGGAMVLTP 323
Query: 327 NTIHYHSQSASCALALNNYA-VSLDSSQELPRSSFSVE----LDAAHATWLQNDVALLST 381
N + Y +QS+ L N YA + + P S S LD A+ +L D L S
Sbjct: 324 NILFYFNQSSRYGLECNEYAQIDTGDQFQFPIDSSSTNLVFTLDCANFIFL-GDRLLGSL 382
Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--- 438
K G+L++ ++ DGR VQR+ ++K SVL+S + ++L FLGSRLGDSLL+Q+T
Sbjct: 383 KGGELLIFHLISDGRNVQRISITKAGASVLSSTSCVLTDNLLFLGSRLGDSLLLQYTEKI 442
Query: 439 ----CGSGTSMLSSGLKE----EFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
LS+ K+ E D+ D + S +D + +E ++
Sbjct: 443 IDVDSSDNVENLSNPYKKKKTSEVFDLFDDEERNSKTGASDADGNGQSLFDDEDDIF--- 499
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN-ADASATGISKQSNYELV------- 542
N+ ++ K++ + D + NIGP+ D G+ + A S +Q + ELV
Sbjct: 500 -NDKKNQLKSYRLNICDHITNIGPVSDLITGVSYDHASVSNDESFEQRSLELVACSGHGK 558
Query: 543 -------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
ELPG + WT+Y+ + S + +
Sbjct: 559 NGALTILQYGVRPELNTSFELPGVRQSWTLYYDDPLAASQSGSSASNAAASAASKKRQHE 618
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQD 642
E T+V +T L EV + TI N+FGRRR+ V + G ++L G S +TQ+
Sbjct: 619 EDSTLVFQTGGQLKEVAK-----FDHATITVANMFGRRRIALVHQNGIKLLSGHSNITQE 673
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS-TCTVSVQTPAAIE 701
+ +V I DPYVL+ DG+I L G+ T + + P
Sbjct: 674 IKL-------------KSVKMAYIVDPYVLILHKDGTISLYQGNTGITQLLEYELPQP-- 718
Query: 702 SSKKPVSSCTLYHD 715
K V SC+++HD
Sbjct: 719 --KDGVMSCSMFHD 730
Score = 95.9 bits (237), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 90/184 (48%), Gaps = 26/184 (14%)
Query: 823 KVVELAMQRW-SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
K+VE+ + ++ HS P+L + G IL Y+A K D + ++ L
Sbjct: 872 KIVEIVIHYLHNSPHSSPYLMILNEFGDILIYKAI---------KYKDSMDNTKEL---- 918
Query: 882 VSASRLRNLRFSRTPLDAYTREETPHGAP-------CQRITIFKNISGHQGFFLSGSRPC 934
+R ++ + L + RE + P ++I F NI GH+G F+ G R
Sbjct: 919 -----IRFIKHTDQNLHSKQREYSYGIDPSSESSFYIRKIVAFDNIGGHKGVFMCGKRSL 973
Query: 935 WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
W + LR HP + +FT HN+NC++GFIY T +G+L+I QL + ++N W
Sbjct: 974 WFFCEKNYLRAHPMNFKDPVTSFTCFHNINCSYGFIYFTEKGVLRINQLSNMMNFENEWA 1033
Query: 995 VQKV 998
++K+
Sbjct: 1034 IRKI 1037
>gi|330799483|ref|XP_003287774.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
gi|325082229|gb|EGC35718.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
Length = 1453
Score = 248 bits (634), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 198/738 (26%), Positives = 332/738 (44%), Gaps = 119/738 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS-AASLELVCHYRLHGNVES 115
NLV++ N +++Y + K KN T ++ + + SLEL+ +L G +ES
Sbjct: 31 NLVLSKNNTLQVYKI-------KYVKNENTTTQQKQIKKVEIKPSLELLIELKLFGTIES 83
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+A + G + +DS++L F DAKISVL+++ I I S+H +E+ E+ K GR
Sbjct: 84 MASVRYPGEN----KDSLLLTFRDAKISVLDYNIDIMDFEIRSLHFYENDEF---KNGRI 136
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
F P++K+D Q RC +L+Y +++L Q S L +++ ++
Sbjct: 137 HFKHPPILKIDTQQRCATMLLYDRNIVVLPFKQISSILDDEDEEEKDEEDEKENDNANQD 196
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
D +F F++GY EP ++ LHE TW R++ K T ++A+SI+ + K
Sbjct: 197 YTEEFDDDDDDNNFCFLYGYYEPTILFLHEPSQTWTSRIAVKRLTSQLTAISINFSTKLA 256
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA-VSLDSSQE 354
+IW N+P++ +L++VP P+ G LV+ N + + +Q++ LA+N YA + + E
Sbjct: 257 SIIWHTSNMPYNCDQLVSVPEPLSGALVITPNIMFHVNQTSKYGLAVNEYANIDIGDKFE 316
Query: 355 LPRS---SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
P + LD ++ +L+ D + S K G+L++ ++ DGR VQR+ +SK SVL
Sbjct: 317 FPLDETLNLVFTLDRSNFVFLEADKFIGSLKGGELLIFHLISDGRTVQRIHVSKAGGSVL 376
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
+ + + ++L FLGSRLGDSLL+Q+T S T +E + E + K+ + S
Sbjct: 377 ATCMCVVSDNLLFLGSRLGDSLLLQYTEKSIT--------DESLEHENFSNPYKKQKTSE 428
Query: 472 SDAL-----------QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+ L D V EE L+ N +S Q + D ++N+GP+ D
Sbjct: 429 QEKLLNQQQQQQKDEMDEVLDEEDELFKEKKNQLKSYQ----LGICDQILNVGPVGDMVI 484
Query: 521 GLRINADASATGISKQSNY--ELVELPGCKG----------------------------- 549
G +N + Y +EL C G
Sbjct: 485 GQALNPTYDLNTLPSDPAYMPRFLELVTCSGYGKNGSISILQNSVKPEIVGAFDSEGVVN 544
Query: 550 -IWTVYHKSSRGHNADSSR------------------------MAAYDDEYHAYLIISL- 583
WTVY+K+S D +++Y YL IS+
Sbjct: 545 SFWTVYNKASSSIKEDEEEKLIGKKRTINEIIKEEQQYEQQQQKQPIEEDYLDYLYISMS 604
Query: 584 EARTMVLETADLLTEVTESVDYF---VQGRTIAAGNLFGRRRVIQVFERGARIL-DGSYM 639
T +L+T T E F + RT+ GNLF +RR++ + E ++L D + +
Sbjct: 605 NGTTNILDT----TSSEEGKLTFKGEFEYRTLDMGNLFNKRRIVLINENSIKLLNDYNNI 660
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
Q++ + S I DPYVL+ SD SI+L D ++ +
Sbjct: 661 VQEIKLS------------KPIKSTFIQDPYVLVHYSDNSIQLFKCDYKLLKLNQFNFSL 708
Query: 700 IESSKKPVSSCTLYHDKG 717
+ V + +L+ DK
Sbjct: 709 NHGDEGKVLTSSLFFDKN 726
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 43/182 (23%), Positives = 87/182 (47%), Gaps = 27/182 (14%)
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
++++VE++++ ++S+P+L G ++ Y+++ E + K + R LS
Sbjct: 876 ENLEIVEISLE--ILNNSQPYLLLKNRIGDLIVYKSFKKENGDLRFKKYNHNFILRDLSN 933
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
++ S + D Y + + I K S + G F+ G +P W +F
Sbjct: 934 NSKSINS-----------DGYRK---------KSIVNIKLSSKNNGVFIGGQKPVW--IF 971
Query: 940 RER--LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGSTYDNYWPVQ 996
E+ +R+H DG+IV+ HN +C +GF+Y T + +KI L ++N + ++
Sbjct: 972 NEKGYIRLHSMDFDGAIVSLKPFHNADCPNGFLYYTEDKQHIKIGYLNGLMNFENEYAIR 1031
Query: 997 KV 998
+V
Sbjct: 1032 RV 1033
>gi|428186188|gb|EKX55039.1| hypothetical protein GUITHDRAFT_160593 [Guillardia theta CCMP2712]
Length = 2290
Score = 238 bits (607), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 142/412 (34%), Positives = 222/412 (53%), Gaps = 38/412 (9%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD---GISAASLELVCHYRLHGNV 113
NL V +E+YV++ +E+ ++ N + ++ D G A+L+ V Y L+GNV
Sbjct: 31 NLAVVKGTQLELYVLKEEEKKHSKTCNGKQNGQKAAGDSGHGHGGATLQCVGRYDLNGNV 90
Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
ES+A + G R RD + L F DAK+S+LE+D+SI + S+H FE E +++G
Sbjct: 91 ESMAFVRLPG----RNRDHLFLVFRDAKLSILEYDNSIDDIVNVSLHLFEDDE---IRKG 143
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF------------- 220
R SF R PL++VDP RC +LVY +M+++ GS L D++
Sbjct: 144 RVSFGRAPLLRVDPLQRCAALLVYESKMVVIPFKHKGSDLEEDDEILTQPNKKFKSESAS 203
Query: 221 -------GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
G+ I ++V++L + +KHV DF F+ GY EP + LHE TWAGR
Sbjct: 204 SNTVTRLGAPSDNKLGILPTYVVDLDEAGIKHVVDFTFLDGYYEPTISFLHENSRTWAGR 263
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
++ + T MI+ +S++ + ++ P+IWSA LPH++ ++A+P+P GGV+VV +N + Y +
Sbjct: 264 LAVSNFTGMITTVSLNISQRRQPIIWSASKLPHNSRHIVALPAPAGGVVVVSSNALIYRN 323
Query: 334 QSASCALALNNYAVSL-DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
CAL LN YA++ D + + D H L+ L S TG+ ++ V
Sbjct: 324 HEQKCALKLNEYAIAAGDGGNRFDTAGDIICFDTVHPVRLEGYQMLFSLVTGESYIMGVQ 383
Query: 393 Y--DGRVVQRLDLS----KTNPS-VLTSDITTIGNSLFFLGSRLGDSLLVQF 437
DG ++ L L K +PS S + +G+S FLGSRLGDS LV+
Sbjct: 384 LDTDGNTIKALTLDLVDVKLSPSGGFASIMCRVGDSYLFLGSRLGDSSLVKM 435
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 135/314 (42%), Gaps = 77/314 (24%)
Query: 501 FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------------------ 542
+ F + D+L NIGP+ G R++A G K+ + ELV
Sbjct: 587 YRFELCDTLTNIGPI-----GSRLDA-----GAVKKDSVELVTASGGLQYGKLGVLQRSL 636
Query: 543 --------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE----YHAYLIISL--EARTM 588
LP + +WTV+ +++ + D ++E HAY++IS + T+
Sbjct: 637 NPVVMTAVPLPDAQAVWTVFGPTAKAADEDMEEDGNEEEEQSAGMHAYMVISQGNDKGTI 696
Query: 589 VLETADLLT-EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
VL+ +L + E VD+ V +T+ GN+FG +R++QV +L+G Q+L
Sbjct: 697 VLKGRELEEFDEDEQVDFEVDAKTVCVGNIFGNQRIVQVTPWNVYVLNGPRKEQELPV-- 754
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
+G+G + +++ I DPY+ L + DG + LLVGD S+ V+ + +
Sbjct: 755 ---VAGNGLQ---IVAAYIRDPYIALILQDGRLNLLVGDASSMQVNY-----VSHEIHNI 803
Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
++ + D P+ GEA D Q D+ +G +++
Sbjct: 804 TAACFFLDPIPD----------------GEANDDP-----QQRDVMLAAAPRNGHFQLYT 842
Query: 768 VPNFNCVFTVDKFV 781
+P+ V+ FV
Sbjct: 843 LPSLELVYDAADFV 856
Score = 45.1 bits (105), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 22/92 (23%), Positives = 42/92 (45%), Gaps = 8/92 (8%)
Query: 913 RITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD--GSIVAFTVLHNVNCNHGFI 970
R+ G +G ++ +P + R R+HP D + + +N+ C G +
Sbjct: 1068 RLMPLGGAGGLEGVLIAARQPAVVLFGRGLPRIHPWKLDRGEGVRSAARFNNLQCKDGIV 1127
Query: 971 YVT------SQGILKICQLPSGSTYDNYWPVQ 996
+ ++G+LKIC +P G + D WP++
Sbjct: 1128 CIADKGRDRAKGVLKICNIPEGISGDTPWPLR 1159
>gi|9794904|gb|AAF98386.1| cleavage and polyadenylation specificity factor [Drosophila
melanogaster]
Length = 507
Score = 237 bits (605), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 162/497 (32%), Positives = 258/497 (51%), Gaps = 49/497 (9%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E ++E + +L + + A + EEL +YGS + + + F F V D
Sbjct: 430 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRI 524
SL+N+ P+ G R+
Sbjct: 489 SLMNVAPINYMCAGERV 505
>gi|58702050|gb|AAH90169.1| LOC564406 protein, partial [Danio rerio]
Length = 416
Score = 229 bits (584), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 129/342 (37%), Positives = 203/342 (59%), Gaps = 14/342 (4%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE V + L GNV S+A + G + RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 66 LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P+V+VDP+ RC +LVYG +++L + + DE
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT---LADEQEG 175
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G + S++I++R+LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 176 IVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
TC I A+S++ K HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPP 295
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
++LN+ + P+ + LD + A+++ +D ++S K G++ +LT++ DG R
Sbjct: 296 FGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVLTLITDGMR 355
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
V+ K SVLT+ + T+ FLGSRLG+SLL+++T
Sbjct: 356 SVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYT 397
>gi|449661926|ref|XP_002167992.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Hydra magnipapillata]
Length = 1122
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 193/664 (29%), Positives = 305/664 (45%), Gaps = 124/664 (18%)
Query: 57 NLVVTAANVIEIYVV----RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
NLV + +Y + V +G + SK ++D + LEL+ + L GN
Sbjct: 29 NLVTAGGQRLNVYRLCDADMVVSDGDQSSK---------IVDSVGKRRLELLASFTLFGN 79
Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ ++ ++ G S RDS++LAF+ AK+S++EFD H L+ SMH FE+ E+ K
Sbjct: 80 IINMQVVRLG----SNVRDSLLLAFKHAKLSIVEFDPLSHDLKTDSMHYFENDEF---KG 132
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES 232
G PLV+VDP+ RC +L+Y +++L + DE S G +
Sbjct: 133 GLSHNIYLPLVRVDPEQRCACMLIYNRHLVVLPFKHD---IKLDESEELSDGEHIKSVLP 189
Query: 233 SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
S++I+L L+ + ++ + F+HGY +P ++ L E T GRV+ + T +SA+S++
Sbjct: 190 SYMIDLHSLEQPLLNITELQFLHGYHQPTLMFLFEPVQTSTGRVAVRQDTFCVSAISLNM 249
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
T K HP+IWS NLP D + L + PIGGVLV +N++ Y +QS + Y VSL+
Sbjct: 250 TEKVHPVIWSVTNLPFDCHMLRPIEKPIGGVLVFASNSLIYLNQS------IPPYGVSLN 303
Query: 351 SSQE----LP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLD 402
S E P + + L + + D +LS K G++ +L+++ DG R V+
Sbjct: 304 SITEGSTMFPLKIQEDVVITLAESSCDAIATDQFILSLKGGEIYVLSLLSDGLRTVRSFH 363
Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
K SVL S + I + FLGSRLG+SLL+++T E D+
Sbjct: 364 FEKAAGSVLASCVCWIEHGFVFLGSRLGNSLLLRYT-------------------EKDSA 404
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
S +S ++ M G DSL+NIGP+ + G
Sbjct: 405 SIA--EKSKEAKVEKMYGGGVGGGIIVC----------------DSLLNIGPITKAALGE 446
Query: 523 RINADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHK 556
G S+Q + E+V ELPGC +WTV K
Sbjct: 447 PAFLSEEFFG-SRQIDLEMVCCSGYGKNGTLTVLQRSIRPQVVTTFELPGCVNMWTVCGK 505
Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
SS+ + YH+YLI+S + TMVL+T +TE+ S + VQ TI A N
Sbjct: 506 SSKESV----------ENYHSYLILSRDDSTMVLKTGAEITELDNS-GFNVQQPTIFACN 554
Query: 617 LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
+ ++QV + +L+ + +S + + SI+DPYV++ S
Sbjct: 555 HLSNKYILQVCPQSIHLLEDTVQINSISL----------QDTIKITQCSISDPYVVMVDS 604
Query: 677 DGSI 680
G +
Sbjct: 605 TGQL 608
>gi|147799623|emb|CAN68460.1| hypothetical protein VITISV_027523 [Vitis vinifera]
Length = 558
Score = 227 bits (579), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 119/175 (68%), Positives = 142/175 (81%), Gaps = 5/175 (2%)
Query: 424 FLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE 483
F GS+LGDSLLVQFT S+ SS +++ GDIE B PS KR RRSSSDALQDMVNG++
Sbjct: 331 FEGSQLGDSLLVQFT-----SIPSSSVEKRVGDIEGBVPSAKRSRRSSSDALQDMVNGDK 385
Query: 484 LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE 543
L LYGSA N+TE++QKTFSF+V DSL+++GPLKDF+YGLRINAD ATGI KQ VE
Sbjct: 386 LPLYGSAPNSTETSQKTFSFSVNDSLIDVGPLKDFAYGLRINADLKATGIVKQKMITEVE 445
Query: 544 LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
LPGC+ IWTVYHK++RGHNADS++M DDEY AYLIIS E+RTMVLET +LL E
Sbjct: 446 LPGCERIWTVYHKNTRGHNADSTKMITKDDEYCAYLIISPESRTMVLETVELLGE 500
>gi|149512998|ref|XP_001514888.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like, partial [Ornithorhynchus anatinus]
Length = 831
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 215/738 (29%), Positives = 342/738 (46%), Gaps = 141/738 (19%)
Query: 233 SHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
S++I++R LD K ++ D F+HGY EP ++IL+E TW GRV+ + TC I A+S++
Sbjct: 8 SYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILYEPNQTWPGRVAVRQDTCSIVAISLNI 67
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS + Y VSL+
Sbjct: 68 LQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQS------VPPYGVSLN 121
Query: 351 S----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLD 402
S + P R + LD A A ++ D ++S K G++ +LT++ DG R V+
Sbjct: 122 SLTAGTTAFPLRLREGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFH 181
Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA- 461
K SVLT+ + T+ FLGSRLG+SLL+++T S +E D AD
Sbjct: 182 FDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPAGSA-REPARDSGADKQ 240
Query: 462 -PSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
P K+ R + A QD V+ E+ +YGS A + T+ A T+SF V DS++NI
Sbjct: 241 EPPVKKKRVEQALSWAGGKSAAQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 296
Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
GP + + G L I + + + + K ++V ELPGC
Sbjct: 297 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 356
Query: 548 KGIWTVYHK-------SSRGHNADSSRMAAY---DDEYHAYLIISLEARTMVLETADLLT 597
+WTV S +G A+S D + H +LI+S E TM+L+T +
Sbjct: 357 YDMWTVIAPVRKEEGDSPKGEGAESEPTPPEPEDDGKRHGFLILSREDSTMILQTGQEIM 416
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 417 ELDTS-GFATQGPTVYAGNIGDDRYIVQVSPLGLRLLEG---VNQLHFIPVDL------- 465
Query: 658 NSTVLSVSIADPYVLLGMSDGSIR--LLVGDP---STCTVSVQTPAAIESSKKPVSSCTL 712
S ++ ++ADPYV++ ++G + LL D T +++ P + S K ++ C +
Sbjct: 466 GSPIVQCAVADPYVVIMSAEGHVTMFLLKSDSYGGRTHRLALHKP-PLHSQSKVIALC-V 523
Query: 713 YHD-----------KGP--EPWLRKTSTDAWLSTGVGEAIDGADG--------------- 744
Y D GP +P LR S L + +D +
Sbjct: 524 YRDVSGMFTTESRASGPRDDPSLRGQSEAEPLLQELSHTVDDEEEMLYGDSSSLFSPSRD 583
Query: 745 -------GPLDQGDI--------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD 789
P D+ + V+ ++GA+EI+ +P + VF V F G+ +VD
Sbjct: 584 EPRRSSLPPADRDAPQYRAEPTHWCVLVRDNGAMEIYQLPEWRLVFLVKNFPMGQRVLVD 643
Query: 790 TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAI----- 844
+ + S + + EE QG + + +V L ++ +RP+L +
Sbjct: 644 SSFGQPAA-SAAQAEAKKEEPARQGELPLVKEVLLVALGNRQ-----TRPYLLRLKWAIR 697
Query: 845 ---LTDGTILCYQAYLFE 859
LT T + Q Y+ +
Sbjct: 698 DSELTSITFIDMQLYIHQ 715
>gi|340371789|ref|XP_003384427.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Amphimedon queenslandica]
Length = 1408
Score = 224 bits (571), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 235/880 (26%), Positives = 376/880 (42%), Gaps = 207/880 (23%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A Y+ +H PTG+ +C S HS + V V +
Sbjct: 2 YAVYREVHPPTGVEHCTSCHFVHSEKEQV---------------------------AVAS 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
+++ I+ V ++ +N G+ K L + HGN++SL +
Sbjct: 35 TSLLRIFDV------AQLQRNDGKAK------------LVQCLEFSFHGNIQSLDKVRLR 76
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
+D RD ++L+F DAK+S++E++ +GL+ SMH FE E ++ G P+
Sbjct: 77 HSD----RDCLLLSFNDAKLSIVEYNPETNGLKTVSMHQFEDEE---IRGGILHNDSRPV 129
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
VKVDP+GRC +L++G + + Q L D S + I ++ I+LRDL
Sbjct: 130 VKVDPEGRCAVMLLFGSHLAVCPFQQD---LSIDTPLSPSPSLDTHDILPTYTISLRDLP 186
Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
+ +KD F+ GY P ++ L E TWAGR+S + + M+ LS++T+ K H +IW+
Sbjct: 187 EPLPVIKDMTFIEGYTSPTLLFLSEVSPTWAGRISLRQDSMMLLGLSLNTSDKSHTVIWT 246
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALN---NYAVSLDSSQELP 356
NLP D+ L VP P+GGVLV GANT+ Y +QS+ L+LN +Y E
Sbjct: 247 LKNLPFDSSYLHPVPKPLGGVLVFGANTLIYLNQSSPPYGLSLNSITDYTTRFLLKNE-- 304
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG--RVVQRLDLSKTNPSVLTSD 414
S + LD + + ++ N+ L+S ++GD+ ++T+ D R V+R+ K S+L+S
Sbjct: 305 -GSLGIRLDCSQSVFISNEQLLVSLQSGDIYIVTLFPDSGMRGVKRITFDKAASSILSSC 363
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
I +I FLGSRL +SLL+++ ++ + + E G A
Sbjct: 364 ICSIKPHFLFLGSRLANSLLLRY-----STTVKQNIVEPIG-----------------GA 401
Query: 475 LQDMVNGEELSLYGSASNNTESAQK----TFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
+ D+ +++ +YG ++ + ++ +S V DSL+ IGP+ + G A S
Sbjct: 402 ILDL---DDIEVYGESAVSQSTSSSSLLTNYSLEVCDSLLCIGPVVKATIGE--PAFLSE 456
Query: 531 TGISKQS-NYELV--------------------------ELPGCKGIWTV---------- 553
+ K + ELV ELPGC +WTV
Sbjct: 457 EFVDKSDLDLELVLCSGHGKNGALSVLQRTIRPQVVTTFELPGCIDMWTVKSEGEEEEKG 516
Query: 554 ----YHKSSRGHNADSSRMAAYDDE-YHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+ G D SR H YLI+S TMVL+T +TE+ +S + Q
Sbjct: 517 EETKEEGQNEGGEKDQSREKEEKGSGQHDYLILSRSDSTMVLQTGQEITELDQS-GFATQ 575
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDG----SYMTQDLSFGPSNSESGSGSENSTVLSV 664
T+ AGN+ ++Q R+L G Y+ D+ G V V
Sbjct: 576 SATVFAGNV--GSFIVQATRTDIRLLKGIKQLCYVALDMGGG--------------VKCV 619
Query: 665 SIADPYVLLGMSDGSIRLL--------VGDPST--------------------CTVSVQ- 695
+ PYV++ + +G I LL + PS T S+Q
Sbjct: 620 DVCSPYVIVLLMEGEIGLLKLVDESLVLSWPSLGNNTPVNHISAYTDTSGLFDVTSSLQF 679
Query: 696 ----------TPAAIESSKKPVSSCTLYHDK-----GPEPWLRKTSTDAWLSTGVGEAID 740
P A K+P S +L +D+ GP K + + +
Sbjct: 680 EGDGSEKEEEVPIAPPPVKRPHLSSSLLYDEDELLYGPVKTEVKEENASPMEASLAAE-- 737
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
+ P + ++C E GALEI+ VP F VF V F
Sbjct: 738 -PEAPPPITPTHWCLLCKEDGALEIYSVPEFQFVFAVRNF 776
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 49/83 (59%), Gaps = 1/83 (1%)
Query: 917 FKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
F NI+G+ G F+ G P W M R L +HP DG + +F NVNC GF+Y +
Sbjct: 894 FSNIAGYSGVFVCGPYPHWIFMAARGHLSIHPMYIDGPVQSFAPFDNVNCPSGFLYFNKE 953
Query: 976 GILKICQLPSGSTYDNYWPVQKV 998
L+I LP+ +YD+YWPV+KV
Sbjct: 954 SELRISVLPTQLSYDSYWPVRKV 976
>gi|33411762|emb|CAD58786.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
Length = 880
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 161/497 (32%), Positives = 252/497 (50%), Gaps = 62/497 (12%)
Query: 233 SHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 8 SYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNI 67
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC-ALALNNYAVSL 349
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 68 TQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGT 127
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNP 408
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 128 TAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAA 187
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
SVLT+ + T+ FLGSRLG+SLL+++T S+ E D E KR+
Sbjct: 188 SVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRVD 245
Query: 469 RS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
+ S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 246 ATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 301
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHKS 557
L I + + + + K ++V ELPGC +WTV
Sbjct: 302 PAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 361
Query: 558 SR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+ G + A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 362 RKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFAT 420
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + S ++ ++A
Sbjct: 421 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVA 470
Query: 668 DPYVLLGMSDGSIRLLV 684
DPYV++ ++G + + +
Sbjct: 471 DPYVVIMSAEGHVTMFL 487
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 116/247 (46%), Gaps = 15/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 602 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 657
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + RP+L + D +L Y+A+ P ++ +
Sbjct: 658 QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 707
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E T R F++I G+ G F+ G
Sbjct: 708 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 767
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 768 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 827
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 828 PWPVRKI 834
>gi|324499955|gb|ADY39993.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
suum]
Length = 1434
Score = 219 bits (559), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 246/990 (24%), Positives = 412/990 (41%), Gaps = 182/990 (18%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + H RL V+SLA+ + S++L F+ AK+SV+ F + L+ S+H
Sbjct: 107 LECIIHVRLLAPVKSLAV---ARIPQNPSCSSLLLGFDTAKLSVVGFSAAERSLKTISLH 163
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G + P+++VDP RC +L+YG + +L L
Sbjct: 164 CFEEE---MLKDGYVTDLPSPVIRVDPAQRCAVMLIYGRYLAVLPFDDTSPHL------- 213
Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
++ + L +D + ++ D F+ GY EP ++ L+E T AGR ++
Sbjct: 214 -----------HTYTVALSSIDPRLVNIIDIAFLDGYYEPTLLFLYEPAQTTAGRACVRY 262
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS- 337
T + +S++T + H +W NLP D ++L +P PIGG L++GAN + Y +QS
Sbjct: 263 DTVCMLGVSLNTKEQVHASVWQLNNLPMDCNQVLMIPRPIGGALIIGANELIYLNQSVPP 322
Query: 338 CALALNNYAVSLDSSQELPRSS---FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD 394
C LN+ +D + P S ++ LD A + + ++ ++G L +LT+V D
Sbjct: 323 CGSLLNS---CMDGFTKFPLKSEKEMALTLDGCAACVISTNKVVVCARSGALFILTLVVD 379
Query: 395 G-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE 453
V+ ++ + +T F+GSR+GDSL +++ S L
Sbjct: 380 STNSVKSIEFKHEFDVSIPHTVTACSPGYLFVGSRVGDSLFIEYV---------SEL--- 427
Query: 454 FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ---KTFSFAVRDSLV 510
+ D P K+L+ + QD + E+L LYG A + S + F V D ++
Sbjct: 428 ---VPVDDPIEKKLK---VEVPQDDLEDEDLELYGKALPSVISQDVSVEKMRFRVLDRML 481
Query: 511 NIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIW--------------TVYHK 556
N+ P K + +G S+ N L E P ++ ++ +
Sbjct: 482 NVAPCKKMT-----------SGCSEGLNSYLQEQPRLDPVFDRVCACGHGKDSSICIFQQ 530
Query: 557 SSRGHNADSSRMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
S R SS + +D+ H Y+I S E ++ LET + L E+ V +
Sbjct: 531 SIRPDIITSSSIEGVIQYWAVGRREDDTHMYIIASKELGSLALETDNDLVELEAPV-FIT 589
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
TIAAG L +QV ++ Q + VLS SI
Sbjct: 590 SESTIAAGELADGGLSVQVTTSSIVVVAEGQQIQLIPL----------QLTFPVLSASIV 639
Query: 668 DPYVLLGMSDGSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYH----------- 714
DP+V + +G + L L P +V P I +K P+++ +Y
Sbjct: 640 DPFVAICTQNGRLLLYELDNTPHVHLKAVDLPGNIIHNKSPITALCIYRDMSGTIRFCSS 699
Query: 715 -------------------DKGPEPWLRKTSTDAWLSTGVGEAIDGADGGP--------- 746
D + L S + I G P
Sbjct: 700 SSAASHGANAINTKQHIDIDDFDDMLLYGDSKNKQKEAKKKRKIVGTRQNPGETPHLETD 759
Query: 747 -LDQGDI----YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREA--LKDS 799
+D I + V+ E+G L I+ +P V+ V K +H+ D + E L D
Sbjct: 760 VVDPNTIVPSHWIVMARENGNLYIYSIPEMQLVYMVKKL----SHLPDVAIDEMNYLGDE 815
Query: 800 E---TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
++I S++ + E I +VE+ + + RP LF ++ D + Y+ +
Sbjct: 816 SVVASDIASNTLNEALVAKPEEI----IVEVLLTGMGMNQGRPMLFVVV-DDMVSVYEMF 870
Query: 857 LFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS----RTPLDAYTREETPHGA--- 909
+++ N V R L + V+ R+ RF R P++A R+ +
Sbjct: 871 MYD---NGVVEHLAVRFKR-LPYTTVT----RSCRFQGNDGRAPVEA-ARDTVRYRTALH 921
Query: 910 PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGF 969
P +RI N G F+ S PC ++ LR+HP +G I++FT +NV C +GF
Sbjct: 922 PFERIGNILN-----GVFICSSYPCVFLMDSGILRMHPLNLEGPILSFTAFNNVLCPNGF 976
Query: 970 IYVTS-QGILKICQLPSGSTYDNYWPVQKV 998
IY+T + ++I +LP+ D+ PV+K+
Sbjct: 977 IYLTEREWAMRIAKLPTDVELDSSLPVRKI 1006
>gi|119602515|gb|EAW82109.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
CRA_b [Homo sapiens]
Length = 377
Score = 218 bits (555), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 133/369 (36%), Positives = 205/369 (55%), Gaps = 31/369 (8%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDIT 416
SVLT+ ++
Sbjct: 367 ASVLTTSVS 375
>gi|402591342|gb|EJW85272.1| hypothetical protein WUBG_03818, partial [Wuchereria bancrofti]
Length = 1025
Score = 210 bits (535), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 245/975 (25%), Positives = 407/975 (41%), Gaps = 149/975 (15%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + RL V+S AI + DS +L F+DAK+S++ + + L+ S+H
Sbjct: 62 LECLLAVRLLAPVQSFAI---ARISQNPDCDSFLLGFDDAKLSIVAVNPADRCLKTISLH 118
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G P+++VDP RC +LV+G + +L + + L
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCASMLVFGRYLAVLPFNDSSAQL------- 168
Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+ + L +D + +V D +F+ GY EP ++ L+E T GR ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217
Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
T + L +S +K+ L +W NLP D ++LA+P P+GG+L+V N + Y +QS
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275
Query: 337 S-CALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
C ++LN+ +D + P F ++ LD A T + + LL + G L L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLKDFKHMALTLDGAVVTVVSTNKILLCDRNGRLFTLILV 332
Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
D V+ L+L +V+ +T+ F+GSRL DS+ + C S L
Sbjct: 333 TDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFLH--CIFEQSTLEES-- 388
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS---ASNNTESAQKTFSFAVRDS 508
+TK+++ S+ + E+ LYG + ++ + V D
Sbjct: 389 -----------ATKKIKLSTEPNANE--EDEDFELYGEMLPKVAKPDITEELLNIRVLDK 435
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
L+N+GP K + G + K ++LV G G + +S R SS
Sbjct: 436 LLNVGPCKKITGGCPSISAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSS 495
Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
+ +D+ H Y I S E T+ LET + L E+ E+ + TIAAG L
Sbjct: 496 SIEGVVQYWAIGRREDDTHMYFIASRELGTLALETDNDLVEL-EAPIFSTSESTIAAGEL 554
Query: 618 FGRRRVIQV-------FERGARILDGSYMTQDLSFGPSNSES-----GSGSENSTVLSVS 665
+QV G +I Y+ L+F N+ ++N +L
Sbjct: 555 ADGGLAVQVTTSSLVMVAEGQQI---QYIPLQLTFPVRNASIVDPYIAICTQNGRLLMYE 611
Query: 666 IAD-PYVLLGMSDGSIRL------------------LVGDPSTCTVSVQTPAAIESSKKP 706
+ + P+V L D S RL ++ S +S Q A + P
Sbjct: 612 LTNHPHVHLKEIDISKRLRHETSPITSLSVYRDMSGIIRFCSAANMSQQQQATGANMHIP 671
Query: 707 -------VSSCTLYHDKGPEPWLRKTSTDAWLSTGVG--EAIDGADGGPLDQGDI----Y 753
V LY D RK + G+ E D +D I +
Sbjct: 672 EQEDFEDVDDLLLYGDSKKS---RKETLSKRRIVGMKLTEQNTHFDTDVIDPNTIVPSHW 728
Query: 754 SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG- 812
+ E+G + I+ +P + V+ V K +H+ D + D E ++ EGT
Sbjct: 729 IAIARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDE----PATGEGTDA 780
Query: 813 -QGRKENIHSMK----VVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
G + ++K ++EL + + RP LF +L D T+ Y+ + + N
Sbjct: 781 MSGTMTDTFAVKPEEVIMELLLVGMGMNQGRPLLF-LLIDDTVSAYEMFTY----NNGIQ 835
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQ 924
+ L + V+ R+ RF T D E+ A + + F+ I
Sbjct: 836 GHLAIRFKRLPYTTVT----RSCRFQGT--DGRAAVESVRDAVRHKTVLHFFERIGNVLN 889
Query: 925 GFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-ILKICQL 983
G F+ S PC + R+HP DG I++FT +N C +GFIY+T + ++++ +L
Sbjct: 890 GVFICSSYPCIFFLESGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERDRLMRVAKL 949
Query: 984 PSGSTYDNYWPVQKV 998
PS D +PV+++
Sbjct: 950 PSDMILDASYPVKRI 964
>gi|147827332|emb|CAN62175.1| hypothetical protein VITISV_001516 [Vitis vinifera]
Length = 1989
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 121/228 (53%), Positives = 148/228 (64%), Gaps = 49/228 (21%)
Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSG 442
+G+L+LLT+V DGRVV +L LSK+ SV TS I IG+SL F GS+LGDSLLVQF
Sbjct: 1657 SGELLLLTLVCDGRVVYKLGLSKSRASVFTSGIAAIGSSLSFPGSQLGDSLLVQF----- 1711
Query: 443 TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
T++ SS ++++ GD E D PSTKR RRSSSDALQDM NG++L LY
Sbjct: 1712 TAIPSSSVEKKVGDSEGDVPSTKRSRRSSSDALQDMDNGDKLPLY--------------- 1756
Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYEL--------------------- 541
V DSL+N+GPLKDF+YGLRIN D ATGI KQSNYEL
Sbjct: 1757 --VSDSLINVGPLKDFAYGLRINTDLKATGIVKQSNYELMCCSGHGKNGALCILQQSIRP 1814
Query: 542 -----VELPGCKGIWTVYHKSSRGHNADSSRMA-AYDDEYHAYLIISL 583
VELPGCKGIWTVYHK++RGHNADS +M+ +D E+ A++ SL
Sbjct: 1815 ERITEVELPGCKGIWTVYHKNTRGHNADSIKMSHVFDLEFRAFIFFSL 1862
>gi|38014465|gb|AAH60475.1| LOC398931 protein, partial [Xenopus laevis]
Length = 363
Score = 200 bits (508), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 109/303 (35%), Positives = 178/303 (58%), Gaps = 23/303 (7%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LEL+ + GN+ S+A + GA +RD+++L+F++AK+SV+E+D H L+ S+H
Sbjct: 66 LELMASFSFFGNIMSMASVQLAGA----KRDALLLSFKEAKLSVVEYDPGTHDLKTLSLH 121
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVG 215
FE PE L+ G P V+VDP GRC +L+YG Q+++L ++ GLVG
Sbjct: 122 YFEEPE---LRDGFVQNVHIPKVRVDPSGRCAVMLIYGTQLVVLPFRRDTLAEEHEGLVG 178
Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ G + S++I++R+LD K ++ D F+HGY EP ++IL E TW GR
Sbjct: 179 E--------GQKSSFLPSYIIDVRELDEKLLNIIDMQFLHGYYEPTLLILFEPNQTWPGR 230
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
V+ + TC I A+S++ K HP+IWS +LP+D + LAVP P+GGV++ N++ Y +
Sbjct: 231 VAVRQDTCSIVAISLNIMQKVHPIIWSLNSLPYDCTQALAVPKPVGGVVIFAVNSLLYLN 290
Query: 334 QSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
QS ++LN+ S P+ + LD + AT++ D ++S K G++ ++T++
Sbjct: 291 QSVPPYGVSLNSLTNGTTSFPLKPQEEVRITLDCSQATFISYDKMVISLKGGEIYVVTLI 350
Query: 393 YDG 395
DG
Sbjct: 351 TDG 353
>gi|328773280|gb|EGF83317.1| hypothetical protein BATDEDRAFT_21894 [Batrachochytrium
dendrobatidis JAM81]
Length = 1673
Score = 198 bits (503), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 181/744 (24%), Positives = 330/744 (44%), Gaps = 152/744 (20%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
AA LEL +R+HGN+ SL ++ + S + D+++L+F++AK+S++E+ L
Sbjct: 87 AACLELAAQFRVHGNITSLGVVPM---NYSGKADALLLSFKEAKMSLVEYSQFTQKLVTV 143
Query: 158 SMHCFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
SMH FE E+ L S R P +KVDPQG C + +YG ++ IL Q G+ L+ D
Sbjct: 144 SMHYFEREEFKKLG----SIDRPPPEIKVDPQGYCAAMRIYGDRLAILPFKQDGADLLND 199
Query: 217 EDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
+ S F I V+ DLD ++++ DF F+ GY P + I+++ E TW R+
Sbjct: 200 LNDANSKYPFRPSI----VLPFLDLDKSIRNIIDFTFLFGYAVPTIAIMYQTEQTWTARL 255
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY--- 331
+ T I+ +S+ T + +P+++ LP++ L++VP+PIGG++V+ N I +
Sbjct: 256 GIRKDTVSIAVISLDTAEESYPVLYKIEKLPYNCTMLVSVPTPIGGLIVLSHNAIIFTDQ 315
Query: 332 -HSQSASCA----------LALNNYAVSLDSSQELP---------------RSSFSVELD 365
H+ +C + L Y + LD Q P ++ LD
Sbjct: 316 IHAPGIACIVNAYFDSETNIMLTPYELQLDMVQPRPPRPPSVFFAQNKYTDYKELAISLD 375
Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDGRV----------VQRLDLSK---------- 405
+ ++ D+ LL + G+++ + ++ + V V+ L++
Sbjct: 376 GSRGMFISPDIFLLVLRDGEMIQVDLIGEEGVGRSWKRRKGGVKSFQLTRLGIRMTAPVH 435
Query: 406 -------TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE 458
+NP L+ +++ FLGSR G L + S + + L +F ++E
Sbjct: 436 LFPLADASNPLSLSGRNSSVPLGGSFLGSR-GSKLRYNYLFASSRTTDACLL--QFVEVE 492
Query: 459 ADAPSTKRLRRSSSDALQDMVNGE----ELSLYGSASNNTES------------------ 496
A S+ + +++ + + NGE + LYG ++ ++
Sbjct: 493 EFAKSSVSMNGAAN--MNNTDNGEDDELDKDLYGDSTTAKQTDTDMSALLSSDEHGHGEI 550
Query: 497 -AQKTFSFAVRDSLVNIGPLKDFSYGLRINAD---------------ASATG-------- 532
+++T F + DS+ + PL+DF+ GL +ATG
Sbjct: 551 VSEQTLRFRLCDSVTVVSPLRDFAVGLPAETSEHRFSPKIGGCDLEIVAATGHGPHGHLA 610
Query: 533 -ISKQSNYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
+++ ++V ELP + +WT+ + D ++ D +H Y+I+S + T
Sbjct: 611 ILNRSVRPQIVTTFELPQIEEMWTI---RCAKFDKDYRLVSEPTDAFHKYVILSHSSGTS 667
Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF--- 645
+L+ + TE+ ++ ++ G T+ G L ++QV G + D + D +
Sbjct: 668 ILKAGEAFTEMDDTT-FYQAGPTVGVGALLDETIIVQVHPNGVILFD--FSKYDFTIIDR 724
Query: 646 --------------GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
G E G ++ V+S S DPY +L M+ G I LL D +T
Sbjct: 725 LNTNRMHALYIFVEGTKLQEMRVGDDDIWVISCSFMDPYAMLLMNTGHIVLLSLDETTHQ 784
Query: 692 VSVQTPAAIESSKKPVSSCTLYHD 715
++ + E K+ VS+ +LY D
Sbjct: 785 ITQIS----EYKKRLVSTFSLYCD 804
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 81/313 (25%), Positives = 124/313 (39%), Gaps = 85/313 (27%)
Query: 753 YSVVCYESGALEIFDVPNFN--CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
+ V ++G L ++ +P+F C F + F + +D + + T N++ +E
Sbjct: 930 WCFVYTDTGHLLVYTLPDFKECCAFPL--FSTLPVLAMDVPLWRSRSIDSTFANTTGDE- 986
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
+ VV L S P+L + +G + Y+ +F P TS +DD
Sbjct: 987 --------FEEILVVNLGN---SKDRQTPYLVCLAANGDLAVYK--IFVCP--TSSNDDD 1031
Query: 871 VS--------TSRSLSVSNVSASRLRN---LRFSRTPLDAYTRE---------------E 904
S SR+ + + A L+ +R R P D TR+ +
Sbjct: 1032 TSFVNSGTFKQSRTPAELELDAQNLKKRLAIRLVRIPHDQITRDLQFYTDNEGDKIDLVQ 1091
Query: 905 TPHGAPC----QRITIFKNI--SG---HQGFFLSGSRPCWCMVFRER------------- 942
P P Q + F I SG + G ++GSRPCW MV +
Sbjct: 1092 EPQHQPTFLKRQHLKPFDAIGWSGGNMYSGVVVTGSRPCWIMVALQSRQQDLDVISFDNS 1151
Query: 943 -----------------LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
LR HP DG + F LHNVN HGF+Y+ +G+ +ICQLP
Sbjct: 1152 VACSTKLPPVPLLGTNMLRFHPMPVDGPMKCFAPLHNVNVAHGFLYINWKGLFRICQLPP 1211
Query: 986 GSTYDNYWPVQKV 998
+D+ WPV KV
Sbjct: 1212 QFNFDHDWPVCKV 1224
>gi|66812672|ref|XP_640515.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
gi|60468551|gb|EAL66554.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
Length = 1628
Score = 197 bits (502), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 156/585 (26%), Positives = 266/585 (45%), Gaps = 128/585 (21%)
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
+++++++VKDF F+HGY EP ++ LHE TW R++ K TC ++A+S++ K I
Sbjct: 281 KNIEIENVKDFCFLHGYYEPTILFLHEPIQTWTSRIAVKKFTCQMTAISLNLLTKAGSFI 340
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W+ N P++ L++VP P+GG LV+ AN + Y +Q++ LA+N YA S+D+S +
Sbjct: 341 WNVSNFPYNCEMLVSVPEPLGGALVITANIMFYVNQTSRYGLAVNEYA-SIDTSTIIGSQ 399
Query: 359 SFS----------VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
F LD ++ +L++D + S K G+L++ ++ DGR VQR+ +SK
Sbjct: 400 PFDFPIDDTLNLVFTLDRSNFVFLESDKFIGSLKGGELLIFHLISDGRSVQRIHVSKAGG 459
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT------SMLSSGLKEE-------FG 455
SVLTS I + N+L FLGSRLGDSLL+Q+T S T S+ K++
Sbjct: 460 SVLTSCICVLSNNLIFLGSRLGDSLLLQYTEKSITDDQLEHENFSNPYKKQKTSEVFDLF 519
Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELS-------------LYGSASNNTESAQKTFS 502
D ++ + +++ Q+ + ++ L+ N +S Q
Sbjct: 520 DENSETNNNNNSNNNNNKENQEKSSSSSIASKLLEEIEDEEDQLFKEKKNQLKSYQ---- 575
Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY-----ELV--------------- 542
+ D ++NIGP+ D G I+ T Q Y ELV
Sbjct: 576 LGICDQIINIGPIGDIVVGQSIDPTYDETIQPNQPEYVPKTLELVTCSGYGKNGSISVLQ 635
Query: 543 -----------ELPGCKGIWTVY------------------HKSSRGHNADSSRMAAY-- 571
ELPG +WTVY K SR N ++ +
Sbjct: 636 NNIKPELVMAFELPGILNVWTVYKEEIEEEHIEKEIKKNTSKKRSRDENNNNEQEDNEQE 695
Query: 572 ----------------DDEYHAYLIISL-EARTMVLETADLLTEVTESVDYFVQGRTIAA 614
D +H YL +SL + T++ ET L EV + +++
Sbjct: 696 DNEDNEEEEEEEKMQKDKNWHDYLYLSLKDGTTLIFETGRDLKEVGK-----FNFKSLDI 750
Query: 615 GNLFGRRRVIQVFERGARILDG-SYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
GNLFGR+R++ +++ G ++++G + Q++ N + S I DP++LL
Sbjct: 751 GNLFGRKRIVVIYQGGIKLINGFDRVIQEIQI------------NEPIKSSYICDPFILL 798
Query: 674 GMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG 717
+G+I++ G D + + + + + S +L+ D+
Sbjct: 799 QFHNGTIQIFKGIDEENQLIQFSINSISNNLNQSIFSSSLFFDRN 843
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/168 (34%), Positives = 89/168 (52%), Gaps = 18/168 (10%)
Query: 57 NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGISAA-------SLELVC 105
NLV+ NV++IY +R ++ E +S+ + ++ I+ SLEL+
Sbjct: 32 NLVLAKTNVLQIYKIRYEKIEKYENVSDSQPQQQQEQEQQQQDITQKKKIELKPSLELII 91
Query: 106 HYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
+L GN+ES+A + ++ RDS+IL F DAKISVL++D + I S+H FE
Sbjct: 92 EKKLFGNIESMASVRYPNSE----RDSLILTFRDAKISVLDYDSDLLDFEIRSLHYFEKD 147
Query: 166 EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
E+ K GR F PL+KVD Q RC +L+Y + +L + S L
Sbjct: 148 EF---KGGRNHFKHPPLLKVDTQQRCAVMLLYDRNLAVLPFKKTSSIL 192
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/104 (25%), Positives = 50/104 (48%), Gaps = 17/104 (16%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ----------------LCDGSIV 955
+RI F +ISG +G F+ G +P W + LR+H ++
Sbjct: 1122 KRIFEFSSISGKRGLFIGGKKPIWAFCEKGYLRLHSMDSSDNSNSNNSNNNNNNNSNTVE 1181
Query: 956 AFTVLHNVNCNHGFIYVTSQG-ILKICQLPSGSTYDNYWPVQKV 998
FT +N++C GFIY + + ++KIC L + ++N ++++
Sbjct: 1182 TFTSFNNISCQDGFIYFSKEKDVIKICTLSTLMNFENDIAIRRI 1225
>gi|301093545|ref|XP_002997618.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110008|gb|EEY68060.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1744
Score = 197 bits (502), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 229/978 (23%), Positives = 387/978 (39%), Gaps = 249/978 (25%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR++++ V D F+ GY+EP +++LHE + + GR++ T ++ +SI+
Sbjct: 278 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 337
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+ HP IW+ NLP D ++L+ +P+GGV+V+ AN I Y +Q+ LA N +A +
Sbjct: 338 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTVN 397
Query: 352 SQELPRS------------SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----- 394
P S +V L +LQ LL+ +G + +L++ Y+
Sbjct: 398 QSVFPLSEAVYETPEHETVQLNVVLYDCQFEYLQEKELLLTMPSGQVYVLSLPYEDTSSR 457
Query: 395 ----------GRVVQRLDLSKTNPSVLTSDITTIG-NSLFFLGSRLGDSLLVQFTCGSGT 443
GR L L S+ S + F+GSR GDS+L
Sbjct: 458 GLYGFGGVSSGRNAS-LSLRMLRSSIQASCVCIDDEKQTLFIGSRSGDSVLFALDKKKLV 516
Query: 444 SMLSSGLKEEFGDI------EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
+ K+E I + AP K S A ++ + ++L LYG+A E A
Sbjct: 517 TATEEEQKDEEMPIKEVVIKQESAPEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPA 571
Query: 498 QKT---------------------------FSFAVR--DSLVNIGPLKDFSYGLRINADA 528
+ + + +R D L +IG + G+ NAD
Sbjct: 572 ATSSTECTNGVGVSSVKTEENGAPEQDTGSYDYELRQIDVLPSIGQITSIELGVENNAD- 630
Query: 529 SATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHN 562
S + ELV EL GC+ +WTV
Sbjct: 631 -----SNEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATR 685
Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
+ R Y+AYLI+S+ RTMVL T + + + + ++ G T+AA NLF ++R
Sbjct: 686 SSDGR------SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGSTLAAANLFNKQR 739
Query: 623 VIQVFERGARIL------------------DGSYM----------------TQDLSFGPS 648
++Q+F++GAR++ +G+ TQ+++
Sbjct: 740 IVQIFKQGARVMMEVPEEETSNGQEKSAKTEGAEDEEEDDEDDGPRVKLVCTQEITLEGD 799
Query: 649 NSESGSGSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP--------- 697
G + S+V +SV + DPY+LL ++DGS+RLL+GD +SV P
Sbjct: 800 VECGGMNVDTSSVGIVSVDVVDPYILLLLTDGSVRLLMGDEEDLELSVIDPEIDYAEGIS 859
Query: 698 ---AAIESSKKPVSSCTLYHD--------------------------------------- 715
+ + SK SS L++D
Sbjct: 860 EANGSADMSKHGSSSACLFYDWAGMFVENAWVEEEQEERHEATQSRAKRAEDDDDMDALY 919
Query: 716 -KGPEPWLRKT-STDAWLSTGVGEAIDGADGGPLDQ---GDIYSVVCYESGALEIFDVPN 770
P P + T +T + ST DG+ PL Q + +C+ G+L +F +P+
Sbjct: 920 SSKPSPKVATTNATKSTPSTATPRNEDGSVSIPLLQQKDAKMMCGMCFGDGSLHVFSLPD 979
Query: 771 F--------------NCVFTVDKFVSGRTHIVDTYMREALKDSETEIN-SSSEEGTGQGR 815
F + V T++ + GR V L +N S+S G+ +
Sbjct: 980 FKKRGVFPYLTFAPQSLVNTLEHYQVGRNKTVK------LSAPVLGLNASTSSANDGRIK 1033
Query: 816 KENIHSMKVVELAMQRW--------SAHHSRPFLFAILTDGTILCYQAY-LFEGPENTSK 866
K + + V ++ + R + + SR + L +G +L Y A FE + +
Sbjct: 1034 KSHTINSPVADIVIHRVGPSEGQHNAQYLSRMVMLVFLANGDLLMYSAAPKFESLKPRAN 1093
Query: 867 SD-DPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR---------ITI 916
+ PV + ++ L + +A E A + +T
Sbjct: 1094 GEIAPVFHFVRVGTELITRPFLPPKARTNAHNEAGNNPEVNTSAVLAKLRAGFRYPMLTC 1153
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS------IVAFTVLHNVNCNHGFI 970
F N++ G F G+ P W + R P +C+ + +++FT H+ NC +GFI
Sbjct: 1154 FHNVNNMSGAFFRGAHPMWILGDRGHASFVP-MCNAAPRVSVPVLSFTSFHHWNCPNGFI 1212
Query: 971 YVTSQGILKICQLPSGST 988
Y S+G L++C+LPS T
Sbjct: 1213 YFHSRGALRVCELPSSKT 1230
>gi|301103686|ref|XP_002900929.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
gi|262101684|gb|EEY59736.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
Length = 1561
Score = 196 bits (498), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 227/964 (23%), Positives = 384/964 (39%), Gaps = 234/964 (24%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR++++ V D F+ GY+EP +++LHE + + GR++ T ++ +SI+
Sbjct: 108 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYYLTVISINMK 167
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL-- 349
+ HP IW+ NLP D ++L+ +P+GGV+V+ AN I Y +Q+ LA N +A
Sbjct: 168 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTVN 227
Query: 350 -------DSSQELPR---SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----- 394
D+ E P + +V L +LQ+ LL+ G + +L++ Y+
Sbjct: 228 QSVFPLSDAVYETPEHETAQLNVVLYDCQFEYLQDKELLLTMPCGQVYVLSLPYEDTSSR 287
Query: 395 ----------GRVVQRLDLSKTNPSVLTSDITTIG-NSLFFLGSRLGDSLLVQFTCGSGT 443
GR L L S+ S + F+GSR GDS+L
Sbjct: 288 GLYGFGGVSSGRNAS-LSLRMLRSSIQASCVCIDDEKQTLFIGSRSGDSVLFALDKKKLV 346
Query: 444 SMLSSGLKEEFGDI------EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
+ K+E I + AP K S A ++ + ++L LYG+A E A
Sbjct: 347 TATEEEQKDEEMPIKEVVIKQESAPEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPA 401
Query: 498 QKT---------------------------FSFAVR--DSLVNIGPLKDFSYGLRINADA 528
+ + + +R D L +IG + G+ NAD
Sbjct: 402 ATSSTECTNGVGVSSVKTEENGAPEQDTGPYDYELRQIDVLPSIGQITSIELGVENNAD- 460
Query: 529 SATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHN 562
S + ELV EL GC+ +WTV
Sbjct: 461 -----SNEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATR 515
Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
+ R Y+AYLI+S+ RTMVL T + + + + ++ G T+AA NLF ++R
Sbjct: 516 SSDGR------SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGPTLAAANLFNKQR 569
Query: 623 VIQVFERGARIL------------------DGSYM----------------TQDLSFGPS 648
++Q+F++GAR++ +G+ TQ+++
Sbjct: 570 IVQIFKQGARVMMEVPEEETSNGQEKSGKAEGAEDEEEDDEDDGPRVKLVCTQEITLEGD 629
Query: 649 NSESGSGSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP--------- 697
G + S+V +SV + DPY+LL ++D S+RLL+GD +SV P
Sbjct: 630 VECGGMNVDTSSVGIVSVDVVDPYILLLLTDVSVRLLMGDEEDLELSVIDPEIDYAEGIS 689
Query: 698 ---AAIESSKKPVSSCTLYHD-------------KGPEPWLRK-TSTDAWLSTGVGEAID 740
+ + SK SS L++D P P + +T + ST D
Sbjct: 690 EANGSADMSKHGSSSACLFYDWAEDDDDMDALYSSKPSPKVATMNATKSMPSTATPRNED 749
Query: 741 GADGGPLDQ---GDIYSVVCYESGALEIFDVPNF--------------NCVFTVDKFVSG 783
G+ PL Q + +C+ G+L +F +P+F + V T++ + G
Sbjct: 750 GSVSIPLLQQKDAKMMCSMCFGDGSLHVFSLPDFKKRGVFPYLTFAPQSLVNTLEHYQVG 809
Query: 784 RTHIVDTYMREALKDSETEIN-SSSEEGTGQGRKENIHSMKVVELAMQRW--------SA 834
R V L +N S+S G+ +K + + V ++ + R +
Sbjct: 810 RNKTVK------LSAPALGLNASTSSANDGRIKKSHTINSPVADIVIHRVGPSEGQHNAQ 863
Query: 835 HHSRPFLFAILTDGTILCYQAY-LFEGPENTSKSD-DPVSTSRSLSVSNVSASRLRNLRF 892
+ SR + L +G +L Y A FE + + + PV + ++ L
Sbjct: 864 YLSRMVMLVFLANGDLLMYSAAPKFESLKPRANGEIAPVFHFVRVGTELITRPFLPPKAR 923
Query: 893 SRTPLDAYTREETPHGAPCQR---------ITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
+ +A E A + +T F N++ G F G+ P W + R
Sbjct: 924 TNAHNEAGNNPEVNTSAVLAKLRAGFRYPMLTCFYNVNNMSGAFFRGAHPMWILGDRGHA 983
Query: 944 RVHPQLCDGS-------------------IVAFTVLHNVNCNHGFIYVTSQGILKICQLP 984
P S +++FT H+ +C +GFIY S+G L++C+LP
Sbjct: 984 SFVPMCVPSSAPPKANGTSKNAAPRVSVPVLSFTPFHHWSCPNGFIYFHSRGALRVCELP 1043
Query: 985 SGST 988
S T
Sbjct: 1044 SSKT 1047
>gi|194374339|dbj|BAG57065.1| unnamed protein product [Homo sapiens]
Length = 330
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 114/302 (37%), Positives = 170/302 (56%), Gaps = 35/302 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS Y V+L
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVAL 300
Query: 350 DS 351
+S
Sbjct: 301 NS 302
>gi|341892673|gb|EGT48608.1| CBN-CPSF-1 protein [Caenorhabditis brenneri]
Length = 1440
Score = 191 bits (484), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 175/623 (28%), Positives = 297/623 (47%), Gaps = 93/623 (14%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++AF+DAK+S++ ++ ++ S+H FE+ +L+ G + P+V+ DP+
Sbjct: 91 QDSILMAFDDAKLSIITINEKERNMQTISLHAFENE---YLRDGFVKYFHPPIVRTDPEN 147
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
RC LVYG + IL + S RI S ++I L+ +D + +V
Sbjct: 148 RCAASLVYGKHIAILPFHEN-----------------SKRIHS-YIIPLKQIDPRLDNVA 189
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
D +F+ GY EP ++ L+E T GR ++ T I +S++ +Q ++W NLP D
Sbjct: 190 DIVFLDGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 249
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
LL +P P+GG +V G+NTI Y +Q+ C + LN+ D + P S +
Sbjct: 250 CATLLPIPKPLGGAIVFGSNTIVYLNQAVPPCGIVLNS---CYDGFTKFPLKDMKSMKMT 306
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
LD + + ++++ + T+ G+L LL +V G V+ L+ SK + + +T
Sbjct: 307 LDCSTSVYMEDGRIAVGTRDGELFLLRLVTSSGGATVKSLEFSKVWDTSIAYTLTVCAPG 366
Query: 422 LFFLGSRLGDSLLVQFTCGSGT--SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
FLGSRLGDS L++++ T S+ +++E +EA+ +
Sbjct: 367 HLFLGSRLGDSQLLEYSLIKTTRESVKRHKMEQEQNHVEAE------------------L 408
Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+ ++L LYG A +++ E ++ F+ D L NIGP+K G R N ++ +
Sbjct: 409 DEDDLELYGGAIEEQQNDDEEQITESLQFSELDRLRNIGPVKSMCVG-RPNYMSNDLVDA 467
Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
K+ + ++++ G G V+ +S R SS + ++E H YLI+
Sbjct: 468 KRRDPVFDVITASGHGKNGSLCVHQRSLRPEIVTSSLLEGAEQLWAVGRKENESHKYLIV 527
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFGRRRVIQVFERG-ARILDGSYM 639
S T+VLE + L E+ E + FV G+ T+AAG L +QV A + DG +
Sbjct: 528 SRIRSTLVLELGEELIELEEPL--FVTGQPTVAAGELSQGAFAVQVTSTSIALVTDGQQL 585
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDPSTCTVSV--- 694
+ N V+ SI DPYV + +G + L LV +P +
Sbjct: 586 AE-----------VKIDSNFPVVQASIVDPYVAVLTQNGRLLLYTLVSNPYMQLQEIDLA 634
Query: 695 QTPAA--IESSKKPVSSCTLYHD 715
QTP + I S ++S ++Y D
Sbjct: 635 QTPFSTFIAQSASQITSISMYAD 657
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 67/291 (23%), Positives = 118/291 (40%), Gaps = 32/291 (10%)
Query: 723 RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS 782
++ DA S GE D D + V+ +E+G L + +P V+ + +F +
Sbjct: 736 KRLGHDAIQSGRGGEQSDAIDPSSYTSISHWLVLAHENGRLSVHSLPEMELVYQIGRFPN 795
Query: 783 GRTHIVD-----TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
+VD +K +S EE K+N +++E + + S
Sbjct: 796 VPELLVDLTPEEEEKERRIKAQLAAKEASDEEQLNAEMKKNCE--RIMEAQIVGMGINQS 853
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
P L AI+ D ++ Y+ + +P S L ++ LR S
Sbjct: 854 HPILMAIV-DEQVIMYEMFA-----------NPNSQPGHLGIAFRKLPHFICLRSSPYLK 901
Query: 898 DAYTR------EETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE--RLRVHPQ 948
R EE P I F+ +S + G + G+ P +V+ ++ HP
Sbjct: 902 SDGKRAAFQIVEEDGKRYPL--IHSFERVSTVNNGVIIGGAVPT-LLVYGAWGGMQTHPM 958
Query: 949 LCDGSIVAFTVLHNVNCNHGFIYVT-SQGILKICQLPSGSTYDNYWPVQKV 998
DGSI AFT + N +GF+Y+T + L+I ++ + Y+ +PV+K+
Sbjct: 959 TIDGSIKAFTPFNIDNVPYGFVYMTQKKSELRIAKMHADFDYEMPYPVKKI 1009
>gi|393245434|gb|EJD52944.1| hypothetical protein AURDEDRAFT_81080 [Auricularia delicata
TFB-10046 SS5]
Length = 1422
Score = 190 bits (482), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 221/981 (22%), Positives = 410/981 (41%), Gaps = 164/981 (16%)
Query: 52 IGPVPNLVVTAANVIEIYVVRVQ------------EEGSKESKNSGETKRRVLMD----- 94
+G NLVV N++ ++ VR++ +E + + + V MD
Sbjct: 40 LGVATNLVVARQNLLRVFEVRIEAAPLPSQEKLLADEQGRGRRGMEAVEGEVEMDVGGEG 99
Query: 95 ----GI-------------SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAF 137
GI + L LV +RLHG V L + Q A + D ++++F
Sbjct: 100 FVSAGIVKSAGQHARQRQRTVTRLYLVRQHRLHGIVTGLGRV-QTMASLEDKLDRLLVSF 158
Query: 138 EDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLV 196
+DAKI++LE+ + H L S+H +E +P+ L R +++DP RC + +
Sbjct: 159 KDAKIALLEWSEVSHDLSTISIHTYERAPQMLAFDSARALTE----LRIDPNSRCAALTL 214
Query: 197 YGLQMIILKASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFV 252
G + IL + + L D D GG S + S +++L ++D ++++ D F+
Sbjct: 215 PGDAVAILPFYESQAELDMDVDQ----GGVSRDVPYSPSFILSLPEVDNDIRNIIDIAFL 270
Query: 253 HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
G+ P + +L E + TW GR++ T + L++ + +P+I S LP+D+ +L+
Sbjct: 271 PGFNNPTLAVLFETQRTWTGRLAEFKDTVRLRILTLDVVTRTYPIIGSVDGLPYDSMRLV 330
Query: 313 AVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATW 371
A P+ +GGV+V+ AN + + QS + A+A N +A + S P L+ + A +
Sbjct: 331 ACPAALGGVIVLTANAVLHIDQSGKNVAVAANGWAARV-SEFPTPAPERDETLEGSRAVF 389
Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL-SKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ + LL + G +V + ++ +GR+V ++D+ + + + + + + + L +GS G
Sbjct: 390 VSDKTFLLVYRDGSIVPVELILEGRMVTKIDMGQRLAQTTIPTVVCAVQDDLVLVGSTAG 449
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-----DMVNGEELS 485
S+L++ T E DI DA S + ++ ++ D ++ E+
Sbjct: 450 PSILLKVT-------------HEEEDITPDAGSARENGAANGNSTNGATYDDPMDSEDED 496
Query: 486 LYGSASNNTESAQKTFS--------------FAVRDSLVNIGPLKDFSYGLRINAD---- 527
LYG S T + A+ DSL GP+ D ++ L N +
Sbjct: 497 LYGGTDMMVTSTSGTLTVGGTAALEKRRILRLALADSLCGHGPISDMAFILGRNGERHVP 556
Query: 528 -------ASATG--------ISKQSNYELVELPGCKGIWTVYHKSS---RGHNADSSRMA 569
TG + + +L + G +G+W+ + + G N + A
Sbjct: 557 ELLAGVGVGHTGGLARFQRDLPARVKRKLHRISGNRGVWSFPVRRAVKVAGMNIERPTGA 616
Query: 570 AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ--GRTIAAGNLFGRRRVIQVF 627
A D +I+S +A + + + + VD + TI AG F R ++QV
Sbjct: 617 ADWD----TVIVSTDATPSPGLSRVAVKDSSTDVDILTRLPAITIGAGPFFQRTAILQVV 672
Query: 628 ERGARIL--DGS--YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
R+L DGS + +DL + + + + SI+DP+V++ D ++ L
Sbjct: 673 NNAIRVLEADGSERQVIKDLD---------GTTPRAKIRACSISDPFVVVVREDDTLGLF 723
Query: 684 VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK------GPEPWLR-KTSTDAWLSTGVG 736
VG+ + + + + + + Y D G L+ K +A ST +
Sbjct: 724 VGETGKGKLRRKDMSMLGDKASRYLAASFYQDHSGLFQVGTARSLKGKEKANAPASTTIE 783
Query: 737 EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREAL 796
A+D +G + V+C G +EI+ +P VF+ + DT+
Sbjct: 784 AAMDEG------RGSQWLVLCRPQGVVEIWALPKLTLVFSCGGVSDIPPVMADTF----- 832
Query: 797 KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
+ S ++ Q ++ E+ + RP L +L GT+ Y
Sbjct: 833 ---DLATPSPVQDPPRQAEDHDVE-----EILISPIGETTPRPHLLVLLRSGTVAVYDTA 884
Query: 857 LFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI 916
E P PV+T R + ++ R+ + P++ + P AP I
Sbjct: 885 PVELP--------PVATGREAGL-QLAFVRIMSRAVDTAPIERAEKRGAP--APRHLIPF 933
Query: 917 FKNISGHQGFFLSGSRPCWCM 937
++S G FL+G +P W +
Sbjct: 934 STSVS---GVFLTGGKPGWIL 951
>gi|134025022|gb|AAI35011.1| LOC564406 protein [Danio rerio]
Length = 348
Score = 189 bits (480), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 104/283 (36%), Positives = 165/283 (58%), Gaps = 13/283 (4%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE V + L GNV S+A + G + RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 66 LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P+V+VDP+ RC +LVYG +++L + + DE
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT---LADEQEG 175
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G + S++I++R+LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 176 IVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
TC I A+S++ K HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPP 295
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
++LN+ + P+ + LD + A+++ +D ++S
Sbjct: 296 FGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVIS 338
>gi|313232279|emb|CBY09388.1| unnamed protein product [Oikopleura dioica]
Length = 1451
Score = 188 bits (478), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 182/695 (26%), Positives = 313/695 (45%), Gaps = 77/695 (11%)
Query: 57 NLVVTAANVIEIYVVR--VQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
NL V A N++ +Y +R V E G+ + EL + L G V
Sbjct: 45 NLAVAAGNMLSVYRIRSSVDEAGNHFDR------------------FELCDEFELWGIVV 86
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
+ L G+ RDS++L+ E++K ++E++ L SMH F+ + L+RG
Sbjct: 87 CMTRLRLAGS----VRDSLLLSIEESKCVIVEYEPDTGSLSTISMHFFQDED---LRRGF 139
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS-GLVGD--EDTFGSGGGFSARIE 231
+ L +VD RC VLVYG + +L + L G + F GF A
Sbjct: 140 RKLSSMALARVDGFNRCAAVLVYGSYLAVLPFRRSTERDLSGQRHQAVFYENSGFIA--- 196
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
++I+L+ L +K V DF F+ GY +P +++L+E TW GRV+ + TC + ALSI+
Sbjct: 197 --NMIDLQSLPVKIASVLDFQFLEGYNDPTILLLYEALPTWTGRVTERQDTCGMVALSIN 254
Query: 290 TTLKQHPLIWSAMNLPH-DAYK--LLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNY 345
+ HP+IW LP + Y L +P P+GG L+ N++ Y QS +ALN+
Sbjct: 255 LIDETHPVIWQMAGLPFPNPYSSALFPIPKPLGGSLLFATNSLIYLDQSVPPYGVALNSL 314
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
+ + + + L A L +D +S ++GD+ ++T+ D V+R L
Sbjct: 315 PLGCTNFALKTQDVAPLNLQNCKACMLSDDSICVSLESGDVYIITLKKDSLNNVRRFYLD 374
Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
+ SV+ + ++ + ++L FLGSRLG+SLL+++ C + S+ L+ D
Sbjct: 375 QVASSVIPTTLSKLSDNLIFLGSRLGNSLLLRYKCKENSKKSSTSLENGEKDGVEIENKE 434
Query: 465 KRLRRSSSDALQDMVNG------EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
+ + + + NG +++ YG N + ++ F D+L NIGP
Sbjct: 435 EEKNELNFEIEKSSENGSPENKRKKMRYYGDEIFNLD-VNTSYDFETMDNLSNIGPCGPV 493
Query: 519 SYGLRINADASATGI---SKQSNYELVELPGCK----GIWTVYHKSSRGHNA-------- 563
N + + + ++ N ++ L G G TV HKS R A
Sbjct: 494 ELIHTANHNDNYDHVGSDARDRNIDVCVLSGKDKTGFGSITVLHKSVRPSIASQFPFPMN 553
Query: 564 --DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV-TESVDYFVQGRTIAAGNLFGR 620
D + ++E H+ L+++ + +TMV +T +L E+ E +TI +
Sbjct: 554 FSDMWTLRRSEEETHSLLVMTKKDQTMVFQTGAILEELKKEECGLATNAKTIFCATIGNG 613
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS- 679
+ ++QV R ++D TQ+ SG ++ V+ DPYV++ S G+
Sbjct: 614 KYIVQVLPRAVVLVDMD--TQETIQNKPFDLSGQ------IIQVA-CDPYVVILASKGTI 664
Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYH 714
I L++ + S T ++T A E + + H
Sbjct: 665 ISLVLFENSDGTAMLKTSTAPECKNQDDPEKKIMH 699
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 59/242 (24%), Positives = 107/242 (44%), Gaps = 35/242 (14%)
Query: 760 SGALEIFDVPNFNCVFTV-DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
+G+LEI+ +P+ C+ D+ + I++T S EG+ +GR+ +
Sbjct: 814 NGSLEIYSLPD--CLLRFGDRNFANAPRILET---------------SRFEGS-EGRRVD 855
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 878
+ + V E+ + S P++ ++ D ++ Y F N +++ PV + R +
Sbjct: 856 V--LDVQEMNVFNMGPS-SLPYIVVMIGDQLMI----YRFRATLNRFQTESPVLSGRFIK 908
Query: 879 VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
+ + + + LR D ++ + + ++ F NIS H G FL G+ P W
Sbjct: 909 LQD----KTKLLRRIPGVHDESSKTKNRNNKIMRQ---FMNISDHNGIFLGGAYPTWIFC 961
Query: 939 FRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT-SQGILKICQLPSGSTYDNYWPVQ 996
+ RL +H +G + AFT N C GF+Y S L + L YD WP +
Sbjct: 962 GQNGRLNIHSMWQEGFVNAFTPFDNEKCADGFLYFRHSTKTLTVANLQPFLKYDADWPFK 1021
Query: 997 KV 998
K+
Sbjct: 1022 KI 1023
>gi|268580265|ref|XP_002645115.1| Hypothetical protein CBG16808 [Caenorhabditis briggsae]
gi|296439546|sp|A8XPU7.1|CPSF1_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
Length = 1454
Score = 188 bits (478), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 150/576 (26%), Positives = 266/576 (46%), Gaps = 81/576 (14%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++ F+DAK+S++ ++ ++ S+H FE+ +L+ G ++ P+V+ DP
Sbjct: 92 QDSILMTFDDAKLSIVAVNEKERNMQTISLHAFENE---YLRDGFTTYFNPPIVRTDPAN 148
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
RC LVYG + IL + ++ S++I L+ +D + +V
Sbjct: 149 RCAASLVYGKHIAILPFHENSKRIL------------------SYIIPLKQIDPRLDNVA 190
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
D +F+ GY EP ++ L+E T GR ++ T I +S++ +Q ++W NLP D
Sbjct: 191 DMVFLEGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
LL++P P+GG +V G+NTI Y +Q+ C + LN+ D + P +
Sbjct: 251 CNSLLSIPKPLGGAVVFGSNTIVYLNQAVPPCGIVLNS---CYDGFTKFPLKDMKHLKMT 307
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
LD + + ++++ + ++ GDL LL +V G V+ L+ SK + + +T
Sbjct: 308 LDCSTSVYMEDGRIAVGSREGDLYLLRLVTSSGGATVKSLEFSKVCDTSIAFTLTVCAPG 367
Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG 481
F+GSRLGDS L+++T ++ S K+ R + + ++
Sbjct: 368 HLFVGSRLGDSQLLEYTL-----------------LKVTKESAKKQRLEQQNPSEIELDE 410
Query: 482 EELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ 536
+++ LYG A +++ E ++ F D L+N+GP+K +G R N ++ +K+
Sbjct: 411 DDIELYGGAIEMQQNDDDEQISESLQFRELDRLLNVGPVKSMCFG-RPNYMSNDLIDAKR 469
Query: 537 SN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLIISL 583
+ ++LV G G V+ +S R SS + ++E H YLI+S
Sbjct: 470 KDPVFDLVTASGHGKNGALCVHQRSMRPEIITSSLLEGAEQLWAVGRKENESHKYLIVS- 528
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMTQD 642
R+ ++ E + T+AAG L +QV A + DG M Q+
Sbjct: 529 RVRSTLILELGEELVELEEQLFVTNEPTVAAGELLQGALAVQVTSTCIALVTDGQQM-QE 587
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
+ N V+ SI DPYV + +G
Sbjct: 588 VHI----------DSNFPVVQASIVDPYVAVLTQNG 613
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/300 (24%), Positives = 132/300 (44%), Gaps = 38/300 (12%)
Query: 723 RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS------VVCYESGALEIFDVPNFNCVFT 776
++ DA +S+ GE D +D YS VV +++G + I +P+ V+
Sbjct: 737 KRLGHDAIMSSRGGEQSDA-----IDPTRTYSSITHWLVVAHDNGRITIHSLPDLELVYQ 791
Query: 777 VDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT--------GQGRKENIHSM------ 822
+ +F + +VD + E K+ + + ++ E+ + ++ ++S
Sbjct: 792 IGRFSNVPELLVDMTVEEEEKEKKAKQTAAQEKEKETEKKKDDAKNEEDQVNSEMKKLCE 851
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
KVVE + + + P L AI+ D ++ Y+ + P+ V+ + + +
Sbjct: 852 KVVEAQIVGMGINQAHPVLIAII-DEEVVLYEMFASYNPQPGHLG---VAFRKLPHLIGL 907
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE 941
S N+ R P + E HG I F+ IS + G + G+ P +V+
Sbjct: 908 RTSPYVNIDGKRAPFEM----EMEHGKRYTLIHPFERISSINNGVMIGGAVPT-LLVYGA 962
Query: 942 --RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKV 998
++ H DGSI AFT +N N HGF+Y+T Q L+I ++ YD +PV+K+
Sbjct: 963 WGGMQTHQMTIDGSIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYDMPYPVKKI 1022
>gi|339253000|ref|XP_003371723.1| cleavage and polyadenylation specificity factor subunit 1
[Trichinella spiralis]
gi|316967988|gb|EFV52332.1| cleavage and polyadenylation specificity factor subunit 1
[Trichinella spiralis]
Length = 1376
Score = 188 bits (478), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 165/616 (26%), Positives = 286/616 (46%), Gaps = 74/616 (12%)
Query: 99 ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
AS ELV +++G + S+AI G + D I+LA +DAK+SV+ +D H L S
Sbjct: 65 ASFELVLSEQVYGRLASVAIARLTGF----QLDVILLAIDDAKLSVVGYDIETHSLVTLS 120
Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVG 215
MH +E + K G F P++++DP+ RC + +YG +++L + S S +
Sbjct: 121 MHYYEDDLF---KLGFTRFEIPPMLRMDPERRCAAMTIYGAHLVVLPLVRESLYESMNIV 177
Query: 216 DEDTFGSGGGFSARIESSHV-INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
D G FS R+ S V N D M +V D F+HG+ EP +++L+E T AGRV
Sbjct: 178 DPSQ-RPGWPFSLRLTSYTVAFNAIDAKMHNVTDMCFLHGFYEPTVLLLYEPTQTTAGRV 236
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
+ T I A+S++ K H +IW+ NLP DA+ LLA+P P+GGVL+ N+I Y +Q
Sbjct: 237 VVRQDTYQILAVSLNPKDKTHAVIWTLGNLPFDAFALLALPKPLGGVLLFSVNSIIYLNQ 296
Query: 335 SASCA-LALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY 393
S C + +N+ + RS V LD +HA + + A L ++G + ++++++
Sbjct: 297 SVPCCGILINDNGRGFTNYPLRDRSELMVTLDGSHAALIDSANAALVLRSGLVFVVSLLF 356
Query: 394 DG-RVVQRLDLSKTN-----PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
D +V+ + L+ ++ PS +++ + ++ F+GS +G+S L + +++
Sbjct: 357 DRLNMVKEILLTASSVRGAAPSTVSA---CVSSNCLFVGSAIGNSALYAYEAIEQVDVVA 413
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN--NTES-AQKTFSFA 504
L R + + L DM LYG TE+ Q F F
Sbjct: 414 VTLPA---------------RDTGLNLLDDM------QLYGELIRPCTTETLVQTKFEFR 452
Query: 505 VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNAD 564
D L ++GP + + G A + S++ + PG G +TV +S R
Sbjct: 453 RLDQLASLGPCRAITVGESSVAMVNNFYEDYVSDWLVAGGPGTDGSFTVMQRSVRPRLLT 512
Query: 565 SSRMAAYDDEYHA-----------------YLIISLEARTMVLETADLLTEVTESVDYFV 607
+R+ + + Y++++ + RT+V + +TE+ ++ + +
Sbjct: 513 QTRVEDVLNAWSVGAQLIGSVDRSASPRPQYMLLTTKQRTVVFTLSSGITEIFDT-GFEI 571
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
+ TIA G++ V+QV + +L Q ++ V S+
Sbjct: 572 RFETIACGDMMNGAYVVQVTKENLVLLHRGQQVQCINL----------RVFEEVCQASVI 621
Query: 668 DPYVLLGMSDGSIRLL 683
DPYV L + G + L
Sbjct: 622 DPYVALIVRHGHVLLF 637
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 88/197 (44%), Gaps = 38/197 (19%)
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSK----------------------------SDD 869
RPFLFA++ + +L Y+A+ + P+ + +DD
Sbjct: 755 RPFLFAVVEE-QLLIYEAFHYPYPQQRYRLSVRFKKVRHTAILQRFRRIGRDDFKLLADD 813
Query: 870 -----PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISG 922
R S + + SR R R S +A+ E + AP ++++ F+N++G
Sbjct: 814 FQFSEQYRRRRKRSKHDSNRSR-RGDRHSGRRQEAHEHEPYRLTYEAPARQLSPFENVAG 872
Query: 923 HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G F+ G P +C + ++ LR+HP DG +VAF + F Y T+ G++++
Sbjct: 873 YAGLFIGGGYPYFCFLSKQGDLRLHPMHIDGPVVAFAPYCSPKQLRAFAYFTADGMMRVS 932
Query: 982 QLPSGSTYDNYWPVQKV 998
LPS +D P KV
Sbjct: 933 SLPSKFDFDRSIPSMKV 949
>gi|326432241|gb|EGD77811.1| hypothetical protein PTSG_08901 [Salpingoeca sp. ATCC 50818]
Length = 1506
Score = 186 bits (473), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 180/690 (26%), Positives = 297/690 (43%), Gaps = 128/690 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLV NV+ +Y + VQ +G+ + + E +DG+ + V R GN
Sbjct: 29 NLVTVQGNVLSVYNL-VQAQGAADKRCHLEADISFTLDGVP----QDVATVRPRGN---- 79
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHLK 171
RD +I F+DA+++++ FD + L S+H FE + W +
Sbjct: 80 ------------SRDLLIFTFKDARVAIVRFDPKMRDLETVSLHAFEDTDTKLGGWHSEQ 127
Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF-GSGGGFSARI 230
R R V VDP RC ++VYG ++I++ S G + + DT + F++R
Sbjct: 128 RLR--------VCVDPLHRCAALMVYGCKLIVISFSSGTATAAPEADTQEDTEQSFTSR- 178
Query: 231 ESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
VI+L L + V D F+ GY P + ILH+ W G ++ T ++ALS+
Sbjct: 179 ----VIDLLSLPSTIGRVDDMAFLDGYDVPCLAILHQPRPAWVGHMAKTKDTAHVTALSL 234
Query: 289 S------------TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
+ P++W NLP D + L VP+P+GGV+V+G N + Y +QS
Sbjct: 235 ALDEMTARRAPTAPPPPPPPVVWHQENLPSDTFALQPVPAPLGGVVVIGVNVLFYVTQSL 294
Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR 396
+LALN Y+ + ++ ++ S++LD AH L L + +GD+ LLT+V
Sbjct: 295 VRSLALNGYSRASTNAPIQEQTGISLDLDGAHHALLTPTQILFALPSGDIHLLTIVCTDV 354
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGL--K 451
V L + K SV+ SDI T+G F+ SR SLL+++ + T + SG+ +
Sbjct: 355 TVDGLRMDKLATSVIGSDICTLGRRHIFIASRHATSLLLEWAPIPLSATTHIDVSGVSGR 414
Query: 452 EEFGDIEADAPSTKRLRRSSS------------DALQDMVNGEELSLYGSASNNTESAQK 499
++ G + ST L S+S D D+V+G +G S
Sbjct: 415 DDAGLYGTSSDSTAALNTSASRDGSSTGGDDLDDVYGDVVDGGTTGAHGIGSGGR---VM 471
Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE---------------- 543
T RD+L + P+K + G +A +S YELV
Sbjct: 472 TVKLMARDALPTVAPIKSTAVG--TSAQGVVPHADPRSQYELVSCIGHDKNGALANISYS 529
Query: 544 -----------LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
L K W V+ +S+ +H +++ S +TMV
Sbjct: 530 LKPQVLLTEDALSSVKDCWAVHSNNSK---------------HHTHVVFSKPKKTMVFRV 574
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
A ++ + + T+ AGN+ GR+ V+QV + +LD +D F
Sbjct: 575 AGDFEQLRHPRGFDTEASTVFAGNVMGRQLVLQVTAKHVMLLDD----RDCVF------D 624
Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+ + VS+ADPY+ L ++D + ++
Sbjct: 625 ERMKKGVRITKVSVADPYIALLLNDATTKV 654
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 58/270 (21%), Positives = 95/270 (35%), Gaps = 44/270 (16%)
Query: 757 CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN------------ 804
C ++G L IF VP+ VF F D+ R+ + E E
Sbjct: 807 CDKNGVLSIFQVPDMREVFCCTVFSVLPNVAWDSVYRKEIGPVELEPEMPLKRAKTMDEK 866
Query: 805 -----------SSSEEGTGQGRKENIHSMKVVELAMQRWSA----HHSRPFLFAILTDGT 849
+ E G+ Q ++ ++ E+ + A SRP LF
Sbjct: 867 GQSVFVEADEEADDESGSAQAEEDEQDRLQRKEMTIVELLAIGLGRGSRPHLFLRNETQH 926
Query: 850 ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
++ Y+ + TS S R RLR T +D + +
Sbjct: 927 VIVYEIF-------TS------SYKRHEKYEGRLQIRLRKRHQHPTWIDERLAQSS--SI 971
Query: 910 PCQRITIFKNISGHQGFFLSGSRPCWCMV--FRERLRVHPQLCDGSIVAFTVLHNVNCNH 967
P F +ISG G F+ RP W M + +R H DG++ FT L +
Sbjct: 972 PPAAFRPFADISGCDGVFVCARRPSWFMCDHTHKVVRHHAMRFDGAVQCFTQLKHAMHTS 1031
Query: 968 GFIYVTSQGILKICQLPSGSTYDNYWPVQK 997
F+Y T +G++++ +G P ++
Sbjct: 1032 CFLYFTGKGVMRMATTAAGQVLSTPLPSRR 1061
>gi|213407244|ref|XP_002174393.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
gi|212002440|gb|EEB08100.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
Length = 1431
Score = 186 bits (471), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 255/1069 (23%), Positives = 430/1069 (40%), Gaps = 213/1069 (19%)
Query: 57 NLVVTAANVIEIY-VVRVQEEGS--------------KESKNSGETKRRVL-MDGISAAS 100
NL+V + ++++ +VRVQ + +E+ ET +++ + +
Sbjct: 29 NLIVAKDDFLQVFDIVRVQRDSDDVEDAFGSSMNLRMEENDAFMETNMQLIRTHEHTVYT 88
Query: 101 LELVCHYRLHGNVESLAILSQ--GGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
L LV R+ G ++ LA++ GG D ++L AK+S+L +D L S
Sbjct: 89 LRLVYQTRVFGTIKDLAVVKPKLGGFTT----DLLVLLTNYAKVSILVWDSLTQQLSTVS 144
Query: 159 MHCFES---PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
MH +ES P+ + E+ A+ + VDP+ C + YG M I+ + +
Sbjct: 145 MHYYESVVPPKPI----AEETPAQ---LIVDPESTCCVLRFYGDMMAIIPFRKPEDLEME 197
Query: 216 DEDTFGSGG-GFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
D + S V+ LD + V D F+ GY E + +L+ E T
Sbjct: 198 DANAQSEKPVDVQCVYLPSFVLTASQLDYSIARVLDSKFLEGYREATLALLYCPEQTSTV 257
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY- 331
+ + T ++ +++ + +I S NLP+D Y +L VP+P+GG L++G N I Y
Sbjct: 258 FLPVRKDTVSLAVITLDIEQRASAVITSIHNLPYDIYCILPVPAPLGGSLLLGGNEIIYV 317
Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-----QNDVALLSTKTGDL 386
S ++ +A+N + + + Q RSSF +EL+ L ++ LL TG L
Sbjct: 318 DSAGSTVGIAVNPFYRNATNFQLEDRSSFQLELEGTIGVPLSSPRTESVSVLLIHPTGQL 377
Query: 387 VLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTC 439
L + DG+ V+ LDL + N ++L S +T + + FLGS+ GDS LVQ++
Sbjct: 378 FYLDFLMDGKNVKNLDLHPASDELNNALLQSGVTCALPVADHELFLGSQTGDSYLVQWSR 437
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
S + + D E DA D L D +Y + S K
Sbjct: 438 RSINNQTQEEGTLTYKDEENDA-------DEEVDELDD--------IYDTGSKEKAKRNK 482
Query: 500 -----TFSFAVRDSLVNIGPLKDFSYG---------------LRINADASATGIS----- 534
V D L N+GP+ +F G L + + TG S
Sbjct: 483 FVELGPLRLEVHDVLSNVGPIIEFCTGKAGSLAYFPQDNHGPLEVTC-VTGTGKSGSLVV 541
Query: 535 -KQSNYELVE----LPGCKGIWTVYHKSSRGHNADSSRMAAY-DDEYHAYLIISLEARTM 588
++S +VE GC+ +WT+ H + R N S Y DDEY YL++S E +
Sbjct: 542 FRRSISPVVEGKFNFEGCQSLWTI-HVTGRLKNPRSHGSERYLDDEYDTYLVVSKEKESF 600
Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGP 647
V + EV +S D+ +G TI G L G R++Q+ R+ D + ++ Q ++ G
Sbjct: 601 VFTAGETFDEVEDS-DFNTKGSTINVGGLLGGMRIVQICTTSLRVYDPNIHLVQRINLG- 658
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
+ V++ S+ DPYV+L + G I L D T + + K V
Sbjct: 659 ---------KKQNVVAASVCDPYVVLVLLGGRILLYSMDAETQRL---IKMDLHKQLKNV 706
Query: 708 SSCTLYHDKGPEPWLRKTSTDAWL-----STGVGE-AIDGADGGP--------------- 746
+ +LY +P +++ ++ L S G + +DG D P
Sbjct: 707 KAASLYSTN--DPVMQELFSELDLGRNNSSPGKSDIQMDGVDTQPDRPSMPAGNQVTETN 764
Query: 747 ---LDQGDIYS----VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
LD+ + V ++ G L++ +P ++CV D F + T + + L
Sbjct: 765 VSTLDEQSFAAHFVLFVLHDDGRLKVLHLPTYSCVLECDVF------DLPTVLYDGLSSE 818
Query: 800 E-TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF 858
TE++ SS+E +VE+ L I Y+ ++
Sbjct: 819 RVTEMHESSQE--------------LVEVLATDLGDEAKEAHLLIRSRMNEITVYKPFVC 864
Query: 859 EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET------------- 905
P T K++ LRFS+ P + TRE T
Sbjct: 865 SNPV-THKTE---------------------LRFSKIPQEGMTRESTECSLQDLVAETEQ 902
Query: 906 ---PHGAPCQ------------RITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQL 949
P A Q R+ + I H F++G++P + + + + HP L
Sbjct: 903 ENAPKDASEQKPQKSSSTVDKPRMVALQRIGNHSAVFITGAKPFFLLKTAHSVAKFHPLL 962
Query: 950 CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+ I++ H + G+I+V + IC+ YD+ W +KV
Sbjct: 963 SECRILSLASFHTEHAPKGYIFVDENYDINICRFQDDINYDHRWGYKKV 1011
>gi|25148482|ref|NP_500157.2| Protein CPSF-1 [Caenorhabditis elegans]
gi|22096347|sp|Q9N4C2.2|CPSF1_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
gi|373220398|emb|CCD73182.1| Protein CPSF-1 [Caenorhabditis elegans]
Length = 1454
Score = 186 bits (471), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 162/589 (27%), Positives = 273/589 (46%), Gaps = 84/589 (14%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++ F+DAK+S++ ++ ++ S+H FE+ +L+ G + + PLV+ DP
Sbjct: 92 QDSILMTFDDAKLSIVSINEKERNMQTISLHAFENE---YLRDGFINHFQPPLVRSDPSN 148
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
RC LVYG + IL + S RI S +VI L+ +D + ++
Sbjct: 149 RCAACLVYGKHIAILPFHEN-----------------SKRIHS-YVIPLKQIDPRLDNIA 190
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
D +F+ GY EP ++ L+E T GR ++ T I +S++ +Q ++W NLP D
Sbjct: 191 DMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSS---FSVE 363
+LL +P P+GG LV G+NT+ Y +Q+ C L LN+ D + P +
Sbjct: 251 CSQLLPIPKPLGGALVFGSNTVVYLNQAVPPCGLVLNS---CYDGFTKFPLKDLKHLKMT 307
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
LD + + ++++ + ++ GDL LL ++ G V+ L+ SK + + +T
Sbjct: 308 LDCSTSVYMEDGRIAVGSRDGDLFLLRLMTSSGGGTVKSLEFSKVYETSIAYSLTVCAPG 367
Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD--ALQDMV 479
F+GSRLGDS L+++T T + KRL+ + D A + +
Sbjct: 368 HLFVGSRLGDSQLLEYTLLKTTRDC----------------AVKRLKIDNKDPAAAEIEL 411
Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+ +++ LYG A +++ E ++ F D L N+GP+K G R N ++ +
Sbjct: 412 DEDDMELYGGAIEEQQNDDDEQIDESLQFRELDRLRNVGPVKSMCVG-RPNYMSNDLVDA 470
Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
K+ + ++LV G G V+ +S R SS + ++E H YLI+
Sbjct: 471 KRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSLLEGAEQLWAVGRKENESHKYLIV 530
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMT 640
S R+ ++ E + T+AAG L +QV A + DG M
Sbjct: 531 S-RVRSTLILELGEELVELEEQLFVTGEPTVAAGELSQGALAVQVTSTCIALVTDGQQM- 588
Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDP 687
Q++ N V+ SI DPYV L +G + L LV +P
Sbjct: 589 QEVHI----------DSNFPVIQASIVDPYVALLTQNGRLLLYELVMEP 627
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 57/262 (21%), Positives = 111/262 (42%), Gaps = 34/262 (12%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD-TYMREALKDSETEINSSSEEGTGQ 813
+V +E+G L I +P V+ + +F + +VD T E + ++ E
Sbjct: 777 IVSHENGRLSIHSLPEMEVVYQIGRFSNVPELLVDLTVEEEEKERKAKAQQAAKEASVPT 836
Query: 814 GRKENIHSM------KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
E +++ +V+E + + + P L AI+ D ++ Y+ + S
Sbjct: 837 DEAEQLNTEMKQLCERVLEAQIVGMGINQAHPILMAIV-DEQVVLYEMF---------SS 886
Query: 868 DDPVSTSRSLSVSNV-------SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
+P+ +S + ++S L N R P + + +G I F+ +
Sbjct: 887 SNPIPGHLGISFRKLPHFICLRTSSHL-NSDGKRAPFEM----KINNGKRFSLIHPFERV 941
Query: 921 SG-HQGFFLSGSRPCWCMVFRE--RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QG 976
S + G + G+ P +V+ ++ H DG I AFT +N N HG +Y+T +
Sbjct: 942 SSVNNGVMIVGAVPTL-LVYGAWGGMQTHQMTVDGPIKAFTPFNNENVLHGIVYMTQHKS 1000
Query: 977 ILKICQLPSGSTYDNYWPVQKV 998
L+I ++ Y+ +PV+K+
Sbjct: 1001 ELRIARMHPDFDYEMPYPVKKI 1022
>gi|260835073|ref|XP_002612534.1| hypothetical protein BRAFLDRAFT_58262 [Branchiostoma floridae]
gi|229297911|gb|EEN68543.1| hypothetical protein BRAFLDRAFT_58262 [Branchiostoma floridae]
Length = 318
Score = 184 bits (468), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 107/297 (36%), Positives = 168/297 (56%), Gaps = 38/297 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LVV + +Y ++ E S++ K +ELV + ++GN+ S+
Sbjct: 29 SLVVAGTTQLHVYRLKGDMEKSRKQK------------------MELVASFSMYGNIMSV 70
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ G+D RD+++L+F DAK+S++E+D H L+ SMH FE E +K G S
Sbjct: 71 ESVQLAGSD----RDALLLSFMDAKLSIVEYDPGTHDLKTASMHYFEEEE---VKDGYVS 123
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+GRC +L+YG ++++L + G+ DE +G S I +++I
Sbjct: 124 NYHAPMVRVDPEGRCAVMLIYGKRLVVLPFRKEGAV---DEAEMSAGSKSS--ILPTYMI 178
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
L+DLD + +V D F+HGY +P ++IL+E TW GRV+ + TC I A+S++ +
Sbjct: 179 KLQDLDERLINVVDLQFLHGYFDPTLLILYEPLQTWPGRVAVRQDTCCIVAVSLNIAQRV 238
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
HP+IWS NLP D + +AVP PIGGVLV N++ Y +QS Y VSL+S
Sbjct: 239 HPIIWSVGNLPFDCKQAVAVPKPIGGVLVFAVNSLLYLNQSVP------PYGVSLNS 289
>gi|391328522|ref|XP_003738737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Metaseiulus occidentalis]
Length = 1500
Score = 183 bits (465), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 122/404 (30%), Positives = 199/404 (49%), Gaps = 61/404 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS----AASLELVCHYRLHGN 112
NLVV VI++Y R++ DG++ A LE + GN
Sbjct: 29 NLVVAGGTVIKVY--------------------RLVCDGLNETDDKAKLEHQQTFNCFGN 68
Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ + + + RDS++ F++ KIS++E+D + H L+ ++ E E+ K
Sbjct: 69 ISGMEKIRLNAS-----RDSLLFVFKETKISLVEYDPATHELQTLAIRSLEKEEY---KE 120
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFS 227
G +F L+KVDP RC VL+YG + I+ A+ + + T + GF
Sbjct: 121 GFYNFVGNTLIKVDPLNRCAAVLIYGKHLAIIPFVKKDATDLSDPIASSKSTQTNTSGFL 180
Query: 228 ARIESSHVINLRDLD----MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
+ I L DLD + ++ D F++GY EP +++L+E TW GRV+ + TC I
Sbjct: 181 ----EYYTIRLIDLDEEKGVNNIHDMTFLNGYYEPTLLLLYEPIRTWTGRVAIRQDTCSI 236
Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALN 343
ALS++ + HP +WS LP +++K+L VP PIGGVL++ N + Y +QS
Sbjct: 237 MALSLNVYQRVHPPVWSFSGLPFNSFKVLPVPKPIGGVLILSVNALLYLNQSVPA----- 291
Query: 344 NYAVSLDSSQELPRSSFSVE--------LDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
Y VSL+ E +SF ++ LD +L LLS GDL +L++ DG
Sbjct: 292 -YGVSLNCFTEC-STSFPLKDQAGPPLTLDCCRCEFLSETKILLSVANGDLYVLSLFTDG 349
Query: 396 -RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
R + + + K + + + I+ F+GSR+G+SLL+++T
Sbjct: 350 MRSINQFEFKKIATTTVATCISLCEPGYLFVGSRIGNSLLLRYT 393
Score = 151 bits (382), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 140/522 (26%), Positives = 217/522 (41%), Gaps = 114/522 (21%)
Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
ELPGC +WTV S+R + D ++ H +LI+S TM+L+T + E+ S
Sbjct: 603 ELPGCTDLWTVRSSSTRSPDVD--------EDSHQFLILSRPDSTMILQTGQEINELDHS 654
Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVL 662
+ Q TI AGNL R +IQV R+L+G Q + S ++
Sbjct: 655 -GFCTQSPTIFAGNLADGRYIIQVCPNSVRLLEGVKQLQQVPI----------DVGSPLV 703
Query: 663 SVSIADPYVLLGMSDGSIRLLV--GDPST-CTVSVQTPAAIESSKKPVSSCTLYHD---- 715
S SIAD +VL+ DG + L GD +T +SV P +K +++ +Y D
Sbjct: 704 SASIADLHVLVMSQDGLVIQLTLRGDDTTGYKLSVLKPQ-FPGAKSKITALCIYKDVSGL 762
Query: 716 -----KGPEPWLR-KTSTDAWLSTGV------------------GEAIDGAD--GGPLDQ 749
+ PE + KT + T V G ++D D G L+
Sbjct: 763 FVTKIQKPEDIAKPKTEAKTKVKTEVAKKVLRSADFDDEDELLYGSSVDIKDLVAGGLNA 822
Query: 750 GDI-----------------------------YSVVCYESGALEIFDVPNFNCVFTVDKF 780
+I + + E+GALEI+ P++ + V F
Sbjct: 823 ANIVPTTQTKDTAEEEDYEENVRKIAPVEPTFWVFLARENGALEIYSFPDYKLRYFVKNF 882
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
+ + ++ A +T S+SE KV+E+ + H SRP
Sbjct: 883 -----PLCNKILQNAAATGQTTSASTSEAQLP----------KVMEIFVCALGMHQSRPL 927
Query: 841 LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
LFA + D + Y+AY F ++ + RL++ + P Y
Sbjct: 928 LFARV-DSELHIYEAYPF--------------VNQKEGHLKLQFRRLQH-AVTMEPRRVY 971
Query: 901 TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTV 959
++E + I F+++ G+ G F+ G RP W + R LR HP L DG I +F
Sbjct: 972 KQKEGDPTLSLRWIRAFQDVCGYNGVFVCGRRPHWIFLTARGELRAHPMLNDGRIYSFAT 1031
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVVFF 1001
HNVNC GF++ G L+IC LPS YD WP++K+ +
Sbjct: 1032 FHNVNCEKGFLFFNKYGELRICALPSYLNYDAPWPMRKIPIY 1073
>gi|390358535|ref|XP_789715.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 1223
Score = 182 bits (461), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 141/456 (30%), Positives = 221/456 (48%), Gaps = 60/456 (13%)
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
RV+ + TC I ALS++ K HP+IWS +LP+D ++ AVP PIGGVL++ N++ Y
Sbjct: 11 RVAVRQDTCSIVALSLNMAQKVHPIIWSQSSLPYDCMQVQAVPKPIGGVLILAVNSLLYL 70
Query: 333 SQSASCALALNNYAVSLDS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGD 385
+QS + Y VSL+S S P + + +D AT++ D LS K G+
Sbjct: 71 NQS------IPPYGVSLNSLTDWSTAFPLKTQEGVKLSMDCTQATFISYDRLALSLKDGE 124
Query: 386 LVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
+ +LT++ DG R V+ L K SVLT+ I +G+ FLGSRLG+SLL+++T +
Sbjct: 125 IYVLTLLVDGMRSVRGFHLDKAAASVLTTCICPMGDGFLFLGSRLGNSLLLKYTEKVSET 184
Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD----MVNGEELSLYGSASNNTESAQKT 500
+ K E + P+ K +SD + + + +EL +YG T + +
Sbjct: 185 SPTDASKTEEPKPGEEPPTKKMRSDDASDWMASDTKFLDDPDELEVYGKQVQKTGTQLTS 244
Query: 501 FSFAVRDSLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELVE---- 543
+SF + DSL+NIGP + G + N D + +G K +++
Sbjct: 245 YSFEICDSLLNIGPCGNMIMGEPAFLSEEFQGNVDPDLELVTTSGYGKNGALSVLQRTIR 304
Query: 544 --------LPGCKGIWTVYHKSSRGHNAD----SSRMAAYDDEYHAYLIISLEARTMVLE 591
LPGC +WTV KS + AD S + D + HA+LI+S + +MVL+
Sbjct: 305 PQVVTTFNLPGCLDMWTV--KSLKEAKADEKSEESEASPEDKDRHAFLILSKQDSSMVLQ 362
Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
T +TEV + Q TI A N+ R ++QV + +++G Q +
Sbjct: 363 TGQEITEVAAG-GFSTQAPTIFASNMGDDRYIVQVMNKSICLMEGVEQIQHMVL------ 415
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
S + S+ADPY+LL +G L+ P
Sbjct: 416 ----DVGSPIKQCSLADPYLLLLTENGDPILMTLKP 447
Score = 106 bits (265), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 76/257 (29%), Positives = 117/257 (45%), Gaps = 32/257 (12%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V C E+G LE++ +P+ F V F G +VD S S TG
Sbjct: 560 WCVFCRENGQLEMYSLPDMVLAFLVKNFPMGSKVLVD---------------SGSAFMTG 604
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
+++ +V E+ + + ++ A++ D I+ Y+A+ P NT + +
Sbjct: 605 DQSQQHEMLQQVQEVLLVGLGHDRKKIYMLALVEDD-IMIYEAF----PYNTVTQEHHLR 659
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPL-DAYTREETPHGAP---------CQRITIFKNISG 922
R + + + + R S+ P + T+ ET A R+ F N+
Sbjct: 660 V-RFRKIPHKILMKPKKTRTSKKPTAEGGTKPETETEAESDTKTTSRRVNRLREFHNVQT 718
Query: 923 HQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G F+SGS P W V R LR HP DG+I F HNVNC +GF+Y + L+IC
Sbjct: 719 YSGVFISGSHPYWLFVTSRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRIC 778
Query: 982 QLPSGSTYDNYWPVQKV 998
LPS +YD WPV+KV
Sbjct: 779 VLPSHLSYDAPWPVRKV 795
>gi|393907593|gb|EJD74705.1| CPSF A subunit region family protein [Loa loa]
Length = 990
Score = 182 bits (461), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 167/640 (26%), Positives = 276/640 (43%), Gaps = 83/640 (12%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + RL V+S AI + DS++L F+DAK+S++ + + L+ S+H
Sbjct: 62 LECLLAVRLLAPVQSFAI---ARIPQNPDCDSLLLGFDDAKLSIVGVNPADRSLKTISLH 118
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G P+++VDP RC +LV+G + +L + G+ L
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCAAMLVFGRYLAVLPFNDSGAQL------- 168
Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+ + L +D + +V D +F+ GY EP ++ L+E T GR ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217
Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
T + L +S +K+ L +W NLP D ++LA+P P+GG+L+V N + Y +QS
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275
Query: 337 -SCALALNNYAVSLDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
C ++LN+ +D + P F + LD T + + LL + G L L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLRDFKHMVLTLDGCVVTVISTNKILLCDRNGRLFTLVLV 332
Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
D V+ L+L +V+ +T+ F+GSRL DS+ + T
Sbjct: 333 TDATNSVKSLELKFQFKTVIPCTMTSCAPGYLFIGSRLCDSVFLHCIFEQST-------- 384
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT---ESAQKTFSFAVRDS 508
++ AP +L + +A +D E+ LYG +SA++ + V D
Sbjct: 385 -----LDESAPKKIKL-NTELNANED----EDFELYGEVLPKVAKPDSAEELLNIRVLDK 434
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
L+N+GP K + G + K ++LV G G ++ +S R SS
Sbjct: 435 LLNVGPCKKITGGCPSISAYFQEVTRKDPLFDLVCACGHGKFGSICIFQRSVRPEIVTSS 494
Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
+ +D+ H Y I S E T+ LET + L E+ E+ + TIAAG L
Sbjct: 495 SIEGVVQYWAVGRREDDTHMYFIASKELGTLALETDNDLVEL-EAPIFATSEPTIAAGEL 553
Query: 618 FGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
+QV ++ Q + V S SI DPY+ + +
Sbjct: 554 ADGGLAVQVTTSSLVMVAEGQQIQHIPL----------QLTFPVRSASIVDPYIAICTQN 603
Query: 678 GSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
G + + L P + + P++S ++Y D
Sbjct: 604 GRLLMYELTSHPHVHLKEIDISKRLRHETSPITSLSIYRD 643
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 62/250 (24%), Positives = 111/250 (44%), Gaps = 29/250 (11%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEEGTG 812
E+G + I+ +P + V+ V K +H+ D + D E + S++ T
Sbjct: 733 ENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPATAESIDTMSATMTDTF 788
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
+ E + ++EL M + RP LF +L D T+ Y+ + + N
Sbjct: 789 AAKPEEV----IMELLMVGMGMNQGRPMLF-LLIDDTVSVYEMFTY----NNGIQGHLAV 839
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGFFLS 929
+ L + V+ R+ RF LD E+ A + + F+ I G F+
Sbjct: 840 RFKRLPYTVVT----RSCRFQG--LDGRAAVESVRDAVRHKTVLHFFERIGNVLNGVFIC 893
Query: 930 GSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGST 988
S PC + R+HP DG I++FT +N C +GFIY+T + ++++ +LP+
Sbjct: 894 SSYPCIFFLETGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRVAKLPNDMI 953
Query: 989 YDNYWPVQKV 998
D +PV+++
Sbjct: 954 LDTSYPVKRI 963
>gi|49619065|gb|AAT68117.1| cleavage and polyadenylation specific factor 1 [Danio rerio]
Length = 1105
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 187/727 (25%), Positives = 314/727 (43%), Gaps = 166/727 (22%)
Query: 388 LLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
+LT++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T +
Sbjct: 2 VLTLITDGMRSVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYT----EKLQ 57
Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSD--------ALQDMVNGEELSLYGS-ASNNTESA 497
+ ++E + E + + +R S+ L D ++ E+ +YGS A + T+ A
Sbjct: 58 ETPMEEGKENEEKEKEPPNKKKRVDSNWAGCPKKGNLPDELD--EIEVYGSEAQSGTQLA 115
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYG--------LRINADAS-----ATGISKQSNYELV-- 542
T+SF V DS++NIGP S G + N + +G K ++
Sbjct: 116 --TYSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVVVCSGYGKNGALSVLQK 173
Query: 543 ----------ELPGCKGIWTVYHKSSR---------GHNADSSRMAAY---DDEYHAYLI 580
ELPGC +WTV + + G + + + D + H +LI
Sbjct: 174 SIRPQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLI 233
Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMT 640
+S E TM+L+T + E+ S + QG T+ AGN+ + +IQV G R+L+G
Sbjct: 234 LSREDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---V 289
Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI---------------RLLVG 685
L F P + S ++ S+ADPYV++ ++G + RL +
Sbjct: 290 NQLHFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFVLKNDSYMGKSHRLALQ 342
Query: 686 DPSTCT------------------------------VSVQTPAAIESSKKPVSSCT---- 711
P T ++++T + E+ + +S+
Sbjct: 343 KPQIHTQSRVITLCAYRDVSGMFTTENKVSFLAKEEIAIRTNSETETIIQDISNTVDDEE 402
Query: 712 --LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVP 769
LY + P K + + G + + ++ E+G +EI+ +P
Sbjct: 403 EMLYGESNPLTSPNKEESSRGSAAASSAHTGKESGSGRQEPSHWCLLVRENGVMEIYQLP 462
Query: 770 NFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAM 829
++ VF V F G+ +VD+ + S T+ EE T QG +I +K E+A+
Sbjct: 463 DWRLVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTRQG---DIPLVK--EVAL 513
Query: 830 QRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRN 889
HSRP+L A + + +L Y+A+ ++ + + SN+
Sbjct: 514 VSLGYSHSRPYLLAHV-EQELLIYEAFPYD---------------QQQAQSNL------K 551
Query: 890 LRFSRTPLDAYTREET--------PHG---------APCQRITIFKNISGHQGFFLSGSR 932
+RF + P + RE+ P G R F++ISG+ G F+ G
Sbjct: 552 VRFKKMPHNINYREKKVKVRKDKKPEGQGEDSLGVKGRVARFRYFQDISGYSGVFICGPS 611
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R +R+HP DG+I +F+ HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 612 PHWMLVTSRGAMRLHPMTIDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDA 671
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 672 PWPVRKI 678
>gi|308459872|ref|XP_003092248.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
gi|308253976|gb|EFO97928.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
Length = 1448
Score = 181 bits (458), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 155/578 (26%), Positives = 263/578 (45%), Gaps = 86/578 (14%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++AF+DAK+S++ ++ ++ S+H FE+ +L+ G ++ P+V+ DP
Sbjct: 92 QDSILMAFDDAKLSIVAVNEKERNMQTISLHAFENE---YLRDGFINYFHPPIVRTDPSN 148
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
RC LVYG + IL + ++ S++I L+ +D + +V
Sbjct: 149 RCAASLVYGKHIAILPFHENSKRIL------------------SYIIPLKQIDPRLDNVA 190
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
D +F+ GY EP ++ L+E T GR ++ T I +S++ +Q ++W NLP D
Sbjct: 191 DMVFLDGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
LL +P P+GG LV G+NTI Y +Q+ C + LN+ D + P +
Sbjct: 251 CTSLLPIPKPLGGALVFGSNTIVYLNQAVPPCGVVLNS---CYDGFTKFPLKDMKHLKMT 307
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
LD A + ++++ + + G L LL +V G V+ ++ S+ + + +T
Sbjct: 308 LDCATSVYMEDGRIAVGGRDGVLYLLRLVTSSGGATVKSMEFSRVWETSIAYCLTVCAPG 367
Query: 422 LFFLGSRLGDSLLVQFTCGSGT--SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
F+GSRLGDS LV++T T S ++++ G+IE D
Sbjct: 368 HLFIGSRLGDSQLVEYTLLKMTKESAKRQKIEKDPGEIELDE------------------ 409
Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+++ LYG A +++ E ++ F D L N+GP+K +G R N +S
Sbjct: 410 --DDMELYGGAIEMQLNDDEEQILESLEFRELDRLRNVGPVKSMCFG-RPNYMSSDLAEM 466
Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
K+ + ++LV G G V+ +S R SS + ++E H YLI+
Sbjct: 467 KRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSILEGAEQLWAVGRKENESHKYLIV 526
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMT 640
S R+ ++ E + T+AAG L +QV A + DG M
Sbjct: 527 S-RVRSTLVLELGEELVELEEQLFVTNEPTVAAGELSQGALAVQVTSTCIALVTDGQQM- 584
Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
Q++ N V+ SI DPYV + +G
Sbjct: 585 QEVHI----------DSNFPVVQASIQDPYVAVLTQNG 612
Score = 59.3 bits (142), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 64/289 (22%), Positives = 121/289 (41%), Gaps = 26/289 (8%)
Query: 723 RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS------VVCYESGALEIFDVPNFNCVFT 776
R+ DA +S+ GE D +D YS +V +++G L I +P+ V+
Sbjct: 741 RRLGHDAIMSSRGGEQSDA-----IDPTRTYSSITHWLMVAHDNGRLSIHSLPDMELVYQ 795
Query: 777 VDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR----W 832
+ +F + ++D E K+ + + ++++ + K+ E M+
Sbjct: 796 IGRFSNVPELLMDMTTDEEEKERKAKAQQAAKDTAADEDQLTTEMKKLCERVMEAQIVGM 855
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
+ S P L AI+ D ++ Y+ + P+ ++ + + S N
Sbjct: 856 GINQSHPVLMAIV-DEQVVMYEMFSHYNPQAGHLG---IAFRKLPHFICLRTSSHLNSDG 911
Query: 893 SRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFR-ERLRVHPQLC 950
R P + E +G I F+ IS + G + G+ P + ++ H
Sbjct: 912 KRAPFEM----EVENGKRYTLIHPFERISSINNGVMIGGAVPTLVVYGAWGGMQTHQMTI 967
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKV 998
DG I AFT +N N HGF+Y+T Q L+I ++ Y+ +P++K+
Sbjct: 968 DGPIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYEMPYPMKKI 1016
>gi|384487281|gb|EIE79461.1| hypothetical protein RO3G_04166 [Rhizopus delemar RA 99-880]
Length = 1468
Score = 179 bits (454), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 163/650 (25%), Positives = 277/650 (42%), Gaps = 97/650 (14%)
Query: 88 KRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEF 147
K+ ++ + LELV ++++G + ++ + DS++L F DAK+S+LE+
Sbjct: 87 KKGGMISDTTLGRLELVAQFKMNGIITTMGTVRTNSPRGREGCDSLLLGFSDAKMSLLEW 146
Query: 148 DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL---VKVDPQGRCGGVLVYGLQMIIL 204
S + + S+H +E E+ ++ F P + +DPQ RC Y ++ +L
Sbjct: 147 SSSTNSIITVSIHYYERDEF------KKEFLTNPYPSAIHIDPQQRCAVFNFYDNKLAVL 200
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
Q S + + G S +I+L LD +K+V D F+ Y EP + I
Sbjct: 201 PFRQ--SDKLDERQGEGEEDEEKWPYYPSFIIDLATLDSRIKNVIDMTFLSDYYEPTLAI 258
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
L + E TW GR+ T + +S+ T K +P+I+S LP+D +KL+A+P P+ G+L
Sbjct: 259 LFQPEQTWTGRLGNNKDTVSLVVISLDITAKIYPIIYSIDKLPYDCFKLVAMPKPVTGML 318
Query: 323 VVGANTIHYHSQ-SASCALALNNYAVSLDSSQELPRSSFS-------VELDAAHATWLQN 374
V+ AN+I + SQ S +A+N Y + + P + + L+ A A
Sbjct: 319 VIAANSILHVSQGSPGMGVAVNGYT---KKTTDFPGMIYEPSLIELGLSLEGAKALAFGG 375
Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN---------------PSVLTSDITTIG 419
D L+ + G L+ V DG V + +S+ P +L S + +
Sbjct: 376 DRCLIFMQNGHWALVEVRRDGNKVVGMAISEIKHDLPVMEKKPPRFDTPPLLASVPSCVT 435
Query: 420 N----SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
N FFLGSR+GDSLL+++ + D + + D +
Sbjct: 436 NVKAGEYFFLGSRVGDSLLIKYDANRVNHQSVAPPVFRVCDTMLNTGPIVDMAVGDVDTV 495
Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
+ + +L L S+ + A F +I P F++ D+ A
Sbjct: 496 EQQEDWPQLELVSSSGHGKNGALCVFQ-------RHIYPQTSFAFH---QFDSQA----- 540
Query: 536 QSNYELVELPGCKGIWTVY-HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
IW++ K+ + N DD++ L IS T+VL D
Sbjct: 541 --------------IWSIKCRKNDQQQNE--------DDDFDKLLFISKSKSTLVLSAGD 578
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSES 652
L EV ++ +G TIA LF R++QV+ G +L +G + ++
Sbjct: 579 ELQEV--KTGFYTRGSTIAVSTLFDATRIVQVYATGVMVLTPEGKRI-----------QT 625
Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC-TVSVQTPAAIE 701
+ ++ SI DPY+LL + + I L GD ST + +Q P I+
Sbjct: 626 VPIPRGAKIVEASIHDPYILLTLDNNKILALQGDASTKDIIHIQLPNHIK 675
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 107/259 (41%), Gaps = 40/259 (15%)
Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
+G L I+ +P+F F +F IVD DS G K I
Sbjct: 811 TGILRIYSLPDFKEHFACPQFSIAPDLIVD--------DS--------------GVKSRI 848
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
+ + E+ M P L I+ Y+A+ + + + S + V
Sbjct: 849 PTNNIQEILMTHIGKERKDPHLVVRTDTNDIIIYKAFTYLDESSPDRLALRFSRVQHEYV 908
Query: 880 SNVSASRLRNLRFSRTPLDAYTREET--------------PHGAPCQR--ITIFKNISGH 923
S S+S + R +D + +T QR + F +++G+
Sbjct: 909 SRKSSSHESKPKKKRGIIDEFEIPDTDLNEEEEDLKLSTKKMDKKIQRKLLIPFTDVAGY 968
Query: 924 QGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQ 982
G F++G++P W M + +RVHP + IV FT HNVNC HGFI V S+ +++ +
Sbjct: 969 AGVFVAGAQPAWLMCSCKSFVRVHPMKTEHEIVGFTQFHNVNCQHGFITVDSKSTIQLSR 1028
Query: 983 LPS-GSTYDNYWPVQKVVF 1000
L + G YD W +QKV+
Sbjct: 1029 LRTEGINYDLDWVIQKVLL 1047
>gi|49619061|gb|AAT68115.1| cleavage and polyadenylation specificity factor 1 [Danio rerio]
Length = 312
Score = 179 bits (453), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 99/253 (39%), Positives = 149/253 (58%), Gaps = 18/253 (7%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE V + L GNV S+A + G + RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 66 LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P+V+VDP+ RC +LVYG +++L + + DE
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFR---NDTLADEQEG 175
Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G S++I++R+LD + ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 176 IVGEGQKFSFLPSYIIDVRELDETLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
TC I A+S++ K HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVP- 294
Query: 339 ALALNNYAVSLDS 351
+ VSL+S
Sbjct: 295 -----PFGVSLNS 302
>gi|170576536|ref|XP_001893668.1| CPSF A subunit region family protein [Brugia malayi]
gi|158600196|gb|EDP37499.1| CPSF A subunit region family protein [Brugia malayi]
Length = 1323
Score = 178 bits (451), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 169/647 (26%), Positives = 278/647 (42%), Gaps = 96/647 (14%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + RL V+S AI + DS++L F+DAK+S++ + + L+ S+H
Sbjct: 62 LECLLAVRLLAPVQSFAIARISQNPDC---DSLLLGFDDAKLSIVAVNPADRCLKTISLH 118
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G P+++VDP RC +LV+G + +L + + L
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCASMLVFGRYLAVLPFNDSSTQL------- 168
Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+ + L +D + +V D +F+ GY EP ++ L+E T GR ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217
Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
T + L +S +K+ L +W NLP D ++LA+P P+GG+L+V N + Y +QS
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275
Query: 337 -SCALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
C ++LN+ +D + P F ++ LD A T + + LL + G L L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLKDFKHMALTLDGAVVTVVSTNKILLCDRNGRLFTLILV 332
Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
D V+ L+L +V+ +T+ F+GSRL DS+ + C S L
Sbjct: 333 TDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFLH--CIFEQSTLEES-- 388
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT---ESAQKTFSFAVRDS 508
+TK+++ S+ + E+ LYG + ++ + V D
Sbjct: 389 -----------ATKKMKLSTEPNANE--EDEDFELYGEVLPKVAKPDVTEELLNIRVLDK 435
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
L+N+GP K + G + K ++LV G G + +S R SS
Sbjct: 436 LLNVGPCKKITGGCPSVSAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSS 495
Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
+ +D+ H Y I S E T+ LET + L E+ E+ + TIAAG L
Sbjct: 496 SIEGVVQYWAVGRREDDTHMYFIASRELGTLALETDNDLVEL-EAPIFSTSESTIAAGEL 554
Query: 618 FGRRRVIQV-------FERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
+QV G +I Y+ L+F V S SI DPY
Sbjct: 555 ADGGLAVQVTTSSLVMVAEGQQI---QYIPLQLTF--------------PVRSASIVDPY 597
Query: 671 VLLGMSDGSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
+ + +G + + L P + + P++S ++Y D
Sbjct: 598 IAICTQNGRLLMYELTNQPHVSLKEIDISKRLRHETSPITSLSIYRD 644
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 110/253 (43%), Gaps = 29/253 (11%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEE 809
+ E+G + I+ +P + V+ V K +H+ D + D E + S +
Sbjct: 731 IARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPVTGEGIDAMSGTMT 786
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
T + E + ++EL + + RP LF +L D T+ Y+ + + N
Sbjct: 787 DTFAVKPEEV----IMELLLVGMGMNQGRPLLF-LLIDDTVSAYEMFTY----NNGIQGH 837
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGF 926
+ L + V+ R+ RF T D E+ A + + F+ I G
Sbjct: 838 LAIRFKRLPYTTVT----RSCRFQGT--DGRAAVESVRDAVRHKTVLHFFERIGNVLNGV 891
Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-ILKICQLPS 985
F+ S PC + R+HP DG I++FT +N C +GFIY+T + +++ +LPS
Sbjct: 892 FICSSYPCIFFLESGVPRLHPVNLDGPILSFTTFNNAVCPNGFIYLTERDRFMRVAKLPS 951
Query: 986 GSTYDNYWPVQKV 998
D +PV+++
Sbjct: 952 DMILDASYPVKRI 964
>gi|407929511|gb|EKG22329.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
phaseolina MS6]
Length = 1418
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 224/969 (23%), Positives = 380/969 (39%), Gaps = 145/969 (14%)
Query: 99 ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
+ L LV Y L G V SLA + D D++++AF DAK+S++E+D + H L S
Sbjct: 81 SKLVLVAEYPLEGTVLSLARIK--ALDTKSGGDALLIAFRDAKMSLVEWDPANHALSTIS 138
Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV-GDE 217
+H +E E + + DP RC + + IL Q G LV GD+
Sbjct: 139 IHYYEGEELHGAPWDADLGHYHNFLAADPSSRCAALKFGARHLAILPFRQLGDDLVEGDD 198
Query: 218 ---------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
+ +G +SS ++L +D + H F+H Y EP
Sbjct: 199 YDPDFDEPMDAPAAKEKATNGDVAQTPYKSSFALSLPQIDPALTHPVHLDFLHEYREPTF 258
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
I+ + A + + + ++ K + S LP+D +K++ +P P+GG
Sbjct: 259 GIISANKAAAASLLYERRDLLTYTVFTLDLEEKASTALLSVAGLPYDTHKVIPLPLPVGG 318
Query: 321 VLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVA 377
L++GAN IH + A+A+N++A S +S ++ L+ A L +N
Sbjct: 319 ALLLGANQFIHVDQAGKTSAVAVNDFAKQCSSFPMSDQSELAMRLEGASIELLSPENGDL 378
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFLGSRLG 430
L+ K G L +++ DGR V L + K + S T++G + F+GS G
Sbjct: 379 LVVLKDGSLAVISFKLDGRSVSGLSIRKISEEKGGHVVPTAASCTTSLGRNRMFIGSEDG 438
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
DS+L+ +T + LS K ++ AD D +G + SA
Sbjct: 439 DSVLLGWT--KKAAQLSR--KRSHAEMLADDAELSFDEEDLEDDDDLYGDGPSTAKTASA 494
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSN-YELV------- 542
S+ S ++F + D ++++ P+KD + D + + + ++ +LV
Sbjct: 495 SSEA-SDPSNYTFRIHDIMLSLAPIKDVALASHKVTDTAIGTLERAADQLDLVVSTGRGA 553
Query: 543 -------------------ELPGCKGIWTVYHK--SSRGHNADSSRMA----AYDDEYHA 577
E + +W+V+ K + +G A S+ A A D +Y
Sbjct: 554 AGGLALMRREIDPVILRKGEFSNARAVWSVHAKKPAPKGMVAAGSQDAEAKLAADVDYDQ 613
Query: 578 YLIISL------EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+LI+S E + TA E + TI G + G R++QV +
Sbjct: 614 FLIVSRSNGDGGEESAIFNITATGFEETNKGDFEREDAATINVGTIAGGTRIVQVLKAEI 673
Query: 632 RILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC 690
R D + Q L P E+GS ++S S ADPY+L+ D S+ +L D +
Sbjct: 674 RSYDSELGLDQIL---PMEDENGS---ELRIISASFADPYILVIRDDSSVIVLQADANGE 727
Query: 691 TVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
+ + S+K WLS + ++ +
Sbjct: 728 MEEIDRGDTLLSTK-------------------------WLSGCIHQSQSTGEKA----- 757
Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
+ + G L IF++P+ + V + ++ L T SSS
Sbjct: 758 --LAYLLSAEGGLHIFELPDLSKPVYVAASLG--------FLPPTLTADFTPRRSSS--- 804
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
K + + V EL + + P+L + ++ YQ Y F E
Sbjct: 805 -----KAALTEVIVAELG----DSTYKTPYLIVRTSSNDLVIYQPYHFPAHEVVKP---- 851
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
+L + RL FS P A E+T G TI N+ G+ F++G
Sbjct: 852 --FFENLRWLKIPQPRLPE--FSEEP--ALESEDTGIGKESILTTI-ANVGGYSAVFMAG 904
Query: 931 SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY- 989
+ P + + L ++ S+ + H C+ GF Y+ + G L++CQLP G Y
Sbjct: 905 TSPSFILKESSSLPRVIKMRTKSVKNLSSFHRAECDRGFAYINADGNLRVCQLPRGYRYG 964
Query: 990 DNYWPVQKV 998
D W V+K+
Sbjct: 965 DAGWAVKKI 973
>gi|291232724|ref|XP_002736306.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
[Saccoglossus kowalevskii]
Length = 304
Score = 173 bits (439), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 118/354 (33%), Positives = 175/354 (49%), Gaps = 62/354 (17%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A Y+ +H PTGI +C G EE NL++
Sbjct: 2 YALYRQIHPPTGIEHCVYGH-------------FFSKEE--------------KNLIIAG 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
A + +Y + + + SK+ K+ E R + L GN+ SL
Sbjct: 35 ATDLHVYRL-LSDVDSKQKKSKLEHLRS----------------FSLFGNIMSLQTTRLA 77
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
GA RD+++L+F+DAK+SV+E+D H L+ S+H FE LK G S P
Sbjct: 78 GAS----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEEA---LKEGYVSNYYIPQ 130
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
V VDP RC +L+YG ++++L + G+ D+D G S+ + S++INL+D+D
Sbjct: 131 VVVDPDNRCAVMLMYGSKLVVLPFRREGAA--EDQDGVLPGSSKSSFL-PSYIINLQDID 187
Query: 243 MK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
K ++ D F+HGY EP + IL E TW GRV+ + TC I A+S++ + HP+IWS
Sbjct: 188 QKLINIIDIKFLHGYYEPTLFILFEPLRTWPGRVAVRKDTCCIVAISLNIEQRVHPVIWS 247
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
NLP D K + VP PIGGVLV +++ Y +QS Y VSL+ E
Sbjct: 248 LNNLPFDCIKAIPVPKPIGGVLVFAVDSLLYLNQSVP------PYGVSLNGLTE 295
>gi|296414526|ref|XP_002836950.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295632796|emb|CAZ81141.1| unnamed protein product [Tuber melanosporum]
Length = 1468
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 234/1077 (21%), Positives = 412/1077 (38%), Gaps = 227/1077 (21%)
Query: 57 NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVL----------------MDGI 96
N++V ++++I+ E ++K G+ RR+L
Sbjct: 29 NVLVAKTSLLQIFTTTTYETELNSALADAKQPGDIDRRILDADEEQTFAADIALQRSQVE 88
Query: 97 SAASLELVCHYRLHGNV---ESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
S L LV Y L G+V + + +LS GG ++++ +F+DAK S++E+D
Sbjct: 89 SVTKLVLVAEYPLSGSVTGLQRIKLLSTRSGG-------EAVLASFKDAKCSLMEWDPET 141
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVK--------VDPQGRCGGVLVYGLQMII 203
+ + S+H +E RE F P+V DP RC + G + I
Sbjct: 142 NSITTISLHYYE----------REEFC-SPVVSDGLPTELVADPGSRCAALRFSGDMLAI 190
Query: 204 LKASQ------------------------------------GGSGLVGDED--TFGSGGG 225
+ Q G ++G+ D T + G
Sbjct: 191 IPFRQREDEELSLGRGDADEVMGDEDGDNDDWDPEMAGTARGEDTIMGEGDVKTTDATEG 250
Query: 226 FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
S V+++ LD + HV F+H Y EP IL+ TW G ++ + I
Sbjct: 251 KDRPYHPSFVLSVSQLDDAISHVISLTFLHEYREPTFGILYSPRRTWTGLLAAEGRKDTI 310
Query: 284 SALSISTTLKQHP--LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCAL 340
S + I+ L+Q I S LP+D +K++ + P GG L+VG N IH + +
Sbjct: 311 SYIVITLDLEQKASTPILSVSGLPYDIFKVVPLAPPTGGSLLVGGNELIHVDQAGKTTGV 370
Query: 341 ALNNYAVSLDSSQELP-RSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRV 397
A+N + L +S +EL+ + L+++ LL TK G+ V++ DGR
Sbjct: 371 AVNPFCRRSTGFAGLADQSDLCLELEGSQVVELESEGGDMLLFTKRGEGVIVGFRMDGRN 430
Query: 398 VQRLDLSKTNP---SVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
V + ++K N S++ ++T +G F+G GD+ ++++ G+K
Sbjct: 431 VSGVKITKLNNHPGSIVGGRVSTAVGLGGRRLFVGCIEGDARVLKWRRKGERKKAGEGIK 490
Query: 452 EE----------FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTF 501
EE +G +E SS + NG N+ +Q +
Sbjct: 491 EEVLENEDEDDVYGALEDMDDDLYGGGGDSSFRKDSLTNGRR--------NSEAKSQGEY 542
Query: 502 SFAVRDSLVNIGPLKDFSYG------------------LRINADASATGISKQSNYELV- 542
F D L N+GP +D + G L + + + S+ S ++
Sbjct: 543 IFQTHDRLTNLGPFRDITLGKPTFPEESRERQKGVSPELELVTTSGPSNTSEDSGISIIR 602
Query: 543 -----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE-----YHAYLIISLEAR 586
+ P C+ +WTV +S+ NA DD + +L ++
Sbjct: 603 KSISPTIVGRFDFPQCQALWTVRARSANTSNAAVGLGGEEDDRSVEESFDRFLFVTKNDE 662
Query: 587 TMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
+ V D EV D+ +G TI G + R++QV R+ D +
Sbjct: 663 SQVFRVGDTFEEV-RGTDFESEGETIEVGVVGNGMRIVQVVSEQVRVYDCDLQLSQII-- 719
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
P E +G E V + DPY+LL DGS + D + ++ + AI+ K
Sbjct: 720 PMFDEE-TGEEGPNVHRARVCDPYILLIKVDGSPAVYKMDSTNLELAEERADAIKFDKYQ 778
Query: 707 VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ-GDIYSVVCYESGALEI 765
S C K G+ +D P++ D + G L+I
Sbjct: 779 -SGCIYASTK-----------------GIFIPLD----APVENVKDYLLFLLTVEGGLQI 816
Query: 766 FDVPN-FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKV 824
+D+ N +F+ + F +T D+ T ++ E+ K+ I + V
Sbjct: 817 YDLSNPVTPLFSAESF--------NTLYPLLRTDNPTSPTANREK---HRSKQLIIEILV 865
Query: 825 VELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS--KSDDPVSTSRSLSVSNV 882
++ + P+L A ++ + Y+ ++ P KS +P S LS+S
Sbjct: 866 ADMG----DSIFKEPYLIARSSNNDLTFYKPFISSSPSTLRFIKSPNPHIASNELSLSAG 921
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRE 941
+ + R L T N++G+ FL G+ P + + +
Sbjct: 922 TKNIFRPL------------------------TAVYNLAGYSAVFLPGADPSFVIKTAKS 957
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
R+H +L + + + H+ + GF+YV S GI+++ +P+ T+D W +KV
Sbjct: 958 SPRIH-KLAGTGVRSLSSFHSAGADRGFVYVDSLGIVRVALMPAEFTFDGNWGYKKV 1013
>gi|395740218|ref|XP_002819588.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Pongo abelii]
Length = 1388
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 102/269 (37%), Positives = 152/269 (56%), Gaps = 29/269 (10%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
T K HP+IWS +LP D + LAVP PI
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPI 275
Score = 115 bits (287), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 101/347 (29%), Positives = 167/347 (48%), Gaps = 56/347 (16%)
Query: 379 LSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+S K G++ +LT++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++
Sbjct: 285 ISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKY 344
Query: 438 TCG----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASN 492
T +++ + KEE + +T + QD V+ E+ +YGS A +
Sbjct: 345 TEKLQEPPASAVREAADKEEPPSKKKRVDATAGWSAAGKSVPQDEVD--EIEVYGSEAQS 402
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASA 530
T+ A T+SF V DS++NIGP + + G L I + +
Sbjct: 403 GTQLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGAL 460
Query: 531 TGISKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHA 577
+ + K ++V ELPGC +WTV + G + S A DD H
Sbjct: 461 SVLQKSIRPQVVTTFELPGCYDMWTVIAPLRKEEEDNPKGEGTEQEPSTPEADDDGRRHG 520
Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
+LI+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 521 FLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG- 578
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 579 --VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 616
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 68/251 (27%), Positives = 110/251 (43%), Gaps = 49/251 (19%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 731 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 786
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 787 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 829
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-------------ETPHGAPCQ----RIT 915
L N+ +RF + P + RE T GA + R
Sbjct: 830 ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGARGRVARFR 879
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG + +F HNVNC GF+Y
Sbjct: 880 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNR 939
Query: 975 QGILKICQLPS 985
Q ++ PS
Sbjct: 940 QEPQRLSGSPS 950
>gi|384253955|gb|EIE27429.1| hypothetical protein COCSUDRAFT_64224 [Coccomyxa subellipsoidea
C-169]
Length = 1137
Score = 172 bits (436), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 148/477 (31%), Positives = 227/477 (47%), Gaps = 68/477 (14%)
Query: 225 GFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
S + +S+++ L L + V+D +F+H Y EPV+++LHE + +W G++ T ++
Sbjct: 18 ALSTTVGNSYMLKLAKLGISEVRDAVFLHRYSEPVLLVLHETKPSWGGQLRNSKDTMEVT 77
Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
A S++ K+H +WS NLP DA+KL+ VP GG LV+ N + Y SQ A+ A A
Sbjct: 78 AFSLNVAHKRHTRLWSIGNLPSDAFKLIEVPG--GGGLVICQNLLIYVSQEAAAAAASGA 135
Query: 345 YAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
F ++L WL ++ LL +G L+L+ V +G +RL +S
Sbjct: 136 PRA----------EGFELDLTDCSGAWLADNSLLLGLASGQLILVNVQLEGS--KRLKVS 183
Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
K + S + +G L FLGS + +SLL++ G ++L G +E+ EADA
Sbjct: 184 KAQGAPPPSCMCRLGPELLFLGSWVANSLLIR-AVPEGQTLLLGGPEEQAS--EADATHA 240
Query: 465 KRLRRSSSDALQDMVN--GEELSLY-----GSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
+ R DA D+ N +E+SL A +T A K +S V DSLV+IG ++D
Sbjct: 241 SKRPRLDPDA-ADLGNEDEDEVSLIYRTDAQPALPSTTGASK-YSLQVVDSLVSIGIVQD 298
Query: 518 FSYGLRINADASATGISKQSN-------------------------YEL---VELPGCKG 549
G + A ++K EL V LPG
Sbjct: 299 LVTG-EASTSAPQEWVAKTERGPPKLLAAVGSDKFGAVAVLRSSLVPELVTEVPLPGVDQ 357
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES-VDYFVQ 608
+W V H G D S + YHA+L ++ ++ T VL T + L E S VD+ +
Sbjct: 358 MWAV-HFQPEGLPVDDSLL------YHAFLFLNEKSGTKVLRTGEELDETDSSQVDFILS 410
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
RT+ AGNL G R++QV RG +L GS QDL + G N+T+++ S
Sbjct: 411 SRTVFAGNLLGNSRIVQVHARGVVLLSGSSRVQDLPV-----QDLIGVSNTTIVAAS 462
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/169 (30%), Positives = 67/169 (39%), Gaps = 31/169 (18%)
Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
PFL +L DGT L Y+A F P V RL + P
Sbjct: 553 PFLLLLLADGTFLAYRA--FHTPRG-----------------RVCFKRLSLPAHAHCPPQ 593
Query: 899 AYTREETPHGAPCQRITIFKNISGHQ-----GFFLSGSRPCWCMVFRERLRVHPQLCDGS 953
+ T AP +T F + + G F+SG RP W + R L H +G
Sbjct: 594 DRRSKTT---APSSSMTRFDGLGESKEHVNSGMFVSGERPLWLVASRGTLVAHAMDVEGR 650
Query: 954 IVAFTVLHNVNCNHGFIYV----TSQGILKICQLPSGSTYDNYWPVQKV 998
+ T HN+NC GFI LKICQLP + D WP+QK+
Sbjct: 651 VSGMTPFHNINCPLGFITACMAENDGETLKICQLPMRTRLDTPWPLQKI 699
>gi|358338426|dbj|GAA28838.2| cleavage and polyadenylation specificity factor subunit 1
[Clonorchis sinensis]
Length = 1741
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 141/482 (29%), Positives = 225/482 (46%), Gaps = 67/482 (13%)
Query: 129 RRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQ 188
R DS++L+F +AK++V+ FD + L+ S+H +E + +LK GR F+ P+++VDP
Sbjct: 27 RLDSLLLSFTEAKVAVMGFDPVQYELKTLSLHNYE---FENLKSGRTHFSHLPILRVDPL 83
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDE--------DTFGSGGGFS------ARIESSH 234
RC VLVY + +L + + GD+ +T G S A + ++
Sbjct: 84 QRCAVVLVYDRHLAVLPFRRSEALAAGDKYLAKPVTNNTARGAGSLSWERRATAPLLATF 143
Query: 235 VINLRD---LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
L + +V D F++G+ EP +++L+E TWAGRVS + TC I ALS +
Sbjct: 144 TTCLSSSTGEKINNVLDMQFLNGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQ 203
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---V 347
+ +P+IW +LP+D + +VP PIGGVL++ N+I Y Q+ SC L LN YA
Sbjct: 204 KRTNPVIWFQESLPYDCTYVHSVPEPIGGVLILATNSIIYMKQTLPSCGLPLNCYAQVTT 263
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT--VVYDGRVVQRLDLSK 405
+ Q++P+ + LD + + L+ T+TG + LL+ V + + V L L +
Sbjct: 264 NFPMRQDVPQCG-PLTLDGCRIVTMTDSQFLIVTRTGKMCLLSLWVEHTTQTVSSLLLHE 322
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS------------GTSMLSSGLKEE 453
SV + + F+GSRL DS+L+ T + + + + +
Sbjct: 323 IGCSVPPYSVALLDKGYVFVGSRLCDSVLLHLTASTMFVNTLGRIVDLDETTTADNFRTD 382
Query: 454 FGDIEADA---------PSTKRLRRSSSDALQD----MVNGE------ELSLYGSASNNT 494
IE DA P+ K SS +V+G ++ LYG N
Sbjct: 383 IPMIERDAESIPVDKNNPTEKEAENVSSGTPSKPSGSIVHGPYVFDEVDVELYGDTILNP 442
Query: 495 ESAQK---TFSFAVRDSLVNIGPL-----KDFSYGLRINADASATG-ISKQSNYELVELP 545
S + T+ F V D LVN GP+ + Y N D + I+ Q+ VEL
Sbjct: 443 PSDVRELNTYKFEVADRLVNFGPMGLLTSGEVPYLAPGNTDPTDEALIAAQAEMHHVELL 502
Query: 546 GC 547
C
Sbjct: 503 AC 504
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 65/272 (23%), Positives = 114/272 (41%), Gaps = 30/272 (11%)
Query: 748 DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
D+ ++ + + +G LEI+ +P+F ++ V F +VD A + ++ E+N +
Sbjct: 1007 DKSRYFAFIVFTNGVLEIYSLPDFTLLYEVHHFSDLPAMLVDC---RAGQGNKVEVNLEN 1063
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
++NI V+E+ + + RP L + T I ++A S +
Sbjct: 1064 IPNCPAAEEDNIPP-TVLEITVFPIGRNRDRPVLL-VRTSQEIAFFEALC------PSHN 1115
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET-PHGAPCQRITI--------FK 918
+ S S S + R R L PL A R T P A Q + F+
Sbjct: 1116 EAHPFASESWSQEGL---RWRRLPIP-CPLVAPRRVRTDPKIADVQSTMLTRKNLLRPFE 1171
Query: 919 NISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
+I GH G F+ G+ P W +RV DG + +F L+ C GF+Y T
Sbjct: 1172 DIDGHCGVFVCGATPIWLFSSDTGHIRVFNHSIDGIMGSFAPLNTDICPSGFVYFTYSNE 1231
Query: 978 LKICQLPSGSTYDNY----W-PVQKVVFFLYF 1004
+++ L G ++ + W P++ +FL +
Sbjct: 1232 MRLATLLPGYSFKEHLGMRWVPLELTPYFLQY 1263
>gi|171695066|ref|XP_001912457.1| hypothetical protein [Podospora anserina S mat+]
gi|170947775|emb|CAP59938.1| unnamed protein product [Podospora anserina S mat+]
Length = 1441
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 238/1041 (22%), Positives = 402/1041 (38%), Gaps = 191/1041 (18%)
Query: 57 NLVVTAANVIEIYVVRV--------QEEGSKESKNSGETKRRVLMD--GISAA------- 99
NLVV +++++I+ ++ Q+ ++N+G + R+ D G+ A+
Sbjct: 28 NLVVAKSSLLQIFRTKIVSTEIDASQQGSGARTRNAGRYESRLANDDDGLEASFLGGDSL 87
Query: 100 ----------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFD 148
L LV L G + LA + + N R D++++AF+DA++S++E+D
Sbjct: 88 AFKTDRTNNTKLVLVSEISLSGTITGLAKIK---SQNLRSGGDALLVAFKDARLSLVEWD 144
Query: 149 DSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMII 203
H L S+H +E E W +F + DP GRC + G+ + I
Sbjct: 145 AERHDLSTVSIHYYEQDELQGSPWAPPLSNFTNF-----LAADPGGRCAALKFGGMNLAI 199
Query: 204 LKASQG---------------GSGLVGDEDTFGSGGGF--SARIESSHVINLRDLD--MK 244
L Q G V E +GG S V+ L +LD +
Sbjct: 200 LPFKQADEDIDMDDDWDEDLDGPRPVKQEAAVVNGGSSIKETPYSPSFVLRLSNLDPSLL 259
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
H F+H Y EP IL + + + K H + ++ + I S L
Sbjct: 260 HPVHLAFLHEYREPTFGILAST-VNASNSLGRKDHLAYM-VFTLDLQQRASTTILSVPGL 317
Query: 305 PHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
P D +++ +P+P+GG L+VGAN IH +A+N S +S ++
Sbjct: 318 PQDLFRVQPLPAPVGGALLVGANELIHIDQSGKPNGVAVNPLTKQCTSFGLSDQSDLNLR 377
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT-- 417
L+ L + L+ G + L+T DGR V LD+ S+T S++ ++T
Sbjct: 378 LEECTIDVLSAEELLVILSDGRMALVTFRIDGRTVSGLDVKLLPSETGGSLIPGRVSTLS 437
Query: 418 -IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG-DIEADAPSTKRLRRSSSDAL 475
IG S+ F GS GDSL+ +T S ++ G DI+ D
Sbjct: 438 RIGKSVMFAGSEEGDSLVFGWTKKQNQSGRKKSRLQDVGLDIDMADEEDLDEDEDEDDLY 497
Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------ 529
+ ++ ++ +ASN E +F + D L++I P++ +YG ++A S
Sbjct: 498 AEEPTPKQQAV-ATASNVKEG---DLTFRIHDRLLSIAPIQSMTYGQPVDAPGSEEEQNS 553
Query: 530 -----------ATGISKQSNYELV------------ELPGCKGIWTVYHKS--SRGHNAD 564
G +K S ++ E P +G WTV K + D
Sbjct: 554 AGVRSELQLVCGVGRNKSSAMAIMNLAIPPKVIGRFEFPEARGFWTVCAKKPVPKSLQGD 613
Query: 565 SSRMAAYDD-----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIA 613
A +D +Y ++I++ E + TA +T + G TI
Sbjct: 614 KGPGAIGNDYGTSGQYDKFMIVAKVDLDGYEKSDVYALTAAGFESLTGTEFDPAAGFTIE 673
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
AG + R+IQV + R DG + P E +T S SIADPY+L+
Sbjct: 674 AGTMGKDNRIIQVLKSEVRCYDGDLGLSQIV--PMMDEETGAEPRAT--SASIADPYLLI 729
Query: 674 GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLST 733
+D S+ + S+ +E +K DK +T WL+
Sbjct: 730 IRNDQSVFI---------ASIHDDNELEEVEK--------EDK-------TLATTKWLTG 765
Query: 734 GVGEAIDGADGGPLDQGD--------IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
+ +G G + GD I + SGAL I+ +P+
Sbjct: 766 CLYTDTNGVFGE--ESGDKKAKLPESILMFLLSASGALYIYRLPDL-------------- 809
Query: 786 HIVDTYMREALKDSETEINS--SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
Y+ E L T +++ ++ +GT KE + + V +L P+L
Sbjct: 810 -CKPVYVAEGLSYIPTGLSADYAARKGTA---KETVSEILVADLG----DTTAKSPYLIL 861
Query: 844 ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
+ + Y+ Y ++ + ++L + S L +++P + E
Sbjct: 862 RHANDDLTMYEPYRYQLGAG-------LEFPKTLFFQKIPNSVL-----AKSPAEETDDE 909
Query: 904 ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNV 963
E H A C + NI G+ FL G P + + + + L ++ A + H
Sbjct: 910 EVTHQAKCLALRRCNNIGGYSTVFLPGPSPSFIIKSSKSMPKVLPLQGAAVTAISSFHTE 969
Query: 964 NCNHGFIYVTSQGILKICQLP 984
C HGFIY S I+++ QLP
Sbjct: 970 GCEHGFIYADSHNIVRVSQLP 990
>gi|148886831|sp|Q7SEY2.2|CFT1_NEUCR RecName: Full=Protein cft-1; AltName: Full=Cleavage factor two
protein 1
Length = 1456
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 221/972 (22%), Positives = 388/972 (39%), Gaps = 144/972 (14%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 96 DRANSAKLVLVAEVTLPGTMTGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + ++ + L+ L
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQADLDLRLEGCAIDVLA 393
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T++ G S F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMIAPEAGGSVIQSRVTSLSRMGRSTMF 453
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + ++ ++ D D + GEE
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRLQDADLDLDLDDEDLEDDDDDDLYGEES 507
Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
+ A + ++ + +F + D L++I P++ +YG + S
Sbjct: 508 ASPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567
Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
A G K S ++ E P +G WTV K + +D
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627
Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+YH ++I++ E + TA +T + G T+ AG + R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+QV + R DG ++Q + + E+G+ V + SIADP++LL D S+ +
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSVFI 742
Query: 683 LVGDPSTCTVSVQTPA-AIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG 741
P + I +S K ++ C LY D ++ + VG+
Sbjct: 743 AEMSPKLLELEEVEKEDQILTSTKWLAGC-LYTD----------TSGVFADETVGKGT-- 789
Query: 742 ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
+ +I + SG L I+ +P+ V + +S Y+ L
Sbjct: 790 -------KDNILMFLLSTSGVLYIYRLPDLTKPVYVAEGLS--------YIPPGLS---- 830
Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
+ ++ +GT KE++ + V +L H P+L + + YQ Y +
Sbjct: 831 -ADYAARKGTA---KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-- 880
Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIF 917
+ + P S S + ++ N F++ P + ++ PH A P +R +
Sbjct: 881 ---ATAGQPFSKS-------LFFQKVPNSTFAKAPEEKPADDDEPHNAQRFLPMRRCS-- 928
Query: 918 KNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
NISG+ FL GS P + + + L + A + H C HGFIY + GI
Sbjct: 929 -NISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGI 987
Query: 978 LKICQLPSGSTY 989
++ Q+P+ S+Y
Sbjct: 988 ARVTQIPTDSSY 999
>gi|147864212|emb|CAN80950.1| hypothetical protein VITISV_016701 [Vitis vinifera]
Length = 262
Score = 166 bits (421), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 89/148 (60%), Positives = 105/148 (70%), Gaps = 26/148 (17%)
Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLV 510
+++ GDIE D PS KR RRSSSDALQDM N ++L LYG A N+TE++QKTFSF+V DSL+
Sbjct: 53 RKKVGDIEGDVPSAKRSRRSSSDALQDMFNSDKLPLYGLAPNSTETSQKTFSFSVSDSLI 112
Query: 511 NIGPLKDFSYGLRINADASATGISKQSNYEL--------------------------VEL 544
N+GPLKDF+YGLRINAD ATGI KQSNYEL VEL
Sbjct: 113 NVGPLKDFAYGLRINADLKATGIVKQSNYELMCCSGHGKNGALCILQQSIRPERITEVEL 172
Query: 545 PGCKGIWTVYHKSSRGHNADSSRMAAYD 572
PGCKGIWTVYHK++RGHNADS +M + D
Sbjct: 173 PGCKGIWTVYHKNTRGHNADSIKMVSAD 200
>gi|325189779|emb|CCA24259.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1911
Score = 165 bits (418), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 152/568 (26%), Positives = 253/568 (44%), Gaps = 123/568 (21%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR+LD+K + DF F+ GY+EP ++ILHE + +GR + + T ++ LSI+
Sbjct: 382 LLRLRELDIKGRIADFAFLDGYLEPTLMILHEENERIASSGRFAIGYDTMCLTVLSITLN 441
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA----- 346
+ HP+IW NLP D ++++ PIGG L++ N I Y +Q+ + LN +A
Sbjct: 442 SRLHPVIWCVKNLPADCFRIIPCKVPIGGALLLSTNAILYFNQTQFYGIKLNVFADKTVN 501
Query: 347 --------VSLDSSQELPRSS--------------FSVELDAAHATWLQNDVALLSTKTG 384
+ + + LP +S S+ L H +L + LLS
Sbjct: 502 QSLFPCQDATYEVLEPLPDASEPPAQGRLAFIEKPLSILLYDCHYDYLGSSDILLSLPDD 561
Query: 385 DLVLLTVVY-DGRVVQRLDLSKTNPSVL---TSDITTIG-------NSLFFLGSRLGDSL 433
L +L + RV + + T +L S +T N F+GSR GDS+
Sbjct: 562 SLYVLKMPQTSNRVFSVEEYNHTGKFILRKVASPASTASCLLVNRENDSIFIGSRCGDSV 621
Query: 434 LVQF--------TCGSGTSMLS----SGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVN 480
L SGT ++S SG G D + +A ++L+ S D +
Sbjct: 622 LYSAHRQKINARKTLSGTVVMSDGSISGTSNVRGADTDNEAALAEKLQAFGSTIALDATD 681
Query: 481 GEELSLYG-----SASNNTESAQKTFSFA------------VRDSLVNIGPLKDFSYGLR 523
++ LYG ++ + FSF+ D + IG + G++
Sbjct: 682 EDDAFLYGPTLSQESTGGAMPSSDCFSFSSMKQEDHSLHLQAIDFIPGIGQITSMDLGVQ 741
Query: 524 INADAS--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNA 563
N+D++ + G SK + ++ EL GC+ +WTV SS +
Sbjct: 742 SNSDSNEQHEELVVSGGSSKDGSISVIHHGLRPIVSTAAELSGCRAMWTVVGMSSDVPES 801
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+R Y +YLI+S+ RTM+L T + + + + ++ G T+ A NLF +RR+
Sbjct: 802 QVTR------RYDSYLILSVAQRTMILRTGEEMEPLEDDSGFYTCGPTLCATNLFSQRRI 855
Query: 624 IQVFERGARILDGSYM----------------------TQDLSFGPSNSESGS---GSEN 658
+QVF++G R++ + + TQ++ F + ESG + N
Sbjct: 856 VQVFKQGVRVMQQASIPASEAKEDDEGTQDVPLTRLVCTQEIPFA-GDIESGGMNVDTAN 914
Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGD 686
++SV DPY+LL ++DGSIRLL GD
Sbjct: 915 VGIVSVDTIDPYILLLLTDGSIRLLEGD 942
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 66/305 (21%), Positives = 123/305 (40%), Gaps = 64/305 (20%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS----ETEINSSSEEG- 810
+CY G+L ++ VP+F + +++T E+ + S +T + S+ G
Sbjct: 1085 LCYGDGSLHVYSVPDFGKMGIFPYVTFAPKFLLNTMTPESRRASYGYGDTARHRISKGGP 1144
Query: 811 ----------TGQGRKENIHSMK--VVELAMQRWSA----HHSRPF----LFAILTDGTI 850
T +GR H++ V ++A+ R H+S+ F L L +G +
Sbjct: 1145 RLGFSAIPADTNEGRIRKAHAINSPVADIAIHRIGPSEGQHNSQLFSHMVLLVFLANGDL 1204
Query: 851 LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGA 909
+ Y+ P S D + + V+ +R ++ + +A T +E G+
Sbjct: 1205 IMYKLL----PSIPSPRDSKQPSFHFVRVNENLITRPNLPMKAIKDSGNAGTHDENSLGS 1260
Query: 910 P----------------CQRITIFKNISGHQGFFLSGSRPCWCMVFRER-----LRVHPQ 948
+T F N++ + G F G+ P W + + + L +
Sbjct: 1261 TEASTSAIIAKLRANFRYPMLTRFFNVNNNSGMFFRGAYPVWILPNQGQPVFVPLNIAAA 1320
Query: 949 LCDGS--------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD-----NYWPV 995
D + +++FT H+ NC +GF+Y S G L++C+LPS N + +
Sbjct: 1321 PSDPTRRTTFKVPVLSFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVL 1380
Query: 996 QKVVF 1000
QKV F
Sbjct: 1381 QKVRF 1385
>gi|325187036|emb|CCA21579.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1912
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 153/569 (26%), Positives = 252/569 (44%), Gaps = 124/569 (21%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR+LD+K + DF F+ GY+EP ++ILHE + +GR + + T ++ LSI+
Sbjct: 382 LLRLRELDIKGRIADFAFLDGYLEPTLMILHEENERIASSGRFAIGYDTMCLTVLSITLN 441
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA----- 346
+ HP+IW NLP D ++++ PIGG L++ N I Y +Q+ + LN +A
Sbjct: 442 SRLHPVIWCVKNLPADCFRIIPCKVPIGGALLLSTNAILYFNQTQFYGIKLNVFADKTVN 501
Query: 347 --------VSLDSSQELPRSS--------------FSVELDAAHATWLQNDVALLSTKTG 384
+ + + LP +S S+ L H +L + LLS
Sbjct: 502 QSLFPCQDATYEVLEPLPDASEPPAQGRLAFIEKPLSILLYDCHYDYLGSSDILLSLPDD 561
Query: 385 DLVLLTVVY-DGRVVQRLDLSKTNPSVL---TSDITTIG-------NSLFFLGSRLGDSL 433
L +L + RV + + T +L S +T N F+GSR GDS+
Sbjct: 562 SLYVLKMPQTSNRVFSVEEYNHTGKFILRKVASPASTASCLLVNRENDSIFIGSRCGDSV 621
Query: 434 LVQF--------TCGSGTSMLS----SGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVN 480
L SGT ++S SG G D + +A ++L+ S D +
Sbjct: 622 LYSAHRQKINARKTLSGTVVMSDGSISGTSNVRGADTDNEAALAEKLQAFGSTIALDATD 681
Query: 481 GEELSLYG------SASNNTESAQKTFSFA------------VRDSLVNIGPLKDFSYGL 522
++ LYG S + FSF+ D + IG + G+
Sbjct: 682 EDDAFLYGPTLSQESTGGGKLPSSDCFSFSSMKQEDHSLHLQAIDFIPGIGQITSMDLGV 741
Query: 523 RINADAS--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHN 562
+ N+D++ + G SK + ++ EL GC+ +WTV SS
Sbjct: 742 QSNSDSNEQHEELVVSGGSSKDGSISVIHHGLRPIVSTAAELSGCRAMWTVVGMSSDVPE 801
Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
+ +R Y +YLI+S+ RTM+L T + + + + ++ G T+ A NLF +RR
Sbjct: 802 SQVTR------RYDSYLILSVAQRTMILRTGEEMEPLEDDSGFYTCGPTLCATNLFSQRR 855
Query: 623 VIQVFERGARILDGSYM----------------------TQDLSFGPSNSESGS---GSE 657
++QVF++G R++ + + TQ++ F + ESG +
Sbjct: 856 IVQVFKQGVRVMQQASIPASEAKEDDEGTQDVPLTRLVCTQEIPFA-GDIESGGMNVDTA 914
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
N ++SV DPY+LL ++DGSIRLL GD
Sbjct: 915 NVGIVSVDTIDPYILLLLTDGSIRLLEGD 943
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 66/305 (21%), Positives = 123/305 (40%), Gaps = 64/305 (20%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS----ETEINSSSEEG- 810
+CY G+L ++ VP+F + +++T E+ + S +T + S+ G
Sbjct: 1086 LCYGDGSLHVYSVPDFGKMGIFPYVTFAPKFLLNTMTPESRRASYGYGDTARHRISKGGP 1145
Query: 811 ----------TGQGRKENIHSMK--VVELAMQRWSA----HHSRPF----LFAILTDGTI 850
T +GR H++ V ++A+ R H+S+ F L L +G +
Sbjct: 1146 RLGFSAIPADTNEGRIRKAHAINSPVADIAIHRIGPSEGQHNSQLFSHMVLLVFLANGDL 1205
Query: 851 LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGA 909
+ Y+ P S D + + V+ +R ++ + +A T +E G+
Sbjct: 1206 IMYKLL----PSIPSPRDSKQPSFHFVRVNENLITRPNLPMKAIKDSGNAGTHDENSLGS 1261
Query: 910 P----------------CQRITIFKNISGHQGFFLSGSRPCWCMVFRER-----LRVHPQ 948
+T F N++ + G F G+ P W + + + L +
Sbjct: 1262 TEASTSAIIAKLRANFRYPMLTRFFNVNNNSGMFFRGAYPVWILPNQGQPVFVPLNIAAA 1321
Query: 949 LCDGS--------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD-----NYWPV 995
D + +++FT H+ NC +GF+Y S G L++C+LPS N + +
Sbjct: 1322 PSDPTRRTTFKVPVLSFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVL 1381
Query: 996 QKVVF 1000
QKV F
Sbjct: 1382 QKVRF 1386
>gi|336276223|ref|XP_003352865.1| hypothetical protein SMAC_04980 [Sordaria macrospora k-hell]
gi|380092984|emb|CCC09221.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1486
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 233/979 (23%), Positives = 391/979 (39%), Gaps = 159/979 (16%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 95 DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVDRNT 154
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 155 LETISIHYYEKEELVGSPWVAPLHHYPTLLLADPASRCAALKFSERNLAILPFKQPDEDM 214
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 215 DMDNWDEELDGPRPKKDLSGAIANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 274
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 275 YRDPTIGVLSSTKTASNSLGHRDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 332
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + +S + L+ L
Sbjct: 333 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 392
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITT---IGNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T+ +G S F
Sbjct: 393 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRVGRSTVF 452
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + K DI+ D
Sbjct: 453 VGSEEGDSVLLGWTRRQGQTQKR---KSRIQDIDLDLDLDDEDLEDDD-----------D 498
Query: 485 SLYGSASNNTE---SAQKT-----FSFAVRDSLVNIGPLKDFSYGLRINADAS------- 529
LYG S + E SA K +F + D L++I P++ +YG + S
Sbjct: 499 DLYGEESTSPEQAISAAKAVKSGELNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSE 558
Query: 530 ----------ATGISKQSNYELV------------ELPGCKGIWTVYHKS--SRGHNADS 565
A G K S ++ E P +G WTV K + D
Sbjct: 559 GVRSDLQLVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDK 618
Query: 566 SRMAA-YDD--EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
M+ YD ++H ++I++ E + TA +T + G T+ AG
Sbjct: 619 GPMSNDYDTSGQHHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGT 678
Query: 617 LFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
+ R++QV + R DG ++Q + + E+G+ V + SIADP++LL
Sbjct: 679 MGKDCRILQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIR 733
Query: 676 SDGSIRLLVGDPSTCTV-SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG 734
D S+ + P + V+ + + K ++ C LY D +TG
Sbjct: 734 DDFSVFVAEMSPKLLELDEVEKEDQMLTGTKWLAGC-LYTD----------------TTG 776
Query: 735 VGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMRE 794
V A + A G D +I + SG L I+ +P+ V + +S Y+
Sbjct: 777 V-FADEAAGKGTKD--NILMFLLSTSGVLYIYRLPDLTKPVYVAEGLS--------YIPP 825
Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
L + ++ +GT KE++ + V +L H P+L + + YQ
Sbjct: 826 GLS-----ADYAARKGTA---KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQ 873
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----P 910
Y + + + P S S + ++ N F++ P + ++ H A P
Sbjct: 874 PYRVK-----ATAGQPFSKS-------LFFQKVPNSTFAKAPEEKPVEDDELHNAQRFLP 921
Query: 911 CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFI 970
+R T NISG+ FL GS P + + + L + A + H C HGFI
Sbjct: 922 MRRCT---NISGYSTVFLPGSSPSFILKTAKSSPRVLGLQGSGVQAMSSFHTEGCEHGFI 978
Query: 971 YVTSQGILKICQLPSGSTY 989
Y + GI ++ Q+P+ S++
Sbjct: 979 YADTNGIARVTQIPTDSSF 997
>gi|241060959|ref|XP_002408050.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
gi|215492346|gb|EEC01987.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
Length = 1241
Score = 165 bits (417), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 173/646 (26%), Positives = 268/646 (41%), Gaps = 113/646 (17%)
Query: 388 LLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
+LT+ DG R V+ + K SVLT+ +T FLGSRLG+SLL+ +T M
Sbjct: 198 VLTLFNDGMRSVRNFNFDKAAASVLTTSMTLCEEGYLFLGSRLGNSLLLHYT-EKAAEME 256
Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
+G KE+ ++ D +++ +EL +YGS + T+ +++F V
Sbjct: 257 EAGKKED---------------KAEGDVNVALIDPDELEVYGSETLATKQL-TSYTFEVC 300
Query: 507 DSLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELV----------- 542
DSL+NIGP G N+D + G K ++
Sbjct: 301 DSLINIGPCGKICMGEPAFLSEEFTQNSDPDLELVTTAGYGKNGALCVLQRSVRPQVVTT 360
Query: 543 -ELPGCKGIWTVYHKSSRGHNA------DSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
ELPGC +WTV + D A HA+LI+S +M+L+T
Sbjct: 361 FELPGCVHMWTVMGPPTEKKKKEASEESDEQAADATLTNTHAFLILSRADSSMILQTDQE 420
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ S + Q T+ AGNL R V+QV G R+LDG+ Q +
Sbjct: 421 INELDHS-GFSTQNPTVFAGNLGDGRYVLQVCPMGVRLLDGTRQLQHIPL---------- 469
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLV--GDPST-CTVSVQTP--AAIESSKKPVSSC 710
S ++ S+ADP+VL+ G + L GDP++ C ++V P A+ S + +C
Sbjct: 470 DVGSPIVGGSLADPHVLIRSEGGLVVHLTLRGDPASGCRLAVLRPQLTAVVSHRANALTC 529
Query: 711 --------------TLYHD----KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
LY D + + +R T+ + + + + P
Sbjct: 530 HCIAVSGVLDDEDELLYGDSEDTRATKEPVRVTAMET--ESETANVFELKEVKP----TF 583
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEEGT 811
+ V E+G LEI+ +P++ F V F G+ +VD+ A +++E ++ S E
Sbjct: 584 WVFVARENGVLEIYSLPDYKLCFLVKNFPMGQRVLVDSVQMTAPSGTKSEKLSDMSHE-- 641
Query: 812 GQGRKENIHSMKVV-ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
M VV E+ M SRP L A + D +L Y+A+ F +
Sbjct: 642 ---------CMPVVHEILMVGLGVRQSRPLLLARV-DEDLLIYEAFPFYETQREGH---- 687
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
L ++ + R +T EE + + F +ISG+ G FL G
Sbjct: 688 ----LKLRFKKLNHDIILRSRKYKTQKPENEEEEKAFQSRLW-LQPFSDISGYSGVFLCG 742
Query: 931 SRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
RP W M R LR HP DG + F HNVNC GF++ Q
Sbjct: 743 HRPHWLFMSSRGELRYHPMFVDGPVYCFAPFHNVNCPKGFLHFNKQ 788
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 5/95 (5%)
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
P+++VDP RC +LV+ + ++ + + +E G G + + + L +
Sbjct: 58 PMIRVDPCNRCAAMLVFSRTIAVVPFRKDTAA---EEQETGPTFGNKPPLLDWYPVALTE 114
Query: 241 LDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
LD K +V D F+HGY EP ++IL+E TW G+
Sbjct: 115 LDEKINNVIDMQFLHGYYEPTLLILYEPLRTWPGK 149
Score = 40.0 bits (92), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 9/103 (8%)
Query: 236 INLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
+ L +LD K +V D F+HGY EP ++IL+E TW G V + M S + +
Sbjct: 158 VALTELDEKINNVIDMQFLHGYYEPTLLILYEPLRTWPGYVLTLFNDGMRSVRNFNFDKA 217
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
++ ++M L + Y L S +G L+ +HY ++A
Sbjct: 218 AASVLTTSMTLCEEGYLFLG--SRLGNSLL-----LHYTEKAA 253
>gi|389740693|gb|EIM81883.1| hypothetical protein STEHIDRAFT_65512 [Stereum hirsutum FP-91666
SS1]
Length = 1438
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 233/1006 (23%), Positives = 416/1006 (41%), Gaps = 167/1006 (16%)
Query: 47 PSKRGIGPVPNLVVTAANVIEIYVVRV------------QEEGSKESKNSGETKRRVLMD 94
P + P+ NLVV +N++ I VR +E K + + V MD
Sbjct: 31 PDSQKALPLFNLVVARSNLLRILEVREVPTLRPIHLDDERERRGNVRKGTEPVEGEVEMD 90
Query: 95 ---------GISAAS----------LELVCHYRLHGNV---ESLAILSQGGADNSRRRDS 132
G S AS V YRLHG V E++ I+S D
Sbjct: 91 EQGEGYVNMGASTASNGAPRPTVLRFYFVRDYRLHGTVTGLETVRIMSS----LEDEMDR 146
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRC 191
++++F+DAKI++LE+ H L S+H +E +P+ L L +S ++ DP +C
Sbjct: 147 LLVSFKDAKIALLEWSTDTHSLSTVSIHTYERAPQLLSL----DSNMFTAQLRTDPLSQC 202
Query: 192 GGVLVYGLQMIILKASQGGSGL-VGDED-TFGSGGGFSARIESSHVINLR---DLDMKHV 246
+ + IL Q L V D+D T +S S +++L D +++V
Sbjct: 203 AALSLPKDAFAILPFYQTQVDLDVMDQDQTRARDVPYSP----SFILDLAAEVDERIRNV 258
Query: 247 KDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPH 306
DF+F+ G+ P + +L + + TW GR+ T + +++ + +P+I S LP+
Sbjct: 259 VDFVFLPGFSHPTVAVLFQAQQTWTGRLKEYKDTMRLFIFTLNVVTRSYPIITSVEGLPY 318
Query: 307 DAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS---- 361
D ++ P+ +GGV+V+ +N+ IH S ALA+N + + ++P ++ +
Sbjct: 319 DCLSVVPCPAALGGVVVLTSNSVIHIDQASRRVALAVNGW---MPRVSDMPVTALAQGDQ 375
Query: 362 --VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSD 414
+EL+ + T++ + + K G + + DG+VV +L +S T PSV
Sbjct: 376 GRLELEGSRMTFVDDKTLFIVLKDGTIHPVEFFVDGKVVSKLSISPPLAQTTTPSV---- 431
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
I I N FF+GS G S L++ SG++E+ D + + K + D+
Sbjct: 432 IRKITNEHFFVGSTAGPSALLKV----------SGVEEDIQD-DVEEIDGKTAPAAVVDS 480
Query: 475 LQDMVNGEELSLYGSA--------------SNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+ M ++ LYGS+ + +T + ++ DSL GP+ D ++
Sbjct: 481 VDGMDIDDDDDLYGSSKADPTPTANGNAVETTSTTRKRTVIHLSLCDSLPAHGPISDMTF 540
Query: 521 GLRINAD------ASATGISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGH 561
+ N D +ATG + L + LP G +G+W++ + +
Sbjct: 541 SMTKNGDRAVPELVAATGSGLLGGFTLFQRDLPIRTKRKLHAIGGARGVWSLPVRQAVRV 600
Query: 562 NADSSRMAAYD-DEYHAYLIISLEARTM--VLETADLLTEVTESVDYFVQG-RTIAAGNL 617
N S + + +IIS +A + A ++ ++ + G T+ A
Sbjct: 601 NGVSYQTPQNPLRSDNDTIIISTDATPSPGISRIATRSSKTDLNITTRIPGVTTVGAAPF 660
Query: 618 FGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
F ++ V R+L DGS P G+ + + + + SI DPYV +
Sbjct: 661 FQGTAILHVLSNAIRVLEPDGSERQ------PIKDMDGN-NYRAKIKNCSICDPYVFVLR 713
Query: 676 SDGSIRLLVGDPSTCTVSVQ--TPAAIESSKKPVSSCTLYHDKGP-----EPWLRKTSTD 728
D +I L +G+ + + +P ++S+ ++ C G L ++T
Sbjct: 714 EDETIGLFIGETERGKIRRKDMSPMGDKTSRY-IAGCFFSDTTGTFQAHVNSSLNGSNTT 772
Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
+T +++ A Q + ++ G +EI+ +P VF+ + + +V
Sbjct: 773 KQNATSTLQSVMNA-----GQKTQWLLLVRPQGVMEIWTLPKLTLVFSTTALATLQPLLV 827
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D+ AL S Q RK + + ++ + RP LF +L G
Sbjct: 828 DSLDPPAL----------SSLPQDQPRKP--QELDIDQILVAPLGETSPRPHLFVLLRSG 875
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--P 906
+ Y+A FE P + DP SR S+ V ++ + F D +E++
Sbjct: 876 QLAIYEAVSFELP-----TGDPEPASRP-SILPVKLVKVLSRAFDIQHPDEQPQEKSVLA 929
Query: 907 HGAPCQRITI-FKNISGHQ----GFFLSGSRPCWCM-VFRERLRVH 946
QR+ I F + G F +G RPCW + + +RVH
Sbjct: 930 ELKKIQRLFIPFVTSPAPEKTFTGVFFTGDRPCWILGTDKGGIRVH 975
>gi|317157892|ref|XP_001826637.2| protein cft1 [Aspergillus oryzae RIB40]
gi|391864317|gb|EIT73613.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
[Aspergillus oryzae 3.042]
Length = 1389
Score = 163 bits (412), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 221/947 (23%), Positives = 372/947 (39%), Gaps = 153/947 (16%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
++I+LAF +AK++++E+D +G+ S+H +E + + + G ++ VDP R
Sbjct: 88 EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147
Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
C V +G++ + IL Q G LV D+ + GS G A
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206
Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
SS V+ L LD + H F++ Y EP IL+ + T + + + +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + + S LP D +K++A+P P+GG L++G+N +H + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326
Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S +S ++ L+ L N LL TG++VL+ DGR V + +
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386
Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE---EF 454
P S +G+ FLGS DS+L+ G S+ SSG K+ +
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKKPRPQA 440
Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNG--EELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
E D+ +S D +D + E+ + G + ++F D L+NI
Sbjct: 441 RHTEEDSGGFSDEDQSEDDVYEDDLYATVPEVVVDGRRPSAESFGSSLYNFREYDRLLNI 500
Query: 513 GPLKDFSYGLRINADASATGISKQSNYELV----------------------------EL 544
GPLKD ++G + S ELV +L
Sbjct: 501 GPLKDIAFGRSFTSLGGEENAGNDSGLELVASQGWDRSGGLAVMKRGLELQVLNSMRTDL 560
Query: 545 PGCKGIWTVYHKSSRGHNADS---SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
C +WT +S H ++ + A + E H Y+++S +A + E +++ +
Sbjct: 561 ASC--VWT----ASVAHMEEAVSKTTTQAENRECHQYVVVS-KATSAEREQSEVFRVEGQ 613
Query: 602 SVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---PSNSE 651
+ F + TI G L G+ RV+Q+ R DG DL P E
Sbjct: 614 ELRPFRAPEFNPNEDVTIDIGTLIGKNRVVQILRSEVRSYDG-----DLGLAQIYPVWDE 668
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCT 711
SE +S S+ DPYV + D ++ LL D S V+ I +SK +SC
Sbjct: 669 --DTSEERMAISSSLVDPYVAILRDDSTLLLLQADDSGDLDEVELNEQIANSKW--TSCC 724
Query: 712 LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF 771
LY DK TG+ +I A L Q + + + L I+ +P+
Sbjct: 725 LYFDK----------------TGIFSSI-SATSDELAQNSMTLFLMTQDCRLFIYRLPDQ 767
Query: 772 NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
+ + G + E K S T +E + + V +L
Sbjct: 768 KLL----AIIEGVDCLPPVLSSEPPKRSTT--------------REVLTEIVVADLG-DS 808
Query: 832 WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLR 891
WS S P+L + Y+ ++ T +P + L +N+ R+
Sbjct: 809 WS---SFPYLIIRSRHDDLAVYRPFI----SITKSVGEPHADLNFLKETNLVLPRI---- 857
Query: 892 FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
+ D + EE P + I NISG F G P + + L
Sbjct: 858 -TSGVEDQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKG 913
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
G + + C GFI + S+G++ +CQ+P G D W +Q++
Sbjct: 914 GYAQSLSKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQI 960
>gi|336388105|gb|EGO29249.1| hypothetical protein SERLADRAFT_445076 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1424
Score = 163 bits (412), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 204/968 (21%), Positives = 383/968 (39%), Gaps = 149/968 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDG-------- 95
N+VV +NV+ I+ VR +E ++ E RR V MDG
Sbjct: 47 NVVVARSNVLRIFEVR-EERPPMSTQTEDERDRRSHVRKGTEAVEGEVEMDGQGEGYVNM 105
Query: 96 ----------ISAASLELVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKI 142
+ + V + LHG V E++ I+S N D ++++F+DAKI
Sbjct: 106 GTVKKGAVHLPTVSRFYFVREHMLHGTVTGLETVRIMSS----NDDNLDRLLVSFKDAKI 161
Query: 143 SVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQM 201
++LE+ D IH L S+H +E +P+ + L +S ++VDP RC + + +
Sbjct: 162 ALLEWSDDIHDLITVSIHTYERAPQLMAL----DSSLFHTKLRVDPSSRCAALSLPKDAI 217
Query: 202 IILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIE 257
IL Q + L V ++D S +++L D +++HV DF+F+ G+
Sbjct: 218 AILPFFQSQAELDVMEQD---QNQARDVPYSPSFILDLASDVDENIRHVIDFVFLPGFNN 274
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P + +L + E TW+GR+ T + ++ +P+I + LP D L+ +
Sbjct: 275 PTIAVLFQTEQTWSGRLKEFKDTAKLIIFTLDLLSHTYPVITAVDGLPFDCISLVPCVAS 334
Query: 318 IGGVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELP-----RSSFSVELDAAHATW 371
+GGV+++ +NTI Y S AL +N ++ S S +P +S ++ L+ HA
Sbjct: 335 LGGVVIMSSNTIIYVDPASRRVALPVNGWS-SRVSDMPMPALSGDEASRNISLEGCHAVL 393
Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLGSRLG 430
+ + + K G + + +V DG+ V +L ++ + + S + I FLGS +G
Sbjct: 394 VDDRTMFVFLKDGTVYPVELVADGKTVSKLSMAPALAQTTIPSMVRKINEDHLFLGSIVG 453
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
S+L++ L ++A + ++ + + +
Sbjct: 454 ASVLLKTVRVEEEVEDEEKLPAHAAVVDAPTTMDLDDDDDTMPSMNGVTH---------S 504
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG------------ 532
+N + ++ DSL GP+ D ++ L D +ATG
Sbjct: 505 NNIIHRTRSVVHLSLCDSLPAYGPISDVTFSLAKLGDRYVPELVAATGSGFLGGFTLFQR 564
Query: 533 -ISKQSNYELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM-- 588
+ ++ +L + G +GIW+ + R + R + + +IIS +A
Sbjct: 565 DLPSRTKRKLHAIGGARGIWSFPVRQQVRVNGLSYERPVNSFESENDTVIISTDANPSPG 624
Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDLSFGP 647
V A ++ ++ + G TI AG+ F R ++ V R+L+ G + +DL
Sbjct: 625 VSRIATRTSKSDIAIPTRIPGTTIGAGSFFQRTAILHVMTNAIRVLESGKQIIKDLD--- 681
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQ--TPAAIESSKK 705
+ + SI DP+VL+ D +I L +G+ + + +P +SS+
Sbjct: 682 ------GNIPRPRIKACSICDPFVLIIREDDTIGLFIGEAERGKIRRKDMSPMGDKSSRY 735
Query: 706 PV------SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
+SC P D +++ + ++ + + ++
Sbjct: 736 LAGCFFTDNSCIFETHANDLPSSASNGVDKNVTSTMQAVVNS------NSRSQWLILVRP 789
Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
G +EI+ +P F+ + D+Y AL + RK N
Sbjct: 790 QGVMEIWTLPKLTLAFSTSSLAMLEHILSDSYDTPALSPPQ-----------DHPRKSN- 837
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
+ V ++ + P+L L G I+ Y+A P + S+
Sbjct: 838 -DLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPAD------------SIPP 884
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI----------SGHQGFFLS 929
S VS +++ ++ + + EET ++ I + S G F +
Sbjct: 885 SRVSVLKVKFIKTATKIFELPKHEETEKSILAEQKRISRQFVPFVTSPTPGSVLSGVFFT 944
Query: 930 GSRPCWCM 937
G RP W +
Sbjct: 945 GDRPSWIV 952
>gi|393220097|gb|EJD05583.1| cleavage factor protein [Fomitiporia mediterranea MF3/22]
Length = 1450
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 225/962 (23%), Positives = 390/962 (40%), Gaps = 157/962 (16%)
Query: 76 EGSKESKNSGETKRRVLMDGISAASLEL--VCHYRLHGNV---ESLAILSQGGADNSRRR 130
EG E GE + +S + + + +RLHG V E + ILS
Sbjct: 89 EGEVEMDTQGEGFVNMASKPLSMTTYQFHFIREHRLHGIVTGLEPVKILSS----TEDSL 144
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQG 189
D ++++F+DAK+++LE+ +H L S+H +E +P+ L + + G L +VDP
Sbjct: 145 DRLLVSFKDAKLALLEWSPELHDLVTVSIHTYERAPQMTFLDPSKFT---GQL-RVDPLS 200
Query: 190 RCGGVLVYG--LQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVK 247
RC + + L ++ SQ LV + T +S + N D +++V
Sbjct: 201 RCAALSLPCDCLAILPFYHSQVDLDLVDADQTVSRDIPYSPSF-ILDLFNQVDHRIRNVI 259
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
DF F+ G+ P + +L + + TW GR+ TC + ++ +P+I S NLPHD
Sbjct: 260 DFAFLPGFNNPTLAVLFQTQHTWTGRLKEFKDTCNLFIFTLDLVTHMYPIITSVENLPHD 319
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQ-SASCALALNNYA--VSLDSSQELPRSSFS--V 362
+ +L S +GGV+++ N++ Y Q S L +N +A VS Q+L + +
Sbjct: 320 CFAMLPCDSSLGGVVIISCNSLIYVDQASRKTVLPVNGWAARVSDMPMQQLRPEEMNRDL 379
Query: 363 ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITTI 418
L+ AHAT++ + + T+ G ++ + +V DGR RL L S+T L ++
Sbjct: 380 HLEGAHATFVDSRTFFIITRDGLVLPVEIVMDGRTALRLALHPAMSQTTTPALVRNVAFR 439
Query: 419 GNS--------LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF------GDIEAD-APS 463
S + F+GS +G S+L++ T ++EE GDI A A +
Sbjct: 440 SASGDQAPRSQILFVGSTVGPSVLLRVTW----------VEEEIQKDKQQGDIPAAVADN 489
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS---FAVRDSLVNIGPLKDFSY 520
+ D + V E + +G + +++A +T S ++ DSL GP+ ++
Sbjct: 490 PMAVDFDDEDDIYGDVAKETQTTHGQPTAASQAAVETKSVIHLSLCDSLSAYGPINSMAF 549
Query: 521 GLRINAD------ASATGISKQSNYELVE-------------LPGCKGIWTVYHKSSRGH 561
L N D +ATG ++ + L + + G +GIW + + S
Sbjct: 550 ALTRNGDRPTAELVAATGYARLGGFTLFQRDVPTRSKRKLHAVGGARGIWCIPVRQSLKV 609
Query: 562 NAD--SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR----TIAAG 615
N S + E +I+S +A T + D + R TI A
Sbjct: 610 NGSERSRNLLPGSSEVVDTVIVSTDANPSPGLTR--FAAKSSRNDIAITARRTETTIGAA 667
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
F R +I V R+L+ D S + ++ + I+DP++L+
Sbjct: 668 PFFQRTAIIHVTTDLIRVLE-----PDCSERQCIRDMDGSNKRPKIRFCCISDPFILVIR 722
Query: 676 SDGSIRLLVGDPSTCTVSVQ--TPAAIESSKKPVSSCTLYHDKG------PEPWLRKTST 727
D S+ L VGD + + TP E + + C G E ++
Sbjct: 723 EDESLGLFVGDAERGRIRRKDMTPMG-EKVSRYSAGCFFLDQSGIFELHMSESSPTTGTS 781
Query: 728 DAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI 787
D G G D +G + V+C G +EI+ +P VF+ + +
Sbjct: 782 DDKQRMGTGSLESAVDA---QRGTQWLVLCRPQGVVEIWTLPKLALVFSTSSLKDLPSVV 838
Query: 788 VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
D++ AL S E+ + ++ +I ++ ++ + P L +L
Sbjct: 839 SDSFDPPAL--------SLPEDPPRKPQEADIELLQFAQIG-----ELYPHPHLIVMLRC 885
Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSV----------------------SNVSAS 885
G + YQA + K D P ST R+ ++ S+V A
Sbjct: 886 GQLAIYQAVAVD------KDDFPESTVRTSTLKIKFIKMGTRSFEPRQLEPAEKSSVIAE 939
Query: 886 RLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLR 944
+ R LR S P E K +S G F++G PCW + ++ L+
Sbjct: 940 QRRALR-SLVPFIVSPNSE-------------KRVS---GVFVTGDEPCWIVATDKDGLK 982
Query: 945 VH 946
+H
Sbjct: 983 IH 984
>gi|336375160|gb|EGO03496.1| hypothetical protein SERLA73DRAFT_165174 [Serpula lacrymans var.
lacrymans S7.3]
Length = 1428
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 204/972 (20%), Positives = 383/972 (39%), Gaps = 153/972 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDG-------- 95
N+VV +NV+ I+ VR +E ++ E RR V MDG
Sbjct: 47 NVVVARSNVLRIFEVR-EERPPMSTQTEDERDRRSHVRKGTEAVEGEVEMDGQGEGYVNM 105
Query: 96 --------------ISAASLELVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFE 138
+ + V + LHG V E++ I+S N D ++++F+
Sbjct: 106 GTVKSTGKKGAVHLPTVSRFYFVREHMLHGTVTGLETVRIMSS----NDDNLDRLLVSFK 161
Query: 139 DAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
DAKI++LE+ D IH L S+H +E +P+ + L +S ++VDP RC + +
Sbjct: 162 DAKIALLEWSDDIHDLITVSIHTYERAPQLMAL----DSSLFHTKLRVDPSSRCAALSLP 217
Query: 198 GLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVH 253
+ IL Q + L V ++D S +++L D +++HV DF+F+
Sbjct: 218 KDAIAILPFFQSQAELDVMEQD---QNQARDVPYSPSFILDLASDVDENIRHVIDFVFLP 274
Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
G+ P + +L + E TW+GR+ T + ++ +P+I + LP D L+
Sbjct: 275 GFNNPTIAVLFQTEQTWSGRLKEFKDTAKLIIFTLDLLSHTYPVITAVDGLPFDCISLVP 334
Query: 314 VPSPIGGVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELP-----RSSFSVELDAA 367
+ +GGV+++ +NTI Y S AL +N ++ S S +P +S ++ L+
Sbjct: 335 CVASLGGVVIMSSNTIIYVDPASRRVALPVNGWS-SRVSDMPMPALSGDEASRNISLEGC 393
Query: 368 HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLG 426
HA + + + K G + + +V DG+ V +L ++ + + S + I FLG
Sbjct: 394 HAVLVDDRTMFVFLKDGTVYPVELVADGKTVSKLSMAPALAQTTIPSMVRKINEDHLFLG 453
Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
S +G S+L++ L ++A + ++ + +
Sbjct: 454 SIVGASVLLKTVRVEEEVEDEEKLPAHAAVVDAPTTMDLDDDDDTMPSMNGVTH------ 507
Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG-------- 532
++N + ++ DSL GP+ D ++ L D +ATG
Sbjct: 508 ---SNNIIHRTRSVVHLSLCDSLPAYGPISDVTFSLAKLGDRYVPELVAATGSGFLGGFT 564
Query: 533 -----ISKQSNYELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR 586
+ ++ +L + G +GIW+ + R + R + + +IIS +A
Sbjct: 565 LFQRDLPSRTKRKLHAIGGARGIWSFPVRQQVRVNGLSYERPVNSFESENDTVIISTDAN 624
Query: 587 TM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDL 643
V A ++ ++ + G TI AG+ F R ++ V R+L+ G + +DL
Sbjct: 625 PSPGVSRIATRTSKSDIAIPTRIPGTTIGAGSFFQRTAILHVMTNAIRVLESGKQIIKDL 684
Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQ--TPAAIE 701
+ + SI DP+VL+ D +I L +G+ + + +P +
Sbjct: 685 D---------GNIPRPRIKACSICDPFVLIIREDDTIGLFIGEAERGKIRRKDMSPMGDK 735
Query: 702 SSKKPV------SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSV 755
SS+ +SC P D +++ + ++ + + +
Sbjct: 736 SSRYLAGCFFTDNSCIFETHANDLPSSASNGVDKNVTSTMQAVVNS------NSRSQWLI 789
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
+ G +EI+ +P F+ + D+Y AL + R
Sbjct: 790 LVRPQGVMEIWTLPKLTLAFSTSSLAMLEHILSDSYDTPALSPPQ-----------DHPR 838
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
K N + V ++ + P+L L G I+ Y+A P +
Sbjct: 839 KSN--DLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPAD------------ 884
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI----------SGHQG 925
S+ S VS +++ ++ + + EET ++ I + S G
Sbjct: 885 SIPPSRVSVLKVKFIKTATKIFELPKHEETEKSILAEQKRISRQFVPFVTSPTPGSVLSG 944
Query: 926 FFLSGSRPCWCM 937
F +G RP W +
Sbjct: 945 VFFTGDRPSWIV 956
>gi|302694047|ref|XP_003036702.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
gi|300110399|gb|EFJ01800.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
Length = 1396
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 211/970 (21%), Positives = 378/970 (38%), Gaps = 172/970 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS------------------- 97
N+V N + IY VR + SK + K+ M+G+
Sbjct: 38 NVVTARGNTLSIYEVREETATSKSPTEAKSQKKDDAMEGVKEERQTPVVQVRSLSKKTYP 97
Query: 98 ---------AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFD 148
+ LV +RLHG V L + + D ++++F+DAKI++LE+
Sbjct: 98 DSDSHSQPLSTKFHLVREHRLHGVVTGLQAVKIISSLEDHL-DRLLVSFKDAKIALLEWS 156
Query: 149 DSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
+ L S+H +E + + +F ++VDPQ RC + + IL Q
Sbjct: 157 TATQDLLTVSIHTYERAIQM-VATDISAFTSE--LRVDPQSRCAALSLPKDAFAILPPCQ 213
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHE 265
+ D S ++NL + +++V DF F+ G+ P + +L+E
Sbjct: 214 VSDSVCRD-----------VPYSPSFILNLPSEVESGIRNVIDFTFLPGFSNPTVAVLYE 262
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
TW GR++ + T ++ ++ +++P+I A LP D +LA PS GGV+VV
Sbjct: 263 TYQTWTGRLNEQKDTVKMAFFTLDIVNRRYPVIGLATGLPCDCLSVLACPS-TGGVMVVA 321
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELP------RSSFSVELDAAHATWLQNDVALL 379
+N+I Y QS + N + S LP + ++EL+ + + ++ + A +
Sbjct: 322 SNSIIYVDQSGRKVVLPVNAWIPRMSDIALPTNLTPEEQARTLELEGSRSIFVDDKTAFI 381
Query: 380 STKTGDLVLLTVVYDGRVVQRLDL-----SKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
K G + + +V GRVV +L L T PS+L I N +GS GDS
Sbjct: 382 ILKDGTIYPVELVTAGRVVSKLALGTPLAKTTIPSILRR----INNDYLLVGSASGDS-- 435
Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL--QDM-VNGEELSLYGSAS 491
++LS+ EE D + D + +S AL QD+ ++ ++ +YG +
Sbjct: 436 ---------ALLSTSWVEEVIDDDVDMEAN-----TSVAALEQQDIEMDDDDDDIYGPSI 481
Query: 492 NNTESAQK------------TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGI 533
T ++QK + D+L GP+ D ++ + N D +ATG
Sbjct: 482 IKTGTSQKESAAPMSKKTRSVLRLSFCDALPAYGPIADLTFTVGKNGDRPVAELVTATGS 541
Query: 534 SKQSNYELVE--LP-----------GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
+ L + LP G +G+W++ + S S+ +A +D LI
Sbjct: 542 GHLGGFTLFQKDLPLRKKKKLPIISGARGVWSLPIRRS-----SSAAVAEHDT-----LI 591
Query: 581 ISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
IS +A R V T L+ V+ V G TI AG F R ++ V R+
Sbjct: 592 ISTDANPSPGFSRLAVRATKGDLSVVSR-----VNGMTIGAGPFFQRTAILHVMTNAIRV 646
Query: 634 L--DGS--YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPST 689
L DG+ + +D+ + + S SI DPYVL+ D +I L +G+ +
Sbjct: 647 LEPDGNERQIIKDME---------GNVPRAKIKSCSICDPYVLIFREDDTIGLFIGETTR 697
Query: 690 CTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG--EAIDGADGGPL 747
+ + + + ++ + D + + DA T + +D +
Sbjct: 698 GKIRRKDMSPMGEKSSRYTAGGFFTDTASVFRVYHQNADANTETTIPMHSVVDASSKSQ- 756
Query: 748 DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
+ V+ G +EI+ +P VF+ + + + D+ AL +
Sbjct: 757 -----WLVLVRPQGVVEIWTLPKLTLVFSTTLLATLQNVLTDSQEPPALSPPQDPPRKPQ 811
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
E + + ++ + +P L +L G + Y+A+
Sbjct: 812 E-------------LDIEQILLTNLGQSDPKPHLLVLLRSGHLAIYEAFATNQAPIVEPP 858
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFF 927
P ++S + +++ R T ++ + F G F
Sbjct: 859 LKPRASSLQIQFVKIASKAFEMQRTDETEKGILAEQK----KALRTFVPFACAGAPAGVF 914
Query: 928 LSGSRPCWCM 937
+G RP W +
Sbjct: 915 FTGDRPHWIV 924
>gi|348679545|gb|EGZ19361.1| putative cleavage and polyadenylation specificity factor CPSF
[Phytophthora sojae]
Length = 1752
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 162/634 (25%), Positives = 260/634 (41%), Gaps = 155/634 (24%)
Query: 235 VINLRDLD-MKHVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR+L+ M V D F+ GY+EP +++LHE + + GR++ T I+ +SI+
Sbjct: 277 LLRLRELEIMGKVIDLAFLDGYLEPTLMVLHEENEKNSTCGRLAAGFDTYCITVISINMN 336
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL-- 349
+ HP IW+ NLP D +KL +P+GGV+V+ AN Y +Q+ LA N +A
Sbjct: 337 TRLHPKIWTVKNLPSDCFKLFPCRAPLGGVVVLSANAFLYFNQTQFHGLATNVFASKTVN 396
Query: 350 -------DSSQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
D+ E P + + L +L LL+ GD +L++ Y+ +
Sbjct: 397 QSVFPLSDAVYETPDHEMAQLHIVLYDCQFEYLHEKEVLLTMPNGDAYVLSLPYEDTSSR 456
Query: 400 RL----DLSKTNPSVLTSDITTIG-----------NSLFFLGSRLGDSLLVQFTCGSGTS 444
L S + + L+ + G F+GSR GDS+L TS
Sbjct: 457 GLYGFGGASSSRNASLSLRMLRSGIQAHCLCVNEEKKTLFVGSRSGDSVLYALDQKKLTS 516
Query: 445 MLSSGLK----EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA--------SN 492
K EE E + A ++ + ++L LYG+A S
Sbjct: 517 AGGEASKQQEDEEMLIKEEVVKEEVTAEVKAEPAEEEEEDEDDLFLYGAAPTKEEPTTSG 576
Query: 493 NTESAQKTFSFAVR----------------------DSLVNIGPLKDFSYGLRINADASA 530
+TE+ T AV+ D L +IG + G+ NAD
Sbjct: 577 STEAVNGTNGSAVKKEENGHAVEEESGPYDYVLHQIDVLPSIGQITSIELGIENNAD--- 633
Query: 531 TGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNAD 564
S + ELV EL GC+ +WTV +
Sbjct: 634 ---SNEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATKSS 690
Query: 565 SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
R Y+AYLI+S+ RTMVL T + + + + ++ G T+AA NLF ++R++
Sbjct: 691 DGR------SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGPTLAAANLFNKQRIV 744
Query: 625 QVFERGARIL------------DGS----------------------YMTQDLSFGPSNS 650
Q+F++GAR++ DG+ TQ+++
Sbjct: 745 QIFKQGARVMMEVPDEETSNGNDGAEKTAKPEDEEVDDEDDGPKVKLVCTQEITLEGDVE 804
Query: 651 ESGSGSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP----------- 697
G + +TV +SV + DPY+LL ++DGS+RLL+GD ++V P
Sbjct: 805 CGGMNVDTATVGIVSVDVVDPYILLLLTDGSVRLLMGDEEDMELTVIDPEIDYLDGVTES 864
Query: 698 -AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
++SK SS L++D W +AW
Sbjct: 865 NGTADASKHGSSSACLFYD-----WAGMFRENAW 893
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 45/81 (55%), Gaps = 7/81 (8%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS------IVAFTVLHNVNCNH 967
+T F N++ G F G+ P W + R + P +C + +++FT H+ NC +
Sbjct: 1159 LTTFYNVNNMSGAFFRGAHPMWILGDRGQPTFIP-MCSAAPKVSVPVLSFTPFHHWNCPN 1217
Query: 968 GFIYVTSQGILKICQLPSGST 988
GFIY S+G L++C+LPS T
Sbjct: 1218 GFIYFHSRGALRVCELPSSKT 1238
>gi|449543656|gb|EMD34631.1| hypothetical protein CERSUDRAFT_116804 [Ceriporiopsis subvermispora
B]
Length = 1440
Score = 156 bits (395), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 218/990 (22%), Positives = 403/990 (40%), Gaps = 178/990 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDGISAASLE- 102
N+VV ++++ I+ VR +E S+ E +RR V MDG L
Sbjct: 49 NVVVARSSLLRIFEVR-EEPAPISSQKEDERERRASVRKGTEAVEGEVEMDGSGEGFLNM 107
Query: 103 ---------------------LVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFE 138
L+ +RLHG +E + I++ R D ++++F+
Sbjct: 108 GSVKSTAQNGSVQPPTINRFYLIREHRLHGIVTGIEGVRIVTS----LEDRLDRLLVSFK 163
Query: 139 DAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
DAKI++LE+ D++H L S+H +E +P+ + L +S P ++ DP RC +L+
Sbjct: 164 DAKIALLEWSDAVHDLVTVSIHTYERAPQLMAL----DSSLFRPTLRADPLSRCAALLLP 219
Query: 198 GLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVH 253
+ IL Q + L V ++DT S +++L D +++V DF+F+
Sbjct: 220 RDSIAILPFYQSQAELDVVEQDT---SQLRDVPYSPSFIVDLSAEVDDRIRNVIDFVFLP 276
Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
G+ P + +L +++ TW GR+ T + ++ + +P+I S LP+D + + A
Sbjct: 277 GFNNPTIAVLFQKQQTWTGRLREYKDTVSLYIFTLDLVTRNYPVITSTEGLPYDCFAVAA 336
Query: 314 VPSPIGGVLVVGANTIHYHSQSA-SCALALNNY-------AVSLDSSQELPRSSFSVELD 365
+ +GGV+++ +N I Y QS+ AL +N + V S+QE R + L+
Sbjct: 337 CSTALGGVVILASNAIIYVDQSSRRVALPVNGWPPRVSDMPVQALSAQEQLR---DLRLE 393
Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGN 420
+H ++ + + K G + + +V DG+ V +L +S T P+V + +
Sbjct: 394 GSHFVFVDDRTLFIILKDGTVYPVELVLDGKSVSKLTMSSAVARTTIPTV----VRRVQT 449
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
F+GS +G S+L++ T+ + + +E D+E T + D+ M
Sbjct: 450 DHLFIGSTVGPSVLLK------TARVEEDIADE--DVEMSVAPT-----AVVDSTDTMDL 496
Query: 481 GEELSLYGSASNNTE------------SAQKT-FSFAVRDSLVNIGPLKDFSYGLRINAD 527
+E LYGS T S ++T ++ DSL GP+ D ++ L N D
Sbjct: 497 DDEDDLYGSTKETTHRVDGLVNGAADASKKRTVVHLSLCDSLPAHGPIADMTFALAKNGD 556
Query: 528 ------ASATGISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGHNADSSRM 568
+ATG + L + LP G +G+W++ + + N +
Sbjct: 557 RAVPELVAATGSGTLGGFTLFQRDLPTRVKRKLHAIGGGRGMWSLPVRQAVKVNGSTYEK 616
Query: 569 AAYDDEYHAY---LIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
A + +H+ +IIS +A + A ++ + G TI A F +
Sbjct: 617 PA--NPFHSVNDSVIISTDANPSPGLSRIASRNQNGDITITTRIPGTTIGAAPFFQGTAI 674
Query: 624 IQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
+ V ++ + D S + + + SI DP+VL+ D +I L
Sbjct: 675 LHVMYNVTNVI--RVLEPDGSERQIIKDVDGNVARPKIRACSICDPFVLIIREDDTIGLF 732
Query: 684 VGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKGP-EPWLRKTSTDAWLSTGVGEAIDG 741
+G+P + + + + + + + ++ C G + L + +T ++
Sbjct: 733 IGEPERGKIRRKDMSPMGDKTSRYLTGCFFTDTTGTFQTHLNPLAAGTEAATSTLQS--A 790
Query: 742 ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
+ G Q + ++C G LEI+ + F+ S + +VDTY L
Sbjct: 791 INAGSRSQ---WLILCRPQGTLEIWTLSKLTLAFSTTLIPSLESVVVDTYDVPHL----- 842
Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
S ++ + ++ +I + V L RP+L L G + Y+ P
Sbjct: 843 ---SLPQDPPRKPQELDIEQIVVAPLG-----ESSPRPYLTVFLRSGQLAVYETIPVAPP 894
Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
DP+ SRS ++ +RF + A+ ++ + K IS
Sbjct: 895 A------DPLPNSRSCTIL---------VRFRKVLSKAFDIQQQNEEVEKSVLAEQKRIS 939
Query: 922 --------------GHQGFFLSGSRPCWCM 937
G F +G RPCW +
Sbjct: 940 RLLIPFVTSPNPGQTLSGVFFTGDRPCWIL 969
>gi|353231025|emb|CCD77443.1| putative cleavage and polyadenylation specificity factor cpsf
[Schistosoma mansoni]
Length = 1825
Score = 155 bits (393), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 146/586 (24%), Positives = 252/586 (43%), Gaps = 123/586 (20%)
Query: 4 AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
A +K + PT + NC +TH + + NLV+T
Sbjct: 15 AVFKHISPPTAVDNCLYCHLTHPK---------------------------LKNLVITRG 47
Query: 64 NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
IEIY V+ S SGET+ V ++ N+ + + G
Sbjct: 48 GFIEIYNVK--------SSASGETR------------FNWVYGTSVYENIADIVTVRFTG 87
Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
S++L+F +AK++V+ F+ LR S+H +E + +LK GR +F + P++
Sbjct: 88 DLLD----SLLLSFPEAKVAVMNFNPVTFELRTLSLHNYE---FENLKSGRMNFTKLPIL 140
Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED----TFGSGGGFSARIESSHVINLR 239
++DP RC +LVY + +L + + + D + + + R + +
Sbjct: 141 RLDPHQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHISVRNSLLWQQRATAPLLATFT 200
Query: 240 DL-------DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+ +V D F++G+ EP +++L+E TWAGRVS + TC I ALS +
Sbjct: 201 TCLSTSTGEKINNVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQK 260
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VS 348
+ +P+IW +LP D +++VP PIGGV+V+ AN+I Y Q+ SC L LN YA +
Sbjct: 261 RTNPVIWFQESLPFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTN 320
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKT 406
Q++P S + +D L L+ T++G+L LL++ + + V L K
Sbjct: 321 FPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKV 379
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF---------------------TCGSGTSM 445
+V + + + F+GSR DS+L++ + G+ ++
Sbjct: 380 GHAVPPHCMVLLESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVDHQLLNQSSGTNNTL 439
Query: 446 LSSGLKEEFGDIEAD------------------------APSTKRLRRSSSDALQD---M 478
S L + +E D + STKR +D + D
Sbjct: 440 KDSELVDGKSIVEDDSDEIPNKCPRIEEGENDKTISKSLSQSTKRNTLDENDIISDNHYK 499
Query: 479 VNGEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYG 521
+ ++ LYG + + S + +SF V D L+N+GP+ + G
Sbjct: 500 FDEVDVELYGESILSPPSIYREIVNYSFKVVDRLINLGPMGQLTSG 545
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 57/251 (22%), Positives = 101/251 (40%), Gaps = 27/251 (10%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
++ + + +G LEI+ +P+F ++ V F ++D + + ++ +
Sbjct: 1086 FAFIVFTNGVLEIYSLPDFTLLYEVHHFTDLPQMLID---HRGVSSEQLHKQYTNSQNVS 1142
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
++I ++E+ + RP L + T I ++A L P+
Sbjct: 1143 YTEDDSIPP-PILEILVYPIGIDKDRPVLM-VRTSQEIAFFEA-LCPSPDE--------- 1190
Query: 873 TSRSLSVSNVSASRLRNLRFS-RTPLDAYTREET-PHGAPCQRITI--------FKNISG 922
S L RLR R PL A R T P Q + F+NI
Sbjct: 1191 -SYPLISGTFYEGRLRWRRLPLPCPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGD 1249
Query: 923 HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
H+G F+ G P W +LRV P DG + +F L+ C+ GF+Y T +++
Sbjct: 1250 HRGVFVCGGNPIWLFATDSGQLRVFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLA 1309
Query: 982 QLPSGSTYDNY 992
LP G +++ +
Sbjct: 1310 TLPPGYSFNEH 1320
>gi|426194401|gb|EKV44332.1| hypothetical protein AGABI2DRAFT_187183 [Agaricus bisporus var.
bisporus H97]
Length = 1413
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 218/1011 (21%), Positives = 400/1011 (39%), Gaps = 128/1011 (12%)
Query: 57 NLVVTAANVIEIYVVRVQE-----EGSKESKNSGETKR-------RVLMD---------- 94
N+VV +N++ I+ VR + + E + G+T+R V MD
Sbjct: 49 NVVVARSNLLRIFEVREEPAPFPTQADDERERKGKTRRGTEAVEGEVEMDEEGEGFVNIA 108
Query: 95 --GISAASLELVCHY------RLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKIS 143
I L V + RLHG V E + I+ A + D ++++F+DAKI+
Sbjct: 109 KSAIQKTKLPTVTKFYFIREHRLHGIVTGLEGVRIM----ASLEDKLDRLLVSFKDAKIA 164
Query: 144 VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
+LE+ D+IH L S+H +E +P+ + L R L +VDP RC + + +
Sbjct: 165 LLEWSDTIHDLVTVSIHTYERAPQLISLD---SPLFRSDL-RVDPISRCAALSLPKHAIA 220
Query: 203 ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
IL Q + L V ++D S S +++L + ++++V DF+F+ G+ P
Sbjct: 221 ILPFYQTQAELDVMEQDQSQSK---DVPYSPSFILDLPIQVEENIRNVIDFVFLPGFNNP 277
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
+ IL + + TW GR+ T + ++ + +I S LP+DA+ LL + I
Sbjct: 278 TIAILFQTQQTWTGRLRESKDTARLIIFTLDILTQNSTIITSVEGLPYDAFSLLPCSTAI 337
Query: 319 GGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPR---SSFSVELDAAHATWLQN 374
GGV+V+ N++ Y QS+ +L +N +A + P ++ + L+ + + +
Sbjct: 338 GGVIVITGNSVIYVDQSSRRVSLQVNGWATRISDLPYPPMEEDAALKLHLEGCRSAMVDD 397
Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLGSRLG 430
L K G + + ++ DG+ V +L ++ P++ + I T+ + F+GS +G
Sbjct: 398 KTVFLIYKDGTVYPVELIADGKTVSKLIMA---PALAQTTIPTVVKRVDEDHLFIGSAVG 454
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
S+L++ G K + D + D D + G+
Sbjct: 455 PSILLKTAHVEQEVEEEHGSKSGPAVVTQDV---------TMDDDDDDIYGDSTMETEPT 505
Query: 491 SNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYEL 541
+N +KT ++RD L GP+ ++ L +N + +ATG + L
Sbjct: 506 ANGVTHVRKTKTVIHLSLRDYLPAYGPISSMTFSLAMNGEKAVPELVAATGAGSLGGFTL 565
Query: 542 VE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----LIISLEARTMVLETADL 595
+ LP K +Y SRG + R + H + LI+S + +
Sbjct: 566 FQRDLPTVKKRKILYISGSRGIWSLPIRQPLRSNTSHGHDYDTLILSTDINPSPGSSRIA 625
Query: 596 LTEVTE--SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSE 651
+ + S++ G TI A F R ++ V R+L DG+ + +
Sbjct: 626 VRSMNRDVSINSRTPGLTIGAAPFFQRTAILHVMTNAIRVLHPDGTERQ-------TIPD 678
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSC 710
+ SIADP+VL+ D SI + V D +P +SS+ ++ C
Sbjct: 679 KDGNMPRPKIRFCSIADPFVLVMREDDSIGMFVATDREKIRRKDMSPMGDKSSRY-LAGC 737
Query: 711 TLYHDKGPEPWLRKTSTD--AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDV 768
G L + + D + +T + GA + ++ G LEI+ +
Sbjct: 738 FFTDTTG----LFEANFDNKSPATTSTLQITSGAKSQ-------WLLLVRPQGVLEIWTL 786
Query: 769 PNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELA 828
P + F+ S ++ + DT+ A Q + + ++
Sbjct: 787 PKLSLAFSTPAIASLQSVLTDTHDPPA-------------PSLPQDPPRKPQDLDIEQIL 833
Query: 829 MQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR 888
+ P L L G + Y+A + +N D P +TS + ++A
Sbjct: 834 LAPIGESSPTPHLCVFLRSGQLAIYEAVVLG--QNPEVPDTPRATSLQIQFVKIAAKSFE 891
Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHP 947
R + + +T + + G F +G RP W + R ++V+P
Sbjct: 892 IQRPEENEKGILAEHKKINRMFIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGVQVYP 951
Query: 948 QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+ AFT F+ T G + + +P +D P++ +
Sbjct: 952 S-GHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVPDFQ-FDGPLPMRSI 1000
>gi|367052335|ref|XP_003656546.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
gi|347003811|gb|AEO70210.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
Length = 1460
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 242/1057 (22%), Positives = 409/1057 (38%), Gaps = 211/1057 (19%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLM---------DGISAA-------- 99
NLVV +++++++ +V + S +SG R DG+ A+
Sbjct: 28 NLVVAKSSLLQVFRTKVVSTELEASPDSGHRSRNAARYESRLANDDDGLEASFLGGDSLA 87
Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDS 150
L LV L G V LA + A + DS+++A +DA++S++E+D
Sbjct: 88 LRTDRANVTKLVLVAETPLAGTVTGLARIKTPHARHGC--DSLLIALKDARLSLVEWDAE 145
Query: 151 IHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMIIL 204
H L S+H +E E + S PL ++ DP RC + + IL
Sbjct: 146 RHALATVSIHYYEQEEL------QGSPWAAPLSHYVNFLEADPGSRCAALKFGARNLAIL 199
Query: 205 KASQGGSGL-VGDEDTFGSGGGFSARIESSHVIN-----------------LRDLD--MK 244
Q + +GD D G A+ +SS VI+ L +LD +
Sbjct: 200 PFRQADEDIDMGDWDG-ELDGPRPAKDQSSAVIDGASNIEDTPYSPSFVLRLSNLDPSLL 258
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
H F+H Y EP IL H T M+ L + K I S L
Sbjct: 259 HPVHLAFLHEYREPTFGILASTASASNSLGRKDHFTYMVFTLDLQQ--KASTTILSVGGL 316
Query: 305 PHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
P D ++++ +P+P+GG L+VG+N IH + +A+N S + +S ++
Sbjct: 317 PQDLFRVVPLPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTRQCTSFGLVDQSELNLR 376
Query: 364 LDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT 417
L+ L D+ L+ G + L+T DGR V L+L + + S++ +TT
Sbjct: 377 LEGCVVDVLTADLGELLVILNDGRMALVTFRIDGRTVSGLELRMLPASSGGSIIPGRVTT 436
Query: 418 ---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+G + F G GDS+L FG + + + +R R+
Sbjct: 437 LSRVGRNAMFAGLEEGDSVL-------------------FGWAKKQSQAGRRRPRAKDAV 477
Query: 475 LQ------------DMVNGEELSLYGSASNNTESAQKT------FSFAVRDSLVNIGPLK 516
LQ + + ++L A+ S+ + +F + D LV+I P++
Sbjct: 478 LQMDEEAGEEEEEEEDEDEDDLYGEEPAARQQPSSTASSLMTGDLTFRIHDRLVSIAPIQ 537
Query: 517 DFSYGLRINADAS-----------------ATGISKQSNYELV------------ELPGC 547
+YG + S A G K ++ + E P
Sbjct: 538 AMTYGQPVWLPGSEEERNSAGVHSDLQLVCAVGRDKSASLATINLAIAPKVIGRFEFPEA 597
Query: 548 KGIWTV-----YHKSSRGHNADSSRMAAYDD--EYHAYLII------SLEARTMVLETAD 594
+G WT+ KS +G A +S YD +Y ++I+ E + TA
Sbjct: 598 RGFWTMCAKKPIPKSLQGDKAGASLGNGYDTSGQYDKFMIVGKVDLDGYEKSDVYALTAA 657
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
+ + G TI AG + R+IQV + R DG + ++Q L P E
Sbjct: 658 GFESLGGTEFDPAAGITIEAGTMGKGSRIIQVLKSEVRCYDGDFGLSQIL---PMQDEE- 713
Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
+G+E V S S+ADP++L+ D S+ + D S + + S+ K ++ C LY
Sbjct: 714 TGAEPRAV-SASVADPFLLIIRDDSSVFIARIDSSNELEELDKDDPVLSTTKWLTGC-LY 771
Query: 714 HDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
D S + +G+ A + + SGAL I+ +P+
Sbjct: 772 AD----------SAGVFAEESMGKPASTAQC-------VLMFLLSASGALYIYRLPDLAR 814
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
V + +S Y+ L + + +GT KE + + V +L
Sbjct: 815 PIYVAEGLS--------YIPPGLS-----ADYAGRKGTA---KETLAEILVADLG----D 854
Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
+ H P+L + + YQ + S+ + S +L V N +
Sbjct: 855 STHKSPYLILRHANDDLTLYQPF-------RSRKATEQAFSETLFFQKVP-----NTALA 902
Query: 894 RTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDG 952
++P +A +E H + N+ G+ F+ G+ P + + + + RV P L
Sbjct: 903 KSPQEA-DEDEASHQPRFLSMRRCDNVGGYSTVFVPGASPSFIIASSKSMPRVMP-LQGS 960
Query: 953 SIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
++A + H C HGFIY S+ I ++CQ P G Y
Sbjct: 961 GVIAMSPFHTEGCEHGFIYADSRRIARVCQFPDGCIY 997
>gi|224135031|ref|XP_002321966.1| predicted protein [Populus trichocarpa]
gi|222868962|gb|EEF06093.1| predicted protein [Populus trichocarpa]
Length = 180
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 85/152 (55%), Positives = 100/152 (65%), Gaps = 22/152 (14%)
Query: 733 TGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
TG+ EAIDGADGG DQGDIY V+CYE+GALEIFDVPNFN VF VDKFVSG+TH+VD++M
Sbjct: 4 TGISEAIDGADGGAHDQGDIYRVICYETGALEIFDVPNFNSVFIVDKFVSGKTHLVDSFM 63
Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
E +D +N EE G GRKE +V L F ILT GTILC
Sbjct: 64 GEPPRDLTKGMN---EEVAGAGRKE------IVLL-------------FFGILTYGTILC 101
Query: 853 YQAYLFEGPENTSKSDDPVSTSRSLSVSNVSA 884
Y A LFEGP+ SK +DPVS S+ S++SA
Sbjct: 102 YHACLFEGPDGNSKLEDPVSAQNSVGDSSISA 133
>gi|409076059|gb|EKM76433.1| hypothetical protein AGABI1DRAFT_108759 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1413
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 217/1011 (21%), Positives = 399/1011 (39%), Gaps = 128/1011 (12%)
Query: 57 NLVVTAANVIEIYVVRVQE-----EGSKESKNSGETKR-------RVLMD---------- 94
N+VV +N++ I+ VR + + E + G+T+R V MD
Sbjct: 49 NVVVARSNLLRIFEVREEPAPFPTQADDERERKGKTRRGTEAVEGEVEMDEEGEGFVNIA 108
Query: 95 --GISAASLELVCHY------RLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKIS 143
I L V + RLHG V E + I+ A + D ++++F+DAKI+
Sbjct: 109 KSAIQKTKLPTVTKFYFIREHRLHGIVTGLEGVRIM----ASLEDKLDRLLVSFKDAKIA 164
Query: 144 VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
+LE+ D+IH L S+H +E +P+ + L R L +VDP RC + + +
Sbjct: 165 LLEWSDTIHDLVTVSIHTYERAPQLISLD---SPLFRSDL-RVDPISRCAALSLPKHAIA 220
Query: 203 ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
IL Q + L V ++D S +++L + ++++V DF+F+ G+ P
Sbjct: 221 ILPFYQTQAELDVMEQD---QSQAKDVPYSPSFILDLPIQVEENIRNVIDFVFLPGFNNP 277
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
+ IL + + TW GR+ T + ++ + +I S LP+DA+ LL + I
Sbjct: 278 TIAILFQTQQTWTGRLRESKDTARLIIFTLDILTQNSTIITSVEGLPYDAFSLLPCSTAI 337
Query: 319 GGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPR---SSFSVELDAAHATWLQN 374
GGV+V+ N++ Y QS+ +L +N +A + P ++ + L+ + + +
Sbjct: 338 GGVIVITGNSVIYVDQSSRRVSLQVNGWATRISDLPYPPMEEDATLKLHLEGCRSAMVDD 397
Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLGSRLG 430
L K G + + ++ DG+ V +L ++ P++ + I T+ + F+GS +G
Sbjct: 398 KTVFLIYKDGTVYPVELIADGKTVSKLIMA---PALAQTTIPTVVKRVDEDHLFIGSAVG 454
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
S+L++ G K + D + D D + G+
Sbjct: 455 PSILLKTAHVEQEVEEEHGSKSGPAVVTQDV---------TMDDDDDDIYGDSTMETEPT 505
Query: 491 SNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYEL 541
+N +KT ++RD L GP+ ++ L +N + +ATG + L
Sbjct: 506 ANGVTHVRKTKTVIHLSLRDYLPAYGPISSMTFSLAMNGEKAVPELVAATGAGSLGGFTL 565
Query: 542 VE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----LIISLEARTMVLETADL 595
+ LP K +Y SRG + R + H + LI+S + +
Sbjct: 566 FQRDLPTVKKRKILYISGSRGIWSLPIRQPLRSNTSHGHDYDTLILSTDINPSPGSSRIA 625
Query: 596 LTEVTE--SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSE 651
+ + S++ G TI A F R ++ V R+L DG+ + +
Sbjct: 626 VRSMNRDVSINSRTPGLTIGAAPFFQRTAILHVMTNAIRVLHPDGTERQ-------TIPD 678
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSC 710
+ SIADP+VL+ D SI + V D +P +SS+ ++ C
Sbjct: 679 KDGNMPRPKIRFCSIADPFVLVMREDDSIGMFVATDREKIRRKDMSPMGDKSSRY-LAGC 737
Query: 711 TLYHDKGPEPWLRKTSTD--AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDV 768
G L + + D + +T + GA + ++ G LEI+ +
Sbjct: 738 FFTDTTG----LFEANFDNKSPATTSTLQITSGAKSQ-------WLLLVRPQGVLEIWTL 786
Query: 769 PNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELA 828
P + F+ S ++ + DT+ A Q + + ++
Sbjct: 787 PKLSLAFSTPAIASLQSVLTDTHDPPA-------------PSLPQDPPRKPQDLDIEQIL 833
Query: 829 MQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR 888
+ P L L G + Y+A + +N D P +TS + ++A
Sbjct: 834 LAPIGESSPTPHLCVFLRSGQLAIYEAVVLG--QNPEVPDTPRATSLQIQFVKIAAKSFE 891
Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHP 947
R + + +T + + G F +G RP W + R ++V+P
Sbjct: 892 IQRPEENEKGILAEHKKINRMFIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGVQVYP 951
Query: 948 QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+ AFT F+ T G + + +P +D P++ +
Sbjct: 952 S-GHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVPDFQ-FDGPLPMRSI 1000
>gi|256079900|ref|XP_002576222.1| cleavage and polyadenylation specificity factor cpsf [Schistosoma
mansoni]
Length = 1958
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 147/586 (25%), Positives = 255/586 (43%), Gaps = 106/586 (18%)
Query: 4 AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
A +K + PT + NC + H + +D+ L + NLV+T
Sbjct: 15 AVFKHISPPTAVDNCLYCHLKH----------ISPPTAVDNCLYCHLTHPKLKNLVITRG 64
Query: 64 NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
IEIY V+ S SGET+ V ++ N+ + + G
Sbjct: 65 GFIEIYNVK--------SSASGETR------------FNWVYGTSVYENIADIVTVRFTG 104
Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
S++L+F +AK++V+ F+ LR S+H +E + +LK GR +F + P++
Sbjct: 105 DLLD----SLLLSFPEAKVAVMNFNPVTFELRTLSLHNYE---FENLKSGRMNFTKLPIL 157
Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED----TFGSGGGFSARIESSHVINLR 239
++DP RC +LVY + +L + + + D + + + R + +
Sbjct: 158 RLDPHQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHISVRNSLLWQQRATAPLLATFT 217
Query: 240 DL-------DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+ +V D F++G+ EP +++L+E TWAGRVS + TC I ALS +
Sbjct: 218 TCLSTSTGEKINNVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQK 277
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VS 348
+ +P+IW +LP D +++VP PIGGV+V+ AN+I Y Q+ SC L LN YA +
Sbjct: 278 RTNPVIWFQESLPFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTN 337
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKT 406
Q++P S + +D L L+ T++G+L LL++ + + V L K
Sbjct: 338 FPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKV 396
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF---------------------TCGSGTSM 445
+V + + + F+GSR DS+L++ + G+ ++
Sbjct: 397 GHAVPPHCMVLLESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVDHQLLNQSSGTNNTL 456
Query: 446 LSSGLKEEFGDIEAD------------------------APSTKRLRRSSSDALQD---M 478
S L + +E D + STKR +D + D
Sbjct: 457 KDSELVDGKSIVEDDSDEIPNKCPRIEEGENDKTISKSLSQSTKRNTLDENDIISDNHYK 516
Query: 479 VNGEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYG 521
+ ++ LYG + + S + +SF V D L+N+GP+ + G
Sbjct: 517 FDEVDVELYGESILSPPSIYREIVNYSFKVVDRLINLGPMGQLTSG 562
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 57/259 (22%), Positives = 104/259 (40%), Gaps = 27/259 (10%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
++ + + +G LEI+ +P+F ++ V F ++D + + ++ +
Sbjct: 1103 FAFIVFTNGVLEIYSLPDFTLLYEVHHFTDLPQMLID---HRGVSSEQLHKQYTNSQNVS 1159
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
++I ++E+ + RP L + T I ++A L P+
Sbjct: 1160 YTEDDSIPP-PILEILVYPIGIDKDRPVLM-VRTSQEIAFFEA-LCPSPDE--------- 1207
Query: 873 TSRSLSVSNVSASRLRNLRFS-RTPLDAYTREET-PHGAPCQRITI--------FKNISG 922
S L RLR R PL A R T P Q + F+NI
Sbjct: 1208 -SYPLISGTFYEGRLRWRRLPLPCPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGD 1266
Query: 923 HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
H+G F+ G P W +LRV P DG + +F L+ C+ GF+Y T +++
Sbjct: 1267 HRGVFVCGGNPIWLFATDSGQLRVFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLA 1326
Query: 982 QLPSGSTYDNYWPVQKVVF 1000
LP G +++ + ++ +
Sbjct: 1327 TLPPGYSFNEHLGIKWITL 1345
>gi|358056450|dbj|GAA97624.1| hypothetical protein E5Q_04302 [Mixia osmundae IAM 14324]
Length = 1305
Score = 152 bits (385), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 169/673 (25%), Positives = 300/673 (44%), Gaps = 98/673 (14%)
Query: 55 VPNLVVTAANVIEIY-----VVRVQE---EGSKESKNSGETKRRVLMDGISAASLELVCH 106
V NLVV +N +++Y V VQ +GS S +T+ L+L+
Sbjct: 37 VRNLVVARSNFLQVYEVLEEPVPVQSSVTDGSSASMREDQTR------------LQLLAE 84
Query: 107 YRLHGNVESLAILSQGGADNSRR--RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
+ HG V LA LS ++R+ R ++++F DAK++V+E+ D +H L SMH FE
Sbjct: 85 HVCHGIVTGLARLS---TLDTRQDGRHRLVISFRDAKMTVMEWSDQLHDLAPVSMHSFE- 140
Query: 165 PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGG 224
L +G + A +++VD RC +L+ + IL Q S L ED G
Sbjct: 141 -RLPQLSQG-DLGAFQAVLRVDQASRCVALLLPDNTLGILPFFQDLSEL---ED-MTREG 194
Query: 225 GFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCM 282
S S I+L ++ +++V DF F+ G+ EP + IL +R+ TW GR+ +
Sbjct: 195 LQSLPYAPSLTIDLSEIGPGIRNVVDFAFLPGFSEPTIAILFQRKPTWTGRIDFAKDITS 254
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALA 341
+ +++ + +P+I+ A LP+DA L P +GGV+++ AN+ +H S +A
Sbjct: 255 LVMVTLDIGSRNYPVIFEADGLPYDALSLSVCPRELGGVVILCANSLVHIDQSSKMTGIA 314
Query: 342 LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
+N + +L ++ R + + L+ A ++ VA+L T+TG+ L + DGR V +
Sbjct: 315 VNGWTSTLTDARLDSRPTLRLVLEGAQCAFVGQQVAVLCTRTGETFSLHLEKDGRNVSSM 374
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
D + + + I T+G + F+GS G S+L+++ SG DI
Sbjct: 375 DCRPRAVTCIPACIETVGAAYVFVGSAQGQSVLLRWASQSGAG----------ADILDIT 424
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSA-SNNTESAQ-----KTFSFAVRDSLVNIGPL 515
S L + SDA+ D LY +A ++N Q K + D+L G +
Sbjct: 425 ESGTGLVQ--SDAMDD-------DLYATAGAHNGNGHQIAPTGKDVQLELCDTLPGYGTI 475
Query: 516 KDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
+ + D ++ + + S + G+ T++ D A D +
Sbjct: 476 RHIAV-----LDHTSASLDEPSLVACTGVQAMAGLTTIHRHVPSVRQVDLDLPTARDIRH 530
Query: 576 HAYLIISLEAR----------TMVLETADLLTEVTESVDYFVQGRT---------IAAGN 616
+ LE R ++ T + + ++D Q T +AAG+
Sbjct: 531 --IWTVGLEQRQKMGRGPITHQIICSTGS--SSMVYTLDQDTQAATLARKSAEVPLAAGS 586
Query: 617 LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
F R +V++V E R+ G +E+ G ++ + V+++DP+V + +
Sbjct: 587 FFSRSQVLEVTEDMLRLYSPD--------GQITTEAPHGQADA--IDVTVSDPFVAVLSA 636
Query: 677 DGSIRLLVGDPST 689
++ + GDP+T
Sbjct: 637 ARNVTVFFGDPTT 649
>gi|440637976|gb|ELR07895.1| hypothetical protein GMDG_02777 [Geomyces destructans 20631-21]
Length = 1495
Score = 152 bits (384), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 235/986 (23%), Positives = 386/986 (39%), Gaps = 153/986 (15%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
S L LV Y L G V SLA + +++ +S++L+F+DAK+S++E+D HGL
Sbjct: 147 STTKLVLVGEYALAGTVTSLARIKI--SESKSGGESLLLSFKDAKLSLVEWDPERHGLST 204
Query: 157 TSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS 211
S+H +E E W ++ + DP+GRC + + IL QG
Sbjct: 205 VSIHYYEQEEIGGSPWDPYLSNCFNY-----LTADPRGRCAALKFGARNLAILPFRQGDE 259
Query: 212 GLVGDE--------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
D+ T + G S V+ L LD + H F++ Y
Sbjct: 260 DTTMDDWDEELDGPRPTTAIITSENKGHEDTPYAPSFVLRLSSLDPTLIHTVHLAFLYEY 319
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL + + + ++ K I LP+D +K++ +P
Sbjct: 320 REPTFGILSSTLSPSSSLLDERKDQLSYMVFTLDLNQKASTTILVVTGLPYDLFKVIPLP 379
Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQN 374
SPIGG L+VG N IH + +A+N A S S + +SS + L+ L
Sbjct: 380 SPIGGALLVGGNELIHIDQSGKANGVAVNALAKSCTSFGLVDQSSLQMRLEGCAVEQLSA 439
Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFL 425
D L+ TG+L +L+ DGR V L+L + PS S + I ++ F+
Sbjct: 440 DNGEMLIILNTGELAVLSFRMDGRSVSGLNLRRV-PSESGICMGAQASCTSLINHNSMFI 498
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE-- 483
GS DS+++ ++ S + + + + D +D + GE
Sbjct: 499 GSEDTDSIVLGWSRKSKQAGRRRS-QPTIDAGDDADVDGTDEDQEDEDEDEDDLYGESTA 557
Query: 484 -LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----RINADASATGISKQSN 538
+ L G + + S ++F + DSLVNI PL+D + R + D AT IS +SN
Sbjct: 558 AIPLKGEVAADANSKAGDYAFRIHDSLVNIAPLRDVTLSKPETPREDEDEEAT-ISTRSN 616
Query: 539 YELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
+ELV E P +GIWT+ K + + A
Sbjct: 617 FELVGVTGRNTSGSLAFLRREIEPNVIGRFEFPEARGIWTLCAKRPLIKGLEPEKSEAIL 676
Query: 573 D-------EYHAYLIISL-------EARTMVLETADLLTEVTESVDYF-VQGRTIAAGNL 617
D ++ +I+S E+ VL +A E ++ G TI G +
Sbjct: 677 DPESELGAQFDRLMIVSKSTEDTPEESSVYVLTSAGF--EALADTEFEPAAGATIKCGTV 734
Query: 618 FGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
RV+Q+ + R DG + Q L + E+G+ +++ SI DPYVLL
Sbjct: 735 GNGMRVVQILKSEVRSYDGDLGLAQILPM--FDDETGA---EPKIVAASIVDPYVLLIRD 789
Query: 677 DGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG 736
D SI + D ++ + K +S C LY+D STG+
Sbjct: 790 DASIFVASCDSDNDLEEIERGDDSLLTNKWLSGC-LYND----------------STGMF 832
Query: 737 EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYM--R 793
++G + I S++ E GAL ++ +P+ + ++ + T I Y R
Sbjct: 833 AETALSNGTVSKKSVIMSLLNSE-GALFMYALPDLSKPIYQANGVSFIPTTISPDYATRR 891
Query: 794 EALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCY 853
+ ++ TE+ L A P+L ++ + Y
Sbjct: 892 STVAETLTEV-----------------------LLADLGDATSKSPYLIFRASNDDLTIY 928
Query: 854 QAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR 913
+ F+ P S+ P S+SL + + T + A E G+P +
Sbjct: 929 EP--FQVP-----SEAPRPLSKSLHFQKIHNPHVAKTANPETEV-AADAESAKRGSPMRA 980
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT 973
I N+ G FL G P + + + L + + + H C+ GFIYV
Sbjct: 981 IA---NVGGLSSVFLPGDSPSFVVKSSKSTPRVVGLRGHGVRSLSGFHTEGCDRGFIYVD 1037
Query: 974 SQGILKICQL-PSGSTYDNYWPVQKV 998
S+GI ++ QL P + D ++KV
Sbjct: 1038 SKGIARVSQLEPETNVTDIGLTLRKV 1063
>gi|257215708|emb|CAX83006.1| Cleavage and polyadenylation specificity factor subunit 1
[Schistosoma japonicum]
Length = 462
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 112/397 (28%), Positives = 195/397 (49%), Gaps = 45/397 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLV+T + IEIY ++ S SGET+ + + S+ + N+ +
Sbjct: 41 NLVITRSGFIEIYNIK--------SSVSGETR----FNWVYGTSV--------YENIADI 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ G +F +AK++V+ F+ LR S+H +E + +LK GR +
Sbjct: 81 VSVRFAGDLLDSLLL----SFSEAKVAVMNFNPITFELRTLSLHNYE---FENLKSGRMN 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK--------ASQGGSGLVGDEDTFGSGGGFSA 228
F + P++++DP RC +LVY + +L +++ +G + +A
Sbjct: 134 FTKLPILRLDPYQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHIGVRNFLLWQQRATA 193
Query: 229 RIESSHVINLRDL---DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
+ ++ L + +V D F+HG+ EP +++L+E TWAGRVS + TC I A
Sbjct: 194 PLLATFTTCLSTSTGEKINNVLDMQFLHGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVA 253
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNN 344
LS + + +P+IW +LP D ++ VP PIGGV+++ AN+I Y Q+ SC+L LN
Sbjct: 254 LSFNLQKRTNPVIWFQESLPFDCRSVIPVPQPIGGVVIMAANSILYLKQTLPSCSLPLNC 313
Query: 345 YA---VSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQ 399
YA + Q++P S + +D L L+ T++G+L LL++ + + V
Sbjct: 314 YAQISTNFPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVT 372
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
L K +V + + + F+GSR DS+L++
Sbjct: 373 SLLFHKVGHAVPPHCMVLLESKYLFIGSRFCDSVLMK 409
>gi|260835071|ref|XP_002612533.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
gi|229297910|gb|EEN68542.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
Length = 1003
Score = 150 bits (378), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 143/529 (27%), Positives = 215/529 (40%), Gaps = 104/529 (19%)
Query: 543 ELPGCKGIWTVY------HKSSRGHNADSSRMAAYDDEY-----------------HAYL 579
+LPGC +WTV G A+S+ + H +L
Sbjct: 107 DLPGCLDMWTVIGIPPESKPQEEGEKAESAGSEEKPEGEKEETKEEGPPDVDLTNSHGFL 166
Query: 580 IISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYM 639
I+S E TMVL+T + E+ S + QG T+ AGN+ + +IQV G R+L G
Sbjct: 167 ILSREDSTMVLQTGKEIMELDHS-GFSTQGPTVYAGNIGNNKYIIQVSPYGIRLLQGVKQ 225
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL--VGDPSTCTVSVQTP 697
Q L F S+ + S+ADPY L+ DG I LL V DP +
Sbjct: 226 LQHLPFD---------SKGPAFVLASVADPYALVMSEDGQILLLTLVNDPYGSGHRLSAK 276
Query: 698 AAIESSKKPVSSCTLYHDKG------------PEPWLRKTSTDAW------LSTGV---- 735
+ K + Y D P P + K + + + TGV
Sbjct: 277 KIDMAGKSQAITVCAYRDTSGLFTVSSPSTTTPAPEVEKDAAEPAAEDAVAMETGVDDED 336
Query: 736 ----GEAIDGADGGPLDQGDI-------------------YSVVCYESGALEIFDVPNFN 772
GE G + + ++ + V+C E+G+LEI+++P+F+
Sbjct: 337 EMLYGEPSAKPSGPAVVREEVKPSTSTVQEPVVKEVEPTHWCVICRENGSLEIYNLPDFS 396
Query: 773 CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
V+ V F +G +VD++ + + K+ V E+ M
Sbjct: 397 LVYLVKNFPTGMKLLVDSFQSTSSASTSQS------------DKQGDQLASVKEILMVGL 444
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
SRP L A + D +L Y+A+ P + S P T + V + + R
Sbjct: 445 GHKGSRPHLLARV-DEDLLIYEAF----PYHLS----PSYTMLKIRFKKVQHNLILRERK 495
Query: 893 SRTPLDAYTREET--PHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQL 949
A +EE+ G+ Q F +ISG+ G F+ GS P W M R LR+HP
Sbjct: 496 GGKTKKAGDQEESDGQTGSRIQHFRTFTDISGYSGLFICGSSPHWLFMTSRGALRIHPMS 555
Query: 950 CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
DG++ F+ HNVNC GF+Y G L+I LP+ +YD WPV+KV
Sbjct: 556 IDGAVTCFSPFHNVNCPKGFLYFNRGGELRISVLPTHLSYDAPWPVRKV 604
>gi|317036382|ref|XP_001398211.2| protein cft1 [Aspergillus niger CBS 513.88]
Length = 1393
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 215/1019 (21%), Positives = 408/1019 (40%), Gaps = 159/1019 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+L+V ++++IY + + E ++ + ++L++ Y L G V L
Sbjct: 28 DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76
Query: 117 ----AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++AF +AK+S++E+D G+ S+H +E +
Sbjct: 77 CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
+ G ++ VDP RC + +G++ + I+ Q G LV D+ +GS
Sbjct: 131 VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187
Query: 223 ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
GGG + S V+ L LD + H F++ Y EP IL+ +
Sbjct: 188 HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
T + + + + ++ + ++ S LP D ++++A+P P+GG L++G+
Sbjct: 248 VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
N +H + A+ +N ++ + S +S ++ L+ L + LL T
Sbjct: 308 NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
G++ ++ DGR V + + + I T IG+ FLGS GDS+L+
Sbjct: 368 GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
++ S ++ ++ D AD +S D +D + + +L G +
Sbjct: 428 YSYSSSSTKKHRLQAKQVIDDSADMSEED---QSDDDVYEDDLYSTSPDTTLTGRRPSGE 484
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
SA + F + D L+NIGPL+D + G R++ + TG S +++ +G
Sbjct: 485 SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGDRTNSTSPELQIVASQGSHKSG 544
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
V + H S + + D + A L EA R V+ T
Sbjct: 545 GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604
Query: 597 TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-M 639
++ ES+ + V G TI+ G R+RV+QV + R D +
Sbjct: 605 SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISVGTQESRKRVVQVLKNEVRSYDFDLSL 664
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
TQ ++ ++ +S S+AD + + D ++ L D S V
Sbjct: 665 TQIYPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVFGED 719
Query: 700 IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
+ S K SC LY DK TG+ +ID P+ + D++ +
Sbjct: 720 VASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV-KNDMFLFLLSH 760
Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
L ++ V + + ++ + G + ++ SSE G +EN+
Sbjct: 761 DCKLFVYRVRD-QKLLSIIEGTDGLSPLL-----------------SSEPPKRSGTRENL 802
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
V +L + WSA P+L ++ Y+ ++ VST +
Sbjct: 803 IEAIVADLG-ETWSAS---PYLILRSETDDLIIYKPFV-------------VSTGPVEGI 845
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
++ S+ N R P + + + + + I +ISG F+ G+ + +
Sbjct: 846 HSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFIIRT 905
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+L + + + L C+ GFIY+ SQ ++ C+LP + +D W +++V
Sbjct: 906 SASAPHFLRLRGENSRSVSSLDTPECSKGFIYLDSQSTVRFCKLPPMTRFDYQWTLKRV 964
>gi|170102106|ref|XP_001882269.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164642641|gb|EDR06896.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1406
Score = 149 bits (377), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 203/975 (20%), Positives = 381/975 (39%), Gaps = 188/975 (19%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSGETKR-------RVLMD---------- 94
N+VV +N++ I+ VR +Q + E + + +R V MD
Sbjct: 51 NVVVARSNLLRIFEVREEPCPIQNQADDERERRSKVRRGTEAVEGEVAMDEQGDGFINIA 110
Query: 95 --------GISAASLELVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFEDAKIS 143
+ V + LHG +E + I+S R D ++++F+DAKI+
Sbjct: 111 KSQKCPTHTPTVTRFYFVREHHLHGIVTGIEGVKIMSS----LEDRLDRLLISFKDAKIA 166
Query: 144 VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
+LE+ D++H L S+H +E +P+ + + S R L + DP RC + + +
Sbjct: 167 LLEWSDAVHDLITVSIHTYERAPQLMSID---SSLFRTEL-RTDPISRCAALSLPRHALA 222
Query: 203 ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
IL Q + L V D+D S +++L D ++++V DF F+ G+ P
Sbjct: 223 ILPFYQSQAELEVMDQD---QSQAKDVPYSPSFILDLPAQVDQNIRNVIDFAFLPGFNNP 279
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
+ +L + + TW GR+ T + ++ + +P+I S LPH+ LL + +
Sbjct: 280 TIAVLFQTQQTWTGRLREFKDTVRLVIFTLDIVTQNYPIITSVEGLPHECLALLPCGTSL 339
Query: 319 GGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPRSSFSVE-------LDAAHAT 370
GGV+++ +N I Y QS+ L +N + + ++P S + E L+ + A
Sbjct: 340 GGVVIITSNAIIYTDQSSKRVVLPVNGWVSRI---SDIPLPSLTPEEQLRNICLEGSRAV 396
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLG 426
++ + + K G + L +V DG+ V +L +S P + + I ++ L F +G
Sbjct: 397 FVDDRNLFVILKDGTVYPLEIVVDGKTVSKLTMS---PPLAQTSIPSVLRKLDDDHFLVG 453
Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFG---DIEADAPSTKRLRRSSSDALQDMVNGEE 483
S +G S+L++ ++ ++EE D+EA AP+T + D N
Sbjct: 454 SSVGPSVLLK----------AAHIEEEVAEDHDMEA-APATVVYDADDMEFDDDDGNLPR 502
Query: 484 LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQS 537
++ + ++RDSL GP+ D ++ L N D +ATG
Sbjct: 503 VA-------QPMAKPTVIHLSLRDSLPAYGPISDMTFSLAKNGDRPVPELVAATGSGFLG 555
Query: 538 NYELVE--LP-----------GCKGIWTV------------YHKSSRGHNADSSRMAAYD 572
+ L + LP G +G+W++ Y K+ A++ +
Sbjct: 556 GFTLFQRDLPVRTKRKLHVIGGARGLWSLPIRQPVKASGISYEKAVNPFQAENDSLIIST 615
Query: 573 DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
D + +S + V+ TA + G TI A F R V+ V R
Sbjct: 616 D-INPSPGLSRAGKNDVMITAR------------IPGTTIGAAPFFQRTTVLHVMTNALR 662
Query: 633 ILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
+L+ G + +D+ + + SI+DP+VL+ D SI L +G+
Sbjct: 663 VLEPGMQIIKDMD---------GNMPRPRIRACSISDPFVLILREDDSIGLFIGETERGK 713
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKG-PEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
+ + + + SC G E ++T +++ + A++ G
Sbjct: 714 IRRKDMSPMGDK----VSCFYTDTTGLLESNFENSTTPVGVTSTLSAAVNAGSKGQ---- 765
Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
+ ++ G +E++ +P F+ D S + +VD++ A
Sbjct: 766 --WLILVRPQGIVELWTLPKLTLGFSADGLTSLQNVLVDSHDPPA-------------PS 810
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
Q V ++ + RP L L G + Y+
Sbjct: 811 LPQDPPRKPQEFDVEQILVAPIGESSPRPHLCVFLRSGQLTIYEVLPLG----------- 859
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG-------- 922
T+ +L + +++ ++ S + EE G ++ I++
Sbjct: 860 -RTTEALPKVRPAHVKIKFVKISSMAFEIQRPEEGEKGIIAEQKRIYRMFVPFVTSASPG 918
Query: 923 --HQGFFLSGSRPCW 935
G F +G RP W
Sbjct: 919 VTFSGVFFTGDRPNW 933
>gi|116182170|ref|XP_001220934.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
gi|88186010|gb|EAQ93478.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
Length = 1394
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 244/1050 (23%), Positives = 401/1050 (38%), Gaps = 204/1050 (19%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLM---------DGISAA-------- 99
NL V +++++I+ +V S+N+G R DG+ A+
Sbjct: 41 NLAVAKSSLLQIFRTKVIATELDTSQNNGHRTRNANRYESRLANDDDGLEASFLGGDSLA 100
Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRR-RDSIILAFEDAKISVLEFDD 149
L LV + L G V L + N+R DS++LAF+DAK+S++E+D
Sbjct: 101 QRTDRANYTKLVLVAEFPLAGTVTGLVRIK---TPNARLGLDSLLLAFKDAKLSLVEWDT 157
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
H L S+H +E E + DP RC + + IL Q
Sbjct: 158 EHHTLSTVSIHYYEQEELQGSPWAAPLSHYANFLAADPGSRCAALKFGARNLAILPFKQA 217
Query: 210 GSGL-VGDEDTFGSGGGFSARIES----------------SHVINLRDLD--MKHVKDFI 250
+ +GD D G + + S S V+ L +LD + H
Sbjct: 218 DEDIDMGDWDEELDGPRPAKDLSSAVINGASNIEDTPYSPSFVLRLSNLDPSLLHPVHLA 277
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
F+H Y EP IL H M+ L + K I S LP D ++
Sbjct: 278 FLHEYREPTFGILASTAAASNSLGRKDHFVYMVFTLDLQQ--KASTTILSVTGLPQDLFR 335
Query: 311 LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHA 369
++ +P+P+GG L+VG+N IH +A+N S + +S ++ L+
Sbjct: 336 VVPLPAPVGGALLVGSNELIHIDQSGKPNGVAVNPMTKHCTSFGLVDQSDLNLRLEGCVI 395
Query: 370 TWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGN 420
L D+ L+ G + ++T+ DGR V L+L + + S++ ++T IG
Sbjct: 396 DVLAADLGELLIILNDGQMAVMTLRIDGRTVSGLELKILPASSGGSIVPGRVSTLSRIGR 455
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA------ 474
+ F G GDS+L FG + +R R+ +A
Sbjct: 456 NAMFAGLEEGDSVL-------------------FGWAKKQTQVGRRKPRTKDNAGDVDVE 496
Query: 475 ---LQDMVNGEELSLYGSASNNTESAQKTF--------SFAVRDSLVNIGPLKDFSYGLR 523
+ +E LYG AS S V D L+N+GP++ +Y
Sbjct: 497 EDEDIEEEEEDEDDLYGEASAPQHQPVSAVSGLLSGEASLRVHDRLINLGPIQAMTYSQP 556
Query: 524 INADAS-----------------ATGISKQS-----NYEL-------VELPGCKGIWTVY 554
+ S A G K + N E+ E P +G WT+
Sbjct: 557 VWLPGSEEERNSAGVHSDLQLVCAVGREKSASLVTMNLEIQPKVIGRFEFPEARGFWTMC 616
Query: 555 HKSSRGHNADSSRMAAY-------DDEYHAYLIIS------LEARTMVLETADLLTEVTE 601
K S + + +Y ++I++ E + TA +
Sbjct: 617 AKKPIPKTLQSDKGGNFLGKDYDVSGQYDKFMIVAKVDLDGYEKSDVYALTAAGFESLGG 676
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
+ G TI AG + R+IQV + R DG + ++Q + + E+G+
Sbjct: 677 TEFDPAAGITIEAGTMGKGSRIIQVLKSEVRCYDGDFGLSQIVPM--LDEETGA---EPR 731
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
+S SIADP +L+ D S+ + D S ++ ++ K ++ C LY D
Sbjct: 732 AISASIADPLLLIIRDDSSVFVAQMDSSNELEELEKEDQTLATTKWLTGC-LYAD----- 785
Query: 721 WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
+T A+ E + G G P I + SG+L I+ +P+ + V +
Sbjct: 786 -----TTGAF-----AEEVAGKGGKPAQA--ILVFLLSASGSLYIYRLPDLSKPVYVAEG 833
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
+S Y+ L + S+ +GT KE + + V +LA R H+
Sbjct: 834 LS--------YIPPGLS-----ADYSARKGTA---KETVAEILVADLA-NRSQLRHAN-- 874
Query: 841 LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
D TI YQ + + +TS D S++L +L N F+++P +A
Sbjct: 875 -----DDLTI--YQPFRY----STSAGAD---FSKTLFF-----QKLPNAAFAKSPEEAD 915
Query: 901 TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTV 959
E T H + NI+G+ FL G+ P + + + RV P L ++A +
Sbjct: 916 EDEAT-HQPRMLSMRRCSNIAGYSTVFLPGASPSFIIKSSKSAPRVLP-LQGAGVIAMSP 973
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
H C +GFIY SQ + ++ QLP Y
Sbjct: 974 FHTEGCENGFIYADSQHMARVTQLPQDWNY 1003
>gi|345566738|gb|EGX49680.1| hypothetical protein AOL_s00078g169 [Arthrobotrys oligospora ATCC
24927]
Length = 1407
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 226/1077 (20%), Positives = 407/1077 (37%), Gaps = 258/1077 (23%)
Query: 57 NLVVTAANVIEIYVVRVQEEGS-----KESKNSGETKRRVLM-----DGISAAS------ 100
NLVV ++++I+ + E+ E+K+ G + RRV D + S
Sbjct: 31 NLVVAKTSLLQIFRLVEYEDAEGEFALDEAKDEGGSDRRVFEGRDHEDSFTVESGMHLQR 90
Query: 101 --------LELVCHYRLHGNVESLAIL----SQGGADNSRRRDSIILAFEDAKISVLEFD 148
L+LV Y L+G+V S+ + S+ G D ++++F+ AKIS+LE+D
Sbjct: 91 ETIEKTTKLDLVAQYHLYGSVTSMVKIRIPTSKSGGD------CLLVSFDSAKISLLEWD 144
Query: 149 DSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK--------VDPQGRCGGV-LVYGL 199
+ H + S+H +E E+ R PL DP+ RC + L
Sbjct: 145 PAAHSISTISLHYYEGDEF-----------RSPLTPEFPINYLISDPKSRCAAFKFNHDL 193
Query: 200 QMII-LKASQGGSGLVGDEDTF-----------------------GSGGGFSARIESSHV 235
I+ + ++ + D D+F G G S V
Sbjct: 194 VAILPFRQTEDEDLEIPDNDSFTYDLEDDDDAEKPKKDVEMKDNTGEGKPSDTPYHPSFV 253
Query: 236 INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
++ LD ++ + D +F+H Y EP I+++ + G + + +++ +
Sbjct: 254 LSASQLDESVERIIDIVFLHEYREPTFGIVYQPQQGSVGMLERRKDPTHFIVVTLDLDQR 313
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNYAVSLDSS 352
I SA NLP D +K +A+P PIGG L++G + I H A+A+N+YA +
Sbjct: 314 ASTSIMSAKNLPFDIWKAVALPPPIGGTLLLGEHEIVHVDQAGKMSAVAVNSYAQQYSAF 373
Query: 353 QELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTN--- 407
+S + L++ A L N+ L+ T GD +L+ +GR + L + +
Sbjct: 374 NMTDQSDLELNLESCSAISLPNENGDVLIVTIAGDFAILSFKAEGRSISSLSVRRIQSKD 433
Query: 408 ----PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
S + +GN FFLGS D++L + + LS S
Sbjct: 434 GYPFTSAPCETLVEVGNRRFFLGSLDSDAMLWGYKRKGEKTSLSQK-------------S 480
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------------FSFAVRDSLVN 511
+L R + + +E LYG ++ + +K + F D L N
Sbjct: 481 EVKLERDDA-EDNVEDDDDEDDLYGESTVTPITPRKASSGNIGRGSSGEYVFRRHDRLQN 539
Query: 512 IGPLKDFSYG--------LRINADA-------SATGISKQSNYEL------------VEL 544
+GP + ++G L+++ + TG + + +
Sbjct: 540 VGPCRQMAFGRPAMLPEKLKLHQGVLPELELMATTGRGVEGAVTVFNTSICPRVSATFDF 599
Query: 545 PGCKGIWTVYHKSSRGHNA--DSSRMAAYDDE------YHAYLIISLEARTMV------- 589
C+ +W V+ K + + SS Y+++ Y YL S + T+V
Sbjct: 600 KDCQRLWAVHSKQVKKGQSMIPSSVSKGYEEQIGATEDYSTYLFASNTSETLVYKVGTKF 659
Query: 590 --LETADL-LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
LE D+ TEV ++++ G R+ QV E ++ D Q +
Sbjct: 660 EPLEGTDIETTEVCPTLEF---------GTFQDGLRIAQVCETNVKVYDSEL--QLIQII 708
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKK 705
+N E G + ++S S ADPY+LL D SI T + ++ PA I+ +K
Sbjct: 709 STNDEDPDGGPH--IVSASFADPYMLLICGDSSILACQCHERTLELDRIELPATIKDTK- 765
Query: 706 PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCY---ESGA 762
T+ L T E G V+C+ E G
Sbjct: 766 --------------------YTNGCLYTSSSEV--------FGLGTKSQVLCFLLTEEGT 797
Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG-QGRKENIHS 821
L++F +PNF T++ F D ++ S E ++ I
Sbjct: 798 LQVFTLPNFELKATLEHF-----------------DMSLQLVSPDETALRFHTARDEIEE 840
Query: 822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
+ V +L A P+L I+ Y+ ++ G
Sbjct: 841 IIVADLGDNISKA----PYLIVKTKRDDIIIYEPFISNG--------------------- 875
Query: 882 VSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRE 941
+ ++ N + P + + +++P G P +I ++ G+ F++G P + +
Sbjct: 876 ICFKKIYN---TVLPTVSLSEQKSPSG-PLVKI---DDLGGYSVAFMAGDTPTFITKSSK 928
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
L +L G + + + + GF+Y+ S+G ++C P S ++ W Q++
Sbjct: 929 TLPKLYKLQGGMVRSLSPFNTKETERGFLYIDSKGTARVCHFPEVSM-EHTWLSQRI 984
>gi|406865186|gb|EKD18229.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 1443
Score = 147 bits (371), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 234/1046 (22%), Positives = 414/1046 (39%), Gaps = 188/1046 (17%)
Query: 57 NLVVTAANVIEIYVVRVQ------EEGSKESKNSGETKRRVLMDGISAA----------- 99
NL+V ++++++ ++ EEG+ SK + + + DG+ A+
Sbjct: 28 NLIVAKTSLLQVFTTKITSIELGIEEGA--SKQNDKWDPSLDNDGLDASFIGADSLLRPD 85
Query: 100 -----SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
L LV Y L G + SLA + + + +++++ F DAK+S++E+D + G+
Sbjct: 86 RARRTKLVLVAEYTLSGTITSLARIKTLSSKSGG--EALLVGFRDAKLSLVEWDPARPGI 143
Query: 155 RITSMHCFESPEWLHLKRG------RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
S+H +E E L+R +ES + DP RC + G + I+ Q
Sbjct: 144 STISIHYYEQDE---LQRSPWAPNLKESVN---YLIADPGSRCAALKFGGRNLGIIPFKQ 197
Query: 209 GGSGLVGDE----------------DTFGSGGGFSARIESSHVINLRDLDMKHVKD--FI 250
+ D+ S S V+ L LD +
Sbjct: 198 DDEDVNMDDWDEEIDGPRPADKVITKATNSSNDKETPYGPSFVLRLATLDPNLINPIHLA 257
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
F++ Y EP IL ++ + + + H T M+ L + + I S LP+D
Sbjct: 258 FLYEYREPTFGILSSSQMPASSLLFERRDHLTYMVFTLDLQQ--RASTTIMSVTGLPYDL 315
Query: 309 YKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
++++ + +P+GG L++G N IH + +A+N +A S + +S + L+ +
Sbjct: 316 FEVVPLDAPVGGALLIGTNELIHIDQAGKANGVAVNVFAKQCTSFGLVDQSGLDMRLEGS 375
Query: 368 HATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITT---I 418
L Q+ ++ +TG++ +L+ DGR V L + + + SV+ + ++T I
Sbjct: 376 KIEQLSIQSGEMIIFLQTGEIAILSFHMDGRSVSSLSVRRVSAEAGGSVIPARVSTLSHI 435
Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
G + F+GS DS+++ + S S S K IE ++ D D
Sbjct: 436 GQNTLFVGSACADSMVLGW---SRKSNQVSRRKPRVEVIEDADDASLDELDDEDDDADDD 492
Query: 479 VNGEELSLYGSASN------NTESAQKTFSFAVRDSLVNIGPLKDFSYG---LRINADAS 529
+ GE S+ A+N S + F V DSLVNI P+ + ++G L N D
Sbjct: 493 LYGEGPSIIQDATNGVAKSDTVNSKAGDYVFQVHDSLVNIAPIVNITFGNASLSQNEDEK 552
Query: 530 ATGISKQSNYELV--------------------------ELPGCKGIWTVYHK--SSRGH 561
+ + ELV E P +GIWT+ K + +G
Sbjct: 553 LDSVGVRGYLELVASVGKQRAGALAVIHQNIQPKVIGRFEFPEARGIWTMSAKRPAEKGL 612
Query: 562 NADSSRMA-----AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYF-------VQG 609
A + + A D +Y +I+S +A + ET+D+ + + + G
Sbjct: 613 EAKKEKSSTSGDYAIDAQYDRLMIVS-KALSDGTETSDVYALTSANFEALTGTEFEPAAG 671
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
TI AG L RVIQV + R DG+ + Q L + +G+E ++S S AD
Sbjct: 672 STIEAGTLGNGNRVIQVLKSEVRSYDGNLGLAQILPM----YDDDTGAE-PKIVSASFAD 726
Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
PY+LL D SI + D + ++ + K ++ C LY D
Sbjct: 727 PYLLLFRDDSSIFVAQSDENNELEEIEREDDALLATKWLTGC-LYAD------------- 772
Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHI 787
S GV + G +++ ++ + GAL I+ +P+ N V+ +
Sbjct: 773 ---SRGVFAPVQSDKGQKVEE-NVMMFLLSAGGALHIYALPDLSNAVYVAEGLC------ 822
Query: 788 VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR-PFLFAILT 846
++ L + S++ E + EL + +R P+L +
Sbjct: 823 ---FVPPVLSAAYAARRSAARE-------------TITELVVADLGDETARSPYLILRPS 866
Query: 847 DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP 906
+ Y+ P +TS S S S + ++ N +R P + ET
Sbjct: 867 TDDLTIYE------PFHTS------SESSGGLASTLQFLKIHNPHLARNP--DVSAAETA 912
Query: 907 HGAPCQR---ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNV 963
G R + + N+ G+ FL G P + M + L + + H
Sbjct: 913 DGIQETRDEPMRVISNLGGYCTVFLPGGSPSFIMKSAKSTPKVISLQGLGVRGMSSFHTE 972
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTY 989
C+ GFIY G+ ++ QLP +T+
Sbjct: 973 GCDRGFIYTDVDGLARVSQLPKDTTF 998
>gi|340924328|gb|EGS19231.1| hypothetical protein CTHT_0058560 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1460
Score = 146 bits (368), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 237/1042 (22%), Positives = 400/1042 (38%), Gaps = 183/1042 (17%)
Query: 57 NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD--GISAA------- 99
NLVV +++++++ + +Q G+ + +++ + R+ D G+ A+
Sbjct: 28 NLVVAKSSLLQVFRTKTVTTEIDTLQTNGASKGRSAARYENRLANDDDGLEASFLGGDSL 87
Query: 100 ----------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFD 148
L LV L G V L+ + SR +S++LAF DAK+S++E+D
Sbjct: 88 GFRADRTTNTKLVLVYETPLAGTVIGLSKIK---TSTSRSGCESLLLAFRDAKLSLVEWD 144
Query: 149 DSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMI 202
+ L S+H +E E + S PL + DP RC + +
Sbjct: 145 AERNALGTVSIHYYEQEEL------QGSPWAAPLSHYVNFLVADPGSRCAALKFAARNLA 198
Query: 203 ILKASQGGSGL-VGDEDTFGSG------------GGFSARIES-----SHVINLRDLD-- 242
IL Q + +GD D G ++ IE S V+ L +LD
Sbjct: 199 ILPFRQVDEDIDMGDWDEELDGPRPQKDVSNAAVSNGASNIEDTPYSPSFVLRLSNLDPS 258
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
+ H F+H Y EP IL H+T M+ L + K I S
Sbjct: 259 LLHPVHLAFLHEYREPTFGILASTSSASNALGRKDHYTYMVFTLDLQQ--KASTTILSVS 316
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS 361
LP D Y+++ +P+P+GG L+VG N IH +A+N S +S +
Sbjct: 317 GLPQDLYRVVPLPAPVGGALLVGCNELIHIDQSGKPNGVAVNPMTKQCTSFGLADQSDLN 376
Query: 362 VELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDI 415
+ L+ L D+ L+ G +VL+T DGR V L+L P +++ I
Sbjct: 377 IRLEGCIIDVLTPDLGEFLMILNDGRMVLITFRIDGRTVSGLELRLVPPASGGTIIPGRI 436
Query: 416 TT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+T IG ++ F GS GDSL+ + T + + + + D
Sbjct: 437 STLSRIGKNVMFAGSEEGDSLVFGW-----TKKQTQAGRRKSKPRDDDFYMDDYEEEEEE 491
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADAS 529
D+ E S + S + SF + D L++I P++ +YG + ++
Sbjct: 492 VDEDDLYGEETTSHHQPVSAASSLLSGDLSFRIHDRLISIAPIQSMTYGQPVWMPGSEEE 551
Query: 530 ATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNA 563
I ++ +LV E +G WT+ K +
Sbjct: 552 RNSIGVHADLQLVCAVGRDKSSCLATMNLAIQPKVIGQFEFSEARGFWTMCAKKPIPKSL 611
Query: 564 DSSRMAA------YDD--EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQG 609
S + + YD +Y ++I++ E + TA + + G
Sbjct: 612 QSDKGVSVLGGNDYDTGGQYDRFMIVAKVDLDGYEKSDVYALTAAGFEGLCGTEFDPAAG 671
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
TI AG + R++Q+ + R DG + + P E +G+E V + SIADP
Sbjct: 672 ITIEAGTMGKGSRIVQILKSEVRSYDGDFGLSQIV--PMMDEE-TGAEPRAV-TASIADP 727
Query: 670 YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDA 729
Y+L+ D S + D S ++ + S K +S C LY+D
Sbjct: 728 YLLIIRDDSSAFIAGIDSSNELEELRKEDKVLVSSKWLSGC-LYND-------------- 772
Query: 730 WLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIV 788
ST + P I + SGAL I+ +P+ + ++ D
Sbjct: 773 --STAIFAEETAKSSKPTQS--ILLFLLSSSGALYIYRLPDLSKPIYVTDGLA------- 821
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
Y+ AL T +GT KE I + V +L H P+L ++
Sbjct: 822 --YIPPALSSDFT-----VRKGT---PKEAITEIMVADLG----DTTHKSPYLILRHSND 867
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+ YQ Y ++ + T + S + +L N F+R P + +++ P
Sbjct: 868 DLTIYQPYRYK-----------LGTGQVFS-KTLFFQKLPNPSFARAP-EETEQDDVPPQ 914
Query: 909 APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDGSIVAFTVLHNVNCNH 967
+ NI+G+ FL G P + + + + RV P L ++A + H C+H
Sbjct: 915 PRLLSMRRCNNIAGYSTVFLPGHSPSFILKSAKSMPRVVP-LQGAGVIAMSPFHTEGCDH 973
Query: 968 GFIYVTSQGILKICQLPSGSTY 989
GFIY S I ++ Q+P +Y
Sbjct: 974 GFIYADSHNIARVTQIPEDWSY 995
>gi|452001482|gb|EMD93941.1| hypothetical protein COCHEDRAFT_1129958 [Cochliobolus
heterostrophus C5]
Length = 1385
Score = 146 bits (368), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 224/1025 (21%), Positives = 402/1025 (39%), Gaps = 180/1025 (17%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++++ ++ V G E++N+ E L S A L LV
Sbjct: 28 NLVVAKNSLLQVFELKSTTTEVTPGGGDEAENAAANLDTEAADVPLQRTESTAKLVLVGE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + R +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVVSLARVK--ALSTKSRGEALLVAFRDAKLSLVEWDPESYSLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ------------ 208
W + +F + DP RC + + IL Q
Sbjct: 146 LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGSHNLAILPFRQRDLVDDDYDSDA 200
Query: 209 -GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
G ++ T + G + SS V+ L +LD + H F+H Y EP I+
Sbjct: 201 DGPKESKPEQQT--ASGSHTTPYTSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAA 258
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
T ++ + S ++ K + S LP+D +++ +PSPIGG L+VG
Sbjct: 259 SRDTAPSLLAHRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITRVVPLPSPIGGALLVG 318
Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTK 382
+N IH + +A+N +A + S +S ++ L+ L ++ L+
Sbjct: 319 SNEIIHVDQGGKTNGVAVNEFAKACTSFPLSDQSDLALRLEGCSVELLSHEAGDVLVVLN 378
Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLV 435
G L++LT DGR V + + S + +G F+GS GDS+++
Sbjct: 379 NGRLLVLTFTLDGRTVSGMTVHPVAADHGGHLIKAAASCTSNLGRGRLFVGSEDGDSVML 438
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNT 494
+T +S L+ + + D D D+ N ++ +A+ +
Sbjct: 439 GWTS------TASHLRRKQSNANIDTDEDMSDEEDMEDMEDDLYNDTAPAVQKITAAASE 492
Query: 495 ESAQKTFSFAVRDSLVNIGPLKD------------------FSYGLRINADASATGISKQ 536
+A T++F + D L +I P+K+ S G A AS T ++++
Sbjct: 493 PTAPGTYTFRIHDVLPSIAPIKNAVLHPGKDTESLNRGEVMLSTGR--GAAASITALNRE 550
Query: 537 SNYELV---ELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISLEAR- 586
+ V +LP +G W V+ + + D A + +Y YL++S
Sbjct: 551 LHPVTVATRQLPSARGTWAVHARKQAPGDVTAAFGEDMEANMATNVDYDQYLVVSKTGED 610
Query: 587 ----TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
T+V E + LTE + +G T+ G L +V+QV R D S +
Sbjct: 611 GTESTVVYEVNGNELTETDKGDFEREEGSTLFVGVLAAGTKVVQVMRTEIRTYD-SELNM 669
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
D + ESG+ V++ S ADPY+L+ D S+++ A +
Sbjct: 670 DQILPMEDEESGN---EVNVINASFADPYLLVLREDSSVKIFR-------------ATGD 713
Query: 702 SSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG 761
+ V + L S WLS + ++ ++++ + G
Sbjct: 714 GELEDVEATGL-------------SNSQWLSASLFKSASFT--------EVFAFLLTPEG 752
Query: 762 ALEIFDVPNFNCVFTVDKFVSGRTHIVDT-YM--REALKDSETEINSSSEEGTGQGRKEN 818
L +F V + V + +S ++ Y+ R A+K + TEI
Sbjct: 753 GLRVFAVSDMEKPCYVAEALSFLPPVLGMDYVPKRSAIKATITEI--------------- 797
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 878
LA A P L + ++ Y+A+ S S S
Sbjct: 798 --------LAADLGDATTKSPHLIVRTSSDNLVIYKAF----------------HSPSRS 833
Query: 879 VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCM 937
+++ LR ++ S+ + YT + + + + +I G+ F G+ P +
Sbjct: 834 AADLWTKNLRWVKLSQQHIPRYTEDGGAEDSGFESTLLTLSDIGGYSTVFQRGTTPAF-- 891
Query: 938 VFRERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY-W 993
+F+E P+ L + + T H +C GF Y+ S L+I QLP + Y + W
Sbjct: 892 IFKESSSA-PRVIGLSGKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGW 950
Query: 994 PVQKV 998
+++
Sbjct: 951 ATRRM 955
>gi|261201748|ref|XP_002628088.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
gi|239590185|gb|EEQ72766.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
Length = 1403
Score = 146 bits (368), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 226/1032 (21%), Positives = 408/1032 (39%), Gaps = 176/1032 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V + +++++ + GS ++ +T+ + L LV Y L G + L
Sbjct: 28 NLIVAKSTLLQVFNLVNVVYGSAPGQSDEKTRSQY-------TKLVLVAEYALSGTITDL 80
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++ +AK+S++E+D H + TS+H +E + +H+
Sbjct: 81 GRVKILNSKSGGE------AVLVGTRNAKLSLIEWDPERHKIATTSIHYYERDD-VHISP 133
Query: 173 GRESFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------------- 214
+ A P + VDP RC VL +G + + IL Q G LV
Sbjct: 134 WTPNLANCPSHLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDFDSDVEEPPRDTN 192
Query: 215 -----GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
DE +G F SS V+ + L+ M H F++ Y EP IL+ +
Sbjct: 193 HTAEGQDEAKKSNGLAFHTPYASSFVLPIAALEPAMLHPISLAFLYEYREPTFGILYSQV 252
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
T + + + S ++ + + S LP+D +K++A+P P+GG L++G N
Sbjct: 253 ATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVALPPPVGGALLIGTN 312
Query: 328 T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
+H + A+ +N +A S +S + L+ + L +N LL G
Sbjct: 313 ELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEGSIVEQLGTENGDMLLVLLNG 372
Query: 385 DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
+ +L+ DGR V + L K PS +G F GS DS+
Sbjct: 373 KMAVLSFKLDGRSVSGISLRLVPDLAGGSLLKARPSC----SVPLGRGKIFFGSEESDSV 428
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY------ 487
L+ G S S+ K+ D SS + +D + E LY
Sbjct: 429 LI------GWSRPSTRPKDPPVQGAGD---DNIAELSSDEEEEDDEDIYEDDLYATPVPT 479
Query: 488 GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYE 540
G+ + + S + T ++F + D L N+GP++D + G + D S +N E
Sbjct: 480 GAKARGSLSVKGTNLNDYTFRIHDRLWNLGPMRDLTLGRPAGSRDKDKRQPVSSLSTNLE 539
Query: 541 LVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDE 574
LV G G W+V+ K + + S
Sbjct: 540 LVATQGYGKAGGLTILRREIDPYVIDSLMIKDTDGAWSVHVKDPKLPSQSGSLPLNASSN 599
Query: 575 YHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFE 628
Y YL++S + +++V + E T++ ++ + RTI G L G RV+QV +
Sbjct: 600 YDHYLLLSKSKGSDKEKSVVYTMSSGGLEETKASEFNPNEDRTIDIGTLAGGTRVVQVLK 659
Query: 629 RGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
R D G + Q + SE V+ S ADPYVL+ D S+ LL D
Sbjct: 660 GEVRSYDSGLGLAQIFPVWDEDM-----SEEKYVVHASFADPYVLIIRDDQSVLLLQADG 714
Query: 688 STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
S ++ I S+ S +LY DK +T+ LS V
Sbjct: 715 SGDLDEIEADGIINSTT--WISGSLYQDKYRSFMSYETAPSRKLSDNV------------ 760
Query: 748 DQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
+ ++ ES L IF +PN VFT + D +I S+
Sbjct: 761 ----LLFLLSSES-KLHIFHLPNAKEPVFTAECV-----------------DLLPQILST 798
Query: 807 SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
+E++ + V ++ + P+L ++ ++ Y+ Y +T+
Sbjct: 799 EPPPKRATYRESLTEILVADIG----DSVSRTPYLILRSSNNDLILYEPY------HTTH 848
Query: 867 SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF 926
S + S S++ + N F + + + + GA + + + ++ G++
Sbjct: 849 STEKKS-------SDLRFLKTINHHFPKFHAGSNVEDSSHIGALPKPLRVLGDVCGYRTV 901
Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG 986
F+ G+ PC+ + + L ++ + + + C GF+YV + ++++C+ P
Sbjct: 902 FMPGNSPCFVIKSSTSIPHVLNLRGKTVHSLSSFNIPACERGFVYVDADNVVRMCRFPRN 961
Query: 987 STYDNYWPVQKV 998
+ +D W +K+
Sbjct: 962 THFDGSWATRKI 973
>gi|451849663|gb|EMD62966.1| hypothetical protein COCSADRAFT_92785 [Cochliobolus sativus ND90Pr]
Length = 1405
Score = 145 bits (367), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 223/1021 (21%), Positives = 400/1021 (39%), Gaps = 172/1021 (16%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++++ ++ V G E++N+ E L S A L LV
Sbjct: 28 NLVVAKNSLLQVFELKSTTTEVTPGGGDEAENAAANLDTEAADVPLQRTESTAKLVLVGE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + R +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVVSLARVK--ALSTKSRGEALLVAFRDAKLSLVEWDPESYNLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ-------GGSGL 213
W + +F + DP RC + + IL Q S
Sbjct: 146 LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGSHNLAILPFRQRDLVDDDYDSDA 200
Query: 214 VGDEDTF----GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
G +++ + G + SS V+ L +LD + H F+H Y EP I+
Sbjct: 201 DGPKESKLEQQAASGSHTTPYTSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAASR 260
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
T ++ + S ++ K + S LP+D +++ +PSPIGG L+VG+N
Sbjct: 261 DTAPSLLAHRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITRVVPLPSPIGGALLVGSN 320
Query: 328 -TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTG 384
IH + +A+N +A + S +S ++ L+ L ++ L+ G
Sbjct: 321 EIIHVDQGGKTSGVAVNEFAKTCTSFPLSDQSDMALRLEGCSVELLSHEAGDVLIVLNNG 380
Query: 385 DLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L++LT DGR V + + S + +G F+GS GDS+++ +
Sbjct: 381 RLLVLTFTLDGRTVSGMTVHPVAADHGGHLIKAAASCTSNLGRGRLFVGSEDGDSVMLGW 440
Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTES 496
T +S L+ + + D D D+ N ++ +A+ + +
Sbjct: 441 TS------TASHLRRKQSNANIDTDEDMSDEEDMDDMEDDLYNDTAPAVQKITAAASEPT 494
Query: 497 AQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL--- 541
A T++F + D L +I P+K+ + G + ++ A N EL
Sbjct: 495 APGTYTFRIHDVLPSIAPIKNAVLHPGKDTESLNRGEIMLSTGRGAAAAITALNRELHPV 554
Query: 542 ----VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISLEAR----- 586
+LP +G W V+ + + D A + +Y YL++S
Sbjct: 555 TAATRQLPSARGTWAVHARKQAPGDVTAAFGEDMEANMATNVDYDQYLVVSKTGEDGTES 614
Query: 587 TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
T+V E + LTE + +G T+ G L +V+QV R D S + D
Sbjct: 615 TVVYEVNGNELTETDKGDFEREEGSTLFVGILAAGTKVVQVMRTEIRTYD-SELNMDQIL 673
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
+ ESG+ V++ S ADPY+L+ D S+++ A + +
Sbjct: 674 PMEDEESGN---ELNVINASFADPYLLVLREDSSVKIFR-------------ATGDGELE 717
Query: 706 PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
V + L S WLS + ++ ++++ + G L +
Sbjct: 718 DVEATGL-------------SNSQWLSASLFKSASFT--------EVFAFLLTPEGGLRV 756
Query: 766 FDVPNFNCVFTVDKFVSGRTHIVDT-YM--REALKDSETEINSSSEEGTGQGRKENIHSM 822
F V + V + +S ++ Y+ R A+K + TEI
Sbjct: 757 FAVSDMEKPCYVAEALSFLPPVLGMDYVPKRSAIKATITEI------------------- 797
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
LA A P L + I+ Y+A+ S S S +++
Sbjct: 798 ----LAADLGDATTKSPHLIIRTSSDNIVIYKAF----------------HSPSRSAADL 837
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRE 941
LR ++ S+ + YT + + + + +I G+ F G+ P + +F+E
Sbjct: 838 WTKNLRWVKLSQQHIPRYTEDGGAEDSGFESTLLALSDIGGYSTVFQRGTTPAF--IFKE 895
Query: 942 RLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY-WPVQK 997
P+ L + + T H +C GF Y+ S L+I QLP + Y + W ++
Sbjct: 896 SSSA-PRVIGLSGKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGWATRR 954
Query: 998 V 998
+
Sbjct: 955 M 955
>gi|367018592|ref|XP_003658581.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
42464]
gi|347005848|gb|AEO53336.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
42464]
Length = 1547
Score = 145 bits (366), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 233/1003 (23%), Positives = 382/1003 (38%), Gaps = 174/1003 (17%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR------------DSIILAFEDAK 141
D + L LV + L G V LA + A+ + DS+++AF DA+
Sbjct: 93 DRANTTKLVLVAEFPLAGTVTGLARIRTPKANRNHDGGAGHAGHAGHGCDSLLIAFRDAR 152
Query: 142 ISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVL 195
+S++E+D H L S+H +E E + S PL + DP RC +
Sbjct: 153 LSLVEWDAEQHTLSTISIHYYEQEEL------QGSPWAAPLSHYVNFLVADPGSRCAALK 206
Query: 196 VYGLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVIN----------------- 237
+ IL Q + +GD D G + S+ V+N
Sbjct: 207 FGARNLAILPFRQADEDIDMGDWDEELDGPRPAKDPSSNAVVNGASNIEDTPYSPSFVLR 266
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
L +LD + H F+H Y EP IL H M+ L + K
Sbjct: 267 LSNLDPSLLHPVHLAFLHEYREPTFGILASATAPSNALGRKDHLVYMVFTLDLQQ--KAS 324
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQE 354
I S LP D ++++ +P+P+GG L+VG+N IH +A+N +
Sbjct: 325 TTILSVSGLPQDLFRVVPLPAPVGGALLVGSNELIHVDQSGKPNGVAVNPMTRQCTNFGL 384
Query: 355 LPRSSFSVELDAAHATWLQNDVALLST--KTGDLVLLTVVYDGRVVQRLDLSKTNPSV-- 410
+ +S ++ L+ L D+ L G ++T DGR V L++ S
Sbjct: 385 VDQSDLNLRLEGCAIDVLTPDLGELFVVLNDGRAAVVTFRIDGRTVSGLEIKMLPESAGG 444
Query: 411 -----LTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGLKEEFGDIEADAP 462
S ++ IG + F G GDSLL+ + +G L + GD++A+
Sbjct: 445 SLIPGRVSTLSRIGRNAVFAGREEGDSLLLGWAKRQAQTGRRRLRARDAAGSGDVDAEG- 503
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT-------------FSFAVRDSL 509
L D + + + +E ESA + SF V D L
Sbjct: 504 --AELAEGDEDVVAEGEDEDEDEEDEDDLYGEESAPRQQPVSAASSFLSGDVSFRVHDRL 561
Query: 510 VNIGPLKDFSY----------------GLRINADASAT-GISKQSNYELV---------- 542
+++ P++ +Y G+R + + T G K + V
Sbjct: 562 LSVAPIQALTYSQPVYLAGSEEERNSAGVRSDLNLVCTVGRDKSAALATVNLAIQPRVIG 621
Query: 543 --ELPGCKGIWTV-----YHKSSRGHNADSSRMAAYDD--EYHAYLIISLEARTMVLETA 593
E P +G WTV KS +G A +S YD +Y ++I++ + E +
Sbjct: 622 RFEFPEARGFWTVCAKKPVPKSLQGDKAGNSLSKDYDTAGQYDRFMIVA-KVDLDGYEKS 680
Query: 594 DLLTEVTESVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
D+ + G TI AG + R+IQ+ + R DG + +
Sbjct: 681 DVYALTAAGFEGLGGTEFDPAAGITIEAGTMGKGSRIIQILKSEVRCYDGDFGLSQIV-- 738
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
P E +G+E V S SI DP++L+ D S + D S + +S K
Sbjct: 739 PMLDEE-TGAEPRAV-SASIVDPFLLIIRDDSSAFIAQVDSSNELEELDKEDPTLASTKW 796
Query: 707 VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIF 766
++ C LY D +T A+ G+ GG L Q + + SGAL I+
Sbjct: 797 LTGC-LYAD----------TTGAFAEEAPGK------GGKLSQ-SVLMFLLSASGALHIY 838
Query: 767 DVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVE 826
+P+ + V + +S Y+ L + S+ +GT KE I + V +
Sbjct: 839 RLPDLSKPVYVAEGLS--------YIPPGLS-----ADYSARKGTA---KETIAEILVAD 882
Query: 827 LAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASR 886
L H P L T+ + YQ + + NT + S++L +
Sbjct: 883 LG----DMTHKSPHLILRHTNDDLTLYQPFRY----NTGAG---LEFSKTLFF-----QK 926
Query: 887 LRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
L N F+++P +A E T H + N+ G+ FL G+ P + + + +
Sbjct: 927 LPNTVFAKSPEEADDDEAT-HQPRFLSMRRCANVGGYSTVFLPGASPSFIIKSSKSVPKV 985
Query: 947 PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
L ++A + H C HGFIY S+ + ++ QLP +Y
Sbjct: 986 LPLQGTGVIAMSPFHTEGCEHGFIYADSRDMARVAQLPQDWSY 1028
>gi|242798830|ref|XP_002483249.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces stipitatus ATCC 10500]
gi|218716594|gb|EED16015.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces stipitatus ATCC 10500]
Length = 1382
Score = 145 bits (365), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 242/1027 (23%), Positives = 399/1027 (38%), Gaps = 186/1027 (18%)
Query: 57 NLVVTAANVIEIYVV-------RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRL 109
NLVV ++++IY + V E G + + N KR L+L Y L
Sbjct: 28 NLVVIKTSLLQIYNLVTETVTPSVLENGQRANDNE---KRN------ETTKLQLFAEYDL 78
Query: 110 HGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
HG V + S+ NSR D+++L+F +AK+S++E++ I + S+H +E +
Sbjct: 79 HGTVTDI---SRINILNSRSGGDALLLSFRNAKLSLIEWNPEIQNISTVSIHYYEKEDIT 135
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE-------DTF 220
+ + VDP RC VL +G++ + IL Q G LV DE D F
Sbjct: 136 LSPWAPDLSQCDSHLTVDPSSRCA-VLNFGVRNLAILPFHQAGDDLVMDEYDPDLDMDDF 194
Query: 221 GSGGGFSARIES----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
++ +S S V+ L LD + H F+H Y EP I
Sbjct: 195 TGQDKNTSHTDSKKGTEKDHTHQTPYAASFVLPLTALDPTLIHPIGLTFLHEYREPTFGI 254
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
L+ T A + + + S ++ + + S LP D ++A+P+P+GG L
Sbjct: 255 LYSPIATSAALLEERKDVVVYSVFTLDLEQRASTPLLSIAKLPSDLLHIMALPAPVGGAL 314
Query: 323 VVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LL 379
++G+N IH + A+A+N +A + S + +S + L+ + + + LL
Sbjct: 315 LIGSNELIHVDQSGKASAVAVNEFAKQVSSFPMIDQSDLGLRLENSVVEVINKECGDILL 374
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT--------TIGNSLFFLGSRLGD 431
+ TG+LVL+ DGR V + P+ D+ ++G+ F+GS D
Sbjct: 375 TLSTGELVLVHFKIDGRSVSGPVVCPV-PTNSGGDVVGATASCSISLGSGKVFIGSEDTD 433
Query: 432 SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS--SDALQDMVNGEELSLYGS 489
SLL+ S S S E+ D + + + S A ++ VN +
Sbjct: 434 SLLLDCYVSSAVSKKSKDHGEDQFDEDMNDEDDDDMYEDDLYSSAPKEAVNK-------A 486
Query: 490 ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
SN SA + +SF V D L ++ L+ + G + D+ A +S QS +EL EL G
Sbjct: 487 VSNG--SASEDYSFRVLDKLPSLASLRSVTVGKPASRDSDAGNVS-QSVHEL-ELAAAYG 542
Query: 550 ---------IWTVYH----KSSRGHNADS------SRMAAYDDEYHAYLIISLEARTMVL 590
+ H + G ADS S + +D E+ + V
Sbjct: 543 SGRNGGVALLQRALHLDGISTMNGETADSVWNINTSTKSGRNDPSEG------ESPSYVF 596
Query: 591 ETADLLTEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARIL 634
T T+ E++ Y V G T+ G L G RV+QV R+
Sbjct: 597 LTKSNSTDNEETLVYAVNGSNLEPFSAPDVNPNGDPTVDIGTLAGNSRVVQVLTGEVRVY 656
Query: 635 DGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
D + M Q P E G E V S S ADPY+L+ D S+ LL D S
Sbjct: 657 DTNLGMAQ---IYPVWDED-EGDERFAV-STSFADPYLLIIRDDSSVLLLHSDESGDLDE 711
Query: 694 VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 753
+ P I SS+ + C LY DK V E D A G+ Y
Sbjct: 712 LSKPETI-SSQSWLCGC-LYTDK----------------HNVFE--DNA------TGNTY 745
Query: 754 SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ 813
+ + L +F +P V + D + I SS +
Sbjct: 746 MFLLNQECKLFMFRLPTRELVSVTEGV-----------------DYVSSILSSDQPAKRL 788
Query: 814 GRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVST 873
+E I + V +L + P+L ++ Y+ PV
Sbjct: 789 NSRETIAELLVADLG----EISTASPYLIIRSATDDLIIYK---------------PVRE 829
Query: 874 SRSLSVSNVSASRLR--NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
+ + V+ ++ N + P++A + +R+ +I G+ +SG+
Sbjct: 830 NSKDEKTGVTLKYIKESNHFLPKVPIEAAATDTQQRMPGLRRLA---DIGGYAAVLMSGA 886
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P + + L + SI + + C G IYV ++ +++ C+L + D
Sbjct: 887 SPSLVVRTSKSLPRVFSIQSDSIRGISGFDSAGCEKGLIYVDNEHVVRTCRLHDNTQLDF 946
Query: 992 YWPVQKV 998
WP++K+
Sbjct: 947 SWPIRKI 953
>gi|196012166|ref|XP_002115946.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
gi|190581722|gb|EDV21798.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
Length = 1187
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 95/299 (31%), Positives = 144/299 (48%), Gaps = 19/299 (6%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+ I +Y + +E S + D LE + Y +G + +
Sbjct: 29 NLLTAGPTCIRVYDIIKDQEDIDLDNRSDNADNHLNKDNKLHPELEFLASYSFYGKIYGI 88
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ RDS+ + F DAK+S++E+D L S+H FE E LK G
Sbjct: 89 ----ESVRFRHHHRDSLFICFADAKLSLVEYDADNSNLTTLSLHTFEDDE---LKNGFSR 141
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-DTFGSGGGFSARIESSHV 235
P+++VDP RC ++V + + IL G + D + G + + S+V
Sbjct: 142 NLSIPIIRVDPDNRCAAMVVSNVHLAILPFRHRGPAEQQVQIDPKNTSGKYP--LMPSYV 199
Query: 236 INLRDLDMKHVKDFI---FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+++RDL + V I F+ GY EP ++IL E TW+GRV+ + TC I A+S++T
Sbjct: 200 VDVRDLGNEKVSRLIDIRFLEGYYEPTILILCEILRTWSGRVAVRQDTCSILAVSLNTID 259
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
K HP+IWS NLP D + VP PIGGVL+ AN + + +QS YA SL+S
Sbjct: 260 KVHPVIWSLNNLPFDCLGAITVPRPIGGVLIFAANCLLHLNQSKP------PYAESLNS 312
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/391 (22%), Positives = 159/391 (40%), Gaps = 100/391 (25%)
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT--ESAQKTFSFAVRDSLVNIGPL 515
+ D P++K+LR +++ LY + ++ T ES ++++F V D ++++GP
Sbjct: 336 DTDEPTSKKLRTDDEKEDEELE-----KLYSAHTSCTAKESYLRSYTFEVCDRILHVGPC 390
Query: 516 KDFSYGLRINADASATGISKQSNYELV--------------------------ELPGCKG 549
+ G +T + ++S+ E+V +LPGC
Sbjct: 391 ASIAIG------QISTFVQEESDVEVVICSGHDKNGALSVLNKGIKPQVVASYDLPGCVD 444
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+WTV K R ++ + + H +LIIS + TM+L T +TEV E + + Q
Sbjct: 445 MWTV--KDIRLNDENDGDFET--ENTHKFLIISRDNLTMILRTGKEITEV-EQLGFLTQT 499
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
+T+ AGNL +IQV ++ Q L S ++ S+ DP
Sbjct: 500 KTVFAGNLDNGNCIIQVTPYEVILVSKGEKIQQLEL----------ENESPIVFCSLQDP 549
Query: 670 YVLLGMSDGSIRLL---VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD----------- 715
Y+ L + GSI +L + D V + + S+ +++C L+ D
Sbjct: 550 YISLLLEGGSIMMLAFELSDNGEKQVKLVNTTPLNHSR--IAACCLFQDNNGRMSVSDGI 607
Query: 716 --KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI--------------------- 752
+ P P T+ A L ID + LD D
Sbjct: 608 SIRTPSP----TNEPAELMEDEKFTIDDDELLYLDVNDTNLQTNDVPVASTSYTDNLERK 663
Query: 753 ---YSVVCYESGALEIFDVPNFNCVFTVDKF 780
+ +C ++G LE++ +P+++ V+TV+ F
Sbjct: 664 VSYWLFLCLDNGKLEVYSIPSYDKVYTVNGF 694
Score = 48.1 bits (113), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 28/50 (56%)
Query: 949 LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
L DG + F + NC +GF+Y S+ L+IC L TYD WPV KV
Sbjct: 767 LVDGYVKCFAPFNIANCPNGFLYFNSEEDLRICVLDQRFTYDCPWPVHKV 816
>gi|239611898|gb|EEQ88885.1| protein CFT1 [Ajellomyces dermatitidis ER-3]
gi|327352847|gb|EGE81704.1| CFT1 [Ajellomyces dermatitidis ATCC 18188]
Length = 1402
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 227/1032 (21%), Positives = 408/1032 (39%), Gaps = 177/1032 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V + +++++ + GS ++ +T+ + L LV Y L G + L
Sbjct: 28 NLIVAKSTLLQVFNLVNVVYGSAPGQSDEKTRSQY-------TKLVLVAEYALSGTITDL 80
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++ +AK+S++E+D H + TS+H +E + +H+
Sbjct: 81 GRVKILNSKSGGE------AVLVGTRNAKLSLIEWDPERHKIATTSIHYYERDD-VHISP 133
Query: 173 GRESFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------------- 214
+ A P + VDP RC VL +G + + IL Q G LV
Sbjct: 134 WTPNLANCPSHLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDFDSDVEEPPRDTN 192
Query: 215 -----GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
DE +G F SS V+ + L+ M H F++ Y EP IL+ +
Sbjct: 193 HTAEGQDEAKKSNGLAFHTPYASSFVLPIAALEPAMLHPISLAFLYEYREPTFGILYSQV 252
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
T + + + S ++ + + S LP+D +K++A+P P+GG L++G N
Sbjct: 253 ATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVALPPPVGGALLIGTN 312
Query: 328 T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
+H + A+ +N +A S +S + L+ + L +N LL G
Sbjct: 313 ELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEGSIVEQLGTENGDMLLVLLNG 372
Query: 385 DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
+ +L+ DGR V + L K PS +G F GS DS+
Sbjct: 373 KMAVLSFKLDGRSVSGISLRLVPDLAGGSLLKARPSC----SVPLGRGKIFFGSEESDSV 428
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY------ 487
L+ G S S+ K D + SSD +D + E LY
Sbjct: 429 LI------GWSRPSTRPK----DPPVQGAGDDNIAELSSDEEEDDEDIYEDDLYATPVPT 478
Query: 488 GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYE 540
G+ + + S + T ++F + D L N+GP++D + G + D S +N E
Sbjct: 479 GAKARGSLSVKGTNLNDYTFRIHDRLWNLGPMRDLTLGRPAGSRDKDKRQPVSSLSTNLE 538
Query: 541 LVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDE 574
LV G G W+V+ K + + S
Sbjct: 539 LVATQGYGKAGGLTILRREIDPYVIDSLMIKDTDGAWSVHVKDPKLPSQSGSLPLNASSN 598
Query: 575 YHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFE 628
Y YL++S + +++V + E T++ ++ + RTI G L G RV+QV +
Sbjct: 599 YDHYLLLSKSKGSDKEKSVVYTMSSGGLEETKASEFNPNEDRTIDIGTLAGGTRVVQVLK 658
Query: 629 RGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
R D G + Q + SE V+ S ADPYVL+ D S+ LL D
Sbjct: 659 GEVRSYDSGLGLAQIFPVWDEDM-----SEEKYVVHASFADPYVLIIRDDQSVLLLQADG 713
Query: 688 STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
S ++ I S+ S +LY DK +T+ LS V
Sbjct: 714 SGDLDEIEADGIINSTT--WISGSLYQDKYRSFMSYETAPSRKLSDNV------------ 759
Query: 748 DQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
+ ++ ES L IF +PN VFT + D +I S+
Sbjct: 760 ----LLFLLSSES-KLHIFHLPNAKEPVFTAECV-----------------DLLPQILST 797
Query: 807 SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
+E++ + V ++ + P+L ++ ++ Y+ Y +T+
Sbjct: 798 EPPPKRATYRESLTEILVADIG----DSVSRTPYLILRSSNNDLILYEPY------HTTH 847
Query: 867 SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF 926
S + S S++ + N F + + + + GA + + + ++ G++
Sbjct: 848 STEKKS-------SDLRFLKTINHHFPKFHAGSNVEDSSHIGALPKPLRVLGDVCGYRTV 900
Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG 986
F+ G+ PC+ + + L ++ + + + C GF+YV + ++++C+ P
Sbjct: 901 FMPGNSPCFVIKSSTSIPHVLNLRGKTVHSLSSFNIPACERGFVYVDADNVVRMCRFPRN 960
Query: 987 STYDNYWPVQKV 998
+ +D W +K+
Sbjct: 961 THFDGSWATRKI 972
>gi|46120520|ref|XP_385083.1| hypothetical protein FG04907.1 [Gibberella zeae PH-1]
Length = 1436
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 228/1002 (22%), Positives = 380/1002 (37%), Gaps = 165/1002 (16%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GGADN 126
R ++ ES G V D + L LV L G V LA + GG
Sbjct: 68 RANDDDGLESSFLGGETMIVKTDRTNNTKLVLVAELPLSGAVTGLAKVKTKHSKCGG--- 124
Query: 127 SRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKV 185
+++++A++ AK+ + +D L S+H +E E LH SF ++
Sbjct: 125 ----EALLIAYKAAKLCMAVWDPEKSTLETISIHYYEKEE-LHGAPWEVSFDEYANYLEA 179
Query: 186 DPQGRCGGVLVYGLQMIILKASQGGSGLVGDE------------DTFGSGGGFSARIESS 233
DP RC + IL Q L D+ +T G S +E
Sbjct: 180 DPGSRCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKETAAVANGDSDTVEPP 239
Query: 234 HV------INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
+ + L D + H F F+H Y EP IL + H T + L
Sbjct: 240 YTPSFVLRLPLLDPSLLHPVHFAFLHEYREPTFGILSSSQERAHSLGQKDHLTYKVFTLD 299
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + I S +LP D +K+LA+P+P+GG L++G N IH + +A+N+ A
Sbjct: 300 LQQ--RASTTILSVTDLPRDLFKILALPAPVGGALLIGENELIHVDQSGKANGVAVNSMA 357
Query: 347 VSLDSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S ++ ++ L+ ++N LL G + +++ + DGR V L +
Sbjct: 358 RQITSFSLTDQADLNLRLEHCVVEQLHIENGELLLVLNDGQIGIVSFLIDGRTVSGLSIK 417
Query: 405 ----KTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
+ +VL S +T +G + FF+GS +GDS+++ +T G K D
Sbjct: 418 MVTDENGGNVLKSRASTASKLGKNTFFVGSEMGDSVVLGWTRKMGQEKRR---KPRLIDT 474
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
+ + D D+ E + + + N SF + D+L++I P+KD
Sbjct: 475 DIALDVDELDLEDDDDEDDDLYGTESAAAKPAQALNGSGRSGELSFRIHDTLLSIAPIKD 534
Query: 518 FSYGLR--------------INAD---ASATGISKQSNYELV------------ELPGCK 548
+ G + +D A G K + ++ E P +
Sbjct: 535 LTPGKTSFLPDSEEMTLSDGVVSDLHLACIVGRGKAGSLAILNRNIQPKIIGRFEFPEAR 594
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHA------YLIISLEARTMVLETADLLT----- 597
G WT+ K S A DEY A Y+I++ + ET+D+
Sbjct: 595 GFWTMSVKKPLPKALGGS--AGVGDEYEAFGQHDKYMIVA-KVDLDGYETSDVYALTGAG 651
Query: 598 -EVTESVDY-FVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGS 654
E + ++ G T+ AG + + R+IQV + R DG +TQ L + E+G+
Sbjct: 652 FETLKETEFDPAAGFTVEAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA 709
Query: 655 GSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYH 714
V S SI DPY+LL D S+ L D + V+ A + K + C LY
Sbjct: 710 ---EPRVTSASIVDPYLLLIRDDSSLLLAQIDSNNELEEVEKMDATLQNTKWHAGC-LYA 765
Query: 715 DKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-C 773
D T + D G ++ I + +GAL ++ +P+ +
Sbjct: 766 D-----------------TEGAFQFNANDKGETEK--IMMFLLSSTGALHVYALPDLSKP 806
Query: 774 VFTVDKFVSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
V+ + H+ D +R L KE + + V +L
Sbjct: 807 VYVAEGLSYVPPHLSADYTLRRGLA------------------KETLREILVADLG---- 844
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS----RLR 888
P+L + Y+ P+ R SN+SA+ ++
Sbjct: 845 DTISQSPYLILRNQTDDLTIYE---------------PIHHVRPGGESNLSAALSFKKMS 889
Query: 889 NLRFSRTPLDAYTRE-ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP 947
N+ + TP + E P P +R NI+G+ FL GS P + + + +
Sbjct: 890 NVTLATTPAQTEDDDVEQPRFMPMRRCA---NINGYSTVFLPGSSPSFVLKSSKSIPRVI 946
Query: 948 QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
L I + H C+ GFIY +GI ++ Q PS + +
Sbjct: 947 GLQGLGIRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNF 988
>gi|408396642|gb|EKJ75797.1| hypothetical protein FPSE_03977 [Fusarium pseudograminearum CS3096]
Length = 1427
Score = 143 bits (360), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 228/1001 (22%), Positives = 377/1001 (37%), Gaps = 163/1001 (16%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GGADN 126
R ++ ES G V D + L LV L G V LA + GG
Sbjct: 68 RANDDDGLESSFLGGETMIVKTDRTNNTKLVLVAELPLSGAVTGLAKVKTKHSKCGG--- 124
Query: 127 SRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKV 185
+++++A++ AK+ + +D L S+H +E E LH SF ++
Sbjct: 125 ----EALLIAYKAAKLCMAVWDPEKSTLETISIHYYEK-EELHGAPWEVSFDEYANYLEA 179
Query: 186 DPQGRCGGVLVYGLQMIILKASQGGSGLVGDE------------DTFGSGGGFSARIESS 233
DP RC + IL Q L D+ +T G S +E
Sbjct: 180 DPGSRCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKETATVANGDSDTVEPP 239
Query: 234 HV------INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
+ + L D + H F F+H Y EP IL + H T + L
Sbjct: 240 YTPSFVLRLPLLDPSLLHPVHFAFLHEYREPTFGILSSSQEPAHSLGQKDHLTYKVFTLD 299
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + I S +LP D +K+LA+P+P+GG L++G N IH + +A+N+ A
Sbjct: 300 LQQ--RASTTILSVTDLPRDLFKILALPAPVGGALLIGENELIHVDQSGKANGVAVNSMA 357
Query: 347 VSLDSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S ++ ++ L+ ++N LL G + +++ + DGR V L +
Sbjct: 358 RQITSFSLTDQADLNLRLEHCVVEQLHIENGELLLVLNDGQIGIVSFLIDGRTVSGLSVK 417
Query: 405 ----KTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
+ +VL S +T +G + FF+GS +GDS+++ +T G K D
Sbjct: 418 MVTDENGGNVLKSRASTASKLGKNAFFVGSEMGDSVVLGWTRKMGQEKRR---KPRLIDT 474
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
+ + D D+ E + + + N SF + D+L++I P+KD
Sbjct: 475 DIALDVDELDLEDDDDEDDDLYGTESAAAKPAQALNGSGRSGELSFRIHDTLLSIAPIKD 534
Query: 518 FSYGLR--------------INAD---ASATGISKQSNYELV------------ELPGCK 548
+ G + +D A G K + ++ E P +
Sbjct: 535 LTPGKTSFLPDSEEMTLSDGVVSDLHLACIVGRGKAGSLAILNRNIQPKIIGRFEFPEAR 594
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEY-----HAYLIISLEARTMVLETADLLT------ 597
G WT+ K S A DEY H +I + ET+D+
Sbjct: 595 GFWTMSVKKPLPKALGGS--AGVGDEYETFGQHDKYMIVAKVDLDGYETSDVYALTGAGF 652
Query: 598 EVTESVDY-FVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
E + ++ G T+ AG + + R+IQV + R DG +TQ L + E+G+
Sbjct: 653 ETLKETEFDPAAGFTVEAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA- 709
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
V S SI DPY+LL D S+ L D + V+ A + K + C LY D
Sbjct: 710 --EPRVTSASIVDPYLLLIRDDSSLLLAQIDSNNELEEVEKMDATLQNTKWHAGC-LYAD 766
Query: 716 KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CV 774
T + +D G ++ I + +GAL ++ +P+ + V
Sbjct: 767 -----------------TKGAFQLSASDKGETEK--IMMFLLSSTGALHVYALPDLSKPV 807
Query: 775 FTVDKFVSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
+ + H+ D +R L KE + + V +L
Sbjct: 808 YVAEGLSYVPPHLSADYTLRRGLA------------------KETLREILVADLG----D 845
Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS----RLRN 889
P+L + Y+ P+ R SN+SA+ + N
Sbjct: 846 TISQSPYLILRNQTDDLTIYE---------------PIRHVRPGGESNLSAALSFKKTSN 890
Query: 890 LRFSRTPLDAYTRE-ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ 948
+ + TP E E P P +R NI+G+ FL GS P + + + +
Sbjct: 891 VTLATTPAQTEDDEVEQPRFMPMRRCA---NINGYSTVFLPGSSPSFVLKSSKSIPRVIG 947
Query: 949 LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
L I + H C+ GFIY +GI ++ Q PS + +
Sbjct: 948 LQGLGIRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNF 988
>gi|322694449|gb|EFY86278.1| Cleavage factor two protein 1 [Metarhizium acridum CQMa 102]
Length = 1431
Score = 142 bits (358), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 230/993 (23%), Positives = 377/993 (37%), Gaps = 144/993 (14%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
R ++ ES G V D L L+ L G V LA + + +
Sbjct: 70 RANDDDGLESSFLGVESLIVRADPSHNTKLVLISEIPLAGTVIGLARVKI--KNTPSGGE 127
Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
+++LA++ AK+ + E+D H L TS+H +E E L+ G V + DP
Sbjct: 128 ALLLAYKAAKMCLTEWDPQRHTLETTSIHYYEKDE---LQGAPWEMPFGDYVNYLEADPG 184
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGD---ED-------------TFGSGGG----FSA 228
RC + IL +Q L D ED T G G G +
Sbjct: 185 SRCVAFKFGSRNLAILPFTQSEEDLEMDDWDEDLDGPCPVKEEPPTTNGDGPGDHDLVKS 244
Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
R S V+ L LD + H F+H Y EP IL + H T + L
Sbjct: 245 RYTPSFVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSMQSPSPALGIKDHLTYKVFTL 304
Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY 345
+ + I S LP D ++++A+P+P+GG L+VG N IH +A+N+
Sbjct: 305 DLQQ--RASTTILSVTGLPQDLFRVIALPAPMGGALLVGENELIHIDQSGKPNGVAVNDM 362
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL 403
A + S + +S + L+ L ND+ LL G L ++ DGR V ++ +
Sbjct: 363 AKQMTSFSLVDQSELGLRLEGCAVELLANDIGELLLILNDGRLAIICFHIDGRTVSKISI 422
Query: 404 ----SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
++ +++ S ++ I G++ FLGS DS+++ ++ G K +
Sbjct: 423 RLVSAECGGNLIKSQVSCISKLGSNTLFLGSESNDSIVLGWSRKQGQE------KRKKSR 476
Query: 457 IEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPL 515
+ + D D + G + SL S + N S SF V+D+L++I P+
Sbjct: 477 LLDPDLALDVDDLDLDDDEDDDLYGNDSSLAKPSQTINGSSKPGEVSFRVQDTLLSIAPI 536
Query: 516 KDFSYGL-RINADASATGISKQSNYEL----------------------------VELPG 546
+D + G D+ +SK EL + P
Sbjct: 537 RDVACGAPAFVPDSEEATLSKGVTAELELACAVGRGFSGSVAILNREIQPKVIGRFDFPE 596
Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVT 600
+G WT+ K A + +Y Y+I++ E + TA +
Sbjct: 597 ARGFWTMCVKKPLSKGAAVASDYDTTAQYDKYMIVAKVDLDGYETSDVYALTAAGFETLK 656
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENS 659
++ G T+ AG + + R+IQV + R DG ++Q L P E +
Sbjct: 657 DTEFEPAAGFTVEAGTMGKQMRIIQVLKSEVRCYDGDLGLSQIL---PMLDEDTGAEPRA 713
Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPE 719
T S SI DPY+LL D SI + + V P S K S C LY+D
Sbjct: 714 T--SASIVDPYLLLNRDDSSIFIAQIHSNNELEEVFKPDGTLKSTKWASGC-LYND---- 766
Query: 720 PWLRKTSTDAWLSTGVG-EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVD 778
T + V + D AD I + +GAL ++ +P+ V
Sbjct: 767 -------TQGIFQSNVNKQKADAAD-------RIMMFLLSSAGALHVYALPD------VS 806
Query: 779 KFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR 838
K + ++ EAL ++++ G KE+I + V +L A
Sbjct: 807 KPI---------FVAEALTSIPPFLSAAFVARKG-ASKESITEILVADLG----DAISQT 852
Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
P+L + Y+ P + D ++ L V+ S + P
Sbjct: 853 PYLIVRHASDDLTIYE------PVRCQEEGDAELSASLLFKKCVNTSLAKT-----APEV 901
Query: 899 AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFT 958
+ E P P +R N++G+ FL G+ P + + L + +
Sbjct: 902 SEDDAEPPRFVPLRRCA---NVNGYGAVFLPGASPSFVLKSSHSEPRVIGLQGLGVRGMS 958
Query: 959 VLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
H C+ GFIYV +GI ++ QLPS +++ +
Sbjct: 959 TFHTEGCDRGFIYVDVEGIARVTQLPSNASFTD 991
>gi|392558419|gb|EIW51607.1| hypothetical protein TRAVEDRAFT_176174 [Trametes versicolor
FP-101664 SS1]
Length = 1431
Score = 142 bits (358), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 183/813 (22%), Positives = 344/813 (42%), Gaps = 117/813 (14%)
Query: 103 LVCHYRLHGNVESL-AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC 161
LV +RLHG V L A+ + ++ + D ++++F+DAKI++LE+ D+IH + S+H
Sbjct: 123 LVREHRLHGTVTGLEAVRTVHSLED--KLDRLLVSFKDAKIALLEWSDAIHDVMTVSIHT 180
Query: 162 FE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK--ASQGGSGLVGDED 218
+E +P+ + L RG L +VDP RC + + + IL SQ L+ E
Sbjct: 181 YERAPQLMALD---SPLFRGEL-RVDPLSRCAALSLPKDSLAILPFYQSQAELDLMEQES 236
Query: 219 TFGSGGGFSARIESSHVINL-RDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
+ +S S V++L D+D +++V DF F+ G+ P + +L + + TW GR+
Sbjct: 237 SQARDVPYSP----SFVLDLANDVDQRIRNVIDFAFLPGFNNPTVAVLCQYQQTWTGRLK 292
Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ- 334
T + ++ +PLI + LP+D L + IGGV ++ +N I + Q
Sbjct: 293 EYKDTVGLFIFTLDLVTNNYPLITAVDGLPYDCLSLTPCSTAIGGVFILASNAIIFVDQA 352
Query: 335 SASCALALNNY---AVSLDSSQELPRSSF-SVELDAAHATWLQNDVALLSTKTGDLVLLT 390
S L +N + L P+ +++L+ A T++ + + K G + +
Sbjct: 353 SRRVILPVNGWPPRTSDLTMPSLTPQEQLRNLQLEGARFTFVDDKTLFVILKDGTVHPVE 412
Query: 391 VVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
+V DG+ V RL ++ T P+V + + + F+GS +G S+L++ T+
Sbjct: 413 LVLDGKTVSRLSMADALARTTIPAV----VARVRDDYLFVGSMVGPSVLLR------TAH 462
Query: 446 LSSGLKEEFGDIEAD-----APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK- 499
+ +KEE D++A AP+ D NGE+ S G+ + +S +K
Sbjct: 463 VEEVIKEEDVDMDAGPATVVAPADTMDLDDDDDLYGPSGNGEQPSANGATNGTVDSVKKR 522
Query: 500 -TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG-------------ISKQSNY 539
++ D+L G + D ++GL N D +ATG + +S
Sbjct: 523 TVVRLSLCDALPAHGAISDMAFGLARNGDRVVPELIAATGSGELGGFHLFQRDMPTRSKR 582
Query: 540 ELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLL 596
+L + G +G+W++ ++ + R ++ +D +IIS +A + A
Sbjct: 583 KLHAIGGARGMWSLAVRQAMKVSGGTLERPSSQNDS----VIISTDANPSPGLSRIATRS 638
Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRVIQVF---ERGARIL--DGS--YMTQDLSFGPSN 649
++ + G T+ A F ++ + R+L DG+ + +DL
Sbjct: 639 AHSDIAITTRIPGTTLGAAPFFQGTAILHILFNVTNAIRVLEPDGTERQIIKDLE----- 693
Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSS 709
+ + S SI DP+VL+ D +I L +G+ + + + + +
Sbjct: 694 ----GTAPRPKIKSCSICDPFVLIIREDDTIGLFIGELERGKIRRKDMSPMGDKTSRYVA 749
Query: 710 CTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI-------YSVVCYESGA 762
+ D T L T V E + QG + + ++ G
Sbjct: 750 GGFFTD-----------TSGLLQTFVNEQAPAENVTSTLQGAMNAGNKSQWLILVRPQGV 798
Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
+E++ +P F+ + + D+Y AL S ++ + ++ +I +
Sbjct: 799 VELWTLPKLTLAFSTTLLATLDPILTDSYDGPAL--------SLPQDPPRKPQELDIDQI 850
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQA 855
+ L R RP L +L G + Y+A
Sbjct: 851 VIAPLGESR-----PRPHLIVLLRSGQLAVYEA 878
>gi|121797760|sp|Q2TZ19.1|CFT1_ASPOR RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
protein 1
gi|83775384|dbj|BAE65504.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1393
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 166/665 (24%), Positives = 276/665 (41%), Gaps = 102/665 (15%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
++I+LAF +AK++++E+D +G+ S+H +E + + + G ++ VDP R
Sbjct: 88 EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147
Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
C V +G++ + IL Q G LV D+ + GS G A
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206
Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
SS V+ L LD + H F++ Y EP IL+ + T + + + +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + + S LP D +K++A+P P+GG L++G+N +H + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326
Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S +S ++ L+ L N LL TG++VL+ DGR V + +
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386
Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE---EF 454
P S +G+ FLGS DS+L+ G S+ SSG K+ +
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKKPRPQA 440
Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNG--EELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
E D+ +S D +D + E+ + G + ++F D L+NI
Sbjct: 441 RHTEEDSGGFSDEDQSEDDVYEDDLYATVPEVVVDGRRPSAESFGSSLYNFREYDRLLNI 500
Query: 513 GPLKDFSYGLRINADASATGISKQSNYELV----------------------------EL 544
GPLKD ++G + S ELV +L
Sbjct: 501 GPLKDIAFGRSFTSLGGEENAGNDSGLELVASQGWDRSGGLAVMKRGLELQVLNSMRTDL 560
Query: 545 PGCKGIWTVYHKSSRGHNADS---SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
C +WT +S H ++ + A + E H Y+++S +A + E +++ +
Sbjct: 561 ASC--VWT----ASVAHMEEAVSKTTTQAENRECHQYVVVS-KATSAEREQSEVFRVEGQ 613
Query: 602 SVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---PSNSE 651
+ F + TI G L G+ RV+Q+ R DG DL P E
Sbjct: 614 ELRPFRAPEFNPNEDVTIDIGTLIGKNRVVQILRSEVRSYDG-----DLGLAQIYPVWDE 668
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCT 711
SE +S S+ DPYV + D ++ LL D S V+ I +SK +SC
Sbjct: 669 --DTSEERMAISSSLVDPYVAILRDDSTLLLLQADDSGDLDEVELNEQIANSKW--TSCC 724
Query: 712 LYHDK 716
LY DK
Sbjct: 725 LYFDK 729
>gi|412986884|emb|CCO15310.1| predicted protein [Bathycoccus prasinos]
Length = 1595
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 100/325 (30%), Positives = 163/325 (50%), Gaps = 72/325 (22%)
Query: 181 PLV-KVDPQGRCGGVLVYGLQMIILK----------------ASQGGSGLVGDEDTFGSG 223
P++ + DP+GRC VL+ + +K +S G + ++ G G
Sbjct: 208 PIIGRADPEGRCAAVLLRNEEKAKVKIMPASETSTSSNYIKESSNGSKKMTTKKE--GEG 265
Query: 224 GGF-SARIESSHVINLRDL---DMKHVKDFIFVHGYIEPVMVILHEREL-TWAGRVSWKH 278
+ A I SS +++R + V+D F+HGY EPV++IL+E TW+GR+S +
Sbjct: 266 TVYVPATIGSSFDLDVRKILGPSAAFVRDCCFLHGYGEPVLMILYESNPPTWSGRLSLRM 325
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
TC + A+SI T K++ ++W+ LP AY L VP+P+GGVLV+ + I Y SQS+S
Sbjct: 326 DTCKLVAVSIDCTKKKYTIVWTREKLPSAAYSLFPVPNPLGGVLVLSSGHILYESQSSSA 385
Query: 339 ALALN----------NYAVSLD------------------------SSQELPRSSFSVEL 364
+ N+A + SS E ++ F V+L
Sbjct: 386 TYISDFLGKGGPQEGNFAEEIARNNGVEGQAAHANPVPHVNSNKNVSSYETTQNEFQVQL 445
Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTVVYD------------GRVVQRLDLSKTNPSVLT 412
DAA ++ +VA++S+KTG L+ TV+ + GR +R+ + K+ +VL+
Sbjct: 446 DAAKIEMIRENVAIISSKTGQLI--TVILETVGGAASVGSKVGRRCRRIRVLKSGNAVLS 503
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
S + +G L F+GSR+GDSLL+ +
Sbjct: 504 SGLAAVGKDLLFIGSRVGDSLLIGY 528
Score = 135 bits (339), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 141/560 (25%), Positives = 230/560 (41%), Gaps = 118/560 (21%)
Query: 501 FSFAVRDSLVNIGPLKDFSYGLR--INAD-------ASATGISKQSNY---------ELV 542
+ F+V+DSL+ I P+ D + G + D +A G K ELV
Sbjct: 668 YKFSVKDSLLCISPVVDLTVGASAPVGTDLDPRTELVAACGHGKNGALAILTRGITPELV 727
Query: 543 ------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA--RTMVLETAD 594
LPG + W + N + R D+ + +LI+SL + TMVLET +
Sbjct: 728 TEVESGALPGLRACWAT---RTEDDNDGTVRPKRKDELFDEHLILSLSSTKTTMVLETGE 784
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG- 653
L EV++ VD+ V T+A +F R + QV + R + + F + +
Sbjct: 785 ELREVSKEVDFIVDEETLACERIFNGRAIAQVTKTKIR-----FTRKGKKFAVDDIDLAF 839
Query: 654 -SGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI-------ESSKK 705
G E + + I + + L +SDGSIR+++GD T T ++ ++
Sbjct: 840 LKGGEGAQITLAIIQNDAIALRLSDGSIRIILGDSKTNTFTLLEKVGELFASDNHSNTGS 899
Query: 706 PVSSCTLYHD----------------KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ 749
V++ TLY D + P WL +T + G E D + +
Sbjct: 900 DVTAFTLYDDSVACTDSFGGGGGGLNRAP-GWLERT------ACGDREEKDESK----EN 948
Query: 750 GDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
++ G L ++ +P+ +++ G RE L + T I+S
Sbjct: 949 NNVVFATISRDGTLALYSLPSLKKLWSSGGVSDG---------REILAPNSTGIDSIDFN 999
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
+ K + +++ A +A + RP L DG++L YQA F+ P +
Sbjct: 1000 DECEVEKYTVSDIRLDAFA----NAAYERPLLTCFRADGSVLAYQA--FKSPSSN----- 1048
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYT--REETPHGAPCQ---RITIFKNIS--- 921
LRF+R P++ T E T + Q R+T +NI
Sbjct: 1049 -------------------ELRFARVPIEIETAGSELTNNDVSVQGGSRLTRIENIGDGR 1089
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI-VAFTVLHNVNCNHGFIYVTSQGILKI 980
G G F+SG P W +V R R+ P +G +AF HNVNC GFI T++G +++
Sbjct: 1090 GIAGVFVSGLNPIWLIVRRGRVLALPTRGEGGARIAFAPFHNVNCPKGFILATNEGGIRV 1149
Query: 981 CQLPSGSTYDNYWPVQKVVF 1000
C+LP + WPV+K+
Sbjct: 1150 CRLPGKMHIEAQWPVRKLAL 1169
>gi|347838999|emb|CCD53571.1| similar to Cleavage and polyadenylation specificity factor subunit
1 [Botryotinia fuckeliana]
Length = 1447
Score = 140 bits (354), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 231/1050 (22%), Positives = 405/1050 (38%), Gaps = 197/1050 (18%)
Query: 57 NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD-GIS---------- 97
NLVV +++++I+ + + E+ S +K+ RV D G+
Sbjct: 28 NLVVAKSSLLQIFTTKTVSVDLDELSEKDSSTAKDDTNIDPRVNNDDGVEDSFLGTDSIM 87
Query: 98 -------AASLELVCHYRLHGNVESLA----ILSQGGADNSRRRDSIILAFEDAKISVLE 146
L LV Y L G V SL I S+ G + +I++ F+DAK+S++E
Sbjct: 88 QRPELARTTKLVLVAEYNLSGTVTSLVRVKTISSKTGGE------AILVGFKDAKLSLVE 141
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+D G+ S+H +E E + VDP RC + + IL
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201
Query: 207 SQGGSGLVGDEDTFGSG--------------GGFSARIESSHVINLRDLDMK-----HVK 247
Q + D D G G SS V+ L LD H++
Sbjct: 202 KQDEDVNMDDWDEELDGPRPAKISQKAAAEDGQLDTPYGSSFVLRLSSLDPSIIFPIHLE 261
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLP 305
F++ Y EP IL + + + H T M+ L + K I S LP
Sbjct: 262 ---FLYEYREPTFGILSSTMAPSSALLQERRDHLTYMVFTLDMHQ--KASTTILSVGGLP 316
Query: 306 HDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVEL 364
+D ++++ + P+GG L+VG N IH + +A+N +A L ++ + L
Sbjct: 317 YDLFRIVPLAPPVGGALLVGTNELIHIDQAGKANGVAVNMFAKQCTGFSLLDQADLDLRL 376
Query: 365 DAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLT---SDI 415
+ L +N L+ +GD+ +L+ DGR V L + + + ++LT S +
Sbjct: 377 EGCKIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAELGGAILTGAASCV 436
Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
+++G F+GS + DS+++ + SG + + E D +
Sbjct: 437 SSLGAGSLFVGSEVSDSVILGWNRKSGQTSRRKSRLDSSAIAEVDE---AMFDEEDLEDD 493
Query: 476 QDMVNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSYG---LRINADASA 530
D + G+ ++ + +N T S KT ++F + DS+VNI P+ + ++G L + D
Sbjct: 494 DDDLYGDGPTITHATANITASNSKTGDYTFRIHDSMVNIAPITNIAFGEAALSLGKDEEL 553
Query: 531 TGISKQSNYELV--------------------------ELPGCKGIWTVYHK--SSRGHN 562
QS +LV +LP +GIWT+ K + +G
Sbjct: 554 KSSGVQSELQLVAAVGREKGGSLAVINREIQPNVIGRFDLPEARGIWTMSAKRPAPKGLQ 613
Query: 563 ADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLTEVTESVDYF-VQG 609
+ + D +Y +I+S +A + E+A D E ++ G
Sbjct: 614 VNKEKSVTSGDYGVDAQYDRLMIVSKASDAEDAIEESAVYALTDAGFEALTGTEFEPAAG 673
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
TI AG L RV+Q+ + R DG + Q L + E+G+ ++S S AD
Sbjct: 674 STIEAGTLGNGMRVVQILKSEVRSYDGDLGLAQILPM--LDDETGA---EPKIISASFAD 728
Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
P++LL D SI + D ++ I S K ++ C LY D +D
Sbjct: 729 PFLLLIRDDASIFVAQCDDDNDLEEIERVDDILLSTKWLTGC-LYDD------YSGAFSD 781
Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDK---FVSGRT 785
+ S GE ++ + GAL I+ +P+ + V + FV
Sbjct: 782 SK-SNKAGE-------------NVKMFLLSAGGALHIYALPDLSKPVYVAEGICFVPPVL 827
Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
+ A +++ TEI L + P+L
Sbjct: 828 SADYAARKSAARETLTEI-----------------------LVANLGDSVSQSPYLILRP 864
Query: 846 TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET 905
++ + Y+ + + S S L S + +++N ++ P + EE
Sbjct: 865 SNDDLTIYEPFRVK------------SASPDLLSSTLQFLKIQNTHLTQAP--DVSAEEQ 910
Query: 906 PHGA------PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTV 959
GA P + I+ N+ G+ F+ G P + + + L + + +
Sbjct: 911 VDGAQQTSDKPMRAIS---NLGGYSTVFMPGGSPSFIIKSSKTAPKVLSLQGTGVRSLSS 967
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
H C+ GFIY +++GI ++ Q P +T+
Sbjct: 968 FHTEGCDRGFIYASTEGIARVAQFPPNTTF 997
>gi|189203597|ref|XP_001938134.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187985233|gb|EDU50721.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 1407
Score = 140 bits (352), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 168/697 (24%), Positives = 292/697 (41%), Gaps = 86/697 (12%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++I+ ++ V + S+N+ E L + A L LV
Sbjct: 28 NLVVAKNSLLQIFELKSTTTEVTPGAGENSENAAANLDTEAADVPLQRTENTAKLVLVAE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
+ L G V SLA + A N++ + +++++AF DAK+S++E+D + L S+H +E+P
Sbjct: 88 FPLAGTVISLARVK---ALNTKSKGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144
Query: 166 E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ----------- 208
+ W + +F + DP RC + + IL Q
Sbjct: 145 DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLVEDDYDSD 199
Query: 209 -GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
G + G+ G SS V+ L +LD + H F+H Y EP I+
Sbjct: 200 ADGPKETKADQANGTNGEHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAA 259
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
T ++ + S ++ K + S LP+D K++ +PSPIGG L+VG
Sbjct: 260 SRATAPSLLAQRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITKVVPLPSPIGGALLVG 319
Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTK 382
N IH + +A+N +A + S +S ++ L+ L + L+
Sbjct: 320 GNEIIHVDQGGKTNGVAVNEFAKACTSFSLSDQSDLALHLEGCSIELLSQETGDVLIVLN 379
Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLV 435
G L++LT DGR V + + S + +G F+GS G+S+++
Sbjct: 380 NGRLLILTFTLDGRTVSGMTIQTVAADHGGHLLKSAASCTSNLGRGRLFIGSEDGESVML 439
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNT 494
+T L++ L+ + + + D D D+ N +++ +A+ +
Sbjct: 440 GWTG------LTNQLRRKLSNADLDG-EDDSEEEEIDDMEDDLYNDTAPTMHKITAAVSE 492
Query: 495 ESAQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL- 541
+A T++F + D L +I P+KD + G + ++ A G + EL
Sbjct: 493 PTAPGTYTFRIHDVLPSIAPIKDAVLHPGKVTESLNRGEIMLSTGRGAAGAITALDRELH 552
Query: 542 ------VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISL--EART 587
ELP G+W V+ + + D+ A D +Y YL++S E T
Sbjct: 553 PISVATKELPSAHGVWAVHARKQAPGDVTAAFGEDTEANMATDVDYDQYLVMSKNGEDGT 612
Query: 588 MVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
+V E D LTE + +G T+ G L +V+QV RI D +
Sbjct: 613 VVYEVNGDKLTETDKGDFEREEGTTLLVGILAAGTKVVQVMRTEVRIYDSELNLVHIQSM 672
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
E GS E + +++ S ADPY+L+ D S+++
Sbjct: 673 EEEEEGGSTKELN-IINASFADPYLLILREDSSVKIF 708
>gi|19112233|ref|NP_595441.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe
972h-]
gi|74582544|sp|O74733.1|CFT1_SCHPO RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|3738146|emb|CAA21247.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe]
Length = 1441
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 233/1061 (21%), Positives = 412/1061 (38%), Gaps = 187/1061 (17%)
Query: 57 NLVVTAANVIEIYVV-RVQEEGS-----------------KESKNSGETKRRVL-MDGIS 97
NLVV+ N + ++ + ++Q++ S ES+ ET ++ + +
Sbjct: 29 NLVVSKVNSLHLFEIEKIQKDESSFPLDDSLQNEFSTSIIDESQAFMETNMHLIRTNEQT 88
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
L LV ++ G + ++ L G++ D +I+ + AK+S LE+D
Sbjct: 89 TYVLRLVSQVKVFGTITEISALKGKGSNGC---DLLIMLTDYAKVSTLEWDMQSQSFVTN 145
Query: 158 SMHCFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
S+H +E +K + P + VDP C +L + M+ + L +
Sbjct: 146 SLHYYED-----VKSSNICSSHTPTQLLVDPDSDCC-LLRFLTDMMAIIPYPANEDLDME 199
Query: 217 EDTF-GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
E S S + S V+ LD + + D F++GY EP + IL+ E T
Sbjct: 200 EAAIENSKISSSYAYKPSFVLASSQLDASISRILDVKFLYGYREPTLAILYSPEQTSTVT 259
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-H 332
+ + T + S +++ + +I + +LP+D Y +++P+P+GG L++G N + Y
Sbjct: 260 LPLRKDTVLFSLVTLDLEQRASAVITTIQSLPYDIYASVSIPTPLGGSLLLGGNELIYVD 319
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-----QNDVALLSTKTGDLV 387
S + + +N+Y +S F++EL+ A L + +L +G
Sbjct: 320 SAGRTVGIGVNSYYSKCTDFPLQDQSDFNLELEGTIAIPLTSSKTETPFVVLVHTSGQFF 379
Query: 388 LLTVVYDGRVVQRLDLS----KTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCG 440
L + DG+ V+ L L + N L S IT G +L FLGS+ DS L++++
Sbjct: 380 YLDFLLDGKSVKGLSLQALDLEINDDFLKSGITCAVPAGENLVFLGSQTTDSYLLRWSRR 439
Query: 441 SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
+ EE E D L ++ + DM++ E +
Sbjct: 440 TT--------NEEVRLDEGD----DTLYGTNDAEMDDMLDIYETDESVGSKRKIAYENGP 487
Query: 501 FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY---ELV--------------- 542
+ D L NIGP+ DF+ G A + Q N+ ELV
Sbjct: 488 LRLEICDVLTNIGPITDFAVG-----KAGSYSYFPQDNHGPLELVGTAGADGAGGLVVFR 542
Query: 543 -----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVL 590
+ GC+ +WTV S + N S A Y + E YL++S E + +
Sbjct: 543 RNIFPLIAGEFQFDGCEALWTV-SISGKLRNMKSRIQAQYSNPELETYLVLSKEKESFIF 601
Query: 591 ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSN 649
+ EV S D+ +T+ G+L R++Q+ R+ D + +TQ +F
Sbjct: 602 LAGETFDEVQHS-DFSKDSKTLNVGSLLSGMRMVQICPTSLRVYDSNLRLTQLFNF---- 656
Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSI----------RLLVGDPSTCTVSVQTPAA 699
S+ V+S SI DP +++ G I RL+ D V+T A+
Sbjct: 657 ------SKKQIVVSTSICDPCIIVVFLGGGIALYKMDLKSQRLIKTDLQNRLSDVKT-AS 709
Query: 700 IESSKKPVSSCTLY----------------HDKGPEPWL-----RKTSTDAWLSTGVGEA 738
+ S L+ +D E L KTS + + G ++
Sbjct: 710 LVSPDSSALFAKLFTYNETLNAKGQIANGMNDSASETDLDIQPNHKTSNNDQM--GYDQS 767
Query: 739 IDGADGGP--------------LDQGDIYS----VVCYESGALEIFDVPNFNCVFTVDKF 780
+ AD P LDQ + + G L+++++ +F+ + D F
Sbjct: 768 V-SADDVPEVDNTIVTEKNVSNLDQESLEKHPILFALTDEGKLKVYNLADFSLLMECDVF 826
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
T + ++ T N S S ++VEL + P
Sbjct: 827 DLPPT------LFNGMESERTYFNKES-------------SQELVELLVADLGDDFKEPH 867
Query: 841 LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
LF I Y+A+L+ NT K + ++ ++ V + +R TP DA
Sbjct: 868 LFLRSRLNEITVYKAFLYS---NTDKHKNLLAFAK---VPQETMTREFQANVG-TPRDAE 920
Query: 901 TREETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAF 957
+ E + ++T + + H F++G +P + + P + I++
Sbjct: 921 STMEKKASSSVDHLKMTALEVVGNHSAVFVTGRKPFLILSTLHSNAKFFPISSNIPILSV 980
Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
H + G+IYV ++IC+ YDN WP +KV
Sbjct: 981 APFHAHHAPQGYIYVDENSFIRICKFQEDFEYDNKWPYKKV 1021
>gi|302831157|ref|XP_002947144.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
gi|300267551|gb|EFJ51734.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
Length = 2830
Score = 139 bits (351), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 123/476 (25%), Positives = 205/476 (43%), Gaps = 81/476 (17%)
Query: 575 YHAYLIISL-EARTMVLETADLLTEVTES--VDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+HAYL+I++ RTMVL D L +VT S ++ V T+AAGNLF ++Q G
Sbjct: 1889 FHAYLLITMGRVRTMVLRCTDGLDDVTNSPECEFLVNQPTLAAGNLFHNAVIVQACPMGL 1948
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS-------------IADPYVLLGMSDG 678
R+L+G + Q+L + ++ S ADPYVL+G+SDG
Sbjct: 1949 RVLEGMTLVQELRVSDFQASRPKTAQYSFCCRTKHPIAHRAMGPIPQAADPYVLVGLSDG 2008
Query: 679 SIRLLVGDPSTCTVSVQTPAA-------IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
+ LL GDP + T+ V T AA S ++ +++ L+ D+ +W+
Sbjct: 2009 TAVLLEGDPLSLTLGVATAAAEQLMAVPARSRQQRLAAACLHRDE-----------TSWM 2057
Query: 732 STGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS-------GR 784
++ + I+ +C SG LE + +P+ VF + G
Sbjct: 2058 ASATAAEAASS----GSSFSIFLWICRLSGRLECYSLPSMRLVFHSSGLAAAEEVLRMGP 2113
Query: 785 THIVDTY--MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS----- 837
+ D Y +E E++ G G G E+ VVEL ++ + S
Sbjct: 2114 AVMYDVYDLFGGGGGGAEAELDG----GGGSGIMED----PVVELRVESFLGGGSPAVPD 2165
Query: 838 --RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSA----------- 884
RP L + G ++ YQ L P ++ + P + + S
Sbjct: 2166 CERPVLLVMAASGNLVAYQIALRRLPLDSLSHEAPAAMGAAAGSSGGGGGIGGGAALGPR 2225
Query: 885 -SRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
+R +L ++ +++R + ++ + + + G F++GSRP W + R L
Sbjct: 2226 MARFDHLAYTDPSSKSHSRTDI------RKYPVASQGTSYSGVFVAGSRPLWLVASRGGL 2279
Query: 944 RVHPQLCDGSIVAFTVLHNVNCNHGFIYV-TSQGILKICQLPSGSTYDNYWPVQKV 998
HP +G++ A T HN NC GFI +S+G+LK+CQLP + D W ++V
Sbjct: 2280 VPHPMFAEGAVAAMTPFHNANCPLGFISACSSRGLLKVCQLPPHTRLDTPWVTRRV 2335
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 120/215 (55%), Gaps = 25/215 (11%)
Query: 228 ARIESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
A + + +++NL + + ++ V+D +F+HGY EPV+++LHE + TW G + + TC ++A+
Sbjct: 1305 ATLGNGYLLNLNKMMGIREVRDCVFLHGYTEPVLLLLHEPDPTWVGMLRERKDTCCLAAI 1364
Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA 346
SIS LK+H ++W +LP+D +KLLAVP VLV+ N + SQ++ A ALN+ A
Sbjct: 1365 SISLRLKRHTILWKLASLPYDCFKLLAVPY-RPAVLVISPNLLLLCSQASQHAAALNSNA 1423
Query: 347 VS--------LDSSQELP---------RSSFSVELDAA-----HATWLQN-DVALLSTKT 383
+ LD S+E P + + +V D A +AT + + +V ++
Sbjct: 1424 LPGEVPPPLILDPSREPPAATAARLAAQYALNVHPDCAPAAGRNATLMADLEVVAAGLQS 1483
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
G L+ + + ++G QR+ + +T + S + I
Sbjct: 1484 GTLLAVHLQFEGPADQRITVVRTGGGPIASAMVGI 1518
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 94/188 (50%), Gaps = 8/188 (4%)
Query: 56 PNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD-GISAASLELVCHYRLHGNVE 114
PNL+V N +E++ +R + + + G A LELV Y LHG VE
Sbjct: 1078 PNLIVVRTNRLEVHSLRSSAVATNAAAATATAAATASAAVGSGGARLELVVSYHLHGVVE 1137
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
SLA+LS G +S RRD+++LAF + K+SV+E++ H LR +S+H FE + + GR
Sbjct: 1138 SLAVLSGG---SSSRRDALLLAFREGKLSVVEWNPRTHSLRTSSLHYFEGDPGVQ-REGR 1193
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG---LVGDEDTFGSGGGFSARIE 231
+ P V DP GRC + Q+ +L A + +G V D G G G RI
Sbjct: 1194 IAVPLPPRVVTDPAGRCAAMSFCFSQLALLPALEVKAGAWQCVDDGGVMGVGRGERERIG 1253
Query: 232 SSHVINLR 239
H+ R
Sbjct: 1254 GVHINERR 1261
>gi|322704830|gb|EFY96421.1| Cleavage factor two protein 1 [Metarhizium anisopliae ARSEF 23]
Length = 1433
Score = 139 bits (349), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 234/995 (23%), Positives = 376/995 (37%), Gaps = 156/995 (15%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
R ++ ES G V D + L L+ L G V LA + + +
Sbjct: 70 RANDDDGLESSFLGGESLIVRADPSNITKLVLITEIPLAGTVIGLARVKV--KNTPSGGE 127
Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
+++LA++ AK+ + E+ H L TS+H +E E L+ + G V + DP
Sbjct: 128 ALLLAYKAAKMCLTEWHPQRHTLETTSIHYYEKDE---LQGAPWEMSFGDYVNYLEADPG 184
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGD---ED-------------TFGSGGG----FSA 228
RC + IL +Q L D ED T G G G +
Sbjct: 185 SRCVAFKFGSRNLAILPFTQSEEDLEMDDWDEDLDGPRPVKEELPLTNGDGPGDHDLVKS 244
Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
R S V+ L LD + H F+H Y EP IL + A H T + L
Sbjct: 245 RYTPSFVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSMQSPSAALGIKDHLTYKVFTL 304
Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY 345
+ + I S LP D ++++A+P+P+GG L+VG N IH +A+N+
Sbjct: 305 DLQQ--RASTTILSVTGLPQDLFRVMALPAPMGGALLVGENELIHIDQSGKPNGVAVNDM 362
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL 403
A + S + +S + L+ L ND+ LL G L ++ DGR V ++ +
Sbjct: 363 AKQMTSFSLVDQSELGLRLEGCAVELLANDIGELLLILNDGRLAIVCFHIDGRTVSKISI 422
Query: 404 ----SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
++ +++ S ++ I G++ FLGS DS+++ ++ G K +
Sbjct: 423 RLVSAEYGGNLIKSQVSCISKLGSNTLFLGSESNDSIVLGWSRKQGQE------KRKKSR 476
Query: 457 IEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPL 515
+ + D D + G + SL S + N S SF ++DSL++I P+
Sbjct: 477 LLDPDLALDVDDLDLDDDEDDDLYGNDASLAKPSQTINGGSKPGEVSFRIQDSLLSIAPI 536
Query: 516 KDFSYGL-RINADASATGISKQSNYEL----------------------------VELPG 546
+D + G + D+ +SK EL + P
Sbjct: 537 RDVACGAPALVPDSEEATLSKGVTAELELACAVGRGSSGSVAILNREIQPKVIGRFDFPE 596
Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVT 600
+G WT+ K A + +Y Y+I++ E + TA +
Sbjct: 597 ARGFWTMCAKKPLSKGAAVASDFDTTGQYDKYMIVAKVDLDGYETSDVYALTAAGFETLK 656
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENS 659
++ G T+ AG + + R+IQV + R DG ++Q L P E +
Sbjct: 657 DTEFEPAAGFTVEAGTMGKQMRIIQVLKSEVRCYDGDLGLSQIL---PMLDEDTGAEPRA 713
Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPE 719
T S SI DPY+LL D SI + + V P S K S C LY+D
Sbjct: 714 T--SASIVDPYLLLIRDDSSIFIAQIHSNNELEEVLKPDGTLKSTKWASGC-LYND---- 766
Query: 720 PWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD-IYSVVCYESGALEIFDVPNFN-CVFTV 777
T V E D+ D I + GAL ++ +P+ + VF
Sbjct: 767 -------TQGIFQNNVNEQ-------QADETDRIMMFLLSSVGALHVYALPDVSRPVFVA 812
Query: 778 DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
+ S + ++ A + +G KE+I + V +L A
Sbjct: 813 EALTS-----IPPFLSAAF---------VARKGAS---KESITEILVADLG----DAISQ 851
Query: 838 RPFLFAILTDGTILCYQA--YLFEGPENTSKS---DDPVSTSRSLSVSNVSASRLRNLRF 892
P+L + Y+ Y EG S S V+TS + + VS
Sbjct: 852 TPYLIVRHASDDLTIYEPVRYQAEGDAELSASLLFKKCVNTSLAKTAPEVSED------- 904
Query: 893 SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDG 952
DA E P P +R N++G+ FL + P + + L
Sbjct: 905 -----DA----EPPRFVPLRRCA---NVNGYGAVFLPNASPSFVLKSSHSEPRVMGLQGL 952
Query: 953 SIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGS 987
+ + H C+ GFIYV +GI ++ QLPS +
Sbjct: 953 GVRGMSTFHTEGCDRGFIYVDMEGIARVTQLPSNA 987
>gi|325094074|gb|EGC47384.1| cleavage factor two protein 1 [Ajellomyces capsulatus H88]
Length = 1377
Score = 139 bits (349), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 212/984 (21%), Positives = 384/984 (39%), Gaps = 156/984 (15%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G + L + D+ +++++A +AK+S++E+D H + TS+H
Sbjct: 65 LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIH 122
Query: 161 CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
+E + +++ + A P + VDP RC VL +G + + IL Q G LV
Sbjct: 123 YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180
Query: 215 -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
DE +G F SS V+ + L+ M H F++ Y
Sbjct: 181 DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL+ + T + + + S ++ + + S LP+D +K++ +P
Sbjct: 241 REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300
Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
P+GG L++G+N +H + A+ +N +A S +S + L+ + L
Sbjct: 301 PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFL 425
+N LL G + +L+ DGR V + L + S+L + + + F
Sbjct: 361 ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ--------D 477
GS GDS+L+ ++ S + + G+I + D
Sbjct: 421 GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480
Query: 478 MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
M G + +S+ G+ N+ + F + D L N+GP++D + G + D
Sbjct: 481 MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533
Query: 531 TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
S +N ELV G G +VY K + +
Sbjct: 534 PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593
Query: 565 SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
S Y YL++S + +++V + E T++ ++ + RTI G L
Sbjct: 594 GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653
Query: 619 GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
RV+QV + R D G + Q + SE +V+ S ADPYVL+ D
Sbjct: 654 SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708
Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
SI LL D S +T I S+ S +LY DK
Sbjct: 709 QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDK-------------------YR 747
Query: 738 AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
+ + +G P + Q D + + L +F +PN VFT +
Sbjct: 748 SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793
Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
D +I S+ +E I + V +L + P+L ++ + Y+
Sbjct: 794 ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLTLYE 846
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
Y + TS ++ S R + ++N + S + ++ + T P +
Sbjct: 847 PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTALSKPLR-- 895
Query: 915 TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
+ ++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV +
Sbjct: 896 -VLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954
Query: 975 QGILKICQLPSGSTYDNYWPVQKV 998
++++C+ P + +D W +K+
Sbjct: 955 DNVVRMCRFPRNTHFDGSWAARKI 978
>gi|226290902|gb|EEH46330.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides brasiliensis Pb18]
Length = 1343
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 231/1035 (22%), Positives = 398/1035 (38%), Gaps = 174/1035 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++Y + GS ++ +T+ + + L LV Y L G V L
Sbjct: 28 NLIVAKTTLLQVYNLVNVVYGSGPGQSDEKTRSQY-------SKLVLVAEYALSGTVTDL 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ ++I++A +AK+S++E+D H + TS+H +E + +H+ +
Sbjct: 81 GRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISPWTPN 137
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-------------------- 214
A P + VDP RC VL +G + + IL Q G LV
Sbjct: 138 LAACPSQLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMGDFDSDHDEERQIDTNHTA 196
Query: 215 --GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTW 270
DE G + SS V+ + L+ M H F++ Y EP IL+ +
Sbjct: 197 EERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGILYSQVAAS 256
Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-I 329
+ + + S ++ + + S LP+D +K++ +P P+GG L+VG+N +
Sbjct: 257 SALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALLVGSNELV 316
Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLV 387
H + A+ +N +A S +S + L+ L +N LL G +
Sbjct: 317 HVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLVLLNGVMA 376
Query: 388 LLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
+++ DGR V + L +T PS +G F GS GDS+L+
Sbjct: 377 VVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SALVGRGKIFFGSEEGDSMLIG 432
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES 496
++ S + + E D A+ + DA +D + ++ G S NT S
Sbjct: 433 WSRPSAGATVPPA-PETGEDNVAELSEDEEEEDDDEDAYEDDLYATPVT-PGINSRNTTS 490
Query: 497 AQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNYELVELPG--- 546
T + F + D L N+GP++D + G + D + S + ELV G
Sbjct: 491 VNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYLELVTTQGYGR 550
Query: 547 -----------------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
G+ +V+ K + S Y YL++S
Sbjct: 551 AGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPTQSGSLPVNAGSNYDHYLLLSK 610
Query: 584 -----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGS 637
+ +++V + + E T + ++ + RTI G L G RV+QV + R D +
Sbjct: 611 SKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVLKGEVRSYDSA 670
Query: 638 YMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 694
+ L P E SE +V+ S ADPYVL+ D SI LL D S +
Sbjct: 671 NLHLGLGLAQIYPVWDE--DTSEERSVVHASFADPYVLIIRDDSSILLLQADESGDLDEI 728
Query: 695 QTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ--GDI 752
+T IES+ S +LY DK ++LS +G P + ++
Sbjct: 729 ETDGIIESTT--WISGSLYQDK----------YRSFLS---------YEGTPNRKPSDNV 767
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM---REALKDSETEINSSSEE 809
+ L IF +PN + V I+ T + R ++ TEI
Sbjct: 768 LLFLLNSESKLYIFHLPNAKEPVYTAESVDLLPQILPTELPPRRTTYRECLTEI------ 821
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
L + P+L ++ Y+ Y ++
Sbjct: 822 -----------------LVADLGDSVSRTPYLILRSNSNELILYEPY-----HTVQSTEK 859
Query: 870 PVSTSRSLSVSN------VSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGH 923
+S R L ++N + S L NL S L R G C T+F
Sbjct: 860 RLSDLRFLKIANHHFPKFLPESNLGNLSDSDRQL---ARPLRALGDVCGYRTVF------ 910
Query: 924 QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
+ G+ PC+ + + L ++ + + + C GF+YV + ++++C+
Sbjct: 911 ----MPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRF 966
Query: 984 PSGSTYDNYWPVQKV 998
P + +D W +K+
Sbjct: 967 PRNTHFDGSWAARKI 981
>gi|225558298|gb|EEH06582.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 1408
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 212/984 (21%), Positives = 385/984 (39%), Gaps = 156/984 (15%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G + L + D+ +++++A +AK+S++E+D H + TS+H
Sbjct: 65 LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQICTTSIH 122
Query: 161 CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
+E + +++ + A P + VDP RC VL +G + + IL Q G LV
Sbjct: 123 YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180
Query: 215 -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
DE +G F SS V+ + L+ M H F++ Y
Sbjct: 181 DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL+ + T + + + S ++ + + S LP+D +K++ +P
Sbjct: 241 REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300
Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
P+GG L++G+N +H + A+ +N +A S +S + L+ + L
Sbjct: 301 PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFL 425
+N LL G + +L+ DGR V + L + S+L + + + F
Sbjct: 361 ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ--------D 477
GS GDS+L+ ++ S + + G+I + D
Sbjct: 421 GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480
Query: 478 MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
M G + +S+ G+ N+ + F + D L N+GP++D + G + D
Sbjct: 481 MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533
Query: 531 TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
S +N ELV G G +VY K + +
Sbjct: 534 PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593
Query: 565 SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
S Y YL++S + +++V + E T++ ++ + RTI G L
Sbjct: 594 GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653
Query: 619 GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
RV+QV + R D G + Q + SE +V+ S ADPYVL+ D
Sbjct: 654 SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708
Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
SI LL D S +T I S+ S +LY DK
Sbjct: 709 QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDK-------------------YR 747
Query: 738 AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
+ + +G P + Q D + + L +F +PN VFT +
Sbjct: 748 SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793
Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
D +I S+ +E I + V +L + P+L ++ ++ Y+
Sbjct: 794 ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLILYE 846
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
Y + TS ++ S R + ++N + S + ++ + T P +
Sbjct: 847 PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTTLSKPLR-- 895
Query: 915 TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
+ ++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV +
Sbjct: 896 -VLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954
Query: 975 QGILKICQLPSGSTYDNYWPVQKV 998
++++C+ P + +D W +K+
Sbjct: 955 DNVVRMCRFPRNTHFDGSWAARKI 978
>gi|240277254|gb|EER40763.1| cleavage factor two protein 1 [Ajellomyces capsulatus H143]
Length = 1408
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 212/984 (21%), Positives = 384/984 (39%), Gaps = 156/984 (15%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G + L + D+ +++++A +AK+S++E+D H + TS+H
Sbjct: 65 LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIH 122
Query: 161 CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
+E + +++ + A P + VDP RC VL +G + + IL Q G LV
Sbjct: 123 YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180
Query: 215 -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
DE +G F SS V+ + L+ M H F++ Y
Sbjct: 181 DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL+ + T + + + S ++ + + S LP+D +K++ +P
Sbjct: 241 REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300
Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
P+GG L++G+N +H + A+ +N +A S +S + L+ + L
Sbjct: 301 PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFL 425
+N LL G + +L+ DGR V + L + S+L + + + F
Sbjct: 361 ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ--------D 477
GS GDS+L+ ++ S + + G+I + D
Sbjct: 421 GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480
Query: 478 MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
M G + +S+ G+ N+ + F + D L N+GP++D + G + D
Sbjct: 481 MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533
Query: 531 TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
S +N ELV G G +VY K + +
Sbjct: 534 PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593
Query: 565 SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
S Y YL++S + +++V + E T++ ++ + RTI G L
Sbjct: 594 GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653
Query: 619 GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
RV+QV + R D G + Q + SE +V+ S ADPYVL+ D
Sbjct: 654 SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708
Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
SI LL D S +T I S+ S +LY DK
Sbjct: 709 QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDK-------------------YR 747
Query: 738 AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
+ + +G P + Q D + + L +F +PN VFT +
Sbjct: 748 SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793
Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
D +I S+ +E I + V +L + P+L ++ + Y+
Sbjct: 794 ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLTLYE 846
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
Y + TS ++ S R + ++N + S + ++ + T P +
Sbjct: 847 PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTALSKPLR-- 895
Query: 915 TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
+ ++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV +
Sbjct: 896 -VLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954
Query: 975 QGILKICQLPSGSTYDNYWPVQKV 998
++++C+ P + +D W +K+
Sbjct: 955 DNVVRMCRFPRNTHFDGSWAARKI 978
>gi|395324102|gb|EJF56549.1| hypothetical protein DICSQDRAFT_93527 [Dichomitus squalens LYAD-421
SS1]
Length = 1433
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 217/996 (21%), Positives = 396/996 (39%), Gaps = 192/996 (19%)
Query: 57 NLVVTAANVIEIYVVR-------VQEEGSKES-----KNSGETKRRVLMD---------- 94
N+VV ++++ I+ VR Q+E KE K + + V MD
Sbjct: 42 NVVVARSSLLRIFEVREEPAPVSTQKEVEKERRAAVRKGTEAVEGEVEMDTSGEGFVNMG 101
Query: 95 ---GISAAS-------LELVCHYRLHGNVESL-AILSQGGADNSRRRDSIILAFEDAKIS 143
G++ A+ LV +RLHG V L A+ + D+ + D ++++F+DAKI+
Sbjct: 102 TSAGLNGAAHPPTVNRFYLVREHRLHGTVTGLEAVRTVHSLDD--KLDRLLVSFKDAKIA 159
Query: 144 VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
+LE+ S+H + S+H +E +P+ + + R L + DP RC + + +
Sbjct: 160 LLEWSLSLHDVITVSIHTYERAPQLIAID---SPLFRSEL-RADPLSRCAALSLPKDSLA 215
Query: 203 ILK--ASQGGSGLVGDEDTFGSGGGFSARIESSHVINL-RDLD--MKHVKDFIFVHGYIE 257
IL SQ ++ E + +S S +++L D+D +++V DF F+ G+
Sbjct: 216 ILPFYQSQAELDILEQEASQARDVPYSP----SFILDLANDVDKRIRNVIDFTFLPGFHN 271
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P + +L + + TW GR+ T + ++ +P+I + LP+D + L +
Sbjct: 272 PTVAVLCQYQQTWTGRLKEYKDTVGLYIFTLDFVTNNYPVITAVDGLPYDCFALTPCSTA 331
Query: 318 IGGVLVVGANTIHYHSQSA-SCALALNNYAVSLD-------SSQELPRSSFSVELDAAHA 369
IGGV+++ +N + + QS L +N + + ++QE R ++L+ A
Sbjct: 332 IGGVVILASNAVLFVDQSGRRVILPVNGWPPRVSDLPMPPLTAQEQTR---DLQLEGARF 388
Query: 370 TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFF 424
++ + L K G + + ++ DGR V +L +S T P+V + IG+ F
Sbjct: 389 VFVDDKKLFLILKDGTVYPIELIQDGRTVSKLTMSDALARTTIPAV----VKRIGDDHIF 444
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE---ADAPST--------------KRL 467
+GS +G S+L++ ++ ++EE D + A+ P+T L
Sbjct: 445 IGSIVGPSVLLK----------TARVEEEIHDEDVAMAEGPATVVDTSKTVDMMDDDDDL 494
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
S+ A Q NG A +N + + ++ D++ GP+ D ++GL N D
Sbjct: 495 YGPSTIADQPAANGTA----NGAVDNVRT-RTVVHLSLCDAIPAHGPISDMTFGLSRNGD 549
Query: 528 ------ASATGISKQSNYELVE--LP-----------GCKGIW------------TVYHK 556
+ATG ++ L + +P G +G+W T + +
Sbjct: 550 RLVPELVAATGSGHLGSFSLFQRDMPTRFKRKLHAIGGGRGMWSLPVRQQVKTGGTTFER 609
Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEART--MVLETADLLTEVTESVDYFVQGRTIAA 614
S +AD+ + D + + + R+ + + VT F QG I
Sbjct: 610 PSNPFHADNDTVIISTDANPSPGLSRIATRSSHSDITITTRIPGVTLGAAPFFQGTAILH 669
Query: 615 GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
+F VI+V E DG+ S + + + S SI DP++L+
Sbjct: 670 -VMFNVTNVIRVLEP-----DGTERQ-------SIKDLDGNAARPRIKSCSICDPFILII 716
Query: 675 MSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG 734
D +I L +G+ + + + + + Y D L +T +A
Sbjct: 717 REDDTIGLFIGEIERGKIRRKDMSPMGEKTSKYLAGYFYTDTS---GLFQTFLNA---EA 770
Query: 735 VGEAIDGADGGPLDQGDI--YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
GEA G ++ G+ + + G +EI+ +P F+ + I D+
Sbjct: 771 PGEAATSTLQGAMNAGNKTHWLTLVRPQGVVEIWTLPKLTLAFSTTTLATLDPVISDSLE 830
Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
AL Q + V +L + H RP L +L G +
Sbjct: 831 PPAL-------------SLPQDPPRKPQELDVDQLVIAPLGESHPRPHLIVLLRSGQLAI 877
Query: 853 YQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNL-RFSRTPLDAYTREETPHGAPC 911
Y+A P DP+ +RSL++ L NL + D EE
Sbjct: 878 YEAVAASPPA------DPLPPTRSLTL-------LVNLVKVKSKAFDIQHTEEEQKSVLA 924
Query: 912 QRITIFKNI----------SGHQGFFLSGSRPCWCM 937
++ I + + + G F +G RP W +
Sbjct: 925 EQKRISRLLLPFVTSPAPGQTYSGVFFTGDRPSWIV 960
>gi|388581811|gb|EIM22118.1| hypothetical protein WALSEDRAFT_28358 [Wallemia sebi CBS 633.66]
Length = 1259
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 168/675 (24%), Positives = 284/675 (42%), Gaps = 121/675 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV--- 113
N+V TA N ++IY + + +A L L Y+LHG +
Sbjct: 30 NIVTTANNTLKIYEIDIDS-------------------NTPSAKLILRREYQLHGEIIGI 70
Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
+S+ ILS +D +++AF DAKI++LE+ D I+ + S+H +E + + + +
Sbjct: 71 QSIKILST----TEDGKDRLLIAFRDAKIALLEWSDEINDIVTVSIHTYERSQQV-ISQD 125
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
F +++ DP+ RC +L+ + IL + L D D S S
Sbjct: 126 MSRFK--AILRSDPENRCSALLLPDDSLAILPVHSAHAEL-EDLDQDVSNAIKDVPYAPS 182
Query: 234 HVINLR--DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
++ L+ D D+ +V D+ F+ G+ P + +L E TW GR+S TC + L++
Sbjct: 183 FILPLKSIDSDICNVIDYTFLPGFHNPTLAVLCEPRQTWTGRLSDSQDTCQVFFLTLDLV 242
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLD 350
+ +P+I + NLP+D+ L A P IGGV ++ AN IH A N +A +L
Sbjct: 243 TQVYPIIATVDNLPYDSMSLKAAPKEIGGVAILSANAIIHVDQNGRPVGRATNGWA-TLT 301
Query: 351 SSQEL--PRSSFSVELDAAHATWLQ------NDVALLSTKTGDLVLLTVVYDGRVVQRLD 402
S++ P V L+ A +LQ + ALL G++ + +GR + R+D
Sbjct: 302 SARNFDAPPKDLFVRLEGASIEFLQPKSKQTHPQALLFLPNGEIHAVQFYREGRTISRID 361
Query: 403 LSK-----TNPS-VLTSDITTIGNS---LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE 453
+SK + PS DI G S F+ S +G S L++ G + L K+E
Sbjct: 362 ISKPFAKGSIPSGAYRLDIDGQGLSGGQFVFIPSMVGTSFLIR--VGKSLNDLELFPKQE 419
Query: 454 FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS------FAVRD 507
+ + A DM + LYGS+ + ++ F + D
Sbjct: 420 ---------------KVGTTAYDDMDVDVDEELYGSSDKKADEKEEEEEISSEPPFTICD 464
Query: 508 SLVNIGPLKDFSYG---------LRINADASA------TGISKQSNYE---LVELPGCKG 549
+ + GP++D + G L+I A A T ++ +E +++ G G
Sbjct: 465 YIESYGPIQDITIGRYMQTRNSPLQILAATGAGHVGGITAFHQEVPFESKHKLDVQGNHG 524
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+WT ++ + G+ + A D + + L + + L D EV
Sbjct: 525 LWT-FNVTGVGN-----VLVATDSKSKTKISKLLPSNEVALIAED--NEV---------- 566
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
TIAA R++ + ++L + Q EN V SI+DP
Sbjct: 567 -TIAADTAANSTRILMITSNAIKVLKEDGIEQ----------QSLQIENGEVQRASISDP 615
Query: 670 YVLLGMSDGSIRLLV 684
Y+L S+GSI L +
Sbjct: 616 YILTLQSNGSISLFI 630
>gi|302924728|ref|XP_003053954.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256734895|gb|EEU48241.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1429
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 224/994 (22%), Positives = 376/994 (37%), Gaps = 150/994 (15%)
Query: 75 EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSII 134
++G + S G V D + L LV L G V LA + + ++++
Sbjct: 72 DDGLESSFLGGGESMLVRTDRTNNTKLVLVAELPLTGTVIGLAKIKTKYTKSGG--EALL 129
Query: 135 LAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQG 189
LA++ AK+ + E+D + L S+H +E E W + + + ++ DP
Sbjct: 130 LAYKAAKMCLCEWDPKKNTLETLSIHYYEKDELQGAPW---EVAFDEYVN--FLEADPGS 184
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGD---EDTFGS---------GGGFSARIESSHV-- 235
RC + IL Q L D ED G G S +E+++
Sbjct: 185 RCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKESTAVANGDSDTLEAAYTPS 244
Query: 236 ----INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
+ L D + H F+H Y EP IL + H T + L +
Sbjct: 245 FVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSSQERAHSLGQKDHLTYKVFTLDLQQ- 303
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLD 350
+ I S +LP D YK++A+P+P+GG L++G N IH + +A+N+ A +
Sbjct: 304 -RASTTILSVTDLPRDLYKMIALPAPVGGALLIGENEFIHIDQSGKANGVAVNSMARQMT 362
Query: 351 SSQELPRSSFSVELDAA--HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS---K 405
S ++ ++ L+ +++N LL G L +++ DGR V + + +
Sbjct: 363 SFSLSDQADLNLRLEGCIIEQLYIENGELLLILNDGRLGIVSFRIDGRTVSGISIKMIPE 422
Query: 406 TNPSVL----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
N L S + +G + FF+GS GDS+++ G M ++
Sbjct: 423 ENGGRLIKSRASTASKLGKNTFFIGSETGDSVVL----GWSRKMSQEKRRKTRLVDADLG 478
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
L D D + G E + + + N SF + D+L++I P++D + G
Sbjct: 479 LDVDDLDLEDDDDEDDDLYGTETAAKPTQALNGAGKSGELSFRIHDTLLSIAPIRDLTSG 538
Query: 522 -LRINADASATGISKQ--SNYELV--------------------------ELPGCKGIWT 552
D+ +SK S+ +L E P +G WT
Sbjct: 539 KAAFLPDSEEATLSKGVVSDLQLACVVGRGNSGSLAILNRHIQPKIIGRFEFPEARGFWT 598
Query: 553 VYHK----SSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVTES 602
+ K S G N ++ Y+I++ E + TA + E+
Sbjct: 599 MCVKKPVPKSLGGNVTVGNDYETFGQHDKYMIVAKVDLDGYETSDVYALTAAGFETLKET 658
Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTV 661
G T+ AG + + RVIQV + R DG +TQ L + E+G+ V
Sbjct: 659 EFDPAAGFTVEAGTMGKQMRVIQVLKSEVRSYDGDLGLTQILPM--LDEETGA---EPRV 713
Query: 662 LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPW 721
+S SIADPY+LL D S+ + D + V+ + S K + C LY D
Sbjct: 714 ISASIADPYLLLIRDDSSVLIAQIDSNNELEEVEKTDSTLQSTKWHAGC-LYTD------ 766
Query: 722 LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKF 780
+ GV + G G D I + +GAL ++ +P+ + V+ +
Sbjct: 767 ----------TKGVFQPSVGDKGA--DTSKIMMFLLSSTGALHVYALPDLSKPVYVAEGL 814
Query: 781 VSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRP 839
H+ D +R L KEN+ + V +L P
Sbjct: 815 CYVPPHLSADYTLRRGLA------------------KENLRELLVADLG----DTVSQSP 852
Query: 840 FLFAILTDGTILCYQA--YLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
+L + Y+ Y EG E T S +L+ S + L +
Sbjct: 853 YLILRNQTDDLTIYEPLRYQPEGAEPT--------LSATLTFKKTSNAALATSPVETSQE 904
Query: 898 DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
DA + P P + N++G+ FL G P + + + + L I
Sbjct: 905 DAV---QQPRFVPLRTCA---NVNGYSTVFLPGPSPSFILKSSKSIPRVIGLQGLGIRGM 958
Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
+ H C+ GFIY +GI ++ QLPS + + +
Sbjct: 959 STFHTEGCDRGFIYADDEGIARVTQLPSETNFTD 992
>gi|426235955|ref|XP_004011942.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Ovis aries]
Length = 819
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 112/384 (29%), Positives = 176/384 (45%), Gaps = 54/384 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAK--------------ISVLEFDDSIHGLRITSMHCF 162
A + GA +RD+++L+F+DAK + FD + + TSM
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKGGYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTM 140
Query: 163 ESPEWL----------------HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
E P +L L+ S R K +P + V ++
Sbjct: 141 E-PGYLFLGSRLGNSLLLKYTEKLQEPPASTTREAADKEEPPSKKKRVDATTGWAGRVRE 199
Query: 207 SQGGSGLVGDEDTFGSGG---------GFSARIESSH---VINLRDLDMK--HVKDFIFV 252
+ V + + +GS F R S +I++R LD K ++ D F+
Sbjct: 200 GELPQDEVDEIEVYGSEAQSGTQLATYSFEVRWGSEWLPGIIDVRALDEKLLNIVDLQFL 259
Query: 253 HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
HGY EP ++IL E TW GRV+ + TC I A+S++ T K HP+IWS +LP D + L
Sbjct: 260 HGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQAL 319
Query: 313 AVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATW 371
AVP PIGGV++ N++ Y +QS +ALN+ + + + LD A A +
Sbjct: 320 AVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAF 379
Query: 372 LQNDVALLSTKTGDLVLLTVVYDG 395
+ D ++S K G++ +LT++ DG
Sbjct: 380 ISYDKMVISLKGGEIYVLTLITDG 403
Score = 73.2 bits (178), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 37/93 (39%), Positives = 51/93 (54%), Gaps = 3/93 (3%)
Query: 907 HGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNC 965
+G R+ + K H F+ G P W +V R LR+HP DG I +F HN+NC
Sbjct: 486 YGGRHHRLALHKPPLHH--VFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINC 543
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
GF+Y QG L+I LP+ +YD WPV+K+
Sbjct: 544 PRGFLYFNRQGELRISVLPAYLSYDAPWPVRKI 576
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 76/160 (47%), Gaps = 22/160 (13%)
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDL-VLLTVVYDG-RVVQRLDLSKTNPSVLTSDI 415
S SV+L A + D LLS K +LT++ DG R V+ K SVLT+ +
Sbjct: 83 SMASVQLAGA-----KRDALLLSFKDAKGGYVLTLITDGMRSVRAFHFDKAAASVLTTSM 137
Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL-------- 467
T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 138 VTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTTREAADKEEPPSKKKRVDATTGWAG 195
Query: 468 RRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVR 506
R + QD V+ E+ +YGS A + T+ A T+SF VR
Sbjct: 196 RVREGELPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVR 231
>gi|159123784|gb|EDP48903.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus A1163]
Length = 1401
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 173/744 (23%), Positives = 307/744 (41%), Gaps = 114/744 (15%)
Query: 57 NLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV +V++I+ +++VQ E+ + + D + L L Y L G V
Sbjct: 28 NLVVVKTSVLQIFSLLKVQHHSRGETIETKSARP----DQVETTKLVLEREYPLSGTVVD 83
Query: 116 LA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
+ + S+ G + +++LAF +AK+S++E+D HG+ S+H +E +
Sbjct: 84 ICRVKILNSKSGGE------ALLLAFRNAKLSLVEWDPERHGISTISIHYYERDDLTRSP 137
Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFG--------- 221
+ + G ++ VDP RC V +G++ + IL Q G L D+ F
Sbjct: 138 WVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDDLNQV 196
Query: 222 ---SGGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL 268
G G ++ SS V+ L LD + H F++ Y EP IL+ +
Sbjct: 197 SDHVGNGLKSKDSTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGILYSQIA 256
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
T +S + + + ++ + + S LP D +K++A+P P+GG L++G+N
Sbjct: 257 TSHALLSERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLIGSNE 316
Query: 329 -IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGD 385
+H + A+ +N +A + + + +S ++ L+ + + LL +G+
Sbjct: 317 LVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHISDSTGDLLLVLSSGN 376
Query: 386 LVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFT 438
+VL+ DGR V + L ++ +++ S ++ +G+ F GS DS+L+ ++
Sbjct: 377 MVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVLLSWS 436
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASNNTES 496
+ ++ D +S DA + D+ E E G + +
Sbjct: 437 SMPN----PKKSRPRMSNVAEDREEASDDSQSEEDAYEDDLYTAEPETPALGRRPSAETT 492
Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG------- 549
+ F D L NIGPL+D + G + + + K + EL EL +G
Sbjct: 493 GVGAYIFQTLDRLPNIGPLRDITLGKPASTVENTGRLIKNACSEL-ELVAAQGSGRNGGL 551
Query: 550 ----------------------IWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIISLEA 585
+WT G D ++ + EY Y+I+S +
Sbjct: 552 VLMKREIEPDVTASFDAQSVQEVWTAVVALGSGAPLVLDEQQI---NQEYRQYVILS-KP 607
Query: 586 RTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILDGSY 638
T ET+++ T+ + F TI G L ++RV+QV R SY
Sbjct: 608 ETPDKETSEVFIADTQDLKPFRAPEFNPNNDVTIEIGTLSCKKRVVQVLRNEVR----SY 663
Query: 639 MTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
D+ G P E S+ +S S+ADPY+ + D ++ +L D S
Sbjct: 664 ---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMILQADDSGDLDE 718
Query: 694 VQTPAAIESSKKPVSSCTLYHDKG 717
V+ A + K SC LY DK
Sbjct: 719 VELNEAARAGK--WRSCCLYWDKA 740
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 44/88 (50%), Gaps = 4/88 (4%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV---AFTVLHNVNCNHGFI 970
+ I NIS F+ G RP ++ + H G V + L + + + GFI
Sbjct: 884 LRILPNISNFSAVFMPG-RPASFILKTAKSCPHVFRLRGEFVRSLSIFDLASPSLDTGFI 942
Query: 971 YVTSQGILKICQLPSGSTYDNYWPVQKV 998
YV S+ +L+IC+ PS + +D W ++K+
Sbjct: 943 YVDSKDVLRICRFPSDTLFDYTWALRKI 970
>gi|429851266|gb|ELA26469.1| protein cft1 [Colletotrichum gloeosporioides Nara gc5]
Length = 1411
Score = 136 bits (343), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 231/1015 (22%), Positives = 393/1015 (38%), Gaps = 160/1015 (15%)
Query: 69 YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSR 128
Y R+ ++ ES G V D + L LV Y + G V LA + NS+
Sbjct: 66 YDRRLNDDDGLESSFLGGDGMLVRADRTNNTKLVLVAEYPIFGVVAGLARIK---IQNSK 122
Query: 129 RR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL----- 182
+++++A A++S++++D H L S+H +E E S GPL
Sbjct: 123 SGGEALLIATRVARLSLVQWDPEKHALEDVSIHFYEKEEL------EGSPFDGPLSNYPT 176
Query: 183 -VKVDPQGRCGGV---------LVYGLQMIILKASQGGSGLVGDED---------TFGSG 223
+ DP RC + L + L + + G T G+
Sbjct: 177 HLAADPGSRCAALRFGSRYIAFLPFKLNDEDIDMDDWDEDVDGPRPAKEPSATAATNGTS 236
Query: 224 GGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTC 281
+S+V+ L LD + H F+H Y EP I+ + H +
Sbjct: 237 NLADVPYSTSYVLPLPQLDPSLLHPVHLAFLHEYREPTFGIISSMQRRSNTLPRKDHFSY 296
Query: 282 MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCAL 340
+ L + + I S NLP D +K++A+P PIGG L+VG N IH +
Sbjct: 297 KVFTLDLQQ--RASTAILSVNNLPQDLFKVIALPGPIGGALLVGTNELIHIDQSGKPNGV 354
Query: 341 ALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVV 398
A+N + S +S + L+ + + +N L+ G L ++ DGR V
Sbjct: 355 AVNAFTKETTSFPLADQSELDLRLEHCYIEQMSPENGELLMVLSDGRLAIIAFKIDGRTV 414
Query: 399 QRLDL----SKTNPSVL---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
L + ++ +V+ S I+ + + FF+GS DSL+V G + +
Sbjct: 415 SGLSVRIVPAEAGGNVVQCGASSISRLSKNAFFIGSTGSDSLVV------GVTRKQTQNA 468
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGE-ELSLYGSASNNTESAQKTFSFAVRDSLV 510
+ + D+ + D D + GE ++ S + N SF V DSL+
Sbjct: 469 RKKTRLVDDSFADDLEDEDIDDDDDDDLYGETTTTVQSSTAANGVPKGGEISFRVHDSLL 528
Query: 511 NIGPLKDFSYG--------------------LRINA-----DASATGISKQSNYELV--- 542
++ P+KD + G L++ A +A+A I Q+ V
Sbjct: 529 SLAPVKDMTTGKQAFIPESEDEKNSVGVVADLQLAAAVGKGNAAAIAIMNQNIQPKVIGK 588
Query: 543 -ELPGCKGIWT--VYHKSSRGHNADSSRMAAYDDEYHA------YLIISLEARTMVLETA 593
E P +G WT V + D AA E+ A ++I+S + ET+
Sbjct: 589 FEFPEARGFWTMCVQKPIPKSLQGDKGANAAVGSEFDASSIYDKFMIVS-KVDLDGYETS 647
Query: 594 DLLTEVTESVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSF 645
D+ + F G T+ AG + R+IQV + R DG ++Q L
Sbjct: 648 DVYALTGAGFEAFTGTEFDPAAGFTVEAGTMGKHMRIIQVLKSEVRCYDGDLGLSQILPM 707
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
+ E+G+ V+S SIADPY+LL D SI + D + ++ S +
Sbjct: 708 --LDEETGA---EPRVVSASIADPYLLLVRDDASIMVAQIDNNNELEEMEKQDDTILSTQ 762
Query: 706 PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
++ C LY D +TGV I G P Q I+ + GAL I
Sbjct: 763 WLAGC-LYTD----------------TTGVFAPIQTDKGTPESQ-SIFMFLLSAVGALYI 804
Query: 766 FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
+ +P+ + V +G T+ V ++ + + GT Q E + + V
Sbjct: 805 YALPDLSKPVYV---AAGMTY-VPPFL---------SADYAVRRGTVQ---ETLTEVLVA 848
Query: 826 ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS 885
+L A S P+L + I Y+ E D +++L ++
Sbjct: 849 KLG----DATESSPYLILRHANDDITIYEPIRLE------SQDKSEGLAKTLHFQKIT-- 896
Query: 886 RLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
N +++P++ + E P P + NI+G+ FL G+ P + + +
Sbjct: 897 ---NPALAKSPVEVADDDANEQPRFVPLRPCA---NINGYSTVFLPGASPSFIIKSAKSA 950
Query: 944 RVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
L + + H C GFIY S+G ++ QLP+ ++++ ++K+
Sbjct: 951 PKVLGLQGIGVRGMSSFHTEGCERGFIYADSEGHTRVTQLPADTSFELGVSIRKI 1005
>gi|146324727|ref|XP_747211.2| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus Af293]
gi|148886828|sp|Q4WCL1.2|CFT1_ASPFU RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
protein 1
gi|129556124|gb|EAL85173.2| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus Af293]
Length = 1401
Score = 136 bits (343), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 173/744 (23%), Positives = 307/744 (41%), Gaps = 114/744 (15%)
Query: 57 NLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV +V++I+ +++VQ E+ + + D + L L Y L G V
Sbjct: 28 NLVVVKTSVLQIFSLLKVQHHSRGETIETKSARP----DQVETTKLVLEREYPLSGTVVD 83
Query: 116 LA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
+ + S+ G + +++LAF +AK+S++E+D HG+ S+H +E +
Sbjct: 84 ICRVKILNSKSGGE------ALLLAFRNAKLSLVEWDPERHGISTISIHYYERDDLTRSP 137
Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFG--------- 221
+ + G ++ VDP RC V +G++ + IL Q G L D+ F
Sbjct: 138 WVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDDLNQV 196
Query: 222 ---SGGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL 268
G G ++ SS V+ L LD + H F++ Y EP IL+ +
Sbjct: 197 SDHVGNGLKSKDSTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGILYSQIA 256
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
T +S + + + ++ + + S LP D +K++A+P P+GG L++G+N
Sbjct: 257 TSHALLSERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLIGSNE 316
Query: 329 -IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGD 385
+H + A+ +N +A + + + +S ++ L+ + + LL +G+
Sbjct: 317 LVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHISDSTGDLLLVLSSGN 376
Query: 386 LVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFT 438
+VL+ DGR V + L ++ +++ S ++ +G+ F GS DS+L+ ++
Sbjct: 377 MVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVLLSWS 436
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASNNTES 496
+ ++ D +S DA + D+ E E G + +
Sbjct: 437 SMPN----PKKSRPRMSNVAEDREEASDDSQSEEDAYEDDLYTAEPETPALGRRPSAETT 492
Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG------- 549
+ F D L NIGPL+D + G + + + K + EL EL +G
Sbjct: 493 GVGAYIFQTLDRLPNIGPLRDITLGKPASTVENTGRLIKNACSEL-ELVAAQGSGRNGGL 551
Query: 550 ----------------------IWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIISLEA 585
+WT G D ++ + EY Y+I+S +
Sbjct: 552 VLMKREIEPDVTASFDAQSVQEVWTAVVALGSGAPLVLDEQQI---NQEYRQYVILS-KP 607
Query: 586 RTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILDGSY 638
T ET+++ T+ + F TI G L ++RV+QV R SY
Sbjct: 608 ETPDKETSEVFIADTQDLKPFRAPEFNPNNDVTIEIGTLSCKKRVVQVLRNEVR----SY 663
Query: 639 MTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
D+ G P E S+ +S S+ADPY+ + D ++ +L D S
Sbjct: 664 ---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMILQADDSGDLDE 718
Query: 694 VQTPAAIESSKKPVSSCTLYHDKG 717
V+ A + K SC LY DK
Sbjct: 719 VELNEAARAGK--WRSCCLYWDKA 740
Score = 44.3 bits (103), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 44/88 (50%), Gaps = 4/88 (4%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV---AFTVLHNVNCNHGFI 970
+ I NIS F+ G RP ++ + H G V + L + + + GFI
Sbjct: 884 LRILPNISNFSAVFMPG-RPASFILKTAKSCPHVFRLRGEFVRSLSIFDLASPSLDTGFI 942
Query: 971 YVTSQGILKICQLPSGSTYDNYWPVQKV 998
YV S+ +L+IC+ PS + +D W ++K+
Sbjct: 943 YVDSKDVLRICRFPSETLFDYTWALRKI 970
>gi|396471273|ref|XP_003838832.1| similar to cleavage and polyadenylation specificity factor subunit
A [Leptosphaeria maculans JN3]
gi|312215401|emb|CBX95353.1| similar to cleavage and polyadenylation specificity factor subunit
A [Leptosphaeria maculans JN3]
Length = 1402
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 170/706 (24%), Positives = 295/706 (41%), Gaps = 99/706 (14%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++I+ ++ V EG E N+ E + A L LV
Sbjct: 28 NLVVAKNSLLQIFEIKSTTTEVTPEGGDEVDNAAANLDTEAADVQFQRTENTAKLVLVAE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
+ L G V SLA + A N++ R +++++AF DAK+S++E+D + L S+H +E+P
Sbjct: 88 FPLAGTVISLARIK---ALNTKSRGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144
Query: 166 E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG-------GSG 212
+ W + +F + DP RC + + IL Q S
Sbjct: 145 DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQSDLVEDDYDSD 199
Query: 213 LVGDEDT------FGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
L G DT SGG + + SS V+ L +LD + H F+H Y EP I
Sbjct: 200 LDGPRDTKPDQAEAPSGGETTHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHQYREPTFGI 259
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
+ ++ + S ++ K + S LP+D +++ +P PIGG L
Sbjct: 260 IAASRAAAPSLLANRKDILTYSVFTLDLEQKASTTLLSVTGLPYDISRVVPLPHPIGGAL 319
Query: 323 VVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LL 379
++G N IH + +A+N +A + S +S ++ L+ + L D +L
Sbjct: 320 LLGNNEIIHVDQGGKTNGVAVNEFAKACTSFPLSDQSDLALHLEGCNVELLSQDTGDVVL 379
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPS----VLTSDITTIGNSL---FFLGSRLGDS 432
G L+++T +GR V + + VL + + N + F+GS G+S
Sbjct: 380 VLNNGRLLIMTFTLEGRTVSGMTIQTVAADHGGHVLKAGSSCTSNLVRGRLFIGSEDGES 439
Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG--SA 490
+L+ G S ++ L+ ++ D T D L D + + +A
Sbjct: 440 VLL------GWSSATASLRRRHSNVGLDGDGTSEEEEEDIDDLDDDLYNDTAPAVQKITA 493
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKD------------------FSYGLRINADASATG 532
+ + + T+SF + D+L +I P++D S G A + TG
Sbjct: 494 AASEPTPPGTYSFRIHDTLPSIAPIRDAVLHPGKVTDSLNRGEIMLSTGR--GAAGAITG 551
Query: 533 ISKQ---SNYELVELPGCKGIWTVYHKSS------RGHNADSSRMAAYDDEYHAYLIISL 583
+ ++ + ELP GIW V+ + D+ + D +Y YL++S
Sbjct: 552 LDRELHPVSLAASELPSTHGIWAVHARKQAPGGVVTAFGEDTEANMSTDVDYDQYLVVSK 611
Query: 584 EAR-----TMVLET-ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
+ T+V E + L+E + +G T+ G L +V+QV R D S
Sbjct: 612 TSEDGSESTVVYEVHGNELSETDKGDFEREEGSTLFVGVLAAGTKVVQVMRTEVRTYD-S 670
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
+ D + E+G+ V++ S ADPY+L+ D S+++L
Sbjct: 671 ELNMDQILPMEDEETGN---ELRVINASFADPYLLVLREDSSVKIL 713
>gi|330919204|ref|XP_003298516.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
gi|311328242|gb|EFQ93393.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
Length = 1388
Score = 136 bits (342), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 170/699 (24%), Positives = 293/699 (41%), Gaps = 90/699 (12%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++I+ ++ V + S+N+ E L + A L LV
Sbjct: 28 NLVVAKNSLLQIFELKSTTTEVTPGSGENSENAAANLDTEAADVPLQRTENTAKLVLVAE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
+ L G V SLA + A N++ + +++++AF DAK+S++E+D + L S+H +E+P
Sbjct: 88 FPLAGTVISLARVK---ALNTKSKGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144
Query: 166 E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-- 217
+ W + +F + DP RC + + IL Q LV D+
Sbjct: 145 DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQ--RDLVEDDYD 197
Query: 218 ------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVIL 263
+ G SS V+ L +LD + H F+H Y EP I+
Sbjct: 198 SDAEVPKETKADQANDTSGEHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIV 257
Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
T ++ + S ++ K + S LP+D K++ +PSPIGG L+
Sbjct: 258 AASRATAPSLLAQRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITKVVPLPSPIGGALL 317
Query: 324 VGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLS 380
VG N IH + +ALN +A + S +S ++ L+ L + L+
Sbjct: 318 VGRNEIIHVDQGGKTNGVALNEFAKACTSFSLSDQSDLALHLEGCSIELLSQETGDVLIV 377
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSL 433
G L++LT DGR V + + S + +G F+GS G+S+
Sbjct: 378 LNNGRLLILTFTLDGRTVSGMTIQTVAADHGGHLVKSAASCTSNLGRGRLFIGSEDGESV 437
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASN 492
++ +T L++ L+ + + + D D D+ N +++ +A+
Sbjct: 438 MLGWTG------LTNQLRRKLSNADLDG-EDDSDEEEIDDMEDDLYNDTAPTMHKITAAV 490
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYE 540
+ +A T++F + D L +I P+KD + G + ++ A G + E
Sbjct: 491 SEPTAPGTYTFRIHDVLPSIAPIKDAVLHPGKVTESLNRGEIMLSTGRGAAGAITALDRE 550
Query: 541 L-------VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISL--EA 585
L ELP G+W V+ + + D+ A D +Y YL++S E
Sbjct: 551 LHPISVATKELPLAHGVWAVHARKQAPGDVTAAFGEDTEANMATDVDYDQYLVMSKNGED 610
Query: 586 RTMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
T+V E D LTE + +G T+ G L +V+QV RI D +
Sbjct: 611 GTVVYEVNGDQLTETDKGDFEREEGTTLLVGVLAAGTKVVQVMRTEVRIYDSELNLVHIQ 670
Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
E GS E + +++ S ADPY+L+ D S+++
Sbjct: 671 SMEEEEEGGSTKELN-IINASFADPYLLILREDSSVKIF 708
>gi|291232722|ref|XP_002736302.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
[Saccoglossus kowalevskii]
Length = 984
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 163/668 (24%), Positives = 264/668 (39%), Gaps = 169/668 (25%)
Query: 423 FFLGSRLGDSLLVQFT--CGSGTSMLSSGLKE---EFGDIEADAPSTKRLRRSSSDALQD 477
FLGSRLG+SLL+++ T +++G K+ + + + P+ K+ +SD +
Sbjct: 6 LFLGSRLGNSLLLKYVEKAQESTDSVTNGAKKTEEDEETNKEEPPNKKKRTDDASDWIAS 65
Query: 478 MV-----NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRI 524
V + +EL +YGS + + +++F V DS++NIGP G +
Sbjct: 66 DVALLAEDVDELEVYGSQTQ-AGTQLTSYTFEVCDSIMNIGPCTKAVMGEPVFLSEEFQT 124
Query: 525 NAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSR 567
N D + +G SK ++ ELPGC +WTV + N D +
Sbjct: 125 NPDPDMELVALSGYSKNGALSVLQRSIRPQVVTTFELPGCIDMWTVVGPPEK-ENKDQPK 183
Query: 568 MAAYDD---------EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
++ HA+LI+S + +M+L T + E+ S + QG T+ AGNL
Sbjct: 184 EKTEEEGDKKPDALTNGHAFLILSRDDSSMILSTGQEIMELDHS-GFSTQGPTVYAGNLG 242
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
++QV G R+L+G Q + S ++ S++DPY LL G
Sbjct: 243 NNAYILQVSPMGVRLLEGVNQLQHIPL----------DLGSPIVLCSVSDPYALLMSEKG 292
Query: 679 SI--------------RLLVGDPSTCTVS-VQTPAAIE--------SSKKPVSSCTLYHD 715
+ RL + P +S + T A + SSK +S
Sbjct: 293 ELVLLTLKPDGFAGGHRLAISRPQIPQISRILTLCAYKDTSGMFTTSSKMESTSDETEEK 352
Query: 716 KGPEPWLRKTSTDAWLSTG-------VGEAIDGADGGPLDQGD----------------- 751
K +P + S + +S GE+ D + P + +
Sbjct: 353 KITKPSVADISMTSEISNVDDEDEMLYGES-DASLFSPTKKEEKSSFLQTREVLSETKPT 411
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
+ + E+G LEI+ +P+F F V F G +VD+Y ++ +S+ G+
Sbjct: 412 YWCAMSRENGVLEIYSLPDFKLAFLVKNFPMGFKVMVDSY----------QMTASAPGGS 461
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
+ +++ V EL + + + L A + D + Y+A+
Sbjct: 462 SKSDQQHDMMPIVKELLLIGLGHKNKKTHLLARV-DEDLYIYEAF--------------- 505
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
T S+ N LR LRF + F+ G
Sbjct: 506 -THDQSSLDN----HLR-LRFRKV-------------------------------FVCGP 528
Query: 932 RPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W M R LR HP DGS+ F HN+NC GF+Y G L+IC LP+ +YD
Sbjct: 529 YPHWLFMTSRGALRSHPMHIDGSVTCFAPFHNINCPKGFLYFNKHGELRICVLPTHLSYD 588
Query: 991 NYWPVQKV 998
WPV+KV
Sbjct: 589 ALWPVRKV 596
>gi|147772179|emb|CAN73417.1| hypothetical protein VITISV_017053 [Vitis vinifera]
Length = 609
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 69/122 (56%), Positives = 80/122 (65%), Gaps = 26/122 (21%)
Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYEL--------------------- 541
F V DSL+N+GPLK F+Y LRINAD ATGI KQSN+EL
Sbjct: 430 FEVNDSLINVGPLKVFAYALRINADLKATGIVKQSNFELMCCSGHGKNGALCILQQSIRP 489
Query: 542 -----VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
VEL GC+ IWTVYHK++RGHNADS++M DDEY AYLIIS E+RTMVLET +LL
Sbjct: 490 EMITEVELSGCERIWTVYHKNTRGHNADSTKMVTKDDEYCAYLIISPESRTMVLETVELL 549
Query: 597 TE 598
E
Sbjct: 550 GE 551
>gi|315045910|ref|XP_003172330.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
gi|311342716|gb|EFR01919.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
Length = 1397
Score = 132 bits (332), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 228/1030 (22%), Positives = 397/1030 (38%), Gaps = 178/1030 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + G+ ++ D + A L L Y + G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSAPA---GQPDQKGRHDRLQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+ NS+ D+I+++ +AK+S++E+D HG+ S+H +E E H+
Sbjct: 85 ERVR---ISNSKSGGDAILVSSRNAKLSLIEWDPQKHGITTISIHYYEGEES-HMSPWVP 140
Query: 176 SFAR-GPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE-DTFGSGGGFSARIES 232
+ VDP G C + +G+ + IL Q G LV D+ D +G + +
Sbjct: 141 DLGSCSSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDAIPNGDDTTDAVND 199
Query: 233 -----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
S V+ + LD + H F+H Y EP IL+ +
Sbjct: 200 AQKPAPGNAVHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSMSL 259
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ S ++ K + + LP D +K++ +P PIGG L++G N +H
Sbjct: 260 TIDRKDIVSYSIFTLDLQQKASTSLLTVSRLPSDIFKVVPLPPPIGGALLIGTNELVHVD 319
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + +S + L+ L + LL G + +LT
Sbjct: 320 QAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCMVEQLGSGAGDVLLILSDGRMAILT 379
Query: 391 VVYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGS-- 441
DGR V + L ++ S++ S + ++G + F GS GDS+L+ ++ S
Sbjct: 380 FKVDGRSVAGISLHFVAEQSGGSIIKSRPSCSASLGRNKLFYGSEEGDSILLGWSKHSSA 439
Query: 442 --------------GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY 487
GT+ LS +++ D + +++ + +VNG+
Sbjct: 440 TKKPSKAAGGGNEDGTANLSDEEEQDDDDDDMYEDDLYSANPTTTQQEKQVVNGD----- 494
Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
G+A+ F+ D L ++GP +D + G + + S +EL
Sbjct: 495 GAAN---------FTLRAHDRLWSLGPYRDITLGRPPKSKSKDRQDSVPEISAPLELVAA 545
Query: 548 KGI-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA------------RTMVL 590
+G TV + DS +M DD Y + I ++ R ++L
Sbjct: 546 RGFGKAGGLTVLKREIDPFTIDSLKM---DDVYGVWSIRVIDPKSKDAGLSRSYDRYLLL 602
Query: 591 ETADLLTEVTESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARIL 634
A + ESV Y V + TI G L RV+QV R
Sbjct: 603 AKAK-GDDKEESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLATGSRVVQVLRTEIRSY 661
Query: 635 DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTV 692
D + + P E SE TV+ S A+PY+L D S+ +L D + V
Sbjct: 662 DCNLGLAQIY--PVWDE--DTSEERTVIQASFAEPYLLTIRDDNSLLILQADKNGDLDEV 717
Query: 693 SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
+Q AA S K VS C LY DK + S+D D +I
Sbjct: 718 EIQGSAA---SAKWVSGC-LYEDK-----TKIFSSD-------------LDTEHAATPNI 755
Query: 753 YSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
+ G L IF +PN + VD L S SSS
Sbjct: 756 LLFLLDSDGNLSIFRLPNITEPLCRVDNL--------------NLLPSNLPYESSSRRPV 801
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
+E + + V +L A H P++ ++ Y+ Y G SK
Sbjct: 802 ---NRETLTELLVADLG----DAIHKSPYMILRTKHDDLVLYEPYRITGENGRSKLQ--F 852
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
+ + V ++ N +R+P +P + + ++ G++ F+SG
Sbjct: 853 IKAVNHVVMGPRTNQPMNKDINRSP------------SPSKLLRALSDVCGYKTVFMSGQ 900
Query: 932 RPCWCM---VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
PC+ + + R + +L ++ + T H C GF YV ++++ +LPS +
Sbjct: 901 NPCFILKSAIARPNVL---RLRGKAVQSLTGFHIAACERGFAYVDEDNVIRMSRLPSNTR 957
Query: 989 YDNYWPVQKV 998
+D+ W +K+
Sbjct: 958 FDSAWATRKI 967
>gi|409046890|gb|EKM56369.1| hypothetical protein PHACADRAFT_93103 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1417
Score = 132 bits (332), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 193/892 (21%), Positives = 368/892 (41%), Gaps = 124/892 (13%)
Query: 103 LVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
V +RLHG V ES+ I+S D ++++F+DAKI++LE+ D+++ L S+
Sbjct: 120 FVREHRLHGTVTGMESIRIVSS----QEDGLDRLLVSFKDAKIALLEWSDAVNDLLTVSI 175
Query: 160 HCFE-SPEWLHLKRGRESFARGPL----VKVDPQGRCGGVLVYGLQMIILKASQGGSGL- 213
H +E +P+ + L+ PL ++ DP RC +++ + IL Q + L
Sbjct: 176 HTYERAPQMMALE--------APLFHSQLRTDPLSRCAALMLPKDSLAILPFYQSQADLD 227
Query: 214 VGDEDTFGSGGGFSARIESSHVINLR-DLD--MKHVKDFIFVHGYIEPVMVILHERELTW 270
+ ++DT S S V+++ D+D +KHV D +F+ G+ P + +L + TW
Sbjct: 228 IMEQDTQTSCRDIP--YSPSFVLDMTTDVDERIKHVIDLVFLPGFNSPTIAVLFQNTQTW 285
Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
R+ T + ++ + P++ + NLP+D L+ + +GGV++V AN +
Sbjct: 286 TSRLREYKDTVGLIIFTLDLVTRNCPVLTAVDNLPYDCLYLVPCSAQLGGVVIVSANALI 345
Query: 331 YHSQ-SASCALALNNYAVSLDSSQELPR-----SSFSVELDAAHATWLQNDVALLSTKTG 384
Y +Q S L +N + + S LP+ S +++L+ ++A ++ ++ + G
Sbjct: 346 YVAQTSRRVILPVNGWQARV-SDHPLPQLTEEEKSRNLKLEGSYAVFVDDNKLFVLLSDG 404
Query: 385 DLVLLTVVYDGRVVQRLDL-SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ + V DGR V RL + S + + + + + + F+GS G S+L++ T
Sbjct: 405 TVYPMEVHADGRTVSRLTMGSALAQTTIPAIVRRVTDENLFIGSTAGPSVLLK------T 458
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
S + +KEE +++ AP+ + D D +GE A T
Sbjct: 459 SHVEEDVKEEDVEMDT-APAAVVDEANEMDLDDD--DGELCHWVHFAKKRT-----VVHL 510
Query: 504 AVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYELVE--LP---------- 545
++ DS+ GP+ D ++ L D +ATG + L + LP
Sbjct: 511 SLCDSIPAYGPVSDMTFSLTRVGDRPVAELVAATGSGGLGGFTLFQRDLPSRVKRKLHAV 570
Query: 546 -GCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLLTEVTE 601
G +G+W++ ++ R + + R + + +IIS +A + A ++
Sbjct: 571 GGGRGMWSLAVRQAVRVNGSTYERPSNPHHGGNDAVIISTDANPSPGLSRIASRSSKSDI 630
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGS--YMTQDLSFGPSNSESGSGSE 657
+ + G T+ A + F ++ V R+L DG+ + +DL
Sbjct: 631 QITTRIPGTTVGAASFFQGTAILHVMSNAIRVLEPDGTERQIIKDLD---------GSVP 681
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDK 716
+ S+ DP++++ D S+ L +G+P + + + + E + K ++ C + D
Sbjct: 682 RPKIRYCSMCDPFIMVIREDDSLGLFIGEPERGKIRRKDMSPMGEKTSKYIAGC-FFMDT 740
Query: 717 GPEPWLRKTSTDAWLSTGVGEAIDGA-DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVF 775
R + A V + + G Q + ++ G LE++ +P VF
Sbjct: 741 TGIFQSRVNAAAAAADKNVTSTLQTVMNAGTRTQ---WLLLVRPQGVLEVWSLPKLALVF 797
Query: 776 TVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAH 835
+ + + +VD+ AL Q + V ++A+
Sbjct: 798 STSHVSALESVLVDSGDSPAL-------------SLPQDPPRKPQDLDVEQIAIAPLGES 844
Query: 836 HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
S+ +L L G Y+A P S P + + +L V V +
Sbjct: 845 SSKLYLLVFLRCGLFAVYEAL----PAPASTDPPPPTRTSTLCVKFV--------KVVTR 892
Query: 896 PLDAYTREETPHGAPCQRITIFKNI----------SGHQGFFLSGSRPCWCM 937
D EE ++ I + + G FL+G RPCW +
Sbjct: 893 AFDIQQSEEVEKSVLAEQKRISRQLIPFVTSPTPGRAFSGVFLTGDRPCWIL 944
>gi|350633238|gb|EHA21604.1| hypothetical protein ASPNIDRAFT_51242 [Aspergillus niger ATCC 1015]
Length = 1406
Score = 132 bits (332), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 213/1034 (20%), Positives = 407/1034 (39%), Gaps = 179/1034 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+L+V ++++IY + + E ++ + ++L++ Y L G V L
Sbjct: 28 DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++AF +AK+S++E+D G+ S+H +E +
Sbjct: 77 CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
+ G ++ VDP RC + +G++ + I+ Q G LV D+ +GS
Sbjct: 131 VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187
Query: 223 ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
GGG + S V+ L LD + H F++ Y EP IL+ +
Sbjct: 188 HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
T + + + + ++ + ++ S LP D ++++A+P P+GG L++G+
Sbjct: 248 VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
N +H + A+ +N ++ + S +S ++ L+ L + LL T
Sbjct: 308 NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
G++ ++ DGR V + + + I T IG+ FLGS GDS+L+
Sbjct: 368 GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
++ S ++ ++ D AD +S D +D + + +L G +
Sbjct: 428 YSYSSSSTKKHRLQAKQVIDDSADMSEED---QSDDDVYEDDLYSTSPDTTLTGRRPSGE 484
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
SA + F + D L+NIGPL+D + G R++ + TG S +++ +G
Sbjct: 485 SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNPEKTGDRTNSTSPELQIVASQGSHKSG 544
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
V + H S + + D + A L EA R V+ T
Sbjct: 545 GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604
Query: 597 TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSYMT 640
++ ES+ + V G TI+ G R+RV+QV + R D T
Sbjct: 605 SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISIGTQESRKRVVQVLKNEVRSYDFGKFT 664
Query: 641 -----QDLSFGPSNSES-------GSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
++ + G S + ++ +S S+AD + + D ++ L D S
Sbjct: 665 PSRCRRNFADGTDLSLTQIYPIWDDDTNDERMAVSASLADSCLAILRDDSTLLFLQADDS 724
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
V + S K SC LY DK TG+ +ID P+
Sbjct: 725 GDLDEVVFGEDVASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV- 765
Query: 749 QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSE 808
+ D++ + L + C+ + G ++ + S+ I++
Sbjct: 766 KNDMFLFLLSHDCKLFV------KCLLWSSFALRGWHLMLSKSSGLSRPRSKAAIDN--- 816
Query: 809 EGTGQGRKENIHSMKVVELAM----QRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
+G + + S+ ++E + + WSA P+L I+C+ EG
Sbjct: 817 ----RGDRRFVASVNLIEAIVADLGETWSAS---PYL--------IVCHH---IEG---- 854
Query: 865 SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQ 924
+ ++ S+ N R P + + + + + I +ISG
Sbjct: 855 --------------IHSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLS 900
Query: 925 GFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLP 984
F+ G+ + + +L + + + L C+ GFIY+ SQ ++ C+LP
Sbjct: 901 AVFMPGASAGFIIRTSASAPHFLRLRGENSRSVSSLDTPECSKGFIYLDSQSTVRFCKLP 960
Query: 985 SGSTYDNYWPVQKV 998
+ +D W +++V
Sbjct: 961 PMTRFDYQWTLKRV 974
>gi|115490949|ref|XP_001210102.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114196962|gb|EAU38662.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 908
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 176/731 (24%), Positives = 297/731 (40%), Gaps = 148/731 (20%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
++++LAF +AK+S++E+D HG+ S+H +E + + + G ++ V+P R
Sbjct: 62 EAVLLAFRNAKLSLIEWDPERHGISTISIHYYERDDLTCSPWVPDLSSCGSILDVEPSSR 121
Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGD-------------EDTFGSGGGFSARIESSHVI 236
C V +G++ + I+ Q G LV D ++T ++ SS V+
Sbjct: 122 CA-VFNFGIRNLAIIPFHQPGDDLVMDDYDSDLDERKHVDQETTRESPAYATPYASSFVL 180
Query: 237 NLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
L D + H F+H Y EP IL+ + T + + S ++ +
Sbjct: 181 PLTAFDPSILHPISLAFLHEYREPTFGILYSQVATSNALLHERKDVVFYSVFTLDLEQRA 240
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQ 353
+ S LP D + ++A+P P+GG L++G+N +H + A+ +N ++ + +
Sbjct: 241 STTLLSVARLPSDLFHVVALPPPVGGSLLIGSNELVHVDQAGKTNAVGVNEFSRQVSAFS 300
Query: 354 ELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRL---------- 401
+S ++ L+ L ++ +L TG++VL+ DGR V +
Sbjct: 301 MTDQSDLALRLEGCRVERLADNSGDMILILSTGNMVLIKFKLDGRSVSGISVHPVPVHAG 360
Query: 402 -DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
DL K+ S +GN FLGS DSLL+ G S LSSG
Sbjct: 361 GDLMKS----AASSSAFLGNGEVFLGSEDADSLLL------GWSDLSSG----------- 399
Query: 461 APSTKRLR------RSSSDALQDMVNGEEL---SLYGSASNNTESAQKT---------FS 502
TKRLR S D D ++ +++ LY ++ + T ++ ++
Sbjct: 400 ---TKRLRSHKNDANDSGDVSDDNMSDDDVYEDDLYSTSPDATADGRRVSADPSSFGLYN 456
Query: 503 FAVRDSLVNIGPLKDFSYG--------LRINADASATGISKQS---NYELVEL------- 544
F + D L+NI PL+D + G + N A ++ Q N L+ +
Sbjct: 457 FRINDRLLNIAPLRDITLGKPSTFDKDRKDNVSAELELVASQGSDRNGGLIAMRREIDPE 516
Query: 545 -------PGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA---- 593
+WT +SS G ++ H Y+I+S + ET
Sbjct: 517 VLASFTIDSANCVWTACVESSGGKDS------------HQYVIVSKQTNIDKEETEIFRV 564
Query: 594 ---DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---P 647
DL V+ + TI G L + RV+QV + R D DL P
Sbjct: 565 DGLDLKPIKAPEVNPN-EEVTIDVGTLAKQSRVVQVLKNEVRCYDA-----DLGLAQIYP 618
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
E S+ +S S+ DPYV + D ++ LL D S V+ P + ++ K +
Sbjct: 619 VWDE--DTSDEHPAVSASVTDPYVAILRDDSTLLLLHVDDSGDVDEVEMPDNM-AAHKWL 675
Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
SSC LY DK TGV + G Q D++ + + L I+
Sbjct: 676 SSC-LYLDK----------------TGVFASNTDTKGS--RQNDMFLFLLGQDCRLFIYR 716
Query: 768 VPNFNCVFTVD 778
+P+ V T+D
Sbjct: 717 LPDLLLVSTID 727
>gi|169603229|ref|XP_001795036.1| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
gi|160706354|gb|EAT88382.2| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
Length = 1338
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 168/728 (23%), Positives = 294/728 (40%), Gaps = 144/728 (19%)
Query: 57 NLVVTAANVIEIY-----VVRVQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NL+V ++++++ V V G E+ N+ E L + A L LV
Sbjct: 28 NLIVAKNSLLQVFELKSTVTEVASGGEGEADNAAANFDTEAADVPLQRIENTAKLVLVGE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + + R +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVISLARVK--ALNTKSRAEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
W + +F + DP RC + + IL
Sbjct: 146 VPGLAPWDAELKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLAEDEYDSDN 200
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
+A+Q G E G+ G + + SS V+ L +LD + H F+H Y EP
Sbjct: 201 EAAQEGKA----ERANGANGDDAVKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTF 256
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
++ + T A ++ + + ++ K + S LP+D +++ +P PIGG
Sbjct: 257 GVISSSKATAASLLTHRKDILTYTVFTLDLEQKASTTLLSVPGLPYDLTQVVPLPHPIGG 316
Query: 321 VLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
L+VG+N IH + +A+N A + S ++ ++ L+ L D
Sbjct: 317 ALLVGSNEIIHVDQAGKTNGVAVNELAKACTSFALSDQADLALRLEGCTLELLSQDTGDV 376
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRL 429
++ G + +LT DGR V + + P+ +I T +G F+GS
Sbjct: 377 MIVLNDGSIFILTFSLDGRNVSAMTIQPV-PADNGGNILKTRASCSTNLGRGRLFIGSED 435
Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
G+S+L+ +T ++ +LRR S+ Q + E++S
Sbjct: 436 GESVLMGWTS-----------------------TSNQLRRKQSNTAQSG-DDEDMSDVEE 471
Query: 490 AS---------NNTESAQK-------------TFSFAVRDSLVNIGPLKD---------- 517
N+T + K T++F V D L +I P++D
Sbjct: 472 EEVDDLDDDLYNDTATTVKKITAAAAEPTAPGTYTFRVHDVLPSIAPIRDTVLHPGKDTE 531
Query: 518 -FSYG-LRINADASATGISKQSNYEL-------VELPGCKGIWTVYHKSSR--------G 560
+ G + ++ A G N EL ELP G+W V+ K G
Sbjct: 532 SLTKGEIMLSTGRGAAGAITALNRELHPTMLAQTELPSSNGVWAVHAKKQAPAGIVADFG 591
Query: 561 HNADSSRMAAYDDEYHAYLIISLE-----ARTMVLETADLLTEVTESVDYFV-QGRTIAA 614
+A+++ A+ D +Y YL++S T+V E TE D+ +G T++
Sbjct: 592 QDAEAN--ASSDVDYDQYLVVSKAWEDGTESTVVYEVHGNELSETEKGDFERDEGLTLSV 649
Query: 615 GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
G L +V+QV R D + + P E N +++ S ADPY+L+
Sbjct: 650 GVLARGTKVVQVLRSEVRTYDSELGMEQII--PMEDEETGNELN--IINASFADPYLLIQ 705
Query: 675 MSDGSIRL 682
D S+++
Sbjct: 706 REDSSVKI 713
>gi|414587800|tpg|DAA38371.1| TPA: hypothetical protein ZEAMMB73_571351 [Zea mays]
Length = 108
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 68/98 (69%), Positives = 77/98 (78%)
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
MVL+T D L EVTE+VDY VQG TIAAGNLFGR RVIQV+ +GAR+LDGS+MTQ+L+F
Sbjct: 1 MVLQTGDDLGEVTETVDYNVQGSTIAAGNLFGRCRVIQVYAKGARVLDGSFMTQELNFSM 60
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
SES SE S SIADPYVLL MSDGSIRLL+G
Sbjct: 61 HTSESSLNSEPLAAASASIADPYVLLKMSDGSIRLLIG 98
>gi|225679191|gb|EEH17475.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 1377
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 230/1037 (22%), Positives = 397/1037 (38%), Gaps = 175/1037 (16%)
Query: 53 GPVPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
G V V ++++Y + GS ++ +T+ + + L LV Y L G
Sbjct: 4 GAVAAFRVAKTTLLQVYNLVNVVYGSGPGQSDEKTRSQY-------SKLVLVAEYALSGT 56
Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
V L + D+ ++I++A +AK+S++E+D H + TS+H +E + +H+
Sbjct: 57 VTDLGRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISP 113
Query: 173 GRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------------- 214
+ A P + VDP RC VL +G + + IL Q G LV
Sbjct: 114 WTPNLAACPSQLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMGDFDSDHDEERQIDT 172
Query: 215 ------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
DE G + SS V+ + L+ M H F++ Y EP IL+ +
Sbjct: 173 NHTAEERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGILYSQ 232
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
+ + + S ++ + + S LP+D +K++ +P P+GG L+VG+
Sbjct: 233 VAASSALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALLVGS 292
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKT 383
N +H + A+ +N +A S +S + L+ L +N LL
Sbjct: 293 NELVHVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLVLLN 352
Query: 384 GDLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDS 432
G + +++ DGR V + L +T PS +G F GS GDS
Sbjct: 353 GVMAVVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SALVGRGKIFFGSEEGDS 408
Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN 492
+L+ ++ S + + E D A+ + DA +D + ++ G S
Sbjct: 409 MLIGWSRPSAGATVPPA-PETGEDNVAELSEDEEEEDDDEDAYEDDLYATPVT-PGINSR 466
Query: 493 NTESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNYELVELP 545
NT S T + F + D L N+GP++D + G + D + S + ELV
Sbjct: 467 NTASVNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYLELVTTQ 526
Query: 546 G--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYL 579
G G+ +V+ K + S Y YL
Sbjct: 527 GYGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPTQSGSLPVNAGSNYDHYL 586
Query: 580 IISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARI 633
++S + +++V + + E T + ++ + RTI G L G RV+QV + R
Sbjct: 587 LLSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVLKGEVRS 646
Query: 634 LD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
D G + Q ++ SE +V+ S ADPYVL+ D SI LL D S
Sbjct: 647 YDSGLGLAQIYPVWDEDT-----SEERSVVHASFADPYVLIIRDDSSILLLQADESGDLD 701
Query: 693 SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ--G 750
++T IES+ S +LY DK ++LS +G P +
Sbjct: 702 EIETDGIIESTT--WISGSLYQDK----------YRSFLSY---------EGTPNRKPSD 740
Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM---REALKDSETEINSSS 807
++ + L IF +PN + V I+ T + R ++ TEI
Sbjct: 741 NVLLFLLNSESKLYIFHLPNAKEPVYTAESVDLLPQILPTELPPRRTTYRECLTEI---- 796
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
L + P+L ++ Y+ Y +
Sbjct: 797 -------------------LVADLGDSVSRTPYLILRSNSNELILYEPYHI-----VQST 832
Query: 868 DDPVSTSRSLSVSN------VSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
+ +S R L ++N + S L NL S L R G C T+F
Sbjct: 833 EKRLSDLRFLKIANHHFPKFLPESNLGNLSDSDRQL---ARPLRALGDVCGYRTVF---- 885
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G+ PC+ + + L ++ + + + C GF+YV + ++++C
Sbjct: 886 ------MPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRMC 939
Query: 982 QLPSGSTYDNYWPVQKV 998
+ P + +D W +K+
Sbjct: 940 RFPRNTHFDGSWAARKI 956
>gi|119484094|ref|XP_001261950.1| cleavage and polyadenylation specificity factor subunit A, putative
[Neosartorya fischeri NRRL 181]
gi|148886830|sp|A1DB13.1|CFT1_NEOFI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
protein 1
gi|119410106|gb|EAW20053.1| cleavage and polyadenylation specificity factor subunit A, putative
[Neosartorya fischeri NRRL 181]
Length = 1400
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 181/751 (24%), Positives = 310/751 (41%), Gaps = 127/751 (16%)
Query: 57 NLVVTAANVIEIY-VVRVQEE---GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
NLVV +V++I+ +++VQ G+ E K++ D + L L Y L G
Sbjct: 28 NLVVVKTSVLQIFSLLKVQHHLRGGTIEGKSARP-------DRVETTKLVLEREYPLSGT 80
Query: 113 VESLA---ILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
V + IL+ GG ++++LAF +AK+S++E+D HG+ S+H +E +
Sbjct: 81 VVDICRVKILNPKSGG-------EALLLAFRNAKLSLVEWDPERHGISTLSIHYYERDDL 133
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGD-------EDT 219
+ + G ++ VDP RC V +G++ + IL Q G L D +D
Sbjct: 134 TRSPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDD 192
Query: 220 FGS-----GGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
F G ++ SS V+ L LD + H F++ Y EP +L+
Sbjct: 193 FNQVSDHVGNDLKSKDRTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGVLY 252
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
+ T + + + + ++ + + S LP D +K++A+P P+GG L++
Sbjct: 253 SQIATSHALLPERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLI 312
Query: 325 GANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
G+N +H + A+ +N +A + + + +S ++ L+ L + LL
Sbjct: 313 GSNELVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHLSDSTGDLLLVL 372
Query: 382 KTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLL 434
+G++VL+ DGR V + L ++ +++ S ++ +G+ F GS DS+L
Sbjct: 373 SSGNMVLVHFQLDGRSVSGISLRPLPAQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVL 432
Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASN 492
+ ++ S + ++ D +S D + D+ E E G +
Sbjct: 433 LSWSSMSSN---PKKPRPRMSNVAEDREEASVDSQSEEDVYEDDLYTAEPETPALGRRPS 489
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYG-----------LRINADASATGISKQS---N 538
S + F + D L NIGPL+D + G L NA + I+ Q N
Sbjct: 490 AETSGVGVYIFQILDRLPNIGPLRDITLGKPASTVENTGRLIENACSELELIAAQGSGRN 549
Query: 539 YELV--------------ELPGCKGIWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIIS 582
LV + +G+WT G D R+ + EY Y+I+S
Sbjct: 550 GGLVLMKREIEPDVAASFDAQSVQGVWTAVVALGSGAPLVPDEQRI---NQEYRQYVILS 606
Query: 583 L-------EARTMVLETADL----LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
++ + + DL E + D TI G L +RRV+QV
Sbjct: 607 KPEAPDKEQSEVFIADKQDLKPFKAPEFNPNNDV-----TIEIGTLSCKRRVVQVLRNEV 661
Query: 632 RILDGSYMTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
R SY D+ G P E S+ +S S+ADPY+ + D ++ LL D
Sbjct: 662 R----SY---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMLLQAD 712
Query: 687 PSTCTVSVQTPAAIESSKKPVSSCTLYHDKG 717
S V+ + + K SC LY DK
Sbjct: 713 DSGDLDEVELDDSTRAGK--WRSCCLYWDKA 741
>gi|121925707|sp|Q0UUE2.1|CFT1_PHANO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
protein 1
Length = 1375
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 168/728 (23%), Positives = 294/728 (40%), Gaps = 144/728 (19%)
Query: 57 NLVVTAANVIEIY-----VVRVQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NL+V ++++++ V V G E+ N+ E L + A L LV
Sbjct: 28 NLIVAKNSLLQVFELKSTVTEVASGGEGEADNAAANFDTEAADVPLQRIENTAKLVLVGE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + + R +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVISLARVK--ALNTKSRAEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
W + +F + DP RC + + IL
Sbjct: 146 VPGLAPWDAELKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLAEDEYDSDN 200
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
+A+Q G E G+ G + + SS V+ L +LD + H F+H Y EP
Sbjct: 201 EAAQEGKA----ERANGANGDDAVKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTF 256
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
++ + T A ++ + + ++ K + S LP+D +++ +P PIGG
Sbjct: 257 GVISSSKATAASLLTHRKDILTYTVFTLDLEQKASTTLLSVPGLPYDLTQVVPLPHPIGG 316
Query: 321 VLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
L+VG+N IH + +A+N A + S ++ ++ L+ L D
Sbjct: 317 ALLVGSNEIIHVDQAGKTNGVAVNELAKACTSFALSDQADLALRLEGCTLELLSQDTGDV 376
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRL 429
++ G + +LT DGR V + + P+ +I T +G F+GS
Sbjct: 377 MIVLNDGSIFILTFSLDGRNVSAMTIQPV-PADNGGNILKTRASCSTNLGRGRLFIGSED 435
Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
G+S+L+ +T ++ +LRR S+ Q + E++S
Sbjct: 436 GESVLMGWTS-----------------------TSNQLRRKQSNTAQSG-DDEDMSDVEE 471
Query: 490 AS---------NNTESAQK-------------TFSFAVRDSLVNIGPLKD---------- 517
N+T + K T++F V D L +I P++D
Sbjct: 472 EEVDDLDDDLYNDTATTVKKITAAAAEPTAPGTYTFRVHDVLPSIAPIRDTVLHPGKDTE 531
Query: 518 -FSYG-LRINADASATGISKQSNYEL-------VELPGCKGIWTVYHKSSR--------G 560
+ G + ++ A G N EL ELP G+W V+ K G
Sbjct: 532 SLTKGEIMLSTGRGAAGAITALNRELHPTMLAQTELPSSNGVWAVHAKKQAPAGIVADFG 591
Query: 561 HNADSSRMAAYDDEYHAYLIISLE-----ARTMVLETADLLTEVTESVDYFV-QGRTIAA 614
+A+++ A+ D +Y YL++S T+V E TE D+ +G T++
Sbjct: 592 QDAEAN--ASSDVDYDQYLVVSKAWEDGTESTVVYEVHGNELSETEKGDFERDEGLTLSV 649
Query: 615 GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
G L +V+QV R D + + P E N +++ S ADPY+L+
Sbjct: 650 GVLARGTKVVQVLRSEVRTYDSELGMEQII--PMEDEETGNELN--IINASFADPYLLIQ 705
Query: 675 MSDGSIRL 682
D S+++
Sbjct: 706 REDSSVKI 713
>gi|310789917|gb|EFQ25450.1| CPSF A subunit region [Glomerella graminicola M1.001]
Length = 1439
Score = 130 bits (328), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 229/1013 (22%), Positives = 381/1013 (37%), Gaps = 173/1013 (17%)
Query: 69 YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL----SQGGA 124
Y R+ ++ ES G V D L LV Y + G V LA + S+ G
Sbjct: 66 YDRRLNDDDGLESSFLGGDGMLVRADRAVNTKLVLVAEYPIFGVVTGLARIKIQHSKSGG 125
Query: 125 DNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL-- 182
+ ++++A A++S+++++ H L S+H +E E + S GPL
Sbjct: 126 E------ALLIATRVARLSLVQWNSEKHALEDISIHYYEKEEL------QGSPFDGPLAN 173
Query: 183 ----VKVDPQGRCGGVLVYGLQMI-ILKASQG--------------GSGLVGDEDTFGSG 223
+ DP RC L +G + I L Q G + T +
Sbjct: 174 YRTHLAADPGSRCAA-LSFGPRYIAFLPFKQADEDIDMDDWDEDVDGPRPAKEPPTTAAT 232
Query: 224 GGFS----ARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
G S +S+V+ L LD + H F+H Y EP I+ +
Sbjct: 233 NGTSNIADVPYSTSYVLPLPQLDPSLLHPVYLAFLHEYREPTFGIISSTQRRSNTLPRKD 292
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
H + + L + + I S NLP D +K++A+P P+GG L+VG N IH
Sbjct: 293 HFSYKVFTLDLQQ--RASTAILSVNNLPQDLFKVVALPGPVGGALLVGTNELIHIDQSGK 350
Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYD 394
+A+N + + +S + L+ H + +N L+ G L ++T D
Sbjct: 351 PNGVAVNAFTKETTNFPLADQSDLDLRLEHCHIELMSAENGELLMVLSDGRLAIITFKID 410
Query: 395 GRVVQRLDLSKTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
GR V + + V S I+ + ++FF+GS DSL++ +T +
Sbjct: 411 GRTVSGVSVKPVAAEVGGNIVQCSVSTISKLSRNVFFVGSTGSDSLVLGWTRKQAQN--- 467
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
+ K D + D + G+ +N S +F V D
Sbjct: 468 ARRKTRLVDDSFEYDLEDEDMDDGDDDDLYGETTTTMIQPGATANGV-SKGGDLTFRVHD 526
Query: 508 SLVNIGPLKDFSYGLR-INAD------------------------ASATGISKQSNYELV 542
SL++I P+KD + G + N D A A I Q+ V
Sbjct: 527 SLLSIAPVKDMTSGKQAFNPDSEEANNSVGVVADLQLACVVGRGNAGAVAILNQNIQPKV 586
Query: 543 ----ELPGCKGIWT--VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
E P +G WT V + D AA E+ A S+ + M++ DL
Sbjct: 587 IGKFEFPEARGFWTMCVQKPVPKSLQGDKGANAAVGSEFDAS---SIYDKFMIVSKVDL- 642
Query: 597 TEVTESVDYFV-----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSY- 638
+ E+ D + G T+ AG + R+IQV + R DG
Sbjct: 643 -DGYETSDVYALTGAGFEALTGTEFDPAAGFTVEAGTMGKHMRIIQVLKSEVRCYDGDLG 701
Query: 639 MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPA 698
++Q L + E+G+ V+S SIADPY+LL D SI + D + V+
Sbjct: 702 LSQILPM--LDEETGA---EPRVVSASIADPYLLLVRDDSSIMVAQIDNNCELEEVEKQD 756
Query: 699 AIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCY 758
S K ++ C LY D +TG + G P Q +I+ +
Sbjct: 757 DAILSTKWLAGC-LYAD----------------TTGRFAPVQTDKGTPEGQ-NIFMFLLS 798
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
+GAL I+ +P+ + V +G T++ + + GT Q E
Sbjct: 799 AAGALYIYALPDLSKPVYV---AAGLTYVPPLL----------SADYAVRRGTVQ---ET 842
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 878
+ + V +L + P+L + + Y+ E + T V S++L
Sbjct: 843 LTELLVADLG----DTTTTSPYLILRHANDDLTIYEPIRLESQDKT------VGLSKTLH 892
Query: 879 VSNVSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC 936
++ N +++P++ E E P P + NI+G+ FL G+ P +
Sbjct: 893 FQKIT-----NPALAKSPVEVADDEANEQPRFVPLRPC---PNINGYSTVFLPGASPSFI 944
Query: 937 MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
+ + L + + H C GFIY S+G ++ QLP+ + +
Sbjct: 945 IKSSKSSPKVIGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVTQLPADTNF 997
>gi|326471884|gb|EGD95893.1| protein kinase subdomain-containing protein [Trichophyton tonsurans
CBS 112818]
Length = 1398
Score = 130 bits (328), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 237/1027 (23%), Positives = 391/1027 (38%), Gaps = 166/1027 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + R D A L L Y + G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSTTATQPDQKGRN---DRSQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ + + D+I+++ +AK+S++E+D HG+ S+H +E E H+
Sbjct: 85 QRVRISNSKSGG--DAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVPD 141
Query: 177 FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------DT 219
P + VDP G C + +G+ + IL Q G LV D+ D
Sbjct: 142 LGSCPSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDMVSDA 200
Query: 220 FGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
S G +A + S V+ + LD + H F+H Y EP IL+ +
Sbjct: 201 QKSAPGNTAHDKPYAPSFVLPMAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
+ S ++ + + + LP D +K++ +P P+GG L++G N +H
Sbjct: 261 IDRKDVVSYSIFTLDLQQRASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
+ A+ +N +A + +S + L+ L + LL G + +L+
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILSF 380
Query: 392 VYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
DGR V + L +K PS S +G + F GS GDS+L+ ++
Sbjct: 381 KVDGRSVSGISLHFVAEQSGGLITKARPSCSAS----LGRNKLFYGSEEGDSILLGWSRP 436
Query: 441 SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES---- 496
S T+ S K G E+ A D D + ++L AS E
Sbjct: 437 SSTTKRPS--KAADGVDESGAADLSDEAEQDDDGDDDDMYEDDLHSVNPASIRQEKQVVN 494
Query: 497 --AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
+ F+F D L ++GP +D + G + + S + +EL +G
Sbjct: 495 GDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPAIAAPLELVAARGFGKSG 554
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR---TMVLETAD---LLTEVT--- 600
TV + + DS +M DD Y + I ++ + T + + D LL +
Sbjct: 555 GLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTRLSRSYDKYLLLAKAKGDD 611
Query: 601 --ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
ESV Y V + T+ G L RV+QV R D Y
Sbjct: 612 KEESVVYSVGSSGLDSIDAPEFNPNEDCTVDIGTLATGTRVVQVLRTEIRSHD--YNLGL 669
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAI 700
P E SE TV+ S A+PY+L D S+ +L D + V VQ AA
Sbjct: 670 AQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA- 726
Query: 701 ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYES 760
S K VS C LY DK + + S E + GP +I +
Sbjct: 727 --SGKWVSGC-LYEDK----------MNIFFSDFDIE----NEAGP----NILLFLLDVD 765
Query: 761 GALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
G L IF +PN + + VD L S SSS +E +
Sbjct: 766 GNLSIFRLPNISEPLCRVDNL--------------NLLPSNLPYESSSRRPV---NRETL 808
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
+ + +L A H P++ ++ Y+ Y G S R L
Sbjct: 809 TELLIADLG----DAIHKSPYMILRTKHDDLVLYEPYRIAGESGHSG-------LRFLKA 857
Query: 880 SN--VSASRLR---NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
N V R N +R+P + C+ + ++ G++ F+SG PC
Sbjct: 858 VNHVVMGPRTDQGVNHDINRSP------------SSCKLLRALPDVCGYKTVFMSGHNPC 905
Query: 935 WCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
+ ++ R H G V + + H C GF YV ++++ +LPS + +D+ W
Sbjct: 906 F-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGFAYVDEDNVIRMSRLPSNTRFDSGW 964
Query: 994 PVQKVVF 1000
+K+
Sbjct: 965 ATRKIAL 971
>gi|390599704|gb|EIN09100.1| hypothetical protein PUNSTDRAFT_67240 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1439
Score = 130 bits (327), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 203/924 (21%), Positives = 364/924 (39%), Gaps = 138/924 (14%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
+ A L LV +RLHG V L + + N D ++++FEDAKI+VLE+ + H L
Sbjct: 118 TVARLRLVREHRLHGMVTGLGRIKILSSLNDGL-DRLLISFEDAKIAVLEWSEEQHDLLT 176
Query: 157 TSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
S+H +E +P+ + L S G L +VDP RC + + I+ Q
Sbjct: 177 VSIHTYERAPQLMSLN---ASLFHGWL-RVDPISRCAALALPCDAFAIIPFHQTLE---- 228
Query: 216 DEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
A S +++L D + +V D F+ G+ P + +L + TW G
Sbjct: 229 -----------EAPYAPSFILDLTSEVDQRIHNVVDMSFLPGFNNPTVAVLFQPTQTWTG 277
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
R++ T + ++ + +P+I S NLP+D + A + +GGV+V+ +N+I +
Sbjct: 278 RLTEYKDTMKLLVFTLDAVTRNYPVITSVDNLPYDCLSVHACSAAVGGVIVITSNSIIHV 337
Query: 333 SQSA-SCALALNNYAVSLDSSQELP----RSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
SQS+ AL++N +A + P ++ ++ L+ + ++ + L K G +
Sbjct: 338 SQSSRRVALSVNGWASRVTDMSLAPVQAEYATRNLALEGSRLAFVDDRTFFLFLKDGTVY 397
Query: 388 LLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
+ + DG VV + + S + + +T + F+GS G S+L++ T
Sbjct: 398 PVELSLDGAVVSTISMGHALAQSAIPAVVTPVTQEHIFVGSTAGTSVLLKIT-------- 449
Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDM-------VNGEELSLYGSASNNTESAQK 499
++EE D +DA + + + S + D + + SL +N T + K
Sbjct: 450 --SVEEEVEDNASDAVAAAVVDTADSMVMDDDDDIYGVSMKTDAQSLSNGHANGTHLSVK 507
Query: 500 TFS---FAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYELVE--LP--- 545
S ++ DSL G + D S+ L N + +ATG + L + LP
Sbjct: 508 KRSVTHLSLSDSLPGYGSISDMSFSLAKNGEKVVPELVAATGSGSMGGFTLFQRDLPART 567
Query: 546 --------GCKGIW------------TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
G +G+W T Y ++ AD+ + D A +
Sbjct: 568 KRKLHAIGGGRGMWSLSLRPTVKVNGTSYERAVNPFQADNDTVVVSTDANPAPGLSRFSH 627
Query: 586 RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGS--YMTQ 641
RT E S+ V G+TI A F R ++ V R+L DGS + +
Sbjct: 628 RTPRTEI---------SITTRVPGQTIGAAPFFQRTAILHVMSNAIRVLEPDGSERQVIK 678
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI- 700
DL + SI DP+VL+ D +I L +G+ + + + +
Sbjct: 679 DLD---------GNMARPKIRHCSICDPFVLIVREDDTIGLFIGESERGKIRRKDMSPMG 729
Query: 701 ESSKKPVSSCTLYHDKGPEPWLRKTSTDA---WLSTGVGEAIDGADG-----------GP 746
+ + + ++ C + G + + ++ +T + + AD G
Sbjct: 730 DKTSRYLTGCFFTDNAGVFDLRSQANGNSGADKTATSTLQGVVNADSRSQWLLLVRPQGV 789
Query: 747 LDQGDIYSVV-CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
L+ D+ + C +I+ +P + VF+V + + D+ AL
Sbjct: 790 LEASDLSPIPGCRRLNEKQIWTLPKLSIVFSVRLASTLDWVLADSGDGPAL--------- 840
Query: 806 SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
S G R + + V + + +P L L G + YQA P S
Sbjct: 841 -SMPGESPRRPQE---LDVEQAVIAPLGETAPQPHLLLFLRSGQLAIYQAI----PMQAS 892
Query: 866 KSDDPVS-TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQ 924
D+ +S S + + V+ R + ++ +T +
Sbjct: 893 SVDESLSRPSLGVRFAKVATRVFEIQRQDDSEKSILAEQKKISRVLIPFLTSPSPTTTFS 952
Query: 925 GFFLSGSRPCWCMVF-RERLRVHP 947
G F +G PCW + R +R+HP
Sbjct: 953 GVFFTGDHPCWILKPDRSGIRIHP 976
>gi|440466842|gb|ELQ36086.1| hypothetical protein OOU_Y34scaffold00669g71 [Magnaporthe oryzae Y34]
gi|440481991|gb|ELQ62520.1| hypothetical protein OOW_P131scaffold01068g7 [Magnaporthe oryzae
P131]
Length = 1475
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 230/1054 (21%), Positives = 399/1054 (37%), Gaps = 195/1054 (18%)
Query: 57 NLVVTAANVIEIYVVRV---QEEGSKESKNSGETKRRVLMD--GISAA------------ 99
NLVV +++++I+ R+ + +G+ +S + L D G+ A+
Sbjct: 51 NLVVAKSSLLQIFATRLVPAELDGTSQSAKATHNYDTKLNDDEGLEASFLGGDAAIIRSD 110
Query: 100 ----SLELVCHYRLHGNVESLAILSQGGADNSRRR---------DSIILAFEDAKISVLE 146
L LV + L G + LA + S D +++AF+DAK+S++E
Sbjct: 111 RNHTKLVLVAEFPLSGTITGLARVKANATKTSNGNGAGSSSSGGDFLLIAFKDAKLSLVE 170
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQ 200
+D L S+H +E E + S PL + DP RC L +G +
Sbjct: 171 WDPDRRSLETISIHYYEQNEL------QSSPWAAPLSDYVNFLVADPGSRCAA-LKFGAR 223
Query: 201 MIILKASQGGSGLVGDED----------------TFGSGGGFSARIES-----SHVINLR 239
+ + + G +G +D T + G +E S V+ L
Sbjct: 224 SLAIIPFKQADGDIGMDDWDEELDGPRPAQEKPATAATNGTTDNVVEDTPYTPSFVLRLP 283
Query: 240 DLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
+LD + H F++ Y EP IL +T + ++ K H + ++ K
Sbjct: 284 NLDPALLHPVHLAFLYEYREPTFGILSS-NITPSTYLARKDH-LTYTVFTLDLQQKASTT 341
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP 356
I S LP D +++A+P+P+GG L+VG+N IH + +A+N S S
Sbjct: 342 ILSVGGLPKDLTRVIALPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTKSCTSFSLAD 401
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLSKTNP------ 408
+S + L+ L + D L T+V+ DGR V L + P
Sbjct: 402 QSDLGLRLEGCMINVLSAEDGQFIIVLNDGRLATLVFHIDGRTVSGLKIKMVAPEAGGQL 461
Query: 409 -SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
S +T +G + F GS GDS++ + S K + D + D
Sbjct: 462 LQTSVSCLTRLGRNALFAGSDRGDSVVFGWNRKHNQ---VSKRKPKIQDPDLDLDIDYDD 518
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----- 522
D D+ E + ++++ E+ F V D +V+I P++D ++G
Sbjct: 519 LEDDEDDDDDLYADTEKTKATTSASTGETKTDDLIFRVHDRMVSIAPIRDVTFGKPPPPT 578
Query: 523 ---RINADASA----------TGISKQSNYELV------------ELPGCKGIWTV---- 553
R D +A G K S+ ++ E P +G+WT+
Sbjct: 579 DAERNTKDPAAVQSELQLVAVVGRDKASSLAIINREMTPVSIGRFEFPEARGLWTLSTQK 638
Query: 554 -YHKSSRGHNADSSRMAAYDD----EYHAYLIISLEARTMVLETADLLTEVTESVDYF-- 606
K + N + AA + +Y Y+I++ E ET+D+ +
Sbjct: 639 PLPKPLQASNKNPKTAAATESILSAQYDQYMIVAKEDDDG-FETSDVYALTAAGFETLSG 697
Query: 607 -----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
G TI AG + ++IQV + R DG +TQ + + E+G
Sbjct: 698 TEFEPAAGFTIEAGTMGDHTKIIQVLKSEVRCYDGDLGLTQIIPM--LDEETG---HEPR 752
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
S SIADPY+L+ D S + + + ++ I SS K + C LY D
Sbjct: 753 ATSASIADPYLLIIRDDSSAFIAHVNEDSEIEEIEKEDKIISSTKWSTGC-LYAD----- 806
Query: 721 WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
S G A P I + +GAL I+ +P+ +
Sbjct: 807 -----------SKGAFAATQQTAKSPKSTPTIMMFLLSAAGALYIYALPDIS-------- 847
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
Y+ E L +++ G R E I + V +L + + H
Sbjct: 848 -------RPVYVAEGLCYVPPYLSADYSARKGMAR-ETISEILVTDLGDTVFKSPH---- 895
Query: 841 LFAIL--TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
IL ++ + Y+ Y ++D S ++ L + +L N ++ P +
Sbjct: 896 --VILRHSNHDLTIYEPYRI--------AEDSQSLTKILRL-----RKLPNPAVAKAP-E 939
Query: 899 AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIV 955
A E+ P + + NI+G+ F+ G P + + + + P+ L +
Sbjct: 940 ATNSEDPPLMSRNMPLRACANIAGYSAVFMPGHSPSFLI---KSAKATPKVIGLRGSGVR 996
Query: 956 AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
A + H C GFIY S G+ ++ Q+P +++
Sbjct: 997 AMSSFHTEGCERGFIYADSAGVARVAQIPKDTSF 1030
>gi|403411348|emb|CCL98048.1| predicted protein [Fibroporia radiculosa]
Length = 1437
Score = 130 bits (326), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 202/978 (20%), Positives = 388/978 (39%), Gaps = 153/978 (15%)
Query: 57 NLVVTAANVIEIYVVRVQE------------------------EGSKESKNSGE------ 86
N+VV +N++ I+ VR + EG E SGE
Sbjct: 45 NVVVARSNLLRIFEVREEPAPFSTQKEDERDRRASMRKGTEAVEGEVEMDASGEGFVNMG 104
Query: 87 ----TKRRVLMDGISAASLELVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFED 139
T + ++ + L+ +RLHG +E + I++ ++S D ++++F+D
Sbjct: 105 SVKSTGQNGILHQPTVNRFYLIREHRLHGIVTGIEGVRIIT--SIEDSF--DRLLVSFKD 160
Query: 140 AKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPL----VKVDPQGRCGGV 194
AKI++LE+ +++H L S+H +E +P+ + + PL ++ DP RC +
Sbjct: 161 AKIALLEWSEAMHDLITVSIHTYERAPQLMAID--------APLFRSQLRADPLSRCAAL 212
Query: 195 LVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIF 251
+ + IL Q + L D + S +++L D +++V DF+F
Sbjct: 213 SLPKDSIAILPFYQSQAEL--DIMEHETSQARDVPYSPSFILDLSADVDTRIRNVIDFVF 270
Query: 252 VHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKL 311
+ G+ P + +L + + TW GR+ T + ++ + +P+I + LPHD + +
Sbjct: 271 LPGFNSPTIAVLFQYQQTWTGRLKEYKDTVGLILFTLDLVTRHYPVITAIDGLPHDCFAM 330
Query: 312 LAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFS-------VE 363
+ +GGV+V+ +N+I Y Q+ L ++ + L +LP S S ++
Sbjct: 331 APCSTALGGVVVLASNSIIYVDQATRRVILPVSGW---LPRISDLPIPSLSHQDQQRDLQ 387
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS-KTNPSVLTSDITTIGNSL 422
L+ + ++ + + K G + + ++ DG+ V RL ++ + + S + + +
Sbjct: 388 LEGSQFVFVDDRTLYVVLKDGTVYPVEIIVDGKTVSRLSMAPPVARTTMPSLVRKMQDDY 447
Query: 423 FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD--APSTKRLRRSSSDALQDMVN 480
F+GS +G S+L++ T G E + A AP+ D
Sbjct: 448 LFVGSIIGPSVLLKTT---RVEEDIEGDDVEMASVPATVVAPNNAMDLDDDDDLYGGSAV 504
Query: 481 GEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASAT 531
E+ + G N + + K + DSL GP+ D ++ L N D +AT
Sbjct: 505 IEQPHMNGITQNGSTAISKKRTVVQLSFCDSLPAYGPIADMTFTLAKNGDRAVPELVAAT 564
Query: 532 GISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY 578
G + L++ LP G +GIW++ + + N + A + YHA
Sbjct: 565 GSGMLGGFTLLQRDLPTRTKRKMHAIGGGRGIWSLLVRQAVKVNGSTYERPA--NPYHAE 622
Query: 579 ---LIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
++IS +A + A + + + G TI A F ++ V +
Sbjct: 623 NDSIVISTDANPSPGLSRIASRNAQGDIQITTRIPGTTIGAAPFFQGTAILHVMINVTNV 682
Query: 634 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
+ + D + + + SI DP+VL+ D SI L +G+ +
Sbjct: 683 I--RVLEPDGTERQVIKDWDGNIPRPKIRFCSICDPFVLIIRDDDSIGLFIGESERGKIR 740
Query: 694 VQTPAAI-ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
+ + + E + + ++ C + D + + + A + + G Q
Sbjct: 741 RKDMSPMGEKTSRYLAGC-FFTDTSGIFQVHQNAQAAGIEGATSTLQSVMNAGNRTQ--- 796
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++C G +EI+ +P F+ + + D Y AL S ++
Sbjct: 797 WLILCRPQGVIEIWTLPKLGLAFSTTHAAGLESVLTDLYDPPAL--------SVPQDPPR 848
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
+ ++ +I + V L RP L L G + Y+ + +T +P+
Sbjct: 849 KPQELDIEQLLVAPLG-----ESSPRPHLMLFLRSGQLAVYEVH------STPVPAEPLP 897
Query: 873 TSRS--LSVSNVSA-SRLRNLRFSRTPLDAYTREET-------PHG---APCQRITIFKN 919
+RS L V V SR N++ S + E+ P +P Q +
Sbjct: 898 AARSSTLLVKFVKVLSRAFNIQHSDEVEKSVLAEQKRISHLLIPFATSPSPGQTFS---- 953
Query: 920 ISGHQGFFLSGSRPCWCM 937
G FL+G RP W +
Sbjct: 954 -----GVFLTGDRPSWLL 966
>gi|378734083|gb|EHY60542.1| histone H2A [Exophiala dermatitidis NIH/UT8656]
Length = 1361
Score = 130 bits (326), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 205/976 (21%), Positives = 374/976 (38%), Gaps = 170/976 (17%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLR 155
S L LV Y L G + SL + NS+ D++++AF DAK+S++E+D ++H +
Sbjct: 49 SETKLVLVAEYNLAGTITSLGRVK---IPNSKSGGDAVLVAFRDAKLSLIEWDPALHSIS 105
Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
S+H +E + + + + VDP RC + I+ Q L
Sbjct: 106 TLSIHYYEHHDLQSIPWQPDLSKCVSHLTVDPSSRCAAFNFGVSNLAIIPLHQVRDELAM 165
Query: 216 DE---------DTFGSGGGFSARIES-------SHVINLRDLD--MKHVKDFIFVHGYIE 257
DE + G + +S S V+ L LD + H D F+H Y +
Sbjct: 166 DEFDEVDGEVKERLSPDGQNENKHDSPDTPFKPSFVLPLTALDPGLLHPVDMAFLHEYRD 225
Query: 258 PVMVILHERELTWAGRVSWKHH----TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
P + IL+ + A R S +H + + ++ K + S LP+D Y+++A
Sbjct: 226 PTVGILY----STAARSSNMNHERRDVTIYAVYALDIGQKASTALQSVQKLPNDLYRVMA 281
Query: 314 VPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL 372
+P P+GG L++G N IH + A+A+N A S +++ ++L+ L
Sbjct: 282 LPPPVGGALLIGGNELIHIDQSGKTIAIAVNELAKEASSFPMADHANYRLKLEGCQIEHL 341
Query: 373 QNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV-------LTSDITTIGNSLF 423
N L+ KTG+L LL+ DGR+V + L + ++ T +G++
Sbjct: 342 GNPSGDMLVILKTGELALLSFRMDGRMVSSMALRRVGEGQSQGLALGASTCSTNLGSNRL 401
Query: 424 FLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA-----LQDM 478
F+GS DS+L+ G T+ L + R++ + DA ++
Sbjct: 402 FIGSEESDSILL--ATGRKTTQLRR--------------TNSRIQSQADDAGLFDDNEED 445
Query: 479 VNGEELSLYGSASN----NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+E LY ++ N + +F + D L +I P+ D + A + ++
Sbjct: 446 GIEDEDDLYAELADELNGNASTDVSGHNFRLLDRLPSIAPINDVALANVGKRRAEESEVT 505
Query: 535 KQ-----------------------SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
+Q S ++ G+W + +RG A
Sbjct: 506 RQELAVAYGRGHAGGLAFLSRKLEPSVTRQIKFERPIGVW-CFSSGNRGQQ------GAE 558
Query: 572 DDEYHAYLIISL-----EARTMVLETAD-LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
++ + ++IS RT +L D L + ES G I L IQ
Sbjct: 559 EENFDDLVMISQTTDDGAGRTKLLRLIDGDLNSMGESEFDESAGAAIGVFKLEATNHTIQ 618
Query: 626 VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
V R+ D + + F + E G + + + VS DPY+++ DGS+ LL
Sbjct: 619 VLPTELRVYDAGFALSQI-FPIVDEEEG---QTARAVKVSFVDPYLVVVKDDGSMSLLKA 674
Query: 686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGG 745
D + V+ P + + + S TLY D TD T G
Sbjct: 675 DKAGELDEVELPENLRAWS--ILSATLYQD-----------TDDMFQTSRFY------NG 715
Query: 746 PLDQGDIYSVVCYESGALEIFDVPNFNC-VFTVDKFVSGRTHIV-DTYMREALKDSETEI 803
G I +++ + G + +PN + VF D TH++ D + + ++
Sbjct: 716 TATPGPILTILT-QDGHFCLLSLPNVSIQVFQCDSLPFLPTHLMQDLQLPKHWRN----- 769
Query: 804 NSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPEN 863
K+++ + + +L ++ +P+L G ++ Y+++
Sbjct: 770 ------------KDDLGEVLLADLG----NSTDRQPYLVVRNLVGDVIIYESFAMP---- 809
Query: 864 TSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGH 923
D + + R V +A L + EE + Q + N++GH
Sbjct: 810 -----DVLGSFRFKKVFTKAAGELED------------GEEVGQPSTLQPMQAVTNVAGH 852
Query: 924 QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
F+ G +P M + +L + + +H C G + V + +K C +
Sbjct: 853 ASVFIPGRQPLLIMREASTMPRVYELNPTKLKSMNSVHTGTCRQGLVLVDADDEIKFCNI 912
Query: 984 PSGSTYD-NYWPVQKV 998
P + + W +++V
Sbjct: 913 PDSTVLGLSDWVIRRV 928
>gi|389641257|ref|XP_003718261.1| cft-1 [Magnaporthe oryzae 70-15]
gi|351640814|gb|EHA48677.1| cft-1 [Magnaporthe oryzae 70-15]
Length = 1452
Score = 129 bits (325), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 230/1054 (21%), Positives = 399/1054 (37%), Gaps = 195/1054 (18%)
Query: 57 NLVVTAANVIEIYVVRV---QEEGSKESKNSGETKRRVLMD--GISAA------------ 99
NLVV +++++I+ R+ + +G+ +S + L D G+ A+
Sbjct: 28 NLVVAKSSLLQIFATRLVPAELDGTSQSAKATHNYDTKLNDDEGLEASFLGGDAAIIRSD 87
Query: 100 ----SLELVCHYRLHGNVESLAILSQGGADNSRRR---------DSIILAFEDAKISVLE 146
L LV + L G + LA + S D +++AF+DAK+S++E
Sbjct: 88 RNHTKLVLVAEFPLSGTITGLARVKANATKTSNGNGAGSSSSGGDFLLIAFKDAKLSLVE 147
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQ 200
+D L S+H +E E + S PL + DP RC L +G +
Sbjct: 148 WDPDRRSLETISIHYYEQNEL------QSSPWAAPLSDYVNFLVADPGSRCAA-LKFGAR 200
Query: 201 MIILKASQGGSGLVGDED----------------TFGSGGGFSARIES-----SHVINLR 239
+ + + G +G +D T + G +E S V+ L
Sbjct: 201 SLAIIPFKQADGDIGMDDWDEELDGPRPAQEKPATAATNGTTDNVVEDTPYTPSFVLRLP 260
Query: 240 DLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
+LD + H F++ Y EP IL +T + ++ K H + ++ K
Sbjct: 261 NLDPALLHPVHLAFLYEYREPTFGILSS-NITPSTYLARKDH-LTYTVFTLDLQQKASTT 318
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP 356
I S LP D +++A+P+P+GG L+VG+N IH + +A+N S S
Sbjct: 319 ILSVGGLPKDLTRVIALPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTKSCTSFSLAD 378
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLSKTNP------ 408
+S + L+ L + D L T+V+ DGR V L + P
Sbjct: 379 QSDLGLRLEGCMINVLSAEDGQFIIVLNDGRLATLVFHIDGRTVSGLKIKMVAPEAGGQL 438
Query: 409 -SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
S +T +G + F GS GDS++ + S K + D + D
Sbjct: 439 LQTSVSCLTRLGRNALFAGSDRGDSVVFGWNRKHNQ---VSKRKPKIQDPDLDLDIDYDD 495
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----- 522
D D+ E + ++++ E+ F V D +V+I P++D ++G
Sbjct: 496 LEDDEDDDDDLYADTEKTKATTSASTGETKTDDLIFRVHDLMVSIAPIRDVTFGKPPPPT 555
Query: 523 ---RINADASA----------TGISKQSNYELV------------ELPGCKGIWTV---- 553
R D +A G K S+ ++ E P +G+WT+
Sbjct: 556 DAERNTKDPAAVQSELQLVAVVGRDKASSLAIINREMTPVSIGRFEFPEARGLWTLSTQK 615
Query: 554 -YHKSSRGHNADSSRMAAYDD----EYHAYLIISLEARTMVLETADLLTEVTESVDYF-- 606
K + N + AA + +Y Y+I++ E ET+D+ +
Sbjct: 616 PLPKPLQASNKNPKTAAATESILSAQYDQYMIVAKEDDDG-FETSDVYALTAAGFETLSG 674
Query: 607 -----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
G TI AG + ++IQV + R DG +TQ + + E+G
Sbjct: 675 TEFEPAAGFTIEAGTMGDHTKIIQVLKSEVRCYDGDLGLTQIIPM--LDEETG---HEPR 729
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
S SIADPY+L+ D S + + + ++ I SS K + C LY D
Sbjct: 730 ATSASIADPYLLIIRDDSSAFIAHVNEDSEIEEIEKEDKIISSTKWSTGC-LYAD----- 783
Query: 721 WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
S G A P I + +GAL I+ +P+ +
Sbjct: 784 -----------SKGAFAATQQTAKSPKSTPTIMMFLLSAAGALYIYALPDIS-------- 824
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
Y+ E L +++ G R E I + V +L + + H
Sbjct: 825 -------RPVYVAEGLCYVPPYLSADYSARKGMAR-ETISEILVTDLGDTVFKSPH---- 872
Query: 841 LFAIL--TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
IL ++ + Y+ Y ++D S ++ L + +L N ++ P +
Sbjct: 873 --VILRHSNHDLTIYEPYRI--------AEDSQSLTKILRL-----RKLPNPAVAKAP-E 916
Query: 899 AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIV 955
A E+ P + + NI+G+ F+ G P + + + + P+ L +
Sbjct: 917 ATNSEDPPLMSRNMPLRACANIAGYSAVFMPGHSPSFLI---KSAKATPKVIGLRGSGVR 973
Query: 956 AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
A + H C GFIY S G+ ++ Q+P +++
Sbjct: 974 AMSSFHTEGCERGFIYADSAGVARVAQIPKDTSF 1007
>gi|156040479|ref|XP_001587226.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980]
gi|154696312|gb|EDN96050.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 1447
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 184/745 (24%), Positives = 307/745 (41%), Gaps = 153/745 (20%)
Query: 57 NLVVTAANVIEIYV-----VRVQEEGSKES---KNSGETKRRVL-MDGISAA-------- 99
NLVV A++++I+ V + E K+S K+ T R DG+ A+
Sbjct: 28 NLVVAKASLLQIFTTKTVSVDLDELSGKDSSTVKDVTSTDPRAHDEDGVEASFLGADSIL 87
Query: 100 ---------SLELVCHYRLHGNVESLA----ILSQGGADNSRRRDSIILAFEDAKISVLE 146
L L+ Y L G V SL I S+ G + ++++ F+DAK+S++E
Sbjct: 88 PRSELARTTKLVLIAEYNLSGTVTSLVRVKTISSKTGGE------ALLVGFKDAKLSLVE 141
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+D G+ S+H +E E + VDP RC + + IL
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201
Query: 207 SQGGSGLVGDED--------------TFGSGGGFSARIESSHVINLRDLDMKHV--KDFI 250
Q + D D + G + SS V+ L LD +
Sbjct: 202 KQDEDVNMDDWDEELDGPRPAKISQKSAAENGILATPYGSSFVLRLSSLDPSLIFPIHLE 261
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
F++ Y EP IL + + + H T M+ L I K I S LP+D
Sbjct: 262 FLYEYREPTFGILSSTMAPSSALLQERKDHLTYMVFTLDIHQ--KASTTILSVGGLPYDL 319
Query: 309 YKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
+ ++ + P+GG L+VGAN IH + +A+N +A + L +S ++ L+
Sbjct: 320 FMIVPLAPPVGGALLVGANELIHIDQAGKANGVAVNMFAKQCTNFSLLDQSDLALRLEGC 379
Query: 368 HATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS----VLT---SDITTI 418
L +N L+ +GD+ +L+ DGR V L + + + +LT S ++++
Sbjct: 380 KIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAELGGDILTGAASCVSSL 439
Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
G F+GS + DS+++ ++ SG PS ++ R SS A+ D+
Sbjct: 440 GAGALFVGSEVSDSVILGWSRKSGQ------------------PSRRKSRLDSS-AIADV 480
Query: 479 ----------------VNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSY 520
+ G+ ++ +A+N T S K ++F++ DS+VNI P+ + ++
Sbjct: 481 DEAMLDEEDLEDDDDDLYGDGPTISPTAANVTASNSKAGDYTFSIHDSMVNIAPITNITF 540
Query: 521 G-----------LRINADAS------ATGISKQSNYELV------------ELPGCKGIW 551
G L++N S A G K + ++ ELP +GIW
Sbjct: 541 GEVALSSDKEEELKLNGVQSELQLLAAVGREKGGSLAVINRNIQPNVIGRFELPEARGIW 600
Query: 552 TVYHK--SSRGHNADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLT 597
T+ K + +G + + D +Y +I+S EA + E+A +
Sbjct: 601 TMSAKKPAPKGLQVNKEKTVIGGDYGVDAQYDRLMIVSKASEAEDAIDESAVYALTNAGF 660
Query: 598 EVTESVDYF-VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
E ++ G TI AG L RVIQV + R DG + Q L + E+G+
Sbjct: 661 EALSGTEFEPAAGSTIEAGTLGNGMRVIQVLKSEVRSYDGDLGLAQILPM--LDDETGA- 717
Query: 656 SENSTVLSVSIADPYVLLGMSDGSI 680
++S S ADP++LL D SI
Sbjct: 718 --EPKIISASFADPFLLLIRDDASI 740
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 26/119 (21%), Positives = 53/119 (44%), Gaps = 1/119 (0%)
Query: 872 STSRSLSVSNVSASRLRNLRFSRTP-LDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
STS +L S + ++ N ++ P + A + + + + N+ G+ F+ G
Sbjct: 879 STSPNLLSSTLQFLKIHNTHLAQAPDVSAEEQADETQQTSDKPMRAVSNLGGYSVVFMPG 938
Query: 931 SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
P + + + L L + + H C+ GFIY ++GI+++ Q P +T+
Sbjct: 939 GSPSFIVKSSKTLPKVLSLQGTGVRGLSSFHTEGCDRGFIYADTEGIVRVAQFPPTTTF 997
>gi|148886829|sp|A2R919.1|CFT1_ASPNC RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
protein 1
gi|134083776|emb|CAK47110.1| unnamed protein product [Aspergillus niger]
Length = 1383
Score = 128 bits (322), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 210/1019 (20%), Positives = 398/1019 (39%), Gaps = 178/1019 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+L+V ++++IY + + E ++ + ++L++ Y L G V L
Sbjct: 28 DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76
Query: 117 ----AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++AF +AK+S++E+D G+ S+H +E +
Sbjct: 77 CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
+ G ++ VDP RC + +G++ + I+ Q G LV D+ +GS
Sbjct: 131 VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187
Query: 223 ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
GGG + S V+ L LD + H F++ Y EP IL+ +
Sbjct: 188 HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
T + + + + ++ + ++ S LP D ++++A+P P+GG L++G+
Sbjct: 248 VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
N +H + A+ +N ++ + S +S ++ L+ L + LL T
Sbjct: 308 NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
G++ ++ DGR V + + + I T IG+ FLGS GDS+L+
Sbjct: 368 GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
++ S ++ ++ D AD +S D +D + + +L G +
Sbjct: 428 YSYSSSSTKKHRLQAKQVIDDSADMSEED---QSDDDVYEDDLYSTSPDTTLTGRRPSGE 484
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
SA + F + D L+NIGPL+D + G R++ + TG S +++ +G
Sbjct: 485 SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGDRTNSTSPELQIVASQGSHKSG 544
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
V + H S + + D + A L EA R V+ T
Sbjct: 545 GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604
Query: 597 TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-M 639
++ ES+ + V G TI+ G R+RV+QV + R D +
Sbjct: 605 SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISVGTQESRKRVVQVLKNEVRSYDFDLSL 664
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
TQ ++ ++ +S S+AD + + D ++ L D S V
Sbjct: 665 TQIYPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVFGED 719
Query: 700 IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
+ S K SC LY DK TG+ +ID P+ + D++ +
Sbjct: 720 VASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV-KNDMFLFLLSH 760
Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
L ++ V + + ++ + G + ++ SSE G +EN+
Sbjct: 761 DCKLFVYRVRD-QKLLSIIEGTDGLSPLL-----------------SSEPPKRSGTRENL 802
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
V +L + WSA P+L ++ Y+ ++ VST +
Sbjct: 803 IEAIVADLG-ETWSAS---PYLILRSETDDLIIYKPFV-------------VSTGPVEGI 845
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
++ S+ N R P + + + + + I +ISG F+ G+ + +
Sbjct: 846 HSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFII-- 903
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
S F L N + ++ C+LP + +D W +++V
Sbjct: 904 ---------RTSASAPHFLRLRGEN--------SRSSTVRFCKLPPMTRFDYQWTLKRV 945
>gi|380488833|emb|CCF37111.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
Length = 1062
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 226/1013 (22%), Positives = 390/1013 (38%), Gaps = 173/1013 (17%)
Query: 69 YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL----SQGGA 124
Y R+ ++ ES G V D L LV Y + G V LA + S+ G
Sbjct: 66 YDHRLNDDDGLESSFLGGDGMLVRADRAINTKLVLVAEYPIFGIVTGLAKIKLQYSKSGG 125
Query: 125 DNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL-- 182
+ ++++A A++S++++D H L S+H +E E S GPL
Sbjct: 126 E------ALLIATRVARLSLVQWDPEKHALEDISIHYYEKEEL------EGSPFDGPLNN 173
Query: 183 ----VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES------ 232
+ DP RC + + L Q + D+ G A+ S
Sbjct: 174 YRTHLAADPGSRCAALRFGPRYIAFLPFKQADEDIDMDDWDEDVDGPRPAKEPSATAATN 233
Query: 233 ------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+V+ L LD + H F+H Y EP I+ + H
Sbjct: 234 GTSNIADVPYSTSYVLPLPQLDPSLLHPVHLAFLHEYREPTFGIISSTQRRSNTLPRKDH 293
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSAS 337
+ + L + + I S NLP D +K++A+P P+GG L+VG N IH
Sbjct: 294 FSYKVFTLDLQQ--RASTAILSVNNLPQDLFKVIALPGPVGGALLVGTNELIHIDQSGKP 351
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDG 395
+A+N + + +S + L+ + + +N L+ G L ++T DG
Sbjct: 352 NGVAVNPFTKETTNFPLADQSDLDLRLEHCYIELMSAENGELLMILSDGRLAIITFKIDG 411
Query: 396 RVVQ----RLDLSKTNPSVLTSDITTIGN---SLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
R V +L ++ ++ ++TI ++FF+G+ DSL++ +T +
Sbjct: 412 RTVSGVGVKLVPTEVGGGIVQCSVSTISRLSRNVFFVGTTGSDSLVLGWTRKQAQNARK- 470
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS--LYGSASNNTESAQKTFSFAVR 506
K D D+ D D + GE + + A+ N S +F V
Sbjct: 471 --KTRLVD---DSFEYDLEDEDMEDDDDDDLYGETTTTMIQPGATANGVSKGGDLTFRVH 525
Query: 507 DSLVNIGPLKDFSYGLRI-------------------------NADASATGISKQSNYEL 541
DSL++I P+KD + G + +A A I Q+
Sbjct: 526 DSLLSIAPVKDMTSGKQAFIPDSEEEKNSVGVVADLQLACVVGRGNAGAVAIVNQNIQPK 585
Query: 542 V----ELPGCKGIWTV-----YHKSSRGHNADSSRMAAYDD---EYHAYLIISLEARTMV 589
V E P +G WT+ KS +G ++ +A+ D +Y ++I+S +
Sbjct: 586 VIGKFEFPEARGFWTMCVQKPVPKSLQGDKGANAAVASEFDASSKYDKFMIVS-KVDLDG 644
Query: 590 LETADLLTEVTESVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQ 641
ET+D+ + G T+ AG + R+IQV + R DG ++Q
Sbjct: 645 YETSDVYALTGAGFEALTGTEFDPAAGFTVEAGTMGKHMRIIQVLKSEVRCYDGDLGLSQ 704
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
L + E+G+ V+S SI DPY+LL D SI + D + V+
Sbjct: 705 ILPM--LDEETGA---EPRVISASITDPYLLLVRDDSSIMVAQIDNNCELEEVEKQDDTI 759
Query: 702 SSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG 761
S K ++ C LY D +TG+ + G P Q + + + +G
Sbjct: 760 LSTKWLAGC-LYTD----------------TTGLFAPMQTDKGTPEGQ-NTFMFLLSAAG 801
Query: 762 ALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS 821
AL I+ +PN + V +G T+ V ++ + + GT Q E +
Sbjct: 802 ALYIYALPNLSKPVYV---AAGLTY-VPPFL---------SADYAVRRGTVQ---ETLTE 845
Query: 822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
+ V +L + P+L + + Y+ E + T + +++L
Sbjct: 846 LLVADLG----DTTATSPYLIVRHANDDLTIYEPIRLESQDKT------LGLAKTLHFQK 895
Query: 882 VSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
++ N +++P++ E E P P + NI+G+ FL G+ P +
Sbjct: 896 IT-----NPALAKSPVEVADDEANEQPRFVPLRPCA---NINGYSTVFLPGASPSLIV-- 945
Query: 940 RERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
+ + P+ L + + H C GFIY S+G ++ QLP+ S +
Sbjct: 946 -KSAKSSPKVVGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVTQLPADSNF 997
>gi|336463425|gb|EGO51665.1| hypothetical protein NEUTE1DRAFT_89273 [Neurospora tetrasperma FGSC
2508]
Length = 1437
Score = 127 bits (320), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 156/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 96 DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + +S + L+ L
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 393
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T++ G S F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRMGRSTVF 453
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + ++ I+ D D + GEE
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRIQDADLDLDLDDEDLEDDDDDDLYGEES 507
Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
+ A + ++ + +F + D L++I P++ +YG + S
Sbjct: 508 TSPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567
Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
A G K S ++ E P +G WTV K + +D
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627
Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+YH ++I++ E + TA +T + G T+ AG + R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
+QV + R DG ++Q + + E+G+ V + SIADP++LL D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 45/178 (25%), Positives = 77/178 (43%), Gaps = 23/178 (12%)
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
KE++ + V +L H P+L + + YQ Y + + + P S S
Sbjct: 791 KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
+ ++ N F++ P + ++ PH A P +R + NISG+ FL GS
Sbjct: 841 ------LFFQKVPNSTFAKAPEEKPVDDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
P + + + L + A + H C HGFIY + GI ++ Q+P+ S+Y
Sbjct: 892 SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSY 949
>gi|358390357|gb|EHK39763.1| hypothetical protein TRIATDRAFT_48211 [Trichoderma atroviride IMI
206040]
Length = 1441
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 218/1028 (21%), Positives = 388/1028 (37%), Gaps = 159/1028 (15%)
Query: 57 NLVVTAANVIEIYVVR-------------------------VQEEGSKESKNSGETKRRV 91
NLVV ++++I+ V+ V ++ ES G +
Sbjct: 28 NLVVAKGSLLQIFTVKAISTELDPEFQPSQPTETETRFDRQVNDDDGLESSFLGGESMFM 87
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
D + L L+ L G V LA + + + ++++LA++ AK+ + E+D
Sbjct: 88 RTDRTNNTKLVLIAEIPLAGTVIGLARVKT--KNTASGGEALLLAYKAAKMCLAEWDPKK 145
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKVDPQGRCGGVLVYGLQMIILKASQGG 210
+ L S+H +E E + E F ++ DP RC + IL ++
Sbjct: 146 NELETISIHYYEK-EEMQGSPWEEVFGEYVNYLEADPGSRCAAFKFGTRNLAILPFTRSE 204
Query: 211 SGLVG---DED-------------TFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFV 252
L DED G G A S V+ L LD + H F+
Sbjct: 205 EDLEMEDWDEDLDGPRPVKEHTAAANGDGNNVEAAYTPSFVLRLPLLDPSLLHPVHLTFL 264
Query: 253 HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
H Y EP +L + + S H + + L + + I S LPHD YK++
Sbjct: 265 HEYREPTFGVLSSSQAPASSLGSKDHLSYKVFTLDLQQ--RASTTILSVTGLPHDLYKVI 322
Query: 313 AVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD--AAHA 369
A+P+P+GG L+VG N IH +A+N A S +S ++ L+ A
Sbjct: 323 ALPAPVGGALLVGQNELIHVDQSGKPNGVAINPMAKLATSFNLTDQSDLNLRLESCAIEL 382
Query: 370 TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITTI---GNSL 422
++N LL G L +++ DGR V L + + +++ S +T I G +
Sbjct: 383 LAIENGELLLILNDGRLGIISFKIDGRTVSGLGVKLVGADCGGNIIKSRVTCISRLGKNA 442
Query: 423 FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE 482
FFLGS DS+++ + S K D + + + +
Sbjct: 443 FFLGSETSDSVVLGW---SRKQTQEKRRKSRLIDTDLALDVDELDLEDDEEDDDLYGDDS 499
Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR--------------INAD- 527
+ +N SF + D+L++I P++D + G ++AD
Sbjct: 500 ATTKPNQTANGGTVKSGDISFRIHDTLLSIAPIQDITCGQSAFLPDSEEATLNKGVSADL 559
Query: 528 --ASATG---------ISKQSNYELV---ELPGCKGIWTVYHK----SSRGHNADSSRMA 569
A A G I+++ +++ E P +G WT+ K S G NA ++
Sbjct: 560 QLACAVGRGEAGSIAVINREIQPKVIGRFEFPEARGFWTMCVKKPVPKSLGTNAGAAGDY 619
Query: 570 AYDDEYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
++ ++I++ E + TA + E+ G T+ AG + + V
Sbjct: 620 DAPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKETEFEPAAGFTVEAGTMGNQMVV 679
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
IQV + R +G + Q L + +G+E V S SI DPY+L+ D S+ L
Sbjct: 680 IQVLKSEVRCYNGDLGLIQILPM----LDEETGAEPRAV-SASIVDPYLLIIRDDASVFL 734
Query: 683 LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGA 742
D + ++ + +S K + C LY D + GV +A G
Sbjct: 735 AQIDSNNEIEEIEKTDSGLTSTKWAAGC-LYKD----------------TKGVFQANQG- 776
Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSET 801
D ++ + +GAL I+ +P+ + V+ + S H+ ++ + +
Sbjct: 777 DQAKKSGEEVMMFLLNTAGALHIYALPDLSKPVYVAEGLSSIPPHLSADFVAKKV----- 831
Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
+E + + V +L H P+L + + Y+
Sbjct: 832 ------------ASREALTELVVADLG----DTVHYSPYLILRHSTDDLTIYEPIRL--- 872
Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
+D P SA+ + PL+ ++ P P + I N+
Sbjct: 873 ----PTDSPTRNLSDTLFFKKSANSILAKSTVEDPLEDTAQQ--PRYVPLR---ICANVG 923
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
G+ FL G P + + + + + + + + C+ GFIY S+GI ++
Sbjct: 924 GYSTVFLPGPSPAFILKSSKSVPRVVGVQGLGVRGMSTFNTEGCDRGFIYSDSEGIARVT 983
Query: 982 QLPSGSTY 989
QLPS + +
Sbjct: 984 QLPSKTNF 991
>gi|350297359|gb|EGZ78336.1| protein cft-1 [Neurospora tetrasperma FGSC 2509]
Length = 1437
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 156/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 96 DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + +S + L+ L
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 393
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T++ G S F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRMGRSTVF 453
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + ++ I+ D D + GEE
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRIQDADLDLDLDDEDLEDDDDDDLYGEES 507
Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
+ A + ++ + +F + D L++I P++ +YG + S
Sbjct: 508 TSPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEKERNSEGVRSDLQ 567
Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
A G K S ++ E P +G WTV K + +D
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627
Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+YH ++I++ E + TA +T + G T+ AG + R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
+QV + R DG ++Q + + E+G+ V + SIADP++LL D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 45/178 (25%), Positives = 77/178 (43%), Gaps = 23/178 (12%)
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
KE++ + V +L H P+L + + YQ Y + + + P S S
Sbjct: 791 KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
+ ++ N F++ P + ++ PH A P +R + NISG+ FL GS
Sbjct: 841 ------LFFQKVPNSTFAKAPEEKPVDDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
P + + + L + A + H C HGFIY + GI ++ Q+P+ S+Y
Sbjct: 892 SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSY 949
>gi|358372791|dbj|GAA89393.1| cleavage and polyadenylation specificity factor subunit A
[Aspergillus kawachii IFO 4308]
Length = 1372
Score = 127 bits (318), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 210/1020 (20%), Positives = 400/1020 (39%), Gaps = 182/1020 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+L+V ++++IY + + + E ++ + ++L++ Y L G V L
Sbjct: 28 DLIVVRTSLLQIYSLH-KVTSNAEGADAQQELTKLLLEK----------EYSLSGTVTGL 76
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+ NSR +++++AF +AK+S++E+D + S+H +E + +
Sbjct: 77 CRVK---VLNSRSGGEAVLVAFRNAKLSLIEWDPERRSISTISIHYYERDDLTRSPWVPD 133
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS------------ 222
G ++ VDP RC + +G++ + I+ Q G LV D+ +GS
Sbjct: 134 LKNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGMSTDHDL 190
Query: 223 GGG---------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
GGG + S V+ L LD + H F++ Y EP IL+ + T +
Sbjct: 191 GGGPDKAKEGIAYQTPYAPSFVLPLTALDPSILHPISLAFLYEYREPTFGILYSQVATSS 250
Query: 272 GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IH 330
+ + + ++ + ++ S LP D ++++A+P P+GG L++G+N +H
Sbjct: 251 ALLPERKDVVFYTVFTLDLEQQASTILLSVSRLPSDLFRVVALPPPVGGALLIGSNELVH 310
Query: 331 YHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVL 388
+ A+ +N ++ + S +S ++ L+ L + LL TG++ +
Sbjct: 311 IDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLSTGEMAI 370
Query: 389 LTVVYDGRVVQRLD---------LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
+ DGR V + L+ N + S T IG+ FLGS GDS+L+ ++C
Sbjct: 371 MKFKLDGRSVSGISVHLLPAHAGLTSMNSAAAAS--TFIGDGKIFLGSEDGDSVLLGYSC 428
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNTESA 497
S +S ++ D AD +S D +D + + +L G + SA
Sbjct: 429 SSSSSKKHRLQAKQAIDDSADMSEED---QSEDDVYEDDLYSTSPDTTLTGRRPSGESSA 485
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WT 552
+ F + D L+NIGPL+D + G ++ + G S +++ +G
Sbjct: 486 FGLYDFRMHDKLINIGPLRDITIGRKLPTNQEKGGDRTNSTSPELQIVASQGSHKSGGLV 545
Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLLTEV 599
V + H S + + D + A L EA R V+ T ++
Sbjct: 546 VMAREIDPHVVASISLESVDSIWTASLTWEEEAVSRTSENIGQRSQRCYVIATEAKASDR 605
Query: 600 TESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-MTQD 642
ES+ + V G TI G R+RV+QV + R D +TQ
Sbjct: 606 EESLIFVVDGHDLKPFRAPDFNPNEDVTINIGTQESRKRVVQVLKNEVRSYDIDLGLTQI 665
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
++ ++ +S S+AD + + D ++ L D S V + S
Sbjct: 666 YPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVLGEDVAS 720
Query: 703 SKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGA 762
K SC LY DK TG+ +ID P+ + D++ +
Sbjct: 721 GK--WISCCLYSDK----------------TGLFSSIDRTLSEPV-KNDMFLFLLSHDSK 761
Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
L ++ V + + ++ + + G + ++ SSE G +EN+
Sbjct: 762 LFVYRVRD-QKLLSIIEGLDGLSPLL-----------------SSEPPKRSGTRENLVEA 803
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
V +L + WSA P+L + ++ Y+ ++ + T + + +
Sbjct: 804 IVADLG-ETWSAS---PYLILRSENDDLIIYKPFV-------------IPTGPTGEIHTL 846
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRER 942
S+ N D + + + + + I +ISG F+ G+
Sbjct: 847 KFSKENNSVLPMISPDVDSTQPSGSDYRVRPLRILPDISGLSAVFMPGAS---------- 896
Query: 943 LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ----GILKICQLPSGSTYDNYWPVQKV 998
F + + + H F+ + + ++ CQLP + +D W ++KV
Sbjct: 897 ------------AGFVLRTSASAPH-FLRLRGESPRCSTVRFCQLPPMTRFDYQWTLKKV 943
>gi|167526060|ref|XP_001747364.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774199|gb|EDQ87831.1| predicted protein [Monosiga brevicollis MX1]
Length = 1324
Score = 126 bits (317), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 142/291 (48%), Gaps = 29/291 (9%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LEL +RL+G ++ ++ + RD+++L+F DAKIS ++F+ S L +
Sbjct: 34 LELAASFRLNGVATAMVAITL----PKQLRDTVVLSFADAKISAIQFEPSTRTLITQKLI 89
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
E E ++ + P+++ DP RC G LVYG +++I+ A
Sbjct: 90 NLEI-EAVYGSKVNADLP--PVLQADPLHRCIGALVYGCRLVIIPAH------------- 133
Query: 221 GSGGGFSARIESS-HVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
R VI+L L + K F F+ GY P ++LHE W GR +
Sbjct: 134 ----ALQPRTNVQFRVIDLEKLSSPLGQAKSFCFLTGYTTPTALLLHEPRPVWVGRHAVG 189
Query: 278 HHTCMISALS--ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
+C++SALS + TT P +W+ +LP D + L+ P P+GG L+V N + + +Q+
Sbjct: 190 RDSCVLSALSCELDTTDDFAPTVWAKDSLPSDCFALVPTPQPLGGALIVSPNMVLHTNQA 249
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
+S A+A+N A ++ S+ LD A T++ + A+ S ++G L
Sbjct: 250 SSSAVAVNAIAARATGYPHTTQAGLSLNLDNARVTFITSVDAIFSLQSGQL 300
>gi|358387835|gb|EHK25429.1| hypothetical protein TRIVIDRAFT_32877 [Trichoderma virens Gv29-8]
Length = 1440
Score = 126 bits (316), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 228/1029 (22%), Positives = 401/1029 (38%), Gaps = 170/1029 (16%)
Query: 57 NLVVTAANVIEIYVVR-------------------------VQEEGSKESKNSGETKRRV 91
NLVV ++++I+ V+ V ++ ES G +
Sbjct: 28 NLVVAKGSLLQIFTVKSISTELDPEFQPNQPAEVDTRFDRQVNDDDGLESSFLGGETMFM 87
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
D + L L+ L G V LA L + + ++LA++ AK+ + ++D
Sbjct: 88 RTDRTNNTKLVLIAEIPLAGTVIGLARLKTN--KTASGGEVLLLAYKAAKMCLAQWDPKK 145
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQMIILKASQ 208
+ L S+H +E E L E F G V + DP RC + IL +
Sbjct: 146 NELETISIHYYEK-EELQGSPWEEVF--GEYVNHLEADPGSRCAAFKFGTRNLAILPFRR 202
Query: 209 GGSGLVG---DEDTFG---------SGGGFSARIESSHV------INLRDLDMKHVKDFI 250
L DED G + G S +E+++ + L D + H
Sbjct: 203 SEEDLEMEDWDEDLDGPRPVKEQAAAVNGDSDNVEAAYTPSFVLRLPLLDPSLLHPVHLT 262
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
F+H Y EP +L + A + K H ++ + I S LPHD YK
Sbjct: 263 FLHEYREPTFGVLSSSQAP-AASLGLKDHLSY-KVFTLDLQQRASTTILSVTGLPHDLYK 320
Query: 311 LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD--AA 367
++A+P+P+GG L+VG N IH +A+N A + S ++ ++ L+ A
Sbjct: 321 VIALPAPVGGALLVGQNELIHVDQSGKPNGVAVNPMAKLVTSFSLTDQADLNLRLENCAI 380
Query: 368 HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDLSKTNPSVLTSD---ITTIGN 420
++N LL G L +++ DGR V RL + +V+ S I+ +G
Sbjct: 381 ELLAVENGELLLILNDGRLGIISFKIDGRTVSGLSVRLVGADCGGNVIKSRAACISRLGK 440
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD-APSTKRLRRSSSDALQDMV 479
+ FF+GS GDS+++ + + + + + I+ D A L + D+
Sbjct: 441 NTFFVGSETGDSVVLGW-----SRRQTQEKRRKSRLIDPDLALEVDELDLEDDEEDDDLY 495
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR--------------IN 525
+ + +N + SF + D L++I P++D + G ++
Sbjct: 496 GDDSAATKPQTTNGGAAKSGDLSFRIHDVLLSIAPIQDITCGQAACLPDSEEATLIKGVS 555
Query: 526 AD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAA 570
+D A A G + + ++ E P +G WT+ K + S+ A
Sbjct: 556 SDLQLACAVGRGEAGSLAIINREIQPRVIGRFEFPEARGFWTMCVKKPVPKSLGSNVGVA 615
Query: 571 --YDD--EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
YD ++ ++I++ E + TA + E+ G T+ AG + +
Sbjct: 616 GDYDAPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKETEFEPAAGFTVEAGTMGKQ 675
Query: 621 RRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
VIQV + R +G + Q L + +G+E V S SI DPY+L+ DGS
Sbjct: 676 MMVIQVLKSEVRCYNGDLGLIQILPM----LDEETGAEPRAV-SASIVDPYLLIIRDDGS 730
Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI 739
+ L D + ++ +S K V+ C LY D + GV ++
Sbjct: 731 VFLAQIDSNNEIEEMEKADGGLTSTKWVAGC-LYKD----------------TKGVFQSN 773
Query: 740 DGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYM--REAL 796
+ G D+G + + +GAL I+ +P+ + V+ + S H+ ++ R A
Sbjct: 774 LNSAAGKADEG-VMMFLLNSAGALHIYSLPDLSKAVYIAEGLSSIPPHLSAGFVARRGAT 832
Query: 797 KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
+++ TEI V +L + HS P+L + + Y+
Sbjct: 833 RETLTEI-------------------VVADLG----DSVHSSPYLILRHSTDDLTIYEPI 869
Query: 857 LFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI 916
T D + +S + S+++ S + + P D + P P +
Sbjct: 870 RLPTASATHALSDTLFFKKSAN-SSLAKSAVED------PSD--DTAQPPRYVPLRTCA- 919
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
N+ G+ FL G P + + + + L + + H C+ GFIY S+G
Sbjct: 920 --NVGGYSAVFLPGPSPAFIIKSSKSIPRVVGLQGLGVRGMSTFHTEGCDRGFIYADSEG 977
Query: 977 ILKICQLPS 985
I ++ QLPS
Sbjct: 978 IARVTQLPS 986
>gi|353234640|emb|CCA66663.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Piriformospora indica DSM 11827]
Length = 1324
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 215/989 (21%), Positives = 393/989 (39%), Gaps = 161/989 (16%)
Query: 55 VPNLVVTAANVIEIYVVR---VQEEGSKES--KNSGETKRRVLMDGISAASLELVCHYRL 109
V NLVV N + IY VR EE ES K+SG ++ + L LV + L
Sbjct: 36 VTNLVVGRNNRLRIYDVRRTIYTEETHVESDLKSSGPSRH--------SHRLCLVREHLL 87
Query: 110 HGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLH 169
HG + LA + D ++++F+D+K++++E+ ++++ + S+H +E L
Sbjct: 88 HGIIIGLAAVRTANPGLGSP-DRLLVSFQDSKLALMEWSNTLYDISTVSIHSYERSPLLL 146
Query: 170 LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR 229
E A ++ DP RC +++ + +L Q + L D
Sbjct: 147 NSDFTECRA---YLRTDPANRCAALVMPRDNIALLPWYQPQTEL--DVQDGIQSIAEELP 201
Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
S+V N+ +D ++++ D +F+ G+ P + IL + + TW GR+ + +S
Sbjct: 202 YSPSYVTNVSAMDERIRNILDLVFLPGFNVPTIAILFQEQRTWTGRLKENKDNTSLFFIS 261
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNYA 346
+ + + +I + LP+D+ + + +GGVLVV AN+I H S L ++ +A
Sbjct: 262 LDLVSRSYQVIATIEKLPYDSLYMSPCHAKLGGVLVVTANSILHVDQASKITTLPMSGWA 321
Query: 347 VSL-DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ D+S + + L+ + ++ + +LS G + + + ++GR V L
Sbjct: 322 ARVSDTSHGFQDAVDDIHLEGSRMGYISDSQVILSLSNGKCLHIRIDHEGRTVWGLTAVH 381
Query: 406 T-----NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
T PSVL + + L FLGS GDS+L ++
Sbjct: 382 TFGISSPPSVLIAK-----DGLVFLGSTAGDSVLFEYA---------------------- 414
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFS- 519
QD+ + + L N +E+ +FS D+L + G S
Sbjct: 415 ---------------QDLSSHRDFML----PNASETIPTSFSLLPVDNLQDSGSYTAASF 455
Query: 520 YGLRINADA---SATGISKQSNYELV----------ELP---GCKGIWTVYHKSSRGHNA 563
+GLR + + +A G+ + V +LP G +GIW S R H
Sbjct: 456 FGLRGSEEPALIAANGLDDLGGFSTVHKTMPLRLRKKLPAIAGRQGIW-----SMRVHQG 510
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR----TIAAGNLFG 619
+ + + ++S +A T + + T+ +D + R TIA F
Sbjct: 511 NGIELPLGHNT-----LLSTDA-TPTPGASRIATKSQARLDINITTRIPMLTIAVAPFFD 564
Query: 620 RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
++QV R+L T D S + + + + + +I DPYVL+ D +
Sbjct: 565 GTHLLQVTSNSLRLL-----TTDGSEKQVIPDRDNSTARARIRHAAICDPYVLILREDDT 619
Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-----KGPEPWLRKTSTDAWLSTG 734
+ L VG+P+ + + + + K + T Y D K E +R+T
Sbjct: 620 LGLFVGEPTRGKLRRKDMSPLGDKKLCYWAATFYDDLTGRLKIDEDLMRETK-------A 672
Query: 735 VGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMRE 794
VG ++G+ + +C +G LEI+ +P VF V G +
Sbjct: 673 VG-----------NRGEKWLALCRSTGTLEIWSLPKLALVF-VSSISLGPS--------- 711
Query: 795 ALK-DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCY 853
LK D + E++S+++ G + + + +L S H L + ++ Y
Sbjct: 712 VLKHDQKKEVDSATKTELPVG-ATTLQQVIITDLGEIEPSPH-----LIVLYDSNLLIVY 765
Query: 854 QAYLFEGPENTSKSDDPVSTSRSLSVSNVS-ASRLRNLRFSRTPLDAYTREETPHGAPCQ 912
Q P K+ P RS+ +S R+ + + TP + T + +
Sbjct: 766 QMV----PLEPDKAGLPQLDRRSVPSLRISFVKRMVHHLANPTPDENQTSGGSNEKRLPK 821
Query: 913 RITIFKNISGH----QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG 968
I F + G F++G P W + +H ++ +FT + +
Sbjct: 822 TIVPFSVLDWEGNSIYGAFVTGDNPAWILSKNHSGLLHLPCGYEAVHSFTPCSMWDFSPT 881
Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQK 997
F+ T +G + Q G T+ +P K
Sbjct: 882 FLMSTEEGSC-LVQWTPGITFHGQYPCSK 909
>gi|295665178|ref|XP_002793140.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226278054|gb|EEH33620.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 1408
Score = 125 bits (313), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 184/750 (24%), Positives = 304/750 (40%), Gaps = 121/750 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++Y + GS ++ +T+ + + L LV Y L G + L
Sbjct: 28 NLIVAKTTLLQVYNLVNVVYGSSPGQSDEKTRSQY-------SKLVLVAEYALSGTITDL 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ ++I++A +AK+S++E+D H + TS+H +E + +H+ +
Sbjct: 81 GRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISPWTPN 137
Query: 177 FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-------------------- 214
A P + VDP RC VL +G + + IL Q G LV
Sbjct: 138 LAACPSHLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMDDFDSDHDDERQIDTNHTA 196
Query: 215 --GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTW 270
DE G + SS V+ + L+ M H F++ Y EP IL+ +
Sbjct: 197 EERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGILYSQVAAS 256
Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-I 329
+ + + S ++ + + S LP+D +K++ +P P+GG L+VG+N +
Sbjct: 257 SALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALLVGSNELV 316
Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLV 387
H + A+ +N +A S +S + L+ L +N LL G +
Sbjct: 317 HVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLVLLNGVMA 376
Query: 388 LLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
+++ DGR V + L +T PS +G F GS GDS+L+
Sbjct: 377 VVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SAPVGRGKIFFGSEEGDSILI- 431
Query: 437 FTCGSGTSMLSSGLK----EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY----- 487
G S LS+G K E G+ D + D D + E LY
Sbjct: 432 -----GWSRLSAGAKVSPAPETGE---DNVAELSEDEEDDDDDDDEEDAYEDDLYATPVT 483
Query: 488 -GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNY 539
G NT S T + F + D L N+GP++D + G + D + S +
Sbjct: 484 PGINPRNTASMNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYL 543
Query: 540 ELVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDD 573
ELV G G+ +V+ K + + S A
Sbjct: 544 ELVTTQGYGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPSQSGSLPANAGS 603
Query: 574 EYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVF 627
Y YL++S + +++V + + E T + ++ + RTI G L G RV+QV
Sbjct: 604 NYDHYLLLSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVL 663
Query: 628 ERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
+ R D G + Q ++ SE +V+ S A+PYVL+ D SI LL D
Sbjct: 664 KGEVRSYDSGLGLAQIYPVWDEDT-----SEERSVMHASFAEPYVLIIRDDSSILLLQAD 718
Query: 687 PSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
S ++T I+S+ S +LY DK
Sbjct: 719 ESGDLDEIETDGIIKSTT--WISGSLYQDK 746
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 16/80 (20%), Positives = 41/80 (51%)
Query: 919 NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV + ++
Sbjct: 900 DVCGYRTVFMPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVV 959
Query: 979 KICQLPSGSTYDNYWPVQKV 998
++C+ P + +D W +K+
Sbjct: 960 RMCRFPRNTHFDGSWAARKI 979
>gi|164429683|ref|XP_964609.2| hypothetical protein NCU02082 [Neurospora crassa OR74A]
gi|157073577|gb|EAA35373.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 1437
Score = 125 bits (313), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 154/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 96 DRANSAKLVLVAEVTLPGTMTGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + ++ + L+ L
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQADLDLRLEGCAIDVLA 393
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T++ G S F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMIAPEAGGSVIQSRVTSLSRMGRSTMF 453
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + ++ ++ D D + GEE
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRLQDADLDLDLDDEDLEDDDDDDLYGEES 507
Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
+ A + ++ + +F + D L++I P++ +YG + S
Sbjct: 508 ASPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567
Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
A G K S ++ E P +G WTV K + +D
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627
Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+YH ++I++ E + TA +T + G T+ AG + R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
+QV + R DG ++Q + + E+G+ V + SIADP++LL D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 45/178 (25%), Positives = 77/178 (43%), Gaps = 23/178 (12%)
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
KE++ + V +L H P+L + + YQ Y + + + P S S
Sbjct: 791 KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
+ ++ N F++ P + ++ PH A P +R + NISG+ FL GS
Sbjct: 841 ------LFFQKVPNSTFAKAPEEKPADDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
P + + + L + A + H C HGFIY + GI ++ Q+P+ S+Y
Sbjct: 892 SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSY 949
>gi|312077399|ref|XP_003141287.1| hypothetical protein LOAG_05705 [Loa loa]
Length = 316
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 132/266 (49%), Gaps = 34/266 (12%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + RL V+S AI + DS++L F+DAK+S++ + + L+ S+H
Sbjct: 62 LECLLAVRLLAPVQSFAI---ARIPQNPDCDSLLLGFDDAKLSIVGVNPADRSLKTISLH 118
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G P+++VDP RC +LV+G + +L + G+ L
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCAAMLVFGRYLAVLPFNDSGAQL------- 168
Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+ + L +D + +V D +F+ GY EP ++ L+E T GR ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217
Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
T + L +S +K+ L +W NLP D ++LA+P P+GG+L+V N + Y +QS
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275
Query: 337 -SCALALNNYAVSLDSSQELPRSSFS 361
C ++LN+ +D + P F
Sbjct: 276 PPCGISLNS---CMDGFTKFPLRDFK 298
>gi|67521912|ref|XP_659017.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
gi|74598221|sp|Q5BDG7.1|CFT1_EMENI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
protein 1
gi|40745387|gb|EAA64543.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
gi|259486722|tpe|CBF84808.1| TPA: Protein cft1 (Cleavage factor two protein 1)
[Source:UniProtKB/Swiss-Prot;Acc:Q5BDG7] [Aspergillus
nidulans FGSC A4]
Length = 1339
Score = 124 bits (310), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 229/1008 (22%), Positives = 379/1008 (37%), Gaps = 191/1008 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++I+ +R S ++ +T+ R L L Y+L G V +
Sbjct: 28 NLIVARTSLLQIFSLR------DVSLSALDTEVRPAQHRQETCKLVLEREYQLPGTVTDI 81
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ ++ G D ++++AF DAK+S++E+D +GL S+H +E +
Sbjct: 82 CRVKILKTKSGGD------AVLVAFRDAKLSLVEWDPERYGLSTISIHYYERDDMTRSPW 135
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE- 231
+ G ++ DP RC + I+ Q G LV D+ FGS + R+E
Sbjct: 136 ASDLSTCGSILSADPGSRCAIFQFGARSLAIIPFHQPGDDLVMDD--FGSEPDYENRVEG 193
Query: 232 --------------------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELT 269
SS V+ L LD + H F++ Y EP IL+ + T
Sbjct: 194 NSRSHEAKDKDAAEYQTPYASSFVLPLTALDPSVIHPISLAFLYEYREPTFGILYSQVAT 253
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT- 328
+ + + +++ + + S LP D +K++A+P P+GG L++G+N
Sbjct: 254 SHALLHERKDVVFYTVITLDLEQRASTTLLSVTRLPSDLFKVVALPPPVGGSLLIGSNEL 313
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL 386
+H + A+ +N ++ S +S ++ L+ +D LL+ TG
Sbjct: 314 VHIDQAGKTNAVGVNEFSRQASSFSMTDQSDLALRLENCVVERFSDDNGDLLLALSTGVF 373
Query: 387 VLLTVVYDGRVVQRLD---LSKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCG 440
L++ DGR V + LS + L S ++ +GN F GS DS+L+
Sbjct: 374 ALVSFKLDGRSVSGISVRPLSGPSKEFLASTASSSAFLGNGKVFFGSESADSVLL----- 428
Query: 441 SGTSMLSSGLKEEF-GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
G S SS K+ F G D S DA +D + + N S
Sbjct: 429 -GWSSASSATKKSFSGSTSND--------ESEDDAYEDDLYSSAPAAMTDNPQNQPSNSS 479
Query: 500 TFSFA---VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVY 554
+F + D L + GP++D G A + T K ELV G G +
Sbjct: 480 VAAFGDLRIHDRLSSPGPIRDIVLGRSSEASSRDT---KDGVLELVAAQGSDEGGTMVIM 536
Query: 555 HK--------SSRGHNADS----SRMAAYDDEYHAYLIISL-------EARTMVLETADL 595
+ S A+S S + +D+ Y+I+S E+ VLE D
Sbjct: 537 KREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYVILSKQEKPDKEESEVFVLE--DK 594
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
L +T T+ G L + RVIQV R D + D
Sbjct: 595 LRPITAPEFNPNHELTVEIGTLASKSRVIQVLRNEVRSYDAVWDEDD------------- 641
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S+ ++ ++ DPY+ + D ++ LL D S + TL D
Sbjct: 642 SDERVAVNATLVDPYLAIIRDDSTLLLLQADDS----------------GDLDEVTLSED 685
Query: 716 KGPEPWLRKT--STDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
+ WL S +A T +I + + L ++ +P+F
Sbjct: 686 VVSQKWLSACFYSDNAGFFTAPFASI--------------LFLLNQDHQLYVYRLPDF-A 730
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
V +V + V I+ T E K S T +EN+ + VVEL
Sbjct: 731 VISVIEGVGCLPPILST---EPPKRSTT--------------RENVLQIAVVELG----D 769
Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
++ S PFL + ++ Y+ + E T R L +N + + N
Sbjct: 770 SYSSLPFLILRTENDDLVVYKPFFTNSKELTGL--------RFLKEANHTLPKTPNTT-- 819
Query: 894 RTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLC 950
D E P + I NI+G F+ G P +FR P +L
Sbjct: 820 ----DELQSEMKP-------LRILPNIAGCSSIFMPG--PSAGFIFRAS-TTSPHFIRLR 865
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
G I + + GF Y+ S G L + +LP G+ W ++ V
Sbjct: 866 GGFIKGLGCFD--SPDKGFAYLDSHG-LHLAKLPEGTQLGYPWIMRTV 910
>gi|403170487|ref|XP_003329830.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168746|gb|EFP85411.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 1513
Score = 123 bits (309), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 173/771 (22%), Positives = 308/771 (39%), Gaps = 167/771 (21%)
Query: 48 SKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKE---SKNSGETKRRVLMDGISAASLELV 104
SK P+ NL+V + +++++ + + E+ E ++N E K + L +
Sbjct: 37 SKTRPRPITNLIVARSTLLQVFELCLVEDDQAENNHTRNHHELKNK-------NYKLFHL 89
Query: 105 CHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
C +RLHG V L L+ D ++++F+DAK+++LE+ +S L S+H FE
Sbjct: 90 CEHRLHGRVTGLQRLTTLDTQEDGL-DRLLVSFQDAKMTLLEWSNSAADLVPISLHTFEK 148
Query: 165 -PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
P+ R+ + ++VDP RC +L+ + +L Q L D+ G
Sbjct: 149 LPQITQGDLPRDFQGQ---LEVDPLSRCAVLLLPQATLAVLPFFQDQLDL----DSLGLS 201
Query: 224 GGFSARI------------ESSHVINLRD----------LDMKH--VKDFI---FVHGYI 256
GG + + SS +++ LD +H +K I F+ G+
Sbjct: 202 GGLKSALGSEQQRFQTFPYASSFILDFNQQLLNHLPPSSLDSQHRPIKSVIALKFLPGFS 261
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
EP + +L++ + TW+ R+ +T + L++ P+I NLP+DA+ L+A P
Sbjct: 262 EPTLAVLYQSQYTWSARLENHANTAALIVLTLDLGSNHFPIISHTTNLPYDAHGLVACPK 321
Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS------------------ 358
+ GVLV+ A+ I + QS+ N V S ++PR
Sbjct: 322 ELAGVLVLCADMILHVDQSSKIIGLATNGWVKHTSELQIPRQDTVRLITPTNKISGHRST 381
Query: 359 ---------------------------SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
V L+ A + + D A + +TG++ L
Sbjct: 382 TNKSDERPEDLEDGEEQDESGVPEGHEKLLVRLENAKIVFSRADRAFVFLRTGEVFSLQF 441
Query: 392 VYDGRVVQRLDLSKTN-PSVLTSDITTIGNSLFFLGSRLGDSLL-----------VQFTC 439
+ DGR + +L L K + S++ S + + N F+GS GDS L
Sbjct: 442 LRDGRTLTKLVLEKLDLLSIIPSTVLKVNNECLFVGSMAGDSALYILDHLRPRSSSDDDN 501
Query: 440 GSGTSMLSSGLKE-----------EFG-DIEADAPSTKRLRRSSSDALQD--MVNGEELS 485
G + SS + + +F DI D T +RR+ L D NG +
Sbjct: 502 DDGHQLPSSSIIQPDKAAKNQSSLDFDEDIYGDRTETDPVRRTDHSQLYDDRPSNGADDG 561
Query: 486 LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
G+ ++ E + + D + GP++DF+ +ATG+ +EL
Sbjct: 562 RPGAGAHLAEPFLR-----LGDVIQAHGPIRDFTM--------AATGVENMP----LELL 604
Query: 546 GCKGI-----WTVYH-----KSSRGHNADSSRMAAYDDEYHAYLII-----SLEARTMVL 590
C G TV+H + R + +S + + + L++ S E + + +
Sbjct: 605 ACTGTGDLGGLTVFHREIPLRKRRKLSFESPSASHINALFFTSLVVESGGLSEERKVVWM 664
Query: 591 ETADLLTEVT---ES-----VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
+ TE+ ES ++ F + +T+A FG++ V+QV ++ S
Sbjct: 665 GRSGPRTEIATYGESGELSLINTFPE-KTLAVSPFFGKQFVVQVTNTAIKLFTSSL---- 719
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
++ +L SI D YV+L G + GD + T+S
Sbjct: 720 -----EEAQVIQPEPAVKILRASIVDDYVMLETHCGLKLIYQGDHDSKTLS 765
>gi|392572878|gb|EIW66021.1| hypothetical protein TREMEDRAFT_70300 [Tremella mesenterica DSM
1558]
Length = 1408
Score = 123 bits (309), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 150/656 (22%), Positives = 279/656 (42%), Gaps = 94/656 (14%)
Query: 101 LELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
L L+C + LHG + LA L + D D ++++F+DAK+++LE+ S + S
Sbjct: 117 LHLLCQHTLHGWITGLAPLRTIESSVDG---LDRLLVSFKDAKMALLEW--SRGDIATVS 171
Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
+H +E + + G F PL++ DP R + + + IL Q S L E+
Sbjct: 172 LHTYERCQ--QMVTGDLQFYT-PLLRSDPLSRLAVLTLPEDSLAILPVLQEQSDLDPLEN 228
Query: 219 TFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
F +S S V++L D+ +K+++D +F+ G+ P + +L+ TWAGR
Sbjct: 229 -FTKDAPYSP----SFVLSLADVAPTIKNLQDLLFLPGFHSPTLAVLYSPYHTWAGRYHS 283
Query: 277 KHHTCMISALSISTTLK-QHPLIWSAMNLPHDAYKLLAVPSPIGGV-LVVGANTIHYHSQ 334
+ T + + T +PL+ S LP D+ ++A P+ +GGV LV +H
Sbjct: 284 QRDTFCLEVRTFDITAGGSYPLLTSVSGLPSDSLYIVACPAELGGVVLVTTTGLLHIDQS 343
Query: 335 SASCALALNNY-----AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
+ A ++N + + D S E S + L+ + + ++ LL + GD+ +
Sbjct: 344 GRTVATSVNAWWSHITTLPCDKSSE----SRKISLEGSKSVFVTERDMLLVLQNGDVHQV 399
Query: 390 TVVYDGRVVQRLDLSKTNPSV-LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
+GR + + + + + +V S + T GN F+G GDSLL +
Sbjct: 400 RFEMNGRAIGAIKVDEQSSNVPAPSSMVTTGNQAIFVGCAEGDSLLANVDIKRAVA---- 455
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE----SAQKTFSFA 504
+++ IEA+A D +D+ ++ L A+N + + +
Sbjct: 456 -IEDRKPAIEAEA---------EVDWDEDLYGDIDVPLTNGATNGAKYQAITGPANIVLS 505
Query: 505 VRDSLVNIGPLKDFSYGLRINADASAT--------GISKQSNY-------------ELVE 543
D L +G + D +G+ + + T G SK+S + E
Sbjct: 506 PADVLTGVGKIVDMEFGIASTDEGTRTYPQLVTIGGGSKRSTFNAFRRGIPISKRRRFNE 565
Query: 544 LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA---RTMVLETADLLTEVT 600
L + +W + + G + + + ++ ++ S EA R L ++
Sbjct: 566 LFNTESVWFLPIQRPSGQH-----LKSIPEDRRTTMLFSSEATQTRIFSLSAKPNPEQIG 620
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST 660
+ G+++ G F R V+ V + +LD TQ G+E
Sbjct: 621 R-----ISGKSLTVGPFFQRSNVLVVTQTEVLLLDSDGKTQ----------QSIGNEGEE 665
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKKPVSSCTLYHD 715
++S SI+DPYV++ +GS + VGD +S V+ P+ +S + P + ++ D
Sbjct: 666 IVSASISDPYVVIRRVNGSGSMFVGDTVARQLSEVKIPS--DSLQPPYQAIEVFSD 719
>gi|320591495|gb|EFX03934.1| cleavage and polyadenylation specificity factor subunit [Grosmannia
clavigera kw1407]
Length = 1461
Score = 123 bits (308), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 218/980 (22%), Positives = 371/980 (37%), Gaps = 169/980 (17%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
S + L LV + L G V LA + G + +++++A +DA++S+LE+D + L
Sbjct: 100 SISKLVLVAEFPLAGTVTGLARIKIPGTKSGG--EAVLVALKDARLSLLEWDPDQNDLTT 157
Query: 157 TSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
S+H +E E + DP RC + + IL Q V
Sbjct: 158 ISIHYYEQEELQGAPWAAPLSDYANFLVADPGSRCAALKFGARNLAILPFRQADEEDVDM 217
Query: 217 ED-------------------TFGSGGGFS-ARIESSHVINLRDLD--MKHVKDFIFVHG 254
+D G G G S V+ L +LD + H F+H
Sbjct: 218 DDWDEELDGPRPAKDPSSAAVVSGPGDGIEDTPFAPSFVLRLSNLDTTLLHPVHLAFLHE 277
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y EP IL T A V + ++ K I S NLP D ++++ +
Sbjct: 278 YREPTFGILSSSVSTSA--VIGRRDKLSYLVFTLDLQQKASTTILSVANLPQDLFRVVPI 335
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
PSPIGG ++VGAN IH + +A+N + S +S ++ L+ L
Sbjct: 336 PSPIGGAILVGANELIHIDQSGRANGVAVNPFTKQSTSFGLADQSDLALRLEGCTVDVLS 395
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLS----KTNPSVLTSDITT---IGNSLFF 424
+ L+ G L +LT+ DGR V L + + V+ S IT IG + F
Sbjct: 396 AEAGELLIVLHDGQLAVLTIRVDGRTVSGLSVKMVRREAGGDVIQSGITCLSRIGRQMLF 455
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVNGEE 483
GS DS+++ ++ G + G D+ AD R + D + +
Sbjct: 456 AGSDQADSVVLGWSRKQGQTARRKPRANRAGLDLGADEEYFDDEREEGEELDDDEDDDDL 515
Query: 484 LSLYGSAS------NNTESAQKTFSFAVRDSLVNIGPLKDFSYG-------LRINADAS- 529
SA+ N T SF + D L++I P++D G L +D +
Sbjct: 516 YGDGPSAAQTLGIDNTTGRGGDDLSFRIHDRLLSIAPIRDMVIGKPALVGELAKRSDQAT 575
Query: 530 ---------ATG---------ISKQSNYELV---ELPGCKGIWTV-----YHKSSRGHNA 563
A G +S++ N + + E + +WTV ++ +G
Sbjct: 576 IHSELNLVCAVGSGRAGALALLSREINPDPLGAFEFAEAQALWTVSSSKPIPRTIQGEKG 635
Query: 564 DSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF-------VQGRTIAA 614
++ Y+ + Y+I++ E ET+D+ + G T+ A
Sbjct: 636 GATVGEDYESPAMHDKYMIVAKEDDDG-FETSDVYAVTASGFETLKGTEFEPAAGFTVQA 694
Query: 615 GNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
G + RR+IQV + R DG ++Q L + +G+E VL SIADPY+LL
Sbjct: 695 GTMGRNRRIIQVLKSEVRCYDGDLGLSQILPM----VDEDTGAE-PRVLFASIADPYLLL 749
Query: 674 GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLST 733
D S+ + + ++ +S K V+ C LYHD KTS A+L +
Sbjct: 750 IRDDASVLVAEMNKDFELEELERDDGSLASTKWVAGC-LYHDTA--SVFSKTSILAFLLS 806
Query: 734 GVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS--GRTHIVDTY 791
SG I+ +P+ V + ++ R + D
Sbjct: 807 A-------------------------SGTFYIYALPDLKQPVYVAEGLNYVPRLFLPDHT 841
Query: 792 MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTIL 851
+R + KE + + V +L A P+L + +
Sbjct: 842 VRRGMA------------------KEPLTEILVADLG----DAVSKAPYLIVRHANDDLT 879
Query: 852 CYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR--NLRFSRTPLDAYTREETPHGA 909
YQ P+ T SL + S L+ N F+++P+ + + ++
Sbjct: 880 IYQ---------------PLRTPSSLGSLSESLRFLKVPNPVFAKSPV-SISSDDASSQL 923
Query: 910 PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIVAFTVLHNVNCN 966
+ + +NI G+ FL GS + + + + P+ L ++ + + H +
Sbjct: 924 RAMPLRVCENIGGYSTVFLPGSSASFVL---KSAKSQPRVVSLQGTAVRSLSPFHTESSE 980
Query: 967 HGFIYVTSQGILKICQLPSG 986
FIYV +G ++C +P+G
Sbjct: 981 RSFIYVDVEGSGRVCSMPAG 1000
>gi|340515387|gb|EGR45642.1| predicted protein [Trichoderma reesei QM6a]
Length = 1441
Score = 122 bits (307), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 222/999 (22%), Positives = 379/999 (37%), Gaps = 156/999 (15%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
+V ++ ES G V + + L L+ L G V LA L + + +
Sbjct: 68 QVNDDDGLESSFLGGETMLVRTERTNNTKLVLITEIPLAGTVIGLARLRT--SRTASGGE 125
Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
+++A++ AK+ + E+D + L S+H +E E L E F G V + DP
Sbjct: 126 VLLIAYKAAKLCMAEWDPRKNELETISIHYYEK-EELQGAPWEEVF--GEYVNHLEADPG 182
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVG---DEDTFG---------SGGGFSARIESSHV- 235
RC + + IL + L DED G + G S +E+++
Sbjct: 183 SRCAALKFGTRNLAILPFRRSEEDLEMEDWDEDLDGPRPVKEQAAAVNGDSDNVEAAYTP 242
Query: 236 -----INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
+ L D + H F+H Y EP +L + A + H + + L +
Sbjct: 243 SFVLRLPLLDPSLLHPVHLTFLHEYREPTFGVLSSSQAPAASLGARDHLSYKVFTLDLQQ 302
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSL 349
+ I S LPHD Y+++A+P+P+GG L+VG N IH S +A+N A
Sbjct: 303 --RASTTILSVTGLPHDLYRVIALPAPVGGALLVGQNELIHVDQSGKSNGVAVNPMAKLA 360
Query: 350 DSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDL 403
S +S + L+ A ++N LL G L +++ DGR V RL
Sbjct: 361 TSFSLTDQSDLKLRLENCAIEVLAIENGELLLILNDGRLGIISFKIDGRTVSGLSVRLVG 420
Query: 404 SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
+ +VL S T + G + F+GS DS+++ + S K D +
Sbjct: 421 ADCGGNVLKSRATCVSRLGKNTLFVGSETSDSVVLGW---SRRQTQEKRKKSRLIDPDLA 477
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+ + + + N + +F + D L++I P++D +
Sbjct: 478 LEVDELDLEDDEEDDDLYGDDSVATKPQQLPNGGPAKSGDLTFRIHDVLLSIAPIQDVTC 537
Query: 521 GLR--------------INAD---ASATGISKQSNYELV------------ELPGCKGIW 551
G + AD A A G + + ++ E P +G W
Sbjct: 538 GQAAFPPDSEEATLNRGVRADLQLACAVGRGEAGSLAIINREIQPRVIGRFEFPEARGFW 597
Query: 552 TVYHKSS--RGHNADSSRMAAYDD--EYHAYLIIS------LEARTMVLETADLLTEVTE 601
T+ K + A++ YD ++ ++I++ E + TA + E
Sbjct: 598 TMCVKKPVPKSLGANAGVAGDYDTPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKE 657
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
+ G T+ AG + + VIQV + R +G + Q L + +G+E
Sbjct: 658 TEFEPAAGFTVEAGTMGKQMVVIQVLKSEVRCYNGDLNLIQILPM----LDEETGAEPRA 713
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
V S SI DPY+ + DGS+ L D + ++ + +S K V+ C KG
Sbjct: 714 V-SASIVDPYLFIVRDDGSVFLAQIDSNNEIEEMEKTDSSLTSTKWVAGCLYKDTKG--- 769
Query: 721 WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDK 779
+ + +D+ T EA+ + +GAL IF +P+ + V+ +
Sbjct: 770 IFQSSYSDSTKQTS--EAV-------------MMFLLNSTGALHIFALPDLSKAVYVAEG 814
Query: 780 FVSGRTHIVDTYM--REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
S H+ Y R A +++ TEI V +L A H+
Sbjct: 815 LSSIPPHLSAGYAARRGATRETLTEI-------------------VVADLG----DAVHA 851
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTP- 896
P+L + + Y+ P N T+ +LS L F ++P
Sbjct: 852 SPYLILRHSTNDLTIYEPIRL--PAN--------ETAHTLS---------DTLFFKKSPN 892
Query: 897 -LDAYTREETPHGAPCQR-----ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLC 950
+ A + E P Q + I N+ G+ FL G P + + + L
Sbjct: 893 AVLAKSAVEDPSDDTAQPPRYVPLRICANVGGYSSVFLPGPSPAFVIKSSRSVPRVVGLQ 952
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
+ + H C+ GFIY S+GI ++ QLPS + +
Sbjct: 953 GHGVRGMSTFHTEGCDRGFIYADSEGIARVTQLPSKTNF 991
>gi|402219312|gb|EJT99386.1| hypothetical protein DACRYDRAFT_17537 [Dacryopinax sp. DJM-731 SS1]
Length = 1620
Score = 122 bits (306), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 158/662 (23%), Positives = 265/662 (40%), Gaps = 102/662 (15%)
Query: 101 LELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
L LV +R+HG V L + G D D ++++F+DAKI++LE+ D+I+ L S
Sbjct: 136 LHLVREHRMHGFVTGLEKVRTLASGEDGM---DRLLVSFKDAKIALLEWSDAIYDLSTVS 192
Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD-- 216
+H +E + E PL++ DP+ RC +L+ + IL Q + D
Sbjct: 193 LHTYERSSQVSTSEASE---HRPLLRADPESRCAALLLPKDALAILPFVQRTGLDLADPA 249
Query: 217 EDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
D ++ S+V L D D ++HV DF F+ + P + IL++ W GR+
Sbjct: 250 RDKEREHQPYTP----SYVFPLSDADDTLRHVLDFCFLPSFHTPTLAILYQPAQNWTGRL 305
Query: 275 SWKHHTCMISALSISTTLK----------QHPLIWSAMNLPHDAYKLLAVP--SPIGGVL 322
S ++ +++ K +I LP+DA+ LL S GGV+
Sbjct: 306 SQTKDNTSLAIVTLDLVGKGAAAGGGAGGGGAVISRTHGLPYDAFSLLPAREGSTFGGVV 365
Query: 323 VVGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--------LDAAHATWLQ 373
V+ N++ H LA + + S+ P +F+ E L+ + W
Sbjct: 366 VLAGNSVLHVDPAGRIVGLAASGWHAQ-SSALRFPLWAFTAEEGETEERKLEGSRLCWAG 424
Query: 374 NDVALLSTKTGDLVLLTVVYDGRVVQRLD----LSKTN-PSVLT-------SDITTIGNS 421
+L G L V +GR V L L +T+ P+VL + G
Sbjct: 425 EQQLILVGAQGWARELKVGVEGRNVSSLSAGRRLGRTSAPAVLCPVGEQSGRALKPTGRD 484
Query: 422 LFFLGSRLGDSLLVQFTCGSG--TSMLSSGLKEEF--GDIEADAPSTKRLRRSSSDALQD 477
L +L S G S+L+Q G + +G ++E D+E DA S K +D L D
Sbjct: 485 LVWLASEAGQSVLLQVHKGEPRVEEVKPNGEEKEIEGEDMEIDADSDK------NDDLAD 538
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS-------- 529
+ L ++ A + V D+L G + D S+ L +
Sbjct: 539 IYGDSGLPAAAASGVTAGPALPWLTLEVLDALQGHGQIADMSFALSFRSGPDRPTPKLVC 598
Query: 530 -------------ATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAY----D 572
G+ + + + G +GIW++ + R D
Sbjct: 599 STPEGERGAWTVYENGLPIRVKRRVPAVAGTRGIWSLRVRRGDRARRGGRRERGEREWAD 658
Query: 573 DEYHAYLIISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
E LI+S +A RT+ +++ L ++ + T+AAG F V+Q
Sbjct: 659 GEERDNLIVSTDATPSPGISRTITVDSRGELQIISR-----LPALTLAAGVFFSHTCVMQ 713
Query: 626 VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
V +LDG ++L N E S ++ + DP+V++ +GS+ L +G
Sbjct: 714 VTPDSLHLLDGD--GKELQVLKDNE---GNKEASPIIKACVEDPWVVVTRENGSVALYLG 768
Query: 686 DP 687
DP
Sbjct: 769 DP 770
>gi|392585051|gb|EIW74392.1| hypothetical protein CONPUDRAFT_133073 [Coniophora puteana
RWD-64-598 SS2]
Length = 1490
Score = 122 bits (306), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 196/913 (21%), Positives = 349/913 (38%), Gaps = 130/913 (14%)
Query: 57 NLVVTAANVIEIYVVR-------VQEEGSKESKN---------SGETKRRVLMDG----- 95
NLV +N+I IY VR Q E KE K+ GE + DG
Sbjct: 40 NLVTARSNIIRIYEVREDAASLSSQVEAEKERKSHVRKGTEAVEGEVEMDTGGDGWVNMG 99
Query: 96 ---------ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLE 146
+ V + +HG V + + + + N R D ++++F+DAKI++LE
Sbjct: 100 SVKSTSSGPPTVTRFHFVREHVVHGIVTGMDCI-RTISSNEDRMDRLLVSFKDAKIALLE 158
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+ D+ H L S+H +E E L R L +VDP RC + + + IL
Sbjct: 159 WSDAAHDLITVSIHTYERSE--QLMSIDAPLFRSSL-RVDPLSRCAALSLPNNALAILPF 215
Query: 207 SQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVIL 263
Q + E + G S +++L D + +V DF F+ G+ P + +L
Sbjct: 216 YQTQAEFDVIEGEGETEGMRDVPYSPSFILDLPVDVDSSLCNVIDFAFLPGFNNPTLAVL 275
Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP------ 317
+ E TWAGR+ T ++ ++ P++ + LP DA+ L P
Sbjct: 276 CQSEQTWAGRLKEHRDTTLVVTFTLDLLSCTFPILSTLRGLPSDAFSLSPATLPPDFTSG 335
Query: 318 -------IGGVLVVGANTIHYHSQSASCA-------------LALNNYAVSLDSSQELPR 357
GV+V+ + + Y Q A C L+++N ++ ++++
Sbjct: 336 LSGGASNAHGVVVLTPDAVLYADQ-ARCVGAAVSGWATRTSDLSISNAYLTGGTAKDAEG 394
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-TNPSVLTSDIT 416
+ L+ A L LL ++G++ ++ +V +GR V R+D+ +V+ + +
Sbjct: 395 DVKPLALEGAFPLLLTPTALLLVLRSGEMHVVRLVTEGRSVGRVDVGPCVGQTVMPATVV 454
Query: 417 TIGNSLFFLGSRLGDS--------LLVQFTCGSGTSMLSSGLKEEFGDIEADA--PSTKR 466
+ LG G+ + V G T +LS+ EE + S
Sbjct: 455 RVKAPQRALGQGQGEGEKAKERRMVFVGSIVGPAT-LLSAERVEETAAANGNGVNGSGAN 513
Query: 467 LRRSSSDALQDM--VNGEELSLYGSASNNTE----SAQKTFSFAVRDSLVNIGPLKDFSY 520
+ DA +M ++ LYG + ++ SA++ FA D++ GP+ D ++
Sbjct: 514 GHVENKDAGMEMDVDLDDDDDLYGPTTLTSQPSSGSAEEALRFAFCDAIPAHGPILDMAF 573
Query: 521 GLRINAD------ASATGISKQSNYELVE-------------LPGCKGIWTVYHKSS-RG 560
L D ++TG + L + L G +GIW++ K S RG
Sbjct: 574 ALGKWGDRYVPELVASTGAEHLGGFTLFQRDLPIRTKRKLHVLGGARGIWSISVKQSPRG 633
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLF 618
A S+ + + ++IS +A V A T ++ + G T+ AG F
Sbjct: 634 SAASSAGAGPNPELANDTVVISTDANPSPGVSRIATRSTRTDLAIPTRIPGTTVGAGPFF 693
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
GR ++ V R+L+ D + S ++ + + SI DP VL+ D
Sbjct: 694 GRTAILHVMTNSIRVLE-----PDGTERQSIKDTDGNMPRAKIRWCSICDPVVLIIREDD 748
Query: 679 SIRLLVGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKG-----PEPWLRKTSTDAWLS 732
++ L +G+P + + + + E S + ++ C G +P S+
Sbjct: 749 TLGLFIGEPERGRIRRKDMSPMGEKSSRYIAGCFFADTSGLFEAFMDPKAAAASSKGDKD 808
Query: 733 TGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
G + + + + V+ G LEI+ +P VF+ + D+Y
Sbjct: 809 KGATQTMQSVVNAATNSQ--WLVLVRPQGVLEIWTLPKLTLVFSTTLIATLDNVCADSYD 866
Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
AL Q + V + M + + P L L G +
Sbjct: 867 PAALS-------------LPQDPPRKPQELDVENIVMAQLGESNPTPHLMVFLRSGQVAI 913
Query: 853 YQAYLFEGPENTS 865
Y+ P + S
Sbjct: 914 YETVHHPPPPDPS 926
>gi|212541400|ref|XP_002150855.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces marneffei ATCC 18224]
gi|210068154|gb|EEA22246.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces marneffei ATCC 18224]
Length = 1383
Score = 122 bits (306), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 181/751 (24%), Positives = 305/751 (40%), Gaps = 137/751 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++IY + + + +N + V + A L L Y L+G V +
Sbjct: 28 NLIVVKTSLLQIYTLVAETSTTLILENDQQADDDVKNE---ATKLHLHAEYDLYGTVTDI 84
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ + S+ G D +++L+F +AK+S++E++ G+ S+H +E
Sbjct: 85 SPVKILKSRSGGD------ALLLSFRNAKLSLIEWNPETQGISTMSIHYYE--------- 129
Query: 173 GRESFARGPLVK----------VDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDE---- 217
+E P V VDP RC +L +G++ I IL Q G LV DE
Sbjct: 130 -KEDITLSPWVPDLSQCDSHLTVDPSSRCA-LLNFGVRNIAILPFHQAGDDLVMDEYDPD 187
Query: 218 ------------------DTFGSGGGF--SARIESSHVINLRDLD--MKHVKDFIFVHGY 255
D+ + G +S V+ L LD + H F+H Y
Sbjct: 188 LDMDDLTDQEENKKPSHTDSKKAEGDLIHQTPYAASFVLPLTALDPTLIHPIGLTFLHEY 247
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL+ T A + + + S ++ + + S LP D ++A+P
Sbjct: 248 REPTFGILYSPIATSAALLEERKDVVVYSVFTLDLEQRASTPLLSIAKLPSDLLHIMALP 307
Query: 316 SPIGGVLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQN 374
+P+GG L++G+N IH + A+A+N +A + + + +S + L+ + + N
Sbjct: 308 APVGGTLLIGSNEMIHIDQSGKASAVAVNEFAKQVSAFPMVDQSDLELRLEGSVVEVINN 367
Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT--------IGNSLFF 424
+ LL+ TG+LVL+ DGR V + P+V D+ + +G+ F
Sbjct: 368 ESGDILLTLSTGELVLVHFKIDGRSVSGFVVFPI-PAVSGGDVVSAVASCAVALGSGKVF 426
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS--SDALQDMVNGE 482
+GS +S+L+ S S S + D E + + S A ++ VN
Sbjct: 427 IGSEDAESVLLDCYLPSAVSKKSRDYDRDHFDEEMNNEEDDDMYEDDLYSSAPKEAVN-- 484
Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV 542
+ G S+N ++F V D L+++GPL+ + G + D++A Q + + +
Sbjct: 485 KTVSNGRISDN-------YTFKVIDRLLSLGPLRAVAVGKPASRDSNAE--DAQQSVDDL 535
Query: 543 ELPGCKG-----------------------------IWTVYHKSSR-GHNADSSRMAAYD 572
EL G +W + +++ GHN DS
Sbjct: 536 ELAAAYGSGRGGGVALLQRTLHLDDVFTLGAESADSVWNITTSNTKSGHN-DSG------ 588
Query: 573 DEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFGRRRVIQV 626
+E +Y+I++ T+V + E + D G TI L G RV+QV
Sbjct: 589 EENQSYVILTKANSPENEETLVYAVNERNLEPFNAPDVNPNGDPTIDIDVLAGNSRVVQV 648
Query: 627 FERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
RI D + M Q P E G E V S S AD Y+L+ D S+ LL
Sbjct: 649 LTGEVRIYDTNLGMAQ---IYPVWDED-EGDERFAV-SASFADHYLLIIRDDSSVLLLHS 703
Query: 686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
D S + P + S +P LY D+
Sbjct: 704 DESGDLDELTKPETV--SSQPWLCGCLYTDR 732
>gi|164655043|ref|XP_001728653.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
gi|159102535|gb|EDP41439.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
Length = 1212
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 150/591 (25%), Positives = 262/591 (44%), Gaps = 75/591 (12%)
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+RL G V + + Q A RD ++++F DAK++++E+DD L S+H FE
Sbjct: 22 HRLFGQVTGIQSV-QTLASQVDGRDRLLVSFRDAKLALMEWDDVYGDLNSISIHTFERAP 80
Query: 167 WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGF 226
L + SF P + VDP RC +L+ + IL Q S L G +D +
Sbjct: 81 QL-VDGLPPSFV--PRLLVDPASRCAALLLPQDALAILPFVQEASEL-GADDPRDAALLD 136
Query: 227 SARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
A S +++ D +++V+D +F+ G+ +P++ +L+E ELTW G +S T +
Sbjct: 137 QAPYAPSFILSFSEDVDASIRNVRDCVFLPGFQKPMLAVLYEPELTWTGSLSRARLTTRV 196
Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALAL 342
+++ T+ ++P+ ++ LP+D L+A P +GGVLVV + + + Q+A L++
Sbjct: 197 CFITLDLTVTKYPVTVTSEALPYDTLYLVACPDSLGGVLVVTPSALLHLDQTARLVGLSV 256
Query: 343 NNYAVSLDSSQELPRSSFSV---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
+ + S LP ++ ++ +L ++ T+ + + LL + G ++ +GR V
Sbjct: 257 SRWTDFTSSELMLPNATATLGDCDLQSSVLTFTEANGGLLVLRDGRMLTFQCALEGRTVT 316
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
L L+ VL + G + F L + L++ + T + + L E +I A
Sbjct: 317 SLSLN----VVLVPERQ--GGASFV--QALPERLILCASFQDDTYLYAMNLLEAPTEIAA 368
Query: 460 D-APSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
P + L + + G+ + S + A V D L +GPL D
Sbjct: 369 STGPDQQSLEPDADVDADALDLYGDSFKPDVATSKQAQPA----GLDVLDVLPTLGPLND 424
Query: 518 FSYGLRINADASA---TGISKQSNYELVE----------LPGCKGIWTVYHKSSRGHNAD 564
+YG+ NA A + Q + ++E + IWTV N
Sbjct: 425 MTYGVVRNAHGKAHPHMVATMQHHLAVIEPRLRCDVVQNIAPAHAIWTV------SINGK 478
Query: 565 SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
+ A+D+E L+ SLE+ + T + +Q RTIA G+ + VI
Sbjct: 479 WLLLTAWDEE---CLVYSLESNS------------THFLSQHLQ-RTIACGS--TQAGVI 520
Query: 625 QVFERGARILD--GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
+V + A +LD G MT +F ++ + G SI D YV L
Sbjct: 521 RVTSKRAEVLDEHGRIMT---TFAECDANASYG-------DASIQDSYVAL 561
>gi|449299306|gb|EMC95320.1| hypothetical protein BAUCODRAFT_25380 [Baudoinia compniacensis UAMH
10762]
Length = 1437
Score = 120 bits (301), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 219/1037 (21%), Positives = 381/1037 (36%), Gaps = 207/1037 (19%)
Query: 52 IGP-VPNLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRL 109
IGP NLVV ++++++ V R+ + + + + R L L+ Y L
Sbjct: 22 IGPQADNLVVAKTSLLQVFEVKRISQAKDNGHHDHADAQSR----------LSLIGEYTL 71
Query: 110 HGNVESLAILS-----QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
G V +L+ ++ GGA +++ AF+DAK+S++E+D + + S+H +E
Sbjct: 72 SGTVTALSPITLPSSRTGGA-------ALVCAFKDAKLSLIEWDPEHYRISTISIHYYEG 124
Query: 165 PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS------------- 211
L G ++ VDP RC + Q+ IL Q G
Sbjct: 125 DNVLLPPFGAALSECESILTVDPGSRCAALKFGERQLAILPFRQQGDELADEAAEDADMA 184
Query: 212 --------GLVGDEDTFGSGGGFS----ARIESSHVINLRDLD--MKHVKDFIFVHGYIE 257
G V + T + S +SS V+ L LD + H F+H Y E
Sbjct: 185 EAESEEQPGNVTLKRTSTTQALDSKDDITPYKSSFVLPLITLDPSLTHPVHLAFLHEYRE 244
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P IL + + + + ++ + + S LP D +K++A+P P
Sbjct: 245 PTFGILSAPQQPSLALLDERKDCLSYTVFTLDLEQRASTNLMSVSKLPSDLWKVIALPPP 304
Query: 318 IGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV 376
+GG L+VG N IH + A+A+N +A + S +++L+ L +
Sbjct: 305 VGGALLVGTNELIHIDQSGKTTAVAVNEFAKVASNFSMADHSDLNMKLEGCEIEMLDSST 364
Query: 377 --ALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSD----ITTIGNSLFFLGS 427
AL+ G L+ GR V L +S+ TN + + + ++ F+GS
Sbjct: 365 GNALIVLNDGSFATLSFKMLGRTVGGLTVSRVADTNGGNVNASAPSCVASMQQQKLFVGS 424
Query: 428 RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD---------------APSTKRLRRSSS 472
G S LV++ + T + G APS ++R++S
Sbjct: 425 EDGSSSLVRWAKDTPTLSRKRSHAQMLGQDAPMDDADDAEELDEDDLYAPSAVAVKRAAS 484
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL--RINAD--- 527
A+ A T++F + DSL ++ P+ + G R +
Sbjct: 485 ----------------VANAAAVDASTTYTFELEDSLNSLAPMNNVCLGRSPRTGKEKLE 528
Query: 528 -ASATGISKQS-----NYELV-------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
+ G K S N E++ ++ G K IW+V +S G S+ D
Sbjct: 529 LVAGIGRGKASSLAFMNREIIPNEIRSRDVAGAKDIWSVCARSREGDKVSSA------DT 582
Query: 575 YHAYLIISLEARTMVLETAD----LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
Y L + T + AD + E+ E+ D+ G T+ G L ++Q
Sbjct: 583 YDNLLFVFDGESTKTYKYADSAEGSIIELDET-DFEGDGETVCVGTLANGSCIVQCRRTE 641
Query: 631 ARILDGSYMTQDLSFGPSNSESGSGSENST---VLSVSIADPYVLLGMSDGSIRLLVGDP 687
R T D G S S E +++ S DPY+L+ D S+++L D
Sbjct: 642 IR-------TYDHQLGLSQIIPMSDDETDAELKIVATSFCDPYLLVIQDDSSVQILQVDK 694
Query: 688 STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
+ +P+ + E LR+ WL+ + G L
Sbjct: 695 -------------QGDVEPLDAA--------ESDLREGK---WLTGSL-------YAGEL 723
Query: 748 DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
G + + + G L++F +P V++ ++ L S+
Sbjct: 724 SDGQSAAFLLGQEGGLQVFSLPETKLVYSAPTL---------PFLPPVL---------SA 765
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
+ +G K + + VV+L + +RP+L ++ Y+ + +
Sbjct: 766 DAPQRRGGKVTLTEVLVVDLGAEGV----TRPYLIVRTAMDDLILYEPFHY--------- 812
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE----ETPHGAPCQRITIFKNISGH 923
S + + A+ +LRF + P + +T G P Q I G
Sbjct: 813 --------SATTLDARATGFTDLRFRKVPFTYLPKYDEGLDTADGRPAQLQPAV--IGGR 862
Query: 924 QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
+L G P + + L L + +F+ LH C GF V G LK QL
Sbjct: 863 NALYLPGGTPSFLVKEATSLPKVLGLRARGVRSFSPLHRAGCQQGFALVDGDGKLKEYQL 922
Query: 984 PSGSTYDNYWPVQKVVF 1000
P ++ W V+ +
Sbjct: 923 PGHVSFATGWSVRTLTL 939
>gi|121719617|ref|XP_001276507.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus clavatus NRRL 1]
gi|148886827|sp|A1C3U1.1|CFT1_ASPCL RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
protein 1
gi|119404719|gb|EAW15081.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus clavatus NRRL 1]
Length = 1401
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 167/723 (23%), Positives = 294/723 (40%), Gaps = 127/723 (17%)
Query: 57 NLVVTAANVIEIYV---VRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV 113
NLVV +V++I+ V EG + S D + + L L Y L G V
Sbjct: 28 NLVVVKTSVLQIFSLLNVSCSAEGEIIAAKSARP------DQLQSTKLILEREYSLSGTV 81
Query: 114 ESLA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLH 169
L + ++ G D +I+LAF +AK+S++E+D +G+ S+H +E +
Sbjct: 82 SDLCRVKLLKTKSGGD------AILLAFRNAKLSLVEWDPERYGISTISIHYYERDDITR 135
Query: 170 LKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GD----------- 216
+ + G ++ VDP RC V +G++ + IL Q G LV GD
Sbjct: 136 SPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLVMGDYESDSQKQSHE 194
Query: 217 ---EDTFGS-----GGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
+D+ G+ G SS V+ L LD + H F++ Y EP IL+ +
Sbjct: 195 HEMDDSAGNSKSKEGAVHQTPYASSFVLPLTALDSAILHPVSLAFLYEYREPTFGILYSQ 254
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
T + + + ++ + ++ S LP D +K++A+P P+GG L++G
Sbjct: 255 IATSNSLLHERKDAIFYTVFTLDLEQRASTMLLSVTRLPSDLFKVVALPPPVGGALLIGY 314
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
N +H + A+ +N ++ + + +S ++ L+ L N LL+ +
Sbjct: 315 NELVHVDQAGKTNAVGVNEFSRQVSTFSMADQSELALRLEGCVVELLGNSSGDLLLALSS 374
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDSLLV 435
G +VL+ DGR V + + + P +I ++G+ F GS +S+L+
Sbjct: 375 GTMVLVHFKLDGRSVSGISI-RPLPGHAGGNILKAAASASASLGSDKVFFGSEDAESVLL 433
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN-- 493
++ S + S + E IE D S D +D LY +A +
Sbjct: 434 GWSLSSSNARKS---RSESKRIEKDHEEGSDDSESEEDVYED-------DLYSAAPDTPA 483
Query: 494 -------TESAQKTFSFAVRDSLVNIGPLKDFSYG-------------------LRINAD 527
S ++ F V D L N PL+D + G L + A
Sbjct: 484 LGHRLSVAPSTFASYKFKVHDVLPNTAPLRDIALGQPAMPVEDTGSHLDNICSELELVAA 543
Query: 528 ASATG-----ISKQSNYELVE----LPGCKGIWT---VYHKSSRGHNADSSRMAAYDDEY 575
+ G + K+ +V+ + G+WT +++ + D + + +E+
Sbjct: 544 YGSNGNGGLVVMKRELEPVVKASLNVGPIHGVWTASIALGSAAKPMSGDQTNI----EEW 599
Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFE 628
Y+I++ + +T+ E +++ ++ F +I G L R+RV+QV
Sbjct: 600 RQYVILT-KPQTIDKEESEVFIVDGLNLKPFKAPEFNPNNDISIQVGTLSNRKRVVQVLR 658
Query: 629 RGARILDGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
R D DL P E S+ LS S+ADPY+ + D ++ LL
Sbjct: 659 NEVRSYDS-----DLELAQIYPVWDE--DTSDERMALSASLADPYIAILRDDSTLLLLQA 711
Query: 686 DPS 688
D S
Sbjct: 712 DDS 714
Score = 48.9 bits (115), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 54/121 (44%), Gaps = 16/121 (13%)
Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG---------SRPCWCMVF 939
N R P D+ T + + + I +ISG+ F+ G SR C +
Sbjct: 861 NHVLPRIPPDSDTNISDKEPSNHRPLCILPDISGYSAVFMPGTSASFIFKTSRSC-PHIL 919
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVV 999
R R V L D FT + + GFIYV S+ +++ICQLP + YD W ++KV
Sbjct: 920 RLRGGVVRSLSD---FDFT---DPSLGRGFIYVDSKDVVRICQLPPETIYDYSWTLKKVA 973
Query: 1000 F 1000
Sbjct: 974 I 974
>gi|346319828|gb|EGX89429.1| protein CFT1 [Cordyceps militaris CM01]
Length = 1452
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 225/989 (22%), Positives = 375/989 (37%), Gaps = 171/989 (17%)
Query: 91 VLMDGISAASLELVCHYRLHGNVESLAIL-----SQGGADNSRRRDSIILAFEDAKISVL 145
+L D L LV + G + LA L S GG ++++LA+ AK+ +
Sbjct: 94 LLRDRSQHTKLVLVAELPVAGTIIGLARLKLPHTSSGG-------EALLLAYRGAKMCLT 146
Query: 146 EFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQMI 202
E++ L S+H +E E L+ G V + DP RC +
Sbjct: 147 EWNPRRAALETVSIHFYEKDE---LQGAPWELPFGEYVNYLEADPASRCAAFKFGSRNLA 203
Query: 203 ILKASQGG--------------------SGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
IL Q + L + D G G ++ S V+ L LD
Sbjct: 204 ILPFRQAEEDLEMEDWDEALDGPKPPKEASLATNGDANGDANGTQSQHSPSFVLRLPLLD 263
Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
+ H F+H Y EP IL + T H T + L + + I S
Sbjct: 264 PTLLHPVHLAFLHQYREPTFGILSSAQSTSIALGFRDHLTYKVFTLDLKQ--RASTTILS 321
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA-----VSLDSSQE 354
LP D +++ +P+P+GG L+VGAN IH + +A+N A SL+ E
Sbjct: 322 VTGLPQDLSRVIPLPTPVGGALLVGANELIHIDQSGKANGVAVNPMARQMTSFSLNDQSE 381
Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL---SKTNPSVL 411
L ++ +E A +++ LL L +++ DGR V + L S+ N L
Sbjct: 382 L---NYRLEGCAIEPVSMESGELLLILNDASLAIVSFKIDGRTVSGISLVPVSQENGGNL 438
Query: 412 ----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
S I+ IG S F+GS GDS+++ + + S +++ ++A+
Sbjct: 439 LKSHVSCISRIGKSSMFIGSEYGDSVVLGW-----SRKQSQEKRKKSRVLDAELALDVDD 493
Query: 468 RRSSSDALQDMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
D + G E + S + N + F ++DSL+ + P+ D + G
Sbjct: 494 IDLDDFDEDDDLYGTESTAAKPSLATNGVTKGGELIFRLQDSLLCLAPIHDVAPGKAVFP 553
Query: 522 -------LRINAD-----ASATGISKQS-----NYEL-------VELPGCKGIWTVYHK- 556
LR A A G K N E+ E P +G WT+ K
Sbjct: 554 LDSEEVVLRDGVTSELQLACAVGRGKAGAIAILNREIQPKVIGRFEFPEARGFWTMCVKK 613
Query: 557 ---SSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF------ 606
+ G NA S + YD E + +I + ET+D+ +
Sbjct: 614 PLPKALGSNAVVS--SEYDSMELYDRFMIVAKVDLDGYETSDVYALTDAGFESLKDTEFE 671
Query: 607 -VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSV 664
G T+ AG + + R+IQV + R DG ++Q L + +G+E V+S
Sbjct: 672 PAAGFTVMAGTMGKQMRIIQVLKSEVRCYDGDLGLSQILPM----MDEDTGAE-PRVVSA 726
Query: 665 SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
SIADPY+++ D SI + A I+S+ + + DKGP ++
Sbjct: 727 SIADPYLMVIRDDNSIFI---------------AKIDSNDE---LDEVEKDKGPLASIK- 767
Query: 725 TSTDAWLSTGVGEAIDGA-DGGPLDQGDIYSVVCY---ESGALEIFDVPNFNCVFTVDKF 780
W + + DG D+G ++ + +GAL I+D+ N +
Sbjct: 768 -----WQTGCLYADHDGHFQPKQPDEGSSPRILMFLMSTTGALHIYDLDNLS-------- 814
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
Y+ E L S S++ G KE + + V +L P+
Sbjct: 815 -------EPVYVAEGLT-STPPFLSANFTGRKAAAKETLTEILVADLG----DVVAKSPY 862
Query: 841 LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
L + Y+ + P ++S S +L + S + + D
Sbjct: 863 LILRHDTDDLTLYEPVRYHEPNSSS-----APLSDTLFFKKSTNSTIAKSAPASDKEDDE 917
Query: 901 TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVL 960
T+++ P Q + N+ G+ FLSG P + + + + L + +
Sbjct: 918 TQQK--RFVPLQ---LCANVGGYSAVFLSGDSPSFILKSAKSIPRIVGLQGQGVQGMSTF 972
Query: 961 HNVNCNHGFIYVTSQGILKICQLPSGSTY 989
H C+ GFIY ++GI ++ QLP+ + Y
Sbjct: 973 HTEGCDRGFIYADTKGIARVSQLPTDTNY 1001
>gi|400597740|gb|EJP65470.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
Length = 1444
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 224/998 (22%), Positives = 375/998 (37%), Gaps = 182/998 (18%)
Query: 85 GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
GET +L D L LV + G V LA L ++ ++++LA+ AK+ +
Sbjct: 85 GET--LLLRDRAQNTKLVLVAEIPVAGTVIGLARLKLQNTESGG--EALLLAYRGAKMCL 140
Query: 145 LEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQM 201
E++ L S+H +E E L+ G V + DP RC +
Sbjct: 141 TEWNPQKAALDTVSIHYYEKDE---LQGAPWELPFGEYVNYLEADPASRCAAFKFGSRNL 197
Query: 202 IILKASQGGSGL-VGDEDTFGSG-----------GGFSARIESSH----VINLRDLD--M 243
IL Q L + D D G G ES H V+ L LD +
Sbjct: 198 AILPFRQAEEDLEMEDWDEALDGPKPAKEAALATNGDDHETESQHSPSFVLRLPLLDPTL 257
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
H F+H Y EP IL + T H T + L + + I S
Sbjct: 258 LHPVHLAFLHQYREPTFGILSSAQSTSIALGFRDHMTYKVFTLDLKQ--RASTTILSVTG 315
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
LP D +++ +P+P+GG L+VG N IH + +A+N A + S +S +
Sbjct: 316 LPQDLKRVIPLPTPVGGALLVGENELIHIDQSGKANGVAVNPMARQMTSFSLADQSELNY 375
Query: 363 ELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL---SKTNPSVL----TS 413
L+ A +++ LL L +++ DGR V + L S+ N L S
Sbjct: 376 RLEGCAIEPISMESGELLLILNDASLAIISFKIDGRTVSGISLAAVSQENGGNLLKSRVS 435
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
I+ IG + F+GS GDS+++ + + S +++ ++ D L D
Sbjct: 436 CISRIGKASMFIGSESGDSVVLGW-----SRKQSQEKRKKSRALDTD------LALDVED 484
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFS--------FAVRDSLVNIGPLKDFSYGLRIN 525
D E+ LYG+ S + +Q F ++D+L+ + P+ D + G +
Sbjct: 485 IDLDDDFDEDDDLYGTESAAAKPSQAGAGATKGGEPVFRLQDALLCLAPIHDVAPGKAVF 544
Query: 526 AD-----------------ASATGISKQS-----NYEL-------VELPGCKGIWTVYHK 556
A A G K N E+ E P +G W + K
Sbjct: 545 PSDSEEAFLRDGVTSELQLACAVGRGKAGAIAILNREIQPKVIGRFEFPEARGFWAMCVK 604
Query: 557 SSRGHNADSSRM--AAYD--DEYHAYLIISLEARTMVLETADLLTEVTESVDYF------ 606
SS + + YD ++Y ++I++ + ET+D+ +
Sbjct: 605 KPVPKALGSSAVISSEYDSTEQYDRFMIVA-KVDLDGYETSDVYALTDAGFESLKDTEFE 663
Query: 607 -VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSV 664
G T+ AG + + R++QV + R DG ++Q L + +G+E V+S
Sbjct: 664 PAAGFTVMAGTMGKQMRIVQVLKSEVRCYDGDLGLSQILPM----LDEDTGAE-PRVVSA 718
Query: 665 SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
SIADPY+++ D S+ + + + +E +K DKGP
Sbjct: 719 SIADPYLMIIRDDNSVFI---------AKIGSNDELEEVEK---------DKGP------ 754
Query: 725 TSTDAWLSTGVGEAIDGA--DGGPLDQGDIYSVVCYES--GALEIFDVPNFNCVFTVDKF 780
+ W + + DG P D +++ S GAL ++D+ N +
Sbjct: 755 LVSTKWQTGCLYTDYDGTFQAKKPDDNASPRTMMFLMSTAGALHMYDLDNLS-------- 806
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
Y+ E L S S++ G KE + + V +L PF
Sbjct: 807 -------EPVYVAEGLT-STPPFLSANFTGRKAAAKERLTEILVADLG----DVVSKSPF 854
Query: 841 LFAILTDGTILCYQAYLFEGPENTS---------KSDDPVSTSRSLSVSNVSASRLRNLR 891
L + Y+ ++ P ++S K + ++S S + + R
Sbjct: 855 LILRHDTDDLTLYEPVRYQEPNSSSPPLTDTLFFKKSANATIAKSASAFDKEEDETQQRR 914
Query: 892 FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
F PL PC N+ G+ FLSG P + + + + L
Sbjct: 915 F--VPLQ-----------PC------GNVGGYSTVFLSGDSPSFVLKSAKSIPRIVGLQG 955
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
+ + H C+ GFIY ++GI ++CQLP+ + Y
Sbjct: 956 QGVQGMSTFHTAGCDRGFIYADTKGIARVCQLPTDTNY 993
>gi|452979579|gb|EME79341.1| hypothetical protein MYCFIDRAFT_104419, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 1342
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 215/975 (22%), Positives = 366/975 (37%), Gaps = 188/975 (19%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G V SLA DN+ D+II+AF DAK+S++E+D H + S+H
Sbjct: 46 LSLVAEYPLAGTVISLA--RTKPRDNASGGDAIIIAFRDAKLSLVEWDPENHRISTISLH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD---- 216
+E + G ++ VDP RC + Q+ IL G L G+
Sbjct: 104 YYEGDNVITPPFGPTLAESESILTVDPSSRCAALKFGARQLAILPFRHFGDELAGEEEED 163
Query: 217 --------------EDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
E T +G ++S V+ L LD + H F+H Y EP
Sbjct: 164 GFENEPMSAVSKRRESTHLNGEEEQTPYKASFVLPLTALDPTLSHTVHLAFLHEYREPTF 223
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
IL + + + ++ + + + LP +K+ +P PIGG
Sbjct: 224 GILSAPMEPSNALLEERKDVLTYTVYTLDLEQRASTNLITVPKLPSTLWKVKPLPLPIGG 283
Query: 321 VLVVGANTIHYHSQSASC-ALALNNYA-------VSLDSSQELPRSSFSVELDAAHATWL 372
L+VG N + + QS A A+N +A +S S L S+E + L
Sbjct: 284 ALLVGTNELVHVDQSGKANATAVNEFAKLESDFGMSDQSHLNLKLEDCSIETIDPKSGQL 343
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-------TNPSVLTSDITTIGNSLFFL 425
LL T G L ++ GR + ++++ T+ S S I + N F+
Sbjct: 344 -----LLVTSDGALAIIEFKLLGRSISAINVTPVTEDNGVTSLSAAPSCIANLANGSVFI 398
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD------IEADAPSTKRLRRSSSDALQDMV 479
GS G S L+ ++ + + G +A L ++ +A + V
Sbjct: 399 GSEDGASSLMGWSQPTAPLTRKRSHAQMLGKDGDEEDEDAIEEDDDDLYDAAPEAKKRAV 458
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD-----ASATGIS 534
+ EL S+ + F +RD L ++GP+ G + + A+ATG
Sbjct: 459 SDTELG----------SSNAAYQFEIRDHLQSLGPIHRMCVGRQGKSSDKLQLAAATG-R 507
Query: 535 KQS------NYELVELPGCKGIWTVYHKSSRGHNADSS---RMAAYDDEYHAYLIISLEA 585
KQS N ++V PG ++SR NA S+ R DE +L+
Sbjct: 508 KQSGRLTLLNRDVVPTPG---------RASRFENAKSAWAVRAHQAGDES------TLDN 552
Query: 586 RTMVLETADLLT-EVTESVDYFV--------------QGRTIAAGNLFGRRRVIQVFERG 630
+ V E A+ E++ + ++FV +G T+ L + ++Q ++
Sbjct: 553 KLFVFEGANTKAYEISSADEHFVEDRYPEHAKSEWESEGETLEVVALADGKIIVQFRKQE 612
Query: 631 ARILDGSY-MTQDLSFGPSNSESGSGSENS-TVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
R D + M Q L P E +EN ++ +++ DPYVL+ D SI++L
Sbjct: 613 VRTYDANLAMNQIL---PMEDE----AENELNIVHIAVCDPYVLVIRDDSSIQIL----- 660
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
SVQ + +P+ + +K WL+ + G L
Sbjct: 661 ----SVQG-----NELEPLEAEGSVAEK------------KWLTGSLY-------AGTLT 692
Query: 749 QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI-VDTYMREALKDSETEINSSS 807
QG + G L F +P+ +F + I VD R A
Sbjct: 693 QGSAAVFLLNADGGLHAFALPDLQPLFAIPTLPHLPPVIAVDAAQRRA------------ 740
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
G +E + + V +L ++P+L ++ Y+ + + P+ + +
Sbjct: 741 ------GTRETLTEVLVSDLGQHGV----TQPYLVLRTAMDDVVLYEPFHY--PQTSGRK 788
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFF 927
S + L R R + FS P + + E+ P ++ I +
Sbjct: 789 ----SWHQDL--------RFRKVPFSHIPKYSESIAESQSARPPPLKSV--KIDTYSAIA 834
Query: 928 LSGSRPCWCM----VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
+ G+ PC + + L + + ++ V C +GF + + L+ QL
Sbjct: 835 IPGAPPCLLLKEPSTLPKVLEIRQSAELNRLSMLCPINRVGCENGFFMINADEELEEQQL 894
Query: 984 PSGSTYDNYWPVQKV 998
P + Y W V +V
Sbjct: 895 PLNTWYGTGWSVHQV 909
>gi|342877552|gb|EGU79002.1| hypothetical protein FOXB_10431 [Fusarium oxysporum Fo5176]
Length = 1399
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 211/980 (21%), Positives = 361/980 (36%), Gaps = 143/980 (14%)
Query: 69 YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GG 123
Y R ++ ES G V D + L LV L G V LA + GG
Sbjct: 65 YDHRANDDDGLESSFLGGESMLVRTDRTNLTKLVLVAELPLSGTVTGLAKVKTKHSKCGG 124
Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPL 182
+++++A++ AK+ + +D L S+H +E E LH SF
Sbjct: 125 -------EALLIAYKAAKLCMAVWDPEKSNLETISIHYYEKEE-LHGAPWEVSFDEYTNY 176
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
++ DP RC + IL Q L D+ G + ES+ V N
Sbjct: 177 LEADPGSRCAAFQFGSRNLAILPFRQAEEDLEMDDWDEDLDGPRPVK-ESTTVANGDSDT 235
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
++ EP IL + H T + L + + I S
Sbjct: 236 LEPA----------EPTFGILSSSQERAHSLGQKDHLTYKVFTLDLQQ--RASTTILSVT 283
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS 361
+LP D +K++ +P+P+GG L++G N IH S +A+N+ A + S ++ +
Sbjct: 284 DLPRDLFKIIPLPAPVGGSLLIGENELIHVDQSGKSNGVAVNSMARQITSFSLTDQADLN 343
Query: 362 VELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDLSKTNPSVLTSDI 415
+ L+ ++N LL G + ++T DGR V R+ + +++ S
Sbjct: 344 LRLEHCVIETLSIENGELLLVLNDGRIGIVTFQIDGRTVSGLTVRMVADENGGNLIKSRA 403
Query: 416 TT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+T +G + +F+GS +GDS+++ +T G K D E
Sbjct: 404 STASKLGKNAYFVGSEVGDSVVLGWTRKMGQEKRR---KPRLIDAEIGLEMDDLDLEDED 460
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-LRINADASAT 531
D D+ E + + + N SF + D+L++I P+KD + G + + D+
Sbjct: 461 DEDDDLYGTESAAAKPAQALNGGGKTGELSFRIHDTLLSIAPIKDLTPGKVSFHPDSEEA 520
Query: 532 GISKQSNYEL----------------------------VELPGCKGIWTVYHK----SSR 559
+S+ +L E P + WT+ K +
Sbjct: 521 TLSQGVVSDLHLACVVGRGKAGSLAILNRNIQPKIIGRFEFPEARDFWTMSVKKPMPKAL 580
Query: 560 GHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLT------EVTESVDY-FVQGRTI 612
G N ++ Y+I++ + ET+D+ E + ++ G T+
Sbjct: 581 GGNVGMGNEYETFGQHDKYMIVA-KVDLDGYETSDVYALTGAGFETLKDTEFDPAAGFTV 639
Query: 613 AAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
AG + + R+IQV + R DG +TQ L + E+G+ V S SIADPY+
Sbjct: 640 EAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA---EPRVTSASIADPYL 694
Query: 672 LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
LL D S+ L D + V+ A + K S C KG + ++D
Sbjct: 695 LLIRDDSSLMLAQIDSNNELEEVEKMDATLQNTKWHSGCLYADTKGA---FQPNASDKGA 751
Query: 732 STGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHI-VD 789
T I + +GAL ++ +P+ + V+ + H+ D
Sbjct: 752 ET----------------EKIMMFLLSSTGALHVYALPDLSKPVYVAEGLCYVPPHLSAD 795
Query: 790 TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGT 849
+R L KEN+ + V +L P+L
Sbjct: 796 YTLRRGLA------------------KENLREILVADLG----DTTSQSPYLILRNQTDD 833
Query: 850 ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
+ Y+ P + S S +L+ S + L + D E P
Sbjct: 834 LTIYE------PLRHVRDGGETSLSATLTFKKTSNTTLATIPVETEQDDV----EQPRFV 883
Query: 910 PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGF 969
P + NI+G+ FL G P + + + + L + + H C+ GF
Sbjct: 884 PLRPCA---NINGYSTVFLPGPSPSFVIKSSKSIPRVIGLQGLGVRGMSTFHTEGCDRGF 940
Query: 970 IYVTSQGILKICQLPSGSTY 989
IY +GI ++ QLP + +
Sbjct: 941 IYADDKGIARVTQLPPDTNF 960
>gi|303321596|ref|XP_003070792.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
gi|240110489|gb|EER28647.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
Length = 1394
Score = 117 bits (293), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 171/723 (23%), Positives = 290/723 (40%), Gaps = 78/723 (10%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + G+ N+ + R ++ L LV Y L G + L
Sbjct: 28 NLIVAKTSILQVFSLVNVAYGTSALPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ +++++A +AK+S++E+D HG+ S+H +E E +H
Sbjct: 85 GRVKI--LDSRSGGEALLVATRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLV-----GDEDTFGSGGG---- 225
P L+ VDP RC +L +G+ + IL Q G LV GD D G
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDGDLDEKPEGASNIPA 200
Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ SS V+ L LD + H F++ Y EP IL+ T +
Sbjct: 201 QIAVENDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ + S ++ + + + LP D +K++ +P PIGG L++G+N IH
Sbjct: 261 LRDRKDIVSYSVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + + +S + L+ L D LL G + +L
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380
Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ DGR V +L K S+L + + ++G F GS DSLL+ ++ S
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGWSRPS-- 438
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
++ E D+ D T+ + + +L + S + F F
Sbjct: 439 QLMRKPKVESADDVFGDHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLNKDDFVF 498
Query: 504 AVRDSLVNIGPLKDFSYGL-----RINADASATGISKQSNYELVELPGCKGIWTVYHKSS 558
D L N+GP+ D + G N S++ S + + G G V +
Sbjct: 499 RSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQREL 558
Query: 559 RGHNADSSRMAAYDD-------------------EYHAYLIISL-----EARTMVLETAD 594
+ DS +M D Y YL+ S + +++V
Sbjct: 559 DPYVIDSMKMDNVDGVWSIQVGAPDSTNTRTSSRNYDKYLVFSKSTEPGKEQSVVYSVGG 618
Query: 595 LLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
E ++ ++ + T+ G L G RV+QV + R D + + P E
Sbjct: 619 SGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY--PIWDE-- 674
Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
S+ +V+S S A+PYVL+ D S+ LL D S V I SS + +S C LY
Sbjct: 675 DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHRWLSGC-LY 732
Query: 714 HDK 716
DK
Sbjct: 733 LDK 735
Score = 46.6 bits (109), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 45/101 (44%), Gaps = 19/101 (18%)
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-- 974
+ +I G++ F+SGS PC+ M +L ++ + + H C GF YV +
Sbjct: 866 YSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLRGEAVSSLSSFHIPACEKGFAYVDASV 925
Query: 975 -----------------QGILKICQLPSGSTYDNYWPVQKV 998
Q ++++C+LP + +DN W +KV
Sbjct: 926 CVPKQYFVPWNKLILVIQNMVRMCRLPGNTRFDNSWVTRKV 966
>gi|320040273|gb|EFW22206.1| hypothetical protein CPSG_00105 [Coccidioides posadasii str.
Silveira]
Length = 1387
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 177/726 (24%), Positives = 289/726 (39%), Gaps = 84/726 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + G+ N+ + R ++ L LV Y L G + L
Sbjct: 28 NLIVAKTSILQVFSLVNVAYGTSALPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ +++++A +AK+S++E+D HG+ S+H +E E +H
Sbjct: 85 GRVKI--LDSRSGGEALLVATRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLV-----GDEDTFGSGGG---- 225
P L+ VDP RC +L +G+ + IL Q G LV GD D G
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDGDLDEKPEGASNIPA 200
Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ SS V+ L LD + H F++ Y EP IL+ T +
Sbjct: 201 QIAVENDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ + S ++ + + + LP D +K++ +P PIGG L++G+N IH
Sbjct: 261 LRDRKDIVSYSVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + + +S + L+ L D LL G + +L
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380
Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ DGR V +L K S+L + + ++G F GS DSLL+ + S
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGW---SRP 437
Query: 444 SMLSSGLKEEFGDI---EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
S L K E D + D VN LS S +N +
Sbjct: 438 SQLMRKPKVESADDVFGDHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495
Query: 501 FSFAVRDSLVNIGPLKDFSYGL-----RINADASATGISKQSNYELVELPGCKGIWTVYH 555
F F D L N+GP+ D + G N S++ S + + G G V
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555
Query: 556 KSSRGHNADSSRMAAYDD-------------------EYHAYLIISL-----EARTMVLE 591
+ + DS +M D Y YL+ S + +++V
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGAPDSTNTRTSSRNYDKYLVFSKSTEPGKEQSVVYS 615
Query: 592 TADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNS 650
E ++ ++ + T+ G L G RV+QV + R D + + P
Sbjct: 616 VGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY--PIWD 673
Query: 651 ESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSC 710
E S+ +V+S S A+PYVL+ D S+ LL D S V I SS + +S C
Sbjct: 674 E--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHRWLSGC 730
Query: 711 TLYHDK 716
LY DK
Sbjct: 731 -LYLDK 735
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 44/82 (53%)
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
+ +I G++ F+SGS PC+ M +L ++ + + H C GF YV +
Sbjct: 878 YSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLRGEAVSSLSSFHIPACEKGFAYVDASN 937
Query: 977 ILKICQLPSGSTYDNYWPVQKV 998
++++C+LP + +DN W +KV
Sbjct: 938 MVRMCRLPGNTRFDNSWVTRKV 959
>gi|431908146|gb|ELK11749.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
alecto]
Length = 820
Score = 116 bits (290), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 74/247 (29%), Positives = 119/247 (48%), Gaps = 16/247 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 162 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 217
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 218 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLVYEAF----PHDSQLGQGNLK 267
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ R + R S+ + E R F++I G+ G F+ G
Sbjct: 268 VRFKKVPHNINF-REKKPRPSKKKAEGGAEEGPGARGRVARFRYFEDIYGYSGVFICGPS 326
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 327 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 386
Query: 992 YWPVQKV 998
WPV+K+
Sbjct: 387 PWPVRKI 393
>gi|255948500|ref|XP_002565017.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592034|emb|CAP98296.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1392
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 170/731 (23%), Positives = 296/731 (40%), Gaps = 151/731 (20%)
Query: 57 NLVVTAANVIEIY--VVRVQEEGSKE-----SKNSGETKRRVLMDGISAASLELVCHYRL 109
NLVV ++++++ V V + KE S S + + +++++ Y L
Sbjct: 28 NLVVVRTSLLQVFSLVKIVSSQPQKEVPEPLSSQSSQPETKLVLEK----------EYPL 77
Query: 110 HGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
G V L S+ N+R ++I++A +AK+S++E+D G+ S+H +E +
Sbjct: 78 SGTVTDL---SRVKILNTRSGGEAILIAVRNAKLSLIEWDPERRGISTISIHYYERDDLT 134
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDED 218
+ G ++ VDP RC V +G++ + IL Q G LV G+
Sbjct: 135 RSPWVPDLSRCGSILSVDPSSRCA-VYNFGIRNLAILPFHQAGDDLVMDDYDSELDGERP 193
Query: 219 TFGSGGGFSARIE-------------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVIL 263
+ SGGG A+IE SS V+ L LD + H F++ Y EP IL
Sbjct: 194 SQNSGGG--AQIEKRKEEPDHQTPYSSSFVLPLTALDPSLLHPISLAFLYEYREPTFGIL 251
Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
+ + T + + + ++ + + S LP D +K++A+P P+GG L+
Sbjct: 252 YSQVATSTALLHERKDVVFYAVFTLDLEQRASTTLLSVSRLPSDLFKVVALPLPVGGALL 311
Query: 324 VGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLS 380
+G+N I H + A+ +N ++ + S +S + L+ L D LL+
Sbjct: 312 LGSNEIVHVDQAGKTNAVGVNEFSRQVSSFSMTDQSDLAFRLEGCVVERLGGDSGDLLLA 371
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDS 432
+G++ L+ DGR V + + P+ DI T +G+ F+GS DS
Sbjct: 372 LASGNMALIKFKLDGRSVSGITVHSL-PAYAGGDILKSAASCSTCLGDGNVFIGSEDADS 430
Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR---RSSSDALQDM----VNGEELS 485
+L++++ S ST++ R + ++D L D+ E+
Sbjct: 431 VLLEWSHTSA--------------------STRKARLESKQTADGLDDLSDEDDQMEDDD 470
Query: 486 LYGSASNNTE---------SAQKTFSFAVRDSLVNIGPLKDFSYGL---RINADASATGI 533
LY SA + S + ++F + D L +IGPL+D + G N + AT
Sbjct: 471 LYSSAPGPIQVDNRMGTDSSTPEFYNFRLNDKLSSIGPLRDITLGKAFSNTNRKSQATTG 530
Query: 534 SKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNADSSR 567
+ + ELV G +W+ RG
Sbjct: 531 TVAAELELVASQGSDRGGGLVVIKREIDPLTTMSLKVDDADAVWSASVTKRRG------- 583
Query: 568 MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLFGR 620
++ D+ Y++IS + E ++ +S+ F + T+ G+L G
Sbjct: 584 ASSTDNPSCQYVVISRSTDSE-QEVNEVFIVEEQSLKPFKAPEFNPNEDCTVDIGSLAGN 642
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSE---SGSGSENSTVLSVSIADPYVLLGMSD 677
R++QV R SY D+ G S S+ S S DPY+++ D
Sbjct: 643 TRLVQVLRNEVR----SY---DIDLGLSQIYPVWDEDTSDERVAASASFIDPYLVIIRDD 695
Query: 678 GSIRLLVGDPS 688
S+ LL D S
Sbjct: 696 SSVLLLQADES 706
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 8/90 (8%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLCDGSIVAFTVLHNVNC--NHG 968
+ I NISG F+ G+ + VFR + P +L G + +V+ ++G
Sbjct: 877 LRILPNISGFSTIFMPGASSSF--VFRTA-KSSPHIIRLRGGFTRWLSSFDSVDTGRDNG 933
Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
FIYV SQ ++ CQLPS + +D W ++KV
Sbjct: 934 FIYVDSQNCVRACQLPSQTQFDYPWTLRKV 963
>gi|425765419|gb|EKV04111.1| Cleavage and polyadenylation specificity factor subunit A, putative
[Penicillium digitatum Pd1]
gi|425767100|gb|EKV05682.1| Cleavage and polyadenylation specificity factor subunit A, putative
[Penicillium digitatum PHI26]
Length = 1271
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 174/755 (23%), Positives = 301/755 (39%), Gaps = 147/755 (19%)
Query: 57 NLVVTAANVIEIYVV------RVQEEGSK---ESKNSGETKRRVLMDGISAASLELVCHY 107
NL+V ++++I+ + ++Q+EGS+ + ETK L L Y
Sbjct: 28 NLIVIRTSLLQIFSLVKIVSSQLQKEGSEPHGSQFSQPETK------------LVLEKEY 75
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
L G V L+ + +N ++I++A +AK+S++E+D HG+ S+H +E +
Sbjct: 76 PLSGTVTDLSRVKI--LNNKSGGEAILIAVRNAKLSLIEWDPERHGISTISIHYYERDDL 133
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDE 217
+ G ++ VDP RC V +G++ + IL Q G LV G+
Sbjct: 134 TRSPWVPDLSRCGSILSVDPSSRCA-VYNFGIRNLAILPFHQAGDDLVMDDYDSELEGER 192
Query: 218 DTFGSGGGFSARIE-----------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
SGGG + SS V+ L LD + H F++ Y EP IL
Sbjct: 193 PIQNSGGGAEPKKSKEGPAYQTPYCSSFVLPLTALDPSLLHPISLAFLYEYREPTFGILF 252
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
+ T + + + ++ + + S LP D +K++A+P P+GG L++
Sbjct: 253 SQVATSTALLYERKDVVFYAVFTLDLEQRASTTLLSVSRLPSDLFKVVALPLPVGGALLL 312
Query: 325 GANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
G+N I H + A+ +N ++ + S +S + L+ L D LL+
Sbjct: 313 GSNEIVHVDQAGKTNAVGVNEFSRQVSSFSMTDQSDLAFRLEGCVVERLGGDSGDLLLAL 372
Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDSL 433
+GD+ L+ DGR V + + P+ D+ + +G+ F+GS DS+
Sbjct: 373 ASGDMALIKFKLDGRSVSGITIHLL-PAHAGGDMLKSAASCSSCLGDGNVFIGSEDADSV 431
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA-------LQDMVNGEELSL 486
L++++ S STK+ R S D E+ L
Sbjct: 432 LLEWSRSSA--------------------STKKARLESKQTADGFDDLEDDDDQMEDDDL 471
Query: 487 YGSASNNTESAQKT---------FSFAVRDSLVNIGPLKDFSYGLRIN---ADASATGIS 534
Y SA +T+ + ++F ++D L +IGPL+D + G + + AT +
Sbjct: 472 YSSAPGSTQVDNRMGTENLTTEFYNFRLKDCLPSIGPLRDITLGKVFSNTYREKQATCEA 531
Query: 535 KQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNADSSRM 568
+ ELV G G+W+ K RG
Sbjct: 532 VSAELELVASQGSDRGGGLVVIKREIDPLTTMSLKIDDADGVWSASVKKRRG-------A 584
Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLFGRR 621
++ D+ Y+++S + E ++ +++ F + T+ G+ G
Sbjct: 585 SSTDNPSRQYVVVSRSTDSEQ-ELNEVFVAEEQNLKPFRAPEFNPNEDCTVDIGSFAGDT 643
Query: 622 RVIQVFERGARILDGSYMTQDLS-FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
R++QV R D M LS P E S+ +S S DPY+++ D S+
Sbjct: 644 RLVQVLRNEVRSYD---MELGLSQIYPVWDE--DTSDERVAVSASFIDPYLMIIRDDSSV 698
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
LL D + V I SS+ S LY+D
Sbjct: 699 LLLQADENGDLDEVPLSTLIISSR--WRSGCLYYD 731
>gi|258575565|ref|XP_002541964.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902230|gb|EEP76631.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1376
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 186/745 (24%), Positives = 298/745 (40%), Gaps = 123/745 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V +V++++ + G+ S ++ + R ++ L L+ Y L G V L
Sbjct: 28 NLIVAKTSVLQVFSLVNVAYGASTSPSTDDKTR---VERQQYTRLVLLAEYDLPGTVTGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ +++++A +AK+S++E+D HG+ S+H +E E LH
Sbjct: 85 GRVKT--LDSKSGGEALLVATRNAKLSLVEWDHERHGISTVSIHYYER-EDLHNSPWTPD 141
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGD---EDTFG---------- 221
P L+ VDP RC +L +G+ + IL Q G LV D ED G
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDDFDEDLRGEKPEDMDNAL 200
Query: 222 ---SGGGFSAR----IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
+ AR SS V+ L LD + H F++ Y EP IL+ T
Sbjct: 201 VESTAANDVARHKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHVATSFA 260
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHY 331
+ + + ++ + + + LP D + ++ +P PIGG L++G+N IH
Sbjct: 261 LLGERKDVVSYAVFTLDIQQRTSTTLVTVSRLPSDLWNVVPLPPPIGGSLLIGSNELIHV 320
Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL---QNDVALLSTKTGDLVL 388
+ A+ +N +A +S + L+ L D+AL+ +G + +
Sbjct: 321 DQAGKTNAVGVNEFARQASEFSMADQSDLELRLEGCVIEQLGTESGDIALV-LASGRMAI 379
Query: 389 LTVVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGS 441
+ DGR V +L ++ S+L + + ++G FLGS DS+LV +T S
Sbjct: 380 VRFKVDGRSVSGIFVQLVSTQAGGSILKARPSCSASLGRGKIFLGSEETDSVLVGWTRPS 439
Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGSASNNT 494
S KRL+R SS D D + E LY + +N T
Sbjct: 440 Q--------------------SIKRLKRDSSGPRAGETDTDDDEDDIYEDDLYSTPTNQT 479
Query: 495 ESAQKT----------FSFAVRDSLVNIGPLKDFSYGLRINA-DASATGISKQS-NYELV 542
+ F F D L ++GP+KD + G D ++ SK S + ELV
Sbjct: 480 TVPKTVSQTNGLIKDEFVFRCHDRLWSLGPMKDITLGRTPGTRDQASKKTSKPSTDLELV 539
Query: 543 EL--PGCKGIWTVYHKSSRGHNADSSRMAAYDD-------------------EYHAYLII 581
G G T+ K + DS +M D Y YL+
Sbjct: 540 VTHGQGDAGGLTILRKELDPYIIDSMKMDNVDGVWSVQIAPSNTSNPSTTSRNYDKYLVF 599
Query: 582 SLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARIL 634
S ++R E + + T +D + T+ G L G RV+QV R
Sbjct: 600 S-KSRGHAKEQSVVYTVGGNGIDEMKAPEFNPNEDHTVDIGTLAGGTRVVQVLTSEVRSY 658
Query: 635 DGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
D DL+ P E S+ +V S A+PY+L+ D S+ LL D S
Sbjct: 659 D-----TDLALAQIYPVWDE--DTSDELSVTGASFAEPYLLITRDDQSLLLLQPDSSGDL 711
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDK 716
V + +S K + C LY DK
Sbjct: 712 DEVNIDGLL-TSNKWLCGC-LYFDK 734
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/92 (26%), Positives = 46/92 (50%), Gaps = 12/92 (13%)
Query: 913 RITIFKNISGHQGFFLSGSRPCWCMVFRE------RLRVHPQLCDGSIVAFTVLHNVNCN 966
R+ ++ G++ F+ GS PC+ M RL+ P + + + H C
Sbjct: 876 RLRAIPDLCGYKTMFMPGSNPCFIMKSSTSSPHVLRLKGEP------VSSLSSFHMPACE 929
Query: 967 HGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
GF YV ++ ++++C+LP + +DN W +K+
Sbjct: 930 KGFAYVDAKNMVRMCRLPGNTRFDNAWAARKI 961
>gi|119195757|ref|XP_001248482.1| hypothetical protein CIMG_02253 [Coccidioides immitis RS]
gi|121769680|sp|Q1E5B0.1|CFT1_COCIM RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
protein 1
gi|392862316|gb|EAS37050.2| protein CFT1 [Coccidioides immitis RS]
Length = 1387
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 172/731 (23%), Positives = 294/731 (40%), Gaps = 94/731 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + G+ N+ + R ++ L LV Y L G + L
Sbjct: 28 NLIVAKTSILQVFSLVNVAYGTSAPPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ ++++++ +AK+S++E+D HG+ S+H +E E +H
Sbjct: 85 GRVKI--LDSRSGGEALLVSTRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDE-----DTFGSGGG---- 225
P L+ VDP RC +L +G+ + IL Q G LV DE D G
Sbjct: 142 LRLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDEDLDEKPEGASNIPA 200
Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ SS V+ L LD + H F++ Y EP IL+ T +
Sbjct: 201 QAAVANDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ + + ++ + + + LP D +K++ +P PIGG L++G+N IH
Sbjct: 261 LHDRKDIVSYAVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + + +S + L+ L D LL G + +L
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380
Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ DGR V +L K S+L + + ++G F GS DSLL+ ++ S
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGWSRPS-Q 439
Query: 444 SMLSSGLK---EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
SM ++ + FG + D VN LS S +N +
Sbjct: 440 SMRKPKVESADDVFG--DHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495
Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
F F D L N+GP+ D + G R +AD + N
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555
Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
+ +++ G+W++ + DS+ Y YL+ S + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610
Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
++V E ++ ++ + T+ G L G RV+QV + R D + +
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
P E S+ +V+S S A+PYVL+ D S+ LL D S V I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725
Query: 706 PVSSCTLYHDK 716
+S C LY DK
Sbjct: 726 WLSGC-LYLDK 735
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 55/108 (50%), Gaps = 8/108 (7%)
Query: 891 RFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLC 950
RF +P AY PH + + + +I G++ F+SGS PC+ M +L
Sbjct: 860 RFDPSP-KAYM----PHS---KFLRAYSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLR 911
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
++ + + H C GF YV + ++++C+LPS + +DN W +KV
Sbjct: 912 GEAVSSLSSFHIPACEKGFAYVDASNMVRMCRLPSNTRFDNSWVTRKV 959
>gi|427795803|gb|JAA63353.1| Putative mrna cleavage and polyadenylation factor ii complex
subunit cft1 cpsf subunit, partial [Rhipicephalus
pulchellus]
Length = 726
Score = 113 bits (283), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 83/257 (32%), Positives = 117/257 (45%), Gaps = 46/257 (17%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEEGTGQG 814
V E+G LEI+ +P + F V F G+ +VD+ A +++E ++ S E
Sbjct: 73 VARENGVLEIYSLPEYKLCFLVKNFPMGQKVLVDSVQMTAPSGTKSEKLSDMSHESMPV- 131
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
+H + VV L ++ HSRP L A + D +L Y+A+ F T
Sbjct: 132 ----VHEILVVGLGIR-----HSRPLLLARV-DEDLLIYEAFPF------------YETQ 169
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTRE-----ETPHGAPCQR-------ITIFKNISG 922
R + LRF + D + RE + P ++ + F +ISG
Sbjct: 170 REGHL---------KLRFKKMSHDIFLRERKYKTQKPENEEEEKAFQSRQWLHPFSDISG 220
Query: 923 HQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G FL G RP W M R LR HP DG I F HNVNC GF++ QG L+I
Sbjct: 221 YSGVFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQGELRIS 280
Query: 982 QLPSGSTYDNYWPVQKV 998
LP+ TYD WPV+KV
Sbjct: 281 TLPTHLTYDAPWPVRKV 297
>gi|169864473|ref|XP_001838845.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
gi|116500065|gb|EAU82960.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
Length = 1458
Score = 112 bits (281), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 200/917 (21%), Positives = 360/917 (39%), Gaps = 164/917 (17%)
Query: 57 NLVVTAANVIEIYVVRVQE-------EGSKESKN---------SGETKRRVLMDGISAAS 100
NLVV +N++ I+ VR + E +E K GE DG S
Sbjct: 40 NLVVARSNLLRIFEVREEPCAVPHGVEDERERKGGIRRGTEAVEGELAMDAQGDGFINVS 99
Query: 101 ----------------LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
L LV ++LHG V L+ + + A + D ++++F+DAKI++
Sbjct: 100 KGMAMKSDVEHPKTTRLYLVREHKLHGMVTGLSGV-RIIASLEDKLDRLLVSFKDAKIAL 158
Query: 145 LEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVK----VDPQGRCGGVLVYGL 199
LE+ D++H L S+H +E +P+ L PL K VDPQ RC + +
Sbjct: 159 LEWSDAVHDLVPVSIHTYERAPQLTSLT--------APLFKSQLRVDPQSRCAALGLPNH 210
Query: 200 QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYI 256
+ IL F S +++L + ++++V DF F+ G+
Sbjct: 211 ALAILP--------------FLDDAVSDVPYSPSFILDLAVSVNPNIRNVADFCFLPGFN 256
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
+P + ++ E TW GR+ T + ++ +P+I S LP D+ L VP+
Sbjct: 257 KPTLAVMFEPLQTWMGRIGEYKDTVKLVIFTLDIKTSSYPIITSVDGLPMDSLGL--VPA 314
Query: 317 PIGGVLVVGANTIHYHSQSAS--CALALNNYA--VSLDSSQELPRSSFSVELDAAHATWL 372
GGV++ N++ Y QS+S A+ +N +A ++ LP ++ L+ + +
Sbjct: 315 -FGGVVITTPNSLIYIDQSSSRQIAVPVNGWASRITDLPLLPLPSPDLNLTLEGSKTVVV 373
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGS 427
+ G + + V+ DG+ V +L + K T PSV+ S
Sbjct: 374 DEKTLFVILANGIIYPIEVMADGKTVTKLQVGKPLAQATIPSVVES-------------- 419
Query: 428 RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR-----RSSSDALQDMVNGE 482
LGD L + +L + EE D E + + K + D + +
Sbjct: 420 -LGDGHLFVGSTVGVGVVLKTAWVEEEVDDEEEGTNAKVVEDDIDMDLYDDDDDLYGDSK 478
Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG---- 532
+ + +T+ + ++RD+L GP+ ++ L D +ATG
Sbjct: 479 NKTQVTAEVKDTKKYRSVLHLSLRDTLPAYGPISSLTFSLATEGDKPVPELVTATGSGIL 538
Query: 533 ---------ISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----- 578
+ ++ +++ + G +G+W++ + S SS A + HA
Sbjct: 539 GGFTLFQRDLPTRTKKKILAVGGTRGLWSLPIRQSVKKGGSSSSTTAIE---HAKTERDT 595
Query: 579 LIISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
LI+S +A R TEV ++ V G T+ A F R ++ V
Sbjct: 596 LILSTDATPSPGVSRIATRAPPGGKTEV--NITTRVPGTTVGAAPFFQRTAILVVMTNSI 653
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSE------NSTVLSVSIADPYVLLGMSDGSIRLLVG 685
++L+ P +E + + + S SI DP+VL+ D S+ L +G
Sbjct: 654 KVLE-----------PDGTERQTIQDMDGKLLRPKIRSCSICDPFVLIIREDDSLGLFIG 702
Query: 686 DPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI--DGA 742
+ + + + + E + K ++ C G +TS +T + + G+
Sbjct: 703 ETERGKIRRKDMSPMGEKTSKYLAGCFFTDTSGLFGQQFETSVPVEGATATLQNVVSGGS 762
Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
G Q + ++ G +EI+ +P F+V S +VD++ + AL S
Sbjct: 763 TSGGKPQHTQWLLLVRPQGVMEIWTLPKLTLAFSVSAVPSLFNVLVDSHDKPAL--SVPN 820
Query: 803 INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
+ G+ E + +V E R LF L +G + Y+A P
Sbjct: 821 PGDPPQRKPGEFDVEQVCVSRVGE-------DGRGRVCLFVFLRNGQLTIYEAL----PL 869
Query: 863 NTSKSDDPVSTSRSLSV 879
+T+ S S ++ V
Sbjct: 870 STTASQPAASVDGAMDV 886
>gi|406602601|emb|CCH45811.1| hypothetical protein BN7_5397 [Wickerhamomyces ciferrii]
Length = 1287
Score = 112 bits (281), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 129/613 (21%), Positives = 261/613 (42%), Gaps = 62/613 (10%)
Query: 96 ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
I + + +L+ ++ N + + I S D+ + +I+ + AK+S++ FD ++ ++
Sbjct: 42 IDSKNDKLILNHEFKLNGKIIGIKSIKLPDSQYDQLAILTSL--AKLSIVSFDHDLNTIQ 99
Query: 156 ITSMHCFESPEWLH-LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
S+H +ES + + + ES +K+DP + ++VY + L Q ++
Sbjct: 100 TNSLHYYESEFYTKSISKINES-----QLKIDPNNQTS-LVVYNDLLAFLPFKQDDDEII 153
Query: 215 GDEDTFGSGGGFSARIESSH--VI---NLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
D+ S IE H +I N + + ++ D F+H Y +P + ILH +E T
Sbjct: 154 DDDHHTQSNDQQQQNIELFHNSIILPANKLESTVSNIIDCDFLHSYRDPTLAILHNKEQT 213
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-T 328
WA +S K T LS+ I NLP+D + + +P PI G L++G N
Sbjct: 214 WASDLSIKKDTVNFVVLSLDLLNDSSTAILLVENLPYDLWFVKPLPDPINGTLLIGCNEI 273
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
IH + + + LN Y + + +S ++ L+ + L + L+ + G+
Sbjct: 274 IHIDNSGNTKGIGLNKYYQDITDFKLKDQSDLNIFLEHSKVEILNDKNILIIDQFGESYN 333
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTS---DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
L DG+ V+ L ++K + IT I F+G + DS+L+++
Sbjct: 334 LQFFIDGKSVKDLLITKFEKDLQIRSPISITNIDEQNIFIGCQSSDSILIKY-------- 385
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
LK+E + + P+ + D + N F+ +
Sbjct: 386 --EKLKQETNEAKPTTPAATKTNNDDDDEDLYEDEDLNNNNDDELIN--------FNLQI 435
Query: 506 RDSLVNIGPLKDFSYGLRINADASATGIS--KQSNYELVEL--PGCKGIWTVYHKSSRGH 561
+D L N GPL F+ G +IN ++ G++ Q++ +V G +G T++++S +
Sbjct: 436 KDKLFNAGPLSSFTLG-KINPNSLIQGLTNPNQNDVSIVGTSGEGKQGKLTLFNQSIQPK 494
Query: 562 NADSSRMAAYDDEY---HAYLIIS-LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
S + + + + YLI + L+ + + + +S D+ TI +
Sbjct: 495 IHSSLKFNNINKTWNILNKYLITTDLQNFKSEIFLINENFKNFQSFDFKNNNITINIDTI 554
Query: 618 FGRRRVIQVFERGARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
++R++Q+ + D ++ + + F +++ I DP++++
Sbjct: 555 QSQKRILQITSNNVYLFDLNFKKLLQINFDF--------------EIINGKIFDPFIIIT 600
Query: 675 MSDGSIRLLVGDP 687
S G +++ DP
Sbjct: 601 SSKGEVKIFEMDP 613
>gi|154320778|ref|XP_001559705.1| hypothetical protein BC1G_01861 [Botryotinia fuckeliana B05.10]
Length = 1153
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 165/763 (21%), Positives = 300/763 (39%), Gaps = 135/763 (17%)
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDS 351
K I S LP+D ++++ + P+GG L+VG N IH + +A+N +A
Sbjct: 10 KASTTILSVGGLPYDLFRIVPLAPPVGGALLVGTNELIHIDQAGKANGVAVNMFAKQCTG 69
Query: 352 SQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP- 408
L ++ + L+ L +N L+ +GD+ +L+ DGR V L + + +
Sbjct: 70 FSLLDQADLDLRLEGCKIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAE 129
Query: 409 ---SVLT---SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
++LT S ++++G F+GS + DS+++ + SG + + E D
Sbjct: 130 LGGAILTGAASCVSSLGAGSLFVGSEVSDSVILGWNRKSGQTSRRKSRLDSSAIAEVD-- 187
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSY 520
+ D + G+ ++ + +N T S KT ++F + DS+VNI P+ + ++
Sbjct: 188 -EAMFDEEDLEDDDDDLYGDGPTITHATANITASNSKTGDYTFRIHDSMVNIAPITNIAF 246
Query: 521 G---LRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
G L + D QS +LV +LP +GIW
Sbjct: 247 GEAALSLGKDEELKSSGVQSELQLVAAVGREKGGSLAVINREIQPNVIGRFDLPEARGIW 306
Query: 552 TVYHK--SSRGHNADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLT 597
T+ K + +G + + D +Y +I+S +A + E+A D
Sbjct: 307 TMSAKRPAPKGLQVNKEKSVTSGDYGVDAQYDRLMIVSKASDAEDAIEESAVYALTDAGF 366
Query: 598 EVTESVDYF-VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
E ++ G TI AG L RV+Q+ + R DG + Q L + E+G+
Sbjct: 367 EALTGTEFEPAAGSTIEAGTLGNGMRVVQILKSEVRSYDGDLGLAQILPM--LDDETGA- 423
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
++S S ADP++LL D SI + D ++ I S K ++ C LY D
Sbjct: 424 --EPKIISASFADPFLLLIRDDASIFVAQCDDDNDLEEIERVDDILLSTKWLTGC-LYDD 480
Query: 716 KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVF 775
+D+ S GE ++ + GAL I+ +P+ +
Sbjct: 481 ------YSGAFSDSK-SNKAGE-------------NVKMFLLSAGGALHIYALPDLSKPV 520
Query: 776 TVDK---FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
V + FV + A +++ TEI L
Sbjct: 521 YVAEGICFVPPVLSADYAARKSAARETLTEI-----------------------LVANLG 557
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
+ P+L ++ + Y+ + + S S L S + +++N
Sbjct: 558 DSVSQSPYLILRPSNDDLTIYEPFRVK------------SASPDLLSSTLQFLKIQNTHL 605
Query: 893 SRTPLDAYTREETPHGA------PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
++ P + EE GA P + I+ N+ G+ F+ G P + + +
Sbjct: 606 TQAP--DVSAEEQVDGAQQTSDKPMRAIS---NLGGYSTVFMPGGSPSFIIKSSKTAPKV 660
Query: 947 PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
L + + + H C+ GFIY +++GI ++ Q P +T+
Sbjct: 661 LSLQGTGVRSLSSFHTEGCDRGFIYASTEGIARVAQFPPNTTF 703
>gi|58268668|ref|XP_571490.1| cleavage and polyadenylation specific protein [Cryptococcus
neoformans var. neoformans JEC21]
gi|134113364|ref|XP_774707.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338817789|sp|P0CM63.1|CFT1_CRYNB RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
protein 1
gi|338817790|sp|P0CM62.1|CFT1_CRYNJ RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
protein 1
gi|50257351|gb|EAL20060.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57227725|gb|AAW44183.1| cleavage and polyadenylation specific protein, putative
[Cryptococcus neoformans var. neoformans JEC21]
Length = 1431
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 161/716 (22%), Positives = 300/716 (41%), Gaps = 108/716 (15%)
Query: 57 NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---------------- 96
NLVV A V+ ++ +R + E K ++ E ++ V M+ +
Sbjct: 48 NLVVAGAEVLRVFEIREESVPIIENVKLEEDVAEGEKDVQMEEVGDGFFDDGHAERAPLK 107
Query: 97 --SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
+ L L+ + L+G + LA ++ D +I++F+DAK+++LE+ S +
Sbjct: 108 YQTTRRLHLLTQHELNGTITGLAA-TRTLESTIDGLDRLIVSFKDAKMALLEW--SRGDI 164
Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
S+H +E ++ +S+ PL++ DP R + + + +L Q S L
Sbjct: 165 ATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDSLAVLPLIQEQSEL- 220
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDM--KHVKDFIFVHGYIEPVMVILHERELTWAG 272
D G A S V++L D+ + K+++D +F+ G+ P + +L TW+G
Sbjct: 221 ---DPLSEGFSRDAPYSPSFVLSLSDMSITIKNIQDLLFLPGFHSPTIALLFSPMHTWSG 277
Query: 273 RV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
R+ + K C+ I +S+ +PL+ S LP D+ L+A PS +GG+++V + I
Sbjct: 278 RLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSELGGIVLVTSTGIV 336
Query: 331 YHSQ----SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
+ Q +A+C A + SL S + S + L+ + ++ LL + G +
Sbjct: 337 HVDQGGRVTAACVNAWWSRITSLKCS--MASVSQKLTLEGSRCVFVTPHDMLLVLQNGAV 394
Query: 387 VLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ +GR V++ LD P SD+T G+ F+GS GDS L +
Sbjct: 395 HQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSWLAKVNVVRQV 452
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
S K+E +++ D + L +DA D E L+G A+ +
Sbjct: 453 VERSEKKKDEM-EVDWD----EDLYGDINDAALDEKAQE---LFGPAA---------ITL 495
Query: 504 AVRDSLVNIGPLKDFSYGL-----------------------RINADASATGISKQSNYE 540
+ D L +G + D +G+ IN I+K+ +
Sbjct: 496 SPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTINVFRRGIPITKRRRFN 555
Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
EL +G+W + G + + A +++S E L ++ T
Sbjct: 556 --ELLNAEGVWFLPIDRQTGQ-----KFKDIPEAERATILLSSEGNAT--RVFALFSKPT 606
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGPSNSESGSGSENS 659
+ G+T++A F R +++V +LD + + Q + G G +
Sbjct: 607 PQQIGRLDGKTLSAAPFFQRSCILRVSPLEVVLLDNNGKIIQTV------CPRGDGPK-- 658
Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
+++ SI+DP+V++ +D S+ VGD TV+ + P E + ++ D
Sbjct: 659 -IVNASISDPFVIIRRADDSVTFFVGDTVARTVA-EAPIVSEGESPVCQAVEVFTD 712
>gi|401889164|gb|EJT53104.1| cleavage and polyadenylation specific protein [Trichosporon asahii
var. asahii CBS 2479]
Length = 1358
Score = 110 bits (276), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 228/968 (23%), Positives = 377/968 (38%), Gaps = 195/968 (20%)
Query: 45 ELPSKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD---------G 95
E+P + +G NLVV + ++ +R EE + + ++ MD
Sbjct: 39 EVPDVKVVG---NLVVAGGQDLRVFEIR--EESTPLPDDESAVPKQEDMDVGDSFFDSAP 93
Query: 96 ISAAS--------LELVCHYRLHGNVESLAILSQGGADNSRR-RDSIILAFEDAKISVLE 146
I A L L+ + LHG V LA L D+S D ++++FE AK S +
Sbjct: 94 IERAPVRYKTTRRLHLLTRHTLHGVVTGLAGLRT--IDSSVDGLDRLLVSFEHAKWSRGD 151
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+ S+H +E + + + + + P+++ DP R + + + +L
Sbjct: 152 -------IATVSLHTYERCQQM-INGNFQGYV--PMLRSDPLSRLAILTLPEDALAVLPI 201
Query: 207 SQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHER 266
Q S L +D+ S ++K++KDF+F+ G+ P + +L
Sbjct: 202 VQEQSELDAMQDSVSSP------------------EIKNIKDFLFLPGFHSPTIALLFAP 243
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLK-QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
TWAGR T + +I T+ +PLI S LP D+ L+A PS +GGV+VV
Sbjct: 244 MNTWAGRYKSVKDTFRLEIRTIDTSAGGTYPLITSVTGLPSDSQYLVACPSEVGGVVVVT 303
Query: 326 ANTIHYHSQSAS-CALALN---NYAVSL--DSSQELPRSSFSVELDAAHATWLQNDVALL 379
A+ I + QS + ++N NY ++ DSS E S + LD +HA ++ + LL
Sbjct: 304 ASGIIHIDQSGRLVSTSVNGWWNYTTNMKSDSSYE----SQKLALDNSHAQFVTENDMLL 359
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT-SDITTIGNSLFFLGSRLGDSLLVQFT 438
+TG++ + DGR V + + + + +V S + G+ F+GS GDSLL
Sbjct: 360 VLETGEVHQIRFEMDGRAVGAIKVDEQSSTVPPPSTLVPAGSDGIFVGSVEGDSLLAMVE 419
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE---ELSLYGSASNNTE 495
S +EE P TK+ D +++ G + S
Sbjct: 420 KARDQSA-----QEE--------PETKQQEMDVDDWDEEVATGPVTVSVKAQDVLSGIGR 466
Query: 496 SAQKTFSFAVRD-------SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK 548
A F AV D LV IG S G +N I+K+ +E +L
Sbjct: 467 IADMEFGIAVTDLGTRTYPQLVCIG---GGSQGSTMNVFRRGIPITKRRLFE--QLRTAV 521
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
W + + + NA + ++ + + E T + + +V E + F +
Sbjct: 522 ATWFLPVERA---NAPKFKDIPESEQSTIAIAATQEGSTQIFALS--TRKVQERIAEFPE 576
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES-GSGSENS---TVLSV 664
IA G R R++ V +LD SN+ G+ E S +++
Sbjct: 577 P-AIATGTWLRRTRIVLVLPSQVLLLD------------SNANPVGTICEMSDAPPIVAA 623
Query: 665 SIADPYVLLGMSDGSIRLLVGD-----------PSTCTVSVQTPAAI------------- 700
SIADPYVL+ +DGS+ + VGD P + V A +
Sbjct: 624 SIADPYVLIRRADGSVSVFVGDTVEGKWSEAPMPEGLALPVCQAAEVFTDTTGIYRTFEA 683
Query: 701 -----ESSKKPVSS----CTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
E KPV + H G E R + +S V + +G
Sbjct: 684 TQGVKEEPVKPVPTKQGQKAKIHLTG-EQLKRLQDSKPAISADVATTESAFNAA---RGT 739
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
+ + +SG L+I +P+F+ V + + D+ + D +T EEG
Sbjct: 740 QWIALLAQSGELQIRSLPDFDLVLQSNG-------VYDS--EPSFTDDQTGELPELEEG- 789
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
+ + M + + RP + + G + Y+A P T + D
Sbjct: 790 -----DEVSQMLFCPIGTRTL-----RPHVIVLHRSGRLNIYEAQ----PRFTVDARD-- 833
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--PHGAPCQRITIFKNISGHQGFFLS 929
+ RSL+V R R + T L + T T P P F +I G G F++
Sbjct: 834 QSRRSLAV------RFRKV---HTQLLSVTPSSTVKPAAIP------FTDIEGLTGAFIT 878
Query: 930 GSRPCWCM 937
G RP W +
Sbjct: 879 GERPHWII 886
>gi|356527660|ref|XP_003532426.1| PREDICTED: disease resistance response protein 206-like [Glycine
max]
Length = 281
Score = 110 bits (275), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 56/106 (52%), Positives = 76/106 (71%), Gaps = 8/106 (7%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSK--RGIGPVPNL 58
MSFAAYKMM PTGI NC GF+THSR+D+VP +Q +++D E PS+ +G +PNL
Sbjct: 1 MSFAAYKMMQCPTGIDNCAVGFLTHSRSDFVP----LQPDDIDVEWPSRPCHHVGSLPNL 56
Query: 59 VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELV 104
+VT ANV+E+Y VR+QE+ S K + +++ L+DGI ASLELV
Sbjct: 57 IVTVANVLEVYAVRLQEDQSP--KAAIDSRSDTLLDGIVGASLELV 100
>gi|405121446|gb|AFR96215.1| cleavage and polyadenylation specific protein [Cryptococcus
neoformans var. grubii H99]
Length = 1431
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 160/726 (22%), Positives = 301/726 (41%), Gaps = 107/726 (14%)
Query: 45 ELPSKRGIGPVPNLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---- 96
+ P + IG NLVV A V+ ++ +R + E +K ++ E ++ V M+ +
Sbjct: 39 DTPDVKVIG---NLVVAGAEVLRVFEIREESVPIIEKAKLEEDVAEGEKDVQMEEVGDGF 95
Query: 97 --------------SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKI 142
+ L L+ + L+G V LA ++ D +I++F+DAK+
Sbjct: 96 FDDGHAERAPLKYQTTRRLHLLTQHELNGTVTGLAA-TRTLESTIDGLDRLIVSFKDAKM 154
Query: 143 SVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
++LE+ S + S+H +E ++ +S+ PL++ DP R + + +
Sbjct: 155 ALLEW--SRGDIATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLWRLAVLTLPEDSLA 209
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
+L Q S L D G A S V++L D+ +K+++D +F+ G+ P +
Sbjct: 210 VLPLIQEQSEL----DPLSEGFSRDAPYSPSFVLSLSDVSTTIKNIQDLLFLPGFHSPTI 265
Query: 261 VILHERELTWAGRV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
+L TW+GR+ + K C+ I +S+ +PL+ S LP D+ L+A PS +
Sbjct: 266 ALLFSPMHTWSGRLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSEL 324
Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFS--VELDAAHATWLQNDV 376
GG+++V + I + Q A A N S +S + +S S + L+ + ++
Sbjct: 325 GGIVIVTSTGIVHVDQGGRVAAACVNAWWSRITSLKCSTASVSQKLTLEGSRCVFVTPHD 384
Query: 377 ALLSTKTGDLVLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
LL + G + + +GR V++ LD P SD+T G+ F+GS GDS
Sbjct: 385 MLLVLQNGAVHQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSW 442
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
L + + K+E +++ D + L +DA D E+ +G A+
Sbjct: 443 LAKVNVVRQVVERAEKKKDEM-EVDWD----EDLYGDINDAALDEKAQEQ---FGPAA-- 492
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA--------SATGISKQSNYELV--- 542
+ + D L +G + D +G+ + + +G S+ S + +
Sbjct: 493 -------ITLSPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTFNVFRRG 545
Query: 543 ----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA---RTMV 589
EL G+W + G + + A +++S E R
Sbjct: 546 IPITKRRRFNELLNADGVWFLPIDRQTGQ-----KFKDIPEAERATMLLSSEGNATRVFA 600
Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
L + ++ + G+T++A F R ++ V +LD + G
Sbjct: 601 LSSKPTPQQIGR-----LDGKTLSAAPFFQRSCILHVSPLEVVLLDNN--------GKII 647
Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSS 709
+ +++ SI+DP+V++ +D S+ VGD TV + P E +
Sbjct: 648 QTVCPRGDGPKIVNASISDPFVIIRRADDSVTFFVGDTVARTVG-EAPIVSEGESPVCQA 706
Query: 710 CTLYHD 715
++ D
Sbjct: 707 VEIFTD 712
>gi|50552095|ref|XP_503522.1| YALI0E03982p [Yarrowia lipolytica]
gi|74634000|sp|Q6C740.1|CFT1_YARLI RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
protein 1
gi|49649391|emb|CAG79101.1| YALI0E03982p [Yarrowia lipolytica CLIB122]
Length = 1269
Score = 109 bits (272), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 202/935 (21%), Positives = 351/935 (37%), Gaps = 162/935 (17%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
A LEL+ Y L G V + + DN DS+ ++ + AK ++ ++ S +
Sbjct: 51 APRLELITEYYLDGTVTGVTRIKT--IDN-YDLDSLYISVKHAKAVIVAWNASSFTIDTK 107
Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE 217
S+H +E + L E V + +L +M L + G + D+
Sbjct: 108 SLHYYE--KGLVESNFFEPECSSVAVSDEANSFYTCLLFQNDRMAFLPIIEKG---LDDD 162
Query: 218 DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
+ SG F + S ++ LD +++V D F+H Y E M IL + + W G +
Sbjct: 163 EMPESGQVF----DPSFIVKASRLDKRIENVMDICFLHEYRETTMGILFQPKRAWVGMKN 218
Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
T + +S+ K +I + LP DA K++ +P+P+GG L++ ANTI Y S
Sbjct: 219 ILKDTVSYAIVSVDVHQKNSTVIGTLNGLPVDAQKVIPLPAPLGGSLIICANTILYIDSS 278
Query: 336 ASCALALNNYAVSLDSSQELPR--SSFSVELDAAHATWLQN--DVALLSTKTGDLVLLTV 391
AS + N +S + R S+ + L+ A ++Q + ALL T+ G L
Sbjct: 279 ASYTGVMVNNTHRQNSDLIVSRDQSTLDLRLEGAEVCFIQELGNTALLVTEDGQFFSLLF 338
Query: 392 VYDGRVVQRLDLSKTNPS--VLT--SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
DGR V L+L P +L+ S + + FLGSR GDSLLV++ G S
Sbjct: 339 NKDGRRVASLELRPIEPDNFILSQPSSVAAGPDGTIFLGSRAGDSLLVKWYHGEPESQPE 398
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE-SAQKTFSFAVR 506
L D N + LYG + TE + + +
Sbjct: 399 ETL--------------------------DDGNESDDDLYGGDTAQTEDTTNRPLKLRLA 432
Query: 507 DSLVNIGPLKDFSYGLRINAD----ASATGISKQSNYELV--------------ELPGCK 548
D ++ +GP++ + G + + TG+ S ++ ++PG +
Sbjct: 433 DRMLGMGPMQSLALGKNRGSQGVEFVTTTGVGANSALAILTSALMPYKRKSLYKDMPGGQ 492
Query: 549 GIWTVYHK-SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
W+V + G A S D ++YL A V+E L T+ ++ +FV
Sbjct: 493 -FWSVPVRFEEEGEVAKSRTYVVSSDSENSYLYYVDAAG--VIEDVSLSTKKKKTKKHFV 549
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
T + ++QV I D S + +T + +
Sbjct: 550 SNVTTIFSSSMLDSALLQVCLETVNIYDAKI---------GQPHKYSLPQGTTAVEARVL 600
Query: 668 DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTST 727
YVL+ +SDG +++L VS+ +++++ + + G +T
Sbjct: 601 GNYVLVLLSDGQVKILEA------VSINKRPFLKAAQVSIEPASESKAIG------IYAT 648
Query: 728 DAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI 787
D+ L+ G G P VVCY G+L + S I
Sbjct: 649 DSSLTFGAPSKKRTRQGSPAQDSRPVVVVCYADGSL------------LLQGLNSDDRLI 696
Query: 788 VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
+D ++++ +E GQ +++V++A+ H +LT
Sbjct: 697 LDA----------SDLSGFIKEKDGQLYDA---PLELVDIALSPLGDDHILRDYLVLLTP 743
Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH 907
++ Y+ Y + LRF + L E TP
Sbjct: 744 QQLVVYEPYHYND----------------------------KLRFRKIFL-----ERTPT 770
Query: 908 GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD----GSIVAFTVLHNV 963
+R+T I+G ++G + + L P+L + VAFT
Sbjct: 771 INSDRRLTQVPLINGKHTLGVTGET---AYILVKTLHTSPRLIEFGETKGAVAFT----- 822
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+ + F Y+T G + C+ + + WPV+ V
Sbjct: 823 SWDGKFAYLTQAGEVAECRFDPSFSLETNWPVKHV 857
>gi|321260384|ref|XP_003194912.1| cleavage and polyadenylation specific protein [Cryptococcus gattii
WM276]
gi|317461384|gb|ADV23125.1| cleavage and polyadenylation specific protein, putative
[Cryptococcus gattii WM276]
Length = 1431
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 162/734 (22%), Positives = 302/734 (41%), Gaps = 123/734 (16%)
Query: 45 ELPSKRGIGPVPNLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---- 96
+ P + IG NLVV A + ++ +R + E K ++ E K+ V M+ +
Sbjct: 39 DTPDVKVIG---NLVVAGAEALRVFEIREESVPIIEKVKLEEDVAEGKKDVQMEEVGDGF 95
Query: 97 --------------SAASLELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDA 140
+ L L+ + L+G V LA + D D +I++F+DA
Sbjct: 96 FDDGHAERAPLKYQTTRRLYLLAQHELNGTVTGLAATRTLESAIDG---LDRLIVSFKDA 152
Query: 141 KISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ 200
K+++LE+ S + S+H +E ++ +S+ PL++ DP R + +
Sbjct: 153 KMALLEW--SRGDIATVSLHTYERCPQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDS 207
Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEP 258
+ +L Q S L D G A S V++L D+ +K+++D +FV G+ P
Sbjct: 208 LAVLPLIQEQSEL----DPLSEGFSRDAPYSPSFVLSLSDVSTTIKNIQDLLFVPGFHSP 263
Query: 259 VMVILHERELTWAGRV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
+ +L TW+GR+ + K C+ I +S+ +PL+ S LP D+ L+A PS
Sbjct: 264 TIALLFSPMHTWSGRLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPS 322
Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFS--VELDAAHATWLQN 374
+GG+++V + I + Q A A N S +S + +S S + L+ + ++
Sbjct: 323 ELGGIVLVTSTGIVHIDQGGRVAAACVNAWWSRITSLKCSMASVSQKLTLEGSRCVFVTP 382
Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLD-LSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
LL + G + + +GR V ++ L K SD+ G+ F+GS GDS
Sbjct: 383 HDMLLILQNGAVHQVRFSMEGRAVGLIEVLDKGCVVPPPSDLIVTGDGAVFVGSAEGDSW 442
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
L + +R+ R+ + V+ +E LYG ++
Sbjct: 443 LAKVNV-----------------------VRQRVERAEEKKDEMEVDWDE-DLYGDINDA 478
Query: 494 T--ESAQKTF-----SFAVRDSLVNIGPLKDFSYGLRINADA--------SATGISKQSN 538
E AQ+ F + + D L +G + D +G+ + + +G S+ S
Sbjct: 479 ALDEKAQEQFGPAAITLSPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNST 538
Query: 539 YELV-------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
+ + EL +G+W + G + + A +++S E
Sbjct: 539 FNVFRRGIPITKRRRFNELLNAEGVWFLSIDRQTGQ-----KFKDIPEAERATILLSSEG 593
Query: 586 ---RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQ 641
R L + ++ + G+T++A F R ++ V +LD + + Q
Sbjct: 594 NATRVFALSSKPTPQQIGR-----LDGKTLSAAPFFQRSCILHVSPLEVVLLDNNGKIIQ 648
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
+ G G + +++ SI+DP+ ++ +D S+ VGD TV+ + P E
Sbjct: 649 TV------CPRGDGPK---IVNASISDPFAIIRRADDSVTFFVGDTVARTVA-EAPIVSE 698
Query: 702 SSKKPVSSCTLYHD 715
+ ++ D
Sbjct: 699 GESPVCQAVEVFTD 712
>gi|320169222|gb|EFW46121.1| cleavage and polyadenylation specificity factor 1 [Capsaspora
owczarzaki ATCC 30864]
Length = 1725
Score = 108 bits (270), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 76/261 (29%), Positives = 135/261 (51%), Gaps = 23/261 (8%)
Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL-TWAGRVSWKHHTCMISA 285
R+ S+ I L +L + HV D F+ GY EP + +L E +W GR + TC + A
Sbjct: 294 RLRPSYEIKLTELQRHIHHVIDIEFLTGYFEPTLALLFEPNAPSWTGRTVQRKDTCSMVA 353
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNN 344
LSI+T+ HP++WS LP ++ +++AVP P+ G ++V + I + SQS+ + ++LN
Sbjct: 354 LSINTSSHSHPVVWSVDKLPFNSMRVMAVPRPVCGTVIVTPDAILHLSQSSPTVGVSLNE 413
Query: 345 Y-AVSLDSSQELPR------SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV 397
++S + +P SS + +L + L T+ G++ + T++ +GR
Sbjct: 414 LSSMSTELRLGIPENKHPDGSSVVYNMQEGRCCFLTPETLLAVTEGGEMFVATLLTEGRT 473
Query: 398 VQRLDLSKTNPSVLTSDITTIGNSLF-FLGSRLGDSLLVQF----TCGSGTSMLSSGLKE 452
V R+ + SVL +T++ N + F+GSR DS+L++ T + L+S +
Sbjct: 474 VVRIRIEPAGASVLPCCMTSLYNGQYCFIGSRASDSVLLRVMNNATAAADKRRLASAALD 533
Query: 453 EFGDIEADAPSTKRLRRSSSD 473
+F + KR R S ++
Sbjct: 534 DFS-------ANKRSRSSDTN 547
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 52/84 (61%), Gaps = 5/84 (5%)
Query: 920 ISGHQ---GFFLSGSRPCWCMV--FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
+ GHQ G F+ G RP W ++ R+ LR H L DGS+ AF+ +N C GF+Y T+
Sbjct: 1218 LGGHQLCSGVFVCGRRPLWLLMSPTRKALRAHLMLTDGSVSAFSAFNNNACPGGFVYFTT 1277
Query: 975 QGILKICQLPSGSTYDNYWPVQKV 998
QG L+ CQL + +DN WPV++V
Sbjct: 1278 QGTLRFCQLAPTTNHDNPWPVRRV 1301
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 66/253 (26%), Positives = 120/253 (47%), Gaps = 30/253 (11%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHS--RADYVPQIPLIQTEELDSELPSKRGIGPVPNL-- 58
FA ++ H PT + +C T++ R V + L++ +D+ S G G L
Sbjct: 2 FAYFRQQHPPTAVEHCVEASFTNAAERQLVVARANLLEVYRIDAATAS--GSGWRSELSS 59
Query: 59 --VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAA---------SLELVCHY 107
+TA +++ R G + S + + + +A LELV +
Sbjct: 60 GSALTAQTAGAMHLGRAAGYGGNDGGRSDDAATEINTRSLHSAPATPPALQHKLELVASF 119
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
L GNVES+ + +RDS++LAF++AK++V+++D + L+ S+H +E
Sbjct: 120 NLSGNVESIGVARLAHC----KRDSLLLAFKEAKVAVVDYDPATLDLKTISLHMYED--- 172
Query: 168 LHLKRGRESFARG----PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
+ ++ GR++ A P+++VDP +C LVYG ++IIL Q + ++D + S
Sbjct: 173 IEMRGGRDATALQAVWPPVIRVDPMRQCAAFLVYGTKLIILPFRQESH--LDEDDDYQSA 230
Query: 224 GGFSARIESSHVI 236
+A + S I
Sbjct: 231 QAPAASVPPSAQI 243
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 57/192 (29%), Positives = 96/192 (50%), Gaps = 25/192 (13%)
Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET-ADLLTEVTE 601
EL G +G+W+V+ S + + +++ D H+ L+ S + T+V T + L ++ E
Sbjct: 739 ELTGGRGLWSVF---STALDPSLAALSSLDGASHSLLVASRDDSTLVFTTTGEELEQIAE 795
Query: 602 SVDYFVQGRTIAAGNLF---GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSEN 658
S +F G TIA GN+F G+ ++ VF G R++DG + Q+L +S
Sbjct: 796 S-GFFTAGATIAIGNVFAANGKILIVDVFAHGIRLVDGVNLRQELLLAQLSSV------- 847
Query: 659 STVLSVSIADPYVLLGMSDGSIRLL--VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
S ++ SIA+ VL +DG++ + GD S T AA +PV + +LY D+
Sbjct: 848 SEIIHASIAESSVLALHADGAVSFVQFTGDTQELVASTATVAA----GQPVVAVSLYADR 903
Query: 717 G----PEPWLRK 724
PE L++
Sbjct: 904 SGLFVPEAVLQR 915
>gi|76157351|gb|AAX28300.2| SJCHGC08809 protein [Schistosoma japonicum]
Length = 225
Score = 107 bits (268), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 91/154 (59%), Gaps = 5/154 (3%)
Query: 242 DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 301
+ +V D F+HG+ EP +++L+E TWAGRVS + TC I ALS + + +P+IW
Sbjct: 52 KINNVLDMQFLHGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQKRTNPVIWFQ 111
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VSLDSSQELPR 357
+LP D ++ VP PIGGV+++ AN+I Y Q+ SC+L LN YA + Q++P
Sbjct: 112 ESLPFDCRSVIPVPQPIGGVVIMAANSILYLKQTLPSCSLPLNCYAQISTNFPMRQDVP- 170
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
S + +D L L+ T++G+L LL++
Sbjct: 171 SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSL 204
>gi|327304811|ref|XP_003237097.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
gi|326460095|gb|EGD85548.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
Length = 1398
Score = 107 bits (266), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 176/734 (23%), Positives = 287/734 (39%), Gaps = 96/734 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + R + A L L Y + G + SL
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSTTAAQPDQKGRN---ERSQHAKLVLAAEYEVPGTITSL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ + + D+II++ +AK+S++E+D HG+ S+H +E E H+
Sbjct: 85 QRVKISNSKSGG--DAIIVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVPD 141
Query: 177 FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGSGGGFSARIES-- 232
P + DP G C + +G+ + IL Q G LV D+ G SA + S
Sbjct: 142 LGSCPSSLTADPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGNDSADVVSDP 200
Query: 233 ----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
S V+ + LD + H F+H Y EP IL+ +
Sbjct: 201 QKSAPENTAHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
+ S ++ K + + LP D +K++ +P P+GG L++G N +H
Sbjct: 261 IDRKDIVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
+ A+ +N +A + +S + L+ L + LL G + +L+
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIIEQLGSGTGDILLILADGRMSILSF 380
Query: 392 VYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
DGR V + L ++N S+ + T ++G + F GS GDS+L+ ++ S T
Sbjct: 381 KVDGRSVSGISLHFVAEQSNGSITIARPTCSASLGRNKLFCGSEEGDSILLGWSRPSSTI 440
Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES------AQ 498
S K G E A D D + ++L AS E +
Sbjct: 441 KRPS--KAADGVDENGAADLSDEAEQDDDGDDDDMYEDDLYSANLASTRQEKQVVNGDSP 498
Query: 499 KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WTV 553
F F D L ++GP +D + G + + S +EL +G V
Sbjct: 499 ADFIFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPEIAAPLELVAARGFGKSGGLAV 558
Query: 554 YHKSSRGHNADSSRMAAYDDEYHAYLIISLEART-------------MVLETADLLTEVT 600
+ + DS +M DD Y + I ++ ++ ++ +T D +
Sbjct: 559 LKREIDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTGLSRSYDKYLLLAKTKD--DDKE 613
Query: 601 ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
ESV Y V + TI G L RV+QV R D Y
Sbjct: 614 ESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLAAGTRVVQVLRTEIRSYD--YNLGLAQ 671
Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAIES 702
P E SE TV+ S A+PY+L D S+ +L D + V VQ AA
Sbjct: 672 IYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA--- 726
Query: 703 SKKPVSSCTLYHDK 716
S K +S C LY DK
Sbjct: 727 SGKWISGC-LYEDK 739
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 47/91 (51%), Gaps = 2/91 (2%)
Query: 911 CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
C+R+ ++ G++ F+SG PC+ ++ R H G V + + H C GF
Sbjct: 882 CKRLRALPDVCGYKTVFMSGHNPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 940
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKVVF 1000
YV ++++ +LPS + +D+ W +K+ F
Sbjct: 941 AYVDEDNVIRMSRLPSNTRFDSGWATRKIAF 971
>gi|344229600|gb|EGV61485.1| hypothetical protein CANTEDRAFT_109087 [Candida tenuis ATCC 10573]
Length = 1300
Score = 106 bits (264), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 104/458 (22%), Positives = 200/458 (43%), Gaps = 45/458 (9%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y+L G + S+ + + + D +++A + AKIS++ +D + H +R S+H
Sbjct: 51 LNLVDQYKLFGTITSIKPIR---TIENPKLDYLLVATQLAKISLVRWDHASHSIRTVSLH 107
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
+E+ + + L+ V+P+ C V L + ++
Sbjct: 108 YYEN---VIQTSTFDKLNSAELI-VEPKNACLCVRYKNLLTFLPFTRLKTEEDEYADEED 163
Query: 221 GS-GGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
G+ + +SS +IN ++LD + + D F+H Y +P + +L ++ WAG + +K
Sbjct: 164 GAVTNSYDGIYDSSFLINGQNLDSRIGTIVDADFLHNYRQPTVALLSSKDQVWAGNLFFK 223
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
LS+ K+ + +LP+D +L+++PSP+ G L+VGAN IH +
Sbjct: 224 KDNISYIVLSLDLNTKKSTTVLKIDDLPYDIDRLISLPSPLNGSLLVGANQLIHIDNGGI 283
Query: 337 SCALALNNYA--VSLDSSQELPRSSFSVELDAAHATWLQNDVALLST-KTGDLVLLTVVY 393
+ +++N + + +S + S ++ L+ L N+ +L TG+ +LT
Sbjct: 284 TRKISVNPFTDLTTKNSKNYINYSHMNLRLENCSVVPLPNENKVLVILSTGEFYMLTFEI 343
Query: 394 DGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
DG+ ++RL S + N+L F+G++ G+S L+Q+
Sbjct: 344 DGKTIKRLTFEVVETSRYNGINVTFPGQFAALDNNLLFVGNKNGNSPLIQYKY------- 396
Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
G KE+ ++ DA + D + S ++ F +
Sbjct: 397 -EGAKEK--AVKEDAKDEEDNDGDEELYEDDEEKVKSFS------------KEKLDFTLC 441
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
D L+N GP+ F++G N + I+ NY+ V +
Sbjct: 442 DELINHGPISAFTFGFYSNEKFKSNLIN--PNYQEVSI 477
>gi|302652141|ref|XP_003017930.1| hypothetical protein TRV_08062 [Trichophyton verrucosum HKI 0517]
gi|291181516|gb|EFE37285.1| hypothetical protein TRV_08062 [Trichophyton verrucosum HKI 0517]
Length = 844
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 183/741 (24%), Positives = 288/741 (38%), Gaps = 110/741 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + R D A L L Y + G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSTTGTQPDQKGRH---DRSQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHL 170
+ NS+ D+I+++ DAK+S++E+D HG+ S+H +E E W+
Sbjct: 85 QRVR---ISNSKSGGDAILVSSRDAKLSLIEWDPEKHGISTISIHYYEGEESHMSPWVP- 140
Query: 171 KRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDEDT- 219
S + G + VDP G C + +G+ + IL Q G LV GD+ T
Sbjct: 141 --DLGSCSSG--LTVDPNGNC-AIFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTD 195
Query: 220 FGSGGGFSARIESSH--------VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELT 269
S SA +SH V+ + LD + H F+H Y EP IL+ +
Sbjct: 196 MVSDAQKSAPGNTSHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVAR 255
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT- 328
+ S ++ K + + LP D +K++ +P P+GG L++G N
Sbjct: 256 STSLTIDRKDVVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNEL 315
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL 386
+H + A+ +N +A + +S + L+ L + LL G +
Sbjct: 316 VHVDQAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRM 375
Query: 387 VLLTVVYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
+L+ DGR V + L +K PS S +G + F GS GDS+L+
Sbjct: 376 SILSFKVDGRSVSGISLHFVAEQSGGSITKARPSCSAS----LGRNKLFYGSEEGDSVLL 431
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE 495
++ S T+ S K G E A D D + ++L AS E
Sbjct: 432 GWSRPSSTTKRPS--KSVDGVDENGAADLSDEADQDDDGDDDDMYEDDLYSVNPASTRQE 489
Query: 496 S------AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
+ F+F D L ++GP +D + G + + S +EL +G
Sbjct: 490 KQVVNGDSPADFTFRAYDRLWSLGPYRDITLGKPSKSKSKDQQDSVPEIAAPLELVAARG 549
Query: 550 I-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART-----------MVLETA 593
TV + + DS +M DD Y + I ++ ++ +L
Sbjct: 550 FGKSGGLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTGLSRSYDKYLLLAK 606
Query: 594 DLLTEVTESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGS 637
+ ESV Y V + TI G L RV+QV R D
Sbjct: 607 SKGEDKEESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLATGTRVVQVLRTEIRSYD-- 664
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQ 695
Y P E SE TV+ S A+PY+L D S+ +L D + V VQ
Sbjct: 665 YNLGLAQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQ 722
Query: 696 TPAAIESSKKPVSSCTLYHDK 716
AA S K +S C LY DK
Sbjct: 723 GSAA---SGKWISGC-LYEDK 739
>gi|328848896|gb|EGF98089.1| hypothetical protein MELLADRAFT_96156 [Melampsora larici-populina
98AG31]
Length = 1427
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 193/933 (20%), Positives = 350/933 (37%), Gaps = 171/933 (18%)
Query: 104 VCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE 163
V ++LHG V L ++ + D ++++F+DAKI++LE+ L S+H FE
Sbjct: 73 VLEHQLHGIVTGLQPITTIDT-HVDGLDRLLVSFKDAKITLLEWSHQQSDLVPISLHTFE 131
Query: 164 SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
+ F + ++ DPQ RC + + + +L Q + D +T S
Sbjct: 132 KLPQITQGDFPTIFDQ---LETDPQSRCAILKLPQSTIAVLPFFQENN---LDLETLFSN 185
Query: 224 GGFSA---RIES-----SHVINLRD------------------LDMKHVKDFIFVHGYIE 257
SA RI+S S +I+L +K + F F+ G+ +
Sbjct: 186 SNPSANNQRIQSFPYAPSFIIDLNQSQSFKSQTQTHSQTQTQQKSIKSIISFKFLPGFSQ 245
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P + IL+ + TWAGR+ +C + +++ + +I+ NLP+ A+ ++A P
Sbjct: 246 PTLAILYTYQHTWAGRLENTTDSCSLIFITLDLSSNHFTIIFQIDNLPYHAHSIMACPKE 305
Query: 318 IGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP---------------RSSFSV 362
+GGVLV+ A+ I + QS+ N L + ++P V
Sbjct: 306 VGGVLVICADMILHIDQSSKLIGIATNGWSKLSTHLDVPTQQMVKIVTEDGQDQEERLKV 365
Query: 363 ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-TNPSVLTSDITTIGNS 421
L+ + ++ D AL+ G + L + DGR + +L L K SV+ S I +
Sbjct: 366 RLENSKLVFVTIDRALMFLTDGQIFRLCLYQDGRTLIKLCLEKFPVVSVIPSVAVKISDH 425
Query: 422 LFFLGSRLGDSLLVQFTC------------------------GSGTSMLSSGLKEEFGDI 457
F+GS LGDS+++ G+ + + E +G
Sbjct: 426 SVFVGSMLGDSIVMGIEFEGEKEVEVVEEVEVEVEAEVVHQNGNEMEIDQAEEDEIYGKE 485
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
E D TK D + ++ + N + ++ S + DS+ GP++D
Sbjct: 486 EPDDKKTK-----DQDGIDSIIK----------ATNKKIHREIRSLRLHDSISGHGPIRD 530
Query: 518 FSYGLRINADASATGISKQSNYE-LVELPGCKGI-----WTVYHKS---SRGHNADSSRM 568
F+ +SK +E +E+ GC G T+++K + DS+
Sbjct: 531 FT-------------MSKIGGFEDSLEMVGCTGSGETGGLTIFYKEMPLMKRKKLDSTNE 577
Query: 569 A---------AYDDEYHA----YLIISLEARTMVLETADLLTEVTESVDY----FVQGRT 611
+ A++D + IS+ RT + E + D + T
Sbjct: 578 SMKITNLNSIAFNDPTGSPGCELAWISIHDRTKIFSMIKNPEEGNRTSDLKFMKTLNAST 637
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
I F + +Q+ ++L + P +E ++ + ++ + Y+
Sbjct: 638 IYVAMFFDQTCFLQITSYEIKLLKVVGFGEVQVIRPIETE----NKKNKIIRAKVVQDYI 693
Query: 672 LLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
LL SD + L G + T+ +Q P KPV+ +L+ P + T+
Sbjct: 694 LLETSDHRVMLYKGQVDSLTIDRIQLPQL----SKPVTYASLFSAHLP-LYDHDDQTN-- 746
Query: 731 LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF-----VSGRT 785
G+G D P + V G L I +P VFTV +
Sbjct: 747 ---GIGLDNDEDAEKP------WLFVTDLGGVLHILSLPELEIVFTVKGIENLPDLLDED 797
Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
+ + A++ + + EE KEN + A +RP L+ L
Sbjct: 798 EDEEQQQQPAIEYEHEDGDVKMEEDEKVEPKENSSIQMIYGFVT---GAKVARPHLYVEL 854
Query: 846 TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF-SRTPLDAYTREE 904
+G + YQ + K DP ++ ++ +++ +F S P+ R
Sbjct: 855 NNGALAVYQISI----AYDRKPGDPSTSKPRRQALSIRLNKVLGYQFESSEPISNLDR-- 908
Query: 905 TPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM 937
++ + K + G LSG P W +
Sbjct: 909 --------KVKVVKKNATFSGIHLSGLEPIWIV 933
>gi|302506529|ref|XP_003015221.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
gi|291178793|gb|EFE34581.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
Length = 1370
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 181/736 (24%), Positives = 287/736 (38%), Gaps = 100/736 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + R D A L L Y + G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSTTATQPDQKGRH---DRSQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+ NS+ D+I+++ +AK+S++E+D HG+ S+H +E E +
Sbjct: 85 QRVR---ISNSKSGGDAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEESHMSPWVPD 141
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------DT 219
+ + VDP G C + +G+ + IL Q G LV D+ D
Sbjct: 142 LGSCSSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDLVSDA 200
Query: 220 FGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
S G +A + S V+ + LD + H F+H Y EP IL+ +
Sbjct: 201 QKSAPGNTAHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
+ S ++ K + + LP D +K++ +P P+GG L++G N +H
Sbjct: 261 IDRKDVVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
+ A+ +N +A + S + L+ L + LL G + +L+
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADHSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILSF 380
Query: 392 VYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
DGR V + L +K PS S +G + F GS GDS+L+ ++
Sbjct: 381 KVDGRSVSGISLHFVAEQSGGSITKARPSCSAS----LGRNKLFYGSEEGDSVLLGWSRP 436
Query: 441 SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES---- 496
S T+ S K G E A D D + ++L AS E
Sbjct: 437 SSTTKRPS--KAADGVDENGAADLSDEAEQDDDGDDDDMYEDDLYSVNPASTRQEKQVVN 494
Query: 497 --AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
+ F+F D L ++GP +D + G + + S +EL +G
Sbjct: 495 GDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQQDSVPEIAAPLELVAARGFGKSG 554
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR---TMVLETAD---LLTEVT--- 600
TV + + DS +M DD Y + I L+ + T + + D LL +
Sbjct: 555 GLTVLKREVDPYTIDSLKM---DDVYGVWSIRVLDPKSKDTGLSRSYDKYLLLAKAKGED 611
Query: 601 --ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
ESV Y V + TI G L RV+QV R D Y
Sbjct: 612 KEESVVYSVGSSGLDSIDTPEFNPNEDCTIDIGTLATGTRVVQVLRTEIRSYD--YNLGL 669
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAI 700
P E SE TV+ S A+PY+L D S+ +L D + V VQ AA
Sbjct: 670 AQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA- 726
Query: 701 ESSKKPVSSCTLYHDK 716
S K +S C LY DK
Sbjct: 727 --SGKWISGC-LYEDK 739
>gi|406699110|gb|EKD02327.1| cleavage and polyadenylation specific protein [Trichosporon asahii
var. asahii CBS 8904]
Length = 1339
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 194/814 (23%), Positives = 316/814 (38%), Gaps = 160/814 (19%)
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
P+++ DP R + + + +L Q S L +D+ S
Sbjct: 157 PMLRSDPLSRLAILTLPEDALAVLPIVQEQSELDAMQDSVSSP----------------- 199
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK-QHPLIW 299
++K++KDF+F+ G+ P + +L TWAGR T + +I T+ +PLI
Sbjct: 200 -EIKNIKDFLFLPGFHSPTIALLFAPMNTWAGRYKSVKDTFRLEIRTIDTSAGGTYPLIT 258
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALN---NYAVSL--DSSQ 353
S LP D+ L+A PS +GGV+VV A+ I + QS + ++N NY ++ DSS
Sbjct: 259 SVTGLPSDSQYLVACPSEVGGVVVVTASGIIHIDQSGRLVSTSVNGWWNYTTNMKSDSSY 318
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT- 412
E S + LD +HA ++ + LL +TG++ + DGR V + + + + +V
Sbjct: 319 E----SQKLALDNSHAQFVTENDMLLVLETGEVHQIRFEMDGRAVGAIKVDEQSSTVPPP 374
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
S + G+ F+GS GDSLL S +EE P TK+
Sbjct: 375 STLVPAGSDGIFVGSVEGDSLLAMVEKARDQSA-----QEE--------PETKQQEMDVD 421
Query: 473 DALQDMVNGE---ELSLYGSASNNTESAQKTFSFAVRD-------SLVNIGPLKDFSYGL 522
D +++ G + S A F AV D LV IG S G
Sbjct: 422 DWDEEVATGPVTVSVKAQDVLSGIGRIADMEFGIAVTDLGTRTYPQLVCIG---GGSQGS 478
Query: 523 RINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
+N I+K+ +E +L W + + + NA + ++ + +
Sbjct: 479 TMNVFRRGIPITKRRLFE--QLRTAVATWFLPVERA---NAPKFKDIPESEQSTIAIAAT 533
Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
E T + + +V E + F + IA G R R++ V +LD
Sbjct: 534 QEGSTQIFALS--TRKVQERIAEFPEP-AIATGTWLRRTRIVLVLPSQVLLLD------- 583
Query: 643 LSFGPSNSES-GSGSENS---TVLSVSIADPYVLLGMSDGSIRLLVGD-----------P 687
SN+ G+ E S +++ SIADPYVL+ +DGS+ + VGD P
Sbjct: 584 -----SNANPVGTICEMSDAPPIVAASIADPYVLIRRADGSVSVFVGDTVEGKWSEAPMP 638
Query: 688 STCTVSVQTPAAI------------------ESSKKPVSS----CTLYHDKGPEPWLRKT 725
+ V A + E KPV + H G E R
Sbjct: 639 EGLALPVCQAAEVFTDTTGIYRTFEATQGVKEEPVKPVPTKQGQKAKIHLTG-EQLKRLQ 697
Query: 726 STDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
+ +S V + +G + + +SG L+I +P+F+ V +
Sbjct: 698 DSKPAISADVATTESAFNAA---RGTQWIALLAQSGELQIRSLPDFDLVLQSNG------ 748
Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
+ D+ + D +T EEG + + M + + RP + +
Sbjct: 749 -VYDS--EPSFTDDQTGELPELEEG------DEVSQMLFCPIGTRTL-----RPHVIVLH 794
Query: 846 TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET 905
G + Y+A P T + D + RSL+V R R + T L + T T
Sbjct: 795 RSGRLNIYEAQ----PRFTVDARD--QSRRSLAV------RFRKV---HTQLLSVTPSST 839
Query: 906 --PHGAPCQRITIFKNISGHQGFFLSGSRPCWCM 937
P P F +I G G F++G RP W +
Sbjct: 840 VKPAAIP------FTDIEGLTGAFITGERPHWII 867
>gi|320583269|gb|EFW97484.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Ogataea parapolymorpha DL-1]
Length = 1309
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 107/464 (23%), Positives = 212/464 (45%), Gaps = 46/464 (9%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L+L+ YRL+G + ++ + ++ + D +I++ + AK+SV+++D +H + S+H
Sbjct: 51 LQLIGEYRLNGQIINI---DKFRSNENESLDYLIVSTKLAKLSVIKWDSQLHAISTVSLH 107
Query: 161 CFESP-EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMII--LKASQGGSGLVGDE 217
+++ + L +++ ++ + + DP C + + L + K L D
Sbjct: 108 YYDTALDALTVEKLEKTSVQH---RTDPNSLCTCLRLNELFTFLPFYKEYLDEEELKDDA 164
Query: 218 DTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERE-LTWAGRV 274
+ S ++N L D+K++ D+ F+H Y +P M IL+ E +TWAG +
Sbjct: 165 EEAKDIKKRKKLFTESFILNASSLYPDIKNIVDYQFLHSYRDPTMAILYAPETMTWAGHL 224
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHS 333
T + LS+ K+ I NLP+D + + SP G L+VG+N IH +S
Sbjct: 225 PKAKDTLKVIVLSLDLENKKASAIMELTNLPYDVDYIYPLESPTNGFLLVGSNEIIHVNS 284
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY 393
+ + N Y + + + +SS + L+ + ++ D L+ T++G+ L
Sbjct: 285 LGSVRGIYTNEYFTDISNLKLKDQSSLGLMLENSRVGLVKEDQVLIITESGEFYQLNFEK 344
Query: 394 DG-----RVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
G +Q+++ S N ++ + + ++ LFF+ + GDS L++ + SG
Sbjct: 345 IGGNSTITGLQKVETSNYKGIIVNHPIMITSVPSL--DLFFVCCQGGDSSLIRISSKSG- 401
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
+L KE+ GD + L + E+ + S+ N++ F
Sbjct: 402 -VLPQETKEQNGDTKETKDDDDWL-----------YDEEDQKSHKSSLVNSQ-------F 442
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
D+++N GPL DF+ G R++ + G+ + E V + C
Sbjct: 443 KKMDNILNCGPLVDFTLG-RVSIEQKIMGLPNPNYNEDVLVAAC 485
>gi|390358537|ref|XP_001201130.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 283
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 126/273 (46%), Gaps = 51/273 (18%)
Query: 3 FAAYKMMHWPTGIANC-GSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVT 61
+A Y+ +H PTG+ +C F + P ++ NLVV
Sbjct: 2 YAFYREIHPPTGVEHCVYCHFFS----------------------PDQQ------NLVVA 33
Query: 62 AANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ 121
+ + +Y + + K+S + LE + + G V S+ Q
Sbjct: 34 KGSELTVYSMITVDSNKPTDKDSKPKNK-----------LEEAATFHIFGKVMSM----Q 78
Query: 122 GGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGP 181
RD+++L+F +AK+S++E+D ++H L+ SMH FE E K G P
Sbjct: 79 SAQVTGSGRDALLLSFMNAKVSIVEYDPNMHDLKTLSMHYFEEDET---KEGVYRNIFHP 135
Query: 182 LVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL 241
+VKVDP RC +L YG ++++L + GLV D D S + S+VI L ++
Sbjct: 136 VVKVDPDHRCAIMLTYGSKLVVLPFRR--DGLVEDLDKSMSASTRRGALMPSYVIRLNEM 193
Query: 242 D--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
D + +V D F+HGY EP ++IL+E TWAG
Sbjct: 194 DDPICNVLDIQFLHGYYEPTLLILYEPLRTWAG 226
>gi|296806499|ref|XP_002844059.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845361|gb|EEQ35023.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 1348
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 161/745 (21%), Positives = 297/745 (39%), Gaps = 116/745 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + + R D A L LV Y++ G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSSLANHPDQKSRH---DRSQHAKLVLVAEYQVSGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ + + D+I+++ +AK+S++E+D HG+ S+H +E E H+
Sbjct: 85 ERVKISNSKSGG--DAILVSSRNAKLSLIEWDPRNHGISTISIHYYEGEES-HMSPWVPD 141
Query: 177 FAR-GPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE----------------- 217
+ VDP G C + +G+ + IL Q G LV D+
Sbjct: 142 LGSCASNLTVDPNGNCA-IFNFGIHSLAILPFHQTGDDLVMDDYDSVLNGDSAADTINDT 200
Query: 218 --DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
T G S E S V+ L LD + H F+H Y EP IL+ +
Sbjct: 201 QKPTAGDSTVHSKPYEPSFVLPLAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSL 260
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ + ++ + + + LP D +K++++P P+GG L++G N +H
Sbjct: 261 SIDRKDVVSYAIFTLDLQQRASTSLLTVSRLPSDMFKVVSLPPPVGGALLIGTNELVHVD 320
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + + +S + L+ L +D LL G + +LT
Sbjct: 321 QAGKTNAVGVNEFARQASAFSMVDQSDLEMRLEDCVVEQLGSDAGEVLLILTDGRMAILT 380
Query: 391 VVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGT 443
DGR V + L ++ S++ + + +G S F GS GDS+L+ ++ S
Sbjct: 381 FKVDGRSVSGISLHYVAEQSGGSIIKARPSCSAGLGRSKLFCGSEEGDSILLGWSKPSSN 440
Query: 444 SMLSSGLKE---EFGDIEADAPSTKRLRR-----------SSSDALQD--MVNGEELSLY 487
+ + E E G E + + + LQ+ +VNG++ +
Sbjct: 441 TKKPTKANEDTNEDGTTEFSGEDEQDDDDDDIYEDDLYSANPAPTLQEKRVVNGDDTA-- 498
Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------LRINAD-----------ASA 530
F F + D L ++GP +D + G L+ D +A
Sbjct: 499 ------------DFVFKIHDRLWSLGPFRDITLGRPPKSKLKDKRDNVPSISASLELVAA 546
Query: 531 TGISKQSNYEL------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY 578
G K + +++ G+W++ + A ++ +Y Y
Sbjct: 547 RGFGKSGGLAVLKREIDPFTIDSLKMDNVYGVWSIRVTDPKSKEASAT---GNSRDYDKY 603
Query: 579 LII-----SLEARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGAR 632
L++ S + ++V + + ++ ++ + TI G L RV+QV R
Sbjct: 604 LLLAKAKCSDKEESVVYSVGNSGLDSIDAPEFNPNEDCTIDIGTLAAGSRVVQVLRTEIR 663
Query: 633 ILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
D + +TQ ++ SE TV+ S A+PY+L D S+ +L D +
Sbjct: 664 SYDYNLGLTQIYPVWDEDT-----SEERTVVQASFAEPYLLAIRDDHSLLVLQADKTGDL 718
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDK 716
V+ + +S VS C LY D+
Sbjct: 719 DEVEI-QGLATSADWVSGC-LYEDR 741
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/98 (24%), Positives = 47/98 (47%), Gaps = 6/98 (6%)
Query: 904 ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLCDGSIVAFTVL 960
E H P + + +I G++ F+ G PC+ + + P +L ++ + +
Sbjct: 820 EGKHPFPRKPLRALSDICGYKTVFMPGQNPCFIL---KSAITQPHVLRLRGKAVQSLSGF 876
Query: 961 HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
H C GF YV I+++ +LPS + +D+ W +K+
Sbjct: 877 HIAACERGFAYVDEDNIIRMSRLPSNTRFDSTWATRKI 914
>gi|348679451|gb|EGZ19267.1| hypothetical protein PHYSODRAFT_492468 [Phytophthora sojae]
Length = 736
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 111/427 (25%), Positives = 169/427 (39%), Gaps = 102/427 (23%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR+L++ V D F+ GY+EP +++LHE + + GR++ T I+ +SI+
Sbjct: 261 LLRLRELEITGKVIDLAFLDGYLEPTLMVLHEENEKNSTCGRLAAGFDTYCITVISINMN 320
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+ HP IW+ NLP D +KL+ +P+GGV+V+ AN Y +Q+ LA N L
Sbjct: 321 TRLHPKIWTVKNLPSDCFKLIPCRAPLGGVVVLSANAFLYFNQTQFHGLATN----VLRE 376
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL-DLSKTNPSV 410
+ + ++ L +L LL+ GD +L++ Y+ R V+ + KT
Sbjct: 377 QDDHEMAQLNIVLYDCQFEYLHEKEVLLTMPNGDAYVLSLPYEDRSVRFWRSIKKT---- 432
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF------TCGSGTSMLSSG----LKEEFGDIEAD 460
F+GSR GDS+L + G S L +KEE E
Sbjct: 433 ------------LFVGSRSGDSVLYALDQKKLTSAGGEASKLQEDEEMLIKEEVVKEEVT 480
Query: 461 -----------------------APSTKRLRRSSSDALQDMVNGEELSLYGSASN----N 493
AP+ + S S + VNG S N
Sbjct: 481 AEVKAEPAEEEEEDEDDLFLCGAAPTKEEPTTSGS---TEAVNGTNGSAVKKEENGHAVE 537
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV----------- 542
ES + D L +IG + + NAD S + ELV
Sbjct: 538 EESGPYDYVLHQIDVLPSIGQITSIELSIENNAD------SNEKREELVISGGYEHSGAI 591
Query: 543 ---------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
EL GC+ +WTV + R Y+AYLI+S+ RT
Sbjct: 592 SVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATKSSDGR------SYNAYLILSVAHRT 645
Query: 588 MVLETAD 594
MVL T +
Sbjct: 646 MVLRTGE 652
>gi|346971831|gb|EGY15283.1| cft-1 [Verticillium dahliae VdLs.17]
Length = 1445
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 226/1051 (21%), Positives = 399/1051 (37%), Gaps = 207/1051 (19%)
Query: 57 NLVVTAANVIEIYVVRV-------QEEGSKESKNSGETKRRVLMD--GISAA-------- 99
NL+V+ ++++I+ V+ + +K + N+GET R + D G+ +A
Sbjct: 28 NLIVSKGSLLQIFAVKTVSTEIDTSQIQAKSTSNAGETYDRRINDDDGLESAFLGGDGML 87
Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDD 149
L LV Y +HG + LA + +SR +++++ A++S+L++D
Sbjct: 88 MRADRTTNTRLVLVAEYPVHGVIAGLARVK---IQSSRSGGEALLVHSRTARLSLLQWDP 144
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMII 203
HG+ S+H +E EW + S GPL ++ DPQ RC L +GL+ I
Sbjct: 145 EKHGVEDISIHFYEKEEW------QGSPMDGPLRQHATILQADPQSRCAA-LKFGLRKIA 197
Query: 204 L-------------KASQGGSGLVGDEDTFGSGGGFSARIES----------SHVINLRD 240
+ G E+ + + S S V+ L
Sbjct: 198 FLPFRQIDGDIDMDDWDEEVDGPRPQEEPPAAAAVHGSSSNSSSLAPVPYTPSFVLALPQ 257
Query: 241 LD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
LD + H F F+H Y EP + I+ H T + + + L
Sbjct: 258 LDPEILHPVHFAFLHEYREPTLGIISSTNRRLKMEPQKDHFTFKVFTVDL--------LQ 309
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPR 357
+++N K++A+P P+GG L++G N IH + +A+N YA + +
Sbjct: 310 KASLN------KVIALPKPMGGALLIGENELIHIDQAGKAHGVAVNPYAAKMTKFPLADQ 363
Query: 358 SSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVL 411
S + L+ + +N LL T+ G++ ++T DGR V + + ++ VL
Sbjct: 364 SELKLRLEHCEVELMSPENGEMLLVTRHGEMAVVTFKMDGRSVSGVSVKVVATENGGDVL 423
Query: 412 ---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
+ +T + + F G+ GDS ++ G + + K+ E+
Sbjct: 424 PFRAACLTKVTKNSMFYGTIGGDSKVI----GWSRQHVQTARKKARLLDESLDYDLDDDE 479
Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
D D + GE ++ + F V DSL+++ P+ D +YG
Sbjct: 480 ADDDDDDDDDLYGEGTVAPQPSAAAGSAKGGDVVFRVHDSLLSLSPIMDMTYGKTAFFPG 539
Query: 522 ---------LRINAD-ASATGISKQSNYELV------------ELPGCKGIWT--VYHKS 557
+R D A G + + L+ + P +G WT V
Sbjct: 540 SEDAKNSEGVRSELDLVCAVGRHRGGSLALINQHIQPRVIGRFDFPEARGFWTTRVQKTI 599
Query: 558 SRGHNADS-SRMAAYDD-----EYHAYLIISLEARTMVLETADLLTEVTESVDYF----- 606
++ D + +A +D +Y ++I++ + ET+D+ +
Sbjct: 600 AKSLQGDKGANLAVGNDYGSVTQYDKFMIVA-KVDLDGYETSDVYALTGAGFEALSGTEF 658
Query: 607 --VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLS 663
G TI AG + R+IQV R DG ++Q L + E+G+ V+S
Sbjct: 659 DPAAGLTIEAGTMGNDMRIIQVLRSEVRCYDGDLGLSQILPM--LDEETGA---EPRVIS 713
Query: 664 VSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK----GPE 719
SI DPY+LL D SI + + S K +S C LY D P
Sbjct: 714 ASIVDPYLLLLREDSSILVAQITNHNELEELDKEDETIVSTKWLSGC-LYKDSRGLFAPV 772
Query: 720 PWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDK 779
+ TST + + AI G+++ +C ++ +PN + V
Sbjct: 773 QTDKGTSTSESVFLFLLNAI----------GELHVRIC-------VYALPNLSKSIYV-- 813
Query: 780 FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRP 839
+G ++I S + ++ GT E + + V +L ++ H
Sbjct: 814 -AAGLSYI----------PSLLSADYTARRGTS---PETLTEILVADLGDSTSASAH--- 856
Query: 840 FLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDA 899
L + + Y+ + G E K D + SL VS S L +++P++A
Sbjct: 857 -LILRHANDDMTIYEPFRIGGQEE--KED----LANSLFFKKVSNSHL-----AKSPVEA 904
Query: 900 YTREETPHGAPCQRITIFK---NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVA 956
E R+ + NI G+ FL G+ P + + + L +
Sbjct: 905 AEDEAVQE----NRVIPLRACDNIGGYSTVFLPGASPSFILKSSKSTPKVIGLQGLGVNG 960
Query: 957 FTVLHNVNCNHGFIYVTSQGILKICQLPSGS 987
+ H C GFIY S+G ++ Q P +
Sbjct: 961 MSSFHTEGCERGFIYADSKGCARVTQFPDAA 991
>gi|159470705|ref|XP_001693497.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283000|gb|EDP08751.1| predicted protein [Chlamydomonas reinhardtii]
Length = 461
Score = 99.4 bits (246), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/164 (34%), Positives = 97/164 (59%), Gaps = 7/164 (4%)
Query: 230 IESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+ +S+V+NL + + ++ V+D +F+HGY EPV+++LHE + TWAGR+ + TC ++A+S+
Sbjct: 128 VGNSYVLNLHKMMGIREVRDCVFLHGYTEPVLLLLHEPDPTWAGRLRERKDTCCLTAISV 187
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC-ALALNNYAV 347
S LK+H ++W A LP+D Y+LL +P LV+ + + SQ++ A ALN+ A+
Sbjct: 188 SLRLKRHTVLWRAAGLPYDCYRLLPLPQ-RPAALVLSPSLVMLTSQASQPQAAALNSTAL 246
Query: 348 SLDSSQEL----PRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
++ L R + SV A + ND A ++ LV
Sbjct: 247 PGEAPPPLVFDPAREAPSVTAARMAAEFALNDCAPALGRSAALV 290
>gi|238508528|ref|XP_002385456.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus flavus NRRL3357]
gi|220688975|gb|EED45327.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus flavus NRRL3357]
Length = 1204
Score = 99.4 bits (246), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 159/355 (44%), Gaps = 40/355 (11%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
++I+LAF +AK++++E+D +G+ S+H +E + + + G ++ VDP R
Sbjct: 88 EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147
Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
C V +G++ + IL Q G LV D+ + GS G A
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206
Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
SS V+ L LD + H F++ Y EP IL+ + T + + + +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + + S LP D +K++A+P P+GG L++G+N +H + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326
Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S +S ++ L+ L N LL TG++VL+ DGR V + +
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386
Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
P S +G+ FLGS DS+L+ G S+ SSG K+
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKK 435
Score = 47.0 bits (110), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 42/101 (41%), Gaps = 3/101 (2%)
Query: 898 DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
D + EE P + I NISG F G P + + L G +
Sbjct: 678 DQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKGGYAQSL 734
Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+ C GFI + S+G++ +CQ+P G D W +Q++
Sbjct: 735 SKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQI 775
>gi|150951283|ref|XP_001387581.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
polyadenylation factor II complex, subunit CFT1 (CPSF
subunit) RNA processing and modification
[Scheffersomyces stipitis CBS 6054]
gi|149388465|gb|EAZ63558.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
polyadenylation factor II complex, subunit CFT1 (CPSF
subunit) RNA processing and modification
[Scheffersomyces stipitis CBS 6054]
Length = 1341
Score = 97.8 bits (242), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 98/415 (23%), Positives = 183/415 (44%), Gaps = 55/415 (13%)
Query: 55 VPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
V +LVV A +++I+ V VQ + S SK L+L+ ++LHG +
Sbjct: 26 VKHLVVGKATLLQIFEV-VQLKSSTPSK--------------PQHRLKLIDQFKLHGLIT 70
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
+ + + N D ++++ + AK SV+++D +H + S+H +E+
Sbjct: 71 DIKPIRTVESPNF---DYLLVSTKSAKFSVIKWDHHLHTISTVSLHYYENAIQ---NSTY 124
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGL----------QMIILKASQGGSGLVGDEDTFGSGG 224
E ++ L+ ++P G C + L ++ A +V E G
Sbjct: 125 EKLSKSELL-LEPYGSCSCLRFKNLLCFLPFETAEELDDDDADSENEDMVKSEKKEHENG 183
Query: 225 GFSARI--------ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
+ + ++S +I+ + LD + + D F+ Y EP IL +R+ WAG +
Sbjct: 184 TVNVPVTDQPGSFFDTSFLIDGQSLDSSIGSIIDMQFLFKYREPTFGILSQRQQAWAGNL 243
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHS 333
L++ T K + NLP+D +++ +PSP+ G L++G N IH +
Sbjct: 244 PKIKDNVQFCILTLDLTTKSTVSVLKIDNLPYDVDRIVPLPSPLNGCLLLGCNEIIHVDN 303
Query: 334 QSASCALALNNYAVSLDSSQEL--PRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLT 390
+A+N + + +S + ++ +++L+ L ND ALL TG+ L
Sbjct: 304 GGIVRRIAVNQFTSLITASTKAYQDQTHLNLKLEDCSVVALPNDHRALLVLSTGEFYYLN 363
Query: 391 VVYDGRVVQRLDLSKTNPSVLTSD--------ITTIGNSLFFLGSRLGDSLLVQF 437
DG+ +++ + + +L SD I T+ N+L F + G+S LVQF
Sbjct: 364 FEVDGKSIKKFTIESVD-KLLYSDIKLTFPGQIATLDNNLLFFANHNGNSPLVQF 417
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/164 (22%), Positives = 68/164 (41%), Gaps = 26/164 (15%)
Query: 836 HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
H +L + G +L Y+ Y F+G N + ++L +
Sbjct: 786 HKEEYLTILTIGGEVLLYKLY-FDG-------------------ENYEFKKEKDLAITGA 825
Query: 896 PLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
P +AY P G +R + F N++G+ F++G P + + Q
Sbjct: 826 PENAY-----PIGTAVERRLAYFPNLNGYTCIFVTGVTPYLILKSLHSIPRIYQFSKIPA 880
Query: 955 VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
V+ + H+ +G I++ +Q +ICQLP Y+N WP++ +
Sbjct: 881 VSISPFHDSKVANGLIFLDNQQNARICQLPLDFNYENTWPMKLI 924
>gi|9794906|gb|AAF98387.1| cleavage and polyadenylation specificity factor [Drosophila
melanogaster]
Length = 279
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 81/300 (27%), Positives = 141/300 (47%), Gaps = 65/300 (21%)
Query: 425 LGSRLGDSLLVQFTCGSGTSMLS------------SGLKEEFGDIEADAPSTKRLRRSSS 472
LGSRLG+SLL+ FT +++++ L++E ++E + +L + +
Sbjct: 1 LGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQRNLQDEDQNLE-EIFDVDQLEMAPT 59
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD----- 527
A + EEL +YGS + + + F F V DSL+N+ P+ G R+ +
Sbjct: 60 QAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSLMNVAPINYMCAGERVEFEEDGVT 119
Query: 528 ---------------ASATGISKQS---------NYELV---ELPGCKGIWTVYHKSSRG 560
+ATG SK N +++ EL GC +WTV+
Sbjct: 120 LRPHAESLQDLKIELVAATGHSKNGALSVFVNCINPQIITSFELDGCLDVWTVFD----- 174
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
D+++ ++ +D+ H ++++S T+VL+T + E+ E+ + V TI GNL +
Sbjct: 175 ---DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQEINEI-ENTGFTVNQPTIFVGNLGQQ 229
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
R ++QV R R+L G+ + Q++ S V+ VSIADPYV L + +G +
Sbjct: 230 RFIVQVTTRHVRLLQGTRLIQNVPIDVG----------SPVVQVSIADPYVCLRVLNGQV 279
>gi|452825139|gb|EME32137.1| cleavage and polyadenylation specificity factor subunit-like
protein [Galdieria sulphuraria]
Length = 1454
Score = 96.7 bits (239), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 119/529 (22%), Positives = 200/529 (37%), Gaps = 91/529 (17%)
Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
KVDP+ VL+ ++++ ++ D+ + + + +++LR L
Sbjct: 166 KVDPEHGLIAVLIRKKNLLLI----AKYPILSHRDSLSAECSSNKLLSDPVILDLRRLGH 221
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
F F+ GY P + +L E+ TW+G S + ++S + + K+ IW
Sbjct: 222 FETIHFCFMFGYSLPTLALLEEKTPTWSGSFSVTRDSRLVSVVQFDLSDKKMKRIWQVEE 281
Query: 304 LPHDAYKLLAVPS-PIGGVLVVGANTIHYHSQSASC-ALALNNYAVSLDSSQELPRSSFS 361
LPH+ + + +VP GG LV G N I Y + L+ N+ S L
Sbjct: 282 LPHECFMVSSVPFLQGGGFLVFGWNIILYFRDGSFVDGLSCNDLGDVYLSKWSLRSQDAP 341
Query: 362 VELDA--------AHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
+ LD +H T+++N V +L + G L + G + L + S
Sbjct: 342 ISLDGCEVVSEFDSHDTFMKNPVIIL--RDGAFFELCIPKKGG-DSVISLRYCKILIQPS 398
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
++ GN L FLGS + S L++ + T + D
Sbjct: 399 TVSYCGNGLIFLGSHVSPSALLEIIWKNSTEL-----------------------HPEDD 435
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
L+ S +G +SN + S RDSL IGP++D I + +
Sbjct: 436 ELE--------SFFGKSSNKNFVVETIDS---RDSLFCIGPIQDLEVFDNIIGSSRKMEL 484
Query: 534 SK---QSNYELV---------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
NY V L C+ IW V + G S +
Sbjct: 485 IAAVGSRNYGAVIIFRRTVSPSLLTSIRLEDCQQIWNVLCQRKMGERNGSVPL------- 537
Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL- 634
LI+S + T+VL +D + E+ +S + RT+ + R +IQVF+ G RIL
Sbjct: 538 ---LILSTQRNTIVLSVSDTIDELVDS-QFQTSSRTLWVSRVLHDRYIIQVFDEGLRILG 593
Query: 635 DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
+ + P + V + DPYV+L +S + +L
Sbjct: 594 NWDSLISLYELPPGD----------VVTQAFVCDPYVMLHLSSSYLVIL 632
>gi|301093651|ref|XP_002997671.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110061|gb|EEY68113.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 478
Score = 96.3 bits (238), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 159/387 (41%), Gaps = 101/387 (26%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR++++ V D F+ GY+EP +++LHE + + GR++ T ++ +SI+
Sbjct: 106 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 165
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+ HP IW+ NLP D ++L+ +P+GGV+V+ AN I Y +Q+ LA N +A
Sbjct: 166 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTHE 225
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
+ +L +V L +LQ LL+ G + +L++ Y+ + L
Sbjct: 226 TAQL-----NVVLYDCQFEYLQEKELLLTMPCGQVYVLSLPYEDTSSRGL---------- 270
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE----------ADA 461
G F+GSR GDS+L L + +EE D E A
Sbjct: 271 ---YGFGGKQTLFIGSRSGDSVLFVLD----KKKLVTATEEEPKDEEMPIKEVVIKQESA 323
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT--------------------- 500
P K S A ++ + ++L LYG+A E A +
Sbjct: 324 PEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPAATSSTECTNGVGVSSVKTEENGAP 378
Query: 501 ------FSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYELV---------- 542
+ + +R D L +IG + G+ NAD S + ELV
Sbjct: 379 EQDTGPYDYELRQIDVLPSIGQITSIELGVENNAD------SNEKREELVISGGYERSGA 432
Query: 543 ----------------ELPGCKGIWTV 553
EL GC+ +WTV
Sbjct: 433 ISVLHNGLRPIVGTEAELNGCRAMWTV 459
>gi|452841862|gb|EME43798.1| hypothetical protein DOTSEDRAFT_79774 [Dothistroma septosporum
NZE10]
Length = 1347
Score = 95.9 bits (237), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 195/965 (20%), Positives = 360/965 (37%), Gaps = 175/965 (18%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G V +LA + D D+++LAF+DAK++++E+D H + S+H
Sbjct: 51 LVLVGEYSLSGTVTNLAQVKL--PDTKTAGDALLLAFKDAKLTLIEWDPENHRISTISIH 108
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
+E + G ++ VDP RC + Q+ +L Q L +ED
Sbjct: 109 YYEGDNVVSQPFGPGLGECENILTVDPNWRCAALKFGTRQLAVLPFRQLDDELGVEEDGD 168
Query: 221 GSGGGFSAR-----------------IESSHVINLRDL--DMKHVKDFIFVHGYIEPVMV 261
+ + ++S V+ L L D+++ D F++GY E +
Sbjct: 169 AEPASTTLKRSESILQNVNGEVQQTPYKASFVLALSTLLEDIRYTVDLGFLYGYRESTLG 228
Query: 262 ILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGV 321
IL + + + + + + LP+ +K++ +P+P+GG
Sbjct: 229 ILSSSLQPSSSLLDIRKDELEYRMFKLELEQGESTELQVVKQLPNSLWKVVPLPAPVGGA 288
Query: 322 LVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP-RSSFSVELDAAHATWL--QNDVA 377
L+VG N+ +H + ++A+N +A +L+S + + +S +++L+ L ++
Sbjct: 289 LLVGTNSFVHVDLNAKVNSVAVNEFA-ALESDRGMEDQSDLNLKLEGCSVEILDAESRQV 347
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLG 426
L+ + G L + GR +Q L DL KT PS + + ++ F+G
Sbjct: 348 LVVLRDGSLATIYFEQSGRSIQGLKVSRVREEHGGDLVKTAPSC----VARLDHNKVFVG 403
Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
S G S LV+++ S LS K G + D E
Sbjct: 404 SEDGASSLVRWS--RSISTLSR--KRTHGQMLGQHGDEDDEEALEDDDDDLYDAAPETK- 458
Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------ATGISKQS--- 537
A++ T++ + SF ++D L ++GP+ D G A TG + S
Sbjct: 459 -KRATSTTDAFETPPSFQIQDVLHSLGPINDVCLGKSDGAQVDKLQMMLGTGRGRSSRIS 517
Query: 538 --NYELVELPG-------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
N ++V + K W V+ K + DD++H L+ + + +
Sbjct: 518 CLNRDIVPVSARKSTIGRAKSAWAVHAKRND-----------RDDDFHDNLLFAYDGQET 566
Query: 589 VLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFG 646
+ D + + + F +G TI L V+Q + R D ++Q +
Sbjct: 567 KIYDVDEVGYMERTAQEFEHEGETIDVQMLAKDTIVVQCRKSEIRTYDADLALSQIIPMV 626
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
++ E ++ +S DPY+L+ +D SI++L K
Sbjct: 627 DEETD-----EEYEIVYLSFCDPYLLVVRNDSSIQVL-----------------HVRGKE 664
Query: 707 VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVV-----CYESG 761
+ D + WL GG + G + V G
Sbjct: 665 IEPLEGEGDIAEKKWL---------------------GGSIHTGSLTKDVPALFLLSAQG 703
Query: 762 ALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS 821
+ +F +P+ V Y AL ++S + + G KE +
Sbjct: 704 TMHVFSLPSLEPV----------------YHAPALPHLPPVLSSDAPQRRA-GPKEALTE 746
Query: 822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
+ V EL ++ P+L A ++ Y+ + P + + +N
Sbjct: 747 LLVAELG----ASGVDTPYLVARTALDDLVLYEPFRHPEPAPSDQ-----------WYTN 791
Query: 882 VSASRLRNLRFSRTPL--DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
+ R R + + P +A +EE+ P + I ++ + + GS P ++
Sbjct: 792 L---RFRKVPVTYIPKYNEAIAQEESTRPLPLRSI----HVGDYDAVTIPGSPP--LLLV 842
Query: 940 RER------LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
+E L V + +H +C GF V + G+L+ LP + Y W
Sbjct: 843 KEASSLPRVLEVRISNESNRVATLLPIHLDHCKKGFAAVNADGLLEEYHLPLSAWYGTGW 902
Query: 994 PVQKV 998
VQ+V
Sbjct: 903 SVQQV 907
>gi|326477251|gb|EGE01261.1| protein kinase subdomain-containing protein [Trichophyton equinum
CBS 127.97]
Length = 1267
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 133/579 (22%), Positives = 232/579 (40%), Gaps = 65/579 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + R D A L L Y + G + L
Sbjct: 28 NLIVVKTSLLQVFSLVNVTYGSTTATQPDQKGRN---DRSQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+ NS+ D+I+++ +AK+S++E+D HG+ S+H +E E H+
Sbjct: 85 QRVR---ISNSKSGGDAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVP 140
Query: 176 SFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------D 218
P + VDP G C + +G+ + IL Q G LV D+ D
Sbjct: 141 DLGSCPSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDMVSD 199
Query: 219 TFGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
S G +A + S V+ + LD + H F+H Y EP IL+ +
Sbjct: 200 AQKSAPGNTAHDKPYAPSFVLPMAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSL 259
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ S ++ + + + LP D +K++ +P P+GG L++G N +H
Sbjct: 260 TIDRKDVVSYSIFTLDLQQRASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVD 319
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + +S + L+ L + LL G + +L+
Sbjct: 320 QAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILS 379
Query: 391 VVYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
DGR V + L +K PS S +G + F GS GDS+L+ ++
Sbjct: 380 FKVDGRSVSGISLHFVAEQSGGLITKARPSCSAS----LGRNKLFYGSEEGDSILLGWSR 435
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES--- 496
S T+ S K G E+ A D D + ++L AS E
Sbjct: 436 PSSTTKRPS--KAADGVDESGAADLSDEAEQDDDGDDDDMYEDDLYSVNPASIRQEKQVV 493
Query: 497 ---AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI--- 550
+ F+F D L ++GP +D + G + + S + +EL +G
Sbjct: 494 NGDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPAIAAPLELVAARGFGKS 553
Query: 551 --WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
TV + + DS +M DD Y + I ++ ++
Sbjct: 554 GGLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKS 589
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/91 (25%), Positives = 45/91 (49%), Gaps = 2/91 (2%)
Query: 911 CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
C+ + ++ G++ F+SG PC+ ++ R H G V + + H C GF
Sbjct: 751 CKLLRALPDVCGYKTVFMSGHNPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 809
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKVVF 1000
YV ++++ +LPS + +D+ W +K+
Sbjct: 810 AYVDEDNVIRMSRLPSNTRFDSGWATRKIAL 840
>gi|443894082|dbj|GAC71432.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
[Pseudozyma antarctica T-34]
Length = 1543
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/406 (24%), Positives = 170/406 (41%), Gaps = 58/406 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
LV +++ IY V + + S T D +L + + L G V L
Sbjct: 46 QLVTARDDLLTIYDVYDRSSSQSAASTSNGTANGTAGDAKPRHTLIVTRRHSLFGTVTGL 105
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ +D R ++++F DAK+++LE++D+ L S+H +E L G
Sbjct: 106 QRVDTLASDKDARH-RLLVSFADAKLALLEWNDTTDDLETVSIHTYERAT--QLLNGTPP 162
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS--------- 227
R P + VDP RC +L+ + IL + + E F G GF
Sbjct: 163 LFR-PNLNVDPLSRCAALLLPHDALAILPFYRDNA-----EFDFDDGLGFDLANDALDAS 216
Query: 228 --------ARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
A +ES S V+ +R++D ++++KDF F+ G+ +P + +L + TW G
Sbjct: 217 DAAAMAAAAHMESLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQKPTVAVLFDHSPTWTG 276
Query: 273 RVSWKHHTCMIS--ALSISTTL------------------KQHPLIWSAMNLPHDAYKLL 312
++ + + + L +S +L HP++ ++ LP+D +L
Sbjct: 277 LLTHRKDSFAVYLFTLDLSASLDGATLGSAAALLDDGNMRSAHPVVTTSSQLPYDCLYML 336
Query: 313 AVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV----ELDAAH 368
P +GGVLVV + I + QS + N S+ E P S V +L A+
Sbjct: 337 PCPQSLGGVLVVCMSAILHVDQSGRVVVTALNRWFKTTSAIE-PESVLDVPGLADLQASQ 395
Query: 369 ATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
+ + A+LS GDL L DGR V+ L + + SD
Sbjct: 396 LVFTTDTDAVLSLSNGDLYRLRCHMDGRSVEGFRLERIDQLTAGSD 441
>gi|301103688|ref|XP_002900930.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
gi|262101685|gb|EEY59737.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
Length = 613
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/383 (24%), Positives = 159/383 (41%), Gaps = 93/383 (24%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR++++ V D F+ GY+EP +++LHE + + GR++ T ++ +SI+
Sbjct: 241 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 300
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+ HP IW+ NLP D ++L+ +P+GGV+V+ AN I Y +Q+ LA N +A
Sbjct: 301 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTHE 360
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
+ +L +V L +LQ LL+ +G + +L++ Y+ + L
Sbjct: 361 TVQL-----NVVLYDCQFEYLQEKELLLTMPSGQVYVLSLPYEDTSSRGL---------- 405
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI------EADAPSTK 465
G F+GSR GDS+L + K+E I + AP K
Sbjct: 406 ---YGFGGKQTLFIGSRSGDSVLFVLDKKKLVTATEEEPKDEEMPIKEVVIKQESAPEIK 462
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------------------------- 500
S A ++ + ++L LYG+A E A +
Sbjct: 463 -----SEPAEEEEEDEDDLFLYGAAPTKEEPAATSSTECTNGVGVSSVKTEENGAPEQDT 517
Query: 501 --FSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYELV-------------- 542
+ + +R D L +IG + G+ NAD S + ELV
Sbjct: 518 GPYDYELRQIDVLPSIGQITSIELGVENNAD------SNEKREELVISGGYERSGAISVL 571
Query: 543 ------------ELPGCKGIWTV 553
EL GC+ +WTV
Sbjct: 572 HNGLRPIVGTEAELNGCRAMWTV 594
>gi|298715583|emb|CBJ28136.1| cleavage and polyadenylation specificity factor CG10110-PA
[Ectocarpus siliculosus]
Length = 1906
Score = 94.4 bits (233), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 93/303 (30%), Positives = 136/303 (44%), Gaps = 77/303 (25%)
Query: 215 GDEDTFGSGGGFSARIESSHVINLR-------DLDMKHVKDFI----FVHGYIEPVMVIL 263
G+ED G G G +A+ + NL DL+ + FI F+ G+ EP + +L
Sbjct: 265 GEEDG-GLGNGATAKGDGGAGGNLAVSKPFTIDLEEAGITGFIKAAAFLEGFHEPALALL 323
Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
+E T AGR++ K TC ++ LSI+ T + P+IW NLPHD++ L+ VPSPIGG+ V
Sbjct: 324 YEPIQTCAGRLASKRSTCRLALLSINLTQGRAPVIWQVENLPHDSWDLVPVPSPIGGLQV 383
Query: 324 VGANTIHYHSQS-ASCALALNNYA-VSLDSS-QELP------------------------ 356
+ N + + +QS LA+N YA ++D + E P
Sbjct: 384 ISTNAVMHVNQSEVRSILAVNGYARATVDPALLECPLRGGDSDWGWTSFRRSHPEREVVD 443
Query: 357 RSSFSV--ELDAAHATWLQNDVALLSTKTGD-----LVLLTVVY---------------- 393
SS+ V ELD +L LLS +TG+ L L TV
Sbjct: 444 LSSYDVCIELDVVRCAFLTPTSMLLSLRTGEVYALRLHLTTVTAAAADAAGCSRPPGGAA 503
Query: 394 ---DGRVVQR--LDLSKTNP-SVLT---------SDITTIGNSLFFLGSRLGDSLLVQFT 438
RVV + + + +P SVL + L F+GSR+GDSLLV ++
Sbjct: 504 FGTPNRVVGQSMRPVGRASPCSVLAVAASGGSGGDGGSGASKGLVFMGSRVGDSLLVDYS 563
Query: 439 CGS 441
S
Sbjct: 564 VAS 566
Score = 45.8 bits (107), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 49/221 (22%), Positives = 89/221 (40%), Gaps = 52/221 (23%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+ Y+ +H PTG+ + G +T + + +LVV
Sbjct: 7 YTCYRQLHPPTGVDHAVFGSVTAAGSR---------------------------DLVVAK 39
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLM------DGISAASLELVCHYRLHGNVESL 116
A+ +E+Y V + S + + R D S LEL + L GN+ +L
Sbjct: 40 ASTLELYRVHRDDHSSTAAAAAAAAARDTSNGDERDDDDASGYYLELAGTFPLAGNITAL 99
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A++ D ++++F AK++++ +D + L S+H F++ G ES
Sbjct: 100 AVIP----------DILVVSFGVAKMALVAYDSVLGRLETISIHNFDAGAIGPGAGGVES 149
Query: 177 -FARGPLVK--------VDPQGRCGGVLVYGLQMIILKASQ 208
+ +K DP GRC +V G Q+++L A +
Sbjct: 150 GYGLAAALKDRPRTISSSDPAGRCLAAVVAGCQLVVLPARR 190
>gi|402085944|gb|EJT80842.1| cft-1 [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 1450
Score = 94.0 bits (232), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 159/679 (23%), Positives = 266/679 (39%), Gaps = 112/679 (16%)
Query: 91 VLMDGISAASLELVCHYRLHGNVESLA------ILSQGGADNSRRRDSIILAFEDAKISV 144
V D S + L+ + L G V LA + GG S D +++AF+DAK+S+
Sbjct: 86 VRSDRASHTKIVLIAEFPLSGTVTGLARVKPPNVSKTGGG--SGVGDLLLIAFKDAKLSL 143
Query: 145 LEFDDSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGL 199
+ +D L S+H +E E W +F + DP RC +
Sbjct: 144 VAWDSERRSLETFSIHYYEQDELQGNPWECPLSDYANF-----LVADPGSRCAALKFGPR 198
Query: 200 QMIILKASQGGSGL-VGDEDTFGSGG------------GFSARIES-----SHVINLRDL 241
+ IL Q + +GD D G ++ IE S V+ L +L
Sbjct: 199 SLAILPFKQADEDIGMGDWDEALDGPRPAQSQSAAVAINGTSTIEDTPYSPSFVLRLPNL 258
Query: 242 D--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
D + H F++ Y EP IL +T + + K H + ++ K I
Sbjct: 259 DPALLHPVHLAFLYEYREPTFGILSS-SITPSNCLDRKDH-LTYTVFTLDLQQKASTTIL 316
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRS 358
S LP D +++A+P+P+GG L+VGAN IH + +A+N + S S
Sbjct: 317 SVGGLPKDLTRVIALPAPVGGALLVGANELIHIDQSGKANGVAVNPFTKQCTSFGLADHS 376
Query: 359 SFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLT 412
++ L+ L ++ L+ G L +T DGR V L + P ++L
Sbjct: 377 DLNLRLEGCTIEVLSAEHGELLVVLDDGRLATITFHIDGRTVSGLKVRIIPPEAGGNILP 436
Query: 413 SDITT---IGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
+ ++ IG + F GS GDS+++ + S S S +++ D++ D +
Sbjct: 437 TSVSCLSRIGRNAMFAGSERGDSIVIGWNRKSSQVSRKKSRVQDPDLDLDIDFDDLEDDE 496
Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA 528
D D + ++ G AS ++ + F D L++I P++D +YG
Sbjct: 497 DDDDDLYGD--TEKTTTVAGLASG--QAKLEDLVFRCHDRLISIAPIRDMAYGKPPPPAE 552
Query: 529 SATGISK----QSNYELV--------------------------ELPGCKGIWTVY---- 554
TG QS +LV + P +G+WT+
Sbjct: 553 GETGSRNSTPIQSELQLVAVVGRDRASSLAIMNREMTPVSIGRFDFPEARGLWTLACQKP 612
Query: 555 -------HKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVTE 601
K ++ D YD +++++ E+ + + TA ++
Sbjct: 613 LPKVLQGEKGTKPVGGDFGVPVQYDK----FMVVAKEDDDNFESSNIYVLTAAGFEKLVG 668
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
+ G TI AG + ++IQV + R DG +TQ + P E + +T
Sbjct: 669 TEFEPAAGFTIEAGTMGNHTKIIQVLKSEVRCYDGDLGLTQII---PMLDEETNHEPRAT 725
Query: 661 VLSVSIADPYVLLGMSDGS 679
S SIADPY+L+ D S
Sbjct: 726 --SASIADPYLLIIRDDSS 742
Score = 46.2 bits (108), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 30/130 (23%), Positives = 57/130 (43%), Gaps = 7/130 (5%)
Query: 867 SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-----GAPCQRITIFK--N 919
S+D ++ ++ S S LRF + P A + + AP +R+ + N
Sbjct: 871 SNDDLTIYEPFKIAESSQSLSGTLRFRKLPNPAVAKSQDTKVSDDAPAPMRRMPLRACGN 930
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
I+G+ FL G P + + + L + A + H C+ GFIY +G+ +
Sbjct: 931 IAGYSCVFLPGHSPSFLIKSSKSTPRVIGLQGPGVRAMSPFHTKGCDRGFIYADYEGVAR 990
Query: 980 ICQLPSGSTY 989
+ Q+P+ ++
Sbjct: 991 VAQIPNDCSF 1000
>gi|448105510|ref|XP_004200513.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|448108635|ref|XP_004201144.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|359381935|emb|CCE80772.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|359382700|emb|CCE80007.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
Length = 1344
Score = 93.6 bits (231), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 105/502 (20%), Positives = 196/502 (39%), Gaps = 83/502 (16%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL- 116
LVV + +++++ + + SKE K L+LV ++LHG + L
Sbjct: 29 LVVGKSTLLQVFDIVQSNKKSKEYK------------------LKLVEQFKLHGLITDLK 70
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A+ + D D ++++ + AK+S++++D + + S+H +E+ E
Sbjct: 71 AVRTVENPD----LDYLLVSTKSAKMSLVKWDHHENSISTVSLHYYENSIQ---SSTYEK 123
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMII-------------LKASQGGSGLVGDEDTFGSG 223
L+ ++P C + L + + G SG G D +
Sbjct: 124 LTTTELI-MEPNNTCACLRFKNLLTFLPFEMPDEDDEEDGYENVDGASGSRGKHDNKATQ 182
Query: 224 GGFS-ARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
+ A SS VI+ ++LD + +V D F++ Y EP + I+ + TW G +
Sbjct: 183 QDENQALFYSSFVIDAQNLDSRIGNVIDMKFLYNYKEPTLAIISSKNHTWTGLLPLTKDN 242
Query: 281 CMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHSQSASCA 339
LS+ K + NLP D ++ +P P+ G L++G N IH +
Sbjct: 243 ISFIVLSLDLVTKTSTTVLKIDNLPFDIDTIVPLPKPLNGTLLIGCNEIIHVDHGGITRR 302
Query: 340 LALNNYAVSLDSSQELPR--SSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGR 396
LA+N + S+ SS + R S +++L+ + ND L K GD + DG+
Sbjct: 303 LAVNQFTSSITSSIKNYRDQSELNLKLENCCVKPIPNDHRVFLILKNGDFYYINFAIDGK 362
Query: 397 VVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
++ L K N D+ + N+L F+ ++ G+S L++
Sbjct: 363 TIKNFYLEKVNSINQNEIGISYPEDVVHLDNNLMFICNKNGNSPLIELKF---------- 412
Query: 450 LKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES----------AQK 499
+++ + + +QD NG ++
Sbjct: 413 ---------SESKDNQNAEQQKDTEMQDTENGTTDKNDNDDDDDIYEDDEDNEKVLIKNS 463
Query: 500 TFSFAVRDSLVNIGPLKDFSYG 521
F D L+N GP+ F++G
Sbjct: 464 VIEFTKHDELINNGPVSSFTFG 485
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/177 (20%), Positives = 75/177 (42%), Gaps = 27/177 (15%)
Query: 824 VVELAMQRWSAHHSRPFLFAILT-DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
+ + HS+ ILT G ++ Y+ + F+G N
Sbjct: 774 IKNIVFNELGDEHSKDEYLTILTIGGEVIIYKLF-FDG-------------------DNF 813
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF-KNISGHQGFFLSGSRPCWCMVFRE 941
+ ++L+ + P +AY P G +R ++ N++G+ F++G P +
Sbjct: 814 KFIKEKDLKITGAPDNAY-----PLGTTLERRLVYVPNVNGYSSIFVTGIIPYFITKTVH 868
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+ + V+F+ + N +GFIY+ + ++C++P Y+N WP++K+
Sbjct: 869 SVPRIFRFTKLPAVSFSSYSDSNIKNGFIYLDNSKNARMCEIPLDFNYENNWPIKKI 925
>gi|294659889|ref|XP_462318.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
gi|218511978|sp|Q6BHK3.2|CFT1_DEBHA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
protein 1
gi|199434312|emb|CAG90824.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
Length = 1342
Score = 92.8 bits (229), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 103/499 (20%), Positives = 206/499 (41%), Gaps = 82/499 (16%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
L+V A V++++ + E +++ K L+LV ++LHG + +
Sbjct: 29 LIVGKATVLQVFEIITTETKTQQYK------------------LKLVEQFKLHGLITDIK 70
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+ +NS+ D ++++ + AK+S++++D ++ + S+H +E+ E
Sbjct: 71 AIRT--VENSQL-DYLLVSSKGAKMSLIKWDHHLNSISTVSLHYYENSIQ---SSTYEKL 124
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMII-----------------LKASQGGSGLVGDEDTF 220
LV V+P C + L + + S G +++
Sbjct: 125 TTTDLV-VEPNNNCTCLRFKNLLTFLPFETLDEEEEDDDDDEEMNGSSGSDKKATNKENG 183
Query: 221 GSGGG-FSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
S G S ESS +I+ R LD + + D F++ Y EP + I+ + WAG +
Sbjct: 184 NSNGEEVSELFESSFMIDGRTLDSRIGDIIDMQFLYNYREPTIAIIFSKAHAWAGNLPKV 243
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHSQSA 336
LS+ K + NLP D K++ +P P+ G L++G N IH +
Sbjct: 244 KDNINFIVLSLDLVTKASTTVLKIDNLPFDIDKIIPLPQPLNGSLLMGCNEIIHVDNGGI 303
Query: 337 SCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVY 393
+ LALN + S+ +S + +S +++L+ + ND L+ GD +
Sbjct: 304 TRRLALNQFTSSITTSLKNYHDQSDLNLKLENCSVKPIPNDNKVLMILNNGDFYYINFKI 363
Query: 394 DGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSG 442
DG+ +++ D+ T P +I T+ N+L F+ ++ G++ L++ +
Sbjct: 364 DGKTIKKFFVEKVSDLNYDDIQLTYP----GEIATLDNNLMFISNKNGNNPLLELKYKNF 419
Query: 443 TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
++ +E +S+ L + ++L + + +
Sbjct: 420 EHVIVQENEE------------------NSNPLDNEDEEDDLYEEDEVNKKISINKSSIE 461
Query: 503 FAVRDSLVNIGPLKDFSYG 521
F D L+N GP+ +F+ G
Sbjct: 462 FIKHDELLNNGPISNFTLG 480
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 26/118 (22%), Positives = 53/118 (44%), Gaps = 4/118 (3%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFR 940
N + ++L + P +AY+ T +R+ F N++G F++G P +
Sbjct: 810 NFKLVKEKDLIITGAPDNAYSLGTTIE----RRLVYFPNVNGFTSIFVTGITPYYISKTT 865
Query: 941 ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+ + V+F + +G IY+ + +IC++P Y+N WP++K+
Sbjct: 866 HSVPRIFKFTKLPAVSFAPYSDDKIKNGLIYLDNSKNARICEIPVDFNYENNWPIKKI 923
>gi|254564833|ref|XP_002489527.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Komagataella pastoris GS115]
gi|238029323|emb|CAY67246.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Komagataella pastoris GS115]
gi|328349950|emb|CCA36350.1| Protein cft1 [Komagataella pastoris CBS 7435]
Length = 1388
Score = 89.7 bits (221), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 105/444 (23%), Positives = 177/444 (39%), Gaps = 51/444 (11%)
Query: 93 MDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
+D L LV Y+L G V L + D +++A + K S++++D S +
Sbjct: 80 IDFSQNVKLSLVAEYKLDGLVTDLCKIR---TIEDSHHDYVLVATKGVKFSMIKWDQSSN 136
Query: 153 GLRITSMHCFESPEWLHLKRGRES-----FARGPLVKVDPQGRCGGVLVYGLQMIILKAS 207
+ S+H H K+ E+ F + DP C +L + + L
Sbjct: 137 SISTVSLH--------HYKKIVENSLIDKFNVDTKLIADPNNHCSCLLANEI-LFFLPFL 187
Query: 208 QGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
Q DE+ G ++ + DL ++K + D F+HGY EP + +L+
Sbjct: 188 QHEV----DEELDGKFVENKKLYSNTFLQFSNDLQPNIKTIIDIEFLHGYSEPTLAVLYT 243
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
T G + T + S++ K I NLP+D ++L + SP+ G L++G
Sbjct: 244 SFPTCTGALPKAKDTVSLQVFSLNLQNKASTSIIEVNNLPYDTDRILPLSSPLNGCLLIG 303
Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTG 384
AN IH +S + ++ N +A + + +S+ + L+ + ND +L T+ G
Sbjct: 304 ANQIIHLNSMGTAKGISCNLFAAKCSNFKLSDQSNLDLRLEKCVLGQVYNDKVILITEKG 363
Query: 385 DLVLLTVVYDGRV-----VQRLDLSKTNPSVLT--SDITTIGNSLFFLGSRLGDSLLVQF 437
+ G V +Q++ K VL+ + T I FF+G + DS+L
Sbjct: 364 AFYAFSFDIVGGVSSINEIQKIAAEKYQGLVLSLPTMFTNIDGKTFFIGCQGSDSVLF-- 421
Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
G K D ++ + DAL E LY N
Sbjct: 422 -----------GSKARLNTQNVDVNGKSKV-ITEEDALY------EEDLYADDIQNVAQG 463
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYG 521
F DSL+NIGP+ +F+ G
Sbjct: 464 IDHIDFVKLDSLLNIGPITNFTTG 487
>gi|343425828|emb|CBQ69361.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Sporisorium reilianum SRZ2]
Length = 1567
Score = 89.7 bits (221), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 85/349 (24%), Positives = 158/349 (45%), Gaps = 48/349 (13%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV + L G V L + Q A + RD ++++F+DAK+++LE++D L S+H
Sbjct: 92 LVLVRRHTLFGVVTGLQRV-QTLATDKDARDCLLVSFKDAKLALLEWNDLTDDLETVSIH 150
Query: 161 CFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-------KASQGGSG 212
+E +P+ L+ G + P++ VDP RC +L+ + +L
Sbjct: 151 TYERAPQLLN---GTPNLFH-PILNVDPLSRCAALLLPHDALAVLPFYRDAADFDFDLDD 206
Query: 213 LVGDEDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
+ + +A +E+ S V+ +R++D ++++KDF F+ G+ +P + +L
Sbjct: 207 RLDLAKDDAAAVAAAAEMETLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQKPTVAVLFS 266
Query: 266 RELTWAGRVSWKHHTCMI-------------------SALSISTTLKQHPLIWSAMNLPH 306
TW G ++ + T + AL T HP++ ++ LP+
Sbjct: 267 HTPTWTGLLAERKDTFSVYLFTLDLSASLDGTLSSAADALDDGTVRSAHPVVTTSTALPY 326
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCAL-ALNNY-----AVSLDSSQELPRSSF 360
D +++ P +GGVLVV +++ + QS + ALN + A+ +S +LP
Sbjct: 327 DCLYMVSCPQTLGGVLVVCMSSVLHVDQSGRVVVTALNGWFKTISAIEPESVLDLPEIP- 385
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+L + + +L+ GDL DGR V+ L + + S
Sbjct: 386 --DLQGSQLVFTAETAGVLALVDGDLYRFRCQMDGRSVEGFRLERMDQS 432
>gi|402913617|ref|XP_003919276.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like, partial [Papio anubis]
Length = 132
Score = 88.6 bits (218), Expect = 2e-14, Method: Composition-based stats.
Identities = 48/121 (39%), Positives = 72/121 (59%), Gaps = 12/121 (9%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LEL + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 18 LELAASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 73
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVG 215
FE PE L+ G P V+VDP GRC +LVYG ++++L ++ GLVG
Sbjct: 74 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVG 130
Query: 216 D 216
+
Sbjct: 131 E 131
>gi|71021721|ref|XP_761091.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
gi|46100541|gb|EAK85774.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
Length = 1597
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/399 (23%), Positives = 177/399 (44%), Gaps = 54/399 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKE-----SKNSGETKRRVLMDGISAASLELVCHYRLHG 111
LV +V+ IY V Q S S+++ + S +L + ++ L G
Sbjct: 46 QLVTARDDVLTIYDVYGQPHASASTIPGISRHTATSSVSSNTSACSHKNLVISRNHTLFG 105
Query: 112 NVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHL 170
V L + Q A + RD ++++F+DAK+++LE++D+I L S+H +E +P+ L+L
Sbjct: 106 AVTGLQRV-QTLASDKDNRDRLLVSFKDAKLALLEWNDAIDDLETISIHTYERAPQLLNL 164
Query: 171 KRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-------DTFGSG 223
P++ VDP RC +L+ + IL + + D +
Sbjct: 165 A----PHLFHPILNVDPLSRCAALLLPHDSLAILPFYRDAADFDFDLDDHLEIAKDDVAA 220
Query: 224 GGFSARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
+A ++S S V+ +R++D ++++K F F+ G+ +P + +L TW G +S
Sbjct: 221 VVAAADLQSLPYSPSFVLTMREVDPKIRNLKHFCFLPGFQKPTVAVLFSHNPTWTGLLSE 280
Query: 277 KHHTCMI--------------------SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
+ T + AL T HP++ ++ LP+D ++A P
Sbjct: 281 RKDTFSVYLFTLDLSASLDGATFSSSAEALDDGTARSAHPVVTTSTPLPYDCLYMVACPQ 340
Query: 317 PIGGVLVVGANTIHYHSQSASCAL-ALNNY-----AVSLDSSQELPRSSFSVELDAAHAT 370
+GGV+VV +++ + QS + ALN + A+ +S EL S +L +
Sbjct: 341 TLGGVIVVCMSSLLHVDQSGRVMVTALNQWFKTTSAIEPESILEL---SDIADLQGSQLV 397
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+ +L+ G++ DGR V+ + L + S
Sbjct: 398 FTSKTQGVLTLVNGEIYRFRCQTDGRSVEGIRLERMQES 436
>gi|354547787|emb|CCE44522.1| hypothetical protein CPAR2_403250 [Candida parapsilosis]
Length = 1334
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 98/458 (21%), Positives = 181/458 (39%), Gaps = 64/458 (13%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L+LV ++L G V L L + + D II++ + AK S+++++ +H + S+H
Sbjct: 57 LKLVEQFKLQGTVTGLKPLR---TSENPQLDYIIVSTKYAKFSIIKWNHQLHSISTVSLH 113
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
+E+ E A L+ V+P L Y + L
Sbjct: 114 YYEN---CIQHSTFEKLAISDLI-VEPTYSSVSCLRYKNLLCFLPFEGVNDHDDDDDDDD 169
Query: 205 -----KASQG-GSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYI 256
+G + G + + G+ +SS +I+ L+ + V D F+H Y
Sbjct: 170 DDDDTDDEKGVAENVAGVDKSNGASNDNQPFYDSSFIIDAGTLESSVDSVLDLQFLHHYQ 229
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
E + IL + +WAG + +++ K +++ NLP+D +++ +
Sbjct: 230 ETTIAILSSKSNSWAGNLIKNKDNVQFQVMTLDIQSKSTLPVFTIDNLPYDIDRIIPLSK 289
Query: 317 PIGGVLVVGAN-TIHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATW 371
P+ G L++G N IH + + +A+N + S+ S Q+ + +E D +
Sbjct: 290 PLNGCLLLGCNEIIHVDNGGIAKRIAVNAFTSLITASVKSYQDESELNLKLE-DCSIVPI 348
Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFF 424
++ LL TG+ L DG+ ++R+ L + ++ T+ N+L F
Sbjct: 349 PEDHRVLLILATGEFYFLNFELDGKSIKRIHLEAVEQKAYDAIKLTYSGEVATLDNNLLF 408
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+ GDS LV+ S + +E K D + GEE
Sbjct: 409 FANMNGDSPLVEIKYSSSAKV-----------VEKQVLDKKEEDSDEEDLYNEDEEGEEQ 457
Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
+ + F + DSL+N GP+ F+ GL
Sbjct: 458 KVMRKSH---------IEFKLHDSLINNGPVSSFTLGL 486
>gi|260941626|ref|XP_002614979.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
gi|238851402|gb|EEQ40866.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
Length = 1363
Score = 87.0 bits (214), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 110/220 (50%), Gaps = 15/220 (6%)
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
SS ++ LD K + D F+H Y +P + +L +++ TWAG + + S LS+
Sbjct: 224 SSFILEASALDNKIGDIIDLQFLHHYRQPTIAVLSQQKSTWAGLLPQTKDNVIFSVLSLD 283
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVS 348
+ + NLP+D K++A+PSP+ G L++G N IH + + +A+N Y
Sbjct: 284 MQTRLTTTVLQIENLPYDLEKIIALPSPLNGSLLIGCNELIHVDTGGITRRIAVNQYTED 343
Query: 349 LDSSQE--LPRSSFSVELDAAHATWLQNDVALLST-KTGDLVLLTVVYDGRVVQRLDLSK 405
+ +S + ++S ++L+ + ND LL +TG++ + DG+ ++R+ + +
Sbjct: 344 ITASLKNYADQTSLDLKLEDCSILPIPNDNKLLMVLRTGEMYFIVFEVDGKTIKRMSVEE 403
Query: 406 TNPSVLTSDI--------TTIGNSLFFLGSRLGDSLLVQF 437
PS S I ++ N+L FL R +S LV+
Sbjct: 404 I-PSETYSQIKLMDPSSFASLDNNLLFLTGRSSNSHLVEL 442
Score = 47.0 bits (110), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 57/124 (45%), Gaps = 16/124 (12%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVF 939
N + +L + P +AY+ HG +R I F ++SG ++G P M+
Sbjct: 829 NFQFVKQYDLPITGAPFNAYS-----HGTSIERRMIYFPDVSGTTCIMVTGVIPY--MIT 881
Query: 940 RERLRVHPQL-----CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
R R H Q+ IV+F +G IY+ ++ +I +LPS +YD WP
Sbjct: 882 RSR---HSQVKVFKFSKIPIVSFVPFSTDKIKNGLIYLDTKKNARIVELPSEFSYDYNWP 938
Query: 995 VQKV 998
++KV
Sbjct: 939 IRKV 942
>gi|388856288|emb|CCF50097.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Ustilago hordei]
Length = 1568
Score = 86.3 bits (212), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 91/358 (25%), Positives = 164/358 (45%), Gaps = 61/358 (17%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L+ + L G V L + Q + + RD ++++F DAK+++LE++ + L S+H
Sbjct: 84 LVLIRKHSLFGTVTGLQRI-QTLSTSKDSRDRLLVSFTDAKLALLEWNHTTDDLETVSIH 142
Query: 161 CFE-SPEWL----HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS---- 211
+E +P+ L HL + P + +DP RC +L+ + IL + +
Sbjct: 143 TYERAPQLLNGIPHLFQ--------PNLNIDPLSRCAALLLPHDALAILPFYRDAAEFEF 194
Query: 212 --GLVGDEDTFGSGGGFSA-----RIES-----SHVINLRDLD--MKHVKDFIFVHGYIE 257
GL D + +G +A +IES S V+ +R++D ++++KDF F+ G+ +
Sbjct: 195 DHGLHLDLNLDFAGEDKAAMQAAVQIESLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQK 254
Query: 258 PVMVILHERELTWAGRVSWKHHT----------------CMISALSIS----TTLKQHPL 297
P + +L T G ++ + M+ + S S T HP+
Sbjct: 255 PTVALLFAHSPTCTGLLAERKDNFSVYLFTLDLAASLDGAMLGSASYSFDDATLRSMHPV 314
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNY-----AVSLDS 351
+ ++ +LP+D +L P +GGVLVV ++I + QS A ALN + A+ +S
Sbjct: 315 LTTSSSLPYDCLYMLPCPQTLGGVLVVCMSSILHVDQSGRVVATALNGWFNLVSAIQPES 374
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+LP + +L + + +L+ GD+ T DGR +Q L + S
Sbjct: 375 LLDLPEIA---DLQGSQLVFTAETEGVLTLVHGDVYTFTCQMDGRNIQGFRLERMQQS 429
>gi|9794908|gb|AAF98388.1| cleavage and polyadenylation specificity factor [Drosophila
melanogaster]
Length = 813
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 99/391 (25%), Positives = 156/391 (39%), Gaps = 64/391 (16%)
Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD--- 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 10 SPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKDLSG 69
Query: 716 ----KG----------------------PEPWLRKTSTDAWLSTGVGEAI------DGAD 743
KG EP ++ + L G A D A
Sbjct: 70 LFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMADLAK 129
Query: 744 GGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTY 791
D + VV +SG LEI+ +P+ V+ V+ +G + D
Sbjct: 130 QSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLTDAM 189
Query: 792 MREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTI 850
+ + E +S+ G Q ++ +S +EL++ + RP L + T +
Sbjct: 190 EFVPISLTTQE---NSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRVEL 245
Query: 851 LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP 910
L YQ +F P+ K R L N+ + ++ D E+ P
Sbjct: 246 LIYQ--VFRYPKGHLKI-----RFRKLDQLNLLDQQPTHIELDEN--DEQEEIESYQMQP 296
Query: 911 --CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNH 967
Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN +
Sbjct: 297 KYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPN 356
Query: 968 GFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
GF+Y + LKI LPS +YD+ WPV+KV
Sbjct: 357 GFLYFDTTYELKISVLPSYLSYDSVWPVRKV 387
>gi|255718033|ref|XP_002555297.1| KLTH0G05984p [Lachancea thermotolerans]
gi|238936681|emb|CAR24860.1| KLTH0G05984p [Lachancea thermotolerans CBS 6340]
Length = 1307
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 127/651 (19%), Positives = 262/651 (40%), Gaps = 131/651 (20%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI-HGLRITSM 159
L L+ ++LHG + +A++ Q D ++++ AK+S++ FD S+ L S+
Sbjct: 47 LVLLHEFKLHGQITGMALVPQMEGP----LDCLVVSTGKAKLSLVRFDPSMPMCLETLSL 102
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYG---LQMIILKASQGGSGLVGD 216
H +E+ ++ A+ +++DP+ RC VL++ L ++ L ++ D
Sbjct: 103 HYYEAE---FTRKNLIELAKTSKLRLDPERRC--VLLFNSDVLALLPLNINEEDE----D 153
Query: 217 EDTFGSGGGFSARIES---------SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
++ + ++E+ S V+++ DL ++K+V D F++ + +P + +L++
Sbjct: 154 DNQEPTHQAKKRKVENGDARRLAKQSSVLHVSDLSAELKNVVDIQFLNSFSQPTLAVLYQ 213
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
L W+G M ++I+ K++ I+ LPHD + ++ + + ++VG
Sbjct: 214 PRLAWSGNDKVAGKGSM-RLMAITPHEKKNTTIYQVKELPHDVHTIIPLAN---SCVLVG 269
Query: 326 ANTIHY--HSQSASCALALNNYAVSLDSSQELPRSSFSVELD-----AAHATWLQNDVAL 378
N I ++ + + LN+++ S+++ SS V A+ ++ +
Sbjct: 270 VNEIVSVDNTGAIQSTIQLNSFSPKFTGSKQIDNSSLEVMFTEPIVWASAMVSKDREILI 329
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV--------LTSDITTIGNSL------FF 424
L D+ +T+ +GR++ L + P V L + I + + FF
Sbjct: 330 LMDHKADMYSITLQSEGRLLIDFTLVRL-PIVNDIFKDQNLPTCIVALSGGIRLKTCQFF 388
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+G GD+++V+ S+ L+ F +A DAL G++
Sbjct: 389 IGFSSGDAVVVK----------SNNLRSAFESQYREAIELPNDEDEDYDALY----GDDE 434
Query: 485 SLYGSASNNTESAQKTFSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYEL- 541
L ++N + + F + DSL+N+GP+ G + +A+ G+ + EL
Sbjct: 435 DLARPVNDNKATVETAVPFEIELMDSLINVGPITSICTGRVSSINATIEGLPNPNRNELA 494
Query: 542 ---------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
++ IW + + + + A D
Sbjct: 495 IVSTSGHDSGTYLNVMEPSVRPLVQQALKFTSVTKIWNLKIRKKDKYLVTTDSGAEKSDV 554
Query: 575 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
Y + A+ ++ VT T+ L G +R++QV + +
Sbjct: 555 YE------IGAKIASIKPKHFKRNVT----------TVEIAILGGGKRIVQVTTKAVYLF 598
Query: 635 DGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+ + MT F V+ VSI DP++LL S G I++
Sbjct: 599 NLGFKKLMTISFDF--------------EVVHVSILDPFILLTNSKGEIKI 635
>gi|154421858|ref|XP_001583942.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
gi|121918186|gb|EAY22956.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
Length = 1297
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 153/368 (41%), Gaps = 49/368 (13%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV + G + + GG DSII+ + +K+ VL+ D+ L+ T H
Sbjct: 48 LRLVWEKKFWGEIFGVYRHKSGG-----EYDSIIVGCDTSKVIVLQVIDN--DLKETEYH 100
Query: 161 CFESPEWLHLKRGRES--------FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
F P + ++ DP G C +L+ + +L +
Sbjct: 101 EFNRPGPPEPDPPKPERPFDISTRLRNKTIMDADPTGTCLALLLAQNILYVLPLANK--- 157
Query: 213 LVGDEDTFGSGGGFSAR---IESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHE 265
+ E T +G + + I+ + ++ D K ++D +F+ GY P + I+HE
Sbjct: 158 -IKIESTEKAGDEYHSSWKVIKDAFAYDVHT-DFKSPLYRIRDMVFLDGYKNPTLAIIHE 215
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLI---------WSAMNLPHDAYKLLAVPS 316
TW+ R+ + T +S +S K+ LI W++ LPH+++ L+ VP
Sbjct: 216 LIPTWSVRLPLQKSTVAVSIVSPPLKKKETVLISASIDKVTMWTSRALPHNSFGLVHVPD 275
Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYA-----VSLDSSQELPRSSFSVELDAAHATW 371
PIGG LV+ N I Y + ALALN A V +D + P EL + T
Sbjct: 276 PIGGFLVLSKNAIIYMDHTNIVALALNKLAYLDDEVPVDITANGPGCH---ELYSKVGTA 332
Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVV-----QRLDLSKTNPSVLTSDITTIGNSLFFLG 426
+ LL+ L +LT+ Y+G V + +PS S T SL F+G
Sbjct: 333 IDKSHILLTVDQHYLSILTLHYNGVKVTNLSLNVNLNLEFHPSCFLSLNYTNNRSLVFMG 392
Query: 427 SRLGDSLL 434
S DS L
Sbjct: 393 STTHDSTL 400
>gi|190348091|gb|EDK40482.2| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC
6260]
Length = 1320
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 127/611 (20%), Positives = 237/611 (38%), Gaps = 68/611 (11%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L+ ++L+G V +L + +S D I++A + AK+S++ +D H + S+H
Sbjct: 52 LRLLDQFKLYGTVTAL---KKFRTVDSPDLDYILVATKAAKVSMIRWDHQTHSIATESLH 108
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV-----G 215
+E E+ L+ V+P + + + L S G
Sbjct: 109 YYEKSIQ---AATYETLDETELI-VEPNRYSCFCVRFKNLLTFLPFSTPDDDDDDMDDEG 164
Query: 216 DEDTFGSGGGFSARI-ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
+ GF + + SS +++ + L+ + + D F+H Y EP + IL + TW G
Sbjct: 165 ETKKQKYVPGFDSEVFGSSFMVDAQTLEPSIGTIVDMQFLHNYREPTVAILSSKAATWTG 224
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHY 331
+ ++I K + NLP D +L+ + P+ G L++G N IH
Sbjct: 225 LLPKVKDNITYHVMTIDLATKATTTVLKIENLPFDIDRLVPLSHPLNGCLLLGCNEIIHV 284
Query: 332 HSQSASCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVL 388
+ LA+N Y + +S + ++ ++ L+ L ND LLS TG L
Sbjct: 285 DNGGIVRRLAVNKYTEDITASVKNYHDQTDLNLMLENCAVIPLPNDNRVLLSLSTGSLFH 344
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTS-DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
+ D + ++R L + +S D+T G F DS L+ +G S L
Sbjct: 345 INFDVDIKTIKRFALEPVLETHYSSVDLTYPGQPAFL------DSNLLFIANNNGNSPL- 397
Query: 448 SGLKEEFGDIEADAPSTKRL-RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
+E + + + S+ +DM EEL +A Q +
Sbjct: 398 ---------LEVKYLRNEEVTEKVQSNGKEDMDGDEELYDDDNAGEKIVIRQGDIKYFKH 448
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG------CKGIWT-------- 552
D L+N GP+ DF+ G A I+ N + G C I+
Sbjct: 449 DELINHGPVSDFTLGKYSTEKFKANLINPNLNDVCIVSNGGSHKQSCLNIFAPSVQPIIR 508
Query: 553 ---VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+ + +R N ++ + DD I +E L++ D + +
Sbjct: 509 SSLTFSQVNRMWNINNKYLITSDDVNSKSEIFQIEKSYSRLKSKDFIND----------E 558
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
TIA L + ++Q+ + + + + + +SF E + ++S ++ D
Sbjct: 559 MTIAMHELNNGKYILQITPKHIEVFNSKF-KRHMSF---EDELKDAMKEDQIISSTVHDD 614
Query: 670 YVLLGMSDGSI 680
Y+++ + G +
Sbjct: 615 YLMIFFASGEV 625
>gi|398397855|ref|XP_003852385.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici
IPO323]
gi|339472266|gb|EGP87361.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici
IPO323]
Length = 1333
Score = 83.2 bits (204), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 78/320 (24%), Positives = 133/320 (41%), Gaps = 32/320 (10%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
+ + L L+ Y L G V S+A + D ++I+LAF++AK+S++E+D H +
Sbjct: 45 AQSKLVLIGGYPLAGTVTSIARVKT--LDTRTGGEAILLAFKNAKLSLIEWDPENHRIST 102
Query: 157 TSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL------------ 204
S+H +E + G ++ VDP RC + Q+ IL
Sbjct: 103 VSIHYYEGENVIAQPYGPSLGEYESILTVDPGSRCAALKFGARQLAILPFRQFGDELLGE 162
Query: 205 ------KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYI 256
A+ G + D G + S V+ L LD + H D F+H Y
Sbjct: 163 EEGEFENANDGTTSKKHDAMQNGEDEAEQTPYKQSFVLPLTTLDPALSHTIDLAFLHEYR 222
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
EP I+ + + ++ K + + NLP +K++ +PS
Sbjct: 223 EPTFGIISSAIEPSYALFDERKDILSYTVFTLDLEQKASTNLITVPNLPSTLWKVVPLPS 282
Query: 317 PIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQND 375
PIGG L++G N IH + A A+N +A+ +S +++L+
Sbjct: 283 PIGGALLIGTNEFIHVDQSGKANATAVNEFAMKESDFGMADQSGLNLKLEGC-------S 335
Query: 376 VALLSTKTGDLVLLTVVYDG 395
V +L+ TG+ +L V+ DG
Sbjct: 336 VEILNASTGE--MLVVLRDG 353
>gi|159155577|gb|AAI54419.1| Cpsf1 protein [Danio rerio]
Length = 400
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/240 (28%), Positives = 110/240 (45%), Gaps = 51/240 (21%)
Query: 483 ELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRINADAS---- 529
E+ +YGS A + T+ A T+SF V DS++NIGP S G + N +
Sbjct: 54 EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVV 111
Query: 530 -ATGISKQSNYELV------------ELPGCKGIWTVYHKSSR---------GHNADSSR 567
+G K ++ ELPGC +WTV + + G + + +
Sbjct: 112 VCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEK 171
Query: 568 MAAY---DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
D + H +LI+S E TM+L+T + E+ S + QG T+ AGN+ + +I
Sbjct: 172 REPTIEDDKKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYII 230
Query: 625 QVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
QV G R+L+G L F P + S ++ S+ADPYV++ ++G + + V
Sbjct: 231 QVSPMGIRLLEG---VNQLHFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFV 280
>gi|146415762|ref|XP_001483851.1| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC
6260]
Length = 1320
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 127/612 (20%), Positives = 237/612 (38%), Gaps = 68/612 (11%)
Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
L L+ ++L+G V +L + +S D I++A + AK+S++ +D H + S+
Sbjct: 51 KLRLLDQFKLYGTVTAL---KKFRTVDSPDLDYILVATKAAKVSMIRWDHQTHSIATESL 107
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV----- 214
H +E E+ L+ V+P + + + L S
Sbjct: 108 HYYEKSIQ---AATYETLDETELI-VEPNRYSCFCVRFKNLLTFLPFSTPDDDDDDMDDE 163
Query: 215 GDEDTFGSGGGFSARI-ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
G+ GF + + SS +++ + L+ + + D F+H Y EP + IL + TW
Sbjct: 164 GETKKQKYVPGFDSEVFGSSFMVDAQTLEPSIGTIVDMQFLHNYREPTVAILSLKAATWT 223
Query: 272 GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIH 330
G + ++I K + NLP D +L+ + P+ G L++G N IH
Sbjct: 224 GLLPKVKDNITYHVMTIDLATKATTTVLKIENLPFDIDRLVPLSHPLNGCLLLGCNEIIH 283
Query: 331 YHSQSASCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLV 387
+ LA+N Y + +S + ++ ++ L+ L ND LLS TG L
Sbjct: 284 VDNGGIVRRLAVNKYTEDITASVKNYHDQTDLNLMLENCAVIPLPNDNRVLLSLLTGSLF 343
Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTS-DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
+ D + ++R L + +S D+T G F DS L+ +G S L
Sbjct: 344 HINFDVDIKTIKRFALEPVLETHYSSVDLTYPGQPAFL------DSNLLFIANNNGNSPL 397
Query: 447 SSGLKEEFGDIEADAPSTKRL-RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
+E + + + S+ +DM EEL +A Q +
Sbjct: 398 ----------LEVKYLRNEEVTEKVQSNGKEDMDGDEELYDDDNAGEKIVIRQGDIKYFK 447
Query: 506 RDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG------CKGIWT------- 552
D L+N GP+ DF+ G A I+ N + G C I+
Sbjct: 448 HDELINHGPVSDFTLGKYSTEKFKANLINPNLNDVCIVSNGGSHKQSCLNIFAPSVQPII 507
Query: 553 ----VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+ + +R N ++ + DD I +E L++ D + +
Sbjct: 508 RSSLTFSQVNRMWNINNKYLITSDDVNLKSEIFQIEKSYSRLKSKDFIND---------- 557
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
TIA L + ++Q+ + + + + + +SF E + ++S ++ D
Sbjct: 558 EMTIAMHELNNGKYILQITPKHIEVFNSKF-KRHMSF---EDELKDAMKEDQIISSTVHD 613
Query: 669 PYVLLGMSDGSI 680
Y+++ + G +
Sbjct: 614 DYLMIFFASGEV 625
>gi|328864890|gb|EGG13276.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
Length = 1627
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 52/103 (50%), Gaps = 16/103 (15%)
Query: 912 QRITIFKNISGHQGFFLSG-SRPCWCMVFRERLRVHPQ---------------LCDGSIV 955
+RI F NI +G F+SG S P W + R+HP I
Sbjct: 1018 RRIIPFSNIGNKRGIFVSGVSTPIWIFSEKNFPRIHPMKQQQQTTSSSSSSSSSSKRPIT 1077
Query: 956 AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
FT HN+NC HGFIY G+L IC+LP G+ Y+N WP++K+
Sbjct: 1078 TFTTFHNINCKHGFIYFDHTGMLCICRLPDGTNYENEWPIRKL 1120
>gi|428164905|gb|EKX33915.1| hypothetical protein GUITHDRAFT_158867 [Guillardia theta CCMP2712]
Length = 1092
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 128/601 (21%), Positives = 236/601 (39%), Gaps = 127/601 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+++ ++ L+ V ++G + ++ + + GA+ R+S+ + E K ++E+D
Sbjct: 37 RLVIYTLTPEGLQPVLDTGIYGRIAAIELYTVAGAE----RESLYILTERLKFCIVEYDS 92
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVY-GLQMIIL 204
S L +M + +S R GP+ +DP+ R G L+Y GL +I
Sbjct: 93 STGELITKAMGDVQ-----------DSVGRPVDGGPIAHIDPERRMIGFLLYDGLFKVIP 141
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH 264
++ G F+ R+E V++++ F++GY +P +V+L
Sbjct: 142 IDTRNGQ----------LREAFNIRLEELQVLDVQ-----------FLYGYAQPTIVLL- 179
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGV 321
++ M + +++ I WS + A ++ VP+PIGG
Sbjct: 180 -----------YQDPKEMRHLKTYQVSIRDKDFIAGPWSQTGVEIGATMIIPVPTPIGGC 228
Query: 322 LVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
+++G TI Y + + + +D + + R+ ++ D LL
Sbjct: 229 ILLGEQTISYLNGDKG-----DTKTIHMDMT--VIRAWGKIDEDGRR--------YLLGD 273
Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
G L +L + +DG V L L + IT + + + F+GS GDS L++
Sbjct: 274 HLGQLYVLVLEFDGNKVLGLKLDTLGETSSAKTITYLDSGVVFIGSCFGDSQLIRL---- 329
Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTF 501
K S+ + L+ N + + + +
Sbjct: 330 --------------------HPDKDENDSNIEVLESFTNLGPIQDFCVVDLERQGQGQVV 369
Query: 502 SFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGH 561
+ + G LKD S LR+ + GI++Q+ VELPG KG+W++
Sbjct: 370 TCS--------GTLKDGS--LRVVRN--GIGINEQAA---VELPGIKGLWSLRE------ 408
Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
+ D +Y YLI S T VLE AD TE + +TI N+ G
Sbjct: 409 --------SIDAQYDKYLIQSFVNETRVLEIADEELSETEIDGFDHNAQTIFCSNVLG-D 459
Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSE--SGSGSENSTVLSVSIADPYVLLGMSDGS 679
++Q+ E R++ + P N E + +G V+ S + L +S+G
Sbjct: 460 CLLQITEVSLRLVSTKSKQLLKEWFPPNGERITVAGGNVQQVVLTSGKRTLIYLDVSNGD 519
Query: 680 I 680
+
Sbjct: 520 V 520
>gi|453082807|gb|EMF10854.1| CPSF_A-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 1349
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 144/680 (21%), Positives = 266/680 (39%), Gaps = 93/680 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV ++++++ G K + N G ++ VL V Y L G V S+
Sbjct: 28 NLVVAKTSLLQVF-------GVKAAGNDGGNEKLVL-----------VGEYSLAGTVTSI 69
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + D ++++L+F+DAK+S++E+D + + S+H +E + G
Sbjct: 70 ARVKT--LDTKSGGEAVLLSFKDAKLSLVEWDPENYRISTISLHFYEGDNVISAPFGPPL 127
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQ---------------GGSGLVGDEDT-- 219
++ VDP RC + Q+ IL Q L + T
Sbjct: 128 ADCDSILTVDPSSRCAALKFGARQLAILPFRQFGDELAGEEEEGEFDADHALATSKRTES 187
Query: 220 --FGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
+G ++S + L LD + H F+H Y EP IL +
Sbjct: 188 VPHANGDTEHTPYKASFTLALTALDPSVSHAVHLAFLHEYREPTFGILSATVEPSYSLLE 247
Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
+ + L++ + + S LP ++++ +P P+GG L++G N + + QS
Sbjct: 248 ERKDILTYTVLTLDLEQRASTNLISVPKLPSTLWEVVPLPLPVGGALLLGTNELVHVDQS 307
Query: 336 ASC-ALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVV 392
A A+N +A +S +++L+ L + L+ T G L +L+
Sbjct: 308 GKANATAVNEFAKLESDFGMADQSHLNLKLEDCRVEVLDSKTGELLIVTNDGSLAILSFQ 367
Query: 393 YDGRVVQRLDLSKTNPSVLTSDITT-------IGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
GR + L++ + ++ I T + S F+GS G S L+ ++ TS
Sbjct: 368 MHGRSISALNVKRATSENGSTTIHTAPSCMARLEGSKIFIGSEDGASSLLGWS--RPTSA 425
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
L+ K + + D E S + T +AQ TFS +
Sbjct: 426 LNR--KRSHAQMLDKEADDEDEEMEEDDDDLYDAAPEPKKRASSETAVTSTAQYTFS--I 481
Query: 506 RDSLVNIGPLKDFSYG--------LRINADASATGISKQS--NYELV-------ELPGCK 548
D L++ GP+ + G L I A A S+ + + ++V +L +
Sbjct: 482 IDELLSTGPIHEVCLGRSGPWKDRLEIAAGAGRKQASRLTLMHRDIVPTVRRKCKLGAAR 541
Query: 549 GIWTVYHKSSRGHNADSSRMA-AYD-DEYHAYLIISL--EARTMVLETADLLTEVTESVD 604
W + K + + +D D+ Y I S + + +A E++D
Sbjct: 542 ATWALRPKQRNAALPEYDNLLFVFDGDDTKVYDIPSQDEDGSSYTERSAPEFESAGETLD 601
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARI-LDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
T+A G + + R ++ A++ LD P E E+ +++
Sbjct: 602 M----ATVADGTIVVQTRRTELRTYNAKLGLD--------QIIPMTDE--ETDEDLSIVH 647
Query: 664 VSIADPYVLLGMSDGSIRLL 683
++++DPYVL+ D S+++L
Sbjct: 648 IAVSDPYVLVIRGDNSVQVL 667
>gi|224135035|ref|XP_002321967.1| predicted protein [Populus trichocarpa]
gi|222868963|gb|EEF06094.1| predicted protein [Populus trichocarpa]
Length = 60
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/48 (75%), Positives = 41/48 (85%)
Query: 684 VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
+ DPSTC VSV TP+A +SSKK VS+CTLYHDKGPEP LRKTS +AWL
Sbjct: 1 MTDPSTCMVSVNTPSAFQSSKKSVSACTLYHDKGPEPLLRKTSPNAWL 48
>gi|254580509|ref|XP_002496240.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
gi|238939131|emb|CAR27307.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
Length = 1331
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 140/656 (21%), Positives = 271/656 (41%), Gaps = 124/656 (18%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L ++ G + LA++ Q + D ++L AKISV+ +D++ + + S+H
Sbjct: 48 LILTHEFKFEGRITDLAVVPQKDSP----LDCLLLCTSIAKISVVRYDEASNSIETLSLH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS--------- 211
+E R A+ ++VDP RC L++ +I L Q S
Sbjct: 104 YYEDS---FKDRSILELAKESTMRVDPGKRCA--LLFNNDVIALLPLQTTSLNDGEEEDE 158
Query: 212 ---GLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHER 266
D+ + G +A S + N ++L DM +V D F+ + P + ++ E
Sbjct: 159 DMDDERPDKRQKNNKGRITA---PSAIFNAKELHQDMNNVIDVTFLRNFTRPTLAVIFEN 215
Query: 267 ELTWAGR-------VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
+ WAG V++ T +++ ST +K +I + L D + ++ + +
Sbjct: 216 KPVWAGTSQVLPLPVTYMAFTLEVTSNEQSTDIKS-TVIATVKELSWDFHTMIPIAN--- 271
Query: 320 GVLVVGANTIHY--HSQSASCALALNNYA-VSLDSSQELPRSSFSVELDAAHA-TWLQND 375
G ++VG+N + Y ++ S + LN+YA ++ ++ + RS + L W +D
Sbjct: 272 GCIIVGSNEMAYIDNTGSLQSIIFLNSYANKNMKKARIVDRSKSKILLHKPTTYNWSVSD 331
Query: 376 VALLSTKTGDLVLLT----------VVYDGRVVQRLDL-----------SKTNPSVLTSD 414
++TG+ +L+ + Y+GR++ + D+ + +N + ++
Sbjct: 332 ---QKSETGETLLIMDHQAAFYYIQLEYEGRLLTKFDIINLPIVNDTLKNNSNATCISRL 388
Query: 415 ITTI-GNSL-FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+T+ GN + F+G R GD+ +++ + L + ++ E +P + +
Sbjct: 389 NSTLSGNYVDLFVGFRSGDASVLRL------NNLKAAIESRDEHKEITSPPENDIEKFED 442
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADAS 529
+D + EE S N E +T F V SL NI P+ + G + D
Sbjct: 443 ---EDDLYSEEASDADKEKENKEVVVETVLPFDIEVLSSLRNIAPITSLTPGKICSVDKF 499
Query: 530 ATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-EARTM 588
G+S + E V L G T G + +M+ + A IS+ + +
Sbjct: 500 VEGLSNPNRNE-VSLVATSGNGT-------GSHLTEIQMSVRPEVQLALKFISITQMWNL 551
Query: 589 VLETADLLTEVTES---------VD----YFVQGR-----TIAAGNLFGR-RRVIQVFER 629
++ D T+S +D + +GR T + ++FG +R++QV
Sbjct: 552 KIKNKDKYLITTDSNKNKSDIYLIDKNFALYKEGRFRRDATTVSISMFGSDKRIVQVTTN 611
Query: 630 GARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+ D ++ T F V+ VS+ DPY+L+ +S G I++
Sbjct: 612 HLYLYDTNFKRLTTMKFEF--------------EVVHVSVMDPYILITVSRGDIKV 653
>gi|255720869|ref|XP_002545369.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240135858|gb|EER35411.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 1351
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/318 (22%), Positives = 134/318 (42%), Gaps = 29/318 (9%)
Query: 222 SGGGFSAR--IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
+G F R +SS +I+ LD + V D F+H Y EP + +L + WAG +
Sbjct: 198 NGNSFEPRQFYDSSFIIDATTLDSTVGTVIDMQFLHNYREPTIGVLSSKSEVWAGNLLKS 257
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
L++ K ++ NLP++ +++ +PSP+ GV++VG N IH +
Sbjct: 258 KDNIQFQVLTLDLNSKSTVSVFKIDNLPYEIDRVIPLPSPLNGVILVGCNELIHVDNGGV 317
Query: 337 SCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTV 391
+A+N + S+ S Q+ +S +++L+ + + ND LL KTG+ +
Sbjct: 318 MKRIAVNKFTGLTTASIKSFQD--QSDLNLKLEDSTIVPIPNDHRVLLVLKTGEFYYINF 375
Query: 392 VYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
DG+ ++R+ + + + ++ + +L F + G+S LVQ S
Sbjct: 376 ELDGKSIKRVHIDVIDKKLYEKVKLTYPGEVAVLDKNLLFFANSSGNSPLVQVKYRDSLS 435
Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
G E D E + ++ E+ +L ++ F
Sbjct: 436 DAKIGAPIEESDEEDETQKADEDDDEDDLYKEEEEEEEQKNL----------SKTHIEFV 485
Query: 505 VRDSLVNIGPLKDFSYGL 522
D L+N GP F+ G+
Sbjct: 486 YHDELINNGPSSSFTLGV 503
>gi|241954348|ref|XP_002419895.1| subunit of the mRNA cleavage and polyadenylation factor, putative
[Candida dubliniensis CD36]
gi|223643236|emb|CAX42110.1| subunit of the mRNA cleavage and polyadenylation factor, putative
[Candida dubliniensis CD36]
Length = 1420
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 55/236 (23%), Positives = 110/236 (46%), Gaps = 17/236 (7%)
Query: 216 DEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
+EDT G+ +SS +I+ LD + V D F+H Y EP + +L ++ WAG
Sbjct: 197 EEDTNGTNKESHLFYDSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGN 256
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ L++ K ++ NLP++ +++ +PSP+ G L+VG N IH
Sbjct: 257 LIKSKDNIQFQVLTLDLNSKSTISVFKIDNLPYEIDRIVPLPSPLNGTLLVGCNELIHVD 316
Query: 333 SQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLV 387
+ +A+N + S+ S Q+ +S +++L+ + +D LL +TG+
Sbjct: 317 NGGVLKRIAVNKFTRLITASIKSFQD--QSDLNLKLENCSIVPIPDDHRVLLILQTGEFY 374
Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
+ DG+ ++R+ + + ++ + ++ F+ + G+S L+Q
Sbjct: 375 FINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAVLDKNMLFIANSNGNSPLIQ 430
>gi|68471006|ref|XP_720510.1| likely Cleavage and Polyadenylation Specificity Factor subunit
[Candida albicans SC5314]
gi|74591422|sp|Q5AFT3.1|CFT1_CANAL RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
protein 1
gi|46442380|gb|EAL01670.1| likely Cleavage and Polyadenylation Specificity Factor subunit
[Candida albicans SC5314]
Length = 1420
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 52/221 (23%), Positives = 104/221 (47%), Gaps = 17/221 (7%)
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+SS +I+ LD + V D F+H Y EP + +L ++ WAG + L++
Sbjct: 217 DSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTL 276
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY-- 345
LK ++ NLP++ +++ +PSP+ G L+VG N IH + +A+N +
Sbjct: 277 DLNLKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTR 336
Query: 346 --AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLD 402
S S Q+ +S +++L+ + +D LL +TG+ + DG+ ++R+
Sbjct: 337 LITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIH 394
Query: 403 LSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
+ + ++ + ++ F+ + G+S L+Q
Sbjct: 395 IDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQ 435
>gi|50305395|ref|XP_452657.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74606921|sp|Q6CTT2.1|CFT1_KLULA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
protein 1
gi|49641790|emb|CAH01508.1| KLLA0C10274p [Kluyveromyces lactis]
Length = 1300
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 136/639 (21%), Positives = 257/639 (40%), Gaps = 111/639 (17%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
A L L ++L G + + +L Q G S + IL+ +K+S++ FD L
Sbjct: 45 AQKLVLAYEWKLAGKIIDMQLLPQIG---SPLKMLAILS-SKSKVSLVRFDPVAESLETL 100
Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGD 216
S+H + ++++L S ++ VDP RC +LV+ ++ IL + D
Sbjct: 101 SLHYYHD-KFVNL--STSSLKTESIMAVDPLFRC--LLVFNEDVLAILPLKLNTEDMEID 155
Query: 217 EDTFGSGGGFSARIESSHVINLRDLDM---------KHVKDFIFVHGYIEPVMVILHERE 267
ED G + R++ + I + M KHV D +++ + +P + IL++
Sbjct: 156 EDENGIKEPMAKRLKRNQGITSDSIIMPISSLHKSLKHVYDIKWLNNFSKPTVGILYQPV 215
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
L W G +T LS+ ++ +I +LP+D + L VP G VL +G N
Sbjct: 216 LAWCGNEKVLGNTMRYMVLSLDVEDEKTTVIAELADLPNDLHTL--VPLKRGYVL-IGVN 272
Query: 328 TIHYHSQSA---SCALALNNYAVSLDSSQELPRSSFSVELDAA----HATWLQNDVALLS 380
+ Y S S SC + LN +A S +++ S ++ L + + ++D+ +L
Sbjct: 273 ELLYISASGALQSC-IRLNTFATSSINTRITDNSDMNIFLSKSSIYFYKALKRHDLLILI 331
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL-----GD---- 431
+ + + +G ++ + D + I N + F SRL GD
Sbjct: 332 DENCRMYNIITESEGNLLTKFDCVQ----------VPIVNEI-FKNSRLPLSVCGDLNLE 380
Query: 432 --SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
+L+ F G + LK F + ++L + D E +LYG
Sbjct: 381 TGRVLIGFLSGDAMFLQLKNLKVAFA-------AKRQLVETVDDDDD-----EYSALYGE 428
Query: 490 ASNNTES----AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
+ NNT + Q+ F ++ DS+ NIGPL + G + + + + + E +
Sbjct: 429 SQNNTHTRIVETQEPFDISLLDSIFNIGPLTSLTIGKVASVEPTIQRLPNPNKDEF-SIV 487
Query: 546 GCKGI-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
G+ T H + + H + + + ++ + ++ + L T D E +
Sbjct: 488 ATSGVGRGSHLTALHSTVQPHIEQALKFTSATRIWN----LKIKGKDKYLVTTDADKEKS 543
Query: 601 E------------SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-----MTQDL 643
+ + D+ RTI + +R++QV G + D + +T D+
Sbjct: 544 DVYQIDRNFEPFRAQDFRKDSRTIGMETMDDDKRILQVTSGGLYLFDVDFKRLARLTIDI 603
Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
++ I DPY+L + G+I++
Sbjct: 604 E----------------IVHACIIDPYILFTDARGNIKI 626
>gi|348681092|gb|EGZ20908.1| hypothetical protein PHYSODRAFT_259403 [Phytophthora sojae]
Length = 1137
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 109/467 (23%), Positives = 182/467 (38%), Gaps = 108/467 (23%)
Query: 174 RESFARGPLV----KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR 229
R+S R + +DP+GR G+ +Y ++ G L +DTF
Sbjct: 107 RDSIGRSSEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGIL---KDTF--------- 154
Query: 230 IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
N+R LD V D F+HGY +P + +L+E H L
Sbjct: 155 -------NIR-LDELRVIDIKFLHGYTKPTICVLYED-------YKAARHIKTYHILLKE 199
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
+ P WS N+ A L+ VP+P+GGVL+V TI YH+ S A+ + + + +
Sbjct: 200 KDFAEGP--WSQSNVESGASLLIPVPAPVGGVLIVSNQTIVYHNGSTFHAIPMQSTVIQV 257
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+ + S F LL+ + G L ++ + + G+ V + L +
Sbjct: 258 YGAVDKDGSRF-----------------LLADQYGTLSVVALQHTGKEVTGVHLEVLGET 300
Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
+ S ++ + N + F+GS GDS L++ ++E G
Sbjct: 301 NIASCLSYLDNGVVFIGSTFGDSQLIKLNAD----------RDENG-------------- 336
Query: 470 SSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINAD 527
S + L VN + + + + + T S A +D G L+ G+ IN
Sbjct: 337 SYIEVLDTYVNVGPIIDFCVMDLDRQGQGQIVTCSGADKD-----GTLRVIRNGIGINEQ 391
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
ASA ELPG KG+W + AA D+Y +S E R
Sbjct: 392 ASA------------ELPGIKGMWAL-----------RETFAAEHDKYLLQSYVS-EIRI 427
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + D + E + + F +T+ N++G +QV E R++
Sbjct: 428 LAIGDEDEMEE--KEIPAFTNVKTLLCRNMYGDVW-LQVTESEVRLI 471
>gi|167384458|ref|XP_001736962.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165900458|gb|EDR26769.1| hypothetical protein EDI_171140 [Entamoeba dispar SAW760]
Length = 836
Score = 76.3 bits (186), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 83/326 (25%), Positives = 141/326 (43%), Gaps = 50/326 (15%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
++L F++AK+S+L +D++ + I S+HCFE P LKR +E P + +D +
Sbjct: 74 LVLLFKEAKVSILRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTNPRLLIDKR 129
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
GRC ++ Y M ++ GF ++S+ INL + + D
Sbjct: 130 GRCISLICYDRLMWVIPL------------------GFD---KTSYSINLEKFGINRIID 168
Query: 249 FIFVHGYIEPVMVILHERELTWAGR-VSWKHHTCMISALSISTTL---KQHPLIWSAMNL 304
I + GY P + LH + TW GR V+ T I LS+ + KQ + +
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIVLSLEPDVIHEKQDIVATVSYQF 228
Query: 305 PHDAYKLLAVPS--PIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSF 360
+ Y L + P G+L++ N+I Y S ++ S L + V + + P SSF
Sbjct: 229 SYVPYNALQIVDCYPTNGILILTINSIIYLSTTSFESFILPFGKFFV-IPKNNNRPLSSF 287
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSVL 411
+ T + N V + T L ++ V+ + R+ D+ TN P
Sbjct: 288 QI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYKY 344
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
TI ++ F+GS + DS + +
Sbjct: 345 HPTTFTISSNHLFIGSTVHDSYIYNY 370
>gi|302403950|ref|XP_002999813.1| cft-1 [Verticillium albo-atrum VaMs.102]
gi|261361315|gb|EEY23743.1| cft-1 [Verticillium albo-atrum VaMs.102]
Length = 1349
Score = 76.3 bits (186), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 117/507 (23%), Positives = 199/507 (39%), Gaps = 75/507 (14%)
Query: 233 SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
S V+ L LD + H F F+H Y EP + I+ H T + ++
Sbjct: 197 SFVLALPQLDPEILHPVHFAFLHEYREPTLGIISSSNRRLKMEPQMDHFTFKV--FTVDL 254
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSL 349
K I + NLP K++A+ P+GG L++G N IH + +A+N YA +
Sbjct: 255 LQKASTAILTVSNLPQSLKKVVALSKPMGGALLIGENELIHIDQAGKAHGVAVNPYAAKM 314
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL---- 403
+S + L+ + D LL T+ G++ ++T DGR V + +
Sbjct: 315 TKFPLADQSELKLRLEHCEVELMSPDNGEMLLVTRHGEMAVVTFKMDGRSVSGVSVKVVA 374
Query: 404 SKTNPSVL---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
++ +L + +T + + F G+ GDS ++ + S + ++ K D +
Sbjct: 375 TENGGDILPFRAACLTKVSKNSMFYGTIGGDSQVIGW---SRQHVQTARKKARLLD---E 428
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+ D D + GE ++ + F V DSL+++ P+ D +Y
Sbjct: 429 SLDYDLDEDELDDDDDDDLYGEGTVAPQPSAAAGSAKGGDVVFRVHDSLLSLSPIMDMAY 488
Query: 521 G----------------LRINAD-ASATGISKQSNYELV------------ELPGCKGIW 551
G +R D A G + + L+ E P +G W
Sbjct: 489 GKTAFFPGSEEAKNSEGVRSELDLVCAVGRHRGGSLALINQHIQPRVIGRFEFPEARGFW 548
Query: 552 TV-----YHKSSRGHNADSSRMAAYDD-----EYHAYLIISLEARTMVLETADLLTEVTE 601
T KS +G + +A +D +Y ++I++ + ET+D+
Sbjct: 549 TTRVQKTIAKSLQGEKG--ANLAVGNDYGSVTQYDKFMIVA-KVDLDGYETSDVYALTGA 605
Query: 602 SVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
+ G TI AG + R+IQV R DG ++Q L + E+G
Sbjct: 606 GFEALSGTEFDPAAGLTIEAGTMGNDMRIIQVLRSEVRCYDGDLGLSQILPM--LDEETG 663
Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSI 680
+ V+S SI DPY+LL D SI
Sbjct: 664 A---EPRVISASIVDPYLLLLREDSSI 687
Score = 42.7 bits (99), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 66/138 (47%), Gaps = 30/138 (21%)
Query: 57 NLVVTAANVIEIYVVRV-------QEEGSKESKNSGETKRRVLMD--GISAA-------- 99
NL+V+ ++++I+ V+ + +K S +GET R + D G+ +A
Sbjct: 28 NLIVSKGSLLQIFAVKTVSTEIDTSQIQAKSSSKAGETYDRRINDDDGLESAFLGGDGML 87
Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDD 149
L LV Y +HG + LA + +SR +++++ A++S+L++D
Sbjct: 88 MRADRTTNTRLVLVAEYPVHGVIAGLARVK---IQSSRSGGEALLVHSRTARLSLLQWDP 144
Query: 150 SIHGLRITSMHCFESPEW 167
HG+ S+H +E EW
Sbjct: 145 EKHGVEDVSIHFYEKEEW 162
>gi|363750592|ref|XP_003645513.1| hypothetical protein Ecym_3197 [Eremothecium cymbalariae
DBVPG#7215]
gi|356889147|gb|AET38696.1| Hypothetical protein Ecym_3197 [Eremothecium cymbalariae
DBVPG#7215]
Length = 1318
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 135/646 (20%), Positives = 266/646 (41%), Gaps = 105/646 (16%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
+ L L ++L G+V S+A++ Q G++ +++ K+S+L+FD L
Sbjct: 44 AKGQLVLSYEWKLSGHVHSMALIPQPGSE----LYCLVILTGCGKLSILKFDHMSQSLDT 99
Query: 157 TSMHCFESP-EWLHLKRGRESFARGPLVKVDPQGRCGGV----LVYGLQMIILKASQGGS 211
S+H +E + L L + P + VD RC V + L + + K +
Sbjct: 100 LSLHYYEDKFKELSLLE----ISNTPSLIVDRSFRCLLVRNNDCIAILPLNVTKEEEEEE 155
Query: 212 GLVGDEDTFGSGGGFSAR------------IESSHVINLRDL--DMKHVKDFIFVHGYIE 257
++ +GG FS + + SS ++ L D+K+V D F+HG+ +
Sbjct: 156 EDNEKDEDRSNGGRFSFKRHKLNGGSVKQFVNSSTIMPASHLHSDIKNVLDVQFLHGFNK 215
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P + IL++ L W+G + T + LS+ ++ +I LP+D + L+ + +
Sbjct: 216 PTLAILYQPILAWSGNEKLRSQTVKVIILSLDFEDEKSTVINIIQGLPNDLHTLIPLSN- 274
Query: 318 IGGVLVVGANTIHYHSQSASC--ALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ-- 373
+VVG N + Y + + ++LN+++ ++ +++ SS + +
Sbjct: 275 --ASIVVGVNELIYIDNTGALQGTVSLNSFSKTVLNTKVKDNSSLQAFFNRPVCQYTTIS 332
Query: 374 --NDVALLSTKTGDLVLLTVVYDGRVVQ-----RL----DLSKTN--PSVLTSDITTIGN 420
D+ LL + + + + +GR+V RL D+ K N P+ + D+
Sbjct: 333 KGKDIMLLMDEKSQMYNVIIESEGRLVTAFNCVRLPIVNDIFKNNHLPTCICGDVDLETG 392
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
+L F+G + GD++ V+ +S+ S G E +EAD +
Sbjct: 393 NL-FIGFKSGDAMRVRLN-NLRSSLASKGNVVE--TMEADEDYDE--------------- 433
Query: 481 GEELSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
LYG ++ + T F D+L+NIGPL + G + + + ++ +
Sbjct: 434 -----LYGGSTEVEKKNMDTETPFDIETLDNLINIGPLTSLAVGKVSSIEPTIAKLTNPN 488
Query: 538 NYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYD------------DEYHAYLIISLEA 585
EL + G T H + + + A + YL+ + +
Sbjct: 489 RCEL-SIVATSGNSTGSHLTVFENTIVPTVEKALKFISVTQIWNLKIKDKDKYLVTTDSS 547
Query: 586 RTMV-LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY---MTQ 641
++ + + D + +S D+ T++ +R++QV +G + D ++ MT
Sbjct: 548 QSKSDIYSIDRDFKPFKSFDFKKNDTTVSTAVTGAGKRIVQVTSKGVYLFDINFKRMMTM 607
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
+ F V+ V I DP++LL S G I++ +P
Sbjct: 608 NFDF--------------EVVHVCINDPFLLLTNSKGDIKIYELEP 639
>gi|238881599|gb|EEQ45237.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 1423
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/238 (23%), Positives = 110/238 (46%), Gaps = 19/238 (7%)
Query: 216 DEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
+ED G+ R+ +SS +I+ LD + V D F+H Y EP + +L ++ WA
Sbjct: 203 EEDKNGTTTNQEPRLFYDSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWA 262
Query: 272 GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IH 330
G + L++ K ++ NLP++ +++ +PSP+ G L+VG N IH
Sbjct: 263 GNLIKSKDNIQFQVLTLDLNSKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIH 322
Query: 331 YHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGD 385
+ +A+N + S S Q+ +S +++L+ + +D LL +TG+
Sbjct: 323 VDNGGVLKRIAVNKFTRLITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGE 380
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
+ DG+ ++R+ + + ++ + ++ F+ + G+S L+Q
Sbjct: 381 FYFINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQ 438
>gi|449019486|dbj|BAM82888.1| similar to cleavage and polyadenylation specificity factor subunit
[Cyanidioschyzon merolae strain 10D]
Length = 1880
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 112/499 (22%), Positives = 194/499 (38%), Gaps = 128/499 (25%)
Query: 243 MKHVK--DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI----STTLKQHP 296
+ HV+ D F+ G P MV+L+E TWAGRV ++C ++A+ + + + P
Sbjct: 345 LGHVRILDCCFLTGTALPTMVMLYEERPTWAGRVEAVSNSCALAAIVLPPLPAGAAGEEP 404
Query: 297 LI-WSAMNLPHDAYKLLAVPS------PIGGVLVVGANTIHYHSQSASCALAL--NNYA- 346
L+ W LP DA K++ +PS G+L++ AN + + + +L N++
Sbjct: 405 LVAWRIQGLPFDAEKVVPLPSVEWDRAAEQGLLLIAANVLFWIRGNGQIGASLSGNHFGD 464
Query: 347 --VSLDSSQELP---------------RSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
+ LD Q LP R+S + A ++ L G++ L
Sbjct: 465 TFMELDGCQ-LPGALYGGTDSDIISRCRTSQVLHFRGACIAPVRLHRYGLFLADGNVYQL 523
Query: 390 TVVYDGRVVQRLDL------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ D RL+ S+ P+ L D + L F+ + LG S+L + T
Sbjct: 524 ALHADAEYPLRLEALRVRGESRLAPAPL--DAKLLSRDLLFVAAHLGSSVLYRMT----- 576
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
P +R R S+++ G+ N + + +
Sbjct: 577 ---------------QVHPHGRRTRTSAAE-------------NGTLHKNATTKEAQWEL 608
Query: 504 AVRDSLVNIGPLKDF---------SYGLRINADA--SATGISKQS------------NYE 540
RD++ +GP+ D G ++ +ATG QS ++
Sbjct: 609 QQRDTIFQLGPIVDLVVIPPRYSPPAGTLLDPGEILAATGHQHQSCLARCTYQVQTREWQ 668
Query: 541 LVELPGCKGIWTVY--HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
+ GC+ +W++Y H + H + A+ + + L+ R + AD T
Sbjct: 669 RIPSAGCRRVWSLYADHDGTGMHQEEQ----AFLLLSLSKSSVILDIRRGFEQAAD--TR 722
Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY---MTQDL---SFGPSNSES 652
V + TIAAGNL RR + QV G R+LD + +D+ + P + S
Sbjct: 723 V------LLPSPTIAAGNLAQRRLIAQVHRTGIRLLDANLDVVYEEDMLLAALEPGTAVS 776
Query: 653 GSGSENSTVLSVSIADPYV 671
G+ S+ DPY+
Sbjct: 777 GA----------SVVDPYI 785
>gi|301121252|ref|XP_002908353.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
gi|262103384|gb|EEY61436.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
Length = 1150
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 110/467 (23%), Positives = 183/467 (39%), Gaps = 108/467 (23%)
Query: 174 RESFARGPLV----KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR 229
R+S R + +DP+GR G+ +Y ++ G L DTF
Sbjct: 107 RDSIGRSSEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGIL---RDTF--------- 154
Query: 230 IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
N+R LD V D F+HGY +P + +L+E + A V H L
Sbjct: 155 -------NIR-LDELRVIDIKFLHGYNKPTICVLYE-DYKAARHVKTYH------ILLKE 199
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
+ P WS N+ A L+ VP+P GGVL+V TI YH+ S A+ + + + +
Sbjct: 200 KDFAEGP--WSQSNVESGASLLIPVPAPTGGVLIVSNQTIVYHNGSTFHAIPMQSTVIQV 257
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+ + S F LL+ + G L ++ + + G+ V + L +
Sbjct: 258 YGAVDKDGSRF-----------------LLADQYGTLSVVALQHTGKEVSGVHLEVLGET 300
Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
+ S ++ + N + F+GS GDS L++ ++E G
Sbjct: 301 NIASCLSYLDNGVVFIGSTFGDSQLIKLNAD----------RDETG-------------- 336
Query: 470 SSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINAD 527
S + L VN + + + + + T S A +D G L+ G+ IN
Sbjct: 337 SYIEVLDSYVNVGPIIDFCVMDLDRQGQGQIVTCSGADKD-----GTLRVIRNGIGINEQ 391
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
ASA ELPG KG+W + AA D++ +S E R
Sbjct: 392 ASA------------ELPGIKGMWAL-----------RETFAAEHDKFLLQSYVS-EVRI 427
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + D + E + + F +T+ N++G +QV E R++
Sbjct: 428 LAIGDEDEMEE--KEIPAFTNVKTLLCRNMYGDYW-LQVTESEVRLI 471
>gi|344305212|gb|EGW35444.1| pre-mRNA 3'-end processing factor CF II [Spathaspora passalidarum
NRRL Y-27907]
Length = 1348
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 100/500 (20%), Positives = 196/500 (39%), Gaps = 77/500 (15%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
L+V N+++I+ + ++ S +K L+++ ++L+G + L
Sbjct: 29 LIVAKGNLLQIFEPVLIKQQSTPTK--------------PKYKLQIIGQFKLNGLITDLH 74
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR--E 175
L +N D +I++ + AK S+++++ +H + S+H +E H R E
Sbjct: 75 PLRT--VENPHL-DYLIVSTKYAKFSIIKWNHHLHTISTVSLHYYE-----HAIRNSTFE 126
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE----------------DT 219
L+ V+P L + + L + D+ D
Sbjct: 127 KLGISELI-VEPTFNSCSCLRFKNLLCFLPFAVSDEEEEEDDEEDMDLDNKKEKKEKLDI 185
Query: 220 FGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
G + +SS +I+ + LD ++ V D F+H Y EP + IL + WAG +
Sbjct: 186 NGKPADAVSFYDSSFIIDAQTLDSSIETVVDIQFMHNYREPTIAILSSKSNVWAGNLLKV 245
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSA 336
+++ K ++ NLP++ +++ +PSP+ G L++G N I H +
Sbjct: 246 KDNVSFQVMTLDLVSKSTVSVFKIDNLPYEIDRIIPLPSPLNGCLLLGCNEIFHVDNGGI 305
Query: 337 SCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
+A+N++ S S Q+ S S+E D + L+ TG +
Sbjct: 306 IKRIAVNSFTSLVTASTKSYQDQTDLSLSLE-DCCIIPIPGDHRVLMVLTTGQFFYINFE 364
Query: 393 YDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
DG+ ++++ + + ++ + ++ + ++L F + G+S LVQF
Sbjct: 365 LDGKAIKKVHIDTVDQALYSQIKLCYPGEVAVLDHNLLFFANENGNSPLVQF-------- 416
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ----KTF 501
+ D+ D + + +E LY N E Q
Sbjct: 417 -------RYTDV--DQKRITQEAAKEEKKEEKDDEEDEDDLYMDEENEEEQKQIISNSPI 467
Query: 502 SFAVRDSLVNIGPLKDFSYG 521
F D L+N GP+ F+ G
Sbjct: 468 EFIHHDELINNGPISSFTLG 487
Score = 43.1 bits (100), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 33/164 (20%), Positives = 67/164 (40%), Gaps = 26/164 (15%)
Query: 836 HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
H +L + G ++ Y+ Y F+G N + ++LR +
Sbjct: 783 HKEEYLTILTIGGEVIMYKLY-FDG-------------------ENYIFKKEKDLRITGA 822
Query: 896 PLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
P +AY P G +R + F N++G+ F++G P M + Q
Sbjct: 823 PENAY-----PLGTTIERRLVYFPNLNGYTSIFVTGIIPYLIMKPMHSIPRIFQFSKIPA 877
Query: 955 VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
++ + + +G I++ + +IC+L TY+ WP++++
Sbjct: 878 LSISAFSDSKIKNGLIFLDNSKNARICELSLDFTYEFNWPMRQI 921
>gi|145351726|ref|XP_001420218.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580451|gb|ABO98511.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1120
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 127/549 (23%), Positives = 224/549 (40%), Gaps = 113/549 (20%)
Query: 96 ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
+ A L+ V ++G + ++++ G D R + L E +VL +D++ L+
Sbjct: 58 LHAEGLKPVLDVPINGRIATMSLCQTGSGDGKAR---LYLTTERYGFTVLSYDEANEELK 114
Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLV 214
+ + GR + G + VD R G+ +Y GL +I +GG
Sbjct: 115 TEAFGDVQD------NIGRPA-DDGQIGIVDDTCRAIGLRLYDGLFKVIPCDEKGG---- 163
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
++ + I L +L V+D F+HG +P + +L+ R+ A V
Sbjct: 164 ---------------VKEAFNIRLEEL---RVEDIKFLHGTPKPTIAVLY-RDTKDA--V 202
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
K + I ++ P W+ +L + K++ VP+PIGGV+V+G I Y
Sbjct: 203 HIKTYEIGIREKEFVSS----P--WAQNDLEGGSNKIIPVPAPIGGVVVLGQEIIVY--- 253
Query: 335 SASCALALNNYAVSLD---SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
LN + D + +P + A LL G L LL +
Sbjct: 254 -------LNKFEDDADVFLKAINIPNIPDRTNITCYGAIDPDGSRYLLGDADGMLYLLVI 306
Query: 392 VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
++DG+ V+ L + + + + S ++ + N + F+GS GDS L++ L
Sbjct: 307 LHDGKRVRELKIERLGDTSIASTLSYLDNGVVFVGSTYGDSQLIK-------------LH 353
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGE--ELSLYGSASNNTESAQKTFSFAVRDSL 509
E I+ D T L +V+ +L +G T S
Sbjct: 354 AEKTSIDKDGNPTYVQILEEFTNLGPIVDFAFVDLERHGQGQVVTCS------------- 400
Query: 510 VNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMA 569
G LKD S LR+ + GI +Q+ +++LPG KG++++ ++D S+M
Sbjct: 401 ---GALKDGS--LRVVRN--GIGIDEQA---VIQLPGVKGLFSL-------RDSDDSQM- 442
Query: 570 AYDDEYHAYLIISLEARTMVL----ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
YL+++ T +L + D L E TE + + +T+ GN+ G +Q
Sbjct: 443 ------DKYLVVTFINETRILGFVGDEGDTLDE-TEIAGFDAEAQTLCCGNMQG-NVFLQ 494
Query: 626 VFERGARIL 634
V RG R++
Sbjct: 495 VTHRGVRLV 503
>gi|448530371|ref|XP_003870046.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis Co
90-125]
gi|380354400|emb|CCG23915.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis]
Length = 1327
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 83/375 (22%), Positives = 157/375 (41%), Gaps = 57/375 (15%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L+LV ++L G V L L + D ++++ + AK S++ ++ +H + S+H
Sbjct: 57 LKLVEQFKLQGTVSGLKALRTSECPH---LDYVVVSTKYAKFSIIRWNHQLHNISTVSLH 113
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
+E+ E A L V+P L Y + L
Sbjct: 114 YYEN---CIQHSTFEKLAISDLT-VEPTYSSVSCLRYKNLLCFLPFEGVHEEDDEDDTDD 169
Query: 205 ----KASQGGS----GLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHG 254
+GGS GL + F ++S +I+ LD + V D F+H
Sbjct: 170 EDIDNDKKGGSITKNGLSYENQPF---------YDASFIIDAGILDSTIDTVLDVQFLHN 220
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y EP + IL + +WAG + +++ K +++ NLP+D +++ +
Sbjct: 221 YQEPTIAILSAKSNSWAGNLIKNKDNVQFQVMTLDVQSKSTLPVFNIDNLPYDIDRVIPL 280
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHA 369
P+P+ G L++G N IH + + +A+N + S+ S Q+ S +++L+
Sbjct: 281 PNPLNGCLLIGCNELIHVDNGGIAKRIAVNAFTSLITASVKSYQD--ESDLNLKLENCAI 338
Query: 370 TWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNS 421
+ +D LL TG+ L DG+ ++++ L + + S + ++ +
Sbjct: 339 VPIPDDHRVLLILATGEFYYLNFDLDGKSIKKIHLELVDQKMYDSIRLTYPGQVASLDKN 398
Query: 422 LFFLGSRLGDSLLVQ 436
L F + GDS LV+
Sbjct: 399 LLFFANLNGDSSLVE 413
>gi|367014525|ref|XP_003681762.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
gi|359749423|emb|CCE92551.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
Length = 1327
Score = 73.2 bits (178), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 130/638 (20%), Positives = 269/638 (42%), Gaps = 86/638 (13%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
+A L L ++ HG + LA++ Q + D ++L AK+S+++FD + +
Sbjct: 45 SAKLFLTNEFKFHGKITDLALIPQVNSS----LDCLLLCTSIAKVSIVKFDPLSNSIETA 100
Query: 158 SMHCFESP--EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYG-LQMIILKASQGGSGLV 214
S+H +E + L+ ++S+ R +DP RC +L L ++ +A+
Sbjct: 101 SLHYYEDKFRDLSLLEIAQQSYFR-----LDPSKRCAIILNNDVLALLPFRAA------T 149
Query: 215 GDEDTFGSGGGFSARIES--------SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILH 264
D++ + R+++ S + ++L ++++V D F++ + +P + IL
Sbjct: 150 DDDEEADAENNDVKRMKTSSDKVTYPSKIFVAKELHSEIRNVIDVQFLNNFSKPTIAILF 209
Query: 265 ERELTWAG--RVSWKHHTCMISALSISTTLKQHPL----IWSAMNLPHDAYKLLAVPSPI 318
E L WAG +++ + + MI L IS+T I L D + L+ + +
Sbjct: 210 EPTLIWAGNRQLNPQPISYMIFTLEISSTDNTTKFGATTIGKLTGLSWDFHSLVPISN-- 267
Query: 319 GGVLVVGANTIHYHSQSASC--ALALNNYA-VSLDSSQELPRSSFSVELDAAHA-TW--- 371
G ++VGAN + + S + + LN+++ +L + + S + + L + A W
Sbjct: 268 -GCMIVGANELAFADNSGALQSVILLNSFSDRNLRQGRIIDNSKYEILLPQSIARCWSPP 326
Query: 372 ----LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFF 424
+ ++ LL ++ + + +GR++ + D+ K N ++ + T + L
Sbjct: 327 TSDKVNDETLLLMDANSNVYYVQLESEGRLLIKFDIIKLPIVNDTLKNNQGCTCMSRLNS 386
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-SSDALQDMVNGEE 483
S LL+ F G + + LK + ++ + S D +D + +E
Sbjct: 387 RSSNNNMDLLMGFKSGDALVVRLNNLKSAAESRDEHKIFSEAMESSFDKDEDEDNLYSDE 446
Query: 484 LSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGI--SKQSN 538
S G A +N E +T F + ++ NIGP+ + G + + G+ ++
Sbjct: 447 ASDAGKADDNKEVIVETVTPFDIELLSTIKNIGPITSLAVGKVCSVEKYVKGLLNPNRNE 506
Query: 539 YELVELP--GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL- 595
Y +V G T S R + + + ++ + ++ R L T D
Sbjct: 507 YSMVATSGNGSGSHLTEIQGSVRPTVEVALKFISVTQIWN----LKIKNRDKYLVTTDSN 562
Query: 596 -----LTEVTESVDYFVQGR------TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
+ E+ + +GR T+ G +R++QV + D ++ + L+
Sbjct: 563 KAKSDIYEIDNNFALHKEGRFRRDATTVCISMFGGDKRIVQVTTNNLILYDTNF--RRLT 620
Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+ E V+ VS+ DPY+L+ +S G I++
Sbjct: 621 TMKFDYE---------VVHVSVMDPYILITVSRGDIKI 649
>gi|430810873|emb|CCJ31593.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 301
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 128/292 (43%), Gaps = 41/292 (14%)
Query: 245 HVKDFIFV-HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
H+ D F+ + Y EP + IL+ T G + ++ T +A I++
Sbjct: 6 HIVDLWFIFYDYREPTLAILYSAFQTSTGLLPYRQDTMTSTA------------IYTVDK 53
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC-ALALNNYAVSLDSSQELPRSSFSV 362
LP+D + +L +P+PIGG L++G N + Y Q+A A+++N++A + ++
Sbjct: 54 LPYDLFSVLPLPNPIGGTLLIGNNELVYVDQAARVKAVSVNSFARKCTHLDFIEDYDLNL 113
Query: 363 ELDAAHATWL-----QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV----LTS 413
L+ A +L Q LL + G V + DGRVV L + + SV L S
Sbjct: 114 RLNGAVGVYLELLDDQPGAVLLVIEDGRFVQVGFKLDGRVVSSLSVKILDQSVKNDFLKS 173
Query: 414 D---ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
+ I + N F+GS++ +S+L+++ S + E + +
Sbjct: 174 EASCIVLLNNEQLFIGSKVSNSVLLEWKRQSEIA-------------EKLLSEPRVIFDE 220
Query: 471 SSDALQDMVNGEELSLYGSASN-NTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ L D+ GE+ + ++S F + D+L + GP+ D + G
Sbjct: 221 DREVLNDLY-GEDFDIVDTSSILQRNGVFGDIQFRLFDTLYSCGPIVDMTIG 271
>gi|325186344|emb|CCA20849.1| predicted protein putative [Albugo laibachii Nc14]
Length = 1148
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 114/511 (22%), Positives = 192/511 (37%), Gaps = 115/511 (22%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+D I L + + VL +D ++ + + + R E G +DP G
Sbjct: 74 QDWIFLVTQRFQFCVLAYDTTLQQIITKANGSLRDT----IGRNSEILTNG---NIDPDG 126
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDF 249
R G+ +Y ++ L F+ R++ LR LD+K
Sbjct: 127 RLIGMNIYEGYFKVIPIDNHSKSL---------KAAFNIRLD-----ELRILDIK----- 167
Query: 250 IFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAY 309
F++GY +P + +L+E H L + P WS N+ A
Sbjct: 168 -FLYGYNKPTICVLYED-------FKAARHVKTYFILLKEKDFAEGP--WSQSNVEAGAN 217
Query: 310 KLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHA 369
L+ VP P GGVL++ TI YH+ + A+ + N + + + S F
Sbjct: 218 LLIPVPMPYGGVLIISNQTIVYHNGTYFHAIPMQNTMIQVYGAVGDDGSRF--------- 268
Query: 370 TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL 429
LL+ + G L ++ + +G+ V + L + + S ++ + N + F+GS
Sbjct: 269 --------LLADQYGALHVVALQTEGKEVLDVYLEVLGQTSIASCVSYLDNGVVFVGSTF 320
Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
GDS LV+ ++E G S + L VN + +
Sbjct: 321 GDSQLVKLNSK----------RDESG--------------SYIEVLDSYVNIGPIIDFCV 356
Query: 490 ASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
+ + + T S A +D G L+ G+ IN ASA ELPG
Sbjct: 357 MDLDRQGQGQIVTCSGADKD-----GSLRVIRNGIGINEQASA------------ELPGI 399
Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDY 605
KG+W + + EY YL+ S E R M + +D + EV ++
Sbjct: 400 KGMWALRESLA--------------SEYDKYLVQSYLNEIRIMTIGDSDEMEEV--EIEA 443
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
F+ +T+ N+ +QV E RI+D
Sbjct: 444 FLDAKTLYCRNV-NEDGWLQVTETEVRIIDA 473
>gi|407035910|gb|EKE37921.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
Length = 836
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
++L F++AK+SVL +D++ + I S+HCFE P LKR +E P + +D +
Sbjct: 74 LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
GRC ++ Y M ++ +G + T S+ INL + + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168
Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
I + GY P + LH + TW GR+ T +I +L ++ ++ +
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
+P++A +++ P G+L++ N+I Y S ++ S L + V + + P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTVNSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
F + T + N V + T L ++ V+ + R+ D+ TN P
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
TI ++ F+GS + DS + +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370
>gi|365984967|ref|XP_003669316.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
gi|343768084|emb|CCD24073.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
Length = 1388
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 148/725 (20%), Positives = 286/725 (39%), Gaps = 145/725 (20%)
Query: 58 LVVTAANVIEIYVVR-VQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
L+V N++ IY + + S S + ET + A L L+ ++L+G V+ +
Sbjct: 29 LLVIRTNILSIYHLETILSPRSNTSSSQLETIEDATVTTSKQAKLFLINEFKLNGKVQDI 88
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + G NS + I+L+ AK+S+L FD SI+ S+H +E S
Sbjct: 89 ASIPLG---NSSSLECILLSTGTAKLSILNFDPSINSFETLSLHYYEEK---FKDISLVS 142
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL--------------KASQGGSGLVGDEDTFGS 222
A+ +++DP RC +L++ ++ L + ++ + + S
Sbjct: 143 LAKKSQLRMDPLNRC--LLMFNNDVMALLPLHSNNEDEEEEEEDENEEDEVLDNYEANLS 200
Query: 223 GGGFSARIE--------SSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
+ RI+ S + N+ L D+K++ D F++ + +P + +L++ LTWAG
Sbjct: 201 KTSPNKRIKYNNNQFEGKSKIFNINKLHEDVKNISDIQFLNNFNKPTIAVLYQPTLTWAG 260
Query: 273 RVSWKHHTC--MISALSI----STTLKQHP-----------LIWSAMNLPHDAYKLLAVP 315
V MI L I ST H +I L D +K++ +
Sbjct: 261 NVQLNPLPTHFMIFTLDILSENSTNNANHTTENNNNDLNLIIIAKLKELAWDWFKIIPIS 320
Query: 316 SPIGGVLVVGANTIHYHSQSA--SCALALNNYA-VSLDSSQELPRSSFSVELD------- 365
+ G +V+G N I Y + + LN++A +L ++ + S F + +
Sbjct: 321 N---GCVVIGNNEIAYIDNTGVLQSIILLNSFADKNLKKTRIIDESKFQIFFNENVTHVW 377
Query: 366 ----AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL-----------SKTNPSV 410
+ + T ++ LL +L + + +GR++ + D+ NP+
Sbjct: 378 SPSTSKNKTTEDDETLLLMDAQSNLYYVRLEAEGRLLTKFDIINLPIVNDVLRENCNPTC 437
Query: 411 LTSDITTIGNSL--FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
++ + NS F+G GDSL+V+ + LK + + S + +
Sbjct: 438 ISRLDSNATNSTMDLFIGFLSGDSLVVRL----------NNLKSAIDTRDEHSESNEHTQ 487
Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQ------------KTFSFAVRDSLVNIGPLK 516
+ D +E +LY + E A+ + F SL NIGP+
Sbjct: 488 LNGFDE------EDEDNLYSDDEVDVEDARSKRDMETIIHTVQPFDIEYLTSLKNIGPIT 541
Query: 517 DFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGH-NADSSRMAAYDDEY 575
+ G + D + G+ + E I T S+ H N + ++
Sbjct: 542 SLTVGKVSSLDLNVKGLQNPNKNEF-------SIVTTSGNSTGSHLNVIQQTVQPIVEKA 594
Query: 576 HAYLIIS------LEARTMVLETADL------LTEVTESVDYFVQGR-----TIAAGNLF 618
++ ++ ++ + L T D + ++ + +GR T +F
Sbjct: 595 LKFISVTQIWNLKIKNKDKYLVTTDSTKSKSDIYDIDNNFSLHKEGRLRRDATTVYIAMF 654
Query: 619 GR-RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
G +RV+Q+ + D ++ + L+ + E V+ VS+ DPY+L+ +S
Sbjct: 655 GDGKRVVQITTNHLYLFDTNF--RRLTAIKFDFE---------VVHVSVMDPYILITVSR 703
Query: 678 GSIRL 682
G I++
Sbjct: 704 GDIKI 708
>gi|67463896|ref|XP_648489.1| cleavage and polyadenylation specificity factor subunit [Entamoeba
histolytica HM-1:IMSS]
gi|56464653|gb|EAL43100.1| cleavage and polyadenylation specificity factor subunit, putative
[Entamoeba histolytica HM-1:IMSS]
Length = 1150
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
++L F++AK+SVL +D++ + I S+HCFE P LKR +E P + +D +
Sbjct: 74 LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
GRC ++ Y M ++ +G + T S+ INL + + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168
Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
I + GY P + LH + TW GR+ T +I +L ++ ++ +
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
+P++A +++ P G+L++ N+I Y S ++ S L + V + + P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTINSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
F + T + N V + T L ++ V+ + R+ D+ TN P
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
TI ++ F+GS + DS + +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370
>gi|68471462|ref|XP_720279.1| likely Cleavage and Polyadenylation Specificity Factor subunit
fragment [Candida albicans SC5314]
gi|46442139|gb|EAL01431.1| likely Cleavage and Polyadenylation Specificity Factor subunit
fragment [Candida albicans SC5314]
Length = 423
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/199 (22%), Positives = 93/199 (46%), Gaps = 15/199 (7%)
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
F+H Y EP + +L ++ WAG + L++ LK ++ NLP++ +
Sbjct: 3 FLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTLDLNLKSTISVFKIDNLPYEIDR 62
Query: 311 LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELD 365
++ +PSP+ G L+VG N IH + +A+N + S S Q+ +S +++L+
Sbjct: 63 VIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTRLITASFKSFQD--QSDLNLKLE 120
Query: 366 AAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITT 417
+ +D LL +TG+ + DG+ ++R+ + + ++
Sbjct: 121 NCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAI 180
Query: 418 IGNSLFFLGSRLGDSLLVQ 436
+ ++ F+ + G+S L+Q
Sbjct: 181 LDKNMLFIANSNGNSPLIQ 199
>gi|149237256|ref|XP_001524505.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146452040|gb|EDK46296.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1380
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 71/328 (21%), Positives = 139/328 (42%), Gaps = 60/328 (18%)
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+SS +I +LD + + D F+H Y +P + +L R +WAG + + +S+
Sbjct: 223 DSSFIIEAGNLDSSIDTIIDLQFLHNYRDPTIALLSSRSHSWAGSLLKSKDNVHLEVMSL 282
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNY-- 345
K I+ NLP++ +++ + +P+ G L+VG N I H + + +++N++
Sbjct: 283 DLLTKLSTSIFKIENLPYEVDRIVPLSAPLNGCLLVGCNEIMHVDNGGIAKRISVNDFTS 342
Query: 346 --AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLD 402
S+ S+Q+ +S+ ++L+ + +D L+ T+ G DG+ ++R+
Sbjct: 343 LTTASVKSNQD--QSNLGLKLENCSVVQIPDDHRVLIVTEQGSFYFANFELDGKSIKRVF 400
Query: 403 LSKTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
+ + ++ +I + +L F+ + GDS LVQ +K
Sbjct: 401 IDVVDKNMYDKIKFTFPGEIAVLSKNLLFMSNLNGDSPLVQ-------------VKYRNS 447
Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA----------------QK 499
I D T+R+ + G E + +SN + QK
Sbjct: 448 KILEDTRGTRRVEKGK---------GAEKNKNNVSSNEVDDDDDDDDDLYKEEEEEEQQK 498
Query: 500 TFS-----FAVRDSLVNIGPLKDFSYGL 522
S F ++D L+N P+ F+ GL
Sbjct: 499 VLSKSHIEFILQDRLINNSPISTFTLGL 526
>gi|168066745|ref|XP_001785293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663100|gb|EDQ49884.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1090
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 126/557 (22%), Positives = 220/557 (39%), Gaps = 121/557 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A+ L+ + ++G + +L + G +D + ++FE K VL++D
Sbjct: 39 RIEIHLLTASGLQPMLDVPIYGRIATLELFRPPG----ESQDVLFISFERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
GL +T S + GR + G + VDP R G+ +Y GL +I ++
Sbjct: 95 ET-GLLVTRAMGDVSD-----RIGRPT-DNGQIGIVDPDCRLIGLHLYDGLFKVIPIDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCAKPTIAVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + P W NL + A L+ VP P+GG +++G T
Sbjct: 186 D-------ARHVKTYEVQLKEKDFGEGP--WLQNNLDNGAGLLIPVPLPLGGAIIIGEQT 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y++ S A+ + + ++ V+ D + LLS G L L
Sbjct: 237 IVYYNGSVFKAIPIR---------PSITKAYGRVDSDGSR--------YLLSDHNGMLYL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + +D V L++ + S ++ + N + F+GS GDS L++
Sbjct: 280 LVISHDKERVSALNVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
+ +ADA + S + L+ VN G + L Q T S A +
Sbjct: 329 -------NHQADA------KNSYVEVLESYVNLGPIVDLCVVDLERQGQGQVVTCSGAFK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN ASA EL G KG+W++ SS
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSLRASSS-------- 410
Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
D Y +L++S E R + + T D L E TE + + +T+ N +++
Sbjct: 411 ------DVYDTFLVVSFISETRILAMNTDDELEE-TEIDGFDSEAQTLFCYNAV-HDQLV 462
Query: 625 QVFERGARILDGSYMTQ 641
QV R++D Q
Sbjct: 463 QVTAGSLRLVDAKTRRQ 479
>gi|449710759|gb|EMD49776.1| cleavage and polyadenylation specificity factor subunit, putative
[Entamoeba histolytica KU27]
Length = 836
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
++L F++AK+SVL +D++ + I S+HCFE P LKR +E P + +D +
Sbjct: 74 LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
GRC ++ Y M ++ +G + T S+ INL + + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168
Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
I + GY P + LH + TW GR+ T +I +L ++ ++ +
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
+P++A +++ P G+L++ N+I Y S ++ S L + V + + P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTINSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
F + T + N V + T L ++ V+ + R+ D+ TN P
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
TI ++ F+GS + DS + +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370
>gi|384250802|gb|EIE24281.1| hypothetical protein COCSUDRAFT_28729 [Coccomyxa subellipsoidea
C-169]
Length = 1101
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 125/551 (22%), Positives = 206/551 (37%), Gaps = 113/551 (20%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ V ++G V ++ + G +D + L+ E K VLE+D
Sbjct: 44 RIEIHTLTPEGLKGVADVAIYGRVATMELFRPVG----ESKDLLFLSTERYKFCVLEYDS 99
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
L + E + GR G + VDP G +MI L G
Sbjct: 100 ETGELVTRANGDIED------QVGRPC-DNGQIGIVDP----------GCRMIGLHLYDG 142
Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
++ +D F+ RI+ +VI D IF+ G +P + +L++
Sbjct: 143 LFKVIPIDDKGQLHEAFNMRIDELNVI-----------DMIFLEGCAKPTIAVLYQDN-- 189
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI 329
H + L + P W NL A +++AVP P+GG LVVG + I
Sbjct: 190 -----KDARHIKTYEVVLKEKDLTEGP--WRQSNLDAGASRVIAVPEPLGGALVVGESVI 242
Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVL 388
Y Q Q + + + AH ++ LL G+L L
Sbjct: 243 AYMGQ-----------------GQAMKCTPIKATIIRAHGRVDEDGSRYLLGDYVGNLYL 285
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + +DG V L + + S +T + N + F+GS GDS LV+
Sbjct: 286 LVLQHDGEHVAGLKVEPLGRTSAPSTLTYLDNGVVFVGSSGGDSQLVRL----------- 334
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDS 508
P T + + + L+ M N + + + + +
Sbjct: 335 ----------HPTPVTPQEPSNFVEVLETMTNLGPIIDFVVVDLERQGQGQVVMCS---- 380
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRM 568
G + D S LRI + G+ +Q+ VELPG KG+W + +S M
Sbjct: 381 ----GIMADGS--LRIVRN--GIGMIEQAT---VELPGIKGMWALR----------ASHM 419
Query: 569 AAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQV 626
A+D +L+IS E R + + D L E E + +T+ GN ++QV
Sbjct: 420 DAFD----TFLVISFVGETRILAINADDELDE-AELPGFSADAQTLCCGNTVS-DHLVQV 473
Query: 627 FERGARILDGS 637
R++D S
Sbjct: 474 AGADVRLVDAS 484
>gi|385304555|gb|EIF48567.1| rna-binding subunit of the mrna cleavage and polyadenylation factor
[Dekkera bruxellensis AWRI1499]
Length = 353
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 67/293 (22%), Positives = 129/293 (44%), Gaps = 39/293 (13%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERE-LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 301
+K++ D+ F++ Y EP + IL+ E L+WAG + + LS++ + I
Sbjct: 69 VKNIMDYQFLYSYREPTIAILYAPEGLSWAGYLXKLKDNMKVVVLSLNLDTHKADSIMVL 128
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSF 360
NLP+D + +PSPI G L++G+N I H +S + + N Y + S
Sbjct: 129 PNLPYDLNSIYPLPSPINGFLLIGSNEILHVNSLGSIKGVYTNKYFPETSDMKLRDESDL 188
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG------RVVQRLDLSKTNPSVLTSD 414
++E + +++ +D LL ++ G +L+ G ++++ + + N SV ++
Sbjct: 189 NLECEGCSVSFVGDDQVLLISQIGKFYVLSFNESGGISNLNKIIEIPEANYCNVSV--NN 246
Query: 415 ITTIGN----SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
+ I N + FL + DS+L+ + + P+ + +S
Sbjct: 247 VLQITNIEDCNSAFLCCQGSDSILLHWN--------------------YNVPTRGTVSKS 286
Query: 471 SSDALQDMVNGEELSLY--GSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
++ ++ E+ LY S + + +F D LVN GP DF+ G
Sbjct: 287 NAGIEKE---DEDSWLYHEDETSQTSNRPLTSCTFTXIDKLVNCGPTSDFTIG 336
>gi|302769568|ref|XP_002968203.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
gi|300163847|gb|EFJ30457.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
Length = 1089
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 123/557 (22%), Positives = 217/557 (38%), Gaps = 121/557 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ ++A L+ + ++G + +L + G +D + ++ E K VL++D
Sbjct: 39 RIEFHLLTAQGLQPLLDVPIYGRIATLELFRPPG----ETQDVLFVSTERYKFCVLQWDS 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + VDP+ R G+ +Y GL +I ++
Sbjct: 95 ETTELVTRAMGDVSD------RIGRPT-DNGQIGIVDPECRLIGLHLYDGLFKVIPIDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIAVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + P WS NL + A L+ VP+P+GGV+++G T
Sbjct: 186 D-------ARHVKTYEIQLKEKDFGEGP--WSQNNLDNGAGMLIPVPTPLGGVIIIGEQT 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y+S SA A+ + + ++ V+ D + LLS TG L L
Sbjct: 237 IVYYSGSAFKAIPIR---------PSITKAYGKVDADGSR--------YLLSDHTGSLHL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + S ++ + N + ++GS GDS L++
Sbjct: 280 LVITHERDRVLGLKVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKLNA--------- 330
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
+ D+ R S + L+ VN G + L Q T S A +
Sbjct: 331 ---------QVDS------RNSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN ASA EL G KG+W++
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
A D + +L++S E R + + D L E TE + + +T+ N ++I
Sbjct: 406 -RATSKDVFDIFLVVSFISETRILAMNMDDELEE-TEIEGFDSEAQTLFCHNAI-HDQII 462
Query: 625 QVFERGARILDGSYMTQ 641
QV R++D + Q
Sbjct: 463 QVTSTSLRLVDATSRRQ 479
>gi|312069702|ref|XP_003137805.1| hypothetical protein LOAG_02219 [Loa loa]
Length = 1065
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 90/387 (23%), Positives = 149/387 (38%), Gaps = 49/387 (12%)
Query: 349 LDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
+D + P F + LD T + + LL + G L L +V D V+ L+L
Sbjct: 1 MDGFTKFPLRDFKHMVLTLDGCVVTVISTNKILLCDRNGRLFTLVLVTDATNSVKSLELK 60
Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
+V+ +T+ F+GSRL DS+ + T ++ AP
Sbjct: 61 FQFKTVIPCTMTSCAPGYLFIGSRLCDSVFLHCIFEQST-------------LDESAPKK 107
Query: 465 KRLRRSSSDALQDMVNGEELSLYGSASNNT---ESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+L + +A +D E+ LYG +SA++ + V D L+N+GP K + G
Sbjct: 108 IKLN-TELNANED----EDFELYGEVLPKVAKPDSAEELLNIRVLDKLLNVGPCKKITGG 162
Query: 522 LRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSSRMAAY-------- 571
+ K ++LV G G ++ +S R SS +
Sbjct: 163 CPSISAYFQEVTRKDPLFDLVCACGHGKFGSICIFQRSVRPEIVTSSSIEGVVQYWAVGR 222
Query: 572 -DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
+D+ H Y I S E T+ LET + L E+ E+ + TIAAG L +QV
Sbjct: 223 REDDTHMYFIASKELGTLALETDNDLVEL-EAPIFATSEPTIAAGELADGGLAVQVTTSS 281
Query: 631 ARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDPS 688
++ Q + V S SI DPY+ + +G + + L P
Sbjct: 282 LVMVAEGQQIQHIPL----------QLTFPVRSASIVDPYIAICTQNGRLLMYELTSHPH 331
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHD 715
+ + P++S ++Y D
Sbjct: 332 VHLKEIDISKRLRHETSPITSLSIYRD 358
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 104/239 (43%), Gaps = 29/239 (12%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEE 809
+ E+G + I+ +P + V+ V K +H+ D + D E + S++
Sbjct: 445 IARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPATAESIDTMSATMT 500
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
T + E + ++EL M + RP LF +L D T+ Y+ + + N
Sbjct: 501 DTFAAKPEEV----IMELLMVGMGMNQGRPMLF-LLIDDTVSVYEMFTY----NNGIQGH 551
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGF 926
+ L + V+ R+ RF LD E+ A + + F+ I G
Sbjct: 552 LAVRFKRLPYTVVT----RSCRFQG--LDGRAAVESVRDAVRHKTVLHFFERIGNVLNGV 605
Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLP 984
F+ S PC + R+HP DG I++FT +N C +GFIY+T + ++++ +LP
Sbjct: 606 FICSSYPCIFFLETGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRVAKLP 664
>gi|58383228|ref|XP_312466.2| AGAP002472-PA [Anopheles gambiae str. PEST]
gi|55242305|gb|EAA08181.2| AGAP002472-PA [Anopheles gambiae str. PEST]
Length = 1138
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 117/519 (22%), Positives = 194/519 (37%), Gaps = 126/519 (24%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G L +DP+ R G+ +Y GL II D+DT S R+E
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DKDT-NELKATSLRMEE------ 161
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
HV+D F++G P ++++H+ ++ +H I IS K+ I
Sbjct: 162 -----MHVQDVEFLYGTTHPTLIVIHQD-------INGRH----IKTHEISLKDKEFTKI 205
Query: 299 -WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAVSL 349
W N+ +A L+AVP P+GG +V+G +I YH + A+A +N YA
Sbjct: 206 AWKQDNVETEATMLIAVPMPLGGAIVIGQESIVYHDGDSYVAVAPAIIKQSTINCYA--- 262
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLS 404
+D+ +L ++A G+L ++ + + V+ + +
Sbjct: 263 -------------RIDSKGLRYLLGNMA------GNLFMMFLETEENAKGQTTVRDIKVE 303
Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
+ IT + N + F+GSR GDS LV+ +G + L E F ++ AP
Sbjct: 304 LLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVMLMETFTNL---APIV 360
Query: 465 KRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI 524
L+ G+ ++ GS G L+ G+ I
Sbjct: 361 DMCVVD----LERQGQGQMITCSGSFKE--------------------GSLRIIRNGIGI 396
Query: 525 NADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
A ++LPG KG+W + R+ D Y LI+S
Sbjct: 397 QEHAC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLILSFV 431
Query: 585 ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQD 642
T VL + E TE +T N+ +++QV AR++ D M +
Sbjct: 432 GHTRVLMLSGDEVEETEIAGILGDQQTFYCANV-SHGQILQVTPSSARLISCDNKAMICE 490
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIR 681
P N G N+T + + A + + DG +
Sbjct: 491 WK-PPDNKRIGVVGANTTQIVCASAQDVYYVEIGDGKLE 528
>gi|91087281|ref|XP_975549.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270010588|gb|EFA07036.1| hypothetical protein TcasGA2_TC010010 [Tribolium castaneum]
Length = 1149
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 105/481 (21%), Positives = 180/481 (37%), Gaps = 110/481 (22%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G L +DP+ R G+ +Y I+ + S L N+R
Sbjct: 119 GILAVIDPKARVIGLRLYDGLFKIIPLEKDNSELKAS--------------------NIR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG P ++++H+ V+ +H + IS K+ +
Sbjct: 159 -IDELQVHDVEFLHGCANPTLILIHQD-------VNGRH----VKTHEISLREKEFVKVP 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VPSP+GG +++G I YH +A + +S
Sbjct: 207 WRQDNVETEASMIIPVPSPLGGAIIIGQENILYHDGITPVVVA----------PAVIKQS 256
Query: 359 SFS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVL 411
+ ++D +L D+A G L +L + D R VV+ L +
Sbjct: 257 TIVCYAKVDPGGLRYLLGDMA------GHLFMLFLEVDNRGDGNDVVKDLKVELLGEIAT 310
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
IT + N + F+GSRLGDS LV+ T S + E F ++ AP L
Sbjct: 311 PECITYLDNGVLFIGSRLGDSQLVKLTTKPNESGSYVTVMESFTNL---API---LDMCV 364
Query: 472 SDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
D L+ G+ ++ G+ G L+ G+ I AS
Sbjct: 365 VD-LERQGQGQLVTCSGAFKE--------------------GSLRIIRNGIGIQEHAS-- 401
Query: 532 GISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE 591
++LPG KG+W + A D Y L+++ +T VL
Sbjct: 402 ----------IDLPGIKGMWAL--------------QVASDGRYDNTLVLAFVGQTRVLS 437
Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
E T+ + +T GN+ +++Q+ AR++ T + P + +
Sbjct: 438 LNGEEVEETDIAGFASDQQTFFCGNVI-HEQIVQITPISARLISAQNKTLLAEWKPPSDK 496
Query: 652 S 652
+
Sbjct: 497 N 497
>gi|440302955|gb|ELP95261.1| hypothetical protein EIN_430670 [Entamoeba invadens IP1]
Length = 1175
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 79/362 (21%), Positives = 150/362 (41%), Gaps = 59/362 (16%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCG 192
+IL F+ A++SV+ ++ + + S+HCFE PE ++ + P + +D +GRC
Sbjct: 74 LILLFKQARLSVMRYNTETNRFVVHSLHCFEYPELRIREKCTPTAYDDPRMFIDKKGRCI 133
Query: 193 GVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFV 252
+L Y + ++ GS SS+ ++L + + D I +
Sbjct: 134 SLLCYDRLLWVIP--------------LGSN-------RSSYRVDLEKFGVSRIVDVISL 172
Query: 253 HGYIEPVMVILHERELTWAGR-VSWKHHTCMISALSISTTL--KQHPLIWSAMN----LP 305
GY P + LH TW R V+ T I+ ++++ + ++ + +N LP
Sbjct: 173 SGYETPTLAFLHMTVPTWDARTVNTGEATNEIAIINVNPGVVGEEEQECANVVNRISRLP 232
Query: 306 HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE----------L 355
++ K++ P+ G+L++ + ++ Y S ++S + L + + + L
Sbjct: 233 YNTLKMVEC-YPLPGILLLASVSVLYISTTSSESFIL-PFGTYFNPPEVWKGVVPFLKLL 290
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
P ++L + QN + L T GD + + +VQ + LS P +
Sbjct: 291 PMKIRIIQLVKSIHQLSQN-LYLTFTDKGDSYYIHLNCVEGIVQEIVLSNA-PYKFIPNT 348
Query: 416 TTIGNSLFFLGSRLGDSLLVQFT---------------CGSGTSMLSSGLKEEFGDIEAD 460
++ + FLGS DS L +T CG + L+E G +E D
Sbjct: 349 VSLYDDYIFLGSVFHDSYLFNYTICEYGKGDIKPFGIHCGDAVRI--KNLQERSGQMEED 406
Query: 461 AP 462
P
Sbjct: 407 YP 408
>gi|154285962|ref|XP_001543776.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407417|gb|EDN02958.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 1283
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 113/541 (20%), Positives = 202/541 (37%), Gaps = 106/541 (19%)
Query: 501 FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYELVELPG----------- 546
+ F + D L N+GP++D + G + D S +N ELV G
Sbjct: 376 YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQPVSSILANLELVTTQGYGKAGGLAILR 435
Query: 547 ---------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
G +VY K + + S Y YL++S + +
Sbjct: 436 REIDPFVIDSLMIKDTDGARSVYVKDPKLPSQSGSLPLNPGSNYDHYLLLSKSKGLDKEK 495
Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDLS 644
++V + E T++ ++ + RTI G L RV+QV + R D G + Q
Sbjct: 496 SVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLASGTRVVQVLKGEVRSYDSGLGLAQIFP 555
Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSK 704
+ SE +V+ S ADPYVL+ D SI LL D S +T I S+
Sbjct: 556 VWDEDM-----SEEKSVVHTSFADPYVLIIRDDQSILLLQADDSGDLDEAETDGIINSTT 610
Query: 705 KPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGP-LDQGD-IYSVVCYESGA 762
S +LY DK + +G P + Q D + +
Sbjct: 611 --WISGSLYQDKY-------------------RSFKSHEGPPNMKQSDNVLLFLLSSESK 649
Query: 763 LEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS 821
L +F +PN VFT + D +I S+ +E I
Sbjct: 650 LYVFHLPNAREPVFTTESI-----------------DLLPQILSTEPPPRRVTYRETITE 692
Query: 822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
+ V +L + P+L ++ ++ Y+ Y + ST R S
Sbjct: 693 LLVADLG----DSVSRSPYLILRSSNSDLILYEPYHYTS-----------STERQFS--G 735
Query: 882 VSASRLRNLRFSRTPLDAYTREETPHGAPCQRIT----IFKNISGHQGFFLSGSRPCWCM 937
+ ++ N F ++ ++ + H A C I+ + ++ G++ F+ G+ PC+ +
Sbjct: 736 LRFVKIANHHFPKSHSESNAGK---HPANCTAISKPLRVLGDVCGYRTVFMPGNSPCFII 792
Query: 938 VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK 997
+ L ++ + + + C GF+YV + ++++C+ P + +D W +K
Sbjct: 793 KSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARK 852
Query: 998 V 998
+
Sbjct: 853 I 853
Score = 47.8 bits (112), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 84/180 (46%), Gaps = 21/180 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V +++++ + GS + +T+ + L LV Y L G + L
Sbjct: 28 NLIVAKTTLLQVFNLVNVVYGSGPGQPDEKTRSQY-------TKLVLVAEYALSGTITDL 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ +++++A +AK+S++E+D H + TS+H +E + +++ +
Sbjct: 81 GRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIHYYERDD-VNISPWTPN 137
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGSGGGFSARIESSH 234
A P + VDP RC VL +G + + IL Q G LV D+ F + +E H
Sbjct: 138 LASCPSYLTVDPNSRC-AVLNFGKKNLAILPFHQVGDDLVMDD--------FDSDVEEQH 188
>gi|242089089|ref|XP_002440377.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
gi|241945662|gb|EES18807.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
Length = 1783
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 108/464 (23%), Positives = 189/464 (40%), Gaps = 112/464 (24%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+D + +A E K VL++D L +M + GR + G + +DP
Sbjct: 75 QDFLFIATERYKFCVLQWDAEKSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDC 127
Query: 190 RCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
R G+ +Y GL +I ++G F+ R+E V++++
Sbjct: 128 RLIGLHLYDGLFKVIPFDNKGQLK-----------EAFNIRLEELQVLDIK--------- 167
Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
F+HG ++P +V+L++ +H AL + P WS N+ + A
Sbjct: 168 --FLHGCVKPTIVVLYQ------DNKDVRHVKTYEVALK-DKDFVEGP--WSQNNVDNGA 216
Query: 309 YKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAH 368
L+ VP+P+GGV+++G I Y + +++ ++ Q + R+ V+ D +
Sbjct: 217 GLLIPVPAPLGGVIIIGEEQIVYCNANSTFK--------AIPIKQSIIRAYGRVDPDGSR 268
Query: 369 ATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSR 428
LL TG L LL + ++ V L + + + S I+ + N + ++GSR
Sbjct: 269 Y--------LLGDNTGILHLLVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSR 320
Query: 429 LGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
GDS LV+ +++ADA S + L+ VN + +
Sbjct: 321 FGDSQLVKL------------------NLQADASG------SFVEILERYVNLGPIVDFC 356
Query: 489 SASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
+ + + T S A +D G L+ G+ IN AS VEL G
Sbjct: 357 VVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIGINEQAS------------VELQG 399
Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
KG+W++ KSS ++D Y YL++S + T L
Sbjct: 400 IKGLWSL--KSS------------FNDPYDMYLVVSFISETRFL 429
>gi|403218521|emb|CCK73011.1| hypothetical protein KNAG_0M01580 [Kazachstania naganishii CBS
8797]
Length = 1345
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 147/706 (20%), Positives = 284/706 (40%), Gaps = 146/706 (20%)
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
T A+ +E+ +VR + SG+ L L ++L + LA++
Sbjct: 22 TTADYVELLIVRTNLLSIYKVTESGK--------------LLLTHEFKLQARITDLALV- 66
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF--- 177
G +N+ + ++L + K+S+++F+ + L S+H +E K SF
Sbjct: 67 -GSVENTGL-NYLLLGIGNCKLSIVKFNSLNNSLETISLHYYEE------KFKANSFIEL 118
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA--------R 229
A+ +++DPQ RC +L ++IL SQ E+ +
Sbjct: 119 AKKTELRIDPQNRCA-LLFNNDNIVILPFSQQQEEEDYGEEEEEEDNYNMEDGPNVKKLK 177
Query: 230 IES--------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH- 278
+ES S + + + LD +++V D F+ + P + IL++ +LTWAG +
Sbjct: 178 LESASTNLTLPSIITDSKKLDSTIENVVDIQFLRNFSRPTLGILYQPKLTWAGNLQLNPL 237
Query: 279 -HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
++ +L+I+ + + +I LP D++ L +P+ G VL+ G+N + Y +
Sbjct: 238 PTKFLVISLNIAVSELEGTVITKLEGLPWDSHTL--IPTWNGCVLL-GSNEVSYIDNTGV 294
Query: 337 -SCALALNNYA-VSLDSSQELPRSSFSVEL--DAAHATWLQ--------NDVALLSTKTG 384
A+ LN+YA SL + + + + L D + W +++ LL ++
Sbjct: 295 LQSAIFLNSYADASLRKVRVVDHTDQQITLNKDLVKSLWSAPTKESGGADEILLLMDESS 354
Query: 385 DLVLLTVVYDGRVVQRLDL-----------SKTNPSVLT--SDITTIGNSLFFLGSRLGD 431
+L + + ++GR++ + D+ +P+ +T + N F+G + GD
Sbjct: 355 NLYYIQLEFEGRLMTKFDMINLPIVNDIFVHNLHPTCITRIDESKHNININLFIGFQTGD 414
Query: 432 SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS---LYG 488
SL+V+ +I + + +++SS++ V E+ LYG
Sbjct: 415 SLVVRL-----------------NNIRSAIETRHEYKQTSSESGLGKVEDEDEDEDDLYG 457
Query: 489 -------SASNNTESAQ----KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+AS N ++A + F + L NIGP+ G + G+ +
Sbjct: 458 DDGAHDKNASVNNDNAVVHTVQPFDIEMMSCLRNIGPVTSLVIGEASSVQPVIKGLPNPN 517
Query: 538 NYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL--------EARTMV 589
E + C + G N +++ + A IS+ + R
Sbjct: 518 KGEYSLVATCG--------NGTGSNLMVGQISVQPEVELALKFISVTQIWNLKVKNRDKY 569
Query: 590 LETADL------LTEVTESVDYFVQGR------TIAAGNLFGRRRVIQVFERGARILDGS 637
L T D + E+ + + QGR T+ G +R++QV + D +
Sbjct: 570 LITTDSTKTKSDIYEIENNFALYKQGRLRRDATTVYISMFGGEKRIVQVTTNHLYLYDTN 629
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
+ + L N E V+ VS+ DPY+L+ +S G I +
Sbjct: 630 F--RRLFLNKFNYE---------VVHVSVMDPYLLITLSRGDIMIF 664
>gi|55976392|sp|Q6E7D1.1|DDB1_SOLCE RecName: Full=DNA damage-binding protein 1; AltName:
Full=UV-damaged DNA-binding protein 1
gi|49484911|gb|AAT66742.1| UV-damaged DNA binding protein 1 [Solanum cheesmaniae]
Length = 1095
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 119/540 (22%), Positives = 204/540 (37%), Gaps = 143/540 (26%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL++ IEI+++ Q G+ L+ + ++G + +L
Sbjct: 31 NLIIAKCTRIEIHLLTPQ--------------------GLQCICLQPMLDVPIYGRIATL 70
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ G +D + +A E K VL++D + +M + GR +
Sbjct: 71 ELFRPHG----ETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSD------RIGRPT 120
Query: 177 FARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
G + +DP R G+ +Y GL +I ++G F+ R+E V
Sbjct: 121 -DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLK-----------EAFNIRLEELQV 168
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
++++ F++G +P +V+L++ +H + +LK
Sbjct: 169 LDIK-----------FLYGCPKPTIVVLYQ------DNKDARH------VKTYEVSLKDK 205
Query: 296 PLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSS 352
I W+ NL + A L+ VP P+ GVL++G TI Y S SA A+ +
Sbjct: 206 DFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR--------- 256
Query: 353 QELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT 412
+ R+ V+ D + LL G L LL + ++ V L + + +
Sbjct: 257 PSITRAYGRVDADGSR--------YLLGDHNGLLHLLVITHEKEKVTGLKIELLGETSIA 308
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
S I+ + N+ F+GS GDS LV+ P TK S
Sbjct: 309 STISYLDNAFVFIGSSYGDSQLVKLNL---------------------QPDTK---GSYV 344
Query: 473 DALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
+ L+ VN + + + + T S A +D G L+ G+ IN AS
Sbjct: 345 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIGINEQAS- 398
Query: 531 TGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
VEL G KG+W++ +A DD Y +L++S + T VL
Sbjct: 399 -----------VELQGIKGMWSL--------------RSATDDPYDTFLVVSFISETRVL 433
>gi|115465791|ref|NP_001056495.1| Os05g0592400 [Oryza sativa Japonica Group]
gi|48475231|gb|AAT44300.1| putative DNA damage binding protein 1 [Oryza sativa Japonica Group]
gi|113580046|dbj|BAF18409.1| Os05g0592400 [Oryza sativa Japonica Group]
gi|215694552|dbj|BAG89545.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222632766|gb|EEE64898.1| hypothetical protein OsJ_19757 [Oryza sativa Japonica Group]
Length = 1090
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + ++ +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G ++P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL + P WS NL + A L+ VP+P+GGV+++G T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y + +++ ++ Q + R+ V+ D + LL G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N + ++GSR GDS LV+
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++AD S + L+ VN + + + + + T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++D Y YL++S + T L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429
>gi|12082087|dbj|BAB20761.1| UV-damaged DNA binding protein [Oryza sativa Japonica Group]
Length = 1090
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + ++ +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G ++P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL + P WS NL + A L+ VP+P+GGV+++G T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y + +++ ++ Q + R+ V+ D + LL G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N + ++GSR GDS LV+
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++AD S + L+ VN + + + + + T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++D Y YL++S + T L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429
>gi|218197365|gb|EEC79792.1| hypothetical protein OsI_21216 [Oryza sativa Indica Group]
Length = 1089
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + ++ +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G ++P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL + P WS NL + A L+ VP+P+GGV+++G T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y + +++ ++ Q + R+ V+ D + LL G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N + ++GSR GDS LV+
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++AD S + L+ VN + + + + + T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++D Y YL++S + T L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429
>gi|357132340|ref|XP_003567788.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
distachyon]
Length = 1090
Score = 67.4 bits (163), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 152/367 (41%), Gaps = 83/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G + P +V+L++ +H A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCLRPTIVVLYQ------DNKDARHVKTYEVA 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P WS NL + A L+ VP+P+GGV+++G TI Y + +++
Sbjct: 197 LK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEETIVYCNANSTFK------ 247
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
++ Q + R+ V+ D + LL TG L LL + + V L +
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSR--------YLLGDNTGILHLLVLTQERERVTGLKIEH 297
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N + ++GSR GDS LV+ +++ADA
Sbjct: 298 LGETSVASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADATG-- 337
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
S + L+ VN + + + + + T S A +D G ++ G+
Sbjct: 338 ----SFVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSIRVVRNGIG 388
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ KSS ++D Y +L++S
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------FNDPYDTFLVVSF 422
Query: 584 EARTMVL 590
+ T L
Sbjct: 423 ISETRFL 429
>gi|302788810|ref|XP_002976174.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
gi|300156450|gb|EFJ23079.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
Length = 1089
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 122/557 (21%), Positives = 216/557 (38%), Gaps = 121/557 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ ++A L+ + ++G + +L + G +D + ++ E K VL++D
Sbjct: 39 RIEFHLLTAQGLQPLLDVPIYGRIATLELFRPPG----ETQDVLFVSTERYKFCVLQWDS 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + VDP+ R G+ +Y GL +I ++
Sbjct: 95 ETTELVTRAMGDVSD------RIGRPT-DNGQIGIVDPECRLIGLHLYDGLFKVIPIDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIAVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + P W NL + A L+ VP+P+GGV+++G T
Sbjct: 186 D-------ARHVKTYEIQLKEKDFGEGP--WLQNNLDNGAGMLIPVPTPLGGVIIIGEQT 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y+S SA A+ + + ++ V+ D + LLS TG L L
Sbjct: 237 IVYYSGSAFKAIPIR---------PSITKAYGKVDADGSR--------YLLSDHTGSLHL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + S ++ + N + ++GS GDS L++
Sbjct: 280 LVITHERDRVLGLKVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKLNA--------- 330
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
+ D+ R S + L+ VN G + L Q T S A +
Sbjct: 331 ---------QVDS------RNSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN ASA EL G KG+W++
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
A D + +L++S E R + + D L E TE + + +T+ N ++I
Sbjct: 406 -RATSKDVFDIFLVVSFISETRILAMNMDDELEE-TEIEGFDSEAQTLFCHNAI-HDQII 462
Query: 625 QVFERGARILDGSYMTQ 641
QV R++D + Q
Sbjct: 463 QVTSTSLRLVDATSRRQ 479
>gi|350537001|ref|NP_001234275.1| DNA damage-binding protein 1 [Solanum lycopersicum]
gi|350539125|ref|NP_001233864.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
gi|55976440|sp|Q6QNU4.1|DDB1_SOLLC RecName: Full=DNA damage-binding protein 1; AltName: Full=High
pigmentation protein 1; AltName: Full=UV-damaged
DNA-binding protein 1
gi|38455768|gb|AAR20885.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
gi|42602165|gb|AAS21683.1| UV-damaged DNA binding protein 1 [Solanum lycopersicum]
Length = 1090
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 113/507 (22%), Positives = 196/507 (38%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----ETQDLLFIATERYKFCVLQWDT 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
+ +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 EASEVITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + +LK I W+ NL + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S SA A+ + + R+ V+ D + LL G
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITRAYGRVDADGSR--------YLLGDHNGL 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
L LL + ++ V L + + + S I+ + N+ F+GS GDS LV+
Sbjct: 277 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNL------ 330
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
P TK S + L+ VN + + + + T S
Sbjct: 331 ---------------QPDTK---GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++
Sbjct: 373 AYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL---------- 405
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
+A DD Y +L++S + T VL
Sbjct: 406 ----RSATDDPYDTFLVVSFISETRVL 428
>gi|159470707|ref|XP_001693498.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283001|gb|EDP08752.1| predicted protein [Chlamydomonas reinhardtii]
Length = 366
Score = 67.4 bits (163), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 42/77 (54%), Gaps = 1/77 (1%)
Query: 923 HQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKIC 981
H G F++G+RP W + R L H +G + A T HNVNC GFI S +G LK+C
Sbjct: 166 HSGVFVAGARPLWLVAGRGGLAAHAMWSEGPVAALTPFHNVNCPLGFITACSARGQLKVC 225
Query: 982 QLPSGSTYDNYWPVQKV 998
LP + D W ++V
Sbjct: 226 CLPPHTRLDGAWATRRV 242
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 47/89 (52%), Gaps = 6/89 (6%)
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL------SFGPSNSESGSGSE 657
+Y TIAAGNLF ++Q G R+L+G + QDL + G + S G
Sbjct: 4 EYITDQPTIAAGNLFHNAVIVQACPGGVRLLEGMSLVQDLPLSELQALGGVAAASRPGVA 63
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
T+ + +ADPYVL+ +S+G+ LL D
Sbjct: 64 PPTITHMQVADPYVLVSLSNGTACLLEAD 92
>gi|297799958|ref|XP_002867863.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
lyrata]
gi|297313699|gb|EFH44122.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
lyrata]
Length = 1088
Score = 66.2 bits (160), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 112/513 (21%), Positives = 203/513 (39%), Gaps = 135/513 (26%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + L+G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQTILDVPLYGRIATLELFRPHG----EAQDFLFVATERYKFCVLQWD- 93
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
+ES E + G S G + +DP R G+ +Y GL +
Sbjct: 94 ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKV 141
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
I ++G F+ R+E V++++ F++G +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
L++ +H + +LK+ + WS NL + A L+ VPSP+
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKEKDFVEGPWSQNNLDNGADLLIPVPSPLC 227
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVL++G TI Y S +A A+ + + ++ V+LD + LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
+G + LL + ++ V L + + + S I+ + N++ F+GS GDS L++
Sbjct: 271 GDHSGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+++ DA S + L+ VN + + + +
Sbjct: 329 ----------------NLQPDATG------SYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366
Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
T S A +D G L+ G+ IN AS VEL G KG+W++ KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
S D+ + +L++S + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428
>gi|168047617|ref|XP_001776266.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672361|gb|EDQ58899.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1089
Score = 66.2 bits (160), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 122/556 (21%), Positives = 214/556 (38%), Gaps = 119/556 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A+ L+ + L+G + +L + G +D + ++FE K VL++D
Sbjct: 39 RIEIHLLTASGLQSMLDVPLYGRIATLELFRPPG----ESQDVLFISFERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
G IT S + GR + G + VDP R G+ +Y ++
Sbjct: 95 ET-GSPITRAMGDVSD-----RTGRPT-DNGQIGIVDPDCRLIGLHLYDGMFKVIPIDNK 147
Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
G F+ R+E V++++ F++G P + +L++
Sbjct: 148 GQ----------LKEAFNIRLEELQVLDIK-----------FLYGCANPTIAVLYQDNKD 186
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI 329
H + P W NL + A L+ VP P+GG +++G TI
Sbjct: 187 -------ARHVKTYEVNLKEKDFGEGP--WLQNNLDNGAGLLIPVPLPLGGAIIIGEQTI 237
Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
Y++ S A+ + + ++ V+ D + LLS G L LL
Sbjct: 238 VYYNGSVFKAIPIR---------PSITKAYGRVDSDGSR--------YLLSDHNGMLYLL 280
Query: 390 TVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
+ +D V L++ + S ++ + N + F+GS GDS L++
Sbjct: 281 VISHDKERVSALNVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRL------------ 328
Query: 450 LKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRD 507
+ +AD ++ S + L+ VN G + L Q T S A +D
Sbjct: 329 ------NHQAD------VKGSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAFKD 376
Query: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR 567
G L+ G+ IN AS VEL G KG+W++ SS
Sbjct: 377 -----GSLRIVRNGIGINEQAS------------VELQGIKGMWSLRASSS--------- 410
Query: 568 MAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
D Y +L++S E R + + T D L E TE + + +T+ N +++Q
Sbjct: 411 -----DVYDTFLVVSFISETRILAMNTDDELEE-TEIDGFDSEAQTLFCHNAV-HDQLVQ 463
Query: 626 VFERGARILDGSYMTQ 641
V R+++ Q
Sbjct: 464 VTAGSLRLVNAKTRKQ 479
>gi|219109892|ref|XP_002176699.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411234|gb|EEC51162.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1678
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 85/328 (25%), Positives = 133/328 (40%), Gaps = 69/328 (21%)
Query: 251 FVHGYIEPVMVILHE--RELTWAGRVSWKHHTC-----MISALSISTTLKQHPLIWSAMN 303
F+ GY+EPV+V+LH W+GR+ + ++ALSIS + ++WS +
Sbjct: 243 FLSGYLEPVLVLLHSDVEGPVWSGRLGRERGVAGAPPLFVTALSISVVHGRTAVLWSQV- 301
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSL------DSSQELP 356
+ DA K+L+ G LVVGANT + +A+N +A S + Q P
Sbjct: 302 VSADATKILSFGKT--GCLVVGANTLVILEIGKVQQVIAMNGWARSTCPAALQTALQANP 359
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLD-------------- 402
+++LD TWL A+++ +TG L +L D V L
Sbjct: 360 VVKLAIQLDGCCVTWLSEHSAIMALRTGQLYVLQRTDDRWAVMPLGQTLGAVGEVAHLAS 419
Query: 403 --------LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE- 453
L K + +G + F GSR GDSL + + +M + +K E
Sbjct: 420 LPIGGLRWLEKMKMDENKASEMQMG--VLFAGSRTGDSLFLGYAL-EIVTMPWAAIKSEG 476
Query: 454 --FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS----------ASNNTESAQ--- 498
F + E S ++ L ++ EE +LYG+ S E+A
Sbjct: 477 QTFINFEGSELSKVATTAPIANGLDRILQLEEEALYGTDRSTPLHIVRDSEEEETADIPS 536
Query: 499 -----KTFSFAV------RDSLVNIGPL 515
+ +F V D LVN+GPL
Sbjct: 537 DAKRLRPVAFTVVRTIVPLDVLVNLGPL 564
>gi|147779836|emb|CAN63685.1| hypothetical protein VITISV_020449 [Vitis vinifera]
Length = 64
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/47 (74%), Positives = 42/47 (89%)
Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
+PRSSFSVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGR+ L
Sbjct: 1 MPRSSFSVELDAANATWLSNDVAMLSTKTGELLLLTLXYDGRLFTDL 47
>gi|357135348|ref|XP_003569272.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
distachyon]
Length = 1074
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 112/482 (23%), Positives = 196/482 (40%), Gaps = 105/482 (21%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G + +DPQ R G+ +Y GL +I F + G +N+
Sbjct: 118 GQIGVIDPQNRLIGLSLYDGLFKVI---------------PFDNKGNLK------EALNI 156
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
R L V D F++G P +V+LH+ +H AL ++
Sbjct: 157 R-LQEFLVLDIKFLYGCARPTVVVLHQ------DNKDSRHVKTYEVALEDKDFVEGS--- 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
WS NL + A+ L +P P+GGV+++G +TI Y S + AL++ Q + R+
Sbjct: 207 WSQSNLDNSAH--LLIPVPLGGVIIIGEHTIVYCSATTFKALSIK---------QSIIRA 255
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
V+ D + + N TG L L+ + ++ V L + + S I+ +
Sbjct: 256 VGRVDPDGSRYLYGDN--------TGALHLIVITHEWGRVTDLKTHYMGETSIASTISYL 307
Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
+ L ++GSR GDS L++ +I+ADA + S + L+
Sbjct: 308 DSGLVYIGSRFGDSQLIKL------------------NIQADASA------SFVEILEQF 343
Query: 479 VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSN 538
+N + + + + + G KD S I A + I+ Q++
Sbjct: 344 MNTGPIVDFCVVDTERRGQGQVITCS--------GAYKDGS----IRAVRNGVVITDQAS 391
Query: 539 YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
VEL G KG+W++ KSS D+ + + +E H +L +++E LE D+
Sbjct: 392 ---VELRGMKGLWSM--KSSLNDPYDTFLVVTFINETH-FLAMNMENE---LEEVDIKGF 442
Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQDLSFGPSNSESGSGSE 657
+E+ +T+A G+ ++IQV R R++ S D F P+ +
Sbjct: 443 DSET-------QTLACGSAI-HNQLIQVTSRSVRLVSSVSLELLDQWFAPARFSVNVAAA 494
Query: 658 NS 659
N+
Sbjct: 495 NA 496
>gi|413946716|gb|AFW79365.1| hypothetical protein ZEAMMB73_562969 [Zea mays]
Length = 1089
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 151/367 (41%), Gaps = 83/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F+HG +P +V+L++ +H A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLHGCAKPTIVVLYQ------DNKDVRHVKTYEVA 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P WS N+ + A L+ VP+P+GGV+++G I Y + +++
Sbjct: 197 LK-DKDFVEGP--WSQNNVDNGAGLLIPVPAPLGGVIIIGEEQIVYCNANSTFK------ 247
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
++ Q + R+ V+ D + LL TG L LL + ++ V L +
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSRY--------LLGDNTGILHLLVLTHERERVTGLKIEY 297
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N + ++GSR GDS LV+ +++ADA
Sbjct: 298 LGETSIASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADASG-- 337
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
S + L+ VN + + + + + T S A +D G L+ G+
Sbjct: 338 ----SFVEILERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIG 388
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ KSS +D + YL++S
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------INDPFDMYLVVSF 422
Query: 584 EARTMVL 590
+ T L
Sbjct: 423 ISETRFL 429
>gi|15233515|ref|NP_193842.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
gi|73620956|sp|O49552.2|DDB1B_ARATH RecName: Full=DNA damage-binding protein 1b; AltName:
Full=UV-damaged DNA-binding protein 1b; Short=DDB1b
gi|110739453|dbj|BAF01636.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
gi|332659001|gb|AEE84401.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
Length = 1088
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 111/513 (21%), Positives = 202/513 (39%), Gaps = 135/513 (26%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + L+G + ++ + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWD- 93
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
+ES E + G S G + +DP R G+ +Y GL +
Sbjct: 94 ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKV 141
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
I ++G F+ R+E V++++ F++G +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
L++ +H + +LK + WS NL + A L+ VPSP+
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLC 227
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVL++G TI Y S +A A+ + + ++ V+LD + LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
G + LL + ++ V L + + + S I+ + N++ F+GS GDS L++
Sbjct: 271 GDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+++ DA + S + L+ VN + + + +
Sbjct: 329 ----------------NLQPDA------KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366
Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
T S A +D G L+ G+ IN AS VEL G KG+W++ KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
S D+ + +L++S + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428
>gi|62318656|dbj|BAD95136.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
Length = 1088
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 111/513 (21%), Positives = 202/513 (39%), Gaps = 135/513 (26%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + L+G + ++ + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWD- 93
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
+ES E + G S G + +DP R G+ +Y GL +
Sbjct: 94 ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKV 141
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
I ++G F+ R+E V++++ F++G +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
L++ +H + +LK + WS NL + A L+ VPSP+
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLC 227
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVL++G TI Y S +A A+ + + ++ V+LD + LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
G + LL + ++ V L + + + S I+ + N++ F+GS GDS L++
Sbjct: 271 GDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+++ DA + S + L+ VN + + + +
Sbjct: 329 ----------------NLQPDA------KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366
Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
T S A +D G L+ G+ IN AS VEL G KG+W++ KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
S D+ + +L++S + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428
>gi|356512636|ref|XP_003525024.1| PREDICTED: DNA damage-binding protein 1a-like isoform 1 [Glycine
max]
Length = 1089
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 110/504 (21%), Positives = 200/504 (39%), Gaps = 117/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQPMLDVPIYGRIATLELFRPHG----EAQDYLFIATERYKFCVLQWDS 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ETAELVTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL L + P WS NL + A L+ VP P+ GVL++G T
Sbjct: 183 ---DNKDARHVKTYEVALKDKDFL-EGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y S +A A+ + + ++ V+ D + LL TG L L
Sbjct: 237 IVYCSANAFKAIPIR---------PSITKAYGRVDPDGSR--------YLLGDHTGLLSL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N+ ++GS GDS L++
Sbjct: 280 LVITHEKEKVTGLKIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++ DA + S + L+ VN + + + + T S A +
Sbjct: 329 -------NLQPDA------KGSYVEGLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428
>gi|449488592|ref|XP_004158102.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
[Cucumis sativus]
Length = 570
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 108/504 (21%), Positives = 194/504 (38%), Gaps = 117/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTAQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDT 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + + G + +DP R G+ +Y GL +I
Sbjct: 95 ESSELITRAMGDVSD------RIGRPTDS-GQIGIIDPDCRLIGLHLYDGLFKVI----- 142
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
F + + N+R L+ V D F++G P +V+L++
Sbjct: 143 ----------------PFDNKGQLKEAFNIR-LEELQVLDIKFLYGCSRPTIVVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + + P WS NL + A L+ VP P+ GV+++G T
Sbjct: 186 D-------ARHVKTYEVVLKDKDFVEGP--WSQNNLDNGAAVLIPVPPPLCGVIIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y S +A A+ + + R+ V+ D + LL G L L
Sbjct: 237 IVYCSATAFKAIPVR---------PSITRAYGRVDADGSR--------YLLGDHAGLLHL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N+ ++GS GDS LV+
Sbjct: 280 LVITHEKERVTGLKIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++ DA + S + L+ VN + + + + T S A +
Sbjct: 329 -------NVQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428
>gi|157128864|ref|XP_001655231.1| DNA repair protein xp-e [Aedes aegypti]
gi|108882186|gb|EAT46411.1| AAEL002407-PB [Aedes aegypti]
Length = 1138
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 109/468 (23%), Positives = 174/468 (37%), Gaps = 119/468 (25%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G L +DP+ R G+ +Y GL II D DT H +
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154
Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
L M+ HV+D F++G P ++++H+ ++ +H I I+ K
Sbjct: 155 TSLRMEEVHVQDVEFLYGTQHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203
Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
I W N+ +A L+ VP+P+GG +V+G ++ YH + A+A +N YA
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYAR 263
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
+ S L +N LLS K + LL + T
Sbjct: 264 VDSKGFRYLLGNMSGHLFMMFLETEENSKGLLSVKDIKVELLGDI-------------TI 310
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
P IT + N + F+GSR GDS LV+ +G + + E F ++ AP
Sbjct: 311 PEC----ITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMETFTNL---APIIDMC 363
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
L+ G+ ++ GS G L+ G+ I
Sbjct: 364 IVD----LEKQGQGQMITCSGSYKE--------------------GSLRIIRNGIGIQEH 399
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
A ++LPG KG+W + R+ D Y L++S T
Sbjct: 400 AC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLSFVGHT 434
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
+L + E TE + +T N+ FG ++IQV AR++
Sbjct: 435 RILTLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPTTARLI 480
>gi|157128866|ref|XP_001655232.1| DNA repair protein xp-e [Aedes aegypti]
gi|108882187|gb|EAT46412.1| AAEL002407-PA [Aedes aegypti]
Length = 980
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 109/468 (23%), Positives = 174/468 (37%), Gaps = 119/468 (25%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G L +DP+ R G+ +Y GL II D DT H +
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154
Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
L M+ HV+D F++G P ++++H+ ++ +H I I+ K
Sbjct: 155 TSLRMEEVHVQDVEFLYGTQHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203
Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
I W N+ +A L+ VP+P+GG +V+G ++ YH + A+A +N YA
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYAR 263
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
+ S L +N LLS K + LL + T
Sbjct: 264 VDSKGFRYLLGNMSGHLFMMFLETEENSKGLLSVKDIKVELLGDI-------------TI 310
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
P IT + N + F+GSR GDS LV+ +G + + E F ++ AP
Sbjct: 311 PEC----ITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMETFTNL---APIIDMC 363
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
L+ G+ ++ GS G L+ G+ I
Sbjct: 364 IVD----LEKQGQGQMITCSGSYKE--------------------GSLRIIRNGIGIQEH 399
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
A ++LPG KG+W + R+ D Y L++S T
Sbjct: 400 AC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLSFVGHT 434
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
+L + E TE + +T N+ FG ++IQV AR++
Sbjct: 435 RILTLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPTTARLI 480
>gi|356525401|ref|XP_003531313.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Glycine
max]
Length = 1089
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 108/504 (21%), Positives = 200/504 (39%), Gaps = 117/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQPMLDVPIYGRIATLELFRPHG----EAQDYLFIATERYKFCVLQWDS 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ETGELVTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL + P WS NL + A L+ VP P+ GVL++G T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y S +A A+ + + ++ V+ D + LL TG + L
Sbjct: 237 IVYCSANAFKAIPIR---------PSITKAYGRVDPDGSR--------YLLGDHTGLVSL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L ++++ V L + + + S I+ + N+ ++GS GDS L++
Sbjct: 280 LVIIHEKEKVTGLKIEPLGETSIASTISYLDNAFVYVGSSYGDSQLIKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++ DA + S + L+ VN + + + + T S A +
Sbjct: 329 -------NLQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428
>gi|119580419|gb|EAW60015.1| hCG2010549, isoform CRA_a [Homo sapiens]
Length = 323
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 38/56 (67%)
Query: 943 LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
LR+HP +G + +F + HNVNC GF+Y QG L+I LP+ +YD+ WPV+K+
Sbjct: 184 LRLHPVGINGPVNSFALFHNVNCPRGFLYFNRQGKLRISVLPAYLSYDSPWPVRKI 239
>gi|357623954|gb|EHJ74904.1| putative DNA repair protein xp-e [Danaus plexippus]
Length = 1128
Score = 63.2 bits (152), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 102/460 (22%), Positives = 176/460 (38%), Gaps = 107/460 (23%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G L +DPQ R G+ +Y I+ + + L S R+E +N+
Sbjct: 120 GILAVIDPQARVIGLRLYDGLFKIIPLDKDSTEL----------KAASLRLEE---LNVY 166
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
DL+ F+HG P ++++H+ ++ +H I I+ K+ I
Sbjct: 167 DLE--------FLHGCSNPTLILIHQD-------LNGRH----IKTHEINLRDKEFMKIP 207
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSP+GG +V+G +I YH + A+A ++
Sbjct: 208 WKQDNVETEASILIPVPSPLGGAIVIGQESIVYHDGQSYVAVAPPQIKTPINC------- 260
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----VVQRLDLSKTNPSVLTSD 414
+D +L D+A G L +L + R V+ L + +
Sbjct: 261 --YCRVDVRGLRYLLGDIA------GRLFMLLLELSERDGTASVRDLKVELLGDIPIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+T + N + F+GSRLGDS LV+ + DA + + + +
Sbjct: 313 MTYLDNGVVFVGSRLGDSALVRLAA-----------------VRDDASQYVQPMETFT-S 354
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L N + F +G L+ G+ I AS
Sbjct: 355 LAPIVDMCVVDLERQGQNQLITCSGAF---------KMGSLRIIRNGIGIQEQAS----- 400
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + + G +H L++S +T VL
Sbjct: 401 -------IDLPGIKGMWAL----TLGQGP-----------HHDTLVLSFVGQTRVLTLNG 438
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ ++IQV + G R++
Sbjct: 439 EEVEETEIKGFVSDRQTFFTGNVC-HDQLIQVTDEGIRLI 477
>gi|195145844|ref|XP_002013900.1| GL24391 [Drosophila persimilis]
gi|194102843|gb|EDW24886.1| GL24391 [Drosophila persimilis]
Length = 1140
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 67/267 (25%), Positives = 114/267 (42%), Gaps = 51/267 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+++Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMVLYQGLFTIIPMDKEASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCLNPTIIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
++ +D + LL G L +L + G V+ + + K +
Sbjct: 260 CYA-RVDGKGLRY------LLGNMDGQLYMLFLGTSETSKGVTVKDIKVEKLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGS 441
IT + N ++G+R GDS LV+ + S
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLSSES 339
>gi|449435512|ref|XP_004135539.1| PREDICTED: DNA damage-binding protein 1-like [Cucumis sativus]
Length = 1093
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 107/504 (21%), Positives = 197/504 (39%), Gaps = 117/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTAQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDT 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ESSELITRAMGDVSD------RIGRPTDS-GQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSRPTIVVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + + P WS NL + A L+ VP P+ GV+++G T
Sbjct: 186 D-------ARHVKTYEVVLKDKDFVEGP--WSQNNLDNGAAVLIPVPPPLCGVIIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y S +A A+ + + R+ V+ D + LL G L L
Sbjct: 237 IVYCSATAFKAIPVR---------PSITRAYGRVDADGSR--------YLLGDHAGLLHL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N+ ++GS GDS LV+
Sbjct: 280 LVITHEKERVTGLKIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++ DA + S + L+ VN + + + + T S A +
Sbjct: 329 -------NVQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428
>gi|125774475|ref|XP_001358496.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
gi|54638233|gb|EAL27635.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
Length = 1140
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 67/267 (25%), Positives = 113/267 (42%), Gaps = 51/267 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+++Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMVLYQGLFTIIPMDKEASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELSVYDVEFLHGCLNPTIIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
++ +D + LL G L +L + G V+ + + K +
Sbjct: 260 CYA-RVDGKGLRY------LLGNMDGQLYMLFLGTSETSKGVTVKDIKVEKLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGS 441
IT + N ++G+R GDS LV+ + S
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLSSES 339
>gi|167998730|ref|XP_001752071.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697169|gb|EDQ83506.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 172
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 8/96 (8%)
Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
E+ A+ ++ A RR S+ + +L F + R +HCFE PE+ +L R
Sbjct: 28 ETAALRTEAAAPGIHRRPSLTMRLR----IILAFTEC----RCLLIHCFEYPEYQYLNRS 79
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
RE FA V+ D GRC VL+Y Q++ LKA G
Sbjct: 80 RERFAMDLSVRADLVGRCASVLIYNSQLVTLKAGHG 115
>gi|2911067|emb|CAA17529.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
gi|7268907|emb|CAB79110.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
Length = 1102
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 85/370 (22%), Positives = 150/370 (40%), Gaps = 90/370 (24%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P + +L++ +H
Sbjct: 158 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCTKPTIAVLYQ------DNKDARH------V 204
Query: 286 LSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALAL 342
+ +LK + WS NL + A L+ VPSP+ GVL++G TI Y S +A A+ +
Sbjct: 205 KTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLIIGEETIVYCSANAFKAIPI 264
Query: 343 NNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLD 402
+ ++ V+LD + LL G + LL + ++ V L
Sbjct: 265 R---------PSITKAYGRVDLDGSR--------YLLGDHAGLIHLLVITHEKEKVTGLK 307
Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
+ + + S I+ + N++ F+GS GDS L++ +++ DA
Sbjct: 308 IELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL------------------NLQPDA- 348
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSY 520
+ S + L+ VN + + + + T S A +D G L+
Sbjct: 349 -----KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRN 398
Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
G+ IN AS VEL G KG+W++ KSS D+ + +L+
Sbjct: 399 GIGINEQAS------------VELQGIKGMWSL--KSS------------IDEAFDTFLV 432
Query: 581 ISLEARTMVL 590
+S + T +L
Sbjct: 433 VSFISETRIL 442
>gi|410079681|ref|XP_003957421.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
gi|372464007|emb|CCF58286.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
Length = 1350
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 144/669 (21%), Positives = 270/669 (40%), Gaps = 133/669 (19%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L ++ G + +A+L + A D ++L AKIS+++FD + + S+H
Sbjct: 48 LFLTNEFKFDGRITDIALLPRQDA----ALDYLLLCTAVAKISIVKFDLESNSIETVSLH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRC------GGVLVYGLQMIILKASQGGSGLV 214
+E ++ L R +++DP RC + V M + G
Sbjct: 104 YYED-KFKDLSLAE--LTRESKLRLDPASRCLVLFNEDNIAVLPFVMKEDEEDDDEEGEE 160
Query: 215 GDEDTFGSG-GGFSARIE-----SSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHER 266
DEDT+ F A I S +++ + + D++++ D F++ Y +P + IL++
Sbjct: 161 EDEDTYEPRIKRFRANINGRVTFPSTILSAKTIHEDIQNIIDIEFLNNYSKPTVAILYQP 220
Query: 267 ELTWAGRVSWKH--HTCMISALSIST----TLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
+LTW G + +I L +T T H +I LP D ++L+ V + G
Sbjct: 221 KLTWVGNLQLHPLPTKLLIVTLECNTNGFETSLSHIVIARLNELPWDWHRLIPVTN---G 277
Query: 321 VLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQND 375
+++VG N + Y + + LN++A + L +S S E + + +++
Sbjct: 278 IVIVGINELAYVDNTGVLQTVILLNSFA-----DRNLKKSRIIDHSKEESVFNHSAMKH- 331
Query: 376 VALLSTKTGD---------------LVLLTVVYDGRVVQRLDLSK--------------T 406
+ +L T G+ L + ++ +GR++ + D+ K T
Sbjct: 332 ICILKTTDGNEDDADLLLLMDDRSNLYYVQMISEGRLMTQFDIIKLPIINNIFINNLNPT 391
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
+ S L S + + LFF G + GD+ F C + ++E D+ D PS
Sbjct: 392 SISRLDSSSSRVNLDLFF-GFQSGDA----FVCRLNNIKSAVETRKEHKDV-LDYPS--- 442
Query: 467 LRRSSSDALQDMVNGEEL----SLYGSASNNTESAQ-------------KTFSFAVRDSL 509
++D + +G +L LY + +T+ A + F A+ SL
Sbjct: 443 ----NADEYDE--DGADLYGDDDLYSDEATSTQRANSKENGRSNMIETVEPFDIALLSSL 496
Query: 510 VNIGPLKDFSYGLRINADASATGISKQSNYELVELP----GCKGIWTVYHKSSRGHNADS 565
NIGPL + G D + G+S +N EL + G T S R +
Sbjct: 497 NNIGPLTSLTSGKVSAVDQNNKGLSNPNNNELSIVATSGNGTGSHLTAVLPSVRPEIELA 556
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADL------LTEVTESVDYFVQGR-----TIAA 614
+ + ++ + + + L T D + E+ + +GR T +
Sbjct: 557 LKFISITQIWN----LKFKGKDKFLVTTDSTKSKSDIYEIDNNFALHREGRLRRDATTVS 612
Query: 615 GNLFGR-RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
+FG +R++QV +LD ++ + + V+ VS+ DPY+L+
Sbjct: 613 IAMFGSDKRIVQVTTNHLYLLDTTF-----------RRLNTIKFDYEVVHVSVMDPYILI 661
Query: 674 GMSDGSIRL 682
+S G I++
Sbjct: 662 TVSRGDIKV 670
>gi|356512638|ref|XP_003525025.1| PREDICTED: DNA damage-binding protein 1a-like isoform 2 [Glycine
max]
Length = 1068
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 85/367 (23%), Positives = 148/367 (40%), Gaps = 84/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P +V+L++ +H A
Sbjct: 123 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 175
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L L + P WS NL + A L+ VP P+ GVL++G TI Y S +A A+ +
Sbjct: 176 LKDKDFL-EGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR-- 230
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ ++ V+ D + LL TG L LL + ++ V L +
Sbjct: 231 -------PSITKAYGRVDPDGSR--------YLLGDHTGLLSLLVITHEKEKVTGLKIEP 275
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N+ ++GS GDS L++ +++ DA
Sbjct: 276 LGETSIASTISYLDNAFVYIGSSYGDSQLIKL------------------NLQPDA---- 313
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
+ S + L+ VN + + + + T S A +D G L+ G+
Sbjct: 314 --KGSYVEGLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRVVRNGIG 366
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ ++ DD + +L++S
Sbjct: 367 INEQAS------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSF 400
Query: 584 EARTMVL 590
+ T +L
Sbjct: 401 ISETRIL 407
>gi|340059653|emb|CCC54046.1| putative mitochondrial carrier protein [Trypanosoma vivax Y486]
Length = 1481
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 61/231 (26%), Positives = 103/231 (44%), Gaps = 41/231 (17%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHH-------TCMISALSISTTL 292
+++V+D F+ EP++ IL ER+ TWAGRV W+ T ++ + IS ++
Sbjct: 268 IRYVRDLQFIGSSGEPLLAILCERQPTWAGRVKLVEWRTKVVESNTLTMHVTWVQISASM 327
Query: 293 KQHP---LIWSAMNLPHDAYKLLAVP---SPIGGVLVVGANTIHYHSQSASCALALNNY- 345
HP LI +P++ +L V + GV+ G N I + + N+
Sbjct: 328 TAHPKLLLIGEVEGVPYNVTHMLPVEPFSQTMSGVVCFGTNVIMHITTKRGYGAYFNDTG 387
Query: 346 ----------AVS----------LDSSQELPRSSFSVELDAAHATW--LQNDVALLSTKT 383
AVS LD S L R + S+ AA + + +++ +L+
Sbjct: 388 REECINSKFSAVSFGKAVWSDPQLDKSSALARVNMSLANCAATSMVGKMGDELQVLALLE 447
Query: 384 GDLVLLTV--VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDS 432
D V++T+ V G V+ + ++ S ++ IG L FLGS +GDS
Sbjct: 448 EDGVVITLHFVARGSSVEEVRITMLGSGCYCSSVSRIGRQLVFLGSTVGDS 498
>gi|224061051|ref|XP_002300334.1| predicted protein [Populus trichocarpa]
gi|222847592|gb|EEE85139.1| predicted protein [Populus trichocarpa]
Length = 1088
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 109/507 (21%), Positives = 193/507 (38%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ ++ ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEINLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ETSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F+HG +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLHGCSKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + LK I WS NL + A L+ VP P GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVALKDKDFIEGPWSQNNLDNGADLLIPVPPPFCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S + A+ + + ++ V+ D + LL G
Sbjct: 234 EETIVYCSANVFRAIPIR---------PSITKAYGRVDADGSR--------YLLGDHAGL 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
L LL + ++ V L + + + S I+ + N+ F+GS GDS LV+
Sbjct: 277 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKL-------- 328
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
++ DA T + L VN + + + + T S
Sbjct: 329 ----------NLHPDAKGT------YVEVLDRYVNLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++
Sbjct: 373 AYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL---------- 405
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
+ DD + +L++S + T +L
Sbjct: 406 ----RSLTDDPFDTFLVVSFISETRIL 428
>gi|345498295|ref|XP_001607743.2| PREDICTED: DNA damage-binding protein 1-like [Nasonia vitripennis]
Length = 1140
Score = 60.8 bits (146), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 88/420 (20%), Positives = 160/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D ++V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 162 MDEQNVQDVNFLHGCTNPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + Y + + S
Sbjct: 211 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTT--------YVTVVPPIIKQSTIS 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L + D + V++ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDLA------GHLFMLFLEQDKKADGSMVIKDLKVELLGEVSIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSRLGDS L++ + E F ++ AP + +
Sbjct: 317 ITYLDNGVIFIGSRLGDSQLIKLNTKPDENGSYCSTMETFTNL---AP----IVDMAVVD 369
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L+ G+ ++ G+ G L+ G+ I AS
Sbjct: 370 LERQGQGQIVTCSGAFKE--------------------GSLRIIRNGIGIQEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + S N L++S +T +L
Sbjct: 405 -------IDLPGIKGMWALKVDSVNFDNT---------------LVLSFVGQTRILMLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E TE + +T GN+ +IQ+ AR++ + + P N + S
Sbjct: 443 EEVEETEIPGFVADEQTFHTGNV-TNDVIIQITPTSARLISNKSSSVISEWEPDNKRTIS 501
>gi|356525403|ref|XP_003531314.1| PREDICTED: DNA damage-binding protein 1-like isoform 2 [Glycine
max]
Length = 1068
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 83/367 (22%), Positives = 148/367 (40%), Gaps = 84/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P +V+L++ +H A
Sbjct: 123 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 175
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P WS NL + A L+ VP P+ GVL++G TI Y S +A A+ +
Sbjct: 176 LK-DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR-- 230
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ ++ V+ D + LL TG + LL ++++ V L +
Sbjct: 231 -------PSITKAYGRVDPDGSR--------YLLGDHTGLVSLLVIIHEKEKVTGLKIEP 275
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N+ ++GS GDS L++ +++ DA
Sbjct: 276 LGETSIASTISYLDNAFVYVGSSYGDSQLIKL------------------NLQPDA---- 313
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
+ S + L+ VN + + + + T S A +D G L+ G+
Sbjct: 314 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRVVRNGIG 366
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ ++ DD + +L++S
Sbjct: 367 INEQAS------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSF 400
Query: 584 EARTMVL 590
+ T +L
Sbjct: 401 ISETRIL 407
>gi|194741158|ref|XP_001953056.1| GF17579 [Drosophila ananassae]
gi|190626115|gb|EDV41639.1| GF17579 [Drosophila ananassae]
Length = 1140
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 113/264 (42%), Gaps = 51/264 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKEASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLITVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
++ +D+ + LL G L +L + G V+ + + + +
Sbjct: 260 CYA-RVDSKGFRY------LLGNMDGQLYMLFLGTSETSKGITVKDIKVEQLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
IT + N ++G+R GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLS 336
>gi|195108657|ref|XP_001998909.1| GI23368 [Drosophila mojavensis]
gi|193915503|gb|EDW14370.1| GI23368 [Drosophila mojavensis]
Length = 1140
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 65/263 (24%), Positives = 114/263 (43%), Gaps = 49/263 (18%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L +LR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATSLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
+D V D F+HG + P ++++H+ +H C L +K L W
Sbjct: 159 -MDELIVYDVEFLHGCLNPTVIVIHKDN-------DGRHVKCHEINLRDKEFMK---LAW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 208 KQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTINC 260
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRV--VQRLDLSKTNPSVLTSDI 415
++ +D+ + LL G L +L + + G+V V+ + + + + I
Sbjct: 261 YA-RVDSKGLRY------LLGNMDGQLYMLFLGINETGKVPTVKDIKVEQLGEISIPECI 313
Query: 416 TTIGNSLFFLGSRLGDSLLVQFT 438
T + N ++GSR GDS LV+ +
Sbjct: 314 TYLDNGFLYIGSRHGDSQLVRLS 336
>gi|170057515|ref|XP_001864517.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167876915|gb|EDS40298.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1138
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 105/473 (22%), Positives = 175/473 (36%), Gaps = 129/473 (27%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G L +DP+ R G+ +Y GL II D DT H +
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154
Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
L M+ HV+D F++G P ++++H+ ++ +H I I+ K
Sbjct: 155 TSLRMEEMHVQDVEFLYGTAHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203
Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
I W N+ +A L+ VP+P+GG +V+G ++ YH + A+A +N YA
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYA- 262
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
+D+ + LL G L ++ + + +L +
Sbjct: 263 ---------------RVDSRGFRY------LLGNMIGHLFMMFLETEENTRGQLTVKDIK 301
Query: 408 PSVL-----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
+L IT + N + F+GSR GDS LV+ + S + E F ++ AP
Sbjct: 302 VELLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAAASGAYVTVMETFTNL---AP 358
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
L+ G+ ++ GS G L+ G+
Sbjct: 359 IIDMCIVD----LERQGQGQMITCSGSYKE--------------------GSLRIIRNGI 394
Query: 523 RINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
I A ++LPG KG+W + R+ D Y L++S
Sbjct: 395 GIQEHAC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLS 429
Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
T +L + E TE + +T N+ FG ++IQV AR++
Sbjct: 430 FVGHTRILMLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPMTARLI 480
>gi|357519461|ref|XP_003630019.1| DNA damage-binding protein [Medicago truncatula]
gi|355524041|gb|AET04495.1| DNA damage-binding protein [Medicago truncatula]
Length = 1171
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 79/349 (22%), Positives = 145/349 (41%), Gaps = 60/349 (17%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A L+ + L+G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTAQGLQSILDVPLYGRIATLELFRPHG----ETQDFLFIATERYKFCVLQWDT 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L SM + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 EKSELVTRSMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL + P WS +L + A L+ VP P+ GVL++G T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNSLDNGADLLIPVPPPLCGVLIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y S + A+ + + ++ V+ D + LL TG L L
Sbjct: 237 IVYCSANGFKAIPIR---------AAITKAYGRVDPDGSRY--------LLGDHTGLLSL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L + ++ V L + + + S I+ + N+ ++GS GDS L++
Sbjct: 280 LVITHEKEKVTGLKIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKL 328
>gi|194901554|ref|XP_001980317.1| GG19434 [Drosophila erecta]
gi|190652020|gb|EDV49275.1| GG19434 [Drosophila erecta]
Length = 1140
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCMNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251
Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
+F +A N + LL G L +L + G V+ + + + +
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
IT + N ++G+R GDS LV+
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335
>gi|298711490|emb|CBJ26578.1| n/a [Ectocarpus siliculosus]
Length = 1135
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 96/418 (22%), Positives = 166/418 (39%), Gaps = 94/418 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
A+ + N+R L+ V D F+ G + + +L++ + + +H I
Sbjct: 143 MDAKGQLKDAFNIR-LEELEVLDIQFLSGCPKATIAVLYQDQR------NARH----IKT 191
Query: 286 LSISTTLKQHPL-IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
+IST K+ W+ +N+ H+A +L+ VP+P GGVL++G TI YHS A + + N
Sbjct: 192 YTISTRDKEFDTGPWAQLNVEHNASELIPVPAPFGGVLILGHQTICYHSGKAFITIPIQN 251
Query: 345 YAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL--VLLTVVYDGRVVQR 400
+ A+ W+ D + L+S +G L V+LT V+
Sbjct: 252 TRM------------------CAYG-WVDADGSRLLVSDHSGGLHVVILTPDATNTAVET 292
Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
+ + S I+ + N + F+GS GDS L++ + E D
Sbjct: 293 AHIEALGETSCASSISYLDNGVVFIGSASGDSQLIKL------------------NPEKD 334
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
A T + D L +++ + T S +D G L+
Sbjct: 335 AQGTYIQVLETYDNLGPILD----MCVADLDRQGQGQAVTCSGCSKD-----GSLRIIRN 385
Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
G+ IN A+ +EL G KG+W++ R N + + YL+
Sbjct: 386 GIGINEHAA------------IELAGIKGMWSL-----RPSNTNHDK----------YLV 418
Query: 581 ISLEARTMVL---ETADLLTEVTE-SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + T VL E D ++ E + F +G T+ G G +QV +RG ++
Sbjct: 419 QAFISETRVLAFEEDEDGDHQLAEGEIAGFQEGCTLFCG-CVGGNMAVQVTKRGVVLI 475
>gi|195329354|ref|XP_002031376.1| GM24084 [Drosophila sechellia]
gi|194120319|gb|EDW42362.1| GM24084 [Drosophila sechellia]
Length = 1140
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251
Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
+F +A N + LL G L +L + G V+ + + + +
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
IT + N ++G+R GDS LV+
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335
>gi|405970039|gb|EKC34976.1| DNA damage-binding protein 1 [Crassostrea gigas]
Length = 1160
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/231 (23%), Positives = 101/231 (43%), Gaps = 45/231 (19%)
Query: 225 GFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTW--------AGRVSW 276
F+ R+E VI+++ F+HG P ++++H+ L +S+
Sbjct: 154 AFNIRLEELTVIDIQ-----------FLHGCTTPTLILIHQANLNCYHLMTLCITNLLSF 202
Query: 277 KH--HTCMISALSISTTLKQ-HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
K H + IS K+ W N+ +A L+AVP P GG L++G +I YH
Sbjct: 203 KQDQHGRHVKTYEISLRDKEFQKGPWKQDNVETEACMLIAVPEPFGGALIIGQESITYHK 262
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV 391
+A + +S+ + ++DA + +L D+ G L +L +
Sbjct: 263 GDNFIPIA----------PPAIKQSTLTCYGKVDANGSRYLLGDMM------GRLFMLML 306
Query: 392 VYDGRV-----VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ ++ V+ L + + + IT + N++ ++GSRLGDS LV+
Sbjct: 307 EKEEKMDSTVTVKDLKVELLGETTIAECITYLDNAVVYIGSRLGDSQLVKL 357
>gi|21357503|ref|NP_650257.1| piccolo [Drosophila melanogaster]
gi|74872881|sp|Q9XYZ5.1|DDB1_DROME RecName: Full=DNA damage-binding protein 1; Short=D-DDB1; AltName:
Full=Damage-specific DNA-binding protein 1; AltName:
Full=Protein piccolo
gi|4928452|gb|AAD33592.1|AF132145_1 damage-specific DNA binding protein DDBa p127 subunit [Drosophila
melanogaster]
gi|7299719|gb|AAF54901.1| piccolo [Drosophila melanogaster]
gi|220942640|gb|ACL83863.1| DDB1-PA [synthetic construct]
Length = 1140
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDS---DGRHVKSHE--------INLRDKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251
Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
+F +A N + LL G L +L + G V+ + + + +
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
IT + N ++G+R GDS LV+
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335
>gi|255571318|ref|XP_002526608.1| DNA repair protein xp-E, putative [Ricinus communis]
gi|223534048|gb|EEF35767.1| DNA repair protein xp-E, putative [Ricinus communis]
Length = 1033
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/362 (23%), Positives = 146/362 (40%), Gaps = 85/362 (23%)
Query: 231 ESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
E+S +I L+ V D F++G +P +V+L++ +H AL
Sbjct: 95 ETSELIT--RLEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVALK-DK 145
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
+ P W+ NL + A L+ VP P+ GVL++G TI Y S +A A+ +
Sbjct: 146 DFGEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR------- 196
Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV 410
+ R+ V+ D + LL G L LL + ++ V L + +
Sbjct: 197 --PSITRAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETS 246
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
+ S I+ + N++ ++GS GDS LV+ +++ DA + S
Sbjct: 247 IASTISYLDNAVVYIGSSYGDSQLVKL------------------NLQPDA------KGS 282
Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADA 528
+ L+ VN + + + + T S A +D G L+ G+ IN A
Sbjct: 283 YVEVLESYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIGINEQA 337
Query: 529 SATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
S VEL G KG+W++ ++ DD + +L++S + T
Sbjct: 338 S------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSFISETR 371
Query: 589 VL 590
+L
Sbjct: 372 IL 373
>gi|407410979|gb|EKF33219.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi marinkellei]
Length = 1436
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 63/260 (24%), Positives = 105/260 (40%), Gaps = 54/260 (20%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
+++V+D F+ EP++ L ER TWAGRV W+ LS S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
T+ ++ LI ++P++ + +P+G GV+ G NT+ + + + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFAQIPSGVICYGINTVMHVTTKRGYGVYL 365
Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
NN + D E + F V L A T + N+ + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVSWYDPKMETSTALFKVNLSLASCTASFMSIVNEMLHLLV 425
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+S + G ++ L++ VQ + ++ S IT IG+ + FLGS GDS
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQDIRIAILGTGCYCSGITRIGDQIVFLGSAFGDS------ 479
Query: 439 CGSGTSMLSSGLKEEFGDIE 458
C + M S + F IE
Sbjct: 480 CIAKVDMFHSDAAKRFQIIE 499
>gi|195449948|ref|XP_002072297.1| GK22405 [Drosophila willistoni]
gi|194168382|gb|EDW83283.1| GK22405 [Drosophila willistoni]
Length = 1140
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 112/264 (42%), Gaps = 51/264 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMEKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELMVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
++ +D+ + LL G L +L + G V+ + + + +
Sbjct: 260 CYA-RVDSKGLRY------LLGNMHGQLYMLFLGTSESSKGITVKDIKVEQLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
IT + N ++G+R GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLS 336
>gi|350410909|ref|XP_003489174.1| PREDICTED: DNA damage-binding protein 1-like [Bombus impatiens]
Length = 1141
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 90/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
++ V+D F+HG P ++++H+ ++ +H + IS K+ + W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKVPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH N Y + + +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L V + + VV+ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKPDGTQVVKDLKVELLGEISIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSRLGDS LV+ +AD + + +
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L + T S A ++ G L+ G+ I AS
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + G N D++ L++S +T +L
Sbjct: 405 -------IDLPGIKGMWAL---KVGGGNFDNT------------LVLSFVGQTRILTLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E T+ + +T GN+ IQ+ AR++ T + P N + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRTIS 501
>gi|195500686|ref|XP_002097479.1| GE26244 [Drosophila yakuba]
gi|194183580|gb|EDW97191.1| GE26244 [Drosophila yakuba]
Length = 1140
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 107/264 (40%), Gaps = 53/264 (20%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVMAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELTVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251
Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
+F +A N + LL G L +L + G V+ + + + +
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTSETSKGVTVKDIKVEQLGEISIPE 311
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
IT + N ++G+R GDS LV+
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335
>gi|255080490|ref|XP_002503825.1| predicted protein [Micromonas sp. RCC299]
gi|226519092|gb|ACO65083.1| predicted protein [Micromonas sp. RCC299]
Length = 1114
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 84/197 (42%), Gaps = 22/197 (11%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
L+ +V D F+HG P + +L+E H TL+ P WS
Sbjct: 157 LEELNVVDVKFMHGCATPTICVLYED-------TKEARHVKTYEVDVKEKTLRDGP--WS 207
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
++ + ++ VP+P+GG +VVG + I Y ++ + ++
Sbjct: 208 QSDVEGGSSLIIPVPAPLGGAIVVGESVIVYLNKDGG-------------NGAGGAIATK 254
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
SV + A LLS TG L LL +V+D R V L L + + S ++ + N
Sbjct: 255 SVNVMAHGVVDADGSRYLLSDSTGMLHLLVLVHDRRRVHALKLESLGQTSIASTLSYLDN 314
Query: 421 SLFFLGSRLGDSLLVQF 437
+ ++GS GDS LV+
Sbjct: 315 GVVYVGSAYGDSQLVRL 331
>gi|328788389|ref|XP_396048.3| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Apis
mellifera]
Length = 1141
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 91/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
++ V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH N Y + + +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L V + + VV+ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKADGTQVVKDLKVELLGEISIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSRLGDS LV+ +AD + + +
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L + T S A ++ G L+ G+ I AS
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + G N D++ L++S +T +L
Sbjct: 405 -------IDLPGIKGMWAL---KIGGGNFDNT------------LVLSFVGQTRILTLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E T+ + +T GN+ IQ+ AR++ T + P N + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISYETKTVVSEWEPENKRTIS 501
>gi|380025901|ref|XP_003696702.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
[Apis florea]
Length = 1141
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 91/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
++ V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH N Y + + +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L V + + VV+ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKTDGTQVVKDLKVELLGEISIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSRLGDS LV+ +AD + + +
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L + T S A ++ G L+ G+ I AS
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + G N D++ L++S +T +L
Sbjct: 405 -------IDLPGIKGMWAL---KIGGGNFDNT------------LVLSFVGQTRILTLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E T+ + +T GN+ IQ+ AR++ T + P N + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISYETKTVVSEWEPENKRTIS 501
>gi|307205760|gb|EFN83990.1| DNA damage-binding protein 1 [Harpegnathos saltator]
Length = 1138
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 94/205 (45%), Gaps = 35/205 (17%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + A+ + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYIAVV----------PPIIKQST 257
Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
+ ++D +L D+A G L +L + + + VV+ L + +
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGTQVVKDLKVELLGEISIP 311
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
IT + N + F+GSRLGDS L++
Sbjct: 312 ECITYLDNGVIFVGSRLGDSQLIKL 336
>gi|224000243|ref|XP_002289794.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975002|gb|EED93331.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1820
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/410 (22%), Positives = 145/410 (35%), Gaps = 141/410 (34%)
Query: 246 VKDFIFVHGYIEPVMVILHERE-----LTWAGRVSWKHHTCM------------------ 282
+ D F+ GYIEP +++LH WAGR+ +
Sbjct: 398 IVDIAFLSGYIEPTLLVLHSNPKRGGGRAWAGRLGRTEEVPLSNNGGSGESKDDYGEDID 457
Query: 283 -----------------------ISALSISTTLKQHPLIWSAMN-LPHDAYKLLAVPSPI 318
++A+S++ ++ ++WS ++ LP DA+KL VP P
Sbjct: 458 LEGGDAAKKGPDLVSTGTKYGLSLTAISLAIHQRRSVVLWSLLDALPADAWKL--VPHPS 515
Query: 319 GGVLVVGANTIHYHSQSA--SCALALNNY------------------AVSLDSSQELPRS 358
GV+V G NT Y S SCALA N + AV L+ + P
Sbjct: 516 DGVIVWGVNTAVYVSMGGKISCALAANGFAKIGCPIGLIPPSGRIGSAVYLEPNPS-PLP 574
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV--------------------------- 391
+++LD A ++ DVA++ G L L +
Sbjct: 575 MLALQLDGARVGFVTEDVAIVCLGNGSLYSLELHRAKSMVSPSMFLSMSPLGHRVGGLGV 634
Query: 392 -------------------VYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFFLGSRL 429
+ D V+ D +K + SV I + G L F GSR+
Sbjct: 635 ASCLSVLAMACHSNSVGHFLVDNEGVKDEDHAKETISKESVSGPKIRSRG--LIFAGSRM 692
Query: 430 GDSLLVQFT---------------CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SD 473
GD L+ F+ G+G L E+ + P+ K+L++ S
Sbjct: 693 GDCSLLAFSLNVPIHLVITDVDSETGAGKRKLGGSRPEQLSSMP--EPAQKQLKKEEISP 750
Query: 474 ALQDMVNGEE--LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ D +GEE + S + + + + DSL +GPL YG
Sbjct: 751 SRTDSEDGEEDIVCAMSSPRRSVRTLSMFRTVSALDSLTGLGPLGQGCYG 800
>gi|71654693|ref|XP_815961.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
strain CL Brener]
gi|50363265|gb|AAT75335.1| cleavage polyadenylation specificity factor CPSF160 [Trypanosoma
cruzi]
gi|70881056|gb|EAN94110.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 1436
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 62/261 (23%), Positives = 107/261 (40%), Gaps = 54/261 (20%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
+++V+D F+ EP++ L ER TWAGRV W+ LS S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
T+ ++ LI ++P++ + +P+G GV+ G NT+ + + + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFSQIPSGVICYGINTVMHVTTKRGYGVYL 365
Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
NN + D E + F V L A+ T + N+ + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFKVNLSLANCTASFMSIVNEMLHLLV 425
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+S + G ++ L++ VQ + ++ S I IG+ + FLGS GDS
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTGCYCSGIARIGDQIVFLGSACGDS------ 479
Query: 439 CGSGTSMLSSGLKEEFGDIEA 459
C + M S + + F IE+
Sbjct: 480 CIAKVDMFHSDVAKRFQIIES 500
>gi|195037449|ref|XP_001990173.1| GH18378 [Drosophila grimshawi]
gi|193894369|gb|EDV93235.1| GH18378 [Drosophila grimshawi]
Length = 1140
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 66/264 (25%), Positives = 109/264 (41%), Gaps = 51/264 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG + P ++++H GR H I+ K+ I
Sbjct: 159 -MDELTVYDVEFLHGCLNPTVIVIHRDN---DGRHVKSHE--------INLRDKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPI GV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLIPVPSPICGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSD 414
++ +D + LL G L +L T G V+ + + + +
Sbjct: 260 CYA-RIDEKGLRY------LLGNMDGQLYMLFLGTTETSKGITVKDIKVEQLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
IT + N ++GSR GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGSRHGDSQLVRLS 336
>gi|312283457|dbj|BAJ34594.1| unnamed protein product [Thellungiella halophila]
Length = 1088
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 109/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMLDVPMYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F+ G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + +LK + WS NL + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S +A A+ + + ++ V++D + LL G
Sbjct: 234 EETIVYCSANAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGL 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
+ LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
++ DA + S + L+ VN + + + + T S
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 373 AFKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
D+ + +L++S + T VL
Sbjct: 409 -------IDEAFDTFLVVSFISETRVL 428
>gi|195395112|ref|XP_002056180.1| GJ10363 [Drosophila virilis]
gi|194142889|gb|EDW59292.1| GJ10363 [Drosophila virilis]
Length = 1140
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 64/263 (24%), Positives = 107/263 (40%), Gaps = 49/263 (18%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
+D V D F+HG P ++++H+ GR H + I W
Sbjct: 159 -MDELTVYDVEFLHGCQNPTVIVIHKDN---DGRHVKSHEINLRDKEFIKVA-------W 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A L+ VPS IGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 208 KQDNVETEATMLIPVPSSIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTINC 260
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSDI 415
++ +D+ + LL G L +L T G V+ + + + + I
Sbjct: 261 YA-RVDSKGLRY------LLGNMDGQLYMLFLGTTETSKGTTVKDIKVEQLGEISIPECI 313
Query: 416 TTIGNSLFFLGSRLGDSLLVQFT 438
T + N ++GSR GDS LV+ +
Sbjct: 314 TYLDNGFLYIGSRHGDSQLVRLS 336
>gi|340381612|ref|XP_003389315.1| PREDICTED: DNA damage-binding protein 1-like [Amphimedon
queenslandica]
Length = 1142
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 104/447 (23%), Positives = 165/447 (36%), Gaps = 92/447 (20%)
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
I L DL ++ D F+HG P + + E GRV + IS K+
Sbjct: 156 IRLEDL---YITDIQFLHGTENPTIAYISEEPSVATGRV--------LKTFVISQRDKEL 204
Query: 296 -PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
P W + A L +VPSP G++VVGA+++ Y N+ + ++D
Sbjct: 205 LPGPWKPNTIEGQASLLCSVPSPYNGLIVVGADSVAY----------FNDTSHTVDPIV- 253
Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV------VQRLDLSKTNP 408
+ S S H+ +L D G L+ L + + + + + L
Sbjct: 254 IKESVISCIEPLDHSRYLLGDFR------GRLLTLFLEFSEEMESGMTNIVNMKLEVLGE 307
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
+ ++ + N + F+GS GDS LV+ LSS E G I
Sbjct: 308 ISIPHTLSYLDNGVVFVGSTKGDSQLVK---------LSSSPLENGGYI----------- 347
Query: 469 RSSSDALQDMVN-GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
D L+ M N G L + S Q L G L+ G+ IN
Sbjct: 348 ----DVLESMTNIGPILDM----SVVDLDKQGRDVLVCCSGLGKDGALRIVKSGIGINEA 399
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
AS ++LPG KGIW++ + A +DE ++++ +T
Sbjct: 400 AS------------IDLPGIKGIWSL-------------KCAGREDELDDTVVLTFVGQT 434
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
M L A E TE +T N+ G +IQ+ + R++D M + P
Sbjct: 435 MALRLAGEEVEETELPALVTDQQTFYCSNVTG-NAIIQITTKSVRLMDDKAMELICDWSP 493
Query: 648 SNSE--SGSGSENSTVLSVSIADPYVL 672
+ S + +S V+ D Y L
Sbjct: 494 PDGRGISTAACNSSQVMVAVGCDLYYL 520
>gi|342186481|emb|CCC95967.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 1456
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 130/334 (38%), Gaps = 77/334 (23%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
+++V+D FV EP++ +L ER TWAGRV W+ + LS IS L
Sbjct: 254 LRYVRDLQFVGSSGEPLLGVLCERRPTWAGRVKLVEWRTKAVDTNTLSMQVAWVQISGAL 313
Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPS---PIGGVLVVGANTIHYHSQSASCALALNNYA 346
HP L+ ++P++ ++ V S GV+ G NT+ + + + N+
Sbjct: 314 TTHPKLLLVGEVDSVPYNVTHMIPVESSSQTPSGVICFGINTVMHITTKRGYGVYFNSTG 373
Query: 347 V---------------------SLDSSQELPRSSFSVELDAAHATWLQN------DVALL 379
+ L+SS L R +FS L AT + +
Sbjct: 374 MEECGSNKSSAMSYGKMSWCDAKLESSTALFRVNFS--LANCTATIFSPRSSDSLQILAV 431
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
S + G + +L + G V + +S S +T I ++LFFLGS V F+C
Sbjct: 432 SEEDGVVAVLEFLSQGANVHDIQISVLASGCYCSSLTPISDNLFFLGSA------VSFSC 485
Query: 440 GSGTSMLSSGLKEEFGDIE--------------------ADAPSTKRLRRSSSDALQDMV 479
+ + +SG +F +E AD S R +S+S L+D
Sbjct: 486 IASITPTNSGAIGKFKVVESIEAIGSIRDVDVVDCSNDAADCISGPRGNQSNSSWLEDTP 545
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIG 513
E A N T S A R +++++
Sbjct: 546 FAE------LAGNTTLDPMPNLSVAQRRAIMDLA 573
>gi|383863765|ref|XP_003707350.1| PREDICTED: DNA damage-binding protein 1-like [Megachile rotundata]
Length = 1138
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 95/205 (46%), Gaps = 35/205 (17%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 162 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + A+ + +S+
Sbjct: 211 RQDNVEREATMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 260
Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DG-RVVQRLDLSKTNPSVLT 412
+ ++D +L D+A G L +L + DG +VV+ L + +
Sbjct: 261 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKNPDGTQVVKDLKVELLGEISIP 314
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
IT + N + F+GSRLGDS L++
Sbjct: 315 ECITYLDNGVIFVGSRLGDSQLIKL 339
>gi|427788481|gb|JAA59692.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
Length = 1156
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 90/401 (22%), Positives = 154/401 (38%), Gaps = 81/401 (20%)
Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
V+D F+HG P +V+LH+ S H + +LK + W
Sbjct: 164 VQDMEFLHGCKTPTIVLLHQD--------SQARHM-----KTYEVSLKDKEFVKGPWKQD 210
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
++ +A ++AVP P G L++G +I YH+ + Y V + L R S V
Sbjct: 211 HVESEANLVIAVPEPFCGALIIGQESITYHNG--------DQYVV---ITPHLIRQSTIV 259
Query: 363 ---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSD 414
++DA + +L D+A G L +L + + ++ V+ L L +
Sbjct: 260 CYGKVDANGSRYLLGDMA------GRLFMLLLEREDKMDGTTTVKDLKLEFLGEITIAEC 313
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + ++GSRLGDS L++ + E F ++
Sbjct: 314 ITYLDNGVVYVGSRLGDSQLIKLHAERNDQGSFVEIMEVFTNL---------------GP 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 359 IVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS----- 401
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + + R E L++S +T VL +
Sbjct: 402 -------IDLPGIKGMWPLRVGPGVAPHGGDGRDPGDSAERDNTLVLSFVRQTRVLMLSG 454
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
E TE + +T GN+ +++IQV R++D
Sbjct: 455 EEVEETELAGFDTSQQTFFCGNV-RNKQLIQVTAAAVRLVD 494
>gi|186511557|ref|NP_001118940.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
gi|332657118|gb|AEE82518.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
Length = 1067
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 100/453 (22%), Positives = 178/453 (39%), Gaps = 108/453 (23%)
Query: 147 FDDSIHGLRITSMHCF----ESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
D I+G RI ++ F E+ ++L + R F +++ DP+ +
Sbjct: 54 LDVPIYG-RIATLELFRPHGEAQDFLFIATERYKFC---VLQWDPES----------SEL 99
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
I +A S +G G F + + N+R L+ V D F+ G +P + +
Sbjct: 100 ITRAMGDVSDRIGRPTDNGQVIPFDNKGQLKEAFNIR-LEELQVLDIKFLFGCAKPTIAV 158
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
L++ +H + +LK + WS +L + A L+ VP P+
Sbjct: 159 LYQ------DNKDARH------VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLC 206
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVL++G TI Y S SA A+ + + ++ V++D + LL
Sbjct: 207 GVLIIGEETIVYCSASAFKAIPIR---------PSITKAYGRVDVDGSR--------YLL 249
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
G + LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 250 GDHAGMIHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-- 307
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
++ DA + S + L+ +N + + + +
Sbjct: 308 ----------------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQ 345
Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
T S A +D G L+ G+ IN AS VEL G KG+W++ KS
Sbjct: 346 VVTCSGAFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KS 386
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
S D+ + +L++S + T +L
Sbjct: 387 S------------IDEAFDTFLVVSFISETRIL 407
>gi|225443992|ref|XP_002280744.1| PREDICTED: DNA damage-binding protein 1 isoform 2 [Vitis vinifera]
Length = 1068
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 88/391 (22%), Positives = 153/391 (39%), Gaps = 84/391 (21%)
Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMV 261
+I +A S +G G F + + N+R L+ V D F++G +P +V
Sbjct: 99 VITRAMGDVSDRIGRPTDNGQVIPFDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIV 157
Query: 262 ILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGV 321
+L++ +H AL + P W+ NL + A L+ VP P+ GV
Sbjct: 158 VLYQ------DNKDARHVKTYEVALK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGV 208
Query: 322 LVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
L++G TI Y S SA A+ + + ++ V+ D + LL
Sbjct: 209 LIIGEETIVYCSASAFKAIPIR---------PSITKAYGRVDADGSR--------YLLGD 251
Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
G L LL + ++ V L + + + S I+ + N+ ++GS GDS L++
Sbjct: 252 HAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVYVGSSYGDSQLIKI---- 307
Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK-- 499
++ DA + S + L+ VN + + + +
Sbjct: 308 --------------HLQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVV 347
Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSR 559
T S A +D G L+ G+ IN AS VEL G KG+W++
Sbjct: 348 TCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL------ 384
Query: 560 GHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 385 --------RSSTDDPHDTFLVVSFISETRIL 407
>gi|50288865|ref|XP_446862.1| hypothetical protein [Candida glabrata CBS 138]
gi|74609915|sp|Q6FSD2.1|CFT1_CANGA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
protein 1
gi|49526171|emb|CAG59795.1| unnamed protein product [Candida glabrata]
Length = 1361
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 132/674 (19%), Positives = 265/674 (39%), Gaps = 122/674 (18%)
Query: 96 ISAASLELVCHYRLHGNVESLAIL---SQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
I + L L+ ++L G + +A++ S G N ++L+ AK+S+L +++
Sbjct: 43 IRSGRLYLMEEHKLSGRINDVALIPKHSNGSNGNGINLSYLLLSTGVAKLSLLMYNNMTS 102
Query: 153 GLRITSMHC----FESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
+ S+H FES L L AR ++++P G +++ ++ +
Sbjct: 103 SIETISLHFYEDKFESATMLDL-------ARNSQLRIEPNGNYA--MLFNNDVLAILPFY 153
Query: 209 GGSGLVGDED----------------TFGSGGGFSARIESSH---VINLRDL--DMKHVK 247
G DED F G + + +H +IN +L +K++K
Sbjct: 154 TGINEDEDEDYINNDKSKINDNSKKSLFKRKKGKTQNNKVTHPSIIINCSELGPQIKNIK 213
Query: 248 DFIFVHGYIEPVMVILHERELTWAGR---VSWKHHTCMIS---ALSISTTLKQHPLIWSA 301
D F+ G+ + + +L++ +L W G V + +IS SI T +I
Sbjct: 214 DIQFLCGFTKSTIGVLYQPQLAWCGNSQLVPLPTNYAIISLDMKFSIDATTFDKAIISEI 273
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVS-LDSSQELPRS 358
LP D + + + G L++G N I + + L LN+Y+ L + + +S
Sbjct: 274 SQLPSDWH---TIAPTLSGSLILGVNEIAFLDNTGVLQSILTLNSYSDKVLPKVRVIDKS 330
Query: 359 SFSVELDAAHATWL----QNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLS------ 404
S V + L +N+ + LL + G + + + +GR++ + +++
Sbjct: 331 SHEVFFNTGSKFALIPSNENERSVENILLFDENGCIFNVDLKSEGRLLTQFNITKLPLGE 390
Query: 405 -----KTNP---SVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFG 455
K+NP S++ +D + F+G + GD+ +++ S + +++
Sbjct: 391 DVLSQKSNPSSVSIIWAD-GRLDTYTIFIGFQSGDATMLKLNHLHSAIEVEEPTFMKDYV 449
Query: 456 DIEADAPSTKRLRRSS-------SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRD 507
+ +A A SD D VN + +G+ SN +AQ+
Sbjct: 450 NKQASAAYNNEDDDDDDDDFNLYSDEENDQVNNKNDRTFGTNESNEPFTAQELM------ 503
Query: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR 567
L NIGP+ G + + + G+ + E+ + T + NA +
Sbjct: 504 ELRNIGPINSMCVGKVSSIEDNVKGLPNPNKQEI------SIVCTSGYGDGSHLNAILAS 557
Query: 568 MAAYDDEYHAYLIIS------LEARTMVLETADL------LTEVTESVDYFVQGR----- 610
+ ++ ++ I+ ++ + L T D + E+ + QGR
Sbjct: 558 VQPRVEKALKFISITKIWNLHIKGKDKFLITTDSTQSQSNIYEIDNNFSQHKQGRLRRDA 617
Query: 611 -TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
TI + +R++QV + D ++ + + V+ VS+ DP
Sbjct: 618 TTIHIATIGDNKRIVQVTTNHLYLYDLTF-----------RRFSTIKFDYEVVHVSVMDP 666
Query: 670 YVLLGMSDGSIRLL 683
YVL+ +S G I++
Sbjct: 667 YVLITLSRGDIKVF 680
>gi|407850337|gb|EKG04765.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 1436
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 61/261 (23%), Positives = 107/261 (40%), Gaps = 54/261 (20%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
+++V+D F+ EP++ L ER TWAGRV W+ LS S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
T+ ++ LI ++P++ + +P+G GV+ G NT+ + + + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFSQIPSGVICYGINTVMHVTTKRGYGVYL 365
Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
NN + D E + F V L A+ T + N+ + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFKVNLSLANCTASFMSIVNEMLHLLV 425
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+S + G ++ L++ VQ + ++ S I +G+ + FLGS GDS
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTDCYCSGIARLGDQIVFLGSACGDS------ 479
Query: 439 CGSGTSMLSSGLKEEFGDIEA 459
C + M S + + F IE+
Sbjct: 480 CIAKVDMFHSDVAKRFRIIES 500
>gi|297809743|ref|XP_002872755.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp.
lyrata]
gi|297318592|gb|EFH49014.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp.
lyrata]
Length = 1088
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 107/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F+ G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + +LK + WS NL + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S +A A+ + + ++ V++D + LL G
Sbjct: 234 EETIVYCSANAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGL 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
+ LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
++ DA + S + L+ +N + + + + T S
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 373 AFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
D+ + +L++S + T +L
Sbjct: 409 -------IDEAFDTFLVVSFISETRIL 428
>gi|307186138|gb|EFN71863.1| DNA damage-binding protein 1 [Camponotus floridanus]
Length = 1136
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 48/205 (23%), Positives = 94/205 (45%), Gaps = 35/205 (17%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + I+ K+ I W
Sbjct: 159 MDEQQVQDVNFLHGCTNPTLILIHQD-------INGRH----VKTHEINLREKEFSKIPW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + A+ + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 257
Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
+ ++D +L D+A G L +L + + + VV+ L + +
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLELEKKPDGTQVVKDLKVELLGEISIP 311
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
IT + N + ++GSRLGDS L++
Sbjct: 312 ECITYLDNGVIYVGSRLGDSQLIKL 336
>gi|358338734|dbj|GAA31211.2| DNA damage-binding protein 1, partial [Clonorchis sinensis]
Length = 1515
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 111/266 (41%), Gaps = 37/266 (13%)
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD--EDTFGSGGGFSARIESSHVINLRD 240
V VDP C V +Y + I+ + G L D E + ++ RIE +++
Sbjct: 101 VLVDPGANCVVVRLYHGLLRIIPLNGIGEKLTTDSLEVNQYAANTYNVRIEEGNIV---- 156
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
D F+HGY P +++E EL H L+ L
Sbjct: 157 -------DMAFLHGYTLPTFAMIYEDELVL--------HMKTYEISGREPALRNVQLTLD 201
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
++ D+ L+ VP P GGV++VG N I+YH++ ++ Y +SQ L ++
Sbjct: 202 SIE--PDSKLLIPVPKPFGGVILVGDNIIYYHTKDGP---HISQYIPQAKASQVLCYAAV 256
Query: 361 SVEL----DAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQ-RLDLSKTNPSVL 411
+ D A ++ + +A T +G+ +L + V R+ R++L +
Sbjct: 257 DAQRYLLGDMAGRLYMVHLLAEDHTPSGNGLLGSTSSAAVPSARIGSIRIEL--LGETAT 314
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
I + N + F+G LGDS L++
Sbjct: 315 PESIAYVDNGVVFIGCTLGDSQLIRL 340
>gi|340714589|ref|XP_003395809.1| PREDICTED: DNA damage-binding protein 1-like [Bombus terrestris]
Length = 1141
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 88/420 (20%), Positives = 160/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
++ V+D F+HG P ++++H+ ++ +H + IS K+ + W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKVPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH N Y + + +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L V + + VV+ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKPDGTQVVKDLKVELLGEISIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSR GDS LV+ +AD + + +
Sbjct: 317 ITYLDNGVIFVGSRFGDSQLVKLIT------------------KADENGSYCVPMETFTN 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +++ + L + T S A ++ G L+ G+ I AS
Sbjct: 359 LAPIIDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + G N D++ L++S +T +L
Sbjct: 405 -------IDLPGIKGMWAL---KVGGGNFDNT------------LVLSFVGQTRILTLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E T+ + +T GN+ IQ+ AR++ T + P N + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRTIS 501
>gi|195571247|ref|XP_002103615.1| GD18880 [Drosophila simulans]
gi|194199542|gb|EDX13118.1| GD18880 [Drosophila simulans]
Length = 1140
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 92/207 (44%), Gaps = 33/207 (15%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
NLR +D +V D F+HG + P ++++H+ GR H I+ K+
Sbjct: 156 NLR-MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFM 203
Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
I W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 204 KIAWKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA------- 249
Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSV 410
+F +A N + LL G L +L + G V+ + + +
Sbjct: 250 -PLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEIS 308
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ IT + N ++G+R GDS LV+
Sbjct: 309 IPECITYLDNGFLYIGARHGDSQLVRL 335
>gi|225443990|ref|XP_002280735.1| PREDICTED: DNA damage-binding protein 1 isoform 1 [Vitis vinifera]
Length = 1089
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 83/367 (22%), Positives = 145/367 (39%), Gaps = 84/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P +V+L++ +H A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P W+ NL + A L+ VP P+ GVL++G TI Y S SA A+ +
Sbjct: 197 LK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR-- 251
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ ++ V+ D + LL G L LL + ++ V L +
Sbjct: 252 -------PSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIEL 296
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N+ ++GS GDS L++ ++ DA
Sbjct: 297 LGETSIASTISYLDNAFVYVGSSYGDSQLIKI------------------HLQPDA---- 334
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
+ S + L+ VN + + + + T S A +D G L+ G+
Sbjct: 335 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIG 387
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ ++ DD + +L++S
Sbjct: 388 INEQAS------------VELQGIKGMWSL--------------RSSTDDPHDTFLVVSF 421
Query: 584 EARTMVL 590
+ T +L
Sbjct: 422 ISETRIL 428
>gi|297740793|emb|CBI30975.3| unnamed protein product [Vitis vinifera]
Length = 1043
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 83/367 (22%), Positives = 145/367 (39%), Gaps = 84/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P +V+L++ +H A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P W+ NL + A L+ VP P+ GVL++G TI Y S SA A+ +
Sbjct: 197 LK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR-- 251
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ ++ V+ D + LL G L LL + ++ V L +
Sbjct: 252 -------PSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIEL 296
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N+ ++GS GDS L++ ++ DA
Sbjct: 297 LGETSIASTISYLDNAFVYVGSSYGDSQLIKI------------------HLQPDA---- 334
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
+ S + L+ VN + + + + T S A +D G L+ G+
Sbjct: 335 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIG 387
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ ++ DD + +L++S
Sbjct: 388 INEQAS------------VELQGIKGMWSL--------------RSSTDDPHDTFLVVSF 421
Query: 584 EARTMVL 590
+ T +L
Sbjct: 422 ISETRIL 428
>gi|15235577|ref|NP_192451.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
gi|55976605|sp|Q9M0V3.1|DDB1A_ARATH RecName: Full=DNA damage-binding protein 1a; AltName:
Full=UV-damaged DNA-binding protein 1a; Short=DDB1a
gi|7267302|emb|CAB81084.1| UV-damaged DNA binding factor-like protein [Arabidopsis thaliana]
gi|25054828|gb|AAN71904.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
gi|332657117|gb|AEE82517.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
Length = 1088
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 107/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDP 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F+ G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + +LK + WS +L + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S SA A+ + + ++ V++D + LL G
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGM 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
+ LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
++ DA + S + L+ +N + + + + T S
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 373 AFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
D+ + +L++S + T +L
Sbjct: 409 -------IDEAFDTFLVVSFISETRIL 428
>gi|321478515|gb|EFX89472.1| hypothetical protein DAPPUDRAFT_303245 [Daphnia pulex]
Length = 1158
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 89/395 (22%), Positives = 149/395 (37%), Gaps = 83/395 (21%)
Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-WSAMNL 304
++D F++G P +VI+H+ H + IS K+ W N+
Sbjct: 164 IQDIAFLYGCANPTVVIIHQ-----------DAHGRHVKTREISLRDKEFAKTSWKQDNV 212
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVEL 364
+A LL VP P GG L++G +I YH+ NY + + ++
Sbjct: 213 ETEAAMLLPVPEPYGGALIIGQESITYHNG--------QNYVTIAPPIIKQSTVTCYGKV 264
Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDITTIG 419
D + +L D+A G L +L + DG V V+ + + + +T +
Sbjct: 265 DPNGSRYLLGDLA------GHLFMLVLEKEEKMDGTVTVRDIKIELLGEVSIPECLTYLD 318
Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
N + F+GSR GDS LV+ + + E F ++ AP + DM
Sbjct: 319 NGVVFIGSRFGDSQLVKLNVTPDDNNSYVTVMETFTNL---AP------------IVDMT 363
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
+ T S A ++ G L+ G+ I+ AS
Sbjct: 364 -------IVDLDRQGQGQLVTCSGAYKE-----GSLRIIRNGIGIHEQAS---------- 401
Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
++LPG KGIW + SS + D + +++S +T VL E
Sbjct: 402 --IDLPGIKGIWALKMGSSGNPSVDDT------------VVLSFVGQTRVLMLNGEEMEE 447
Query: 600 TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
TE +T GN+ G+ V+Q+ R++
Sbjct: 448 TEIPGLTADQQTFFCGNV-GKDSVLQITTGSVRLI 481
>gi|332030156|gb|EGI69950.1| DNA damage-binding protein 1 [Acromyrmex echinatior]
Length = 1138
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/205 (23%), Positives = 94/205 (45%), Gaps = 35/205 (17%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + I+ K+ I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEINLRDKEFAKIPW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + A+ + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 257
Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
+ ++D +L D+A G L +L + + + VV+ L + +
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGSQVVKDLKVELLGEISIP 311
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
IT + N + ++GSRLGDS L++
Sbjct: 312 ECITYLDNGVIYVGSRLGDSQLIKL 336
>gi|241260143|ref|XP_002404926.1| DNA repair protein xp-E, putative [Ixodes scapularis]
gi|215496735|gb|EEC06375.1| DNA repair protein xp-E, putative [Ixodes scapularis]
Length = 1148
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 94/403 (23%), Positives = 155/403 (38%), Gaps = 95/403 (23%)
Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
V+D F+HG P +V+LH+ S H + IS LK + W
Sbjct: 166 VQDMEFLHGCKTPTIVLLHQD--------SQARH---MKTYEIS--LKDKEFVKGPWKQD 212
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
++ +A ++AVP P +G +I YH+ + + L R S V
Sbjct: 213 HVESEATIVIAVPEPFCDARCIGQESITYHNGDQDVVI-----------TPHLIRQSTIV 261
Query: 363 ---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSD 414
++DA + +L D+A G L +L + + ++ V+ L L +
Sbjct: 262 CYGKVDANGSRYLLGDMA------GRLFMLLLEREDKMDGTTTVKDLKLEFLGEITIAEC 315
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+T + N + ++GSRLGDS L++ L+S E+ +E T
Sbjct: 316 MTYLDNGVVYVGSRLGDSQLIK---------LNSERNEQGSYVEVMEVFTN--------- 357
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L + F G L+ G+ I+ AS
Sbjct: 358 LGPIVDMCVVDLERQGQGQLVTCSGAFKE---------GSLRIIRNGIGIHEHAS----- 403
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KGIW + N DSSR L++S +T VL +
Sbjct: 404 -------IDLPGIKGIWPLR------VNTDSSR--------DNTLVLSFVGQTRVLMLSG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
E TE + + +T GN+ ++IQV R++DG
Sbjct: 443 EEVEETELAGFDISQQTFFCGNV-RNNQLIQVTAAAVRLVDGK 484
>gi|390342012|ref|XP_793599.3| PREDICTED: uncharacterized protein LOC588842 [Strongylocentrotus
purpuratus]
Length = 1161
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 68/281 (24%), Positives = 111/281 (39%), Gaps = 46/281 (16%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
GP+ +DP+ R G+ +Y I+ + L F+ R+E +VI+++
Sbjct: 50 GPIGIIDPECRMIGLRLYDGLFKIIPLDRDNKEL----------KAFNIRLEELNVIDVQ 99
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
F++G +P +V LH+ GR H + P W
Sbjct: 100 -----------FLYGCHQPTIVFLHQDP---HGR-----HVKTYEVNLREKEFNRGP--W 138
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++AVP P GG L++G +I YH A+A + S+
Sbjct: 139 KQDNVETEATMVIAVPQPYGGALIIGQESITYHKGDNYVAIA----------PPTIKNST 188
Query: 360 FSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDIT 416
LD + +L D L L+ DG V+ L L + + +T
Sbjct: 189 LVCYGRLDNNGSRYLLGD--LTGRLFLLLLDKEESMDGAATVKDLKLEFLGETSIAECLT 246
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
+ N + F+GSRLGDS LV+ S S + E F ++
Sbjct: 247 YLDNGVVFIGSRLGDSQLVRLNTESDESGSYVTMMETFTNL 287
>gi|366994686|ref|XP_003677107.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
gi|342302975|emb|CCC70752.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
Length = 1340
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 83/386 (21%), Positives = 165/386 (42%), Gaps = 64/386 (16%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV + L+ + +A++ Q + S +++A AKIS++ FD + L S+H
Sbjct: 48 LNLVEEFNLNAKITDIALIPQEKSPLS----CLVIASGVAKISIVRFDAVTNSLETLSLH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS--------- 211
+E + A+ ++VDP R +L++ I L G+
Sbjct: 104 YYEDKLS---DISLVTLAKTSKLRVDPMNR--ALLLFNNDSIALLPLFSGNHEDEDEDDE 158
Query: 212 ----GLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
+ E T + S + ++++L ++++V D F++ + +P + +L++
Sbjct: 159 EDDYDVTRGEVTTKRSKKNEKHVGQSKIFHVKELHQELQNVLDIQFLNDFTKPTLAVLYQ 218
Query: 266 RELTWAGRVSWKHH--TCMISALSIST----TLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
+LTW G + MI L++ T T +I + +L D ++LL +
Sbjct: 219 PKLTWVGNTELNPQPTSFMIFTLNLRTNELETAFDVVIIATLHDLSWDWFQLLPISR--- 275
Query: 320 GVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSFSVELDA---AHATW--- 371
G +V+G N + Y + + LN++A D S + R EL+ T+
Sbjct: 276 GCVVMGNNEMAYIDNTGVLQSIIHLNSFA---DKSLQRARIIDETELEVFFNEKVTYFWS 332
Query: 372 -------LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----------TNPS-VLT 412
+ ++ L+ + +L + + +GR++ + DL K +NP+ V
Sbjct: 333 ASTDKKNIDDETLLIIDASANLYYVRLEAEGRLLTKFDLIKLPIVNDALKDTSNPTCVAR 392
Query: 413 SDITTIGNSL-FFLGSRLGDSLLVQF 437
D + +S+ F+G GDSL+V+
Sbjct: 393 VDPNSSNSSMDLFIGYLSGDSLVVRL 418
>gi|45184764|ref|NP_982482.1| AAL060Wp [Ashbya gossypii ATCC 10895]
gi|74695871|sp|Q75EY8.1|CFT1_ASHGO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
protein 1
gi|44980110|gb|AAS50306.1| AAL060Wp [Ashbya gossypii ATCC 10895]
gi|374105681|gb|AEY94592.1| FAAL060Wp [Ashbya gossypii FDAG1]
Length = 1305
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 120/624 (19%), Positives = 248/624 (39%), Gaps = 124/624 (19%)
Query: 140 AKISVLEFDDSIHGLRITSMHCFESP--EWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
++S++ FD L S+H +++ E L G P ++ +P RC +LV+
Sbjct: 82 GRVSIVRFDAENQTLETESLHYYDAKFEELSALTVGA-----APRLEQEPAARC--LLVH 134
Query: 198 GLQMIILKASQGGSGLV-------------GDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+ + +G D G G S + +SH+ + D+K
Sbjct: 135 NGDCLAVLPLRGHEEEGEEAEEEEEHPAKRARTDADGRLVGASTVMPASHLHS----DIK 190
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
+VKD F+ G + + +L++ +L+W G T LS+ ++ +I L
Sbjct: 191 NVKDMRFLRGLNKSAVGVLYQPQLSWCGNEKLTRQTMKFIILSLDLDDEKSTVINMLQGL 250
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC--ALALNNYAVS-----------LDS 351
P+ + ++ + + G ++ G N + Y + + A++LN ++ S L +
Sbjct: 251 PNTLHTIIPLSN---GCVLAGVNELLYVDNTGALQGAISLNAFSNSGLNTRIQDNSKLQA 307
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL---------D 402
E P F+ + + D+ LL + + + + +GR++ +
Sbjct: 308 FFEQPLCYFATQSNG-------RDILLLMDEKARMYNVIIEAEGRLLTTFNCVQLPIVNE 360
Query: 403 LSKTN--PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
+ K N P+ + ++ SL F+G + GD++ V+ + L S L+
Sbjct: 361 IFKRNMMPTSICGNMNLETGSL-FIGFQSGDAMHVRL------NNLKSSLEH-------- 405
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------FSFAVRDSLVNIGP 514
+ + S+ L+ + + + LYG NN E +K F D L+NIGP
Sbjct: 406 -------KGTVSETLE--TDEDYMELYG---NNAEKEKKNLETESPFDIECLDRLLNIGP 453
Query: 515 LKDFSYGLRINADASATGISKQSNYELVELP----GCKGIWTVYHKSSRGHNADSSRMAA 570
+ + G + + + ++ + EL + G T+ + + + +
Sbjct: 454 VTSLAVGKASSIEHTVAKLANPNKDELSIVATSGNGTGSHLTILENTIVPTVQQALKFIS 513
Query: 571 YDDEYH-------AYLIISLEARTMV-LETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
++ YL+ + ++T + + D + ++ D+ T++ G +R
Sbjct: 514 VTQIWNLKIKGKDKYLVTTDSSQTRSDIYSIDRDFKPFKAADFRKNDTTVSTAVTGGGKR 573
Query: 623 VIQVFERGARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
++QV +G + D ++ MT + F V+ V I DP++LL S G
Sbjct: 574 IVQVTSKGVHLFDINFKRMMTMNFDF--------------EVVHVCIKDPFLLLTNSKGD 619
Query: 680 IRLLVGDPSTCTVSVQT--PAAIE 701
I++ +P V+T P A++
Sbjct: 620 IKIYELEPKHKKKFVKTVLPDALK 643
>gi|320163506|gb|EFW40405.1| UV-damaged DNA binding protein [Capsaspora owczarzaki ATCC 30864]
Length = 1123
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 75/318 (23%), Positives = 127/318 (39%), Gaps = 69/318 (21%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
N+R L+ V D F+ GY P +++L++ H L + P
Sbjct: 208 NIR-LEELQVFDIKFLRGYDRPTILVLYQD-------TKETRHVKTYQVLLKEKEFAEGP 259
Query: 297 LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
W+ N+ A L+ V P+GGVL+VG TI YHS SA ++A+ +
Sbjct: 260 --WAQNNVEGGASLLIPVLMPLGGVLIVGEQTITYHSGSAFRSVAMRPAII--------- 308
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDI 415
+SV + + LL+ G+L+ + + +D + V + + + + + S +
Sbjct: 309 -KCYSV---------IDTNRFLLADSEGNLLSVLLTHDRQDKVTAIKIDRLGVTSILSCL 358
Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
T + N + F GS+ GDS L++ ++E G S L A+
Sbjct: 359 TYLDNGVVFGGSQFGDSQLLRLATE----------RDETGSFVRVLESFSNLGPICDMAV 408
Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
D+ + T S A +D G L+ G+ GI +
Sbjct: 409 VDL------------ERQGQCQVVTCSGAFKD-----GSLRVVRNGV---------GIEE 442
Query: 536 QSNYELVELPGCKGIWTV 553
Q+ +ELPG KGIW++
Sbjct: 443 QAT---IELPGIKGIWSL 457
>gi|71413926|ref|XP_809084.1| cleavage and polyadenylation specificity factor-like protein
[Trypanosoma cruzi strain CL Brener]
gi|70873410|gb|EAN87233.1| cleavage and polyadenylation specificity factor-like protein,
putative [Trypanosoma cruzi]
Length = 499
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 62/260 (23%), Positives = 105/260 (40%), Gaps = 54/260 (20%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
+++V+D F+ EP++ L ER TWAGRV W+ LS S
Sbjct: 250 IRYVRDMQFIDSSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
T+ ++ LI ++P++ + +P+G GV+ G NT+ + + + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFAQIPSGVICYGINTVMHVTTKRGYGVYL 365
Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
NN + D E + F V L A+ T + N+ + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFMVNLSLANCTASFMSIVNEMLHLLV 425
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+S + G ++ L++ VQ + ++ S I IG+ + FLGS GDS
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTGCYCSGIARIGDQIVFLGSACGDS------ 479
Query: 439 CGSGTSMLSSGLKEEFGDIE 458
C + M S + F IE
Sbjct: 480 CIAKVDMFHSDAAKRFQIIE 499
>gi|259155222|ref|NP_001158852.1| DNA damage-binding protein 1 [Salmo salar]
gi|223647700|gb|ACN10608.1| DNA damage-binding protein 1 [Salmo salar]
Length = 1139
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 77/341 (22%), Positives = 131/341 (38%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG VV + L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLRVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +S G D L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ +++IQ+ G R++
Sbjct: 440 GEEVEETELPGFVDNLQTFYCGNV-AHQQLIQITSGGVRLV 479
>gi|190345965|gb|EDK37945.2| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC
6260]
Length = 1206
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 123/611 (20%), Positives = 236/611 (38%), Gaps = 102/611 (16%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
+ ++ +CH ++ G ++++ + +GG++ D +++ + ++S+LEFD
Sbjct: 55 SGKIKQICHQQVIGVIQNIDRIRKGGSN----LDLLVITSDSGRLSILEFDKD------- 103
Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE 217
+ F + H K G G + VDPQ R + ++ KA + L
Sbjct: 104 ELKFFPVVQEPHSKNGMNRTTPGEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKL---- 159
Query: 218 DTFGSGGGFSARIES----SHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
S+ +ES + I + LD GY P++ + E +A
Sbjct: 160 -------ELSSPLESVSKNTLTIQMVSLDT----------GYENPMLAAI---ECNYAHY 199
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSA-----MNLPHDAYKLLAVPSPIGGVLVVGANT 328
+ + S L++ + L + A + +P + L+ +P+PIGGV+V G++
Sbjct: 200 DASLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTLVPLPTPIGGVIVAGSSF 259
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I YH+ + L L S S +P ++V H N LL + GD
Sbjct: 260 IFYHNPTIDQQLYLP--IPSRAGSSPVPIVCYAV-----HKLKKNNFFILLHNELGDCFR 312
Query: 389 LTVVY--DGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSM 445
+ + Y D V L + + ++ I F D +L Q G S
Sbjct: 313 VLIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDKMLYQIEDLGDNDSY 372
Query: 446 LSSGLKEEFGDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
+SS D+ + + + R + AL +++ G+ +ES + +
Sbjct: 373 ISSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGALVKTSESKESRIAMI 432
Query: 505 VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNAD 564
S LK ++G+ I+ LV P +V+
Sbjct: 433 TGHS-----HLKLKTHGIPIST--------------LVSSPLPMIATSVF---------- 463
Query: 565 SSRMAAYDDEYHAYLIISLEA--RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
++R++A + + Y++IS A +T+VL +++ EV +S FV + G +
Sbjct: 464 TTRLSA-ESKNDEYMVISSSASSKTLVLAIGEVVEEVQDSS--FVTDQPTIGVQQVGLKS 520
Query: 623 VIQVFERGARIL-----DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
+IQ++ G R + +G + + P T++S S VL+G+S+
Sbjct: 521 LIQIYSNGIRHIRQTETEGKITKKTFDWYP--------PAGITIISASTNQEQVLIGLSN 572
Query: 678 GSIRLLVGDPS 688
+ DP+
Sbjct: 573 RELCYFEIDPT 583
>gi|427780151|gb|JAA55527.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
Length = 1181
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 92/420 (21%), Positives = 156/420 (37%), Gaps = 94/420 (22%)
Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
V+D F+HG P +V+LH+ S H + +LK + W
Sbjct: 164 VQDMEFLHGCKTPTIVLLHQD--------SQARHMK-----TYEVSLKDKEFVKGPWKQD 210
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
++ +A ++AVP P G L++G +I YH+ + Y V + L R S V
Sbjct: 211 HVESEANLVIAVPEPFCGALIIGQESITYHNG--------DQYVV---ITPHLIRQSTIV 259
Query: 363 ---ELDAAHATWLQNDVA-------------------LLSTKTGDLVLLTVVYDGRV--- 397
++DA + +L D+A LL G L +L + + ++
Sbjct: 260 CYGKVDANGSRYLLGDMAGRLFMLLLEREDKMDGTXYLLGDMAGRLFMLLLEREDKMDGT 319
Query: 398 --VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
V+ L L + IT + N + ++GSRLGDS L++ + E F
Sbjct: 320 TTVKDLKLEFLGEITIAECITYLDNGVVYVGSRLGDSQLIKLHAERNDQGSFVEIMEVFT 379
Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPL 515
++ + DM + T S A ++ G L
Sbjct: 380 NL---------------GPIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSL 412
Query: 516 KDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
+ G+ I+ AS ++LPG KG+W + + R E
Sbjct: 413 RIIRNGIGIHEHAS------------IDLPGIKGMWPLRVGPGVAPHGGDGRDPGDSAER 460
Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
L++S +T VL + E TE + +T GN+ +++IQV R++D
Sbjct: 461 DNTLVLSFVRQTRVLMLSGEEVEETELAGFDTSQQTFFCGNV-RNKQLIQVTAAAVRLVD 519
>gi|223647932|gb|ACN10724.1| DNA damage-binding protein 1 [Salmo salar]
Length = 1139
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 77/341 (22%), Positives = 131/341 (38%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG VV + L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLRVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +S G D L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ +++IQ+ G R++
Sbjct: 440 GEEVEETELPGFVDNLQTFYCGNV-AHQQLIQITSGGVRLV 479
>gi|156389050|ref|XP_001634805.1| predicted protein [Nematostella vectensis]
gi|156221892|gb|EDO42742.1| predicted protein [Nematostella vectensis]
Length = 1157
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 95/212 (44%), Gaps = 40/212 (18%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
N+R L+ HV D F++G P +V +++ H + I+ L+ H
Sbjct: 155 NIR-LEELHVVDIQFLYGCANPTIVFIYQDP-----------HGRHVKTYEIN--LRDHE 200
Query: 297 LI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
W N+ +A +++AVP+P+GG L++G +I YH S A+A
Sbjct: 201 FAKGPWKQDNVEVEACRVIAVPNPLGGALIIGQESITYHKGSNYHAIA----------PP 250
Query: 354 ELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKT 406
L +SS + ++D + +L D+ G L +L + + DG V+ L L
Sbjct: 251 ALKQSSLTCHGKIDTNGSRYLLGDM------NGRLYMLLLERQELIDGTYEVKDLKLEML 304
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+ + + + N + F+GS LGDS L + +
Sbjct: 305 GETSIAHCLVYLDNGVVFIGSMLGDSQLAKLS 336
>gi|19114492|ref|NP_593580.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe 972h-]
gi|46395602|sp|O13807.1|DDB1_SCHPO RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|2330717|emb|CAB11219.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe]
Length = 1072
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 95/492 (19%), Positives = 190/492 (38%), Gaps = 97/492 (19%)
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
RES GPL+ VDP R + VY + I+ + + + FS RI+
Sbjct: 111 RES-QSGPLLLVDPFQRVICLHVYQGLLTIIPIFKSKKRFMTSHNNPSLHDNFSVRIQEL 169
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
+V+ D ++ P + +L++ + ++K ++
Sbjct: 170 NVV-----------DIAMLYNSSRPSLAVLYKDSKSIVHLSTYK------------INVR 206
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
+ + + + HD + +PS GGV V G ++Y S+ + L Y
Sbjct: 207 EQEIDEDDV-VCHDIEEGKLIPSENGGVFVFGEMYVYYISKDIQVSKLLLTY-------- 257
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
P ++FS + T L + + +++ ++G L ++ V ++L K S + S
Sbjct: 258 --PITAFSPSISNDPETGLDSSIYIVADESGMLYKFKALFTDETVS-MELEKLGESSIAS 314
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+ + ++ F+GS +S+L+Q PS + +
Sbjct: 315 CLIALPDNHLFVGSHFNNSVLLQL------------------------PSITK-NNHKLE 349
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
LQ+ VN +S + + T S+ T S A +D G L+ + I
Sbjct: 350 ILQNFVNIAPISDFIIDDDQTGSSIITCSGAYKD-----GTLRIIRNSINI--------- 395
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
N L+E+ G K ++V S A YD+ + +L + E R +++
Sbjct: 396 ---ENVALIEMEGIKDFFSV------------SFRANYDN--YIFLSLICETRAIIVSPE 438
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
+ + + D + TI ++G +++Q+ + R+ DG + +S P + G
Sbjct: 439 GVF---SANHDLSCEESTIFVSTIYGNSQILQITTKEIRLFDGKKLHSWIS--PMSITCG 493
Query: 654 SGSENSTVLSVS 665
S ++ ++V+
Sbjct: 494 SSFADNVCVAVA 505
>gi|325189950|emb|CCA24429.1| splicing factor putative [Albugo laibachii Nc14]
Length = 1644
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 73/316 (23%), Positives = 130/316 (41%), Gaps = 47/316 (14%)
Query: 299 WSAMNLPHDAYKLLAVPS----PIGGVLVVGANTIHYHSQS---ASCALALNNYAVSLDS 351
WS + +P A KL+AVP P GGVLV+ I Y +++ SC+ L + +
Sbjct: 660 WSQV-VPRSANKLVAVPGGNDGP-GGVLVIAQGLIQYQNENHPPLSCSFPLRSTG-GPNP 716
Query: 352 SQELPRSSFSVELDAAHATWLQNDV--ALLSTKTGDLVLLTVVYDGRVVQRLDLS--KTN 407
Q+ + + + + + AT Q D+ L+ ++ GDL +++ Y G VQ+L + T
Sbjct: 717 VQDERKQGYPMMI-VSTATHKQRDLFFVLMQSEWGDLFKISLEYAGSSVQKLRIQYFDTI 775
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
P L IT G L F S + L QF + + D E + PS
Sbjct: 776 PVALALCITKTG--LLFAASEFSNHYLFQFLSIGEDDDAAQCVSAAENDQEPEIPSFSVR 833
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
+ + + ++ + ++ E + ++ + N L+ +GL +
Sbjct: 834 KLKNLAMISNIPSISPITQLLVDDFANEQTPQLYALCGQG---NRSSLRILRHGLPVMEM 890
Query: 528 ASATGISKQSNYELVELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR 586
A++ LPG K +W + ++ D Y+++S E
Sbjct: 891 AASA------------LPGVAKAVWCLKE--------------SFTDTCDKYIVVSFEDA 924
Query: 587 TMVLETADLLTEVTES 602
T+VLE D + E+T+S
Sbjct: 925 TLVLEIGDTVEEITDS 940
>gi|322787057|gb|EFZ13281.1| hypothetical protein SINV_13198 [Solenopsis invicta]
Length = 986
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 92/203 (45%), Gaps = 31/203 (15%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + I+ K+ I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEINLRDKEFSKIPW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSP+ G +++G +I YH N+Y + + +
Sbjct: 208 RQDNVEREAMMVIPVPSPMCGAIIIGQESILYHDG--------NSYVAVVPPIIKQSTIT 259
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV-VQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L + DG + V+ L + +
Sbjct: 260 CYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGTLSVKDLKVELLGEISIPEC 313
Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
IT + N + ++GSRLGDS L++
Sbjct: 314 ITYLDNGVIYVGSRLGDSQLIKL 336
>gi|242010743|ref|XP_002426118.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
gi|212510165|gb|EEB13380.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
Length = 1148
Score = 52.8 bits (125), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 60/276 (21%), Positives = 116/276 (42%), Gaps = 61/276 (22%)
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES 232
G++S G + +DP+ R G+ +Y + I+ + S L S R+E
Sbjct: 113 GKQS-ETGIIAVIDPEARVIGLRLYDGLLKIIPLGKDNSELKAS----------SIRMEE 161
Query: 233 SHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
V +D F+HG P ++++H+ ++ +H + IS
Sbjct: 162 VEV-----------QDLNFLHGCQNPTIILIHQD-------INGRH----VKTHEISLRD 199
Query: 293 KQH-PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA---LNNYAVS 348
K+ + W N+ DA ++ VP P+ G +++G +I YH+ + A+A +N ++
Sbjct: 200 KEFVKMPWKQDNVEPDASIVIPVPEPLCGAIIIGQESILYHNGAGYVAVAPPVINQSTIT 259
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-------VQRL 401
+ ++D+ + +L D+A G L +L + + ++ L
Sbjct: 260 CYT-----------QVDSNGSRYLLGDMA------GHLFMLLLETEEKIDGTPCVKENGL 302
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ + IT + N + F+GSR GDS LV+
Sbjct: 303 KVELLGEISIPEAITYLDNGVLFIGSRCGDSQLVKL 338
>gi|391335522|ref|XP_003742140.1| PREDICTED: DNA damage-binding protein 1-like [Metaseiulus
occidentalis]
Length = 1154
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 100/431 (23%), Positives = 167/431 (38%), Gaps = 88/431 (20%)
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
+PV+ I++E + T +H + AL L + P W NL +A L+ V
Sbjct: 178 DPVLAIVYEEQQT-------RHMKTHVIALR-DKELMKGP--WGQRNLDLEADMLIPVED 227
Query: 317 PIGGVLVVGANTIHYH-SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQND 375
GV++VG TI YH Q C + SF + + N+
Sbjct: 228 TETGVIIVGGETIVYHYGQDYICI-----------------QPSFLRTTKISCYCRIDNN 270
Query: 376 --VALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
V +L G L +LT+ + + V L + ++ + N + F+GSRLGDS
Sbjct: 271 RLVFILGGICGRLFILTLRRENKKVVSHSLDLLGSVSIPECLSYLDNGVVFVGSRLGDSQ 330
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-DALQDMVNGEELSLYGSASN 492
L++ + A P + L ++ A+ DM+ +L G
Sbjct: 331 LIR--------------------MHAQEPFIEVLESYTNLGAILDMI-VVDLEKQGQDQL 369
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWT 552
T S Q G L+ G+ I+ A VEL G KGIW
Sbjct: 370 ITCSGQGA-----------CGSLRIIRNGIGIHELAC------------VELSGIKGIWA 406
Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE--TADLLTEVTESVDYFVQGR 610
+ R + A DD L++S +T V + + L +VT + + +
Sbjct: 407 L-----RMNTAQLEEDTPTDDT----LVLSFVGQTRVFNCSSTEELEQVTLPAAFDIDSQ 457
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQ-DLSFGPSNSESGSGSENSTVLSVSIADP 669
T A N+ G +VIQV ++ ++ + T+ D F P + N +++++ +
Sbjct: 458 TFCARNVLG-NQVIQVTDKRVNLISVTSKTRVDQWFPPEGEIITQCACNDVQVALALKNV 516
Query: 670 YVLLGMSDGSI 680
V L + DGS+
Sbjct: 517 LVYLEIRDGSL 527
>gi|389586447|dbj|GAB69176.1| splicing factor 3B subunit 3 [Plasmodium cynomolgi strain B]
Length = 1286
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 127/604 (21%), Positives = 225/604 (37%), Gaps = 114/604 (18%)
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
L+ L L+ + G + L G++ +D +++ + ++ +L+F +
Sbjct: 41 LLRADKQGKLNLIASKDVFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLVILQFSNEK 96
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
+ +HC + K G G + VDP+GR + VY L
Sbjct: 97 NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDTK 149
Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
Q+ I D G GF I +S N D K V + + G
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPIFASIEQNYEMYD-KQVTNTNEIDGCTRK 208
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
++ L E +L ++ +++H LP D+ L +P P
Sbjct: 209 TLLCLWEMDL------------------GLNHVIRKH-------TLPIDSSAHLLIPIPG 243
Query: 319 G-----GVLVVGANTIHYHSQS---ASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
G GV+V N + Y CA Y L++ QE + S+ A H
Sbjct: 244 GQQGPSGVIVCCDNYLVYKKVEHVDVYCA-----YPRRLETGQE---KNISIVCSALHRI 295
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ L+ ++ GDL + + ++ +V+ + + + + I + + F+ + G
Sbjct: 296 R-KFFFILIQSEFGDLYKIEMDHEDGIVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354
Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--LSLY 487
+ QF+ G + K G A TK+L ++ L D V L +
Sbjct: 355 NHFFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPILDMK 411
Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVEL 544
+ N S Q +L GP L+ +GL I A EL
Sbjct: 412 ILDAKNANSPQIY-------ALCGRGPRSSLRILQHGLSIEELADN------------EL 452
Query: 545 PG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
PG K IWT+ + NA +Y Y+I+S E T++LE + + EV +S+
Sbjct: 453 PGRPKFIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVVDSL 498
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
+ T N+ +IQV + G R ++G + + + P N + + + NST +
Sbjct: 499 --LLTNVTTIHVNILYDNTLIQVHDTGIRHINGKVVHEWVP--PKNKQIKAATSNSTQIV 554
Query: 664 VSIA 667
+S++
Sbjct: 555 ISLS 558
>gi|302406266|ref|XP_003000969.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
gi|261360227|gb|EEY22655.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
Length = 1059
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 132/578 (22%), Positives = 217/578 (37%), Gaps = 141/578 (24%)
Query: 104 VCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE 163
V + + G + S+A G++ +D +ILA + +I+++E+ + + + F+
Sbjct: 59 VLSHDVFGIIRSMAAFRIAGSN----KDYLILATDSGRIAIIEY--------LPAQNRFQ 106
Query: 164 SPEWLHL----KRGRESFARGPLVKVDPQGRCGGVLVYGLQ-----MIILKASQGGSGLV 214
LHL K G G + DP+GR L+ L+ ++ + SQ
Sbjct: 107 R---LHLETFGKSGIRRVVPGEFLACDPKGRA--CLIASLEKNKLVYVLNRNSQA----- 156
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
E T S A HV+++ LD+ GY PV L E + T A +
Sbjct: 157 --ELTISSP--LEAHKPGVHVLSMVALDV----------GYANPVFAAL-ETDYTEADQD 201
Query: 275 SWKHHTCMISALSISTTLKQHPL------IWSAMNLPHDAYKLLAVPSPIG-----GVLV 323
+AL + T L + L + + P D L P G GVLV
Sbjct: 202 PTGQ-----AALDVETQLVYYELDLGLNHVVRKWSEPVDNTASLLFQVPGGNDGPSGVLV 256
Query: 324 VGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA----L 378
G I Y HS + + + + E P S+ H L+ L
Sbjct: 257 CGEENITYRHSNQEAFRVPVPRRR----GATEDPSRKRSIVAGVMHK--LKGSAGAFFFL 310
Query: 379 LSTKTGDLVLLTVVY----DGRV---VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGD 431
L T+ GDL +T+ DG V+RL + + + S + + + ++ S+ G+
Sbjct: 311 LQTEDGDLFKITIDMIEDRDGNPTGEVKRLKIKYFDTIPVASSLCILKSGFLYVASQFGN 370
Query: 432 SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSAS 491
QF E+ GD + L SS D D E +
Sbjct: 371 YQFYQF--------------EKLGD------DDEELEFSSDDFPTDPKQSYEAVFF---- 406
Query: 492 NNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV----- 542
++ + A+ +S+ ++ PL D DA +A G +S + ++
Sbjct: 407 ----HPRELENLALVESIDSMNPLIDCKVANLTGEDAPQIYTACGNGARSTFRILKHGLE 462
Query: 543 -------ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
ELPG +WT+ K SRG D+Y AY+++S T+VL +
Sbjct: 463 VNEIVASELPGIPSAVWTL--KLSRG------------DQYDAYIVLSFTNATLVLSIGE 508
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ EV +S F+ A L G +IQV +G R
Sbjct: 509 TVEEVNDS--GFLTSVPTLAAQLLGGEGLIQVHPKGIR 544
>gi|428180158|gb|EKX49026.1| hypothetical protein GUITHDRAFT_68305 [Guillardia theta CCMP2712]
Length = 1202
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 110/552 (19%), Positives = 202/552 (36%), Gaps = 97/552 (17%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
++ +C G + S+A G++ +D ++L + +ISVLEF + +
Sbjct: 49 IQSICQMECFGLIRSMASFRLPGSN----KDYLVLGADSGRISVLEFSKERNQFERVHLE 104
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
+ K G G + DP+GR + Q ++ V + D
Sbjct: 105 TYG-------KSGCRRIVPGQFLASDPKGRAVMISAIEKQKLVY---------VFNRDA- 147
Query: 221 GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSW 276
S+++ S + H G+ P+ L + + G+ S
Sbjct: 148 ------SSKLTISSPLEAHKASTIHFSIVGVDVGFDNPIFAALEMDYSDADADETGQ-SA 200
Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP-----IGGVLVVGANTIHY 331
+ +++ + L + + P DA + +P P GVLV N I Y
Sbjct: 201 EEFNKVLTFYELDLGLNH---VVRKASEPIDAASNMLIPVPGDTDGPSGVLVCAENKIAY 257
Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
+AL + Q L + +S H LL ++ GDL LT+
Sbjct: 258 KKPDHEDVVALIPRRQGMPLDQPLLITGYS------HLKQKDGFFFLLQSEIGDLYRLTL 311
Query: 392 VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---MLSS 448
Y+ V ++++ + + IT + F+ S G+ L QF G+ M+
Sbjct: 312 TYNDEEVSEINITYFDTVPVAQSITILKTGFLFVASEFGNHALYQFLSIKGSDESDMMPV 371
Query: 449 GLKEEFGDIE----ADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
++ E IE A P L ++L +++ L L G E + ++
Sbjct: 372 EVEIEGETIEIPHFAPRPLKNLLLVDEMESLSPILDMRVLDLAG------EETPQIYA-- 423
Query: 505 VRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGCK-GIWTVYHKSSRG 560
L GP L+ +GL + + + ELP +WTV +G
Sbjct: 424 ----LCGKGPRSTLRTLRHGLAV------------AEMAVSELPSNPLAVWTV-----KG 462
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
+ D++ Y++++ T+VL D + EVT+S + +T++ +L G
Sbjct: 463 SSKDAA---------DKYIVVTFANATIVLSIGDTVEEVTDS-GFLATNKTLSV-SLLGD 511
Query: 621 RRVIQVFERGAR 632
++QV G R
Sbjct: 512 DSLLQVHPNGLR 523
>gi|146420838|ref|XP_001486372.1| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC
6260]
Length = 1206
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 120/610 (19%), Positives = 237/610 (38%), Gaps = 96/610 (15%)
Query: 96 ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
+ + ++ +CH ++ G ++++ + +GG++ D +++ + ++S+LEFD
Sbjct: 53 LESGKIKQICHQQVIGVIQNIDRIRKGGSN----LDLLVITSDSGRLSILEFDKD----- 103
Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
+ F + H K G G + VDPQ R + ++ KA + L
Sbjct: 104 --ELKFFPVVQEPHSKNGMNRTTPGEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKL-- 159
Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
+ +++ I + LD GY P++ + E +A +
Sbjct: 160 -----ELLSPLESVSKNTLTIQMVSLDT----------GYENPMLAAI---ECNYAHYDA 201
Query: 276 WKHHTCMISALSISTTLKQHPLIWSA-----MNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
+ S L++ + L + A + +P + L+ +P+PIGGV+V G++ I
Sbjct: 202 SLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTLVPLPTPIGGVIVAGSSFIF 261
Query: 331 YHSQSASCALALNNYAVSLDS-SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
YH+ + L L + L + S +P ++V H N LL + GD +
Sbjct: 262 YHNPTIDQQLYL---PIPLRAGSSPVPIVCYAV-----HKLKKNNFFILLHNELGDCFRV 313
Query: 390 TVVY--DGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSML 446
+ Y D V L + + ++ I F D +L Q G S +
Sbjct: 314 LIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDKMLYQIEDLGDNDSYI 373
Query: 447 SSGLKEEFGDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
SS D+ + + + R + AL +++ G+ +ES + +
Sbjct: 374 SSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGALVKTSESKESRIAMIT 433
Query: 506 RDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADS 565
S LK ++G+ I+ LV P +V+ +
Sbjct: 434 GHS-----HLKLKTHGIPIST--------------LVSSPLPMIATSVF----------T 464
Query: 566 SRMAAYDDEYHAYLIISLEA--RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+R++A + + Y++IS A +T+VL +++ EV +S FV + G + +
Sbjct: 465 TRLSA-ESKNDEYMVISSSASSKTLVLAIGEVVEEVQDSS--FVTDQPTIGVQQVGLKSL 521
Query: 624 IQVFERGARIL-----DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
IQ++ G R + +G + + P T++S S VL+G+S+
Sbjct: 522 IQIYSNGIRHIRQTETEGKITKKTFDWYP--------PAGITIISASTNQEQVLIGLSNR 573
Query: 679 SIRLLVGDPS 688
+ DP+
Sbjct: 574 ELCYFEIDPT 583
>gi|339235331|ref|XP_003379220.1| DNA damage-binding protein 1 [Trichinella spiralis]
gi|316978142|gb|EFV61158.1| DNA damage-binding protein 1 [Trichinella spiralis]
Length = 1329
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 80/355 (22%), Positives = 152/355 (42%), Gaps = 66/355 (18%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R + +SA L+ V ++ G + + + + G + + +++ ++++E+D+
Sbjct: 205 RFEVHSVSAEGLQYVTEGKMFGRIGAAKLFTPKGENKAL----MVIVTLKQDVAIVEYDN 260
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
RI ++ E GR + + G L+ V P G G+ + + ++
Sbjct: 261 G----RIKTLASRNISE----NFGRPA-SNGILLSVHPDGEVIGLRIMSSTFKCITWNRA 311
Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
S L S++ +N + H+ DF+F+HG+ PV+ +++
Sbjct: 312 TSKL------------------STYSLNY---SLTHLSDFVFLHGFQFPVIALIY----- 345
Query: 270 WAGRVSWKHH-TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
G + +H TC IS + P WS ++ +A+ L+AVP P+ GV+VVG ++
Sbjct: 346 --GDLVGRHVITCRISL--DEQEFENGP--WSRGHIEWEAHTLIAVPPPLCGVIVVGCSS 399
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
+ Y +N +S S L +S + DAA L G L L
Sbjct: 400 LLY---------IRDNSTISTVSPPFLSKSIVNC-YDAAP----DGLTYFLGQLDGTLSL 445
Query: 389 LTVVYDGRVVQRLDLSKTNPSVL--TSDITTIG----NSLFFLGSRLGDSLLVQF 437
L + + ++ LS+ ++L TS ++ SL F+GSR+ DS L++
Sbjct: 446 LKLDIETDAEGKVTLSRMRATILGVTSPPDSLSYMHKESLLFVGSRIADSKLLRL 500
>gi|261335516|emb|CBH18510.1| cleavage and polyadenylation specificity factor-like protein,
putative [Trypanosoma brucei gambiense DAL972]
Length = 1452
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 98/234 (41%), Gaps = 42/234 (17%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
+++V+D F+ EP++ IL ER+ TWAGRV W+ + LS IS T
Sbjct: 254 IRYVRDVQFIGTLGEPLLAILCERKPTWAGRVKLVEWRTKAVESNMLSQQVTWVQISGTA 313
Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPS---PIGGVLVVGANTIHY--------------- 331
P L+ +P++ +L V S + GV+ G NTI +
Sbjct: 314 SALPKLLLVGEVDGVPYNVTHMLPVGSISQAMSGVICFGVNTIMHITTRRGYGAYWNETG 373
Query: 332 -----HSQSASCALALNNYA-VSLDSSQELPRSSFSVELDAAHATWLQND-----VALLS 380
S+S++ + N+ L+SS L R + S+ A ++D +S
Sbjct: 374 KEECTSSKSSAVSYGKINWCDKKLESSTALFRVNLSLANCVAATLEGKDDEGSLQAVAVS 433
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
G +++L + G + + ++ S IT I L FLGS + DS +
Sbjct: 434 EDDGVVLMLQFLSQGSNIHDIRIAVLTSGCYCSSITPISERLMFLGSAVSDSCI 487
>gi|74025892|ref|XP_829512.1| cleavage and polyadenylation specificity factor-like protein
[Trypanosoma brucei brucei strain 927/4 GUTat10.1]
gi|70834898|gb|EAN80400.1| cleavage and polyadenylation specificity factor-like protein,
putative [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 1452
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 98/234 (41%), Gaps = 42/234 (17%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
+++V+D F+ EP++ IL ER+ TWAGRV W+ + LS IS T
Sbjct: 254 IRYVRDVQFIGTLGEPLLAILCERKPTWAGRVKLVEWRTKAVESNMLSQQVTWVQISGTA 313
Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPS---PIGGVLVVGANTIHY--------------- 331
P L+ +P++ +L V S + GV+ G NTI +
Sbjct: 314 SALPKLLLVGEVDGVPYNVTHMLPVGSISQAMSGVICFGVNTIMHITTRRGYGAYWNETG 373
Query: 332 -----HSQSASCALALNNYA-VSLDSSQELPRSSFSVELDAAHATWLQND-----VALLS 380
S+S++ + N+ L+SS L R + S+ A ++D +S
Sbjct: 374 KEECTSSKSSAVSYGKINWCDKKLESSTALFRVNLSLANCVAATLEGKDDEGSLQAVAVS 433
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
G +++L + G + + ++ S IT I L FLGS + DS +
Sbjct: 434 EDDGVVLMLQFLSQGSNIHDIRIAVLTSGCYCSSITPISERLMFLGSAVSDSCI 487
>gi|402595041|gb|EJW88967.1| hypothetical protein WUBG_00126 [Wuchereria bancrofti]
Length = 621
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 82/363 (22%), Positives = 135/363 (37%), Gaps = 104/363 (28%)
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
+W NL +A +++VP P+GG L+ G + I YH + AL YA S
Sbjct: 201 LWKHDNLEGEANIVISVPEPVGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY---------DGRVVQRLDLSKTNP 408
+ +D +L D+A G+L +L + +V+ + +
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLLLELGKDQEQDENSAVIVRDMKVESLGE 310
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
+ + + + N + F+GSR GDS +L
Sbjct: 311 TCIAECMCYLDNGVCFIGSRFGDS---------------------------------QLI 337
Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA 528
R S++ A T ++ DS N+ P++D + +R N
Sbjct: 338 RLSTEP---------------------RADGTGYISLLDSYTNLAPIRDMTV-MRCNGQQ 375
Query: 529 ---SATGISKQSNY----------EL--VELPGCKGIWTVYHKSSRGHNADSSRMAAYDD 573
+ +G K EL VEL G K ++T+ +RG D
Sbjct: 376 QILTCSGAYKDGTIRIIRNGIGIEELASVELKGIKNMFTL---RTRG------------D 420
Query: 574 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
E+ YLI+S ++ T VL E TE + V G T+ AG LF + ++QV +
Sbjct: 421 EFDDYLILSFDSETHVLFINGEELEDTEITGFAVDGATLWAGCLFHSKTILQVTHGEVIL 480
Query: 634 LDG 636
+DG
Sbjct: 481 IDG 483
>gi|290998415|ref|XP_002681776.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
gi|284095401|gb|EFC49032.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
Length = 1103
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 77/358 (21%), Positives = 151/358 (42%), Gaps = 60/358 (16%)
Query: 82 KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAK 141
KN R+ +G+S+ V + G ++++++ G ++D + + ED
Sbjct: 35 KNQYLQVNRLSEEGVSS-----VVEFEAPGRIDTMSLFRPSG----EKQDLLFITIEDTF 85
Query: 142 ISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQ 200
++ D I L S+ + P GR S G + +DP R + +Y GL
Sbjct: 86 FTLGFIDGKIETLSSGSI---DDP------VGRRS-ESGSITTIDPLCRAVALSIYEGLL 135
Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
II ++ F F+ R+E +VI++ L+ K P
Sbjct: 136 KII--------PFENNKHQFKEA--FNVRLEELNVIDIAFLESLGSK------SKSGPTF 179
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
+L++ + H ++ +++ L + +N+ H A L+ VP+P+GG
Sbjct: 180 ALLYQDHV-------GSRHVKTYEVKTLDKDMEESSL--NQLNVDHGANILIPVPAPLGG 230
Query: 321 VLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
V+ VG + Y ++S N++V+ ++ + S+ +LD + W D
Sbjct: 231 VICVGEAQVSYINESN------KNHSVASPANSRMAIRSYG-KLD--NTRWFLGD----- 276
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
++G L LL++ V L L + + ++S I+ + N F+GS GDS +++ +
Sbjct: 277 -QSGQLYLLSLQVSDSEVTGLTLKELGVTSISSCISYLDNGYVFIGSNYGDSQVIRIS 333
>gi|401883281|gb|EJT47496.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 2479]
Length = 1216
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 130/595 (21%), Positives = 223/595 (37%), Gaps = 132/595 (22%)
Query: 85 GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
G T+ +L S L+ +C G V ++A G +D I+L+ + ++S+
Sbjct: 34 GSTRLEILKLNPSTGQLDSICSSEAFGTVRNVAAFRLAGMG----KDYIVLSSDSGRLSI 89
Query: 145 LEFDDSIHGLRITSMHCFESP-EWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQ 200
+E L I+ FES + ++ K G G + VDP+GR G V L
Sbjct: 90 IE-------LVISPTPHFESLYQEVYGKSGSRRTIPGQFLAVDPKGRSAMFGAVEKQKLC 142
Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
I+ + ++G A + V+N+ D GY P+
Sbjct: 143 YILNRNTEG---------KVYPSSPLEAHKNHTLVVNMIACDT----------GYDNPMF 183
Query: 261 VILHERELTW----------AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
L EL + A R + KH T L ++ +++ WS P D
Sbjct: 184 AAL---ELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRK----WSE---PTDRRA 233
Query: 311 LLAVPSP------------IGGVLVVGANTI---HYHSQSASCALALNNYAVSLDSSQEL 355
L V P GGVLV + + H +++ + ++
Sbjct: 234 NLLVQVPGGQNANTDRFDGPGGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPG---- 289
Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
+SS + + AA ++ LL ++ GDL T+ ++G V+ L + + + +
Sbjct: 290 -QSSRGIIIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATS 348
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTC---------GSGTSMLSSGLKEEFGDIE--ADAPS 463
+ + + F+ S GD L QF S T GL EE P
Sbjct: 349 LCILKSGYLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPL 408
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSY 520
L + +L +++ + ++L G+AS+ + ++ R GP + +
Sbjct: 409 QNLLLADTLSSLDPILDAQVVNLLGNASDTPQ----IYAACGR------GPRSTFRSLKH 458
Query: 521 GLRINADASATGISKQSNYELVE--LPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
GL IN LVE LPG +WT+ + DDEY +
Sbjct: 459 GLDINV--------------LVESPLPGVPNAVWTL--------------KLSEDDEYDS 490
Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
Y+++S T+VL + + EV ++ + G T+A L G ++QV G R
Sbjct: 491 YIVLSFPNGTLVLSIGETIEEVNDT-GFLSSGPTLAVQQL-GSAGLLQVHPAGLR 543
>gi|198432469|ref|XP_002129207.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
DNA-binding protein 1) (UV-damaged DNA-binding factor)
(DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
(UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
pigmentosum group E-co... isoform 1 [Ciona intestinalis]
Length = 1150
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 92/438 (21%), Positives = 167/438 (38%), Gaps = 109/438 (24%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F+ RIE VI+ + F+HGY P +VI+++ +H I
Sbjct: 155 FNIRIEELSVIDAK-----------FLHGYTTPTLVIIYQNS-------QGRHVKTYIVD 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA---- 341
+ + W N+ +A ++ VP P+ G +++G +I YH+ +A
Sbjct: 197 VRDKEVVAGP---WKQENIDAEANFIINVPKPLAGSIIIGQESITYHNGDKYIPIAPPQI 253
Query: 342 ---LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYD 394
+N YA +D + +L D+A G L +L + + D
Sbjct: 254 KDTINCYA----------------PVDKDGSRYLLGDLA------GHLFILLLESDEMMD 291
Query: 395 G-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE 453
G V+ L + + I+ + N + ++GSRLGDS L++
Sbjct: 292 GTNTVRDLKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIR----------------- 334
Query: 454 FGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRDSLVN 511
+ D+ R + S L N G + + + Q T S A ++
Sbjct: 335 ---LPTDSSMEGRPKPSLISVLDTYTNLGPIIDMCVVDLDRQGQGQVVTCSGAFKE---- 387
Query: 512 IGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
G L+ G+ I AS ++LPG KG+W + D+SR +Y
Sbjct: 388 -GSLRIIRNGIGIQEHAS------------IDLPGIKGLWPL-------RVFDTSR--SY 425
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
D L+IS + +L+ + E T+ + + +T N+ +++Q+ E+
Sbjct: 426 DT-----LVISFVGHSRILQLSGEEVEETDLPGFDDESQTFYCSNVC-HNQLVQITEKSI 479
Query: 632 RILDGSYMTQDLSFGPSN 649
R++ + Q + P N
Sbjct: 480 RLISHTERRQVHEWKPKN 497
>gi|348667612|gb|EGZ07437.1| hypothetical protein PHYSODRAFT_565381 [Phytophthora sojae]
Length = 1197
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 68/301 (22%), Positives = 116/301 (38%), Gaps = 76/301 (25%)
Query: 321 VLVVGANTIHYHSQ---SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV- 376
VLV+G NT+ Y ++ +CA+ Q PR V + AT Q D+
Sbjct: 247 VLVLGENTVQYKNEGHPELTCAIP---------RRQGEPRDIVIV----SAATHKQRDLF 293
Query: 377 -ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
LL ++ GDL +++ Y G V+ + + + + S + L F S + L
Sbjct: 294 FVLLQSELGDLYKISLDYSGNAVEEIKIQFFDTVPVASSMCITKTGLLFCASEFSNHYLF 353
Query: 436 QF-TCGSG------------TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE 482
QF + G G + LS+ + +++ A S L + + D+ N +
Sbjct: 354 QFLSIGEGDDTAKCSSLAMDPTELSTFPLRKLTNLQL-ASSMPSLSPVTQLLVDDLANEQ 412
Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV 542
+Y N+ S+ L+ +GL I A++
Sbjct: 413 TPQMYALCGNSNRSS-----------------LRVLRHGLPITEMAASA----------- 444
Query: 543 ELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
LPG K +W + +Y D Y Y+++S E T+VLE + + EVT+
Sbjct: 445 -LPGVAKAVWCLKE--------------SYADPYDKYIVVSFEDATLVLEVGETVEEVTQ 489
Query: 602 S 602
S
Sbjct: 490 S 490
>gi|406698009|gb|EKD01256.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 8904]
Length = 1216
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 129/595 (21%), Positives = 223/595 (37%), Gaps = 132/595 (22%)
Query: 85 GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
G T+ +L S L+ +C G V ++A G +D I+L+ + ++S+
Sbjct: 34 GSTRLEILKLNPSTGQLDSICSSEAFGTVRNVAAFRLAGMG----KDYIVLSSDSGRLSI 89
Query: 145 LEFDDSIHGLRITSMHCFESP-EWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQ 200
+E L I+ FES + ++ K G G + VDP+GR G V L
Sbjct: 90 IE-------LVISPTPHFESLYQEVYGKSGSRRTIPGQFLAVDPKGRSAMFGAVEKQKLC 142
Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
I+ + ++G A + V+N+ D GY P+
Sbjct: 143 YILNRNTEG---------KVYPSSPLEAHKNHTLVVNMIACDT----------GYDNPMF 183
Query: 261 VILHERELTW----------AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
L EL + A R + KH T L ++ +++ WS P D
Sbjct: 184 AAL---ELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRK----WSE---PTDRRA 233
Query: 311 LLAVPSP------------IGGVLVVGANTI---HYHSQSASCALALNNYAVSLDSSQEL 355
L V P GGVLV + + H +++ + ++
Sbjct: 234 NLLVQVPGGQNANTDRFDGPGGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPG---- 289
Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
+SS + + AA ++ LL ++ GDL T+ ++G V+ L + + + +
Sbjct: 290 -QSSRGIIIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATS 348
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTC---------GSGTSMLSSGLKEEFGDIE--ADAPS 463
+ + + F+ S GD L QF S T GL EE P
Sbjct: 349 LCILKSGYLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPL 408
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSY 520
L + +L +++ + ++L G+AS+ + ++ R GP + +
Sbjct: 409 QNLLLADTLSSLDPILDAQVVNLLGNASDTPQ----IYAACGR------GPRSTFRSLKH 458
Query: 521 GLRINADASATGISKQSNYELVE--LPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
GL +N LVE LPG +WT+ + DDEY +
Sbjct: 459 GLDVNV--------------LVESPLPGVPNAVWTL--------------KLSEDDEYDS 490
Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
Y+++S T+VL + + EV ++ + G T+A L G ++QV G R
Sbjct: 491 YIVLSFPNGTLVLSIGETIEEVNDT-GFLSSGPTLAVQQL-GSAGLLQVHPAGLR 543
>gi|156095699|ref|XP_001613884.1| Splicing factor 3B subunit 3 [Plasmodium vivax Sal-1]
gi|148802758|gb|EDL44157.1| Splicing factor 3B subunit 3, putative [Plasmodium vivax]
Length = 1230
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 127/607 (20%), Positives = 220/607 (36%), Gaps = 120/607 (19%)
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
L+ L L+ + G + L G++ +D +++ + ++++L+F +
Sbjct: 41 LLRADKQGKLNLIASKDIFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLTILQFSNEK 96
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
+ +HC + K G G + VDP+GR + VY L
Sbjct: 97 NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDTK 149
Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
Q+ I D G GF + +S N LD K V + + Y
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPMFASIEQNYEALD-KQVTNTSEIDSYTRK 208
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
++ L W + H + P DA L +P P
Sbjct: 209 TLLSL------WEMDLGLNH-------------------VIRKYTFPIDASAHLLIPIPG 243
Query: 319 G-----GVLVVGANTIHYHS---QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
G GV+V N + Y CA Y L++ QE S L
Sbjct: 244 GQQGPSGVIVCCDNFLVYKKVDHADVYCA-----YPRRLETGQEKNLSIVCSTLHRIRKF 298
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ L+ ++ GDL + + ++ VV+ + + + + I + + F+ + G
Sbjct: 299 FF----ILIQSELGDLYKIEMEHEDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354
Query: 431 DSLLVQFTCGSG----TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--L 484
+ QF+ G G +M +S K G A TK+L ++ L D V L
Sbjct: 355 NHFFYQFS-GIGDEDNEAMCTS--KHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPIL 408
Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYEL 541
+ + N S Q +L GP L+ +GL I A
Sbjct: 409 DMKVIDAKNASSPQIY-------ALCGRGPRSSLRILQHGLSIEELADN----------- 450
Query: 542 VELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
ELPG K IWT+ + NA +Y Y+I+S E T++LE + + EV
Sbjct: 451 -ELPGRPKFIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVV 495
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST 660
+S+ + T N+ +IQV + G R ++G + + + P N + + + N
Sbjct: 496 DSL--LLTNVTTIHVNILYDNSLIQVHDAGIRHINGKVIHEWVP--PKNKQIKAATSNCA 551
Query: 661 VLSVSIA 667
+ +S++
Sbjct: 552 QIVISLS 558
>gi|358440070|pdb|4A0B|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
gi|358440072|pdb|4A0B|C Chain C, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
Length = 1159
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 96/461 (20%), Positives = 172/461 (37%), Gaps = 116/461 (25%)
Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+DP+ R G+ +Y ++ + L F+ R+E HVI+++
Sbjct: 143 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 187
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
F++G P + +++ GR H + P W N+
Sbjct: 188 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 231
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
+A ++AVPSP GG +++G +I YH+ A+A + + S V
Sbjct: 232 EAEASMVIAVPSPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 280
Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
+D + +L D+ G L +L + DG V ++ L + + + +T
Sbjct: 281 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 334
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+ N + F+GSRLGDS LV+ S G+ +++ G I D R+
Sbjct: 335 YLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 393
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ T S A ++ G L+ G+ I+ AS
Sbjct: 394 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 420
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +R E L++S +T VL
Sbjct: 421 --------IDLPGIKGLWPLRSDPNR--------------ETDDTLVLSFVGQTRVLMLN 458
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + + +T GN+ +++IQ+ R++
Sbjct: 459 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 498
>gi|348526664|ref|XP_003450839.1| PREDICTED: DNA damage-binding protein 1-like [Oreochromis
niloticus]
Length = 1140
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 74/341 (21%), Positives = 129/341 (37%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG V ++ L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVAVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +S G D L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ +++IQ+ R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHQQLIQITSGSVRLV 479
>gi|432851195|ref|XP_004066902.1| PREDICTED: DNA damage-binding protein 1-like [Oryzias latipes]
Length = 1140
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 74/341 (21%), Positives = 130/341 (38%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG V ++ L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVTVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PILDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +S G +D L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRESDD------------MLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ +++IQ+ R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHQQLIQITSGSVRLV 479
>gi|402222132|gb|EJU02199.1| hypothetical protein DACRYDRAFT_21931 [Dacryopinax sp. DJM-731 SS1]
Length = 1209
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 87/376 (23%), Positives = 150/376 (39%), Gaps = 83/376 (22%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL +T+ ++ V+ + + + + S + + + F+ S G+ L QF
Sbjct: 308 LLQSEDGDLFKVTIDHEDEEVKTMKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367
Query: 438 -TCGSGTSML--SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN-- 492
G + SS + G + + R R + L D +N + + +N
Sbjct: 368 QKLGDDDDEIEYSSVSYPDNGMADPIPQAYFRPRPLENLVLADELNSFDPIVDAKVTNLL 427
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
NT++ Q F+ R + + L+ +GL + S+ ELPG +W
Sbjct: 428 NTDTPQ-IFAACGRGARSSFRMLR---HGLDVEETVSS------------ELPGIPNAVW 471
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV K+ DD+Y AY+I+S T+VL + + EV+++ + T
Sbjct: 472 TVKLKA--------------DDQYDAYIILSFVNGTLVLSIGETIEEVSDT-GFLSSSPT 516
Query: 612 IAAGNLFGRRRVIQVFERGAR------------------ILDGS---------------- 637
IA + G ++QV+ G R I+ +
Sbjct: 517 IAVQQI-GEDSLLQVYPHGIRHVLSDRRVNEWRCPQHTTIVAATTNSRQVAIALSSAQLV 575
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRLLVGDPST 689
Y DL G N S S VL++SIA+ PY+ +G D ++R++ DP T
Sbjct: 576 YFELDLE-GQLNEYQDRKSLGSGVLAMSIAEVPEGRQRTPYLAVGCEDQTVRIISLDPDT 634
Query: 690 C--TVSVQTPAAIESS 703
+S+Q A SS
Sbjct: 635 TLENISLQALTAPPSS 650
>gi|303271531|ref|XP_003055127.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463101|gb|EEH60379.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1223
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 62/262 (23%), Positives = 113/262 (43%), Gaps = 35/262 (13%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
GP+ VDP+ R G+ +Y ++ Q G FS R+E V +++
Sbjct: 130 GPIGAVDPECRMYGLHLYDGLFKVIPMDQTGQ----------LREAFSVRLEELQVFDVK 179
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
F+ G +P + +L++ T GR + C+ +P W
Sbjct: 180 -----------FLAGTPKPTIAVLYQD--TKEGRHIKTYEVCLKDK-------DFNPGPW 219
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
+ ++ + L+AVP+P+GGV+VVG I Y ++ + + + ++
Sbjct: 220 AQNDVESGSRFLIAVPAPLGGVVVVGEKVIAYLNKETTHGVGDGGGGGGGGGGGMIVKA- 278
Query: 360 FSVELDAAHATWLQNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
+++ DA T+ D LLS G L LL +++D V+ L L + + S +
Sbjct: 279 IAMQSDATIMTYGAVDKDGSRYLLSDSAGRLHLLVLMHDKTRVRALKLESLGQTSIASSL 338
Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
+ + N + ++GS GDS LV+
Sbjct: 339 SYLDNGVVYVGSAYGDSQLVRL 360
>gi|68531971|ref|XP_723667.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23478038|gb|EAA15232.1| Drosophila melanogaster CG13900 gene product [Plasmodium yoelii
yoelii]
Length = 1235
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 62/295 (21%), Positives = 121/295 (41%), Gaps = 43/295 (14%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L+ ++ GDL + V ++ +V+ + + + + I + + F+ + G+ QF
Sbjct: 302 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 361
Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SDALQDMVNGEELSLYGSASNN 493
+ S +M +S G A T++L+ D + + ++ + + ++N
Sbjct: 362 SGIGNDSNDAMCTSN--HPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILDAKNSN 419
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWT 552
R SL + +GL I A+ ELPG + IWT
Sbjct: 420 LPQIYALCGRGPRSSL------RILQHGLSIEELANN------------ELPGKPRYIWT 461
Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
V +S EY Y+I+S E T++LE + + EV +S+ + T
Sbjct: 462 VKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNVTT 505
Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
NL IQV++ G R ++G + + + P N + + + N + + VS++
Sbjct: 506 IHINLLYDNSFIQVYDTGIRHINGKIVQEWIP--PKNKQINAATSNGSQIVVSLS 558
>gi|324502823|gb|ADY41238.1| DNA damage-binding protein 1, partial [Ascaris suum]
Length = 1129
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 65/144 (45%), Gaps = 12/144 (8%)
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
P +W N+ +A ++ +P P GGV+VVG I YH + N Y+
Sbjct: 200 PPLWKQDNIEAEACMVIPIPQPYGGVIVVGHEAISYHKDA-------NAYSAIAPPLIHQ 252
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVL-LTVVYDGRV-VQRLDLSKTNPSVLTS 413
+ S ++D +L D LS + L+L L V DG V+ L + + +
Sbjct: 253 SQISCYGKIDRDGQRYLLGD---LSGRIFMLLLDLDVATDGTASVKDLKVELLGETSIPE 309
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
+ + N + F+GSR GDS LV+
Sbjct: 310 CVVYLDNGVVFIGSRFGDSQLVRL 333
>gi|313238818|emb|CBY20011.1| unnamed protein product [Oikopleura dioica]
gi|313245836|emb|CBY34826.1| unnamed protein product [Oikopleura dioica]
Length = 1135
Score = 49.3 bits (116), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 130/628 (20%), Positives = 229/628 (36%), Gaps = 177/628 (28%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ ++ + L+ V + L+G + + + + ++D + + E +LE+ D
Sbjct: 43 RIEVNLSTQTGLKPVTEFNLYGRIAVIEVFRY----KNEKKDCLFILTESCYACILEYVD 98
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
G IT + ++ S ++ G VDP+ RC + +Y + I+ +
Sbjct: 99 ---GKIITRA-------YGDMRDKNYSVSQSGMHACVDPEARCIALRLYDGVLKIINLNS 148
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
L E RIE V+ D F+H +P + +L++
Sbjct: 149 SSKHLTSAEQ----------RIEEILVV-----------DMCFLHTANKPTLALLYDDN- 186
Query: 269 TWAGRVSWKHHTCMISALSIS---TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
S +H + + L S ++ + P + + D ++AVP P+ G+L++G
Sbjct: 187 ------SSRHLSTIAITLDNSGSGASIHKGP--FRHTQVEQDTILIVAVPEPLAGILLLG 238
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
I YH ++ N V+ T + L G+
Sbjct: 239 HVNITYHDSKNRSTCSIENI----------------VKRTIECVTPIDKHRYLCGDSNGE 282
Query: 386 LVLLTVVYDGRVV----QRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
L LL + Y+ + RL + L + ++ I N + F+GS GDS L++
Sbjct: 283 LFLLLLDYNENRIPEERMRLATKYLGRTTLPNTLSYIDNYVVFVGSTFGDSELIRI---- 338
Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTF 501
E D NN S Q
Sbjct: 339 -----------EVSD-----------------------------------NN--SGQHFT 350
Query: 502 SFAVRDSLVNIGPLKDFSY--------GLRINADASATGISKQ--------SNYELVELP 545
S D+L GP+KD G + A TG S + Y ++L
Sbjct: 351 SLHQYDNL---GPIKDMCIVDFEKQGQGQLVTASGVGTGGSLRIIRNGVGIHEYASIDLE 407
Query: 546 GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMV--LETADLLTEVTESV 603
G KG+W + + SS S++ + L++S +T+ LE D +TEV E +
Sbjct: 408 GVKGLWALKYLSS------STKQDS--------LLLSFVGQTIFLRLEGQD-VTEV-EEI 451
Query: 604 DYFVQG-RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS----EN 658
F G +T+ AGN+ ++ +Q+ E+ R++ ES GS EN
Sbjct: 452 PGFTNGEQTMYAGNV-TDQQFLQITEKQVRLI--------------ADESLKGSWEPEEN 496
Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGD 686
+ + S+ VLLG+ +I L + D
Sbjct: 497 TQINLCSVNKNQVLLGVGSTAIYLEIND 524
>gi|268536658|ref|XP_002633464.1| C. briggsae CBR-DDB-1 protein [Caenorhabditis briggsae]
Length = 1134
Score = 48.9 bits (115), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 87/357 (24%), Positives = 148/357 (41%), Gaps = 82/357 (22%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
D+ L+ VP+P+GGV+V+GAN+ Y + + + Y+ SL L + F+ +
Sbjct: 210 DSQVLIPVPAPVGGVIVLGANSALYKASDVNGDVV--PYSCSL-----LKNTIFTCHGIV 262
Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTV-VYDGR---VVQRLDLSKTNPSVLTSDITTIGN 420
DA+ D LL+ G L++L + + +GR V+ + + + + + + N
Sbjct: 263 DAS------GDRFLLADTDGRLLMLLLNIGEGRSGTTVKEMRIEYLGETSVADSVNYVDN 316
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
+ F+GSRLGDS L++ S L+ +++ ++DMV
Sbjct: 317 GVVFVGSRLGDSQLIRLMTAPNGGSYSVVLET----------------YTNTGPIRDMVL 360
Query: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
E ++ + T S A +D G L+ G+ I AS
Sbjct: 361 VE---------SDGQPQLVTCSGADKD-----GSLRVIRNGIGIEELAS----------- 395
Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
V+L G++ + +S+ D+ + + DE H I E LE LL T
Sbjct: 396 -VDLAKVIGMFPIRLRST----TDNFVIVSLPDETHVLKITGEE-----LEDVQLLEIET 445
Query: 601 ESVDYFVQGRTIAAGNLFG---RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E T+ A +LFG ++QV E R + S+ Q + P+N ES S
Sbjct: 446 ERT-------TMYASSLFGPDDSELILQVTEEEIRFM--SFQKQVKIWRPTNGESVS 493
>gi|198432471|ref|XP_002129229.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
DNA-binding protein 1) (UV-damaged DNA-binding factor)
(DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
(UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
pigmentosum group E-co... isoform 2 [Ciona intestinalis]
Length = 1142
Score = 48.9 bits (115), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 51/217 (23%), Positives = 91/217 (41%), Gaps = 40/217 (18%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F+ RIE VI+ + F+HGY P +VI+++ +H I
Sbjct: 155 FNIRIEELSVIDAK-----------FLHGYTTPTLVIIYQNS-------QGRHVKTYIVD 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
+ + W N+ +A ++ VP P+ G +++G +I YH+ +A
Sbjct: 197 VRDKEVVAGP---WKQENIDAEANFIINVPKPLAGSIIIGQESITYHNGDKYIPIA---- 249
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDG-RVVQR 400
L Q+ V+ D + +L D+A G L +L + + DG V+
Sbjct: 250 --PLCFFQDTINCYAPVDKDGSR--YLLGDLA------GHLFILLLESDEMMDGTNTVRD 299
Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L + + I+ + N + ++GSRLGDS L++
Sbjct: 300 LKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIRL 336
>gi|124806507|ref|XP_001350742.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium
falciparum 3D7]
gi|23496869|gb|AAN36422.1|AE014849_41 splicing factor 3b, subunit 3, 130kD, putative [Plasmodium
falciparum 3D7]
Length = 1329
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 80/379 (21%), Positives = 147/379 (38%), Gaps = 64/379 (16%)
Query: 304 LPHDAYKLLAVPSPIG-----GVLVVGANTIHYHS---QSASCALALNNYAVSLDSSQEL 355
LP D L +P P G GVL+ N + Y + CA Y L+ Q+
Sbjct: 261 LPIDITAHLLIPLPGGQQGPSGVLICCENFLVYKKVDHEDIYCA-----YPRRLEIGQDK 315
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
S + + L+ ++ GDL + V ++ +V+ + + + + I
Sbjct: 316 NISIICWTMHRIKKFFF----ILIQSEYGDLYKIEVDHEDGIVKEIVCKYFDTVPIGNSI 371
Query: 416 TTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+ + + F+ + G+ QF+ G G A T +L+
Sbjct: 372 SVLKSGSLFVAAEFGNHYFYQFSGIGDDNKQFMCTSNHPLGKNAIIAFKTNKLKNL---Y 428
Query: 475 LQDMVNGEE--LSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADAS 529
L D + L + + NT + Q ++ R GP L+ +GL I A
Sbjct: 429 LVDQIYSLSPILDMKIIDAKNTHTPQ-IYTLCGR------GPRSSLRILQHGLSIEELAD 481
Query: 530 ATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
ELPG K IWT+ + EY Y+++S E T+
Sbjct: 482 N------------ELPGKPKYIWTIKKDNL--------------SEYDGYIVVSFEGNTL 515
Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPS 648
+LE + + EV++++ + T N+ IQV++ G R ++G + + ++ P
Sbjct: 516 ILEIGESVEEVSDTL--LLNNVTTLHINILYDNSFIQVYDTGIRHINGKVVQEWVA--PK 571
Query: 649 NSESGSGSENSTVLSVSIA 667
N + + S NS+ + +S++
Sbjct: 572 NKQIKAASSNSSQIVISLS 590
>gi|221061705|ref|XP_002262422.1| splicing factor 3b, subunit 3, 130kd [Plasmodium knowlesi strain H]
gi|193811572|emb|CAQ42300.1| splicing factor 3b, subunit 3, 130kd, putative [Plasmodium knowlesi
strain H]
Length = 1276
Score = 48.5 bits (114), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 123/604 (20%), Positives = 222/604 (36%), Gaps = 114/604 (18%)
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
L+ L L+ + G + L G++ +D +++ + ++ +L+F +
Sbjct: 41 LLRADKQGKLNLIVSKDIFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLVILQFSNEK 96
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
+ +HC + K G G + VDP+GR + VY L
Sbjct: 97 NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDNK 149
Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
Q+ I D G GF + +S N D K V + +
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPMFASIEQNYEMYD-KQVTNTTEIDACTRK 208
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
++ L E +L ++ +++H LP D L +P P
Sbjct: 209 TLLCLWEMDL------------------GLNHVIRKH-------TLPIDMSAHLLIPIPG 243
Query: 319 G-----GVLVVGANTIHYHSQS---ASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
G GV+V N + Y CA Y L++ QE + S+ H
Sbjct: 244 GQQGPSGVIVCCDNYLVYKKVEHVDVYCA-----YPRRLETGQE---KNISIVCSTVHRI 295
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ L+ ++ GDL + + + VV+ + + + + I + + F+ + G
Sbjct: 296 R-KFFFILIQSEYGDLYKIEMDHQDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354
Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--LSLY 487
+ QF+ G + K G A TK+L ++ L D V L +
Sbjct: 355 NHFFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPILDMK 411
Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVEL 544
+ N S Q ++ R GP L+ +GL I A EL
Sbjct: 412 ILDAKNANSPQ-IYALCGR------GPRSSLRILQHGLSIEELADN------------EL 452
Query: 545 PG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
PG K IWT+ + NA +Y Y+I+S E T++LE + + EV +++
Sbjct: 453 PGRPKYIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVVDTL 498
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
+ T N+ +IQV + G R ++G + + + P N + + + N+T +
Sbjct: 499 --LLTNVTTIHVNILYDNSLIQVHDTGIRHINGKVINEWVP--PKNKQVKAATSNATQIV 554
Query: 664 VSIA 667
+S++
Sbjct: 555 ISLS 558
>gi|193644722|ref|XP_001942922.1| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
Length = 1156
Score = 48.5 bits (114), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 58/267 (21%), Positives = 109/267 (40%), Gaps = 59/267 (22%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G + +DP R G+ +Y GL II D G + R+E + +
Sbjct: 122 GAMAVIDPSARVIGLKLYDGLFKII------------PLDKEGELKAYCLRMEE---VEV 166
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH-PL 297
+D+D F++G P ++I+H+ + GR I A +S K+
Sbjct: 167 QDID--------FLYGCANPTIIIIHQDTM---GR--------HIKAKELSIKDKEFVKT 207
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
W N+ +A ++ VP P+ G +++G ++ YH+ S+ A+ S + +
Sbjct: 208 PWKQENVETEASMIIPVPEPLCGAIIIGRESVLYHNGSSFIAI----------SPPVIKQ 257
Query: 358 SSFS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL---- 411
S+ +D +L D+A G L +L + Y+ + +L
Sbjct: 258 STIVCYARIDPEGTRYLLGDMA------GHLFMLLLNYEKNPDGTFKIKDPKVDLLGEIS 311
Query: 412 -TSDITTIGNSLFFLGSRLGDSLLVQF 437
+T + N + ++ SR+GDS L++
Sbjct: 312 IPESLTYLDNKIIYVASRVGDSQLIKL 338
>gi|68075683|ref|XP_679761.1| splicing factor 3b, subunit 3, 130kD [Plasmodium berghei strain
ANKA]
gi|56500578|emb|CAH95367.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium berghei]
Length = 1216
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 62/297 (20%), Positives = 120/297 (40%), Gaps = 48/297 (16%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L+ ++ GDL + V ++ +V+ + + + + I + + F+ + G+ QF
Sbjct: 302 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 361
Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
+ S SM +S G A T++L+ L D + +
Sbjct: 362 SGIGNDSNESMCTSN--HPSGKNAIIAFKTQKLKNL---YLVDQIYSLPIVDMKILDAKN 416
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ + ++ R GP L+ +GL I A+ ELPG + I
Sbjct: 417 SNIPQIYALCGR------GPRSSLRILQHGLSIEELANN------------ELPGKPRYI 458
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WT+ +S EY Y+I+S E T++LE + + EV +S+ +
Sbjct: 459 WTIKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNV 502
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
T NL IQV++ G R ++G + + + P N + + + N + + +S++
Sbjct: 503 TTIHINLLYDNSFIQVYDTGIRHINGKIVQEWVP--PKNKQINAATSNGSQIVISLS 557
>gi|17541566|ref|NP_502299.1| Protein DDB-1 [Caenorhabditis elegans]
gi|74965443|sp|Q21554.2|DDB1_CAEEL RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|5824558|emb|CAA92824.2| Protein DDB-1 [Caenorhabditis elegans]
Length = 1134
Score = 48.1 bits (113), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 100/396 (25%), Positives = 166/396 (41%), Gaps = 96/396 (24%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
D+ L+ VP IGGV+V+G+N++ Y + Y SL L ++F+ +
Sbjct: 210 DSSVLIPVPHAIGGVIVLGSNSVLYKPNDNLGEVV--PYTCSL-----LENTTFTCHGIV 262
Query: 365 DAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
DA+ +L LS G L++L T G V+ + + + + I I N
Sbjct: 263 DASGERFL------LSDTDGRLLMLLLNVTESQSGYTVKEMRIDYLGETSIADSINYIDN 316
Query: 421 SLFFLGSRLGDSLLVQF-TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
+ F+GSRLGDS L++ T +G S S + E + +I ++DMV
Sbjct: 317 GVVFVGSRLGDSQLIRLMTEPNGGSY--SVILETYSNI---------------GPIRDMV 359
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
E ++ + T + A +D G L+ G+ I+ AS
Sbjct: 360 MVE---------SDGQPQLVTCTGADKD-----GSLRVIRNGIGIDELAS---------- 395
Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
V+L G GI+ + S NAD+ Y+I+SL T VL+ E
Sbjct: 396 --VDLAGVVGIFPIRLDS----NADN------------YVIVSLSDETHVLQITGEELED 437
Query: 600 TESVDYFVQGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQDLSFGPSNSESGSGS 656
+ ++ TI A LFG ++Q E+ R++ S +++ + P+N E S
Sbjct: 438 VKLLEINTDLPTIFASTLFGPNDSGIILQATEKQIRLMSSSGLSK--FWEPTNGEIISK- 494
Query: 657 ENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
+SV+ A+ ++L D ++ LL TC V
Sbjct: 495 -----VSVNAANGQIVLAARD-TVYLL-----TCIV 519
>gi|260790329|ref|XP_002590195.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
gi|229275385|gb|EEN46206.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
Length = 1152
Score = 48.1 bits (113), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 59/268 (22%), Positives = 111/268 (41%), Gaps = 56/268 (20%)
Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+DP+ R G+ +Y ++ + L F+ R+E +VI+++
Sbjct: 124 IDPECRMIGLRLYDGLFKVIPLDRDNREL----------KAFNIRLEELNVIDVK----- 168
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL-IWSAMN 303
F++G P +V +++ H + IS K+ W N
Sbjct: 169 ------FLYGCQVPTVVFVYQ-----------DPHGRHVKTYEISVRDKEFSKGPWKQDN 211
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV- 362
+ +A ++AVP P G L++G +I YH+ A+A + +S+
Sbjct: 212 VETEASMVIAVPEPFCGSLIIGQESITYHNGDKYVAVA----------PPAIKQSTLICH 261
Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
+DA + +L D++ G L +L + + DG V V+ L + + + +T
Sbjct: 262 GRVDANGSRYLLGDMS------GRLFMLLLEKEELIDGSVTVKDLKVELLGETSIAECLT 315
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
+ N + +LGSRLGDS L++ + S
Sbjct: 316 YLDNGVVYLGSRLGDSQLIKLNVDADDS 343
>gi|47230701|emb|CAF99894.1| unnamed protein product [Tetraodon nigroviridis]
Length = 953
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 71/341 (20%), Positives = 130/341 (38%), Gaps = 64/341 (18%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 68 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 123
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG V ++ L + + +
Sbjct: 124 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 173
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S L +++++ + +
Sbjct: 174 CLTYLDNGVVFVGSRLGDSQLVKVRVTHSLSEL---------NVDSNDQGSFVTVMETFT 224
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
L +V+ + L + F G L+ G+ I+ AS
Sbjct: 225 NLGPIVDMCVVDLERQGQGQLVTCSGAFKE---------GSLRIIRNGIGIHEHAS---- 271
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + ++ R E L++S +T VL +
Sbjct: 272 --------IDLPGIKGLWPLRSEAGR--------------ETDDMLVLSFVGQTRVLMLS 309
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ ++IQ+ R++
Sbjct: 310 GEEVEETELPGFVDNQQTFYCGNV-AHNQLIQITSGSVRLV 349
>gi|410912407|ref|XP_003969681.1| PREDICTED: DNA damage-binding protein 1-like [Takifugu rubripes]
Length = 1140
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 73/341 (21%), Positives = 127/341 (37%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG V ++ L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGD LV+ S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDPQLVKLNVDSNDQGSFVTVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +S G D L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ ++IQ+ R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHNQLIQITSGSVRLV 479
>gi|390357128|ref|XP_001198237.2| PREDICTED: splicing factor 3B subunit 3-like [Strongylocentrotus
purpuratus]
Length = 949
Score = 47.4 bits (111), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 61/259 (23%), Positives = 100/259 (38%), Gaps = 39/259 (15%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + + + + + + + F+ S G+ L Q
Sbjct: 34 LAQTEQGDIFKITLETDDDMVTEIRMKYFDTVPVATSMNVLKTGFLFIASEYGNHYLYQI 93
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS E GD AP T R L+++ E LS S
Sbjct: 94 AHLGDDDDEPEFSSATPLEEGDTFFFAPRTLR-------NLEEVDQLESLSPILSCQIAD 146
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTV 553
+++ T V ++ +GL + S + ELPG +WTV
Sbjct: 147 LASEDTPQLYVACGRGPRSSMRVLRHGLEV------------SEMAVSELPGNPNAVWTV 194
Query: 554 YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
KS DDEY AY+I+S T+VL + + EVT+S F+
Sbjct: 195 KKKS--------------DDEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTPTL 238
Query: 614 AGNLFGRRRVIQVFERGAR 632
+ +L G ++Q++ G R
Sbjct: 239 SSSLIGDDALLQIYPDGIR 257
>gi|400597418|gb|EJP65151.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
Length = 1212
Score = 47.4 bits (111), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 143/614 (23%), Positives = 217/614 (35%), Gaps = 133/614 (21%)
Query: 64 NVIEIYVVRVQEEGSKES---KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
NV++ V Q G+KE SG + D + L+ H + G + S+A+
Sbjct: 19 NVVQ--AVLGQFAGTKEQLIITGSGSQLTILRPDPAQGKVIPLLSH-DIFGVLRSIAVFR 75
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
G+ +D IILA + +I+VLE+ S + M F K G G
Sbjct: 76 LAGSS----KDYIILATDSGRITVLEYLPSPNRFSRLHMETFG-------KTGIRRVVPG 124
Query: 181 PLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
+ DP+GR V L ++ + SQ E T S A VI
Sbjct: 125 EYLACDPKGRACLISAVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVIA 175
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMISALSISTTLK 293
L LD+ GY PV L E + G + T ++ + L
Sbjct: 176 LTALDV----------GYANPVFAALEIDYTEVDQDNTGEALSEVETHLVY-YELDLGLN 224
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNYAV 347
WS P D L P G GVLV G + Y HS + + +
Sbjct: 225 HVVRKWSD---PVDPTASLLFQVPGGNDGPSGVLVCGEENVTYRHSNQDALRVPIPRRR- 280
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTV--VYDGR----- 396
+ E P ++ H L+ LL T GDL +T+ V D
Sbjct: 281 ---GATEDPSRKRNIVAGVMHK--LKGSAGAFFFLLQTDDGDLFKITIDMVEDEEGAPTG 335
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
VQR+ + + + + + + + ++ S+ G+ QF E+ GD
Sbjct: 336 EVQRMKIKYFDTVPVATSLCILKSGFLYVASQFGNYAFYQF--------------EKLGD 381
Query: 457 IEADAPSTKRLRRSSSDALQDMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPL 515
L SS D D + E + Y + N A+ DS+ + PL
Sbjct: 382 ------DDDELEFSSDDFPVDPLAAYEPVYFYPRPAEN---------LALVDSIPAMNPL 426
Query: 516 KDFSYGLRINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSS 558
D DA S G +S + + ELPG +WT+ S
Sbjct: 427 LDCKVANLTGEDAPQIYSICGNGARSTFRTIKHGLEVNEIVASELPGVPSAVWTLKLNS- 485
Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
D++Y Y+++S T+VL + + EV++S + TIAA L
Sbjct: 486 -------------DEQYDTYIVLSFTNGTLVLSIGETVEEVSDS-GFLTSVPTIAA-QLL 530
Query: 619 GRRRVIQVFERGAR 632
G +IQV RG R
Sbjct: 531 GTDGLIQVHPRGIR 544
>gi|195996153|ref|XP_002107945.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
gi|190588721|gb|EDV28743.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
Length = 1134
Score = 47.4 bits (111), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 49/200 (24%), Positives = 91/200 (45%), Gaps = 29/200 (14%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
L+ V D F++G+ EP + +++E + R + + +A + + P W+
Sbjct: 158 LEELQVLDVKFLYGFTEPTIALIYE---SGQNRYLKTYEISLQNA-----DIHRQP--WN 207
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
+ +A+ +L VP P G++V+GA +I Y+ S L+ SL R +
Sbjct: 208 IGKVEEEAFMILPVPPPSCGMVVIGAGSISYYKGQDS----LHITPASLKD-----RITC 258
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----GRVVQRLDLSKTNPSVLTSDIT 416
+D+ +L D + G L +L +V + G V+ L L + + S IT
Sbjct: 259 FGRVDSNGCRYLLGDYS------GRLFMLILVQEHSQSGIKVKDLCLEYLGETSIPSCIT 312
Query: 417 TIGNSLFFLGSRLGDSLLVQ 436
+ N+ ++GS GDS L++
Sbjct: 313 YLDNAFAYIGSSCGDSQLIK 332
>gi|341884150|gb|EGT40085.1| CBN-DDB-1 protein [Caenorhabditis brenneri]
Length = 1134
Score = 47.0 bits (110), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 87/354 (24%), Positives = 143/354 (40%), Gaps = 82/354 (23%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
D+ L+ VPSPI GV+V+G +++ Y S N+ V SS L + F+ +
Sbjct: 210 DSSMLIPVPSPISGVVVLGTHSLLYKSSE-------NDGEVVPYSSPLLENTIFTSHSIV 262
Query: 365 DAAHATWLQNDV--ALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGN 420
D ++ +D LL ++LL V + G V+ + + + + I I N
Sbjct: 263 DPTGERFIVSDTDGRLL------MLLLNAVENQSGLSVKEIRIDLLGDTSVAESINYIDN 316
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
+ F+GSR GDS L++ S S L + + ++DM+
Sbjct: 317 GVVFIGSRFGDSQLIRLLSEKTNSSYISVLDTYY----------------NIGPIRDMIM 360
Query: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
E ++ + T S A +D G L+ G+ I A+
Sbjct: 361 VE---------SDGQPQLVTCSGAEKD-----GSLRVIRNGIGIEELAT----------- 395
Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
V+LPG GI+ + SS AD+ Y+I+SL T VL+ E
Sbjct: 396 -VDLPGVVGIFPIRLDSS----ADN------------YVIVSLVEETHVLQITGEELEDV 438
Query: 601 ESVDYFVQGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQDLSFGPSNSE 651
+ + T+ AG LFG V+QV ER R++ +++ + P+N E
Sbjct: 439 QFLQIDTALPTMFAGTLFGPNDSGLVVQVTERQVRLMSNGGLSK--FWEPANGE 490
>gi|68060004|ref|XP_671977.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56488645|emb|CAI04030.1| hypothetical protein PB301494.00.0 [Plasmodium berghei]
Length = 346
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 57/269 (21%), Positives = 108/269 (40%), Gaps = 41/269 (15%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L+ ++ GDL + V ++ +V+ + + + + I + + F+ + G+ QF
Sbjct: 90 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 149
Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SDALQDMVNGEELSLYGSASNN 493
+ S SM +S G A T++L+ D + + ++ + + ++N
Sbjct: 150 SGIGNDSNESMCTSN--HPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILDAKNSN 207
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG-CKGIWT 552
R SL + +GL I A+ ELPG + IWT
Sbjct: 208 IPQIYALCGRGPRSSL------RILQHGLSIEELANN------------ELPGKPRYIWT 249
Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
+ +S EY Y+I+S E T++LE + + EV +S+ + T
Sbjct: 250 IKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNVTT 293
Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQ 641
NL IQV++ G R ++G + +
Sbjct: 294 IHINLLYDNSFIQVYDTGIRHINGKIVQE 322
>gi|302837243|ref|XP_002950181.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
nagariensis]
gi|300264654|gb|EFJ48849.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
nagariensis]
Length = 1104
Score = 46.6 bits (109), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 94/242 (38%), Gaps = 55/242 (22%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL + G + LL + +DG V L + S + + + L F+GSR GDS LV+
Sbjct: 281 LLGNRQGGMQLLVLAHDGSRVSGLRTEPLGYTCAPSCLAYLDSGLTFVGSRSGDSQLVRI 340
Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
+ + P T S +L +V+ + L
Sbjct: 341 SAQP-----------------VNQPPTYLELVDSFPSLAPIVDFVVMDL---------ER 374
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
Q + + + G L+ G+ IN A+ VELPG KG+W++
Sbjct: 375 QGQGQLVMCSGIDSDGSLRVVRNGIGINRQAT------------VELPGIKGVWSL---- 418
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
R H YDDEY YL+++ E R + L T + L E E + +T+ G
Sbjct: 419 -RSH---------YDDEYDKYLLLTFVGETRLLALNTEEELDE-AELPGFDSGSQTLWCG 467
Query: 616 NL 617
N+
Sbjct: 468 NM 469
>gi|328869269|gb|EGG17647.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
Length = 1194
Score = 46.6 bits (109), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 114/564 (20%), Positives = 210/564 (37%), Gaps = 111/564 (19%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L+ V + G + S+A G +D +I+ + ++ +LE++ S +
Sbjct: 49 LDHVLYSEAFGVIRSIAPFRLTGGS----KDYLIVGSDSGRVVILEYNPSKNVFEKVHQE 104
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDE 217
F + G G + DP+GR G + L I+ + SQ +
Sbjct: 105 TFG-------RSGCRRIVPGQYISTDPKGRAFMIGAIEKQKLVYILNRDSQAKLSI---- 153
Query: 218 DTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM--VILHERELTWAGRVS 275
A + V ++ +D+ G+ P+ + + E T V
Sbjct: 154 -----SSPLEAHKAHTIVFSMCGVDV----------GFENPIFATISVDYSEETNIEDVE 198
Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS----PIGGVLVVGANTIHY 331
H+T +++ + L WS + A +++VP P GGVLV ++Y
Sbjct: 199 ETHNTKVLTFYELDLGLNNVVRKWSE-EVDRSANLVVSVPGGSDGP-GGVLVCAQGRVYY 256
Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVE----LDAAHATWLQNDVA--LLSTKTGD 385
+ + D S +PR + E + +HA+ Q D+ L+ ++ GD
Sbjct: 257 RNIGHA------------DISVSIPRRNGMTEEKSLMIVSHASHKQRDMFFFLVQSEYGD 304
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
L +T+ Y G +V + ++ + + IT + N F+ S GD L F
Sbjct: 305 LYKITLDYSGEMVSGMQIAYFDTFPTANCITMLKNGFLFVASEFGDHGLYLFK------- 357
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
S GL DAP+ + + + L L + S S F V
Sbjct: 358 -SLGLD--------DAPTASSAGNTEMVFFEPVFEPRNLVLTATIS----SLSPIVDFKV 404
Query: 506 RDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------------ELPGC-KGIWT 552
D L G + ++ +G+S+++N ++ +LPG GIWT
Sbjct: 405 AD-LAQEGTPQMYAL----------SGVSERANLRVLRHGLPITQMVDSQLPGTPAGIWT 453
Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE----SVDYFVQ 608
+ + N + + Y+++S T+VL + + EV + S +
Sbjct: 454 IPQSLTTMRNPQYQGIGTVESPADRYIVVSFVGSTLVLGVGETVEEVQDSGILSTTTTIL 513
Query: 609 GRTIAAGNLFGRRRVIQVFERGAR 632
R++ A NL ++Q+F +G R
Sbjct: 514 IRSMGA-NL---DSIVQIFAQGIR 533
>gi|301110252|ref|XP_002904206.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
gi|262096332|gb|EEY54384.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
Length = 1197
Score = 46.6 bits (109), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 68/306 (22%), Positives = 114/306 (37%), Gaps = 86/306 (28%)
Query: 321 VLVVGANTIHYHSQ---SASCALALNNYAVSLDSSQELPRSSFSVE--LDAAHATWLQND 375
VLV+G NT+ Y ++ +CA+ PR + + AT Q D
Sbjct: 247 VLVLGENTVQYKNEGHPELTCAI---------------PRREGEHRDIIIVSAATHKQRD 291
Query: 376 V--ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
+ LL ++ GDL +++ Y G VV+ + + + + S + L F S +
Sbjct: 292 LFFVLLQSELGDLYKISLDYSGNVVEEIKIQFFDTIPVASSMCITKTGLLFCASEFSNHY 351
Query: 434 LVQF-TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA---------------LQD 477
L QF + G G K ++ ST LR+ ++ A + D
Sbjct: 352 LFQFLSIGEG----DDAAKCSSLAMDPTEFSTFPLRKLTNLALASSSASLSPVTQLLVDD 407
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ N + +Y NN S+ L+ +GL I A++
Sbjct: 408 LANEQTPQMYALCGNNNRSS-----------------LRVLRHGLPITEMAASA------ 444
Query: 538 NYELVELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
LPG K +W + +Y D Y Y+++S E T+VLE + +
Sbjct: 445 ------LPGVAKAVWCLKE--------------SYADPYDKYIVVSFEDATLVLEVGETV 484
Query: 597 TEVTES 602
EV +S
Sbjct: 485 EEVAQS 490
>gi|448528339|ref|XP_003869702.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis Co 90-125]
gi|380354055|emb|CCG23569.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis]
Length = 1170
Score = 46.6 bits (109), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 81/340 (23%), Positives = 136/340 (40%), Gaps = 48/340 (14%)
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
+P+DA L VP IGGVLV GAN I Y L N ++ L + ++S +
Sbjct: 229 VPNDANYLAPVPGHIGGVLVCGANWIMYDK--------LGNESILLPLLRRKDQTSVIIS 280
Query: 364 LDAAHATWLQND--VALLSTKTGDLVLLTVVYDG--RVVQRLDLSKTNPSVLTSDITTIG 419
HA +N LL GDL L + YD +++ ++++ + + ++
Sbjct: 281 -HVTHALKKKNYGFFILLQNDLGDLFRLIIDYDSNRELIKDIEITYFDTIPVCYNLNIFK 339
Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
N L F LL QF L EE E D K ++ + ++
Sbjct: 340 NGLCFANCINRSQLLYQF----------EKLGEEIS--EEDIRINKTVQMDNIQLTKEKY 387
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
E L G + ++ S + DS++N L S ++ T +
Sbjct: 388 --FEFKLKGLDNLALIDVVESLS-PITDSILNDDTLVTLSTKSKLKTIVHGTPTTTLVES 444
Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII--SLEARTMVLETADLLT 597
+L P I+T + A DDE YL+I +L +T+VL +++
Sbjct: 445 QLPIKP--TNIFTT-----------KTSANAVDDE---YLVITSTLSFKTLVLSLGEVIE 488
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
EV +S FV + A G+ ++Q++ G R ++G+
Sbjct: 489 EVNDS--EFVLDQPTVAVQQVGKSSIVQIYSNGLRHINGN 526
>gi|312076588|ref|XP_003140928.1| xeroderma Pigmentosum Group E Complementing protein [Loa loa]
Length = 516
Score = 46.6 bits (109), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 78/356 (21%), Positives = 136/356 (38%), Gaps = 90/356 (25%)
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
+W NL +A ++ VP P GG L+ G + I YH + AL YA S
Sbjct: 201 LWKHDNLEGEASMVIGVPEPAGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----------VVQRLDLSKTN 407
+ +D +L D+A G+L +L + + G+ V+ + +
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLLLEF-GKGQEQDESSTVSVKDMKVESLG 309
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPST 464
+ + + + N + F+GSR GDS L++ + GT +S L + + ++ AP
Sbjct: 310 NTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTGYIS--LLDSYTNL---AP-- 362
Query: 465 KRLRRSSSDALQDMV----NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
++DM NG++ L S + + + + + L +
Sbjct: 363 ----------IRDMTVMRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS--------- 403
Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
VEL G K ++T+ + D E+ YLI
Sbjct: 404 ---------------------VELKGIKNMFTLRTR---------------DHEFDDYLI 427
Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
+S ++ T VL E T+ + V G T+ AG LF ++QV ++DG
Sbjct: 428 LSFDSDTHVLLINGEELEDTQITGFVVDGATLWAGCLFQSTTILQVTHGEVILIDG 483
>gi|444313909|ref|XP_004177612.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
gi|387510651|emb|CCH58093.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
Length = 1459
Score = 46.6 bits (109), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 58/116 (50%), Gaps = 17/116 (14%)
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC----F 162
++ G + + ++ Q G++ D ++L +AKIS+++FD+ ++ L+ S+H F
Sbjct: 54 FKFSGKITDIVLIPQRGSE----LDCLLLVTPNAKISIIKFDEELNTLKTISLHYYTDEF 109
Query: 163 ESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
E L L AR ++V+P+ +C VL++ + I + + DED
Sbjct: 110 EKLSMLQL-------ARTSQLRVEPKKKC--VLLFNTESIAILPFTQQFNIDNDED 156
>gi|448111975|ref|XP_004201977.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
gi|359464966|emb|CCE88671.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
Length = 1249
Score = 46.2 bits (108), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 99/241 (41%), Gaps = 47/241 (19%)
Query: 95 GISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
I L+ +C + + ++SL + G+ ++D +++ + K+++L++D + L
Sbjct: 52 NIDTGKLDKICVHNVFSVIQSLEKVRLTGS----QKDYLVVTSDSGKLAILQYDTGRNRL 107
Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ--MIILKASQGGSG 212
+ F+ P H K G GP + DPQ R +L+ L+ +I K G
Sbjct: 108 ----VTVFQEP---HSKTGFRRNTPGPYLLTDPQNR--AILIGALERNKLIYKVHSDDKG 158
Query: 213 LVGDEDTFGSGGGFSARIESS--HVINLR--DLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G S+ +ES H I L LD GY PV V +
Sbjct: 159 ----------GMQISSPLESQIRHTITLAMCALDT----------GYENPVFVAIEAEYG 198
Query: 269 TWAGRV----SWKHHTCMISALSISTTLKQHPLIWSAMN--LPHDAYKLLAVPSPIGGVL 322
+ S H T + ++ + L ++ +N LP A L+ +PSP+GGVL
Sbjct: 199 ALDSKEYSIDSQAHQTLLFTSYELDQGLNH--VVRRVVNNKLPISATHLIPLPSPVGGVL 256
Query: 323 V 323
+
Sbjct: 257 I 257
>gi|392593521|gb|EIW82846.1| hypothetical protein CONPUDRAFT_81012 [Coniophora puteana
RWD-64-598 SS2]
Length = 1213
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 87/384 (22%), Positives = 146/384 (38%), Gaps = 97/384 (25%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL +T+ +D V+ L + + + S + + + F+ S G+ L QF
Sbjct: 308 LLQSEDGDLFKVTIDHDEDEVKSLKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367
Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
S TS S G+ E F + + R R + AL D + + L
Sbjct: 368 QKLGDDDDEPEFSSTSFPSFGMAESFIPLPH---AHFRPRGLDNLALADEIESLDPILDA 424
Query: 489 SASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
N N+++ Q F+ R S L+ +GL + S+ ELP
Sbjct: 425 KVMNILPNSDTPQ-IFTACGRGSRSTFRMLR---HGLEVEESVSS------------ELP 468
Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
G +WT DD Y +Y+I+S T+VL + + EV ++
Sbjct: 469 GIPNAVWTTKRTE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT-G 513
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGA------------RILDGS--------------- 637
+ T+A + G ++QV +G R+ G
Sbjct: 514 FLSSAPTLAVQQI-GSDALLQVHPQGIRHVLSDRRVNEWRVPQGKTIVCATTNKRQVVVA 572
Query: 638 -------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRL 682
Y DL G N + STVL++S+ + PY+ +G D ++R+
Sbjct: 573 LSSAELVYFELDLD-GQLNEYQDWKAMGSTVLALSVGEVPEGRQRTPYLAVGCEDQTVRI 631
Query: 683 LVGDPSTC--TVSVQT----PAAI 700
+ DP + T+S+Q P+AI
Sbjct: 632 ISLDPESTLETISLQALTAPPSAI 655
>gi|195586770|ref|XP_002083143.1| GD13507 [Drosophila simulans]
gi|194195152|gb|EDX08728.1| GD13507 [Drosophila simulans]
Length = 1227
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G+SD ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLSDNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|195169735|ref|XP_002025674.1| GL20829 [Drosophila persimilis]
gi|194109167|gb|EDW31210.1| GL20829 [Drosophila persimilis]
Length = 1225
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + S + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP T L+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPRT----------LKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPG-CKGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|125977518|ref|XP_001352792.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
gi|54641542|gb|EAL30292.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
Length = 1228
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + S + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP T L+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPRT----------LKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|346971485|gb|EGY14937.1| pre-mRNA-splicing factor RSE1 [Verticillium dahliae VdLs.17]
Length = 1230
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 121/543 (22%), Positives = 201/543 (37%), Gaps = 125/543 (23%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCG 192
+ILA + +I+++E+ + + + + F K G G + DP+GR
Sbjct: 102 LILATDSGRIAIIEYLPAQNRFQRLHLETFG-------KSGIRRVVPGEFLACDPKGRA- 153
Query: 193 GVLVYGLQ-----MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVK 247
L+ L+ ++ + SQ E T S A HV+++ LD+
Sbjct: 154 -CLIASLEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVHVLSMVALDV---- 199
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL------IWSA 301
GY PV L E + T A + +AL + T L + L +
Sbjct: 200 ------GYANPVFAAL-ETDYTEADQDPTGQ-----AALDVETQLVYYELDLGLNHVVRK 247
Query: 302 MNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNY--AVSLDSSQ 353
+ P D L P G GVLV G I Y HS + + + A S +
Sbjct: 248 WSEPVDNTASLLFQVPGGNDGPSGVLVCGEENITYRHSNQEAFRVPVPRRRGATEDPSRK 307
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRLDLSKT 406
+ +L + + LL T+ GDL +T+ DG V+RL +
Sbjct: 308 RCIVAGVMHKLKGSAGAFF----FLLQTEDGDLFKITIDMIEDRDGNPTGEVKRLKIKYF 363
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
+ + S + + + ++ S+ G+ QF E+ GD +
Sbjct: 364 DTIPVASSLCILKSGFLYVASQFGNYQFYQF--------------EKLGD------DDEE 403
Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
L SS D D E + ++ + A+ +S+ ++ PL D
Sbjct: 404 LEFSSDDFPTDPKQSYEAVFF--------HPRELENLALVESIDSMNPLIDCKVANLTGE 455
Query: 527 DA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADSSRMA 569
DA +A G +S + ++ ELPG +WT+ K SRG
Sbjct: 456 DAPQIYTACGNGARSTFRILKHGLEVNEIVASELPGIPSAVWTL--KLSRG--------- 504
Query: 570 AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
D+Y AY+++S T+VL + + EV +S F+ A L G +IQV +
Sbjct: 505 ---DQYDAYIVLSFTNATLVLSIGETVEEVNDS--GFLTSVPTLAAQLLGGEGLIQVHPK 559
Query: 630 GAR 632
G R
Sbjct: 560 GIR 562
>gi|426192113|gb|EKV42051.1| hypothetical protein AGABI2DRAFT_229642 [Agaricus bisporus var.
bisporus H97]
Length = 1213
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 86/385 (22%), Positives = 148/385 (38%), Gaps = 99/385 (25%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL +T+ ++ V+ L + + + S + + + F+ S G+ L QF
Sbjct: 308 LLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367
Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTK-RLRRSSSDALQDMVNGEELSLY 487
S TS SSG+ E +A P + R + AL D + + +
Sbjct: 368 QKLGDDDEEPEFSSTSFPSSGMAEP----QAALPRVYFKPRPLDNLALADELESLDPIID 423
Query: 488 GSASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
N N+++ Q F+ R + + L+ +GL + S+ +L
Sbjct: 424 SKVLNLLPNSDTPQ-IFAACGRGARSS---LRTLQHGLEVEESVSS------------DL 467
Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
PG +WT DD Y +Y+I+S T+VL + + EV ++
Sbjct: 468 PGIPNAVWTTKRNE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT- 512
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGS-------- 637
+ T+A + G ++QV G R I+ +
Sbjct: 513 GFLSSAPTLAVQQI-GSDALLQVHPHGIRHVLADRRVNEWRVPSNKIIVAATTNKRQVVV 571
Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
Y DL G N + STVL++SI D PY+ +G D ++R
Sbjct: 572 ALSSAELVYFELDLD-GQLNEYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGCEDQTVR 630
Query: 682 LLVGDPSTC--TVSVQT----PAAI 700
++ DP + T+S+Q P+AI
Sbjct: 631 IISLDPESTLETISLQALTAPPSAI 655
>gi|380490733|emb|CCF35810.1| pre-mRNA-splicing factor rse-1 [Colletotrichum higginsianum]
Length = 1212
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 137/603 (22%), Positives = 226/603 (37%), Gaps = 129/603 (21%)
Query: 74 QEEGSKESK--NSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
Q G+KE + ++ +L S + V + + G + S+A G++ +D
Sbjct: 27 QFSGTKEQNIVTASGSRLTLLRPDPSQGKVITVLSHDIFGIIRSMAAFRLAGSN----KD 82
Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVDP 187
+ILA + +I+++E+ I + + F+ LHL K G G + DP
Sbjct: 83 YLILATDSGRITIIEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACDP 131
Query: 188 QGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+GR V L ++ + SQ E T S A V+++ LD+
Sbjct: 132 KGRACLIASVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVLSMVALDV- 181
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLIW 299
GY PV L E E T A G + + T ++ + L W
Sbjct: 182 ---------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRKW 230
Query: 300 SAMNLPHDAYKLLAVP---SPIGGVLVVGANTIHY-HSQSASCALALNNY--AVSLDSSQ 353
S P A L VP GVLV G I Y HS + + + A S +
Sbjct: 231 SESVDP-TASMLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRK 289
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRLDLSKT 406
S +L + + LL T+ GDL T+ DG V+RL +
Sbjct: 290 RHAVSGVMHKLKGSAGAFF----FLLQTEDGDLFKATLDMVEDTDGNPTGEVKRLKIKYF 345
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
+ ++S + + + + S+ G+ QF E+ GD + +
Sbjct: 346 DTIPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGDDDDE------ 385
Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
L SS D D G + + + + A+ +S+ ++ PL D
Sbjct: 386 LEFSSDDFPTDPKAGYDAVYF--------HPRPLENLALVESIDSMNPLLDCKVANLTGE 437
Query: 527 DA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADSSRMA 569
DA +A G +S + ++ ELPG +WT+ K +RG
Sbjct: 438 DAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLNRG--------- 486
Query: 570 AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
D+Y AY+++S T+VL + + EV++S F+ A L G +IQV +
Sbjct: 487 ---DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQVHPK 541
Query: 630 GAR 632
G R
Sbjct: 542 GIR 544
>gi|409075182|gb|EKM75565.1| hypothetical protein AGABI1DRAFT_64324 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1213
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 86/385 (22%), Positives = 148/385 (38%), Gaps = 99/385 (25%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL +T+ ++ V+ L + + + S + + + F+ S G+ L QF
Sbjct: 308 LLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367
Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTK-RLRRSSSDALQDMVNGEELSLY 487
S TS SSG+ E +A P + R + AL D + + +
Sbjct: 368 QKLGDDDEEPEFSSTSFPSSGMAEP----QAALPRVYFKPRPLDNLALADELESLDPIID 423
Query: 488 GSASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
N N+++ Q F+ R + + L+ +GL + S+ +L
Sbjct: 424 SKVLNLLPNSDTPQ-IFAACGRGARSS---LRTLQHGLEVEESVSS------------DL 467
Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
PG +WT DD Y +Y+I+S T+VL + + EV ++
Sbjct: 468 PGIPNAVWTTKRNE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT- 512
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGS-------- 637
+ T+A + G ++QV G R I+ +
Sbjct: 513 GFLSSAPTLAVQQI-GSDALLQVHPHGIRHVLADRRVNEWRVPSNKTIVAATTNKRQVVV 571
Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
Y DL G N + STVL++SI D PY+ +G D ++R
Sbjct: 572 ALSSAELVYFELDLD-GQLNEYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGCEDQTVR 630
Query: 682 LLVGDPSTC--TVSVQT----PAAI 700
++ DP + T+S+Q P+AI
Sbjct: 631 IISLDPESTLETISLQALTAPPSAI 655
>gi|393905247|gb|EJD73911.1| CPSF A subunit region family protein [Loa loa]
Length = 1145
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 78/356 (21%), Positives = 136/356 (38%), Gaps = 90/356 (25%)
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
+W NL +A ++ VP P GG L+ G + I YH + AL YA S
Sbjct: 201 LWKHDNLEGEASMVIGVPEPAGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----------VVQRLDLSKTN 407
+ +D +L D+A G+L +L + + G+ V+ + +
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLLLEF-GKGQEQDESSTVSVKDMKVESLG 309
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPST 464
+ + + + N + F+GSR GDS L++ + GT +S L + + ++ AP
Sbjct: 310 NTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTGYIS--LLDSYTNL---AP-- 362
Query: 465 KRLRRSSSDALQDMV----NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
++DM NG++ L S + + + + + L +
Sbjct: 363 ----------IRDMTVMRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS--------- 403
Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
VEL G K ++T+ + D E+ YLI
Sbjct: 404 ---------------------VELKGIKNMFTLRTR---------------DHEFDDYLI 427
Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
+S ++ T VL E T+ + V G T+ AG LF ++QV ++DG
Sbjct: 428 LSFDSDTHVLLINGEELEDTQITGFVVDGATLWAGCLFQSTTILQVTHGEVILIDG 483
>gi|430813298|emb|CCJ29330.1| unnamed protein product [Pneumocystis jirovecii]
Length = 1197
Score = 45.8 bits (107), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 118/544 (21%), Positives = 206/544 (37%), Gaps = 92/544 (16%)
Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
+HG + +L G + +D +I+ + +I++LE+ + +
Sbjct: 65 VHGIIRTLVGFRLAGTN----KDHLIVGSDSGRITILEYKPDSNAFSKVHQETYG----- 115
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
K G G + VDP+GR + ++ ++ + A
Sbjct: 116 --KSGVRRVVPGQYLAVDPKGRATMIASIEKNKLVYVLNRDSA------TNLTISSPLEA 167
Query: 229 RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMIS 284
S V +L +D+ GY PV L E E +G+ +++ +++
Sbjct: 168 HKSCSLVFHLIGMDV----------GYENPVFAALEVDYTEAESDPSGK-AYREIQKVLT 216
Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASC 338
+ L WS P D L V P G G LV +I Y H +
Sbjct: 217 YYELDLGLNHVVRKWSD---PVDRKANLLVTVPGGSDGPSGALVCTEGSIFYKHKGKKTH 273
Query: 339 ALALNNYAVSLDSSQ--ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR 396
+ + SL++SQ ++ SS ++ A LQN+ GDL +T+ +
Sbjct: 274 RIPIPTRIGSLENSQKKQIIVSSVVHKMRGAFFFLLQNE-------DGDLFKVTIDSNDG 326
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF-TCGSGTSMLS-SGLKEEF 454
V+ L + + +++ ++ + + F+ S G+ L QF G + + S +
Sbjct: 327 EVESLKIKYFDTVPVSTGLSILKSGFLFVASEYGNHHLYQFEKLGDDNNEIEFSSVDFPV 386
Query: 455 GDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVRDSLVNI 512
D+ E PS R R + L D +N + N T E A + ++ R
Sbjct: 387 LDLNEGYEPSYFRPRSLENLLLVDDLNSMNPLMDSKILNLTDEDAPQIYALCGR------ 440
Query: 513 GPLKDFS---YGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRM 568
GP F YGL +N + A+G LPG +WT SS
Sbjct: 441 GPRSTFRTLRYGLEVN-EIVASG-----------LPGSPTAVWTTKLTSS---------- 478
Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
D+Y AY+++S T+VL + + EV+++ + T+A L G +IQV
Sbjct: 479 ----DQYDAYIVLSFVNGTLVLSIGETVEEVSDT-GFLSSSPTLAVQQL-GDDALIQVHP 532
Query: 629 RGAR 632
+G R
Sbjct: 533 KGIR 536
>gi|308477185|ref|XP_003100807.1| CRE-DDB-1 protein [Caenorhabditis remanei]
gi|308264619|gb|EFP08572.1| CRE-DDB-1 protein [Caenorhabditis remanei]
Length = 1154
Score = 45.8 bits (107), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 87/354 (24%), Positives = 145/354 (40%), Gaps = 82/354 (23%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
DA L+ VP+PI GVLV+ AN+I Y S N V +S L + F+ +
Sbjct: 210 DASVLIPVPAPISGVLVLAANSILYKSSDV-------NGDVVPYASPLLDNTVFTCHGLV 262
Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNS 421
D + ++ +D T+ L+L+ + +GR V+ + + + + I I
Sbjct: 263 DPSGERFILSD-----TEGRLLMLILNIGEGRSGITVKDMRIEYLGETSIADSINYIDAG 317
Query: 422 LFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
+ F+GSRLGDS L++ SG S S + E + +I ++DM+
Sbjct: 318 VVFVGSRLGDSQLIRLMPTPSGGSY--SVVLETYSNI---------------GPIRDMIM 360
Query: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
E ++ ++ T S A +D G L+ G+ I AS
Sbjct: 361 VE---------SDGQAQLVTCSGAEKD-----GSLRVIRNGIGIEELAS----------- 395
Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
VEL G GI+ + S+ + Y+I+SL T VL+ E
Sbjct: 396 -VELAGVIGIFPIRLNSTTDN----------------YVIVSLAEETHVLQINGEELEDV 438
Query: 601 ESVDYFVQGRTIAAGNLFG---RRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
+ + + TI A +FG ++QV E+ R + S +++ + P N E
Sbjct: 439 QLLQICTEMPTIFASTIFGPDNSEVLLQVTEKHVRFMAFSGLSK--IWEPPNGE 490
>gi|367001853|ref|XP_003685661.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
gi|357523960|emb|CCE63227.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
Length = 1357
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 70/365 (19%), Positives = 153/365 (41%), Gaps = 71/365 (19%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
S L L ++L+G V +A++ Q + + D +I+ AK+S++ F+ + L
Sbjct: 45 STNKLHLNYEFKLNGRVSDIALIKQVDS----KLDYLIILTATAKLSLVNFNVFTNSLET 100
Query: 157 TSMHCFESPEWLH--LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
S+H +E + LK +ES R +D C VL++ I + +
Sbjct: 101 ISLHYYEDKFRQNSILKLAKESKLR-----IDQAKNC--VLLFNNDNIAILPISSTTDEF 153
Query: 215 GDED-----------------TFGSGGGFSARIESSHVINLRDLDM----KHVKDFIFVH 253
DED F S +I +S +I L+ ++ +++ D F+
Sbjct: 154 EDEDLGQESSAKTVKRGNMSIKFPSQSQKKNKITNSSII-LKSTELNSKIQNIIDIQFLS 212
Query: 254 GYIEPVMVILHERELTWAGR-----VSWKHHTCMISAL-------------SISTTLKQH 295
+ +P + +L++ +L W G + ++ ++ L S++ L +
Sbjct: 213 NFSKPTLSVLYQPKLAWIGNSNLVTLPTQYMILTLNILERENIKSQENGENSLNQDLIET 272
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY--HSQSASCALALNNY--AVSLDS 351
+I LP++ + ++ + + G +VG+N I Y H+ + +N + +L
Sbjct: 273 TIIGQVSELPYELHTIIPLNN---GSTLVGSNEIIYIDHTGVLQSLIIINQFQDKETLKK 329
Query: 352 SQELPRSSFSVELDA------AHATWLQNDV-----ALLSTKTGDLVLLTVVYDGRVVQR 400
+ + +S ++ L+ A + N+V L+ + ++ L+ + +GR++
Sbjct: 330 GRVIDKSKQNIILNKPIKFINAGSRVESNNVDDKNNVLIFDENNNIYLVNITLEGRLLIN 389
Query: 401 LDLSK 405
D++K
Sbjct: 390 FDINK 394
>gi|301124447|ref|XP_002909707.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262106897|gb|EEY64949.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 328
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 36/83 (43%), Gaps = 19/83 (22%)
Query: 925 GFFLSGSRPCWCMVFRERLRVHPQLCDGS-------------------IVAFTVLHNVNC 965
G F G+ P W + R P S +++FT H+ +C
Sbjct: 3 GAFFRGAHPMWILGDRGHASFVPMCVPSSAPPKANGTSKNAAPRVSVPVLSFTPFHHWSC 62
Query: 966 NHGFIYVTSQGILKICQLPSGST 988
+GFIY S+G L++C+LPS T
Sbjct: 63 PNGFIYFHSRGALRVCELPSSKT 85
>gi|241952575|ref|XP_002419009.1| pre-mRNA-splicing factor, putative; pre-spliceosome component,
putative [Candida dubliniensis CD36]
gi|223642349|emb|CAX42591.1| pre-mRNA-splicing factor, putative [Candida dubliniensis CD36]
Length = 1187
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 93/395 (23%), Positives = 162/395 (41%), Gaps = 62/395 (15%)
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+K+ P ++ LP D ++ +P IGG+LV G+N Y L+ + L
Sbjct: 218 VKKKPASLNSDPLPDDVNYMIPLPGHIGGMLVCGSNWCFYD--------KLDGPRIYLPL 269
Query: 352 SQELPRSSFSVELD-AAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLS--KT 406
+ ++ S+ ++ H +N LL GDL LTV Y D ++ + ++ T
Sbjct: 270 PRRDGQTQESIIVNHVTHVLKKKNFFILLQNTLGDLFKLTVDYDFDKETIKNISITYFDT 329
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
P L+ +I N F+ D LL QF E+ GD A+
Sbjct: 330 IPPALSLNI--FKNGFLFVNVLNNDKLLYQF--------------EKLGDDLAE----NE 369
Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL--RI 524
L +SSD Y S N + TF D+L I L+ S + RI
Sbjct: 370 LVINSSD-------------YDSLDNVRGTDTTTFKLKGLDNLALIDVLETLSPIIDSRI 416
Query: 525 NADASATGISKQSNYELVE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII- 581
N D+ +S S + + +P + + + + + +DE YL+I
Sbjct: 417 N-DSKLVTLSSHSYVKSITHGVPTTTLVESPLPITPTDIFTTKLSLESANDE---YLVIS 472
Query: 582 -SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG---ARILDGS 637
SL ++T+VL +++ +V +S FV ++ + G V+QV+ G R ++G
Sbjct: 473 SSLSSKTLVLSIGEVVEDVEDS--EFVLDQSTISVQQVGIASVVQVYSNGIKHIRTVNGK 530
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
T D F P+ S N+ + +++++ V+
Sbjct: 531 KKTTDW-FPPAGITITHASTNNQQVLIALSNLNVV 564
>gi|449459948|ref|XP_004147708.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
gi|449513493|ref|XP_004164340.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
Length = 1214
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 126/324 (38%), Gaps = 57/324 (17%)
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVLV N + Y +Q A+ + +LP + + AA LL
Sbjct: 246 GVLVCAENFVIYKNQGHPDVRAV------IPRRADLPAERGVLIVSAAMHKQKTMFFFLL 299
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
T+ GD+ +T+ ++ V+ L + + +T+ + + + F S G+ L QF
Sbjct: 300 QTEYGDIFKVTLEHNNDSVKELKIKYFDTIPVTASMCVLKSGFLFAASEFGNHSLYQFQA 359
Query: 440 -GSGTSMLSSG-----LKEEFGDIEADAPSTKRLRR-SSSDALQDMVNGEELSLYGSASN 492
G + SS +E F + K L R ++L +++ + ++L+
Sbjct: 360 IGEDADVESSSATLMETEEGFQPVFFQPRRLKNLMRIDQVESLMPIMDMKIINLF----- 414
Query: 493 NTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-K 548
E + F+ R GP L+ GL I S + ELPG
Sbjct: 415 -EEETPQIFTLCGR------GPRSSLRILRPGLAI------------SEMAVSELPGVPS 455
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+WTV +DE+ AY+++S T+VL + + EV++S F+
Sbjct: 456 AVWTVKKN--------------INDEFDAYIVVSFANATLVLSIGETVEEVSDS--GFLD 499
Query: 609 GRTIAAGNLFGRRRVIQVFERGAR 632
A +L G ++QV G R
Sbjct: 500 TTPSLAVSLIGDDSLMQVHPNGIR 523
>gi|353232348|emb|CCD79703.1| putative dna repair protein xp-E [Schistosoma mansoni]
Length = 1329
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 126/323 (39%), Gaps = 57/323 (17%)
Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
R DS+ L A ++++E +DS+ + + S + R +G V
Sbjct: 72 RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVED--------RSARIIDQGFDVL 123
Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
+DP V +Y GL IIL G + G+ + +IE +++
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIG--------EKIGTDFLDTNQIEEGNIV------- 168
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
D F++GY P +++E EL H L+ L ++
Sbjct: 169 ----DMAFIYGYSLPTFAMIYEDELVL--------HMKTYEIYGREPVLRNVQLTLDSIE 216
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
D+ L+ VP P GGV++VG N I YH++ ++ Y +SQ L ++ +
Sbjct: 217 --PDSKLLIPVPKPYGGVILVGDNIICYHTKDGP---HISQYIPQAKASQVLCYAAVDAQ 271
Query: 364 L----DAAHATW----LQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
D A + L D+ A + T + L+ V G + L P
Sbjct: 272 RYLLGDMAGRLYMVHLLSEDISAAANNGTSNSDSLSAVRIGSIRIELLGETATP----ES 327
Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
I + N + F+GS LGDS L++
Sbjct: 328 IAYLDNGVVFIGSTLGDSQLIRL 350
>gi|256088964|ref|XP_002580590.1| DNA repair protein xp-E [Schistosoma mansoni]
Length = 1329
Score = 45.1 bits (105), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 126/323 (39%), Gaps = 57/323 (17%)
Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
R DS+ L A ++++E +DS+ + + S + R +G V
Sbjct: 72 RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVED--------RSARIIDQGFDVL 123
Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
+DP V +Y GL IIL G + G+ + +IE +++
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIG--------EKIGTDFLDTNQIEEGNIV------- 168
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
D F++GY P +++E EL H L+ L ++
Sbjct: 169 ----DMAFIYGYSLPTFAMIYEDELVL--------HMKTYEIYGREPVLRNVQLTLDSIE 216
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
D+ L+ VP P GGV++VG N I YH++ ++ Y +SQ L ++ +
Sbjct: 217 --PDSKLLIPVPKPYGGVILVGDNIICYHTKDGP---HISQYIPQAKASQVLCYAAVDAQ 271
Query: 364 L----DAAHATW----LQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
D A + L D+ A + T + L+ V G + L P
Sbjct: 272 RYLLGDMAGRLYMVHLLSEDISAAANNGTSNSDSLSAVRIGSIRIELLGETATP----ES 327
Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
I + N + F+GS LGDS L++
Sbjct: 328 IAYLDNGVVFIGSTLGDSQLIRL 350
>gi|194864680|ref|XP_001971056.1| GG14635 [Drosophila erecta]
gi|190652839|gb|EDV50082.1| GG14635 [Drosophila erecta]
Length = 1227
Score = 45.1 bits (105), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPSGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|239613967|gb|EEQ90954.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ER-3]
gi|327353314|gb|EGE82171.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ATCC
18188]
Length = 1199
Score = 45.1 bits (105), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 21/134 (15%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 299 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLEEATIFV-------A 340
Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGS 427
W Q D LL+ G L L ++ D VQ L + S + +G + F+GS
Sbjct: 341 WEQVDGQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIPRASVLVYMGGGVTFIGS 400
Query: 428 RLGDSLLVQFTCGS 441
GDS L++ T GS
Sbjct: 401 HQGDSQLIRITEGS 414
>gi|24654874|ref|NP_728546.1| CG13900, isoform A [Drosophila melanogaster]
gi|23092721|gb|AAF47416.2| CG13900, isoform A [Drosophila melanogaster]
gi|60678131|gb|AAX33572.1| LD01809p [Drosophila melanogaster]
gi|220950356|gb|ACL87721.1| CG13900-PA [synthetic construct]
gi|289803030|gb|ADD20765.1| FI04459p [Drosophila melanogaster]
Length = 1227
Score = 45.1 bits (105), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|195336406|ref|XP_002034829.1| GM14250 [Drosophila sechellia]
gi|194127922|gb|EDW49965.1| GM14250 [Drosophila sechellia]
Length = 1227
Score = 45.1 bits (105), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|60677959|gb|AAX33486.1| RE01065p [Drosophila melanogaster]
Length = 1227
Score = 45.1 bits (105), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|346327528|gb|EGX97124.1| pre-mRNA splicing factor RSE1 [Cordyceps militaris CM01]
Length = 1206
Score = 45.1 bits (105), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 146/656 (22%), Positives = 239/656 (36%), Gaps = 132/656 (20%)
Query: 64 NVIEIYVVRVQEEGSKES---KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
NV++ V Q G+KE SG + D + L+ H + G + S+A+
Sbjct: 13 NVVQ--AVLGQFAGTKEQLIITGSGSQLTLLRPDPAQGKVIALLSH-DIFGILRSIAVFR 69
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
G++ +D IILA + +I++LE+ + M F K G G
Sbjct: 70 LAGSN----KDYIILATDSGRITILEYLPGPNRFNRLHMETFG-------KSGIRRVVPG 118
Query: 181 PLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
+ DP+GR V L ++ + SQ E T S A VI
Sbjct: 119 EYLACDPKGRACLISAVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVIA 169
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
+ LD+ GY PV L + + I+ ++S Q L
Sbjct: 170 MVALDV----------GYANPVFAALE---------IEYTEVDQDITGEALSEVETQ--L 208
Query: 298 IWSAMNL-----------PHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCAL 340
++ ++L P D L P G GVLV G I Y HS + +
Sbjct: 209 VYYELDLGLNHVVRKWSDPVDPTASLLFQVPGGNDGPSGVLVCGEENITYRHSNQDALRV 268
Query: 341 ALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTV--VYD 394
+ + E P ++ H L+ LL + GDL +T+ V D
Sbjct: 269 PIPRRR----GATEDPSRKRNIVAGVMHK--LKGSAGAFFFLLQSDDGDLFKITIDMVED 322
Query: 395 GR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
VQR+ + + + + + + + ++ S+ G+ QF
Sbjct: 323 EEGAPTGEVQRMKIKYFDTVPVATSLCILKSGFLYVASQFGNYAFYQFEKLGDDDDEVEF 382
Query: 450 LKEEF--GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVR 506
E+F + A P R + + AL D + L +N T E A + F+
Sbjct: 383 SSEDFPVDPLAAYEPVYFYPRLAENLALVDSIPAMNPLLDCKVANLTGEDAPQIFTICGN 442
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADS 565
+ LK +GL +N ++ ELPG +WT+ S
Sbjct: 443 GARSTFRTLK---HGLEVNEIVAS------------ELPGVPSAVWTLKLNS-------- 479
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
D++Y AY+++S T+VL + + EV++S + TIAA L G +IQ
Sbjct: 480 ------DEQYDAYIVLSFTNGTLVLSIGETVEEVSDS-GFLTSVPTIAA-QLLGTDGLIQ 531
Query: 626 VFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
V RG R I +G N S ++ ++++ S V + +S G I
Sbjct: 532 VHPRGIRHIRNG------------NVNEWSAPQHRSIVAASTNSHQVAIALSSGEI 575
>gi|261193401|ref|XP_002623106.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
gi|239588711|gb|EEQ71354.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
Length = 1168
Score = 45.1 bits (105), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 21/134 (15%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 299 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLEEATIFV-------A 340
Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGS 427
W Q D LL+ G L L ++ D VQ L + S + +G + F+GS
Sbjct: 341 WEQVDGQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIPRASVLVYMGGGVTFIGS 400
Query: 428 RLGDSLLVQFTCGS 441
GDS L++ T GS
Sbjct: 401 HQGDSQLIRITEGS 414
>gi|194749950|ref|XP_001957397.1| GF24063 [Drosophila ananassae]
gi|190624679|gb|EDV40203.1| GF24063 [Drosophila ananassae]
Length = 1228
Score = 45.1 bits (105), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|195490209|ref|XP_002093045.1| GE20993 [Drosophila yakuba]
gi|194179146|gb|EDW92757.1| GE20993 [Drosophila yakuba]
Length = 1227
Score = 45.1 bits (105), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|240275059|gb|EER38574.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H143]
Length = 1134
Score = 45.1 bits (105), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 64/136 (47%), Gaps = 25/136 (18%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 301 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV-------A 342
Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFL 425
W Q D LL+ G L L +V D VQ +LDL P S + +G + F+
Sbjct: 343 WEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVYMGGGITFI 400
Query: 426 GSRLGDSLLVQFTCGS 441
GS GDS L++ T GS
Sbjct: 401 GSHQGDSELIRITEGS 416
>gi|325094412|gb|EGC47722.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H88]
Length = 1201
Score = 45.1 bits (105), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 64/136 (47%), Gaps = 25/136 (18%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 301 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV-------A 342
Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFL 425
W Q D LL+ G L L +V D VQ +LDL P S + +G + F+
Sbjct: 343 WEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVYMGGGITFI 400
Query: 426 GSRLGDSLLVQFTCGS 441
GS GDS L++ T GS
Sbjct: 401 GSHQGDSELIRITEGS 416
>gi|226291941|gb|EEH47369.1| DNA damage-binding protein 1a [Paracoccidioides brasiliensis Pb18]
Length = 1209
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 74/174 (42%), Gaps = 38/174 (21%)
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
+W+ C I+ LK+ L A L+ VP+P+GG+LV+G +I Y
Sbjct: 279 AWQDTGC-IAVFKALDLLKEE--------LEMGASFLIPVPAPLGGLLVLGETSIRYLD- 328
Query: 335 SASCALALNNYAVSLDSSQELPRSSFSVELDAA--HATWLQNDVA--LLSTKTGDLVLLT 390
D++ E S+ LD A W Q D LL+ G L L
Sbjct: 329 ---------------DATNE----CISLPLDEATIFVAWEQVDGQRWLLADDYGRLFFLM 369
Query: 391 VVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
++ D VQ +LDL P S + +G + F+GS GDS L++ T GS
Sbjct: 370 LILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVTFIGSHQGDSQLIRITEGS 421
>gi|225558618|gb|EEH06902.1| DNA damage-binding protein 1a [Ajellomyces capsulatus G186AR]
Length = 1201
Score = 44.7 bits (104), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 64/136 (47%), Gaps = 25/136 (18%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 301 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV-------A 342
Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFL 425
W Q D LL+ G L L +V D VQ +LDL P S + +G + F+
Sbjct: 343 WEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVYMGGGITFI 400
Query: 426 GSRLGDSLLVQFTCGS 441
GS GDS L++ T GS
Sbjct: 401 GSHQGDSELIRITEGS 416
>gi|308808936|ref|XP_003081778.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
gi|116060244|emb|CAL56303.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
Length = 1282
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 102/208 (49%), Gaps = 28/208 (13%)
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
N+R L+ V+D F+HG +P + +L+ R++ A V K + + ++
Sbjct: 376 EAFNIR-LEELRVEDIQFLHGTAKPTIAVLY-RDMKEA--VHIKTYEIGVREKEFVSS-- 429
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
W+ +L + K++ VP+P+GGV+V+G TI Y ++++ ++ V L +
Sbjct: 430 ----PWAQNDLEGGSSKIIPVPAPVGGVVVLGEETIVYLNKTS------DDTDVFLKAIN 479
Query: 354 ELPRSSF----SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
RSS +++ D + LL G L LL +V+DG+ V L + + +
Sbjct: 480 IPERSSIVCYGAIDPDGSRY--------LLGDHDGTLYLLVLVHDGKRVNELKIERLGET 531
Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ S ++ + N + F+GS GDS L++
Sbjct: 532 SIPSTVSYLDNGVVFVGSAYGDSQLIKL 559
>gi|225680146|gb|EEH18430.1| DNA damage-binding protein [Paracoccidioides brasiliensis Pb03]
Length = 1138
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 63/138 (45%), Gaps = 29/138 (21%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
L+ VP+P+GG+LV+G +I Y D++ E S+ LD A
Sbjct: 318 LIPVPAPLGGLLVLGETSIRYLD----------------DATNE----CISLPLDEATIF 357
Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLF 423
W Q D LL+ G L L ++ D VQ +LDL P S + +G +
Sbjct: 358 VAWEQVDGQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVT 415
Query: 424 FLGSRLGDSLLVQFTCGS 441
F+GS GDS L++ T GS
Sbjct: 416 FIGSHQGDSQLIRITEGS 433
>gi|310793065|gb|EFQ28526.1| CPSF A subunit region [Glomerella graminicola M1.001]
Length = 1212
Score = 44.7 bits (104), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 137/606 (22%), Positives = 225/606 (37%), Gaps = 135/606 (22%)
Query: 74 QEEGSKESKNSGETKRRVLM---DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR 130
Q G+KE + R+ + D + L+ H + G + S+A G++ +
Sbjct: 27 QFSGTKEQNIITASGSRLTLLRPDPSQGKVITLLSH-DIFGIIRSMAAFRLAGSN----K 81
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVD 186
D +ILA + +I+++E+ I + + F+ LHL K G G + D
Sbjct: 82 DYLILATDSGRITIIEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACD 130
Query: 187 PQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
P+GR V L ++ + SQ E T S A V+++ LD+
Sbjct: 131 PKGRACLIASVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVLSMVALDV 181
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLI 298
GY PV L E E T A G + + T ++ + L
Sbjct: 182 ----------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRK 229
Query: 299 WSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNY--AVSLD 350
WS P D L P G GVLV G I Y HS + + + A
Sbjct: 230 WSE---PVDPTASLLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDP 286
Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRLDL 403
S + S +L + + L+ T+ GDL T+ DG V+RL +
Sbjct: 287 SRKRHVVSGVMHKLKGSAGAFF----FLIQTEDGDLFKATIDMVEDADGNPTGEVKRLKI 342
Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
+ ++S + + + + S+ G+ QF E+ GD
Sbjct: 343 KYFDTIPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGD------D 382
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR 523
+ L SS D D G + + + + A+ +S+ ++ PL D
Sbjct: 383 DEELEFSSDDFPTDPKAGYDAVYF--------HPRPLENLALVESIDSMNPLLDCKVANL 434
Query: 524 INADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADSS 566
DA +A G +S + ++ ELPG +WT+ K +RG
Sbjct: 435 TGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLNRG------ 486
Query: 567 RMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQV 626
D+Y AY+++S T+VL + + EV++S F+ A L G +IQV
Sbjct: 487 ------DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQV 538
Query: 627 FERGAR 632
+G R
Sbjct: 539 HPKGIR 544
>gi|295667673|ref|XP_002794386.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
gi|226286492|gb|EEH42058.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
Length = 1195
Score = 44.7 bits (104), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 63/138 (45%), Gaps = 29/138 (21%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
L+ VP+P+GG+LV+G +I Y D++ E S+ LD A
Sbjct: 292 LIPVPAPLGGLLVLGETSIRYLD----------------DATNE----CISLPLDEATIF 331
Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLF 423
W Q D LL+ G L L ++ D VQ +LDL P S + +G +
Sbjct: 332 VAWEQVDGQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVT 389
Query: 424 FLGSRLGDSLLVQFTCGS 441
F+GS GDS L++ T GS
Sbjct: 390 FIGSHQGDSQLIRITEGS 407
>gi|358366432|dbj|GAA83053.1| UV-damaged DNA binding protein [Aspergillus kawachii IFO 4308]
Length = 1643
Score = 44.7 bits (104), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 106/264 (40%), Gaps = 29/264 (10%)
Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+DP GR + VY + ++ Q S G + SG E R +D
Sbjct: 62 IDPSGRFMTLEVYEGVIAVVPIVQLPSKKRGRQVAPPSGPDAPRVGELGEPTTAR-IDEL 120
Query: 245 HVKDFIFVHGYI-EPVMVILHE-RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
V+ F+H P + +L+E + +V H++ S+ ++ L +
Sbjct: 121 FVRSSAFLHVQSGPPRLALLYEDNQKKVRLKVRALHYSAATSSTGADAAFEES-LDGFSQ 179
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
L A L+ VP+P+GG+LV+G +I Y V DS++ + R
Sbjct: 180 ELDLGASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP---- 220
Query: 363 ELDAA--HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITT 417
LD A W Q D LL+ G L L +V D VQ L + S +
Sbjct: 221 -LDEATIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIY 279
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
+G + F+GS GDS +++ GS
Sbjct: 280 LGGGVIFVGSHQGDSQVLRIGNGS 303
>gi|170589359|ref|XP_001899441.1| Xeroderma Pigmentosum Group E Complementing protein [Brugia malayi]
gi|158593654|gb|EDP32249.1| Xeroderma Pigmentosum Group E Complementing protein, putative
[Brugia malayi]
Length = 521
Score = 44.7 bits (104), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 67/155 (43%), Gaps = 31/155 (20%)
Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADA---SATGISKQSNY----------EL-- 541
A T ++ DS N+ P++D + +R N + +G K EL
Sbjct: 345 ADGTGYISLLDSYTNLAPIRDMTV-MRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS 403
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
VEL G K ++T+ + DDE+ YLI+S ++ T VL E TE
Sbjct: 404 VELKGIKNMFTLRTR---------------DDEFDDYLILSFDSETHVLLINGEELEDTE 448
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
+ V G T+ AG LF + ++QV ++DG
Sbjct: 449 ITGFTVDGATLWAGCLFHSKTILQVTHGEVILIDG 483
>gi|91092128|ref|XP_972649.1| PREDICTED: similar to AGAP005549-PA [Tribolium castaneum]
gi|270004662|gb|EFA01110.1| hypothetical protein TcasGA2_TC010322 [Tribolium castaneum]
Length = 1219
Score = 44.3 bits (103), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 73/320 (22%), Positives = 124/320 (38%), Gaps = 59/320 (18%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + S + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVASAMCVLKTGFLFVTSEFGNHYLYQI 361
Query: 438 T-CGSGTSML--SSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
G L SS + E GD AP + R + ++L +++ L G
Sbjct: 362 AHLGDDDDELEFSSAMPLEEGDTFFFAPRSLRNLVLVDEMESLSPILSCRVADLAG---- 417
Query: 493 NTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-K 548
E + + R GP L+ +GL + S + ELPG
Sbjct: 418 --EDTPQLYMLCGR------GPRSSLRVLRHGLEV------------SEMAVSELPGNPN 457
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+WTV +S DDEY AY+I+S T+VL + + EVT+S F+
Sbjct: 458 AVWTVKRRS--------------DDEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLG 501
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
+ + ++QV+ G R ++ D N G + T++ +I
Sbjct: 502 TTPTLSCSALSDDALVQVYPGGIR-----HICSDKRV---NEWKAPGKK--TIVKCAINQ 551
Query: 669 PYVLLGMSDGSIRLLVGDPS 688
V++ +S G + DP+
Sbjct: 552 RQVVIALSGGELAYFEMDPT 571
>gi|258572939|ref|XP_002540651.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237900917|gb|EEP75318.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1144
Score = 44.3 bits (103), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 61/142 (42%), Gaps = 21/142 (14%)
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
NL A L+ VP P+GG+L++G I Y ++N ++L +
Sbjct: 239 NLELGAEILVPVPLPLGGILILGEKCIKYVD-------TISNETITL-----------PL 280
Query: 363 ELDAAHATW--LQNDVALLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIG 419
E + W L N LL+ G L L +V D V+ + + S + +G
Sbjct: 281 EYNTVFVAWEQLDNQRWLLADDYGRLFFLMLVLDSANAVRTWKVDLLGETSRASVLVHLG 340
Query: 420 NSLFFLGSRLGDSLLVQFTCGS 441
+ FLGS GDS +++ T GS
Sbjct: 341 GGVVFLGSHQGDSHVIRITEGS 362
>gi|429859776|gb|ELA34542.1| pre-mRNA-splicing factor rse1 [Colletotrichum gloeosporioides Nara
gc5]
Length = 1212
Score = 44.3 bits (103), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 138/607 (22%), Positives = 226/607 (37%), Gaps = 137/607 (22%)
Query: 74 QEEGSKESKNSGETKRRVLM---DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR 130
Q G+KE + R+ + D + L+ H + G + S+A G++ +
Sbjct: 27 QFSGTKEQNIVTASGSRLTLLRPDPSQGKVITLLSH-DIFGIIRSMAAFRLAGSN----K 81
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVD 186
D +ILA + +I+++E+ I + + F+ LHL K G G + D
Sbjct: 82 DYLILATDSGRITIVEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACD 130
Query: 187 PQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
P+GR V L ++ + +Q E T S A V+++ LD+
Sbjct: 131 PKGRACLIASVEKNKLVYVLNRNAQA-------ELTISSP--LEAHKPGVLVLSMVALDV 181
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLI 298
GY PV L E E T A G + + T ++ + L
Sbjct: 182 ----------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRK 229
Query: 299 WSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNY--AVSLD 350
WS P D L P G GVLV G I Y HS + + + A
Sbjct: 230 WSE---PVDPTASLLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDP 286
Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL--VLLTVVYDGR-----VVQRLDL 403
S + S +L + + LL T+ GDL ++ +V D V+RL +
Sbjct: 287 SRKRHIVSGVMHKLKGSAGAFF----FLLQTEDGDLFKAVIDMVEDADGNPTGEVKRLKI 342
Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
+ ++S + + + + S+ G+ QF E+ GD + +
Sbjct: 343 KYFDTVPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGDDDEEK-- 386
Query: 464 TKRLRRSSSDALQDMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
SS D D G + + Y N A+ +S+ ++ PL D
Sbjct: 387 ----EFSSDDFPADPKAGYDAVYFYPRPLEN---------LALVESIDSMNPLLDCKVAN 433
Query: 523 RINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADS 565
DA +A G +S + ++ ELPG +WT+ K SRG
Sbjct: 434 LTGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLSRG----- 486
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
D+Y AY+++S T+VL + + EV++S F+ A L G +IQ
Sbjct: 487 -------DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQ 537
Query: 626 VFERGAR 632
V +G R
Sbjct: 538 VHPKGIR 544
>gi|384490729|gb|EIE81951.1| hypothetical protein RO3G_06656 [Rhizopus delemar RA 99-880]
Length = 967
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 72/154 (46%), Gaps = 17/154 (11%)
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
S + + + L+ VP P+GG+LV+G I Y L N +S+D ++ ++
Sbjct: 198 STIKVEASTHALVPVPEPLGGLLVIGEYIITYFD-----PLTNTNRELSIDPAR---VTA 249
Query: 360 FSVELDAAHATWLQND-----VALLSTKTGDLVLLTVVYDGRVV---QRLDLSKTNPSV- 410
+ D ++ L ++ V + T +V L+ + G+V Q ++ +P V
Sbjct: 250 WEFMKDESNRYLLGDEEGYLYVFSIETSHNKVVNLSSTFIGQVPSFNQNIESKANHPQVS 309
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
S I +GN +F++GS GDS L+Q G S
Sbjct: 310 RPSCIVDLGNLMFYIGSTHGDSCLIQLIKGQEKS 343
>gi|195428692|ref|XP_002062402.1| GK16677 [Drosophila willistoni]
gi|194158487|gb|EDW73388.1| GK16677 [Drosophila willistoni]
Length = 1273
Score = 43.9 bits (102), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 347 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 406
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 407 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIVTSQ 456
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPG-CKGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 457 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 504
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV + DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 505 WTVKKR--------------VDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 549
Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
T+ L G ++QV+ G R
Sbjct: 550 TLCCAAL-GDDALVQVYPDGIR 570
>gi|157873900|ref|XP_001685450.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania major strain Friedlin]
gi|68128522|emb|CAJ08654.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania major strain Friedlin]
Length = 1541
Score = 43.9 bits (102), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 81/185 (43%), Gaps = 37/185 (20%)
Query: 223 GGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHERELTWAGRVS--- 275
GGG S + V + R D+K +++D FV EP++ L E++ TWAGRV
Sbjct: 282 GGGTSLLLRVGTVTHWRLQDVKSALRNIRDVQFVQSAGEPLLAFLFEKQPTWAGRVKLLE 341
Query: 276 WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-LPHDAYKLLAVPS----PIGGV 321
W+ TC I ++++ + H L S ++ LP+D + +P+ P
Sbjct: 342 WRSKTVESHMLTCSIEWMKVTLANSATPHMLSLSEVDGLPYDVTSMTPLPAFQDLPSAVF 401
Query: 322 LVVGANTIHYHSQSA----------SCALALNNYAVSLD------SSQELPRSSFSVELD 365
V +H ++S A +L + AVSL+ +SQ L V L+
Sbjct: 402 CVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLEAVQWRSASQALSTDLVKVNLN 461
Query: 366 AAHAT 370
A+AT
Sbjct: 462 FANAT 466
>gi|226480826|emb|CAX73510.1| glyceraldehyde 3-phosphate dehydrogenase [Schistosoma japonicum]
Length = 332
Score = 43.9 bits (102), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 92/212 (43%), Gaps = 34/212 (16%)
Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
R DS+ L A ++++E +DS+ + + S S E R +G V
Sbjct: 72 RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVAS----GSVE----DRSARIIDQGFDVL 123
Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFG-SGGGFSARIESSHVINLRDLD 242
+DP V +Y GL IIL G DT + +S RIE +++
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIGDKIGTDFLDTNQWTVNTYSVRIEEGNIV------ 177
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
D F++GY P +++E EL + +++ + + ++ TL
Sbjct: 178 -----DMAFIYGYSLPTFAMIYEDELVLHMK-TYEIYGREPALRNVQLTLD--------- 222
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
++ D+ L+ VP P GGV++VG N I YH++
Sbjct: 223 SIEPDSKLLIPVPKPYGGVILVGDNIICYHTK 254
>gi|66811906|ref|XP_640132.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
gi|74854972|sp|Q54SA7.1|SF3B3_DICDI RecName: Full=Probable splicing factor 3B subunit 3
gi|60468134|gb|EAL66144.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
Length = 1256
Score = 43.9 bits (102), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 124/338 (36%), Gaps = 75/338 (22%)
Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE----LDAAHATWLQN 374
GGVLV + I Y +Q + + +PR S L +H++ Q
Sbjct: 256 GGVLVASEDYIVYRNQDHA------------EVRSRIPRRYGSDPNKGVLIISHSSHKQK 303
Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDS 432
+ L+ ++ GDL +T+ Y G V ++++ + VL + +T + N F S GD
Sbjct: 304 GMFFFLVQSEHGDLYKITLDYQGDQVSEVNVNYFDTIVLANCLTVLKNGFLFAASEFGDH 363
Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL----RRSSSDALQDMVNGEELSLYG 488
L F S G +EE G + L R S ++++ N E S
Sbjct: 364 TLYFFK--------SIGDEEEEGQAKRLEDKDGHLWFTPRNSCGTKMEELKNLEPTSHLS 415
Query: 489 SASNNTESAQKTFSFAVRDSLVNIGP-------------LKDFSYGLRINADASATGISK 535
S S F V D + P LK +GL + +A
Sbjct: 416 SLS-------PIIDFKVLDLVREENPQLYSLCGTGLNSSLKVLRHGLSVTTITTAN---- 464
Query: 536 QSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
LPG GIWTV +S NA D+ Y+++S T VL D
Sbjct: 465 --------LPGVPSGIWTVPKSTS--PNA--------IDQTDKYIVVSFVGTTSVLSVGD 506
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ E ES ++ T G +IQVF G R
Sbjct: 507 TIQENHES--GILETTTTLLVKSMGDDAIIQVFPTGFR 542
>gi|402077250|gb|EJT72599.1| pre-mRNA-splicing factor RSE1 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 1216
Score = 43.5 bits (101), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 60/279 (21%), Positives = 110/279 (39%), Gaps = 68/279 (24%)
Query: 378 LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
LL T+ GDL +T+ + D VQRL + + ++S++ + + F+ S G
Sbjct: 310 LLQTEDGDLFKVTIDMLEDAEGNTTGEVQRLKIKYFDTIPVSSNLCILKSGFLFVASEFG 369
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
+ QF E+ GD + L SS + D E + +
Sbjct: 370 NHHFYQF--------------EKLGD------DDEELEFSSENFPSDPAEPYEPAYF--- 406
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV---- 542
+ T + A+ +S+ ++ PL D + DA + +G +S + ++
Sbjct: 407 -----YPRPTENLALVESVESMNPLMDLKVANLTDEDAPQIYTVSGNGARSTFRMLKHGL 461
Query: 543 --------ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
+LPG +WT A DD+Y +Y+++S T+VL
Sbjct: 462 EVNEIVASQLPGTPSAVWTT--------------KIARDDQYDSYIVLSFTNGTLVLSIG 507
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ + EV+++ F+ + A G ++QV RG R
Sbjct: 508 ETVEEVSDT--GFLSSVSTLAVQQLGEDGLVQVHPRGIR 544
>gi|213405251|ref|XP_002173397.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
yFS275]
gi|212001444|gb|EEB07104.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
yFS275]
Length = 1166
Score = 43.5 bits (101), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 121/561 (21%), Positives = 202/561 (36%), Gaps = 99/561 (17%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
+ L+ +G V ++A L G ++D ++L + + ++LE+D + L
Sbjct: 56 MNLMISQNCYGIVRNIAPLRLTGF----KKDYLVLTSDSGRFTILEYDIGKNKLVSVYQE 111
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGLQ------MII---L 204
F K G G + +D +GR V LVY L + I L
Sbjct: 112 AFG-------KSGIRRIVPGEYLALDAKGRAAMVASTEKNKLVYVLNRDSEANLTISSPL 164
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH 264
+A + G+ D G G+ I ++ + DLD + +
Sbjct: 165 EAHKAGTICF---DLVGLDTGYENPIFAALEVEYSDLDHDPLGEL--------------- 206
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS----PIGG 320
+KH +++ + L WS + + AYKL+ VP P G
Sbjct: 207 -----------YKHSEKVLTYYELDLGLNHVVKRWSKV-VDRSAYKLIRVPGGNDGP-SG 253
Query: 321 VLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
V+V+ I Y H Q S + + ++ LP + + A + LL
Sbjct: 254 VIVISTGWISYRHLQRQSHFVPIPTRETKATTNTALP-----IIVSAVMHKMRDSFFYLL 308
Query: 380 STKTGDLVLLTVVYDGRV-VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
GDL+ LT+ D V+ L + + + + + + L F G G+ L QF
Sbjct: 309 QNSDGDLLKLTMELDDHSQVKELRIKYFDTIPFAAILNILKSGLLFAGCEGGNHHLYQFE 368
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ 498
S+ + EF +K + + L + N L S T++
Sbjct: 369 -----SLAIDDDEPEFSSANFSEEQSKHSPKKLTYKLHPLQNISLLDEIPSLFPLTDAIV 423
Query: 499 KTFSFAVRDSLVNI-GPLKDFSYGLRINADASATGISKQSNYELVELPGCK-GIWTVYHK 556
S L + G K+ S L + SAT + L ELPG IWTV K
Sbjct: 424 TRTSTDANSQLYTLCGRHKEASLRL-LKRGVSATEVV------LSELPGAPIAIWTVKQK 476
Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
+D Y Y+++S T+VL + + EV +S T+
Sbjct: 477 --------------LNDPYDKYMVLSFTNGTLVLSIGETVEEVLDS-GLLSSVSTLNVRQ 521
Query: 617 LFGRRRVIQVFERGARILDGS 637
L GR V+Q+ +G R + +
Sbjct: 522 L-GRSSVVQIHSKGIRCISAN 541
>gi|195126264|ref|XP_002007593.1| GI12293 [Drosophila mojavensis]
gi|193919202|gb|EDW18069.1| GI12293 [Drosophila mojavensis]
Length = 1227
Score = 43.5 bits (101), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV + DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
T+ L G ++QV+ G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525
>gi|195012560|ref|XP_001983703.1| GH16029 [Drosophila grimshawi]
gi|193897185|gb|EDV96051.1| GH16029 [Drosophila grimshawi]
Length = 1228
Score = 43.5 bits (101), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV + DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
T+ L G ++QV+ G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525
>gi|195376606|ref|XP_002047087.1| GJ13230 [Drosophila virilis]
gi|194154245|gb|EDW69429.1| GJ13230 [Drosophila virilis]
Length = 1229
Score = 43.5 bits (101), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV + DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
T+ L G ++QV+ G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525
>gi|302423344|ref|XP_003009502.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
gi|261352648|gb|EEY15076.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
Length = 1119
Score = 43.5 bits (101), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 60/136 (44%), Gaps = 13/136 (9%)
Query: 312 LAVPSPIGGVLV----VGANTIHYHSQSASCALA-LNNYAVS----LDSSQELPRSSFSV 362
L +P P L+ V ++ YH + + A A L V+ L L + S
Sbjct: 221 LEIPDPFARTLIPVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYVDTLTKVKISK 280
Query: 363 ELDAAH--ATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
LD +W + DV LL+ G+L LLT+ DG +V L L + S + +
Sbjct: 281 ALDEPRIFVSWAKYDVTRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYM 340
Query: 419 GNSLFFLGSRLGDSLL 434
GN + FLGS GDS L
Sbjct: 341 GNEILFLGSHHGDSQL 356
>gi|350629921|gb|EHA18294.1| damage-specific DNA binding protein [Aspergillus niger ATCC 1015]
Length = 1140
Score = 43.1 bits (100), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G +I Y V DS++ + R LD A
Sbjct: 245 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 284
Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
W Q D LL+ G L L +V D VQ L + S + +G +
Sbjct: 285 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 344
Query: 423 FFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ GS
Sbjct: 345 IFVGSHQGDSQVLRIGNGS 363
>gi|317031116|ref|XP_001392900.2| UV-damaged DNA binding protein [Aspergillus niger CBS 513.88]
Length = 1124
Score = 43.1 bits (100), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G +I Y V DS++ + R LD A
Sbjct: 229 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 268
Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
W Q D LL+ G L L +V D VQ L + S + +G +
Sbjct: 269 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 328
Query: 423 FFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ GS
Sbjct: 329 IFVGSHQGDSQVLRIGNGS 347
>gi|154286506|ref|XP_001544048.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407689|gb|EDN03230.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 1158
Score = 43.1 bits (100), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 63/136 (46%), Gaps = 25/136 (18%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 258 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV-------A 299
Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFL 425
W Q D LL+ G L L +V D VQ +LDL P S + +G + F+
Sbjct: 300 WEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVYMGGGITFI 357
Query: 426 GSRLGDSLLVQFTCGS 441
GS GD L++ T GS
Sbjct: 358 GSHQGDPELIRITEGS 373
>gi|198420618|ref|XP_002125906.1| PREDICTED: similar to Splicing factor 3B subunit 3
(Spliceosome-associated protein 130) (SAP 130)
(Pre-mRNA-splicing factor SF3b 130 kDa subunit)
(SF3b130) (STAF130) [Ciona intestinalis]
Length = 1216
Score = 43.1 bits (100), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 100/463 (21%), Positives = 168/463 (36%), Gaps = 115/463 (24%)
Query: 304 LPHDAYKLLAVPS----PIGGVLVVGANTIHYHS--QSASCALALNNYAVSLDSSQELPR 357
L A L++VP P GGVLV N I Y + + LD P
Sbjct: 228 LEERANHLISVPGGNDGP-GGVLVCAENYITYKNFGDQPDIRTPIPRRRNDLDD----PE 282
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
V A H T L+ T+ GD+ +T+ D +V + L + ++ +
Sbjct: 283 RGMIVVCSATHKTK-SMFFFLIQTEQGDIFKVTLETDEDMVTEIRLKYFDTVPVSMAMCV 341
Query: 418 IGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+ F+ + +G+ L Q + SS + E GD AP A
Sbjct: 342 LRTGFLFVAAEMGNHCLYQIAHLGDDDDETEFSSAMPLEEGDTFFYAPR----------A 391
Query: 475 LQDMVNGEELS---------LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN 525
L+++V +EL + A+ +T T R SL + +GL +
Sbjct: 392 LRNLVLVDELDSLSPIMTCLISDLANEDTPQLYVTCGRGPRSSL------RVLRHGLEV- 444
Query: 526 ADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
S + ELPG +WTV K ++E+ +Y+I+S
Sbjct: 445 -----------SEMAVSELPGNPNAVWTVKIKE--------------EEEFDSYIIVSFV 479
Query: 585 ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS------- 637
T+VL + + EVT+S F+ + +L G ++QV+ G R +
Sbjct: 480 NATLVLSIGETVEEVTDS--GFLGTTPTLSCSLLGENALVQVYPDGIRHIRADKRVNEWK 537
Query: 638 ---------------------------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD-- 668
Y D S G N + NS V+ + ++
Sbjct: 538 TPGKKTILRCAVNQRQVVIALTGGELVYFEMDQS-GQLNEYTERKEMNSEVVCMDLSKVP 596
Query: 669 ------PYVLLGMSDGSIRLLVGDPSTC--TVSVQT-PAAIES 702
++ +G++D ++R++ DP+ C +S+Q PA ES
Sbjct: 597 PTEQRTRFLAVGLADNTVRIISLDPTDCLQPLSMQALPATPES 639
>gi|134077422|emb|CAK45676.1| unnamed protein product [Aspergillus niger]
Length = 1133
Score = 43.1 bits (100), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G +I Y V DS++ + R LD A
Sbjct: 229 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 268
Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
W Q D LL+ G L L +V D VQ L + S + +G +
Sbjct: 269 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 328
Query: 423 FFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ GS
Sbjct: 329 IFVGSHQGDSQVLRIGNGS 347
>gi|154320780|ref|XP_001559706.1| hypothetical protein BC1G_01862 [Botryotinia fuckeliana B05.10]
Length = 238
Score = 43.1 bits (100), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 45/182 (24%), Positives = 73/182 (40%), Gaps = 36/182 (19%)
Query: 57 NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD-GIS---------- 97
NLVV +++++I+ + + E+ S +K+ RV D G+
Sbjct: 28 NLVVAKSSLLQIFTTKTVSVDLDELSEKDSSTAKDDTNIDPRVNNDDGVEDSFLGTDSIM 87
Query: 98 -------AASLELVCHYRLHGNVESL----AILSQGGADNSRRRDSIILAFEDAKISVLE 146
L LV Y L G V SL I S+ G + +I++ F+DAK+S++E
Sbjct: 88 QRPELARTTKLVLVAEYNLSGTVTSLVRVKTISSKTGGE------AILVGFKDAKLSLVE 141
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+D G+ S+H +E E + VDP RC + + IL
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201
Query: 207 SQ 208
Q
Sbjct: 202 KQ 203
>gi|328874742|gb|EGG23107.1| UV-damaged DNA binding protein1 [Dictyostelium fasciculatum]
Length = 1116
Score = 42.7 bits (99), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 44/196 (22%), Positives = 77/196 (39%), Gaps = 41/196 (20%)
Query: 504 AVRDSLVNIGPLKDF----------------SYGLR---INADASATGISKQSNYELVEL 544
V D+ N+GP+ DF S G + + + GI++Q++ ++L
Sbjct: 335 TVLDTFANLGPIPDFCLVDIEKQGQNQIVACSGGFKEGSLRVIRNGIGITEQAS---IDL 391
Query: 545 PGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
PG K IW++ S R YLI+S + T VLE E TE
Sbjct: 392 PGIKAIWSLARGSDR------------------YLILSFISSTKVLEFQGEDIEETEIAG 433
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
+ +Q T+ GN+ ++++Q+ G ++D + PS+ S + +
Sbjct: 434 FDLQSPTLYCGNV-ADKQILQISTSGIYLVDHETNLNYDVWKPSSGSINLASHQGNQILI 492
Query: 665 SIADPYVLLGMSDGSI 680
S + + D I
Sbjct: 493 SFGKTLIYFEIKDQKI 508
>gi|328700785|ref|XP_001945395.2| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
Length = 1072
Score = 42.7 bits (99), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 41/202 (20%), Positives = 88/202 (43%), Gaps = 38/202 (18%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
++ +++D F++G+ P ++I++E +A+ + +K+
Sbjct: 188 MEETNIQDIGFLYGFTNPTIIIIYE------------------NAMGRTIKIKKIIDSKK 229
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
++ +A ++ VPSP+ G +++G N+I YH + SC + LP
Sbjct: 230 YKSIEKEASMVIPVPSPLCGAIIIGENSIFYH--NGSCNII------------RLPIRQ- 274
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSDI 415
+E+ L+ LL +G L++L + Y+ + V L L + +
Sbjct: 275 KIEIVCYTRVDLEGTRYLLGDHSGCLLMLFLKYEKTLNGKFKVTDLYLRYFGEISIPISL 334
Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
T + N + ++ S+ GDS L++
Sbjct: 335 TYLDNKVIYVASKFGDSQLIKL 356
>gi|346970653|gb|EGY14105.1| hypothetical protein VDAG_00787 [Verticillium dahliae VdLs.17]
Length = 1160
Score = 42.7 bits (99), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 60/136 (44%), Gaps = 13/136 (9%)
Query: 312 LAVPSPIGGVLV----VGANTIHYHSQSASCALA-LNNYAVS----LDSSQELPRSSFSV 362
L +P P L+ V ++ YH + + A A L V+ L L + S
Sbjct: 221 LEIPDPFARTLIPVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYVDTLTKVKISK 280
Query: 363 ELDAAH--ATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
LD +W + DV LL+ G+L LLT+ DG +V L L + S + +
Sbjct: 281 ALDEPRIFVSWAKYDVTRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYM 340
Query: 419 GNSLFFLGSRLGDSLL 434
GN + FLGS GDS L
Sbjct: 341 GNEILFLGSHHGDSQL 356
>gi|389602597|ref|XP_001567507.2| cleavage and polyadenylation specificity factor-like protein
[Leishmania braziliensis MHOM/BR/75/M2904]
gi|322505515|emb|CAM42945.2| cleavage and polyadenylation specificity factor-like protein
[Leishmania braziliensis MHOM/BR/75/M2904]
Length = 1536
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 68/138 (49%), Gaps = 20/138 (14%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSIST 290
+++++D FV EP++ L E++ TWAGRV W+ TC I ++++
Sbjct: 298 LRNIRDVQFVASAGEPLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLAN 357
Query: 291 TLKQHPLIWSAMN-LPHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSASCALALNNY 345
T H L S ++ LP+DA + +P+ P VL V N + + S + + +N
Sbjct: 358 TAAPHMLSLSEVDGLPYDATSMTPLPAFQDVP-SAVLCVSRNMMVHVSTKSGYGVYVN-- 414
Query: 346 AVSLDSSQELPRSSFSVE 363
A+ + ++ L S+ S E
Sbjct: 415 AMGEEQARSLKSSAVSCE 432
>gi|67516629|ref|XP_658200.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
gi|40747539|gb|EAA66695.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
gi|259489136|tpe|CBF89158.1| TPA: damaged DNA binding protein (Eurofung) [Aspergillus nidulans
FGSC A4]
Length = 1132
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 64/260 (24%), Positives = 107/260 (41%), Gaps = 31/260 (11%)
Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+DP GR + +Y ++++ Q S G + +G E I R +D
Sbjct: 122 IDPSGRFMTLEIYDGMIVVIPIIQLPSKRRGRQVALPTGPDAPRIGELGEPIITR-IDEL 180
Query: 245 HVKDFIFVHGYI-EPVMVILHE-RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
V+ F+H P + +L+E + +V ++ A S T++ + A
Sbjct: 181 FVRSSAFLHVQAGSPRLALLYEDNQKKVKLKVRELKYSTAAGAESEFTSIADY-----AQ 235
Query: 303 NLPHDAYKLLAVPSPI---GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
L A L+ VP+P+ GG+L++G +I Y A NN VS Q L ++
Sbjct: 236 ELDLGASHLIPVPAPLAAAGGLLILGETSIKYVD-------ADNNEIVS----QPLEEAT 284
Query: 360 FSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
V W Q D LL+ G L L +V V+R +L + S +
Sbjct: 285 IFV-------AWEQVDSQRWLLADDYGRLFFLMLVLRNSEVERWELHSLGNTSRASVLVY 337
Query: 418 IGNSLFFLGSRLGDSLLVQF 437
+G + F+GS GDS +++
Sbjct: 338 LGGGVVFVGSHQGDSQVIRI 357
>gi|391341059|ref|XP_003744849.1| PREDICTED: splicing factor 3B subunit 3-like isoform 2 [Metaseiulus
occidentalis]
Length = 1223
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 77/391 (19%), Positives = 133/391 (34%), Gaps = 102/391 (26%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ +D V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLEFDDDAVTEIKLKYFDSLPVAQTMHVLKSGFLFVASEFGNHSLYQI 361
Query: 438 T-CGSGTSM--LSSGLKEEFGDIEADAPSTKR----------LRRSSSDALQDMVNGEEL 484
G T SS E GD P + L + + D+ N +
Sbjct: 362 AHLGDNTDEPEFSSIFPLEEGDTFFFLPRELKNLVLVDEMDSLSPIMTARVADLTNEDTP 421
Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
LY + S + LR + S +S EL
Sbjct: 422 QLYAACGRGPRSTMRV---------------------LRHGLEVSEMAVS--------EL 452
Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
PG +WTV ++ DDEY AY+++S T+VL + + EVT+S
Sbjct: 453 PGNPSAVWTVKKRA--------------DDEYDAYIVVSFINATLVLSIGETVEEVTDS- 497
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-------------------------- 637
F+ A + G ++Q++ G R +
Sbjct: 498 -GFLGTTPTLACHQIGHDALVQIYPEGIRHIRADRRVNEWRTSGKKLIVKCAVNQRQVVI 556
Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
Y D S G N + NS VL +++ ++ +G SDG++
Sbjct: 557 ALTGGELIYFEMD-SSGQLNEYAERKEMNSDVLCMALGSVPAGEQRTKFLAVGSSDGTVH 615
Query: 682 LLVGDPSTCTVSVQTPAAIESSKKPVSSCTL 712
++ DP +C + ES+ + ++ L
Sbjct: 616 VISLDPKSCLSILSVQGMTESNPESLAIVEL 646
>gi|383847297|ref|XP_003699291.1| PREDICTED: splicing factor 3B subunit 3-like [Megachile rotundata]
Length = 1217
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
SS + E GD AP R + D+L ++ + L A+
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG-CKGIW 551
+T T R +L + +GL + S + ELPG +W
Sbjct: 419 DTPQLYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV + D+EY AY+I+S T+VL + + EVT+S F+
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504
Query: 612 IAAGNLFGRRRVIQVFERGAR 632
+ + G ++QV+ G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525
>gi|391341057|ref|XP_003744848.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Metaseiulus
occidentalis]
Length = 1211
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 78/397 (19%), Positives = 137/397 (34%), Gaps = 103/397 (25%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ +D V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLEFDDDAVTEIKLKYFDSLPVAQTMHVLKSGFLFVASEFGNHSLYQI 361
Query: 438 T-CGSGTSM--LSSGLKEEFGDIEADAPSTKR----------LRRSSSDALQDMVNGEEL 484
G T SS E GD P + L + + D+ N +
Sbjct: 362 AHLGDNTDEPEFSSIFPLEEGDTFFFLPRELKNLVLVDEMDSLSPIMTARVADLTNEDTP 421
Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
LY + S + LR + S +S EL
Sbjct: 422 QLYAACGRGPRSTMRV---------------------LRHGLEVSEMAVS--------EL 452
Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
PG +WTV ++ DDEY AY+++S T+VL + + EVT+S
Sbjct: 453 PGNPSAVWTVKKRA--------------DDEYDAYIVVSFINATLVLSIGETVEEVTDS- 497
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-------------------------- 637
F+ A + G ++Q++ G R +
Sbjct: 498 -GFLGTTPTLACHQIGHDALVQIYPEGIRHIRADRRVNEWRTSGKKLIVKCAVNQRQVVI 556
Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
Y D S G N + NS VL +++ ++ +G SDG++
Sbjct: 557 ALTGGELIYFEMD-SSGQLNEYAERKEMNSDVLCMALGSVPAGEQRTKFLAVGSSDGTVH 615
Query: 682 LLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY-HDKG 717
++ DP +C + ES+ + ++ + H++G
Sbjct: 616 VISLDPKSCLSILSVQGMTESNPESLAIVDMSGHEEG 652
>gi|330792580|ref|XP_003284366.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
gi|325085712|gb|EGC39114.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
Length = 1064
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 128/565 (22%), Positives = 214/565 (37%), Gaps = 117/565 (20%)
Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
++G + L + S GG ++D + ++ E K +L +D + + E
Sbjct: 14 IYGRISVLKLFSAGG-----KQDYLFISTESFKFCILAYDSEKKEIVTKASGNAED---- 64
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
GR + A G L +DP GR +I L +G L+ E G +
Sbjct: 65 --TIGRPTEA-GQLGIIDPDGR----------LIALHLYEGLLKLINIEK------GLNN 105
Query: 229 RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
I+ + N R L+ V D F++G P + +L + KH I +
Sbjct: 106 PIQKTAA-NTR-LEELQVMDMTFLYGCKIPTIAVL------FKDTKDEKH----IVTYEV 153
Query: 289 STTLKQH-PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
S ++ P WS N+ Y + V P+GGVLVV N I Y + + ++A
Sbjct: 154 SQKDQELCPGPWSQSNV--GVYSSMLVAVPLGGVLVVADNGITYMNGRTTRSIA------ 205
Query: 348 SLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+P + F +D + +L D G L +L ++ + V L
Sbjct: 206 -------IPYTKFLAYDRVDKDGSRYLFGD------HFGRLSVLVLLNHQQRVTELKFET 252
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + + + F+GS GDS L++ + E D P+T
Sbjct: 253 LGRTSIPSSISYLDSGVVFIGSSSGDSQLIRL------------------NTEKD-PATD 293
Query: 466 RLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRDSLVNIGPLKDFSYGLR 523
S L++ N G + + AQ T S RD G L+ G+
Sbjct: 294 ----SYISHLENFTNIGPIVDFCLVDTEKQGQAQIVTCSGTYRD-----GTLRVIRNGI- 343
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
GI++++ L+EL G KG+W + N S + D YLI+S
Sbjct: 344 --------GIAEKA---LIELEGVKGLWPI------KENDPSDPLNPKD----QYLIVSF 382
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQD 642
T VL+ E TE TI N+ ++QV + +++ ++ D
Sbjct: 383 IGYTKVLQFQGEEIEETEFEGLDSNSSTILCSNIDKENVIVQVTNQAINLINPITFKRVD 442
Query: 643 LSFGPSNSESGSGSENSTVLSVSIA 667
PS S S N + +++SI
Sbjct: 443 QWKSPSGSPINLVSSNQSQIALSIG 467
>gi|393212467|gb|EJC97967.1| hypothetical protein FOMMEDRAFT_162310 [Fomitiporia mediterranea
MF3/22]
Length = 1161
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 72/331 (21%), Positives = 128/331 (38%), Gaps = 73/331 (22%)
Query: 306 HDAYKLLAVPSPI-------GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
D+ L+ VP I GGVLV+G +TI ++S ++ S+ ++P++
Sbjct: 224 EDSNLLIPVPPQIKSSWNVNGGVLVLGGSTIAFYSIDRKQKKKNSSSQSKS-STSKIPQA 282
Query: 359 SFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
+ A W Q D LL G L LL + + + L + +P +
Sbjct: 283 EVNWPYFDITA-WAQIDEDGLRYLLGDSFGRLALLAINPQYAYLDIVLLGEVSPP---TS 338
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+T + + ++GS GDS L++ T ++ + + F +I AP + + D+
Sbjct: 339 LTPLASQYIYVGSHFGDSQLIRVTSERSSNGSYLEISDTFKNI---APIMDAVFEDTDDS 395
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
Q + ++ G S G L+ G N DA GI+
Sbjct: 396 GQPTI----ITCSGGEST--------------------GSLRVIRNGANFNEDARIEGIA 431
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE--T 592
G+W + + YDD +H Y++++ + T +LE
Sbjct: 432 N-----------ITGMWPIRRQ--------------YDDTFHHYMLVTTDTNTHLLELPN 466
Query: 593 ADLLTEVTESVDY---FVQGRTIAAGNLFGR 620
+ T V+ S D+ + RT+ AGN+ R
Sbjct: 467 SQQETAVSRSNDFSDLTIDSRTLVAGNMLTR 497
>gi|340721347|ref|XP_003399083.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus terrestris]
gi|350406701|ref|XP_003487854.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus impatiens]
Length = 1217
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
SS + E GD AP R + D+L ++ + L A+
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
+T T R +L + +GL + S + ELPG +W
Sbjct: 419 DTPELYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV + D+EY AY+I+S T+VL + + EVT+S F+
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504
Query: 612 IAAGNLFGRRRVIQVFERGAR 632
+ + G ++QV+ G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525
>gi|66553024|ref|XP_623333.1| PREDICTED: splicing factor 3B subunit 3 isoform 1 [Apis mellifera]
gi|380015815|ref|XP_003691890.1| PREDICTED: splicing factor 3B subunit 3-like [Apis florea]
Length = 1217
Score = 42.0 bits (97), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
SS + E GD AP R + D+L ++ + L A+
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
+T T R +L + +GL + S + ELPG +W
Sbjct: 419 DTPQLYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV + D+EY AY+I+S T+VL + + EVT+S F+
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504
Query: 612 IAAGNLFGRRRVIQVFERGAR 632
+ + G ++QV+ G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525
>gi|302831461|ref|XP_002947296.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f.
nagariensis]
gi|300267703|gb|EFJ51886.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f.
nagariensis]
Length = 1221
Score = 42.0 bits (97), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 71/353 (20%), Positives = 130/353 (36%), Gaps = 76/353 (21%)
Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI---GGVLVVGAN 327
A ++ KH T L ++ L++ W+ + + A L+AVP GGVLV N
Sbjct: 199 AASMAQKHLTFYEMDLGLNNVLRK----WTE-PIDNGANLLVAVPGGADGPGGVLVCAEN 253
Query: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
I Y +Q A+ L + + S++ A++ +L + ++ GD+
Sbjct: 254 FIIYKNQDHEEVRAVIPRRSDLPGDRGVLIVSYATHKKKAYSFFL------VQSEYGDIY 307
Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
+T+ Y+G V L + + + I + F S G+ L QF
Sbjct: 308 KVTLAYEGEAVTELKIKYFDTIPPCTSIAVLKTGFLFAASEYGNHALYQFV--------- 358
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
G E+ D+E SSS AL G + + + + + D
Sbjct: 359 -GTGEDDEDVE-----------SSSAALVQTEEGFQPVFF--------EPRPLKNLLLID 398
Query: 508 SLVNIGPLKDFS-----------------YGLRINADASATGISKQSNYELVELPGCK-G 549
+ ++ P+ D +G R + G++ + + LPG
Sbjct: 399 EMASLMPITDMKVANLLNEEIPQIYALCGHGPRASLSVLRPGLAV-TELAVSPLPGAPTA 457
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
+WTV ++ DE+ A++++S T+V + + E ES
Sbjct: 458 VWTVRRNAT--------------DEFDAFIVVSFANATLVFSIGEEVKETNES 496
>gi|242803623|ref|XP_002484212.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus
ATCC 10500]
gi|218717557|gb|EED16978.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus
ATCC 10500]
Length = 1140
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 63/137 (45%), Gaps = 21/137 (15%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G I Y + NN + S+ L ++ V
Sbjct: 245 ASHLIPVPAPLGGLLVLGETCIKYIDDA-------NNETI----SRPLDEATIFV----- 288
Query: 368 HATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDITTIGNSLFF 424
W+Q D LL+ G L L +V D R V+ + + S + +G + F
Sbjct: 289 --AWVQVDGQRWLLADDYGRLFFLMLVLDSRNEVEGWKIDYLGSASRASVLIYLGAGMTF 346
Query: 425 LGSRLGDSLLVQFTCGS 441
+GS GDS +++ + GS
Sbjct: 347 IGSHQGDSQVIRISEGS 363
>gi|242018509|ref|XP_002429717.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
gi|212514723|gb|EEB16979.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
Length = 1218
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 68/323 (21%), Positives = 122/323 (37%), Gaps = 65/323 (20%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVATSMCVMKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E GD AP AL+++V +E+ S +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPR----------ALRNLVQVDEMD-----SLSP 406
Query: 495 ESAQKTFSFAVRDS-----LVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPG 546
A + A D+ L GP L+ +GL + S + ELPG
Sbjct: 407 IMACQVADLANEDTPQLYMLCGRGPRSTLRVLRHGLEV------------SEMAVSELPG 454
Query: 547 -CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
+WTV + ++EY AY+I+S T+VL + + EVT+S
Sbjct: 455 NPNAVWTVKRR--------------VEEEYDAYIIVSFVNATLVLSIGETVEEVTDS--G 498
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
F+ + + G ++QV+ G R + N G + T++ +
Sbjct: 499 FLGTTPTLSCSALGDDALVQVYPDGIRHIRADKRV--------NEWKAPGKK--TIMKCA 548
Query: 666 IADPYVLLGMSDGSIRLLVGDPS 688
+ V++ ++ G + DP+
Sbjct: 549 VNQRQVVIALTAGELVYFEMDPT 571
>gi|384490247|gb|EIE81469.1| hypothetical protein RO3G_06174 [Rhizopus delemar RA 99-880]
Length = 1197
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 46/205 (22%), Positives = 84/205 (40%), Gaps = 60/205 (29%)
Query: 543 ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
ELPG +WT ++ DD+YHAY+++S T+VL + + EVT+
Sbjct: 443 ELPGNPSAVWTTKLRA--------------DDQYHAYIVVSFANATLVLSIGETVEEVTD 488
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGSYMTQDL 643
+ + T+A + G ++QV G R I++ + ++ +
Sbjct: 489 T-GFLTNAPTLAVQQI-GEDALVQVHPHGIRHIRADRRVNEWRAPQGQTIVEAATNSRQI 546
Query: 644 SFGPSNSE--------SGSGSENS---------TVLSVS------IADPYVLLGMSDGSI 680
+ SN E G +E+ T L++ + Y+ +G D ++
Sbjct: 547 AIALSNGEIVYFEMDNMGQLNEHQEHRQMSAYITTLALGEVPEGRVRARYIAVGCEDQTV 606
Query: 681 RLLVGDPSTC--TVSVQTPAAIESS 703
R+L DP +C +S+Q + SS
Sbjct: 607 RILSLDPDSCLEPISMQALQGVPSS 631
>gi|121699866|ref|XP_001268198.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
1]
gi|119396340|gb|EAW06772.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
1]
Length = 1140
Score = 41.6 bits (96), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 61/136 (44%), Gaps = 25/136 (18%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
L+ VP+P+GG+L++G +I Y V D+++ + R LD A
Sbjct: 248 LIPVPAPLGGLLILGETSIKY---------------VDADNNEIISRP-----LDEATIF 287
Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFL 425
W Q D LL+ G L L +V D V+ L + S + +G + FL
Sbjct: 288 VAWEQVDSQRWLLADDYGRLFFLMLVLDSDNQVESWKLDLLGKTSRASVLVYLGGGVLFL 347
Query: 426 GSRLGDSLLVQFTCGS 441
GS GDS +++ + GS
Sbjct: 348 GSHQGDSQVLRISNGS 363
>gi|302680006|ref|XP_003029685.1| hypothetical protein SCHCODRAFT_58785 [Schizophyllum commune H4-8]
gi|300103375|gb|EFI94782.1| hypothetical protein SCHCODRAFT_58785 [Schizophyllum commune H4-8]
Length = 1213
Score = 41.6 bits (96), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 80/380 (21%), Positives = 141/380 (37%), Gaps = 87/380 (22%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL +T+ ++ V+ + + + + S + + + F+ S G+ L QF
Sbjct: 308 LLQSEDGDLFKVTIEHEDEDVKEVKIKYFDTVPVASALCILKSGFLFVASEFGNHYLYQF 367
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTK-RLRRSSSD--ALQDMVNGEELSLYGSAS 491
SS +FG ++ P + D AL D V + +
Sbjct: 368 QKLGDDDDEPEFSSSSYPQFGMADSSMPLPHVHFKPHPLDNLALADEVESLDPIIDSKVL 427
Query: 492 NNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC- 547
N ++ FA GP L+ +GL + S+ +LPG
Sbjct: 428 NLMPNSDTPQIFAA----CGRGPRSSLRTLRHGLEVEESVSS------------DLPGIP 471
Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+WT K DD + +Y+I+S T+VL + + EV ++ +
Sbjct: 472 NAVWTTKKKE--------------DDAFDSYIILSFVNGTLVLSIGETIEEVQDT-GFLS 516
Query: 608 QGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGS------------ 637
T+A + G ++QV +G R I+ +
Sbjct: 517 SAPTLAVQQI-GADALLQVHPQGIRHVLSDRRVNEWRVPQGKSIVQATTNKRQVVVALSS 575
Query: 638 ----YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRLLVG 685
Y DL G N + STVL++SI + P++ +G D ++R++
Sbjct: 576 AELVYFELDLD-GQLNEYQDRKAMGSTVLALSIGEVPEGRQRTPFLAVGCEDQTVRIISL 634
Query: 686 DPSTC--TVSVQTPAAIESS 703
DP + T+S+Q A SS
Sbjct: 635 DPESTLDTISLQALTAPPSS 654
>gi|258570355|ref|XP_002543981.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
gi|237904251|gb|EEP78652.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
Length = 1209
Score = 41.6 bits (96), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 65/265 (24%), Positives = 108/265 (40%), Gaps = 40/265 (15%)
Query: 378 LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
LL T+ GDL +T+ V D V+RL L + + S + + N F+ S G
Sbjct: 307 LLQTEDGDLFKVTIDMVEDDNGQPTGEVRRLKLKYFDTVPIASSLCILKNGFLFVASENG 366
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGD--IEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
+ QF + ++F +E AP R R + + L + +N +
Sbjct: 367 NHHFYQFEKLGDDDEETEFTSDDFSSDPLEPLAPVYFRPRPAENLNLVESINSVNPLMSC 426
Query: 489 SASNNTES-AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
+N TE A + ++ + LK +GL + S+ EL +P
Sbjct: 427 KVANLTEDDAPQLYTLCGTGARSTFRTLK---HGLEV---------SEIVESELPSVPS- 473
Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+WT K +R +D+Y AY+I+S T+VL + + EVT++ +
Sbjct: 474 -AVWTT--KLTR------------NDQYDAYIILSFTNGTLVLSIGETVEEVTDT-GFLS 517
Query: 608 QGRTIAAGNLFGRRRVIQVFERGAR 632
T+A L G +IQV +G R
Sbjct: 518 SAPTLAVQQL-GEDSLIQVHPKGIR 541
>gi|345570887|gb|EGX53705.1| hypothetical protein AOL_s00006g33 [Arthrobotrys oligospora ATCC
24927]
Length = 1133
Score = 41.6 bits (96), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 63/269 (23%), Positives = 107/269 (39%), Gaps = 46/269 (17%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G L DP GR G+ +Y G+ I Q G G A++ + + NL
Sbjct: 116 GHLYLADPGGRLLGLYLYEGIFTAIPIKRQS------------KGRGRHAQLPEAEIGNL 163
Query: 239 RD---LDMKHVK--DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
D + M +K + +F++G PV+ +L+ ++++ L+++
Sbjct: 164 DDPCPIRMNELKVINMVFLYGTSVPVIAVLYTDSKKLVHLITYE--------LNVAKRAV 215
Query: 294 QHPLI--W--SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
+ P W A NL H A L+ V +P GG+LV+G + Y + V +
Sbjct: 216 KDPEFAQWGIKANNLDHGAKLLIPVDNPTGGILVIGEQVVSYFHPERT---------VPM 266
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
P S + H + + LLS + G L LL ++ + + + +
Sbjct: 267 KKPLHEPTSFVT------HGK-IDPERYLLSDELGHLYLLLLIIENNKLINMRIENLGEV 319
Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
I + N FLGS GDS LV+ +
Sbjct: 320 CQARAIVYLDNGYVFLGSHFGDSTLVRIS 348
>gi|347829304|emb|CCD45001.1| similar to pre-mRNA-splicing factor rse1 [Botryotinia fuckeliana]
Length = 1212
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 129/600 (21%), Positives = 226/600 (37%), Gaps = 94/600 (15%)
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ + G + ++A G++ +D II+ + +I+++EF + + + F
Sbjct: 62 HDVFGIIRAIAAFRLAGSN----KDYIIITSDSGRITIVEFVPAQNKFNRLHLETFG--- 114
Query: 167 WLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
K G G + VDP+GR V L ++ + SQ E T S
Sbjct: 115 ----KSGVRRVVPGQYLAVDPKGRACLTASVEKNKLVYVLNRNSQA-------ELTISSP 163
Query: 224 GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHH 279
A + V L LD+ GY PV L E + G+ ++
Sbjct: 164 --LEAHKAQTLVFALVALDV----------GYANPVFAALEIDYGESDQDPTGQ-AYDEI 210
Query: 280 TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI---GGVLVVGANTIHY-HSQS 335
+ + L WS + A L VP GVLV G + I Y HS
Sbjct: 211 EKQLVYYELDLGLNHVVRKWSE-PVDRTANILFQVPGGTDGPSGVLVCGEDNITYRHSNQ 269
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV-- 391
+ +A+ + + Q V +L A + LL T GDL +T+
Sbjct: 270 EAFRVAIPRRRGATEDPQRKRNIVAGVMHKLKGAAGAFF----FLLQTDDGDLFKITIEM 325
Query: 392 VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
V D V+RL + + + + + + + F+ S G+ QF
Sbjct: 326 VEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGFLFVASEFGNHQFYQFEKLGDDDEE 385
Query: 447 SSGLKEEF--GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSF 503
+ + ++F G E+ P R + + +L + ++ + +N T E A + +S
Sbjct: 386 TEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESIDSMNPLMDCKVANLTDEDAPQIYSI 445
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHN 562
+ LK +GL ++ + ELPG +WT K +RG
Sbjct: 446 CGTGARSTFRTLK---HGLEVSEIVES------------ELPGVPSAVWTT--KLTRG-- 486
Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
D Y AY+I+S T+VL + + EVT++ + T+A L G
Sbjct: 487 ----------DTYDAYIILSFSNGTLVLSIGETVEEVTDT-GFLSSAPTLAVQQL-GEDS 534
Query: 623 VIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD-PYVLLGM-SDGSI 680
+IQV +G R + + + + P + + + N ++V+++ V M SDGS+
Sbjct: 535 LIQVHPKGIRHIRADHRVNEWA-APQHRSIVAATTNERQVAVALSSGEIVYFEMDSDGSL 593
>gi|302820387|ref|XP_002991861.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
gi|300140399|gb|EFJ07123.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
Length = 1292
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 47/216 (21%), Positives = 85/216 (39%), Gaps = 38/216 (17%)
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFS----YGLRINADASATGISKQSNYELVE----- 543
E Q +F V+ NI P+ DFS YG + + + G ++ + ++
Sbjct: 419 KVEDGQLSFQSFVQ----NIAPILDFSLVDYYGEKQDQMFACCGGDEEGSVRIIRNGNSV 474
Query: 544 ---------LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
G GIWT+ ++ + D YHA+ +IS T VL
Sbjct: 475 EKLICTPPVYQGVSGIWTMRYR--------------FKDPYHAFFLISFVEETRVLSVGL 520
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
++T++V + Q T+A G L V QV+ ++ + SN S +
Sbjct: 521 NFVDITDAVGFESQVNTLACG-LVEDGWVAQVWRYEVKLCSPTKAAHPAGVSGSNPLSTT 579
Query: 655 GSENSTVLSV-SIADPYVLLGMSDGSIRLLVGDPST 689
+ +SV ++ V+L ++ + L++G T
Sbjct: 580 WRKPGYPISVGAVCRSRVILALARPGLLLMLGATQT 615
>gi|313235544|emb|CBY10999.1| unnamed protein product [Oikopleura dioica]
Length = 1185
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 87/428 (20%), Positives = 157/428 (36%), Gaps = 105/428 (24%)
Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE------LDAAHATWL 372
GGV+V N + Y N+ D +PR ++ + AHAT
Sbjct: 246 GGVIVCAENYLIY-----------KNFGDQPDIRFPIPRRRNDLDDPERGMIIVAHATHK 294
Query: 373 QNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ LL T+ GDL +T+ + +V + L + ++S + + F+ G
Sbjct: 295 TRSMFFFLLQTEQGDLFKVTLETEEDIVTEIRLKYFDTVPVSSSLCVLRTGFLFVAGEFG 354
Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRR----SSSDALQDMVNGEELS 485
+ L Q T G E + E + + LR D+L ++N E
Sbjct: 355 NHNLYQITRLGEDDDEPEFSSAEPLEEGETFFFTPRGLRNLALTDEMDSLSPVLNCEVAD 414
Query: 486 LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
L A+ +T T R +L + +GL + S + ELP
Sbjct: 415 L---ANEDTPQLYVTCGRGPRSTL------RVLRHGLEV------------SEMAVSELP 453
Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
G +WTV + D ++ +Y+I+S T+VL + + E+T+S
Sbjct: 454 GNPNAVWTV--------------KTSADADHDSYIIVSFVNATLVLSIGETVEEITDS-G 498
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGAR-------------------------------I 633
+ T+++G L G ++Q++ G R
Sbjct: 499 FLGTTPTLSSG-LMGEDALVQIYPEGIRHIRSDRRVNEWRAPDRKQIVRCACNRQQVVIA 557
Query: 634 LDGS---YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRL 682
L G Y D + G N + S ++++ + D ++ +G+SDG++R+
Sbjct: 558 LTGGEIVYFEMDPT-GQLNEYTERREFGSEIIALDVGDVPAGEQRCRFLAVGLSDGTVRI 616
Query: 683 LVGDPSTC 690
+ DP+ C
Sbjct: 617 ISLDPNDC 624
>gi|146096490|ref|XP_001467824.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania infantum JPCM5]
gi|134072190|emb|CAM70891.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania infantum JPCM5]
Length = 1542
Score = 41.2 bits (95), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 86/206 (41%), Gaps = 38/206 (18%)
Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIE 257
+ A+ G G + GG S + V + R D+K +++D FV E
Sbjct: 263 VAFGAASAGPGTASSQKV-TQGGVTSLLLRVGTVTHWRLQDVKTALRNIRDVQFVESAGE 321
Query: 258 PVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-L 304
P++ L E++ TWAGRV W+ TC I ++++ + H L S ++ L
Sbjct: 322 PLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGL 381
Query: 305 PHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSA----------SCALALNNYAVSLD 350
P+D + +P+ P V +H ++S A +L + AVSL+
Sbjct: 382 PYDVTSMTPLPAFQDVPSAVFCVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLE 441
Query: 351 ------SSQELPRSSFSVELDAAHAT 370
+SQ L V L+ A+AT
Sbjct: 442 AVQWRSASQALSTDLVKVNLNFANAT 467
>gi|452824087|gb|EME31092.1| DNA damage-binding protein 1 isoform 1 [Galdieria sulphuraria]
Length = 1128
Score = 41.2 bits (95), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 84/211 (39%), Gaps = 35/211 (16%)
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
I L +LD V D F++G+ +P + +L + +H +L
Sbjct: 151 IRLEELD---VLDIQFLYGHSKPTIAVL------YTDSEENRHLKTYTVSLK-DKDFGNG 200
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
PL NL A L+ VP+PIGGV+V+G T+ Y S S L Y S+ S +
Sbjct: 201 PLFQG--NLESGASMLIPVPTPIGGVVVLGQETVTYISGS-----GLRGYH-SIPVSATI 252
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL---------TVVYDGRVVQRLDLSKT 406
R+ ++ D LL + G L LL T + L +
Sbjct: 253 FRAYGRIDKDGTR--------YLLGDEKGILYLLVLEQSTSLSTFTETETKITGLKIQTL 304
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ L S I + N ++GS GDS L++
Sbjct: 305 GETSLPSTIDYLDNGFVYIGSCHGDSQLIRL 335
>gi|170041368|ref|XP_001848437.1| splicing factor 3B subunit 3 [Culex quinquefasciatus]
gi|167864946|gb|EDS28329.1| splicing factor 3B subunit 3 [Culex quinquefasciatus]
Length = 1215
Score = 41.2 bits (95), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 67/321 (20%), Positives = 118/321 (36%), Gaps = 61/321 (19%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L+ T+ GD+ +T+ D VV + L + + + + F+ G+ L Q
Sbjct: 302 LVQTEQGDIFKVTLETDDDVVAEIKLKYFDTVPPATAMCVLKTGFLFVACDFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS----- 489
SS + E GD AP L+++V +E+ +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPR----------PLKNLVMVDEIHSFAPILGCQ 411
Query: 490 -ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC- 547
A E + + R +I L+ +GL + S + ELPG
Sbjct: 412 VADLANEDTPQLYLACGRGPRSSIRVLR---HGLEV------------SEMAVSELPGNP 456
Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+WTV ++ DDE+ AY+I+S T+VL D + EVT+S F+
Sbjct: 457 NAVWTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGDTVEEVTDS--GFL 500
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
+ G ++QV+ G R + N G + T++ ++
Sbjct: 501 GTTPTLCCSALGDDALVQVYPDGIRHIRADKRV--------NEWKAPGKK--TIIKCAVN 550
Query: 668 DPYVLLGMSDGSIRLLVGDPS 688
V++ +S G + DP+
Sbjct: 551 SRQVVIALSGGELVYFEMDPT 571
>gi|398020786|ref|XP_003863556.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania donovani]
gi|322501789|emb|CBZ36871.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania donovani]
Length = 1542
Score = 41.2 bits (95), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 86/206 (41%), Gaps = 38/206 (18%)
Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIE 257
+ A+ G G + GG S + V + R D+K +++D FV E
Sbjct: 263 VAFGAASAGPGTASSQKV-TQGGVTSLLLRVGTVTHWRLQDVKTALRNIRDVQFVESAGE 321
Query: 258 PVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-L 304
P++ L E++ TWAGRV W+ TC I ++++ + H L S ++ L
Sbjct: 322 PLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGL 381
Query: 305 PHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSA----------SCALALNNYAVSLD 350
P+D + +P+ P V +H ++S A +L + AVSL+
Sbjct: 382 PYDVTSMTPLPAFQDVPSAVFCVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLE 441
Query: 351 ------SSQELPRSSFSVELDAAHAT 370
+SQ L V L+ A+AT
Sbjct: 442 AVQWRSASQALSTDLVKVNLNFANAT 467
>gi|154295205|ref|XP_001548039.1| pre-mRNA splicing factor 3b [Botryotinia fuckeliana B05.10]
Length = 1020
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 129/600 (21%), Positives = 226/600 (37%), Gaps = 94/600 (15%)
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ + G + ++A G++ +D II+ + +I+++EF + + + F
Sbjct: 62 HDVFGIIRAIAAFRLAGSN----KDYIIITSDSGRITIVEFVPAQNKFNRLHLETFG--- 114
Query: 167 WLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
K G G + VDP+GR V L ++ + SQ E T S
Sbjct: 115 ----KSGVRRVVPGQYLAVDPKGRACLTASVEKNKLVYVLNRNSQA-------ELTISSP 163
Query: 224 GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHH 279
A + V L LD+ GY PV L E + G+ ++
Sbjct: 164 --LEAHKAQTLVFALVALDV----------GYANPVFAALEIDYGESDQDPTGQ-AYDEI 210
Query: 280 TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI---GGVLVVGANTIHY-HSQS 335
+ + L WS + A L VP GVLV G + I Y HS
Sbjct: 211 EKQLVYYELDLGLNHVVRKWSE-PVDRTANILFQVPGGTDGPSGVLVCGEDNITYRHSNQ 269
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV-- 391
+ +A+ + + Q V +L A + LL T GDL +T+
Sbjct: 270 EAFRVAIPRRRGATEDPQRKRNIVAGVMHKLKGAAGAFF----FLLQTDDGDLFKITIEM 325
Query: 392 VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
V D V+RL + + + + + + + F+ S G+ QF
Sbjct: 326 VEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGFLFVASEFGNHQFYQFEKLGDDDEE 385
Query: 447 SSGLKEEF--GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSF 503
+ + ++F G E+ P R + + +L + ++ + +N T E A + +S
Sbjct: 386 TEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESIDSMNPLMDCKVANLTDEDAPQIYSI 445
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHN 562
+ LK +GL ++ + ELPG +WT K +RG
Sbjct: 446 CGTGARSTFRTLK---HGLEVSEIVES------------ELPGVPSAVWTT--KLTRG-- 486
Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
D Y AY+I+S T+VL + + EVT++ + T+A L G
Sbjct: 487 ----------DTYDAYIILSFSNGTLVLSIGETVEEVTDT-GFLSSAPTLAVQQL-GEDS 534
Query: 623 VIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD-PYVLLGM-SDGSI 680
+IQV +G R + + + + P + + + N ++V+++ V M SDGS+
Sbjct: 535 LIQVHPKGIRHIRADHRVNEWA-APQHRSIVAATTNERQVAVALSSGEIVYFEMDSDGSL 593
>gi|452824086|gb|EME31091.1| DNA damage-binding protein 1 isoform 2 [Galdieria sulphuraria]
Length = 1150
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 84/211 (39%), Gaps = 35/211 (16%)
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
I L +LD V D F++G+ +P + +L + +H +L
Sbjct: 151 IRLEELD---VLDIQFLYGHSKPTIAVL------YTDSEENRHLKTYTVSLK-DKDFGNG 200
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
PL NL A L+ VP+PIGGV+V+G T+ Y S S L Y S+ S +
Sbjct: 201 PLFQG--NLESGASMLIPVPTPIGGVVVLGQETVTYISGS-----GLRGYH-SIPVSATI 252
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL---------TVVYDGRVVQRLDLSKT 406
R+ ++ D LL + G L LL T + L +
Sbjct: 253 FRAYGRIDKDGTR--------YLLGDEKGILYLLVLEQSTSLSTFTETETKITGLKIQTL 304
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ L S I + N ++GS GDS L++
Sbjct: 305 GETSLPSTIDYLDNGFVYIGSCHGDSQLIRL 335
>gi|406602265|emb|CCH46158.1| Pre-mRNA-splicing factor [Wickerhamomyces ciferrii]
Length = 1123
Score = 40.8 bits (94), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 100/255 (39%), Gaps = 81/255 (31%)
Query: 493 NTESAQKTFSFA-VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE--LPG-CK 548
N ++ K +S + V+DS LK YGL IN E+VE LPG
Sbjct: 406 NDDAFTKIYSLSGVKDS----SSLKILQYGLSIN--------------EIVESDLPGIAN 447
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+WT +DE+ YL+IS T+VL + + E+T+S +
Sbjct: 448 KVWTTKLNK--------------NDEFDKYLVISFMDTTLVLSIGENVEEITDS-GLALN 492
Query: 609 GRTIAAGNLFGRRRVIQVFERGAR------------------ILDGSYMTQDLSFGPSNS 650
TI + G ++Q+ G R IL S + ++ G SN
Sbjct: 493 EETIGIQQI-GINSLVQIHSNGIRNIKNGELINEWQPPAGIKILTTSTTNRQIAIGLSND 551
Query: 651 E---------------SGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRLLVGDP 687
E + S ++S+S+ D P++++G D +IR+L DP
Sbjct: 552 ELVYFEVDDRDRLIEYNERKELTSRIVSLSLGDIPEGRLRSPFLIVGCQDSTIRVLSTDP 611
Query: 688 STC--TVSVQTPAAI 700
+ +S+Q ++I
Sbjct: 612 GSTLELLSLQALSSI 626
>gi|171691144|ref|XP_001910497.1| hypothetical protein [Podospora anserina S mat+]
gi|170945520|emb|CAP71632.1| unnamed protein product [Podospora anserina S mat+]
Length = 1158
Score = 40.8 bits (94), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 72/188 (38%), Gaps = 53/188 (28%)
Query: 282 MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA 341
+I + +K+H + PH +GGV+VVG + Y
Sbjct: 233 LIPVRKVEEEVKRHNFRNTGSAKPH-----------LGGVIVVGETRLLY---------- 271
Query: 342 LNNYAVSLDSSQELPRSSFSVELDAA--HATWLQNDVA--LLSTKTGDLVLLTVVYDGRV 397
++ +++ +LD A W + +V L+ G L LLT+ DG
Sbjct: 272 ----------IDDVTKATVESKLDKASIFVKWAEYNVQTYFLADDYGSLHLLTINTDGAE 321
Query: 398 VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
V+ + L+K + S++ +GN + F+ S GDS L Q D+
Sbjct: 322 VKGMVLTKIGVTSRASELVYLGNEMLFVASHHGDSRLFQL------------------DL 363
Query: 458 EADAPSTK 465
AD P+ K
Sbjct: 364 SADKPADK 371
>gi|115397303|ref|XP_001214243.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114192434|gb|EAU34134.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 1140
Score = 40.8 bits (94), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 60/146 (41%), Gaps = 25/146 (17%)
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS--LDSSQELPRS 358
A L A L+ VP+P+GG+L++G +I Y NN VS LD +
Sbjct: 237 AQELDLGASHLIPVPAPLGGLLILGETSIKYVDDD-------NNEIVSRLLDEA------ 283
Query: 359 SFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDI 415
W Q D LL+ G L L +V D VQ L + S +
Sbjct: 284 -------TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSENQVQGWQLDHLGNTSRASTL 336
Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGS 441
+G + F+GS GDS +++ GS
Sbjct: 337 VYLGGGVIFVGSHQGDSQVLRVGDGS 362
>gi|407923753|gb|EKG16818.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
phaseolina MS6]
Length = 1129
Score = 40.8 bits (94), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 63/262 (24%), Positives = 102/262 (38%), Gaps = 32/262 (12%)
Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
+DP GR + +Y G+ ++ +G GD + G +RIE V + L
Sbjct: 121 LDPTGRFMTLELYEGIVTVVPLTEKGKRK--GDPEVSALGEPVPSRIEEMFVRSSAFLHR 178
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
K + +P++ +L+E + R+ + + + P+
Sbjct: 179 KSPESE-------KPLVALLYEEDEDSKIRLRLRQLAFQTAGTEEQSVAALEPVEGLKEE 231
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
L A L+ VP P GVLV+G I Y N+Y +L + L S+ V
Sbjct: 232 LDLGASHLIPVPGPCYGVLVLGETCITY----------FNDYTKAL-VKKPLQDSTIFV- 279
Query: 364 LDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGR--VVQRLDLSKTNPSVLTSDITTIG 419
W Q N LL+ G L L ++ D VV+ L K + S + +
Sbjct: 280 ------AWEQIDNQRFLLADDFGGLYLFMLLLDDNSGVVEGWRLDKIGETSRASVLVYLD 333
Query: 420 NSLFFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ T GS
Sbjct: 334 AGHVFVGSHEGDSQVIRITEGS 355
>gi|18410222|ref|NP_567015.1| splicing factor 3B subunit 3 [Arabidopsis thaliana]
gi|18410226|ref|NP_567016.1| putative splicing factor [Arabidopsis thaliana]
gi|7019653|emb|CAB75754.1| spliceosomal-like protein [Arabidopsis thaliana]
gi|7019655|emb|CAB75756.1| spliceosomal-like protein [Arabidopsis thaliana]
gi|332645831|gb|AEE79352.1| splicing factor 3B subunit 3 [Arabidopsis thaliana]
gi|332645833|gb|AEE79354.1| putative splicing factor [Arabidopsis thaliana]
Length = 1214
Score = 40.8 bits (94), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 73/326 (22%), Positives = 125/326 (38%), Gaps = 61/326 (18%)
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVLV N + Y +Q A+ + +LP + + AA L+
Sbjct: 246 GVLVCAENFVIYMNQGHPDVRAV------IPRRTDLPAERGVLVVSAAVHKQKTMFFFLI 299
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
T+ GD+ +T+ ++G V L + + + S I + F S G+ L QF
Sbjct: 300 QTEYGDVFKVTLDHNGDHVSELKVKYFDTIPVASSICVLKLGFLFSASEFGNHGLYQFQA 359
Query: 440 --------GSGTSMLSSGLKEEFGDIEADAPSTKRLRR-SSSDALQDMVNGEELSLYGSA 490
S ++++ + +E F + K L R ++L +++ + L+++
Sbjct: 360 IGEEPDVESSSSNLMET--EEGFQPVFFQPRRLKNLVRIDQVESLMPLMDMKVLNIF--- 414
Query: 491 SNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPG- 546
E + FS R GP L+ GL I A + +LPG
Sbjct: 415 ---EEETPQIFSLCGR------GPRSSLRILRPGLAITEMAVS------------QLPGQ 453
Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+WTV S DE+ AY+++S T+VL + + EV +S F
Sbjct: 454 PSAVWTVKKNVS--------------DEFDAYIVVSFTNATLVLSIGEQVEEVNDS--GF 497
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGAR 632
+ A +L G ++QV G R
Sbjct: 498 LDTTPSLAVSLIGDDSLMQVHPNGIR 523
>gi|1399512|gb|AAC47162.1| repE [Dictyostelium discoideum]
Length = 1139
Score = 40.8 bits (94), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 87/204 (42%), Gaps = 29/204 (14%)
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
+V N+R L+ V D F++G P + +L + + H S T L
Sbjct: 150 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFKD-------TKDEKHISTYEISSKDTELV 201
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
P WS N+ Y L VP P+GGVLVV N I Y + + ++A+ +Y L ++
Sbjct: 202 VGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSVAV-SYTKFLAFTR 256
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
V+ D + L G L +L +++ + V L + + S
Sbjct: 257 --------VDKDGSR--------FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPS 300
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
I+ + + + ++GS GDS L++
Sbjct: 301 SISYLDSGVVYIGSSSGDSQLIRL 324
>gi|320593036|gb|EFX05445.1| uv-damaged DNA-binding protein [Grosmannia clavigera kw1407]
Length = 1504
Score = 40.4 bits (93), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 36/135 (26%), Positives = 58/135 (42%), Gaps = 21/135 (15%)
Query: 306 HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD 365
H+ + +GG+LVVG + Y + C + E+P + S+
Sbjct: 562 HNVRNTATATANLGGLLVVGETRLLYIDSTTKCTV-------------EVPLRAASI--- 605
Query: 366 AAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN-SL 422
W + D LL+ + G L LLT++ G VV LD+S + S + + + L
Sbjct: 606 --FVAWARYDATHYLLADEYGTLHLLTILVSGAVVDNLDVSPIGKTSRASCLVYLPDRRL 663
Query: 423 FFLGSRLGDSLLVQF 437
F+GS GDS L +
Sbjct: 664 LFVGSHNGDSQLFRL 678
>gi|427798971|gb|JAA64937.1| Putative damage-specific dna binding complex subunit ddb1, partial
[Rhipicephalus pulchellus]
Length = 1259
Score = 40.4 bits (93), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 60/268 (22%), Positives = 103/268 (38%), Gaps = 57/268 (21%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ + G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTIPVAASMCVLKTGFLFVAAEFGNHCLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E GD AP AL++++ EEL A T
Sbjct: 362 ARLGEEDEEPEFSSAIPLEEGDTFFFAPR----------ALRNLLPVEELDSLSPAMGCT 411
Query: 495 ------ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELP 545
E + + R GP ++ +GL + S + ELP
Sbjct: 412 IADLANEDTPQLYVACGR------GPRSCIRVLRHGLEV------------SEMAVSELP 453
Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
G +WTV K+ D++Y AY+I+S T+VL + + EVT+S
Sbjct: 454 GNPNAVWTVKRKA--------------DEDYDAYIIVSFVNATLVLSIGETVEEVTDS-G 498
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ T++ + G ++QV+ G R
Sbjct: 499 FLGTTPTLSCAQI-GDDALVQVYPEGIR 525
>gi|166240328|ref|XP_637896.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
gi|238064940|sp|B0M0P5.1|DDB1_DICDI RecName: Full=DNA damage-binding protein 1; AltName: Full=DNA
repair protein E; AltName: Full=UV-damaged DNA-binding
protein 1
gi|165988543|gb|EAL64385.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
Length = 1181
Score = 40.4 bits (93), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 87/204 (42%), Gaps = 29/204 (14%)
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
+V N+R L+ V D F++G P + +L + + H S T L
Sbjct: 192 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFKD-------TKDEKHISTYEISSKDTELV 243
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
P WS N+ Y L VP P+GGVLVV N I Y + + ++A+ +Y L ++
Sbjct: 244 VGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSVAV-SYTKFLAFTR 298
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
V+ D + L G L +L +++ + V L + + S
Sbjct: 299 --------VDKDGSR--------FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPS 342
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
I+ + + + ++GS GDS L++
Sbjct: 343 SISYLDSGVVYIGSSSGDSQLIRL 366
>gi|212539802|ref|XP_002150056.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei
ATCC 18224]
gi|210067355|gb|EEA21447.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei
ATCC 18224]
Length = 1139
Score = 40.0 bits (92), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 62/139 (44%), Gaps = 25/139 (17%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G I Y +D ++ + S LD A
Sbjct: 245 ASHLIPVPAPLGGLLVLGETCIKY-----------------IDDAK---NETISNPLDEA 284
Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDITTIGNSL 422
W+Q D LL+ G L L +V D + V+ L + S + +G +
Sbjct: 285 TIFVAWVQVDGQRWLLADDYGRLFFLMLVLDSQNEVEGWKLDYLGEASRASVLIYLGAGM 344
Query: 423 FFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ + GS
Sbjct: 345 TFIGSHQGDSQVIRISEGS 363
>gi|380481704|emb|CCF41690.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
Length = 932
Score = 40.0 bits (92), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 58/144 (40%), Gaps = 14/144 (9%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSS-----QELPRSSFS 361
D Y + +P PI V YH + + A A + + + L R+
Sbjct: 179 DPYARIVIPVPI-----VEDEVKRYHKRDTTGAKAQLGGLIVVGETLLVYVDTLTRTVVE 233
Query: 362 VELD--AAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
L+ A W D LS G+L LLT+ +G VV L L + S +
Sbjct: 234 SGLNSPAIFVAWAAYDDTNYFLSDDYGNLHLLTIETEGVVVTNLSLRLLGVTSRASCLVH 293
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
+GN L FLGS GDS L+Q S
Sbjct: 294 MGNGLLFLGSHYGDSQLLQINMES 317
>gi|401426989|ref|XP_003877978.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322494225|emb|CBZ29522.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 1542
Score = 40.0 bits (92), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 84/185 (45%), Gaps = 37/185 (20%)
Query: 223 GGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHERELTWAGRVS--- 275
GGG S + V + R D+K +++D FV EP++ L E++ TWAGRV
Sbjct: 283 GGGTSLLLRIGTVTHWRLQDVKTALRNIRDIQFVESAGEPLLAFLFEKQPTWAGRVKLLE 342
Query: 276 WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-LPHDAYKLLA------VPSPIG 319
W+ TC I ++++ + H L S ++ LP+D + VPS +
Sbjct: 343 WRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGLPYDVTSMTPLTAFQDVPSAVF 402
Query: 320 GV---LVVGANT-----IHYHSQSASCALALNNYAVSLD------SSQELPRSSFSVELD 365
V ++V +T ++ ++ A +L + AVS + +SQ L V L+
Sbjct: 403 CVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSFEAVQWRSASQALSTDLVKVNLN 462
Query: 366 AAHAT 370
++AT
Sbjct: 463 FSNAT 467
>gi|440636768|gb|ELR06687.1| pre-mRNA-splicing factor rse1 [Geomyces destructans 20631-21]
Length = 1212
Score = 39.7 bits (91), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 74/327 (22%), Positives = 125/327 (38%), Gaps = 49/327 (14%)
Query: 320 GVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA- 377
GVLV G + I Y HS + +A+ + E P+ S+ H
Sbjct: 253 GVLVCGEDNITYRHSNQEAFRVAIPRRK----GATEDPQRKRSIVAGVMHKMRGAAGAFF 308
Query: 378 -LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL 429
LL + GDL +T+ + D V+RL + + + + + + + F+ S
Sbjct: 309 FLLQSDDGDLFKITIEMIEDDNGQPTGEVRRLKIKYFDTVPIATSLCILKSGFLFVASEF 368
Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLY 487
G+ QF + + + F A+ P R + + L + ++ +
Sbjct: 369 GNHQFYQFEKLGDDDDETEYISDNFPTDPAEPYTPVYFHPRPAENLNLVESIDSMNPLMD 428
Query: 488 GSASNNTES-AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
+N TE A + +S + LK +GL +N + ELPG
Sbjct: 429 CKVANLTEEDAPQIYSICGTGARSTFRTLK---HGLEVNEIVES------------ELPG 473
Query: 547 C-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
+WT K +RG DEY AY+I++ T+VL + + EVT++
Sbjct: 474 VPSAVWTT--KLTRG------------DEYDAYIILAFSNGTLVLSIGETVEEVTDT--G 517
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGAR 632
F+ T A G +IQV +G R
Sbjct: 518 FLSSATTLAVQQLGEDGLIQVHPKGIR 544
>gi|443694993|gb|ELT96001.1| hypothetical protein CAPTEDRAFT_155561 [Capitella teleta]
Length = 1215
Score = 39.7 bits (91), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 59/264 (22%), Positives = 100/264 (37%), Gaps = 49/264 (18%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + S + + F+ S G+ L Q
Sbjct: 302 LTQTEQGDVFKITLETDEDMVTEVRLKYFDTVPVASSMCVLKTGFLFIASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGD--IEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN 492
SS + E GD A P + D+L +++ + L
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLKNLVMVDEMDSLSPIMHCQIADL------ 415
Query: 493 NTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-K 548
E + F+ R GP L+ +GL + S + ELPG
Sbjct: 416 ANEDTPQLFAMCGR------GPRSTLRVLRHGLEV------------SEMAVSELPGNPN 457
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+WTV +DE+ AY+I+S T+VL + + EVT+S +
Sbjct: 458 AVWTVKRN--------------IEDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGT 502
Query: 609 GRTIAAGNLFGRRRVIQVFERGAR 632
T++ L G ++Q++ G R
Sbjct: 503 TPTLSCSQL-GDDALVQIYPDGIR 525
>gi|70992271|ref|XP_750984.1| UV-damaged DNA binding protein [Aspergillus fumigatus Af293]
gi|66848617|gb|EAL88946.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
Af293]
gi|159124553|gb|EDP49671.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
A1163]
Length = 1140
Score = 39.7 bits (91), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 60/135 (44%), Gaps = 25/135 (18%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
L+ VP+P+GG+L++G +I Y V D+++ + R LD A
Sbjct: 248 LIPVPAPLGGLLILGEMSIKY---------------VDADNNEIISRP-----LDEATIF 287
Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFL 425
W Q D LL+ G L L +V D V+ L + S + +G + FL
Sbjct: 288 VAWEQVDSQRWLLADDYGRLFFLMLVLDSDSQVESWKLDHLGNTSRASVLVYLGGGILFL 347
Query: 426 GSRLGDSLLVQFTCG 440
GS GDS +++ + G
Sbjct: 348 GSHQGDSQVLRISNG 362
>gi|322700233|gb|EFY91989.1| Pre-mRNA-splicing factor rse-1 [Metarhizium acridum CQMa 102]
Length = 1039
Score = 39.7 bits (91), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 47/105 (44%), Gaps = 19/105 (18%)
Query: 540 ELV--ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
ELV ELPG +WT S D+Y AY+I++ TMVL + +
Sbjct: 461 ELVASELPGTPSAVWTTKLTQS--------------DDYDAYIILTFLHDTMVLSVGETV 506
Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
T+VT+S F+ A G+ + QV+ +G R + T+
Sbjct: 507 TQVTDS--GFITTVATLAVQQIGKNSLFQVYSKGIRHIQSGQFTE 549
>gi|296411833|ref|XP_002835634.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295629420|emb|CAZ79791.1| unnamed protein product [Tuber melanosporum]
Length = 1053
Score = 39.7 bits (91), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 106/262 (40%), Gaps = 49/262 (18%)
Query: 186 DPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
DP G+ VY G+ ++I + Q G G I + V+ L++L+
Sbjct: 123 DPGKNMLGIHVYKGIFLVIPQIQQSIKGSRRSRADLDVG-----NIGNPCVVRLKELE-- 175
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
+ D F+ G I PV+ +L++ +G +T +S S L L W +L
Sbjct: 176 -ILDLKFLFGTISPVLAVLYKP----SGADEMAVNTYELSVKSGEVKL----LDWRIRDL 226
Query: 305 P--HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
+A L+ V P G+L++G I Y +NY ++ V
Sbjct: 227 KGGREALFLIPVRPPSNGLLLIGVTKIQY----------FDNYG---------NKTFLPV 267
Query: 363 ELDAAHATW--LQNDVALLSTKTGDLVLLTV---VYDGRVVQRLDL--SKTNPSVLTSDI 415
+ TW L + +L + G L +LT+ + D +V L L + + P +L
Sbjct: 268 DPPMVWVTWEMLSPERYILGDEAGGLHMLTLSAGLMDTKVGLHLKLVGNASIPEILVH-- 325
Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
+ L FLGS GDS L+Q
Sbjct: 326 --LNQGLLFLGSHSGDSQLLQL 345
>gi|384500266|gb|EIE90757.1| hypothetical protein RO3G_15468 [Rhizopus delemar RA 99-880]
Length = 1057
Score = 39.7 bits (91), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 52/220 (23%), Positives = 85/220 (38%), Gaps = 39/220 (17%)
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
K V F+ ++P ++IL+E L L T+K L+ +
Sbjct: 142 KKVISLAFLQDTLDPTLLILYEDALE--------------QRLLQMFTIKDRQLVPGDII 187
Query: 304 LPH---DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
L H DA L+A+P +GGVL+V + I Y L +Q P +
Sbjct: 188 LDHFESDASLLIAMPPAVGGVLLVASKFIRY-----------------LKPNQ--PPIAI 228
Query: 361 SVELDAAHATWLQNDV---ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
+ ++ + N+ LL G L LL + + V+ L + S +
Sbjct: 229 GIRSSTINSHCIMNEEGSRVLLGDAEGLLYLLALNTTNQCVESLSFIYLGSISIPSCLAY 288
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
+ N + F+GS L DS LV +G S + E F ++
Sbjct: 289 LDNDIVFVGSNLADSQLVYIQRTTGESEDILQIIETFANL 328
>gi|281208174|gb|EFA82352.1| UV-damaged DNA binding protein1 [Polysphondylium pallidum PN500]
Length = 1054
Score = 39.7 bits (91), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 39/199 (19%)
Query: 503 FAVRDSLVNIGPLKDFSY---------------------GLRINADASATGISKQSNYEL 541
+V D N+GP+ DF LRI + GI++Q++
Sbjct: 302 ISVIDQFTNLGPITDFCVVDVEKQGQGQLVTCSGTFQDGSLRIIRNG--IGIAEQAS--- 356
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
+ELPG +G+W S + H +LI+S T VL + E TE
Sbjct: 357 IELPGIRGLW-------------SLSNNSNPSSLHRHLIVSFINSTKVLTFSGEEIEETE 403
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTV 661
+ T+ GN IQ+ G ++D S + + + P S N +
Sbjct: 404 IAGFDSNATTLYCGNTTENNHFIQIATSGIYLVDSSSLMRLDQYTPEKGSINLASCNGSQ 463
Query: 662 LSVSIADPYVLLGMSDGSI 680
+ +S L +SD +
Sbjct: 464 ILISQGSNLTYLEISDSKL 482
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.395
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,589,231,647
Number of Sequences: 23463169
Number of extensions: 661713671
Number of successful extensions: 1528851
Number of sequences better than 100.0: 646
Number of HSP's better than 100.0 without gapping: 339
Number of HSP's successfully gapped in prelim test: 307
Number of HSP's that attempted gapping in prelim test: 1525297
Number of HSP's gapped (non-prelim): 1649
length of query: 1004
length of database: 8,064,228,071
effective HSP length: 153
effective length of query: 851
effective length of database: 8,769,330,510
effective search space: 7462700264010
effective search space used: 7462700264010
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 82 (36.2 bits)